FAS Research Computing - Historique des incidents

Expérimenter des performances partiellement dégradées

Status page for the Harvard FAS Research Computing cluster and other resources.

Cluster Utilization (VPN and FASRC login required): Cannon | FASSE


Please scroll down to see details on any Incidents or maintenance notices.
Monthly maintenance occurs on the first Monday of the month (except holidays).

GETTING HELP
Documentation: https://docs.rc.fas.harvard.edu | Account Portal https://portal.rc.fas.harvard.edu
Email: rchelp@rc.fas.harvard.edu | Support Hours


The colors shown in the bars below were chosen to increase visibility for color-blind visitors.
For higher contrast, switch to light mode at the bottom of this page if the background is dark and colors are muted.

Performances dégradées

SLURM Scheduler - Cannon - Performances dégradées

Cannon Compute Cluster (Holyoke) - Performances dégradées

Boston Compute Nodes - Performances dégradées

GPU nodes (Holyoke) - Performances dégradées

seas_compute - Performances dégradées

Opérationnel

SLURM Scheduler - FASSE - Opérationnel

FASSE Compute Cluster (Holyoke) - Opérationnel

Opérationnel

Kempner Cluster CPU - Opérationnel

Kempner Cluster GPU - Opérationnel

Opérationnel

FASSE login nodes - Opérationnel

Opérationnel

Cannon Open OnDemand/VDI - Opérationnel

FASSE Open OnDemand/VDI - Opérationnel

Opérationnel

Netscratch (Global Scratch) - Opérationnel

Home Directory Storage - Boston - Opérationnel

Tape - (Tier 3) - Opérationnel

Holylabs - Opérationnel

Isilon Storage Holyoke (Tier 1) - Opérationnel

Holystore01 (Tier 0) - Opérationnel

HolyLFS04 (Tier 0) - Opérationnel

HolyLFS05 (Tier 0) - Opérationnel

HolyLFS06 (Tier 0) - Opérationnel

Holyoke Tier 2 NFS (new) - Opérationnel

Holyoke Specialty Storage - Opérationnel

holECS - Opérationnel

Isilon Storage Boston (Tier 1) - Opérationnel

BosLFS02 (Tier 0) - Opérationnel

Boston Tier 2 NFS (new) - Opérationnel

CEPH Storage Boston (Tier 2) - Opérationnel

Boston Specialty Storage - Opérationnel

bosECS - Opérationnel

Samba Cluster - Opérationnel

Globus Data Transfer - Opérationnel

Historique des incidents

sept. 2025

août 2025

SMB access to shares on the FASRC samba cluster)
  • Résolu
    Résolu

    SMB access has been restored. Please disconnect and retry if you have a failed mapped drive. If you still cannot connect to a share, please contact rchelp@rc.fas.harvard.edu and let us know your username and exactly which share you are attempting to map.

  • Identifié
    Identifié

    We are continuing to work on a fix for this incident. No ETA.

  • Détecté
    Détecté

    Drive mapping to some shares may fail if those shares use the Samba Cluster. This includes but is not limited to share paths that begin with \\smbip.

    Known affected shares:

    anderson_lab

    arlotta_lab

    bellono_lab

    bertoldi_lab c

    apellini_lab

    dasch14

    dasch15

    dasch16

    denic_lab

    dobbie_lab

    engert_lab

    ferreira_lab

    fortune_lab

    friedman_lab

    girguis_lab

    grad_lab

    hausmann_lab

    hays_lab

    hbs_liran

    hbs_rcs huh

    illumina

    jessicacohen_lab

    lichtman_boslfs02

    mallet_lab

    mason_lab

    mckinley_lab

    mcz

    mitrano_lab

    moorcroftfs5

    murraylab

    nmr_large

    nmr_small

    novitsky_lab

    pooling

    qbrc_center

    ramachandran_lab

    schnapp_lab

    schrag_lab

    srivastava_lab

    whited_lab

    yau2_lab

juil. 2025

FASRC Monthly maintenance July 7, 2025 9AM-1PM
  • Terminé
    juillet 07, 2025 à 17:00
    Terminé
    juillet 07, 2025 à 17:00
    Maintenance has completed successfully
  • En cours
    juillet 07, 2025 à 13:00
    En cours
    juillet 07, 2025 à 13:00
    Maintenance is now in progress
  • Pas encore commencé
    juillet 07, 2025 à 13:00
    Pas encore commencé
    juillet 07, 2025 à 13:00

    FASRC monthly maintenance will take place Monday July 7th, 2025 from 9am-1pm

    NOTICES

    • ​New Quota tool available (/usr/local/sbin/quota) - Works on all filesystem types (home directory, lustre, isilon, netscratch, etc.)
      Type quota -h to see the full instructions for usage o visit the usage doc.

    • Training: Upcoming training from FASRC and other sources can be found on our Training Calendar. at https://www.rc.fas.harvard.edu/upcoming-training/

    • Status Page: You can subscribe to our status to receive notifications of maintenance, incidents, and their resolution at https://status.rc.fas.harvard.edu/ (click Get Updates for options).

    • Upcoming holidays:​ Juneteenth - ​T​hur. June 19​ / Independence Day - Fri​. July 4

    MAINTENANCE TASKS
    Cannon cluster will be paused during this maintenance?: YES
    FASSE cluster will be paused during this maintenance?: YES

    • Slurm Upgrade to 24.11.5

      • Audience: All cluster users

      • Impact: Jobs and the scheduler will be paused during this upgrade

    • Login node ​OS ​upgrades

      • Audience: Anyone logged into a FASRC Cannon or FASSE login node

      • Impact: All login nodes will ​upgraded ​and unavailable during this maintenance window

    • ​Start of cluster OS upgrades - July 7 -10

      • Audience: All cluster users

      • Impact: Over 4 days, July 7 through 10, we will upgrade the OS on 25% of the cluster each day. During that time, total capacity will be reduced across the cluster by 1/4 each day. This will require draining each sub-set of nodes ahead of time. 

    • Netscratch cleanup ( https://docs.rc.fas.harvard.edu/kb/policy-scratch/ )

      • Audience: Cluster users

      • Impact: Files older than 90 days will be removed. Please note that retention cleanup can and does run at any time, not just during the maintenance window.

    Thank you,
    FAS Research Computing
    https://docs.rc.fas.harvard.edu/
    https://www.rc.fas.harvard.edu/

Rolling cluster OS upgrades July 7 - 10
  • Terminé
    juillet 11, 2025 à 16:02
    Terminé
    juillet 11, 2025 à 16:02

    All upgrades are complete. A small number of nodes need clean-up, but the cluster is back to normal operation with all nodes running Rocky 8.10. Thanks for your patience.

  • Mettre à jour
    juillet 07, 2025 à 13:00
    Mettre à jour
    juillet 07, 2025 à 13:00

    Cannon rolling upgrades are in progress. Not all nodes are available.

    https://www.rc.fas.harvard.edu/blog/2025-compute-os-upgrade/

  • En cours
    juillet 07, 2025 à 13:00
    En cours
    juillet 07, 2025 à 13:00

    UPDATE: 7/7/25 6M FASSE is operational.

    Please be aware that FASSE jobs cannot be launched at this time due to the upgrades.
    We will return all FASSE nodes to normal services as soon as possible.

    https://www.rc.fas.harvard.edu/blog/2025-compute-os-upgrade/

  • Pas encore commencé
    juillet 07, 2025 à 13:00
    Pas encore commencé
    juillet 07, 2025 à 13:00

    Cluster OS upgrades - July 7 -10

    • Audience: All cluster users

    • Impact: Over 4 days, July 7 through 10, we will upgrade the OS on 25% of the cluster each day.
      During that time, total capacity will be reduced across the cluster by 1/4 each day.
      This will require draining each sub-set of nodes ahead of time. 

    Work begins during the July 7th maintenance (login nogdes will be upgraded during the 7/7 maintenance window) and will continue through July 10th.

    Additional details and a breakdown of each phase: 2025 Compute OS Upgrade

juil. 2025 à sept. 2025

Suivant