FAS Research Computing - Legg merke til historikk

Opplever delvis svekket ytelse

Status page for the Harvard FAS Research Computing cluster and other resources.

Cluster Utilization (VPN and FASRC login required): Cannon | FASSE


Please scroll down to see details on any Incidents or maintenance notices.
Monthly maintenance occurs on the first Monday of the month (except holidays).

GETTING HELP
Documentation: https://docs.rc.fas.harvard.edu | Account Portal https://portal.rc.fas.harvard.edu
Email: rchelp@rc.fas.harvard.edu | Support Hours


The colors shown in the bars below were chosen to increase visibility for color-blind visitors.
For higher contrast, switch to light mode at the bottom of this page if the background is dark and colors are muted.

Nedsatt ytelse

SLURM Scheduler - Cannon - Nedsatt ytelse

Cannon Compute Cluster (Holyoke) - Nedsatt ytelse

Boston Compute Nodes - Nedsatt ytelse

GPU nodes (Holyoke) - Nedsatt ytelse

seas_compute - Nedsatt ytelse

Operasjonell

SLURM Scheduler - FASSE - Operasjonell

FASSE Compute Cluster (Holyoke) - Operasjonell

Operasjonell

Kempner Cluster CPU - Operasjonell

Kempner Cluster GPU - Operasjonell

Operasjonell

FASSE login nodes - Operasjonell

Operasjonell

Cannon Open OnDemand - Operasjonell

FASSE Open OnDemand - Operasjonell

Operasjonell

Netscratch (Global Scratch) - Operasjonell

Home Directory Storage - Boston - Operasjonell

Tape - (Tier 3) - Operasjonell

Holylabs - Operasjonell

Isilon Storage Holyoke (Tier 1) - Operasjonell

Holystore01 (Tier 0) - Operasjonell

HolyLFS04 (Tier 0) - Operasjonell

HolyLFS05 (Tier 0) - Operasjonell

HolyLFS06 (Tier 0) - Operasjonell

Holyoke Tier 2 NFS (new) - Operasjonell

Holyoke Specialty Storage - Operasjonell

holECS - Operasjonell

Isilon Storage Boston (Tier 1) - Operasjonell

BosLFS02 (Tier 0) - Operasjonell

Boston Tier 2 NFS (new) - Operasjonell

CEPH Storage Boston (Tier 2) - Operasjonell

Boston Specialty Storage - Operasjonell

bosECS - Operasjonell

Samba Cluster - Operasjonell

Globus Data Transfer - Operasjonell

Legg merke til historikk

mars 2024

FASRC maintenance update - All jobs requeued (Cannon and FASSE)
  • Løst
    Løst
    This incident has been resolved.
  • Overvåker
    Overvåker

    Informational Notice

    The Slurm upgrade to 23.11.4 was completed successfully during maintenance. However a complication with the automation of Slurm's cryptographic keys occurred during the upgrade which caused nodes to lose the ability to talk to the Slurm master. The Slurm master therefore viewed those nodes as down and requeued their jobs.

    All jobs on Cannon and FASSE were requeued.

    This is deeply regrettable but the chain of events which caused this could not be foreseen.

    To check the status of your jobs, see the common Slurm commands at:

    https://docs.rc.fas.harvard.edu/kb/convenient-slurm-commands/#Information_on_jobs

    FAS Research Computing

    https://docs.rc.fas.harvard.edu/

    rchelp@rc.fas.harvard.edu

feb. 2024

Ceph instability - Affects Boston VMs (Virtual Machines) and Tier2 Ceph shares
  • Løst
    Løst

    The Ceph instability has been resolved. Ceph Tier2 shares, VDI, and VMs should be back to their normal state.

    If your VM, /net/fs-[labname] share, or VDI session is still impacted, please contact rchelp@rc.fas.harvard.edu

  • Identifisert
    Identifisert

    The infrastructure behind Tier2 Ceph shares and VMs is unstable.
    This also affects VDI/OOD which relies on virtual machines.

    /net/fs-[labname] shares, new OOD/VDI sessions, and VMs are affected and may will be inaccessible until this is resolved.

    Thanks for your patience.

jan. 2024

Ceph instability - Affects Boston VMs (Virtual Machines) and Tier2 Ceph shares
  • Løst
    Løst

    The Ceph instability has been resolved. Ceph Tier2 shares, VDI, and VMs should be back to their normal state.

    If your VM, /net/fs-[labname] share, or VDI session is still impacted, please contact rchelp@rc.fas.harvard.edu

  • Identifisert
    Identifisert

    The infrastructure behind Tier2 Ceph shares and VMs is unstable.
    This also affects VDI/OOD which relies on virtual machines.

    /net/fs-[labname] shares, new OOD/VDI sessions, and VMs are affected and may will be inaccessible until this is resolved.

    Thanks for your patience.

Ceph instability - Affects Boston VMs (Virtual Machines) and Tier2 Ceph shares
  • Løst
    Løst

    The Ceph instability has been resolved. Ceph Tier2 shares, VDI, and VMs should be back to their normal state.

    If your VM, /net/fs-[labname] share, or VDI session is still impacted, please contact rchelp@rc.fas.harvard.edu

  • Identifisert
    Identifisert

    The infrastructure behind Tier2 Ceph shares and VMs is unstable.
    This also affects VDI/OOD which relies on virtual machines.

    /net/fs-[labname] shares, new OOD/VDI sessions, and VMs are affected and may will be inaccessible until this is resolved.

    Thanks for your patience.

Ceph instability - Affects Boston VMs (Virtual Machines) and Tier2 Ceph shares
  • Løst
    Løst

    The Ceph instability has been resolved. Ceph Tier2 shares, VDI, and VMs should be back to their normal state.

    If your VM, /net/fs-[labname] share, or VDI session is still impacted, please contact rchelp@rc.fas.harvard.edu

  • Identifisert
    Identifisert

    The infrastructure behind Tier2 Ceph shares and VMs is unstable.
    This also affects VDI/OOD which relies on virtual machines.

    /net/fs-[labname] shares, new OOD/VDI sessions, and VMs are affected and may will be inaccessible until this is resolved.

    Thanks for your patience.

jan. 2024 til mars 2024

Neste