FAS Research Computing - Slurm - critical security patch – 故障详情

目前部分性能下降

Status page for the Harvard FAS Research Computing cluster and other resources.

Cluster Utilization (VPN and FASRC login required): Cannon | FASSE


Please scroll down to see details on any Incidents or maintenance notices.
Monthly maintenance occurs on the first Monday of the month (except holidays).

GETTING HELP
Documentation: https://docs.rc.fas.harvard.edu | Account Portal https://portal.rc.fas.harvard.edu
Email: rchelp@rc.fas.harvard.edu | Support Hours


The colors shown in the bars below were chosen to increase visibility for color-blind visitors.
For higher contrast, switch to light mode at the bottom of this page if the background is dark and colors are muted.

Slurm - critical security patch

已解决
严重故障
开始于 超过 2 年前持续 大约 1 小时

受到影响

Cannon Cluster

严重故障 从 3:02 PM 至 4:26 PM

SLURM Scheduler - Cannon

严重故障 从 3:02 PM 至 4:26 PM

Cannon Compute Cluster (Holyoke)

严重故障 从 3:02 PM 至 4:26 PM

Boston Compute Nodes

严重故障 从 3:02 PM 至 4:26 PM

GPU nodes (Holyoke)

严重故障 从 3:02 PM 至 4:26 PM

FASSE Cluster

严重故障 从 3:02 PM 至 4:26 PM

更新
  • 已解决
    已解决

    The security patch has been applied, and all clusters are accepting jobs at this time.

  • 调查中
    调查中

    SchedMD (the maintainers of Slurm) have discovered a critical security flaw in Slurm. Due to the nature and severity of the issue, we will be immediately applying this patch.

    Cannon and FASSE schedulers will remain down for the duration of the patching. All running jobs will be paused, and new jobs will not be accepted until the scheduler is back up.

    ETA is expected to be approximately one hour.