FAS Research Computing - Partial compute outage due to cooling failure – 사건 세부 정보
모든 시스템이 정상입니다
Status page for the Harvard FAS Research Computing cluster and other resources.
Cluster Utilization (VPN and FASRC login required): Cannon | FASSE
Please scroll down to see details on any Incidents or maintenance notices.
Monthly maintenance occurs on the first Monday of the month (except holidays).
The colors shown in the bars below were chosen to increase visibility for color-blind visitors. For higher contrast, switch to light mode at the bottom of this page if the background is dark and colors are muted.
Partial compute outage due to cooling failure
해결됨
정상
시작 3년 이상 전지속됨 약 12시간
영향받음
Cannon Cluster
정상 ~에서 2:52 AM ~ 3:06 PM
Cannon Compute Cluster (Holyoke)
정상 ~에서 2:52 AM ~ 3:06 PM
FASSE Cluster
정상 ~에서 2:52 AM ~ 3:06 PM
FASSE Compute Cluster (Holyoke)
정상 ~에서 2:52 AM ~ 3:06 PM
업데이트
해결됨
해결됨
The cooling issue for the affected nodes has been resolved and they are back in service.
업데이트
업데이트
The following FASSE partitions are fully down:
fasse
fasse_bigmem
test
The following Cannon partitions are fully down:
test
gpu
gpu_mig
bigmem
unrestricted
seas
itc_cluster
imasc
huce_cascade
geophysics
conroy
davies
edwards
giribet
ortegahernandez
pehlevan
xlin
zon
We are continuing to work on a fix for this incident.
조사 중
조사 중
We are currently investigating an outage tied to a cooling failure in row 7c.
More details as we're made aware of them.
No ETA for returning down nodes to service at this time.