Status page for the Harvard FAS Research Computing cluster and other resources.
Please scroll down to see details on any Incidents.
No incidents reported
UPDATE: 10:45 Regal back to normal.
The regal scratch filesystem is experiencing performance issues, and causing jobs to be stuck and nodes to be closed. We are actively working on this issue.
UPDATE: 5:12 PM: The Lustre management tools have caught up and are reporting normally. Moving status to operational.
UPDATE: 5:02 PM: Reboot is being reconsidered as performance seems OK. Suspicion that the diagnostics are not reporting up-to-date info.
UPDATE 4:50 PM: A full reboot of Regal is necessary at this point. Jobs will be suspended, but due to timeouts which have likely already happened, some jobs reading from/writing to Regal may fail.
Regal scratch is experiencing performance issues again. We are actively working on the problem. Updates to follow.
UPDATE: Regal appears to be back to normal. We will continue to monitor for issues. NOTE: a plan to replace Regal is already under way. Thank you for your understanding and patience.
UPDATE: Regal is responding, but may become slow again as we restart various storage modules.
Regal scratch is experiencing performance issues. We are actively working on the problem.