Notice history

Operational

Partial outage

Operational

Nov 2025

Resolved
November 19, 2025 at 6:25 PM
Resolved
November 19, 2025 at 6:25 PM
This alert was a false alarm. Coldfront is operating normally.
Investigating
November 19, 2025 at 5:14 PM
Investigating
November 19, 2025 at 5:14 PM
Coldfront cannot be accessed at the moment. This incident was created by an automated monitoring service.

Resolved
November 19, 2025 at 4:31 PM
Resolved
November 19, 2025 at 4:31 PM
This test incident has been resolved.
Investigating
November 19, 2025 at 4:30 PM
Investigating
November 19, 2025 at 4:30 PM
Authentication issues with openauth/radius. This incident was created by an automated monitoring service.

Resolved
November 18, 2025 at 8:35 PM
Resolved
November 18, 2025 at 8:35 PM
The Slurm scheduler is back online.
Identified
November 18, 2025 at 8:28 PM
Identified
November 18, 2025 at 8:28 PM
The Slurm scheduler is experiencing a problem and we need to take it down briefly to clear this up. Should be short duration, but no ETA yet. This will not affect running jobs but will mean you cannot talk to the scheduler or submit new jobs.

Starfish dashboard maintenance Nov. 14th 5-6PM

Completed
November 14, 2025 at 11:00 PM
Completed
November 14, 2025 at 11:00 PM
Maintenance has completed successfully
In progress
November 14, 2025 at 10:00 PM
In progress
November 14, 2025 at 10:00 PM
Maintenance is now in progress
Planned
November 14, 2025 at 10:00 PM
Planned
November 14, 2025 at 10:00 PM
There is a planned upgrade of the Starfish dashboard scheduled for Friday November 14th starting at 5PM.
The dashboard will be down for an hour while the upgrade is performed.

Resolved
November 14, 2025 at 3:05 PM
Resolved
November 14, 2025 at 3:05 PM
The Slurm scheduler is back up and new jobs can be scheduled at this time.
This incident has been resolved.
Identified
November 14, 2025 at 2:49 PM
Identified
November 14, 2025 at 2:49 PM
The Slurm job scheduler will be offline temporarily while we resolve some minor issues. Jobs will continue to run but you will be unable to access the scheduler or submit new jobs until we return it to service.

Oct 2025

Resolved
October 08, 2025 at 4:05 PM
Resolved
October 08, 2025 at 4:05 PM
This incident has been resolved.
Investigating
October 08, 2025 at 3:32 PM
Investigating
October 08, 2025 at 3:32 PM
We are currently investigating an issue with holystore01. Service may be degraded. Updates will be posted here as we identify the root cause.

Resolved
October 07, 2025 at 7:46 PM
Resolved
October 07, 2025 at 7:46 PM
The license server is responding normally again.
Update
October 07, 2025 at 7:01 PM
Update
October 07, 2025 at 7:01 PM
We are currently investigating this incident.
Investigating
October 07, 2025 at 6:57 PM
Investigating
October 07, 2025 at 6:57 PM
rclic is currently down. Software licenses may not be accessible, particularly for the following:
- Matlab
- Gurobi
- Mathematica
We are currently investigating this incident.

FASRC monthly maintenance Monday October 6th, 2025 9am-1pm

Completed
October 06, 2025 at 5:00 PM
Completed
October 06, 2025 at 5:00 PM
Maintenance is now in progress
In progress
October 06, 2025 at 1:00 PM
In progress
October 06, 2025 at 1:00 PM
Maintenance is now in progress
Planned
October 06, 2025 at 1:00 PM
Planned
October 06, 2025 at 1:00 PM
FASRC monthly maintenance will take place Monday October 6th, 2025 from 9am-1pm
NOTICES
- Training: Upcoming training from FASRC and other sources can be found on our Training Calendar. at https://www.rc.fas.harvard.edu/upcoming-training/
- Status Page: You can subscribe to our status to receive notifications of maintenance, incidents, and their resolution at https://status.rc.fas.harvard.edu/ (click Get Updates for options).
- Upcoming holidays: Columbus / Indigenous Peoples’ Day - October 13
MAINTENANCE TASKS
Cannon cluster will be paused during this maintenance?: NO
FASSE cluster will be paused during this maintenance?: NO
- DNS server reboots
  Audience: All FASRC services
  Impact: Rolling reboot should have no impact
- Login node reboots
  Audience: Anyone logged into a FASRC Cannon or FASSE login node
  Impact: All login nodes will rebooted during this maintenance window
- Netscratch cleanup ( https://docs.rc.fas.harvard.edu/kb/policy-scratch/ )
  Audience: Cluster users
  Impact: Files older than 90 days will be removed. Please note that retention cleanup can and does run at any time, not just during the maintenance window.
Thank you,
FAS Research Computing
https://docs.rc.fas.harvard.edu/
https://www.rc.fas.harvard.edu/

Resolved
October 03, 2025 at 3:45 PM
Resolved
October 03, 2025 at 3:45 PM
This incident has been resolved and login to FASRC sites and services, including OOD and Spinal, is restored.
Identified
October 03, 2025 at 3:24 PM
Identified
October 03, 2025 at 3:24 PM
Some FASRC webpages and services (such as OOD) that require authentication may not load.
We have identified the cause and are working on a solution

Sep 2025

Resolved
September 26, 2025 at 1:06 PM
Resolved
September 26, 2025 at 1:06 PM
The root cause was found and a fish has been implemented.
Investigating
September 26, 2025 at 10:02 AM
Investigating
September 26, 2025 at 10:02 AM
The changelog (which tracks changes on the filesystem) on holylfs05 is throwing errors creating instability on the filesystem. We are investigating.

Resolved
September 24, 2025 at 4:03 PM
Resolved
September 24, 2025 at 4:03 PM
The gpu_test partition is back to normal service.
This incident has been resolved.
Investigating
September 24, 2025 at 1:44 PM
Investigating
September 24, 2025 at 1:44 PM
The gpu_test partition is currently down. Please use our other public gpu partitions in the meanwhile.
We are currently investigating this incident. incident.

Resolved
September 23, 2025 at 4:38 PM
Resolved
September 23, 2025 at 4:38 PM
The gpu_test partition is back in normal service.
Investigating
September 23, 2025 at 2:45 PM
Investigating
September 23, 2025 at 2:45 PM
The gpu_test partition is currently down due to a networking issue in that row. Please use our other public gpu partitions in the meanwhile.
We are currently investigating this incident.

Resolved
September 22, 2025 at 1:01 PM
Resolved
September 22, 2025 at 1:01 PM
holylfs05 is back online. Thank you for your patience.
Update
September 20, 2025 at 4:32 PM
Update
September 20, 2025 at 4:32 PM
holylfs05 is undergoing filesystem checks. Due to the size of the system, this process will take the rest of the weekend at a minimum.
No ETA at this time.
We are continuing to work on a fix for this incident. Our sincere apologies for the unexpected disruption.
Identified
September 20, 2025 at 12:08 AM
Identified
September 20, 2025 at 12:08 AM
holylfs05 will be inaccessible for an extended period of time while our staff continues to troubleshoot the underlying cause We are continuing to work on a fix for this incident.
Investigating
September 19, 2025 at 8:22 PM
Investigating
September 19, 2025 at 8:22 PM
Writing to holylfs05 may result in errors. We are currently investigating this incident.

Resolved
September 09, 2025 at 2:23 PM
Resolved
September 09, 2025 at 2:23 PM
This incident has been resolved. Coldfront is back online.
Investigating
September 09, 2025 at 1:31 PM
Investigating
September 09, 2025 at 1:31 PM
Coldfront is not operating normally. We are investigating.

Sep 2025 to Nov 2025

FAS Research Computing - Notice history

Notice history

Nov 2025

Oct 2025

Sep 2025