All systems are operational

About This Site

Status page for Harvard FAS Research Computing and the Odyssey3 Cluster.

Please scroll down to see details on any Incidents.

Scheduled Maintenance
Winter Maintenance Downtime - Dec 17th

Date: Monday December 17th Time: TBD - All day event

FAS RC will be performing maintenance on several system, primarily, but not limited to, most resources housed in our Boston data center. Please plan accordingly as this all-day event will likely affect all users and jobs.

This downtime will interrupt many services including, but not limited to, storage, home directories, authentication, accounts and portal, as well as licensing and virtual machines.

A more detailed list of resources affected and the tasks being performed will be posted closer to the event date. https://www.rc.fas.harvard.edu/winter-maintenance

Past Incidents

8th September 2018

No incidents reported

7th September 2018

No incidents reported

6th September 2018

No incidents reported

5th September 2018

No incidents reported

4th September 2018

Regal Scratch Filesystem Regal Issues

Hello Odyssey Users,

We are seeing continued performance issues with Regal after the long weekend. This is related to one of the OST (Object Storage Targets) nodes reaching a stuck state. We are actively trying to identify the cause and correcting this, likely the OST node will need to restarted which may cause further performance issues with Regal during this time. Likely this should be done around 12pm or 1pm today, barring any unexpected issues. No data loss is expected from this. We will also continue to update this page as progress is made.

Thank you, FASRC

Update @ 12:16pm: The engineer working with Regal has been able to restore functionality. Performance should be returning to normal. We have opened all the nodes that were previously closed to Regal. Additionally we are also running a retention of regal, this should help get the usage down that helps alleviate further performance issues (use 95% and over causes issues).

Update @ 2:57pm: Since this morning Regal has crept up past 95%, this is causing ongoing performance issues for the time being. We are currently running retention to free up more space. The retention should finish some time this evening or possibly tomorrow morning. Feel free to check back here for status updates.

Update @ 9/5 12:00pm: Retention finished overnight and Regal seems to be returning to a more normal state. We will continue to watch usage and report back.

3rd September 2018

No incidents reported

2nd September 2018

No incidents reported