FAS Research Computing - OOD/VDI apps down, holystore01 performance issues – Incident details

seas_compute under maintenance

Status page for the Harvard FAS Research Computing cluster and other resources.

Cluster Utilization (VPN and FASRC login required): Cannon | FASSE


Please scroll down to see details on any Incidents or maintenance notices.
Monthly maintenance occurs on the first Monday of the month (except holidays).

GETTING HELP
Documentation: https://docs.rc.fas.harvard.edu | Account Portal https://portal.rc.fas.harvard.edu
Email: rchelp@rc.fas.harvard.edu | Support Hours


The colors shown in the bars below were chosen to increase visibility for color-blind visitors.
For higher contrast, switch to light mode at the bottom of this page if the background is dark and colors are muted.

OOD/VDI apps down, holystore01 performance issues

Resolved
Major outage
Started 6 days agoLasted about 2 hours

Affected

VDI/OpenOnDemand

Major outage from 3:16 PM to 5:08 PM

Cannon Open OnDemand/VDI

Major outage from 3:16 PM to 5:08 PM

FASSE Open OnDemand/VDI

Major outage from 3:16 PM to 5:08 PM

Storage

Major outage from 3:16 PM to 5:08 PM

Holystore01 (Tier 0)

Major outage from 3:16 PM to 5:08 PM

Updates
  • Resolved
    Resolved

    The root cause is related to a filesystem outage. The filesystem is available again and load has dropped. The clusters and OOD are responding again. If a job failed, you will need to restart it.

  • Investigating
    Investigating

    OOD apps that rely on Singularity images are currently unavailable due to holystore01 performance issues. The following apps may have trouble launching from the virtual desktop:

    Cannon:

    • RStudio Server

    • Containerized Remote Desktop

    FASSE

    • RStudio Server

    • Containerized Remote Desktop

    • HeavyAI

    Files on holystore01 may be intermittently inaccessible as well. We are currently investigating this incident.