FAS Research Computing - June 5-6 MGHPCC pod 7c cooling updates - See partition list below – Detalji popravke

U toku je gora performansa

Status page for the Harvard FAS Research Computing cluster and other resources.

Cluster Utilization (VPN and FASRC login required): Cannon | FASSE


Please scroll down to see details on any Incidents or maintenance notices.
Monthly maintenance occurs on the first Monday of the month (except holidays).

GETTING HELP
Documentation: https://docs.rc.fas.harvard.edu | Account Portal https://portal.rc.fas.harvard.edu
Email: rchelp@rc.fas.harvard.edu | Support Hours


The colors shown in the bars below were chosen to increase visibility for color-blind visitors.
For higher contrast, switch to light mode at the bottom of this page if the background is dark and colors are muted.

June 5-6 MGHPCC pod 7c cooling updates - See partition list below

Završeno
Zakazano za June 05, 2025 u 11:00 AM – 7:03 PM

Utiče na

Cannon Cluster

Popravka u toku undefined 11:00 AM do 7:03 PM

Cannon Compute Cluster (Holyoke)

Popravka u toku undefined 11:00 AM do 7:03 PM

seas_compute

Popravka u toku undefined 11:00 AM do 7:03 PM

Ажурирања
  • Završeno
    June 05, 2025 u 7:03 PM
    Završeno
    June 05, 2025 u 7:03 PM

    The work on row 7c is complete. Returning idled nodes to normal service.

  • U toku
    June 05, 2025 u 11:00 AM
    U toku
    June 05, 2025 u 11:00 AM
    Maintenance is now in progress
  • Planirano
    June 05, 2025 u 11:00 AM
    Planirano
    June 05, 2025 u 11:00 AM

    There will be additional scheduled maintenance at MGHPCC between June 5th and 6th.

    As part of the work during the MGHPCC Outage, one of the Cooling Distribution Unit (CDU) in Pod 7c will be replaced. This will allow for future expansion into this space.

    This work will run from Thursday Jun 5th until the evening of Friday June 6th. This means nodes whose names begin with holy7c02, 04, 06, 08, 10, 12 will not come back online after the outage and will remain down until this CDU update is complete.

    This impacts the following partitions. If you are using one of those partitions please use the public sapphire partition while your equipment is being serviced. These nodes will be returned to service once the CDU work is complete:

    • blackhole

    • blackhole_priority

    • davies

    • desai

    • eddy

    • huce_cascade

    • huce_cascade_priority

    • huttenhower

    • jacobsen2

    • janson

    • janson_cascade

    • ke

    • lukin

    • nguyen

    • seas_compute

    • shared

    • tambe

    • vishwanath

    • whipple

    • xlin