FAS Research Computing - Winter maintenance - January 5th 8am-5pm – Detalji popravke

Svi sistemi funkcionišu

Status page for the Harvard FAS Research Computing cluster and other resources.

Cluster Utilization (VPN and FASRC login required): Cannon | FASSE


Please scroll down to see details on any Incidents or maintenance notices.
Monthly maintenance occurs on the first Monday of the month (except holidays).

GETTING HELP
Documentation: https://docs.rc.fas.harvard.edu | Account Portal https://portal.rc.fas.harvard.edu
Email: rchelp@rc.fas.harvard.edu | Support Hours


The colors shown in the bars below were chosen to increase visibility for color-blind visitors.
For higher contrast, switch to light mode at the bottom of this page if the background is dark and colors are muted.

Winter maintenance - January 5th 8am-5pm

Završeno
Zakazano za January 05, 2022 u 1:00 PM – 7:56 PM

Utiče na

Cannon Cluster

Popravka u toku undefined 1:00 PM do 7:56 PM

SLURM Scheduler - Cannon

Popravka u toku undefined 1:00 PM do 7:56 PM

Cannon Compute Cluster (Holyoke)

Popravka u toku undefined 1:00 PM do 7:56 PM

Boston Compute Nodes

Popravka u toku undefined 1:00 PM do 7:56 PM

Login Nodes

Popravka u toku undefined 1:00 PM do 7:56 PM

login.rc.fas.harvard.edu

Popravka u toku undefined 1:00 PM do 7:56 PM

Ажурирања
  • Završeno
    January 05, 2022 u 7:56 PM
    Završeno
    January 05, 2022 u 7:56 PM

    Maintenance has completed successfully.

  • Obaveštenje
    January 05, 2022 u 7:52 PM
    Obaveštenje
    January 05, 2022 u 7:52 PM

    All data center work is complete.

    Remaining tasks are:

    • rebooting login and VDI nodes
    • running scratch 90-day retention
  • U toku
    January 05, 2022 u 5:51 PM
    U toku
    January 05, 2022 u 5:51 PM

    Maintenance on-track.

    Several smaller tasks are complete, but water cooling maintenance is the larger task today and still on-going.

  • Planirano
    January 05, 2022 u 1:00 PM
    Planirano
    January 05, 2022 u 1:00 PM

    Our Winter maintenance will occur on January 5th, from 8am - 5pm. This will be an all-day event and will require that jobs will be suspended for the duration and scratch (holyscratch01) will be offline for updates.

    Storage firmware will also be updated that day, potentially in both data centers, so short storage interruptions may occur.

    This is one of the two times each year we can plan maintenance around our Boston data center and also make changes that involve maintaining, modifying, or adding to the water-cooling system in MGHPCC.

    Tasks will include:

    • Water cooling maintenance in MGHPCC row 7-C
    • Slurm 21.08.5 Upgrade (jobs paused during maintenance)
    • Firmware upgrades
    • Scratch (holyscratch01) maintenance (offline during maintenance)
    • Login and VDI node reboots
    • Scratch 90-day retention cleanup