FAS Research Computing - MGHPCC power work - Part 2 May 18 – تفاصيل الصيانة

أداء متدهور جزئيًا

Status page for the Harvard FAS Research Computing cluster and other resources.

Cluster Utilization (VPN and FASRC login required): Cannon | FASSE


Please scroll down to see details on any Incidents or maintenance notices.
Monthly maintenance occurs on the first Monday of the month (except holidays).

GETTING HELP
Documentation: https://docs.rc.fas.harvard.edu | Account Portal https://portal.rc.fas.harvard.edu
Email: rchelp@rc.fas.harvard.edu | Support Hours


The colors shown in the bars below were chosen to increase visibility for color-blind visitors.
For higher contrast, switch to light mode at the bottom of this page if the background is dark and colors are muted.

MGHPCC power work - Part 2 May 18

مكتمل
المقرر ل مايو 18, 2026 في 11:00 – مايو 22, 2026 في 00:14

يؤثر

Cannon Cluster

صيانة من 11:00 AM ألى 12:14 AM

SLURM Scheduler - Cannon

صيانة من 11:00 AM ألى 12:14 AM

Cannon Compute Cluster (Holyoke)

صيانة من 11:00 AM ألى 12:14 AM

Boston Compute Nodes

صيانة من 11:00 AM ألى 12:14 AM

GPU nodes (Holyoke)

صيانة من 11:00 AM ألى 12:14 AM

seas_compute

صيانة من 11:00 AM ألى 12:14 AM

التحديثات
  • مكتمل
    مايو 22, 2026 في 00:14
    مكتمل
    مايو 22, 2026 في 00:14

    The power work has completed successfully. All nodes have been returned to normal service.

  • قيد التقدم
    مايو 18, 2026 في 11:00
    قيد التقدم
    مايو 18, 2026 في 11:00
    Maintenance is now in progress
  • تحديث
    مايو 18, 2026 في 11:00
    تحديث
    مايو 18, 2026 في 11:00

    Rescheduled to May 18

  • مخطط
    مايو 18, 2026 في 11:00
    مخطط
    مايو 18, 2026 في 11:00

    Our Holyoke data center, MGHPCC, will be doing power work on Row 8A. This work, which is being completed over the course of 2 weeks, will bring online another power feed which will increase power capacity.

    In order to do this work, it will require us to idle half the nodes in 8a for the duration of the week. This means all partitions in this row will be at half capacity. Existing jobs should drain naturally and no job should need to be canceled.

    The impacted partitions are:

    arguelles_delgado_h100
    bigmem
    bigmem_intermediate
    blackhole_gpu
    dvorkin
    eddy
    enos
    gershman
    gpu
    gpu_h200
    gpu_requeue
    hejazi
    hernquist_ice
    hoekstra
    hsph
    hsph_gpu
    huce_ice
    iaifi_gpu_requeue
    intermediate
    itc_cluster
    itc_gpu
    janson_sapphire
    joonholee
    jshapiro
    kempner
    kempner_priority
    kempner_dev
    kempner_eng
    kempner_h200_priority
    kempner_h100
    kempner_h100_priority
    kempner_h100_priority2
    kempner_h100_priority3
    kempner_h100_priority4
    kempner_interactive
    kovac
    kozinsky
    kozinsky_gpu
    kozinsky_requeue
    murphy_ice
    mweber_compute
    mweber_gpu
    olveczky_sapphire
    ortegahernandez_ice
    rivas
    sapphire
    seas_compute
    seas_gpu
    siag
    siag_combo
    test
    yao
    yao_priority
    zhuang