FAS Research Computing - gpu_test partition down – تفاصيل الحادثة

أداء متدهور جزئيًا

Status page for the Harvard FAS Research Computing cluster and other resources.

Cluster Utilization (VPN and FASRC login required): Cannon | FASSE


Please scroll down to see details on any Incidents or maintenance notices.
Monthly maintenance occurs on the first Monday of the month (except holidays).

GETTING HELP
Documentation: https://docs.rc.fas.harvard.edu | Account Portal https://portal.rc.fas.harvard.edu
Email: rchelp@rc.fas.harvard.edu | Support Hours


The colors shown in the bars below were chosen to increase visibility for color-blind visitors.
For higher contrast, switch to light mode at the bottom of this page if the background is dark and colors are muted.

gpu_test partition down

تم الحل
أداء متدهور
بدأ في منذ سنتين تقريبااستمر 43 دقيقة

متأثر

Cannon Cluster

أداء متدهور من 2:15 PM ألى 2:58 PM, جاهز للعمل من 2:15 PM ألى 2:58 PM

SLURM Scheduler - Cannon

جاهز للعمل من 2:15 PM ألى 2:58 PM

Cannon Compute Cluster (Holyoke)

جاهز للعمل من 2:15 PM ألى 2:58 PM

Boston Compute Nodes

جاهز للعمل من 2:15 PM ألى 2:58 PM

GPU nodes (Holyoke)

أداء متدهور من 2:15 PM ألى 2:58 PM

التحديثات
  • تم الحل
    تم الحل

    gpu_test is now operational and can be used.

    This incident has been resolved.

  • تحقيق
    تحقيق

    An InfiniBand switch has failed in the cabinet housing the gpu_test nodes.

    Please use gpu_requeue or gpu in the meantime.

    We will update this incident when we have an ETA.