FAS Research Computing - Slurm responding slowly - WIP – جزئیات حادثه

همه سیستم‌ها عملیاتی هستند

Status page for the Harvard FAS Research Computing cluster and other resources.

Cluster Utilization (VPN and FASRC login required): Cannon | FASSE


Please scroll down to see details on any Incidents or maintenance notices.
Monthly maintenance occurs on the first Monday of the month (except holidays).

GETTING HELP
Documentation: https://docs.rc.fas.harvard.edu | Account Portal https://portal.rc.fas.harvard.edu
Email: rchelp@rc.fas.harvard.edu | Support Hours


The colors shown in the bars below were chosen to increase visibility for color-blind visitors.
For higher contrast, switch to light mode at the bottom of this page if the background is dark and colors are muted.

Slurm responding slowly - WIP

حل شد
افت عملکرد
آغاز شد بیشتر از 1 سال قبلطول کشید 7 روز

تحت تأثیر

Cannon Cluster

عملیاتی از 3:06 PM تا 3:16 PM, افت عملکرد از 3:16 PM تا 4:30 PM

SLURM Scheduler - Cannon

عملیاتی از 3:06 PM تا 3:16 PM, افت عملکرد از 3:16 PM تا 4:30 PM

Cannon Compute Cluster (Holyoke)

عملیاتی از 3:06 PM تا 3:16 PM, افت عملکرد از 3:16 PM تا 4:30 PM

Boston Compute Nodes

عملیاتی از 3:06 PM تا 3:16 PM, افت عملکرد از 3:16 PM تا 4:30 PM

GPU nodes (Holyoke)

عملیاتی از 3:06 PM تا 3:16 PM, افت عملکرد از 3:16 PM تا 4:30 PM

seas_compute

عملیاتی از 3:06 PM تا 3:16 PM, افت عملکرد از 3:16 PM تا 4:30 PM

به‌روزرسانی‌ها
  • حل شد
    حل شد

    After another patch of the scheduler, Slurm should be much more stable now

  • به‌روزرسانی
    به‌روزرسانی

    We are still investigating this incident, we are working with SchedMD to find a solution.

  • در حال بررسی
    در حال بررسی
    We are currently investigating this incident.
  • شناسایی شد
    شناسایی شد
    We are continuing to work on a fix for this incident.
  • در حال بررسی
    در حال بررسی
    We are currently investigating this incident.