Approximately 100 compute nodes in row 7c (holy7cxxxxx) came back up at the wrong clock speed. This is a known issue with some older nodes that can occur after power dip or loss. These nodes will need to be physically reset in the chassis to clear this issue.
Jobs are running on these nodes so they will need to be drained of jobs before we can do this. As such we are marking the cluster partially degraded.
However, please note that this is a fraction of the total nodes and should not impact the average user.
This affects some specific owned partitions:
doshi-velez - 2 of 2 nodes
shakhnovich 2 of 3 nodes
blackhole/blackhole_priority - some % nodes
hernquist - some % nodes
pehlevan - some % nodes
seas_compute - some % nodes
tambe - some % nodes
shared - some % nodes