All Systems Operational

Login ? Operational
90 days ago
99.67 % uptime
Today
Storage ? Operational
90 days ago
99.65 % uptime
Today
File transfer node ? Operational
90 days ago
100.0 % uptime
Today
high2,med2,low2 ? Operational
90 days ago
99.97 % uptime
Today
high,med,low ? Operational
90 days ago
99.97 % uptime
Today
bmh,bmm ? Operational
90 days ago
99.97 % uptime
Today
bigmemh,bigmemm ? Operational
90 days ago
99.97 % uptime
Today
bgpu ? Operational
90 days ago
99.97 % uptime
Today
gpuh,gpum ? Operational
90 days ago
99.97 % uptime
Today
Email ? Operational
90 days ago
100.0 % uptime
Today
Virtualization Operational
90 days ago
100.0 % uptime
Today
Proxmox Virtualization Nodes Operational
90 days ago
100.0 % uptime
Today
Ganetti cluster ? Operational
90 days ago
100.0 % uptime
Today
Slurm ? Operational
90 days ago
100.0 % uptime
Today
Software Operational
90 days ago
84.01 % uptime
Today
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Major outage
Partial outage
No downtime recorded on this day.
No data exists for this day.
had a major outage.
had a partial outage.
Mar 28, 2025

No incidents reported today.

Mar 27, 2025
Resolved - nas-5-3 is once again correctly serving data.
Mar 27, 11:59 PDT
Monitoring - nas-5-2 has been rebooted and verified to be back in service. It is taking a very high load of writes, so access will be sluggish until backed-up jobs catch up.
Mar 27, 09:39 PDT
Identified - nas-5-2 has crashed. Any home directories, or group directories, shared from there are currently hung. Admins are investigating.
Mar 27, 09:08 PDT
Mar 26, 2025

No incidents reported.

Mar 25, 2025

No incidents reported.

Mar 24, 2025

No incidents reported.

Mar 23, 2025

No incidents reported.

Mar 22, 2025

No incidents reported.

Mar 21, 2025

No incidents reported.

Mar 20, 2025

No incidents reported.

Mar 19, 2025

No incidents reported.

Mar 18, 2025

No incidents reported.

Mar 17, 2025

No incidents reported.

Mar 16, 2025

No incidents reported.

Mar 15, 2025

No incidents reported.

Mar 14, 2025
Resolved - Continued monitoring shows that disabling the broken hard drive has resolved the issue.
Mar 14, 09:55 PDT
Update - We are continuing to monitor for any further issues.
Mar 13, 12:42 PDT
Monitoring - A faulty hard drive has been identified as the most likely cause of the NAS outage. This drive has been forced offline and admins are monitoring.
Mar 13, 12:41 PDT
Investigating - Farm's nas-6-1 is currently having issues. Admins are investigating.
Mar 13, 11:37 PDT