At 00:57 CEST on Monday, May 29th a power outage caused the cooling system at Ångström Laboratory to shut down, leading to a rapid increase in temperature within the compute hall. To prevent further temperature escalation and safeguard the equipment, all systems in the compute hall were forcefully powered off. The cooling system was restored at approximately 05:00.
Due to the elevated temperatures experienced during the outage, additional inspections are required to ensure the compute hall, compute, storage, and network hardware are functioning as expected. Currently, we have identified an issue with one of the two UPS units.
Throughout the day, we will provide regular updates regarding the progress of the recovery efforts and the status of the affected equipment. We are working diligently to resolve any issues and restore normal operations as soon as possible.
Update 2023-05-29 11:00
The compute hall is fully operational again. We are now working on restoring systems.