[Latest] at bottom.
23 May, 24 2:05:20 PM
Here’s a summary of the email:
Summary: End of Life for CentOS 7 and Debian 10
CentOS 7 and Debian 10 will reach their end of life on June 30, 2024.
Key Actions:
Alternatives:
+ContainerOS="Rocky8"
) – Available since July 2021.+ContainerOS="Rocky9"
) – Available since February 2023, similar to AlmaLinux 9.+ContainerOS="Debian11"
) – Currently in use on desktop machines.+ContainerOS="Debian12"
) – Will be available soon.Recommendations:
Cheers and happy computing, Your BAF Operators
24 Jun, 24 9:31:52 AM
Here’s a summary of the email:
Reminder: ContainerOS Removal on Friday, 28th June
This is a reminder that on Friday, 28th June, the ContainerOS for Debian10 and CentOS7 will be discontinued.
Key Points:
Cheers, Your BAF Operators
24 Jun, 24 10:56:31 PM
Here’s a summary of the email:
Summary: Emergency Cooling System Failure
About an hour ago, the redundant cooling system at Nußallee 12 began to fail, causing temperatures to rise.
Key Points:
Updates will be provided as the situation develops or if further systems need to be shut down.
Cheers, Your IT Team
24 Jun, 24 11:50:28 PM
Here’s a summary of the email:
Summary: Cooling System Status Update
The technical department inspected the cooling system and found one redundant device completely unresponsive and the other in error.
Key Points:
Cheers and have a good night, Oliver
25 Jun, 24 1:46:20 AM
Here’s a summary of the email:
Summary: Ongoing Cooling System Issues
The BAF team reports that the cooling system reset did not resolve the issue, and temperatures are still rising.
Key Points:
More updates will be provided in the morning.
Cheers, Oliver
25 Jun, 24 1:52:04 PM
Here’s a summary of the email:
Summary: Cooling System Update
The BAF team provides an update on the cooling system issues. One cooling machine remains broken, and the other one, despite showing 100% operational status, isn’t providing any cooling power.
Key Points:
Further updates will be provided as soon as available.
Cheers, BAF Operators
25 Jun, 24 2:23:45 PM
Here’s a summary of the email:
Summary: Upcoming CephFS Storage Upgrade
While technicians work on the cooling system (currently with reduced capacity), the BAF team will upgrade the CephFS storage (BAF User Data Directory / BUDDY, i.e., /cephfs, not home storage). This upgrade includes several major updates missed previously due to the need for extensive testing.
Key Points:
The team will keep users updated on the progress.
Cheers, BAF Operators
25 Jun, 24 4:48:16 PM:
Here’s a summary of the email:
Summary: Filesystem Upgrade Update
The BAF team has completed today’s filesystem upgrade.
Key Points:
Cheers, BAF Operators
25 Jun, 24 4:48:30 PM:
Here’s a summary of the email:
Summary: Filesystem Upgrade Update
The BAF team has completed today’s filesystem upgrade.
Key Points:
Cheers, BAF Operators
25 Jun, 24 7:07:37 PM:
Here’s a summary of the email:
Summary: /cephfs Access Issues on Desktops
BAF has received reports of access to /cephfs from desktops hanging under high activity. The issue does not affect access from within the cluster or interactive jobs.
Key Points:
Cheers, BAF Operators
27 Jun, 24 4:24:00 PM:
Here’s a summary of the email:
Summary: BAF System Upgrade Progress
The BAF team is prioritizing the OS upgrade for worker nodes to eventually upgrade the OS of file servers, aiming to restore /cephfs on desktops. This process will be gradual.
Key Points:
Cheers, BAF Operators
28 Jun, 24 4:02:11 PM:
Here’s a summary of the email:
Summary: High I/O Nodes and CephFS Update
BAF has reintroduced the first high I/O worker nodes with the new OS and updated CephFS clients. High-IO jobs are now running on four nodes, with interactive and batch job slots available.
Key Points:
Future Steps:
Note: Recovery will take several weeks due to other commitments, including a move to a new building. Users relying on /cephfs from desktops should use provided workarounds.
Cheers and have a nice weekend, BAF Operators
02 Jul, 24 5:24:06 PM:
Here’s a summary of the email:
Summary: BAF System Upgrade Update
The BAF team has upgraded most of the worker nodes, including many “medium” I/O nodes. However, due to low job activity, the stability of the cooling system under increased load and large-scale testing of the new OS haven’t been fully tested. Consequently, most “high” I/O nodes aren’t yet in regular use.
Key Points:
Happy computing, BAF Operators