BwUniCluster2.0/Maintenance/2023-03: Difference between revisions

From bwHPC Wiki
Jump to navigation Jump to search
No edit summary
 
(8 intermediate revisions by the same user not shown)
Line 1: Line 1:
The following changes will be introduced during the maintenance interval between on 20.03.2023 (Monday) 08:30 and 24.03.2023 (Friday) 15:00.
The following changes were introduced during the maintenance interval between on 20.03.2023 (Monday) 08:30 and 24.03.2023 (Friday) 15:00.


The host key of the system will not change. You should not receive any warnings by your SSH client(s), but if there should be a warning or if you want to check that you are connecting to the correct system, you can verify the key hashes using the following list:
The host key of the system has not changed. You should not receive any warnings by your SSH client(s), but if there should be a warning or if you want to check that you are connecting to the correct system, you can verify the key hashes using the following list:


{|class="wikitable"
{|class="wikitable"
Line 23: Line 23:
= Hardware =
= Hardware =


* All firmware versions on all components will be upgraded.
* All firmware versions on all components were upgraded.


= Operating system =
= Operating system =


* The operating system will be upgraded from RHEL 8.4 EUS to RHEL 8.6 EUS. We recommend to re-compile all applications after the upgrade.
* The operating system was upgraded from RHEL 8.4 EUS to RHEL 8.6 EUS. We recommend to re-compile all applications after the upgrade.


* The Mellanox OFED InfiniBand Stack will be updated.
* The Mellanox OFED InfiniBand Stack was updated.


= Compilers, Libaries and Runtime Environments =
= Compilers, Libaries and Runtime Environments =
Line 36: Line 36:
= Userspace tools =
= Userspace tools =


* pigz and pbzip are not supported anymore. Please use pzstd instead.
* The hpc-workspace tools will be updated to version 1.4.0.


= Software Modules =
= Software Modules =


* The Lmod module system will be upgraded.
* The Lmod module system was upgraded.


* Old openmpi 4.0 modules will be removed. Please use openmpi 4.1.
* Old openmpi 4.0 modules were removed. Please use openmpi 4.1.


= Batch system =
= Batch system =


* The Slurm version will be upgraded to version 23.02.0.
* The Slurm version was upgraded to version 23.02.0.

* The Slurm partitions have changed:
** multiple and multiple_il maximum number of nodes is now 80
** the amount of available nodes in partition single has been increased
** the amount of available nodes in partition multiple has been decreased


= Storage =
= Storage =


* Lustre client, BeeGFS client and Spectrum Scale client will be updated.
* Lustre client, BeeGFS client and Spectrum Scale client were updated.


= Graphics stack =
= Graphics stack =


* The NVIDIA driver will be upgraded.
* The NVIDIA driver was upgraded.


= Containers =
= Containers =


* Enroot will be upgraded.
* Enroot was upgraded.


* Singularity will be replaced with its successor Apptainer. (the command 'singularity' will still work)
* Singularity was replaced with its successor Apptainer. (the command 'singularity' still works)


= JupyterHub =
= JupyterHub =


* Jupyterhub will be upgraded to version 3.1.1
* Jupyterhub was upgraded to version 3.1.1

* python3.9 is now used as the default

= Resource Limits on Login Nodes =


* After the maintenance the following per-user limits apply (via cgroups):
* python3.9 will used as the default
** 48 GB phyisical memory
** 400% CPU cycles (100% equals 1 thread)

Latest revision as of 14:39, 24 March 2023

The following changes were introduced during the maintenance interval between on 20.03.2023 (Monday) 08:30 and 24.03.2023 (Friday) 15:00.

The host key of the system has not changed. You should not receive any warnings by your SSH client(s), but if there should be a warning or if you want to check that you are connecting to the correct system, you can verify the key hashes using the following list:

Algorithm Hash (SHA256) Hash (MD5)
RSA p6Ion2YKZr5cnzf6L6DS1xGnIwnC1BhLbOEmDdp7FA0 59:2a:67:44:4a:d7:89:6c:c0:0d:74:ba:3c:c4:63:6d
ECDSA k8l1JnfLf1y1Qi55IQmo11+/NZx06Rbze7akT5R7tE8 85:d4:d9:97:e0:f0:43:30:6e:66:8e:d0:b6:9b:85:d1
ED25519 yEe5nJ5hZZ1YbgieWr+phqRZKYbrV7zRe8OR3X03cn0 42:d2:0d:ab:87:48:fc:1d:5d:b3:7c:bf:22:c3:5f:b7

Hardware

  • All firmware versions on all components were upgraded.

Operating system

  • The operating system was upgraded from RHEL 8.4 EUS to RHEL 8.6 EUS. We recommend to re-compile all applications after the upgrade.
  • The Mellanox OFED InfiniBand Stack was updated.

Compilers, Libaries and Runtime Environments

Userspace tools

  • pigz and pbzip are not supported anymore. Please use pzstd instead.

Software Modules

  • The Lmod module system was upgraded.
  • Old openmpi 4.0 modules were removed. Please use openmpi 4.1.

Batch system

  • The Slurm version was upgraded to version 23.02.0.
  • The Slurm partitions have changed:
    • multiple and multiple_il maximum number of nodes is now 80
    • the amount of available nodes in partition single has been increased
    • the amount of available nodes in partition multiple has been decreased

Storage

  • Lustre client, BeeGFS client and Spectrum Scale client were updated.

Graphics stack

  • The NVIDIA driver was upgraded.

Containers

  • Enroot was upgraded.
  • Singularity was replaced with its successor Apptainer. (the command 'singularity' still works)

JupyterHub

  • Jupyterhub was upgraded to version 3.1.1
  • python3.9 is now used as the default

Resource Limits on Login Nodes

  • After the maintenance the following per-user limits apply (via cgroups):
    • 48 GB phyisical memory
    • 400% CPU cycles (100% equals 1 thread)