Helix/Hardware

From bwHPC Wiki
Jump to navigation Jump to search

System Architecture

The bwForCluster Helix is a high performance supercomputer with high speed interconnect. Is composed of login nodes, compute nodes and parallel storage systems connected by fast data networks. It is connected to the Internet via Baden Württemberg's extended LAN BelWü.

Operating System and Software

  • Operating system: RedHat
  • Queuing system: Slurm
  • Access to application software: Environment Modules

Compute Nodes

AMD Nodes

Common features of alle AMD nodes:

  • Processors: 2 x AMD Milan EPYC 7513
  • Processor Frequency: GHz
  • Number of Cores per Node: 64
  • Local disk space: None
CPU Nodes GPU Nodes
Node Type cpu fat gpu4 gpu8
Quantity xxx xxx xxx xxx 2
Working Memory (GB) 256 2048 256 256 2048
Interconnect 1x HDR100 1x HDR100 2x HDR100 2x HDR200 4x HDR200

Intel Nodes

Some Intel nodes (Skylake and Cascade Lake) from the predecessor system will be integrated. Details will follow.

Storage Architecture

There is one storage system providing a large parallel file system based on IBM Spectrum Scale for $HOME, for workspaces, and for temporary job data.

Network

The components of the cluster are connected via two independent networks, a management network (Ethernet and IPMI) and an Infiniband fabric for MPI communication and storage access. The Infiniband backbone is a fully non-blocking fabric with 200 GB/s data speed. The compute nodes are connected with different data speeds according to the requirements of the configuration.