1. Overview computing services#
The provision of IT and scientific computing related services for MPSD is split between
MPSD’s Computational Science Unit and overview of HPC computing resources
1.1. High Performance Computing (HPC)#
Researchers at MPSD have access to multiple HPC compute resources, which are hosted at the Max Planck Compute and Data Facility (MPCDF), the GWDG, and at MPSD itself.
Raven (since 2020)
Based on Intel Xeon IceLake-SP processors and Nvidia A100 GPUs. 1592 CPU compute nodes, 114,624 CPU-cores, 375 TB RAM (DDR4), 7.5 PFlop/s theoretical peak performance (FP64), 192 GPU-accelerated nodes providing 768 Nvidia A100 GPUs, 30 TB GPU RAM (HBM2). Shared MPG resource.
See Supercomputing services for details.
Viper (since June 2024)
768 compute nodes with AMD EPYC Genoa 9554 CPUs with 128 cores and at least 512 GB RAM per node. A subset of 609 nodes is equipped with 512 GB RAM (16 memory channels), 90 nodes with 768 GB RAM (24 memory channels), 66 nodes with 1024 GB RAM (16 memory channels), and 3 nodes with 2304 GB RAM (24 memory channels).
In addition, Viper will provide 228 GPU compute nodes (to be deployed in the course of 2024) each with 2 AMD Instinct MI300A APUs and 256 GB of high-bandwidth memory (HBM3). The nodes are interconnected with a NVIDIA/Mellanox NDR InfiniBand network using a fat-tree topology with two non-blocking islands, one for the CPU nodes (NDR200, 200 Gb/s), and one for the GPU nodes (NDR, 400 Gb/s).
See Supercomputing services for details.
-
Dedicated GPU-based HPC machine for PKS and MPSD with 72 GPU nodes. Each GPU node hosts 4 A100 GPUs, two Intel Xeon IceLake-SP 8360Y CPUs (72 cores in total) and 1 TB RAM.
See Ada documentation for details.
-
Hardware resources located at MPSD.
To get access to Raven, Viper and Ada, request an account here. For software on Raven, Viper and Ada please check the Raven user guide, Viper user guide and Ada user guide. To use Jupyter notebooks on Viper and Raven please check https://docs.mpcdf.mpg.de/doc/visualization/index.html .
For software on the MPSD HPC system please refer to Software.
1.2. SSH key-based authentication GWDG#
For many GWDG services, it is required to deposit ssh keys with GWDG (for example archiv, GitLab, HPC):
1.3. Training Opportunities#
Courses from the GWDG: https://www.gwdg.de/academy
Courses from the MPCDF: https://www.mpcdf.mpg.de/services/training