When to use a HPC Cluster: Difference between revisions
| H Winkhardt (talk | contribs)  (Formatting) | H Winkhardt (talk | contribs)  No edit summary | ||
| Line 1: | Line 1: | ||
| I have calculations to do - should I try using one of the bwHPC clusters for my tasks? | I have calculations to do - should I try using one of the bwHPC clusters for my tasks? | ||
| == When to use HPC computers == | == When to use HPC computers == | ||
| You can benefit from an HPC-Cluster if: | |||
| ⚫ | |||
| ⚫ | |||
| #  Software can efficiently divide your problem into many smaller problems that run in parallel | #  Software can efficiently divide your problem into many smaller problems that run in parallel | ||
| #  You need more RAM (memory) than your computer can provide | #  You need more RAM (memory) than your computer can provide | ||
| Line 26: | Line 30: | ||
| * Many compute cores  | * Many compute cores  | ||
| * Much RAM (memory)  | * Much RAM (memory)  | ||
| *  | * GPUs | ||
| === "It's just a computer, I know how to use a computer" === | === "It's just a computer, I know how to use a computer" === | ||
| While the clusters are indeed "just" many  | While the clusters are indeed "just" many Linux computers, they are also multi-million Euro instruments. So while they may have many similarities with a Linux desktop, in some respects, it is better to think of them like of other expensive instruments you may use in your research. HPC clusters have hardware components you may have never heard of in the consumer market, they typically have more than one CPU socket (making among other things memory management more complicated), etc. etc. etc. | ||
| === "I want to use bwHPC to host servers" === | |||
| On the bwHPC-clusters, you submit jobs to a supercomputer that get scheduled and processed over time. It is not the place for permanent server hosting. If you need self-service virtual machines, check out the [https://www.bw-cloud.org/ bwCloud]. | |||
| == What are the Costs? == | == What are the Costs? == | ||
Revision as of 11:51, 17 June 2025
I have calculations to do - should I try using one of the bwHPC clusters for my tasks?
When to use HPC computers
You can benefit from an HPC-Cluster if:
- You can run many separate calculations in parallel
- Software can efficiently divide your problem into many smaller problems that run in parallel
- You need more RAM (memory) than your computer can provide
- You need more GPUs and have no other source
- You need to process large amounts of data that do not fit on your computer
When not to use HPC computers
You will probably not have a big benefit from using HPC when:
- Your calculation runs in serial (only on one compute core is used, things cannot run in parallel) and you have few calculations
- Your workflow requires that you run one calculation, then retrieve data to analyze it locally, then run the next calculation
Except 3) or 4) of "when to use a cluster" force you to use the clusters anyway.
Misconceptions
"Everything will be faster"
The performance of a single compute core is not higher than that of your typical desktop computer. The main things we can offer are:
- Many compute cores
- Much RAM (memory)
- GPUs
"It's just a computer, I know how to use a computer"
While the clusters are indeed "just" many Linux computers, they are also multi-million Euro instruments. So while they may have many similarities with a Linux desktop, in some respects, it is better to think of them like of other expensive instruments you may use in your research. HPC clusters have hardware components you may have never heard of in the consumer market, they typically have more than one CPU socket (making among other things memory management more complicated), etc. etc. etc.
"I want to use bwHPC to host servers"
On the bwHPC-clusters, you submit jobs to a supercomputer that get scheduled and processed over time. It is not the place for permanent server hosting. If you need self-service virtual machines, check out the bwCloud.
What are the Costs?
Monetary
No cost to you as the end-user
(But of course the HPC systems are bought for millions of Euro and the power cost of running calculations is in the same order of magnitude as buying the systems)
Effort / Time
There is quite a learning curve to start calculating.
- Linux shell knowledge is a major part
- HPC-specific knowledge:
- Software module system - software supplied by the cluster maintainers (vide infra)
- How to use the scheduler / write job scripts: how to send calculations to a computer on the cluster (vide infra)
 
- Wait times: a good cluster is always busy. Expect a waiting time betwween 1h and 2 days until your calculation(s) start
Compute Workflow
A short description of the workflow how running calculation works can be found under Running Calculations and can give you a general idea.
Software
Basic usage is very simple: you run  module load module_name  and use the software, but important documentation and examples are also built into the modules. 
The usage of the software on the cluster is described in Environment Modules
Still With Us?
If you feel your calculations meet the requirements and you will save a lot of time despite some learning overhead, proceed to the Registration page.
