BwUniCluster2.0/First Steps: Difference between revisions

From bwHPC Wiki
Jump to navigation Jump to search
 
(9 intermediate revisions by 5 users not shown)
Line 1: Line 1:
<!--
== Command line interface ==
== Command line interface ==


Any work is to be done via command line interface. A quick guide can be found [https://indico.scc.kit.edu/indico/event/278/material/slides/10.pdf here].
Any work is to be done via command line interface. A quick guide can be found [https://indico.scc.kit.edu/indico/event/278/material/slides/10.pdf '''here'''].


== Software setup ==
== Software setup ==


Softwares provided by bwHPC have to be loaded according to your need via the [[BwUniCluster_2.0_SoftwareModules|software module system]].
Softwares provided by bwHPC have to be loaded according to your need via the [[BwUniCluster_2.0_Software|'''Software Module System''']].


== Computation ==
== Computation ==


Any kind of computation on any HPC cluster is done as [[bwUniCluster_2.0_Slurm_common_Features|batch jobs]].
Any kind of computation on any HPC cluster is done as [[bwUniCluster_2.0_Slurm_common_Features|'''batch jobs''']].


<!--
For setting up your environment to use compilers and installed software please visit:
For setting up your environment to use compilers and installed software please visit:
* [[BwUniCluster_Environment_Modules|bwUniCluster user environment]]
* [[BwUniCluster_Environment_Modules|bwUniCluster user environment]]
For guides on how to submit compute jobs to bwUniCluster please visit:
For guides on how to submit compute jobs to bwUniCluster please visit:
* [[bwUniCluster_2.0_Slurm_common_Features|bwUniCluster 2.0 batch jobs]]-->
* [[bwUniCluster_2.0_Slurm_common_Features|bwUniCluster 2.0 batch jobs]]
-->

== General Workflow of Running a Calculation ==

On a compute cluster, you do not simply run log in and your software, but you write a "job script" that contains all commands to run and process your job and send this into a waiting queue to be run on one of several hundred computers.

How this is done is described in a little more detail here: [[Running Calculations]]


== Get access to the cluster ==

Follow the registration process for the bwUniCluster. &rarr; [[Registration/bwUniCluster|bwUniCluster registration]]

== Login to the cluster ==

Setup service password and 2FA token and login to the cluster. &rarr; [[BwUniCluster2.0/Login|Login and security measures]]

== Transfer your data to the cluster ==

Get familiar with available file systems on the cluster. &rarr; [[BwUniCluster2.0/Hardware_and_Architecture|File Systems and Workspaces]]

Transfer your data to the cluster using appropriate tools. &rarr; [[Data Transfer|Data Transfer]]

== Find information about installed software and examples ==

Compiler, Libraries and application software are provided as software modules. Learn how to work with [[Environment_Modules|software modules]]. &rarr; [[BwUniCluster2.0/Software|Software]]

== Submit your application as a batch job ==

Get familiar with available nodes types on the cluster. &rarr; [[BwUniCluster2.0/Hardware and Architecture|Hardware and Architecture]]

Submit and monitor your jobs with Slurm commands. &rarr; [[BwUniCluster2.0/Slurm|Batch System Slurm]]

== Learn about Scaling your Job ==

How many compute-cores should my job use? This depends on the software and the problem you are trying to solve. But if you use too few cores, your computation may take much too long - if you use too many cores, they will not improve the speed of your computation and all you do by using more cores is wasting compute resources and energy.

If you run hundreds or thousands of similar calculations, you should look at this carefully before starting.

How to do this is described in: [[Scaling]]

== Acknowledge the cluster ==

Remember to mention the cluster in your publications. &rarr; [[BwUniCluster2.0/Acknowledgement|Acknowledgement]]

Latest revision as of 11:59, 11 September 2024


General Workflow of Running a Calculation

On a compute cluster, you do not simply run log in and your software, but you write a "job script" that contains all commands to run and process your job and send this into a waiting queue to be run on one of several hundred computers.

How this is done is described in a little more detail here: Running Calculations


Get access to the cluster

Follow the registration process for the bwUniCluster. → bwUniCluster registration

Login to the cluster

Setup service password and 2FA token and login to the cluster. → Login and security measures

Transfer your data to the cluster

Get familiar with available file systems on the cluster. → File Systems and Workspaces

Transfer your data to the cluster using appropriate tools. → Data Transfer

Find information about installed software and examples

Compiler, Libraries and application software are provided as software modules. Learn how to work with software modules. → Software

Submit your application as a batch job

Get familiar with available nodes types on the cluster. → Hardware and Architecture

Submit and monitor your jobs with Slurm commands. → Batch System Slurm

Learn about Scaling your Job

How many compute-cores should my job use? This depends on the software and the problem you are trying to solve. But if you use too few cores, your computation may take much too long - if you use too many cores, they will not improve the speed of your computation and all you do by using more cores is wasting compute resources and energy.

If you run hundreds or thousands of similar calculations, you should look at this carefully before starting.

How to do this is described in: Scaling

Acknowledge the cluster

Remember to mention the cluster in your publications. → Acknowledgement