Difference between revisions of "Software Modules Lmod"

From bwHPC Wiki
Jump to: navigation, search
(Lmod commnads)
 
(175 intermediate revisions by 3 users not shown)
Line 1: Line 1:
 
Software Module System - Lmod
 
Software Module System - Lmod
   
= '''Preface''' =
+
= Preface =
   
This guide describes the software environment and tools used to run applications on JUSTUS 2 system. It is intended as a general overview and introduction to the software system management on JUSTUS 2 for new users as well as for experienced users coming, e.g. from sites planted with different Environment modules package, as for example JUSTUS 1 system.
+
This guide provides a general overview and introduction to the software system management via Lmod on JUSTUS 2 for new users as well as for experienced users coming, e.g. from sites planted with different Environment modules systems.
   
  +
= Scientific Software management through the module system Lmod =
= '''For users migrating from JUSTUS 1 to JUSTUS 2''' =
 
  +
The following sections covers the basic module commands needed to find and load the scientific applications installed on the JUSTUS 2.<br>
There exist several software implementations for a dynamical management of environmental variables throughout the HPC world.
 
  +
JUSTUS 1 hosted Environment Modules package based on Tcl, while there is a Lua based module system Lmod (https://lmod.readthedocs.io/en/latest/#) installed on JUSTUS 2.
 
  +
JUSTUS 2 uses Linux operating system. The standard Linux packages are installed on the front end nodes (login and visualisation nodes).
Despite of the various software solutions, the difference in handling of the modules from users' perspective of view stays tiny. The biggest change lies in the introducing of the partial software hierarchy implemented in modules system on JUSTUS 2. Besides that, Lmod extends the basic environment modules by adding predefined shortcuts and a more powerful search capability.
 
  +
The '''scientific software is accessible via so called module system'''.<br>
<br>
 
  +
'''To find and load an scientific application, one needs to use module commands.''' For example the following command sequence:
<br>
 
=== Lmod commands ===
 
The much of module commands and its functionality has remained same on JUSTUS 2, although [https://lmod.readthedocs.io/en/latest/# Lmod] offers new functions as for example ''module spider'' as an extension of "old" ''module avail'' functionality.
 
<br>
 
Thus, e.g. the default Gaussian module is initialized by
 
 
<pre>
 
<pre>
$ module load chem/gaussian
+
module load chem/gaussian/g16.C.01
  +
module list
  +
module help chem/gaussian/g16.C.01
  +
cp $GAUSSIAN_EXA_DIR/bwforcluster-gaussian-example.sbatch .
 
</pre>
 
</pre>
which is pretty much the same as on JUSTUS 1
 
or with using of the Lmod shorthand version
 
 
<pre>
 
<pre>
  +
1. loads the module with gaussian software package of version 16, revision C.01
$ ml chem/gaussian
 
  +
2. prints out the list of currently load modules
  +
3. provides the user help for the particular gaussian module
  +
4. copies the template batch script which was specifically designed for submission of g16 jobs into SLURM workload manager on JUSTUS 2.
 
</pre>
 
</pre>
  +
<br>
 
  +
Full list of the Lmod module commands is available online on https://lmod.readthedocs.io/en/latest/010_user.html
 
  +
=== Why we use module system? Modules load scientific software ===
<br>
 
  +
The module system on JUSTUS 2 is managed by '''Lmod''' (https://lmod.readthedocs.io/en/latest/).
<br>
 
  +
The module system incorporates majority of the computational software available - this includes among others compilers, mpi libraries, numerical libraries, computational
Rather than the divergence of module commands, JUSTUS 1 users has to get a grasp of the modulefile system hierarchy built-in JUSTUS 2 software layout.
 
  +
chemistry packages, python specific libraries etc.. <br><br>'''The programs managed by the module system are by default not utilizable. It has to be "loaded" to become executable.'''
The layout is no more flat as on JUSTUS 1, however JUSTUS 2 adopts a partial module hierarchy for mpi modules.
 
  +
<br><br>
  +
The use of '''module system''' provide among others the following '''functionalities''':
  +
  +
'''1.''' When '''loading a module''', it automatically '''sets the appropriate environment variables required by the application to run properly'''.<br>
  +
'''2.''' It also takes care about the '''module dependency'''. It either loads all additional modules required for the application, or it informs the user if additional dependency modules need to be manually loaded.<br>
  +
'''3.''' It '''prevents loading of modules''' that could be '''in conflicts''' and can '''cause instability or unexpected behavior'''.<br>
 
<br>
 
<br>
   
  +
Among the main functionalities of Lmod belongs '''module load''' to make variety of the software packages pre-installed on the cluster accessible. '''This is feasible by only a single command''':
=== Consequences of the partial module hierarchy for mpi modules ===
 
Due the flat layout of software modules on JUSTUS 1 ''module avail'' command showed all pre-installed software packages including those directly depended on another modules.
 
This is no more true on JUSTUS 2, in particular for mpi modules. mpi modules remains invisible for a user (prompted ''module avail'') until some compiler module has been loaded. Once the compiler module has been activated corresponding mpi modules, i.e. built with the particular compiler, become visible.<br>
 
 
<br>
 
<br>
E.g., with the originally empty list of the loaded modules, the module command
 
 
<pre>
 
<pre>
$ module avail
+
module load <module_name>
 
</pre>
 
</pre>
  +
The activation is realized by '''dynamical modification of the user's shell environment'''. This simply includes '''adding new paths''' to bin directories with the specific software '''into the PATH environmental variable'''. Typically, Lmod modifies '''PATH''' and '''LD_LIBRARY_PATH''' as well as it '''sets new variables''' as, for example '''<SOFTWARE_NAME>_EXA_DIR''' containing path to directory with the examples for a specific software.
or its shorthand analogue
 
<pre>
+
<br><br>
  +
Example: compare the content of $PATH environmental variable before and after the load of the gaussian module:
$ ml av
 
  +
<br>Before the load of gaussian module:
  +
<pre>
  +
echo $PATH
  +
/home/software/common/bin:/usr/local/bin:/usr/bin:/usr/local/sbin:/usr/sbin
 
</pre>
 
</pre>
displays no mpi module available. After running
 
 
<pre>
 
<pre>
  +
which g16
ml compiler/intel/19.1
 
  +
/usr/bin/which: no g16 in (/home/software/common/bin:/usr/local/bin:/usr/bin:/usr/local/sbin:/usr/sbin)
ml av
 
 
</pre>
 
</pre>
mpi packages compatible with the intel 19.1 compiler becomes visible
 
 
<pre>
 
<pre>
  +
ml chem/gaussian
------------ /opt/bwhpc/common/modulefiles/Compiler/intel/19.1 --------------------
 
  +
</pre>
mpi/impi/2019.7 mpi/openmpi/4.0
 
  +
After gaussian is loaded:
  +
<pre>
  +
echo $PATH
  +
/.../chem/gaussian/g16.C.01/x86_64-Intel-avx2-source/g16/bsd:/.../chem/gaussian/g16.C.01/x86_64-Intel-avx2-source/g16:/home/software/common/bin:/usr/local/bin:/usr/bin:/usr/local/sbin:/usr/sbin
  +
</pre>
  +
<pre>
  +
which g16
  +
/.../chem/gaussian/g16.C.01/x86_64-Intel-avx2-source/g16/g16
 
</pre>
 
</pre>
in the list of the available software.
 
   
  +
=== Basic functions of Lmod and commands ===
= '''Lmod Module System on JUSTUS 2''' =
 
  +
<br>The module system has other useful capabilities then just the managing the environment.<br><br>
  +
''' (i) to list available software'''<br>
  +
<pre>
  +
module available
  +
</pre>
  +
'''<br> (ii) to load (activate) modules (particular software)'''<br>
  +
<pre>
  +
module load
  +
</pre>
  +
'''<br> (iii) to unload (inactivate) modules (particular software)'''<br>
  +
<pre>
  +
module unload
  +
</pre>
  +
<pre>
  +
module purge
  +
</pre>
  +
''' (iv) to list currently loaded modules'''<br>
  +
<pre>
  +
module list
  +
</pre>
   
  +
'''<br> (v) to search through all packages within the module system'''<br>
JUSTUS 2 system uses [https://lmod.readthedocs.io/en/latest/# Lmod (the Lua Based Module System)] environment management package to support dynamical modification of the user environment via modulefiles. Each modulefile contains information needed to configure the shell, i.e. its environmental variables, required to run a particular software application. To make major changes in the users' environment, such as modifications of PATH, LD_LIBRARY_PATH, CC or FC variables for, e.g. a switching to a different compiler, use the appropriate single or sequence of modules commands instead of a reconfiguration of the shell by hand.
 
<br>
 
<br>
 
E.g. in order to activate the default vasp module run a single command:
 
 
<pre>
 
<pre>
$ module load chem/vasp
+
module available
 
</pre>
 
</pre>
or its shorthand variant
 
 
<pre>
 
<pre>
  +
module spider
$ml chem/vasp
 
 
</pre>
 
</pre>
For an activation of the module of intel mpi built with the corresponding intel compiler a sequence of two module commands
 
 
<pre>
 
<pre>
$ module load compiler/intel
+
module keyword
  +
</pre>
$ module load mpi/impi
 
  +
'''<br> (vi) to provide specific help information for a particular module system<br>'''
  +
<pre>
  +
module help
  +
</pre>
  +
<pre>
  +
module whatis
 
</pre>
 
</pre>
has to be run.
 
   
=== Online user guide of Lmod ===
+
== Elementary Lmod Commands ==
  +
The module commands below might be used interactively (in shells' current session), as well as in the shell scripts, in particular in the sbatch scripts used for submission of the computational jobs into SLURM (workload manager on JUSTUS2).
The complete user guide can be found on Lmod websites https://lmod.readthedocs.io/en/latest/010_user.html
 
  +
=== List of available software modules ===
  +
<pre>
  +
module available
  +
</pre>
  +
alternatively also in a short form
  +
<pre>
  +
ml av
  +
</pre>
   
  +
=== Module naming convention: Category/Name/Version ===
=== Which shells supports module commands? ===
 
  +
On JUSTUS 2 (similarly as on other HPC sites of bwHPC), the software modules are grouped into
So far Bash is only supported shell on JUSTUS 2 to interpret module commands.
 
  +
several categories:
  +
<!--* [[:Category:Chemistry_software|chem]]-->
  +
* chem
  +
<!--* [[:Category:Compiler_software|compiler]]-->
  +
* compiler
  +
<!--* [[:Category:Debugger_software|devel]]-->
  +
* devel
  +
<!--* [[:Category:Libraries|lib]]-->
  +
* lib
  +
<!--* [[:Category:Numerical libraries|numlib]]-->
  +
* numlib
  +
<!--* [[:Category:Physics software|phys]]-->
  +
* phys
  +
<!--* [[:Category:System software|system]]-->
  +
* system
  +
<!--* [[:Category:Visualization|vis]]-->
  +
* vis
  +
<!--* [[:Category:Mathematical ecosystems|math]]-->
  +
* math
   
  +
This makes it easier for
=== Basic Module System tasks ===
 
  +
users to get oriented within the module system. For example Gaussian 16 program allowing
Lmod offers more than 25 sub-commands plus various options to manage the modulefile system installed on JUSTUS 2. See, e.g. output of "module --help" command. Large majority of users will use only couple of them. A complete list of module sub-commands can be displayed by entering "module --help" command or in [https://lmod.readthedocs.io/en/latest/010_user.html Lmod online documentation]. The following text lists only a couple of them.
 
  +
to calculate electronic structure of molecules is found in the category chem (together with other
  +
programs used by theoretical chemists).
  +
<br>
  +
Each the category is further divided according to software packages and those finally according to software versions.
  +
<br>
  +
The full '''name of a module''' always consists of three parts '''category, name, and version''' separated by slash '''category/name/version''' . Consequently, the
  +
full name of the module with Gaussian 16 package is '''chem/gaussian/g16.C.01'''. Analogously, gnu compiler of version 10.2 is addressed as '''compiler/gnu/10.2'''.
  +
<br><br>
  +
See, for example, all modules of category chem with:
  +
<pre>
  +
ml av chem
  +
</pre>
  +
<pre>
  +
---------------------------------------------- /opt/bwhpc/common/modulefiles/Core----------------------------------------------------------------------------------
  +
chem/adf/2019.304 chem/gaussian/g16.C.01 chem/molpro/2020.1 (D) chem/orca/5.0.1-xtb-6.4.1 chem/tmolex/4.6 (D)
  +
chem/ams/2020.101 chem/gaussview/6.1.1 chem/namd/2.14 chem/orca/5.0.1 (D) chem/turbomole/7.4.1
  +
chem/ams/2020.103 chem/gromacs/2020.2 chem/nbo/6.0.18_i4 chem/quantum_espresso/6.5 chem/turbomole/7.5 (D)
  +
chem/ams/2021.102 (D) chem/gromacs/2020.4 chem/nbo/6.0.18_i8 (D) chem/quantum_espresso/6.7_openmp-5 chem/vasp/5.4.4.3.16052018
  +
chem/cfour/2.1_openmpi chem/gromacs/2021.1 (D) chem/openbabel/3.1.1 chem/quantum_espresso/6.7 (D) chem/vmd/1.9.3
  +
chem/cp2k/7.1 chem/jmol/14.31.3 chem/openmolcas/19.11 chem/schrodinger/2020-2 chem/xtb/6.3.3
  +
chem/cp2k/8.0_devel (D) chem/lammps/stable_3Mar2020 chem/openmolcas/21.06 (D) chem/schrodinger/2021-1 (D) chem/xtb/6.4.1 (D)
  +
chem/dalton/2020.0 chem/molcas/8.4 chem/openmolcas/21.10 chem/siesta/4.1-b4
  +
chem/dftbplus/20.2.1-cpu chem/molden/5.9 chem/orca/4.2.1-xtb-6.3.3 chem/siesta/4.1.5 (D)
  +
chem/gamess/2020.2 chem/molpro/2019.2.3 chem/orca/4.2.1 chem/tmolex/4.5.2
  +
</pre>
  +
or, analogously, all available versions of intel compilers:
  +
<br>
  +
<pre>
  +
ml av compiler/intel
  +
</pre>
  +
<pre>
  +
--------------------------------------------------------------------------------/opt/bwhpc/common/modulefiles/Core--------------------------------------------------------------
  +
compiler/intel/19.0 compiler/intel/19.1 compiler/intel/19.1.2 (D)
  +
</pre>
   
==== Lmod shortcuts ====
+
=== Load specific software ===
Lmod extends the basic environment modules by adding simple shortcuts. The ''ml'' shortcut can be used in place of ''module'', ''av'' in instead of ''avail'', ''ml'' instead of ''module list'', or ''-'' can alternate ''unload''.<br>
 
For instance, following combinations are interchangeble:
 
 
<pre>
 
<pre>
module list <==> ml
+
module load <module_name>
 
</pre>
 
</pre>
  +
or shortly
 
<pre>
 
<pre>
  +
ml <module_name>
module avail <==> ml av
 
 
</pre>
 
</pre>
  +
For example to load gaussian of version 16 one has to run
 
<pre>
 
<pre>
  +
ml chem/gaussian/g16.C.01
module load compiler/intel <==> ml compiler/intel
 
 
</pre>
 
</pre>
  +
=== List of the loaded modules ===
 
<pre>
 
<pre>
  +
module list
module unload compiler/intel <==> ml -compiler/intel
 
  +
</pre>
  +
or simply
  +
<pre>
  +
ml
 
</pre>
 
</pre>
   
==== Loading and unloading modulefiles ====
+
=== Default module version ===
  +
In case of there is multiple software versions, one version is always pre-determined as the '''default'''
To load and unload a module of a specific package use "module load category/package" and "module unload category/package", respectively. Thus, e.g, to load default version of VASP which belongs into category chem, enter
 
  +
version. To address a default version, ''version'' can be omitted in the module identifier.
  +
For example, the loading of the default intel compiler module is realized via
 
<pre>
 
<pre>
  +
ml compiler/intel
$ module load chem/VASP
 
 
</pre>
 
</pre>
and for unloading of the package use
 
 
<pre>
 
<pre>
  +
ml
$ module unload chem/VASP
 
  +
  +
Currently Loaded Modules:
  +
1) compiler/intel/19.1.2
 
</pre>
 
</pre>
   
  +
=== Unload a specific software from the environment ===
==== Complete unloading of all modulefiles ====
 
  +
<pre>
To clean the environment from all entries set by lmod use "module purge" command.
 
  +
module unload <module_name>
  +
</pre>
  +
or equivalently
  +
<pre>
  +
ml -<module_name>
  +
</pre>
  +
for example to unload previously loaded vasp module chem/vasp/5.4.4.3.16052018
  +
use
  +
<pre>
  +
ml -chem/vasp/5.4.4.3.16052018
  +
</pre>
  +
=== Unload all the loaded modules ===
 
<pre>
 
<pre>
 
$ module purge
 
$ module purge
 
</pre>
 
</pre>
  +
or
Beware that the "module purge" does not always restore the environmental variables that existed prior entering first "module load" command. I.e., some of the entries in PATH, or environmental variables, etc. originally existed in the environment might be cleaned by "module purge" if those would be duplicates of the variables set by any modulefiles that was loaded.
 
 
==== What is Loaded Now? ====
 
To see which modules are currently loaded in your environment, enter the command "module list".
 
 
<pre>
 
<pre>
  +
ml purge
$ module list
 
 
</pre>
 
</pre>
  +
=== Providing a specific help for a particular module ===
 
==== Which modulefiles/software is available? ====
 
To see what modulefiles are available on JUSTUS 2, you can enter the following command:
 
 
<pre>
 
<pre>
$ module avail
+
module help <module_name>
 
</pre>
 
</pre>
  +
or
or alternatively
 
 
<pre>
 
<pre>
  +
ml help <module_name>
$ ml av
 
 
</pre>
 
</pre>
  +
=== Software job examples and batch script templates ===
which comes with lmod to save typewriting.
 
  +
Majority of the software modules provides examples, including job queueing system examples (batch scripts)
  +
for slurm. A full path the directory with examples is normally contained in <SOFTWARE_NAME>_EXA_DIR
  +
environmental variable. For example the examples for Gromacs-2021.1 are located in (after the loading of the module).
  +
<pre>
  +
ml chem/gromacs/2021.1
  +
</pre>
  +
<pre>
  +
echo $GROMACS_EXA_DIR/
  +
/opt/bwhpc/common/chem/gromacs/2021.1-openmpi-4.0/bwhpc-examples/
  +
</pre>
  +
<pre>
  +
ls $GROMACS_EXA_DIR
  +
GROMACS_TestCaseA Performance-Tuning-and-Optimization-of-GROMACS.pdf README
  +
</pre>
  +
<pre>
  +
ls $GROMACS_EXA_DIR/GROMACS_TestCaseA/
  +
gromacs-2021.1_gpu.slurm gromacs-2021.1.slurm ion_channel.tpr
  +
</pre>
  +
Users may make a copy of these examples and use it as template for their own job scripts:
  +
<pre>
  +
cp $GROMACS_EXA_DIR/GROMACS_TestCaseA/gromacs-2021.1_gpu.slurm .
  +
</pre>
  +
'''Note: All the batch scripts examples are fully functional, i.e. the example scripts could be directly submitted into the queuing system, to launch a test job.'''
  +
Typically, the scripts launch a short, simple calculation of the given software. Moreover, most of the sbatch scripts contain
  +
general submit instructions, as well as hints specific for the particular program.
   
==== Module specific help ====
+
=== Searching through module names ===
To access the module specific help, use the "module help" command. For example, to see the module help associated with the default VASP module, enter this command:
 
 
<pre>
 
<pre>
$ module help chem/vasp
+
module available <module_name>
 
</pre>
 
</pre>
  +
or shortly
It is always a good practice to read through all the info provided by a specific module help since this serves as an important communication channel between a user and the person who install the piece of a software.
 
  +
<pre>
  +
ml av <module_name>
  +
</pre>
  +
For example, searching for python modules is realized via
  +
<pre>
  +
ml av python
  +
</pre>
  +
with the following output:
  +
<pre>
  +
----------------------------------------------------------------------------------- /opt/bwhpc/common/modulefiles/Core ------------------------------------------------------------------------------------
  +
devel/python/3.8.3 lib/python_matplotlib/3.2.2_numpy-1.19.0_python-3.8.3 numlib/python_numpy/1.19.0_python-3.8.3 numlib/python_scipy/1.5.0_numpy-1.19.0_python-3.8.3
   
  +
Use "module spider" to find all possible modules and extensions.
== Semi hierarchical layout of modules on JUSTUS 2 ==
 
  +
Use "module keyword key1 key2 ..." to search for all possible modules matching any of the "keys"
  +
</pre>
   
  +
=== What does this software do? Command when you don't know this software ===
=== Module hierarchy in Lmod ===
 
  +
<pre>
The structure of software modules on JUSTUS 2 exploits a "semi" hierarchical structure. This is slightly different from what can be seen on another HPC systems with "full" hierarchical structure. The typical systems with full hierarchy put compiler modules (i.e., intel, gcc) in the uppermost (Core) level, depending libraries (e.g., MPI) on the second level, and more depending libraries on a third level. As a consequence, not all the modules contained in the module system are initially visible, namely the modules putted in the second and third layer. Only after a loading a compiler module, the modules of the second layer directly depending on the particular compiler will become available. And similarly, loading an MPI module will make the modules of the third layer depending on the loaded MPI library visible.<br>
 
  +
ml whatis <modulename>
  +
</pre>
  +
provides the short description of the software package.
   
  +
=== Finding detailed information about a specific module ===
=== Semi hierarchy of software stack on JUSTUS 2 ===
 
JUSTUS 2 adopted the hierarchical structure of the modules layout only partially. In particular, there is only "Core" and the "second" level presented and there are only mpi modules contained in the second level. All other modules, i.e. for example those from the "chem" sub-cathegory such as ''vasp'', ''turbomole'', or ''gaussian'', or those located in the "numlib" sub-cathegory such as ''mkl'' or ''python_numpy'', are embodied in the "Core" level.
 
<br>
 
 
=== Module dependency ===
 
The adopted hierarchy models is not the only tool handling the module dependency. As a matter of fact, most of the modules on JUSTUS 2 require a provision of functionalities from another modules, albeit located in the "Core" level. Such provisioning is implemented in a modulefile either automatically without a need of any action from the user (the depending modulefile, while loading, loads all additional modules automatically) or the depending modulefile, while loading, informs the user about necessity to pre-load additional modules if those has not been activated yet (in this case the user must repeat the loading operation). Which of the solution is applied rests with the decision of the person who built the particular module.<br>
 
<br>
 
An example of module with the implemented automated pre-loading is ''orca'' module. With the pre-emptied list of the loading modules, i.e.
 
 
<pre>
 
<pre>
  +
module spider <searching_pattern>
$ ml
 
 
</pre>
 
</pre>
  +
or just
shows
 
 
<pre>
 
<pre>
  +
ml spider <searching_pattern>
No modules loaded
 
 
</pre>
 
</pre>
  +
, the command sequence
 
  +
=== Extended searching through entire module system ===
 
<pre>
 
<pre>
  +
module keyword <searching_pattern>
$ ml chem/orca
 
$ ml
 
 
</pre>
 
</pre>
  +
For example, to find out which modules contain fftw library:
shows
 
 
<pre>
 
<pre>
  +
ml keyword fftw
Currently Loaded Modules:
 
1) compiler/intel/19.1 2) chem/orca/4.2.1
 
 
</pre>
 
</pre>
  +
which gives the following info:
I.e., loading of the ''intel'' compiler is built-in the ''orca'' module.
 
  +
<pre>
  +
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
   
  +
The following modules match your search criteria: "fftw"
== Other topics ==
 
  +
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
   
  +
numlib/mkl: numlib/mkl/2019, numlib/mkl/2020, numlib/mkl/2020.2
=== Complete list of Lmod options and sub-commands ===
 
  +
The whole list of module options and all commands available can be displayed by running
 
  +
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
  +
</pre>
  +
  +
== Best Practices when working with modules ==
  +
=== Always load modules with the entire module name ===
  +
The software stack is updated regularly. The adding of the new software version usually revokes which version is marked as default.
  +
The newer software is not always backwards compatible, including the existing scripts, workflow, or even input files.
  +
Therefore it is strongly recommended to avoid the loading based on just category and software name. Instead, one should always use the entire module name (including the version) to make sure the same module is loaded each time.
  +
  +
=== Load only those modules that are needed for the current application ===
  +
Only load modules that are needed for the current script or workflow you are running, to reduce the chance of unexpected behavior caused by module conflicts.
  +
  +
'''Typical error''' sometimes seen on the cluster by loading of vasp module is this:
 
<pre>
 
<pre>
  +
ml compiler/intel/19.1.2
man module
 
  +
ml mpi/impi/2019.8
  +
ml chem/vasp/5.4.4.3.16052018
 
</pre>
 
</pre>
  +
with the output showing
 
  +
'''The correct way''' is indeed only:
 
<pre>
 
<pre>
  +
ml chem/vasp/5.4.4.3.16052018
Usage: module [options] sub-command [args ...]
 
  +
</pre>
   
  +
=== Do not use module commands in .bashrc, .bash_profile etc. scripts ===
Options:
 
  +
Avoid including “module load” commands in your .bashrc or .bash_profile files. As an alternative, create a bash script with the module load commands and source it each time, to load the modules needed.
-h -? -H --help This help message
 
-s availStyle --style=availStyle Site controlled avail style: system
 
(default: system)
 
--regression_testing Lmod regression testing
 
-D Program tracing written to stderr
 
--debug=dbglvl Program tracing written to stderr (where
 
dbglvl is a number 1,2,3)
 
--pin_versions=pinVersions When doing a restore use specified
 
version, do not follow defaults
 
-d --default List default modules only when used with
 
avail
 
-q --quiet Do not print out warnings
 
--expert Expert mode
 
-t --terse Write out in machine readable format for
 
commands: list, avail, spider, savelist
 
--initial_load loading Lmod for first time in a user
 
shell
 
--latest Load latest (ignore default)
 
--ignore_cache Treat the cache file(s) as out-of-date
 
--novice Turn off expert and quiet flag
 
--raw Print modulefile in raw output when used
 
with show
 
-w twidth --width=twidth Use this as max term width
 
-v --version Print version info and quit
 
-r --regexp use regular expression match
 
--gitversion Dump git version in a machine readable
 
way and quit
 
--dumpversion Dump version in a machine readable way
 
and quit
 
--check_syntax --checkSyntax Checking module command syntax: do not
 
load
 
--config Report Lmod Configuration
 
--config_json Report Lmod Configuration in json format
 
--mt Report Module Table State
 
--timer report run times
 
--force force removal of a sticky module or save
 
an empty collection
 
--redirect Send the output of list, avail, spider
 
to stdout (not stderr)
 
--no_redirect Force output of list, avail and spider
 
to stderr
 
--show_hidden Avail and spider will report hidden
 
modules
 
--spider_timeout=timeout a timeout for spider
 
-T --trace
 
   
module [options] sub-command [args ...]
+
=== Use 'module help' command ===
  +
see [[Software_Modules_Lmod#Providing_a_specific_help_for_a_particular_module]]
   
  +
=== Check content of $<SOFTWARE_NAME>_EXA_DIR folder ===
Help sub-commands:
 
  +
see [[//wiki.bwhpc.de/e/Software_Modules_Lmod#Software_job_examples_and_batch_script_templates]]
------------------
 
help prints this message
 
help module [...] print help message from module(s)
 
   
  +
=== Use 'ml purge' in sbatch scripts before the first 'ml load' ===
Loading/Unloading sub-commands:
 
  +
The environment in effect at the time of the sbatch, salloc, or srun commands is executed are propagated
-------------------------------
 
  +
to the spawned processes, i.e. also to the job-script. Consequently, should be some module loaded at the
load | add module [...] load module(s)
 
  +
time of the 'sbatch <job-script>' command execution, its state, i.e. "loaded", as well as the values of
try-load | try-add module [...] Add module(s), do not complain if not
 
  +
the set environmental variables will be propagated with the job.<br><br>
found
 
  +
Thus, consider to put 'ml purge' command as the first module command when you are designing your job-scripts.
del | unload module [...] Remove module(s), do not complain if not
 
  +
This might prevent variety of module conflict situations.
found
 
swap | sw | switch m1 m2 unload m1 and load m2
 
purge unload all modules
 
refresh reload aliases from current list of
 
modules.
 
update reload all currently loaded modules.
 
   
  +
Imagine for example, in the following scenario
Listing / Searching sub-commands:
 
  +
<pre>
---------------------------------
 
  +
On the login node:
list List loaded modules
 
list s1 s2 ... List loaded modules that match the
 
pattern
 
avail | av List available modules
 
avail | av string List available modules that contain
 
"string".
 
spider List all possible modules
 
spider module List all possible version of that module
 
file
 
spider string List all module that contain the
 
"string".
 
spider name/version Detailed information about that version
 
of the module.
 
whatis module Print whatis information about module
 
keyword | key string Search all name and whatis that contain
 
"string".
 
   
  +
ml compiler/intel/19.1.2
Searching with Lmod:
 
  +
salloc --nodes=1 --ntasks-per-node=1
--------------------
 
All searching (spider, list, avail, keyword) support regular expressions:
 
   
  +
... waiting for the allocation of the resources ...
-r spider '^p' Finds all the modules that start with
 
  +
Once on the compute node execute
`p' or `P'
 
-r spider mpi Finds all modules that have "mpi" in
 
their name.
 
-r spider 'mpi$ Finds all modules that end with "mpi" in
 
their name.
 
   
  +
ml compiler/gnu/10.2
Handling a collection of modules:
 
  +
</pre>
--------------------------------
 
  +
the load of compiler/gnu/10.2 module on the compute node fails with following error:
save | s Save the current list of modules to a
 
  +
<pre>
user defined "default" collection.
 
  +
Lmod has detected the following error: Cannot load module "compiler/gnu/10.2" because these module(s) are loaded:
save | s name Save the current list of modules to
 
  +
compiler/intel
"name" collection.
 
reset The same as "restore system"
 
restore | r Restore modules from the user's
 
"default" or system default.
 
restore | r name Restore modules from "name" collection.
 
restore system Restore module state to system defaults.
 
savelist List of saved collections.
 
describe | mcc name Describe the contents of a module
 
collection.
 
disable name Disable (i.e. remove) a collection.
 
   
  +
While processing the following module(s):
Deprecated commands:
 
  +
Module fullname Module Filename
--------------------
 
  +
--------------- ---------------
getdefault [name] load name collection of modules or
 
  +
compiler/gnu/10.2 /opt/bwhpc/common/modulefiles/Core/compiler/gnu/10.2.lua
user's "default" if no name given.
 
===> Use "restore" instead <====
 
setdefault [name] Save current list of modules to name if
 
given, otherwise save as the default
 
list for you the user.
 
===> Use "save" instead. <====
 
   
  +
[ul_l_tkz12@login02 ~]$ ml
Miscellaneous sub-commands:
 
---------------------------
 
is-loaded modulefile return a true status if module is loaded
 
is-avail modulefile return a true status if module can be
 
loaded
 
show modulefile show the commands in the module file.
 
use [-a] path Prepend or Append path to MODULEPATH.
 
unuse path remove path from MODULEPATH.
 
tablelist output list of active modules as a lua
 
table.
 
   
  +
Currently Loaded Modules:
Important Environment Variables:
 
  +
1) compiler/intel/19.1.2
--------------------------------
 
  +
</pre>
LMOD_COLORIZE If defined to be "YES" then Lmod prints
 
properties and warning in color.
 
   
  +
== Useful Extras ==
--------------------------------------------------------------------------
 
  +
=== Conflicts between modules ===
  +
Some modules cannot be loaded together at the same time. For example two different versions of the same package cannot
  +
be activated simultaneously. The modules might already built-in this functionality. In such circumstances, Lmod, during the loading, either
  +
prints an error message and no module is loaded, or the module is reloaded - the old module is unloaded and only the new module is become activated.<br><br>
  +
'''Example of two versions of the intel compiler - module reload:'''
  +
<pre>
  +
ml compiler/intel/19.1
  +
</pre>
  +
<pre>
  +
ml
   
  +
Currently Loaded Modules:
Lmod Web Sites
 
  +
1) compiler/intel/19.1
  +
</pre>
  +
<pre>
  +
ml compiler/intel/19.1.2
   
  +
The following have been reloaded with a version change:
Documentation: http://lmod.readthedocs.org
 
  +
1) compiler/intel/19.1 => compiler/intel/19.1.2
Github: https://github.com/TACC/Lmod
 
  +
</pre>
Sourceforge: https://lmod.sf.net
 
  +
<pre>
TACC Homepage: https://www.tacc.utexas.edu/research-development/tacc-projects/lmod
 
  +
ml
   
  +
Currently Loaded Modules:
To report a bug please read http://lmod.readthedocs.io/en/latest/075_bug_reporting.html
 
  +
1) compiler/intel/19.1.2
--------------------------------------------------------------------------
 
  +
</pre>
  +
<br><br>
  +
'''Example of two different compilers intel and gnu triggers the module conflict with the error during the load of gnu - the new module is not loaded:'''
   
  +
<pre>
Modules based on Lua: Version 8.2.7 2019-11-30 09:41 -06:00
 
  +
ml compiler/intel/19.1.2
by Robert McLay mclay@tacc.utexas.edu
 
 
</pre>
 
</pre>
  +
<pre>
  +
ml
   
  +
Currently Loaded Modules:
=== Module categories, versions and defaults ===
 
  +
1) compiler/intel/19.1.2
Software stack on bwHPC systems is commonly classified into following categories:
 
  +
</pre>
<!--* [[:Category:Chemistry_software|chem]]-->
 
  +
<pre>
* chem
 
  +
ml compiler/gnu/10.2
<!--* [[:Category:Compiler_software|compiler]]-->
 
  +
Lmod has detected the following error: Cannot load module "compiler/gnu/10.2" because these module(s) are loaded:
* compiler
 
  +
compiler/intel
<!--* [[:Category:Debugger_software|devel]]-->
 
* devel
 
<!--* [[:Category:Libraries|lib]]-->
 
* lib
 
<!--* [[:Category:Numerical libraries|numlib]]-->
 
* numlib
 
<!--* [[:Category:Physics software|phys]]-->
 
* phys
 
<!--* [[:Category:System software|system]]-->
 
* system
 
<!--* [[:Category:Visualization|vis]]-->
 
* vis
 
<!--* [[:Category:Mathematical ecosystems|math]]-->
 
* math
 
   
  +
While processing the following module(s):
Each the category is further divided according to software packages and those finally according to software versions.
 
  +
Module fullname Module Filename
Similarly, the module identifier has the format: ''category/softwarename/version''
 
  +
--------------- ---------------
For instance, ''gnu'' compiler of the version ''10.1'' is unabiguosly addressed as ''compiler/gnu/10.1'' in, e.g. the load command:
 
  +
compiler/gnu/10.2 /opt/bwhpc/common/modulefiles/Core/compiler/gnu/10.2.lua
  +
</pre>
 
<pre>
 
<pre>
  +
ml
$ ml compiler/gnu/10.1
 
  +
  +
Currently Loaded Modules:
  +
1) compiler/intel/19.1.2
 
</pre>
 
</pre>
 
<br>
 
<br>
  +
'''Solution:''' intel compiler must be unloaded prior the load of gnu module.
In case of there is multiple software versions, one version is always pre-determined as the '''default'''
 
version. To address a default version, ''version'' can be omitted in the module identifier.
 
For example, the loading of the default intel compiler module is realized via
 
 
<pre>
 
<pre>
$ ml compiler/intel
+
ml -compiler/intel/19.1.2
 
</pre>
 
</pre>
  +
<pre>
  +
ml compiler/gnu/10.2
  +
</pre>
  +
<pre>
  +
ml
   
  +
Currently Loaded Modules:
=== Selective searching ===
 
  +
1) compiler/gnu/10.2
It is possible to perform searching through only selected category by adding name of the cathegory to ''module avail'' or ''ml av'' command.
 
  +
</pre>
For example, to only explore the compilers is realized with
 
  +
  +
=== Module dependencies (why there is no mpi module?) ===
  +
Some modules can depends on other modules. Typically, many modules depends on mpi library, Mpi library depends on a compiler, etc.. The user does not need
  +
to care about these fundamental dependencies. Majority of modules automatically take care about loading of all necessary packages it is depending on.
  +
However, there is an eminent exception - mpi library. While the most of the installed parallel applications exist for only one compiler-mpi combination,
  +
there are variety of mpi libraries of the same versions built with different compilers. For example there are two sets of OpenMPI 4.0 modules for intel and gnu compilers.
  +
Thus, an user who wants to load a specific mpi must chose (load) a particular compiler prior the mpi module load. Note, that the mpi modules also remains
  +
"invisible" for "module av <mpi_name>" command until a certain compiler is not loaded. This due to the module hierarchy of Lmod. More details about the hierarchy
  +
is below in [[Software_Modules_Lmod#Semi_hierarchical_layout_of_modules_on_JUSTUS_2]]
  +
  +
  +
==== Consequences of the partial module hierarchy for mpi modules ====
  +
Mpi modules remains invisible for a user (prompted ''module avail'') until some compiler module has been loaded. Once the compiler module has been activated corresponding mpi modules, i.e. built with the particular compiler, become visible.<br>
  +
<br>
  +
E.g., with the originally empty list of the loaded modules, the module command
 
<pre>
 
<pre>
$ module avail compiler/
+
$ module avail
 
</pre>
 
</pre>
  +
or its shorthand analogue
  +
<pre>
  +
$ ml av
  +
</pre>
  +
displays no mpi module available. After running
  +
<pre>
  +
ml compiler/intel/19.1
  +
ml av
  +
</pre>
  +
mpi packages compatible with the intel 19.1 compiler becomes visible
  +
<pre>
  +
------------ /opt/bwhpc/common/modulefiles/Compiler/intel/19.1 --------------------
  +
mpi/impi/2019.7 mpi/openmpi/4.0
  +
</pre>
  +
in the list of the available software.
   
  +
=== Online user guide of Lmod ===
  +
The complete user guide can be found on Lmod websites [https://lmod.readthedocs.io/en/latest/010_user.html]
   
  +
=== Additional Module System tasks ===
=== Searching capabilities of ''module avail'' commands ===
 
  +
Lmod offers more than 25 sub-commands plus various options to manage the modulefile system installed on JUSTUS 2. See, e.g. output of "module --help" command. Large majority of users will use only couple of them. A complete list of module sub-commands can be displayed by entering "module --help" command or in [https://lmod.readthedocs.io/en/latest/010_user.html Lmod online documentation]. The following text lists only a couple of them.
The command ''module avail'' has a search capabilities. For example
 
  +
  +
== Other topics ==
  +
=== Which shells supports module commands? ===
  +
So far Bash is only supported shell on JUSTUS 2 to interpret module commands.
  +
  +
=== Semi hierarchical layout of modules on JUSTUS 2 ===
  +
  +
==== Module hierarchy in Lmod ====
  +
The structure of software modules on JUSTUS 2 exploits a "semi" hierarchical structure. This is slightly different from what can be seen on another HPC systems with "full" hierarchical structure. The typical systems with full hierarchy put compiler modules (i.e., intel, gcc) in the uppermost (Core) level, depending libraries (e.g., MPI) on the second level, and more depending libraries on a third level. As a consequence, not all the modules contained in the module system are initially visible, namely the modules putted in the second and third layer. Only after a loading a compiler module, the modules of the second layer directly depending on the particular compiler will become available. And similarly, loading an MPI module will make the modules of the third layer depending on the loaded MPI library visible.<br>
  +
  +
==== Semi hierarchy of software stack on JUSTUS 2 ====
  +
JUSTUS 2 adopted the hierarchical structure of the modules layout only partially. In particular, there is only "Core" and the "second" level presented and there are only mpi modules contained in the second level. All other modules, i.e. for example those from the "chem" sub-cathegory such as ''vasp'', ''turbomole'', or ''gaussian'', or those located in the "numlib" sub-cathegory such as ''mkl'' or ''python_numpy'', are embodied in the "Core" level.
  +
<br>
  +
  +
==== Module dependency ====
  +
The adopted hierarchy models is not the only tool handling the module dependency. As a matter of fact, most of the modules on JUSTUS 2 require a provision of functionalities from another modules, albeit located in the "Core" level. Such provisioning is implemented in a modulefile either automatically without a need of any action from the user (the depending modulefile, while loading, loads all additional modules automatically) or the depending modulefile, while loading, informs the user about necessity to pre-load additional modules if those has not been activated yet (in this case the user must repeat the loading operation). Which of the solution is applied rests with the decision of the person who built the particular module.<br>
  +
<br>
  +
An example of module with the implemented automated pre-loading is ''orca'' module. With the pre-emptied list of the loading modules, i.e.
 
<pre>
 
<pre>
  +
ml
$ ml av gnu
 
  +
</pre>
  +
shows
  +
<pre>
  +
No modules loaded
  +
</pre>
  +
, the command sequence
  +
<pre>
  +
ml chem/orca
  +
ml
  +
</pre>
  +
shows
  +
<pre>
  +
Currently Loaded Modules:
  +
1) compiler/intel/19.1 2) chem/orca/4.2.1
  +
</pre>
  +
I.e., loading of the ''intel'' compiler is built-in the ''orca'' module.
  +
  +
=== Complete list of Lmod options and sub-commands ===
  +
The whole list of module options and all commands available can be displayed by running
  +
<pre>
  +
man module
 
</pre>
 
</pre>
  +
or
will list for any modulefile with the name containing the string "gnu", i.e. for instance
 
 
<pre>
 
<pre>
  +
module --help
--------------------------- /opt/bwhpc/common/modulefiles/Core ---------------------------------
 
compiler/gnu/system compiler/gnu/9.3 compiler/gnu/10.1 (D) vis/gnuplot/5.2
 
 
</pre>
 
</pre>
   
Line 399: Line 551:
 
You can view its content using:
 
You can view its content using:
 
<pre>
 
<pre>
$ type module
+
type module
 
</pre>
 
</pre>
 
and you will get the following result:
 
and you will get the following result:
 
<pre>
 
<pre>
$ type module
+
type module
 
module is a function
 
module is a function
 
module ()
 
module ()
Line 415: Line 567:
 
<br>
 
<br>
 
----
 
----
[[Category:bwForCluster_JUSTUS_2|JUSTUS 2]]
 
 
[[#top|Back to top]]
 
[[#top|Back to top]]

Latest revision as of 10:07, 23 April 2024

Software Module System - Lmod

Contents

1 Preface

This guide provides a general overview and introduction to the software system management via Lmod on JUSTUS 2 for new users as well as for experienced users coming, e.g. from sites planted with different Environment modules systems.

2 Scientific Software management through the module system Lmod

The following sections covers the basic module commands needed to find and load the scientific applications installed on the JUSTUS 2.

JUSTUS 2 uses Linux operating system. The standard Linux packages are installed on the front end nodes (login and visualisation nodes). The scientific software is accessible via so called module system.
To find and load an scientific application, one needs to use module commands. For example the following command sequence:

module load chem/gaussian/g16.C.01
module list
module help chem/gaussian/g16.C.01
cp $GAUSSIAN_EXA_DIR/bwforcluster-gaussian-example.sbatch .
1. loads the module with gaussian software package of version 16, revision C.01
2. prints out the list of currently load modules
3. provides the user help for the particular gaussian module
4. copies the template batch script which was specifically designed for submission of g16 jobs into SLURM workload manager on JUSTUS 2.


2.1 Why we use module system? Modules load scientific software

The module system on JUSTUS 2 is managed by Lmod (https://lmod.readthedocs.io/en/latest/). The module system incorporates majority of the computational software available - this includes among others compilers, mpi libraries, numerical libraries, computational chemistry packages, python specific libraries etc..

The programs managed by the module system are by default not utilizable. It has to be "loaded" to become executable.

The use of module system provide among others the following functionalities:

1. When loading a module, it automatically sets the appropriate environment variables required by the application to run properly.
2. It also takes care about the module dependency. It either loads all additional modules required for the application, or it informs the user if additional dependency modules need to be manually loaded.
3. It prevents loading of modules that could be in conflicts and can cause instability or unexpected behavior.

Among the main functionalities of Lmod belongs module load to make variety of the software packages pre-installed on the cluster accessible. This is feasible by only a single command:

module load <module_name>

The activation is realized by dynamical modification of the user's shell environment. This simply includes adding new paths to bin directories with the specific software into the PATH environmental variable. Typically, Lmod modifies PATH and LD_LIBRARY_PATH as well as it sets new variables as, for example <SOFTWARE_NAME>_EXA_DIR containing path to directory with the examples for a specific software.

Example: compare the content of $PATH environmental variable before and after the load of the gaussian module:
Before the load of gaussian module:

echo $PATH
/home/software/common/bin:/usr/local/bin:/usr/bin:/usr/local/sbin:/usr/sbin
which g16
/usr/bin/which: no g16 in (/home/software/common/bin:/usr/local/bin:/usr/bin:/usr/local/sbin:/usr/sbin)
ml chem/gaussian

After gaussian is loaded:

echo $PATH
/.../chem/gaussian/g16.C.01/x86_64-Intel-avx2-source/g16/bsd:/.../chem/gaussian/g16.C.01/x86_64-Intel-avx2-source/g16:/home/software/common/bin:/usr/local/bin:/usr/bin:/usr/local/sbin:/usr/sbin
which g16
/.../chem/gaussian/g16.C.01/x86_64-Intel-avx2-source/g16/g16

2.2 Basic functions of Lmod and commands


The module system has other useful capabilities then just the managing the environment.

(i) to list available software

module available


(ii) to load (activate) modules (particular software)

module load


(iii) to unload (inactivate) modules (particular software)

module unload
module purge

(iv) to list currently loaded modules

module list


(v) to search through all packages within the module system

module available
module spider
module keyword


(vi) to provide specific help information for a particular module system

module help
module whatis

2.3 Elementary Lmod Commands

The module commands below might be used interactively (in shells' current session), as well as in the shell scripts, in particular in the sbatch scripts used for submission of the computational jobs into SLURM (workload manager on JUSTUS2).

2.3.1 List of available software modules

module available

alternatively also in a short form

ml av

2.3.2 Module naming convention: Category/Name/Version

On JUSTUS 2 (similarly as on other HPC sites of bwHPC), the software modules are grouped into several categories:

  • chem
  • compiler
  • devel
  • lib
  • numlib
  • phys
  • system
  • vis
  • math

This makes it easier for users to get oriented within the module system. For example Gaussian 16 program allowing to calculate electronic structure of molecules is found in the category chem (together with other programs used by theoretical chemists).
Each the category is further divided according to software packages and those finally according to software versions.
The full name of a module always consists of three parts category, name, and version separated by slash category/name/version . Consequently, the full name of the module with Gaussian 16 package is chem/gaussian/g16.C.01. Analogously, gnu compiler of version 10.2 is addressed as compiler/gnu/10.2.

See, for example, all modules of category chem with:

ml av chem
---------------------------------------------- /opt/bwhpc/common/modulefiles/Core----------------------------------------------------------------------------------
   chem/adf/2019.304               chem/gaussian/g16.C.01             chem/molpro/2020.1        (D)    chem/orca/5.0.1-xtb-6.4.1                 chem/tmolex/4.6            (D)
   chem/ams/2020.101               chem/gaussview/6.1.1               chem/namd/2.14                   chem/orca/5.0.1                    (D)    chem/turbomole/7.4.1
   chem/ams/2020.103               chem/gromacs/2020.2                chem/nbo/6.0.18_i4               chem/quantum_espresso/6.5                 chem/turbomole/7.5         (D)
   chem/ams/2021.102        (D)    chem/gromacs/2020.4                chem/nbo/6.0.18_i8        (D)    chem/quantum_espresso/6.7_openmp-5        chem/vasp/5.4.4.3.16052018
   chem/cfour/2.1_openmpi          chem/gromacs/2021.1         (D)    chem/openbabel/3.1.1             chem/quantum_espresso/6.7          (D)    chem/vmd/1.9.3
   chem/cp2k/7.1                   chem/jmol/14.31.3                  chem/openmolcas/19.11            chem/schrodinger/2020-2                   chem/xtb/6.3.3
   chem/cp2k/8.0_devel      (D)    chem/lammps/stable_3Mar2020        chem/openmolcas/21.06     (D)    chem/schrodinger/2021-1            (D)    chem/xtb/6.4.1             (D)
   chem/dalton/2020.0              chem/molcas/8.4                    chem/openmolcas/21.10            chem/siesta/4.1-b4
   chem/dftbplus/20.2.1-cpu        chem/molden/5.9                    chem/orca/4.2.1-xtb-6.3.3        chem/siesta/4.1.5                  (D)
   chem/gamess/2020.2              chem/molpro/2019.2.3               chem/orca/4.2.1                  chem/tmolex/4.5.2

or, analogously, all available versions of intel compilers:

ml av compiler/intel
--------------------------------------------------------------------------------/opt/bwhpc/common/modulefiles/Core--------------------------------------------------------------
   compiler/intel/19.0    compiler/intel/19.1    compiler/intel/19.1.2 (D)

2.3.3 Load specific software

module load <module_name>

or shortly

ml <module_name>

For example to load gaussian of version 16 one has to run

ml chem/gaussian/g16.C.01

2.3.4 List of the loaded modules

module list

or simply

ml

2.3.5 Default module version

In case of there is multiple software versions, one version is always pre-determined as the default version. To address a default version, version can be omitted in the module identifier. For example, the loading of the default intel compiler module is realized via

ml compiler/intel
ml

Currently Loaded Modules:
  1) compiler/intel/19.1.2

2.3.6 Unload a specific software from the environment

module unload <module_name>

or equivalently

ml -<module_name>

for example to unload previously loaded vasp module chem/vasp/5.4.4.3.16052018 use

ml -chem/vasp/5.4.4.3.16052018

2.3.7 Unload all the loaded modules

$ module purge

or

ml purge

2.3.8 Providing a specific help for a particular module

module help <module_name>

or

ml help <module_name>

2.3.9 Software job examples and batch script templates

Majority of the software modules provides examples, including job queueing system examples (batch scripts) for slurm. A full path the directory with examples is normally contained in <SOFTWARE_NAME>_EXA_DIR environmental variable. For example the examples for Gromacs-2021.1 are located in (after the loading of the module).

ml chem/gromacs/2021.1
echo $GROMACS_EXA_DIR/
/opt/bwhpc/common/chem/gromacs/2021.1-openmpi-4.0/bwhpc-examples/
ls $GROMACS_EXA_DIR
GROMACS_TestCaseA  Performance-Tuning-and-Optimization-of-GROMACS.pdf  README
ls $GROMACS_EXA_DIR/GROMACS_TestCaseA/
gromacs-2021.1_gpu.slurm  gromacs-2021.1.slurm  ion_channel.tpr

Users may make a copy of these examples and use it as template for their own job scripts:

cp $GROMACS_EXA_DIR/GROMACS_TestCaseA/gromacs-2021.1_gpu.slurm .

Note: All the batch scripts examples are fully functional, i.e. the example scripts could be directly submitted into the queuing system, to launch a test job. Typically, the scripts launch a short, simple calculation of the given software. Moreover, most of the sbatch scripts contain general submit instructions, as well as hints specific for the particular program.

2.3.10 Searching through module names

module available <module_name>

or shortly

ml av <module_name>

For example, searching for python modules is realized via

ml av python

with the following output:

----------------------------------------------------------------------------------- /opt/bwhpc/common/modulefiles/Core ------------------------------------------------------------------------------------
   devel/python/3.8.3    lib/python_matplotlib/3.2.2_numpy-1.19.0_python-3.8.3    numlib/python_numpy/1.19.0_python-3.8.3    numlib/python_scipy/1.5.0_numpy-1.19.0_python-3.8.3

Use "module spider" to find all possible modules and extensions.
Use "module keyword key1 key2 ..." to search for all possible modules matching any of the "keys"

2.3.11 What does this software do? Command when you don't know this software

ml whatis <modulename>

provides the short description of the software package.

2.3.12 Finding detailed information about a specific module

module spider <searching_pattern>

or just

ml spider <searching_pattern>

2.3.13 Extended searching through entire module system

module keyword <searching_pattern>

For example, to find out which modules contain fftw library:

ml keyword fftw

which gives the following info:

-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

The following modules match your search criteria: "fftw"
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

  numlib/mkl: numlib/mkl/2019, numlib/mkl/2020, numlib/mkl/2020.2

-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

2.4 Best Practices when working with modules

2.4.1 Always load modules with the entire module name

The software stack is updated regularly. The adding of the new software version usually revokes which version is marked as default. The newer software is not always backwards compatible, including the existing scripts, workflow, or even input files. Therefore it is strongly recommended to avoid the loading based on just category and software name. Instead, one should always use the entire module name (including the version) to make sure the same module is loaded each time.

2.4.2 Load only those modules that are needed for the current application

Only load modules that are needed for the current script or workflow you are running, to reduce the chance of unexpected behavior caused by module conflicts.

Typical error sometimes seen on the cluster by loading of vasp module is this:

ml compiler/intel/19.1.2
ml mpi/impi/2019.8
ml chem/vasp/5.4.4.3.16052018

The correct way is indeed only:

ml chem/vasp/5.4.4.3.16052018

2.4.3 Do not use module commands in .bashrc, .bash_profile etc. scripts

Avoid including “module load” commands in your .bashrc or .bash_profile files. As an alternative, create a bash script with the module load commands and source it each time, to load the modules needed.

2.4.4 Use 'module help' command

see Software_Modules_Lmod#Providing_a_specific_help_for_a_particular_module

2.4.5 Check content of $<SOFTWARE_NAME>_EXA_DIR folder

see [[1]]

2.4.6 Use 'ml purge' in sbatch scripts before the first 'ml load'

The environment in effect at the time of the sbatch, salloc, or srun commands is executed are propagated to the spawned processes, i.e. also to the job-script. Consequently, should be some module loaded at the time of the 'sbatch <job-script>' command execution, its state, i.e. "loaded", as well as the values of the set environmental variables will be propagated with the job.

Thus, consider to put 'ml purge' command as the first module command when you are designing your job-scripts. This might prevent variety of module conflict situations.

Imagine for example, in the following scenario

On the login node:

ml compiler/intel/19.1.2
salloc --nodes=1 --ntasks-per-node=1

... waiting for the allocation of the resources ...
Once on the compute node execute

ml compiler/gnu/10.2

the load of compiler/gnu/10.2 module on the compute node fails with following error:

Lmod has detected the following error:  Cannot load module "compiler/gnu/10.2" because these module(s) are loaded:
   compiler/intel

While processing the following module(s):
    Module fullname    Module Filename
    ---------------    ---------------
    compiler/gnu/10.2  /opt/bwhpc/common/modulefiles/Core/compiler/gnu/10.2.lua

[ul_l_tkz12@login02 ~]$ ml

Currently Loaded Modules:
  1) compiler/intel/19.1.2

2.5 Useful Extras

2.5.1 Conflicts between modules

Some modules cannot be loaded together at the same time. For example two different versions of the same package cannot be activated simultaneously. The modules might already built-in this functionality. In such circumstances, Lmod, during the loading, either prints an error message and no module is loaded, or the module is reloaded - the old module is unloaded and only the new module is become activated.

Example of two versions of the intel compiler - module reload:

ml compiler/intel/19.1
ml

Currently Loaded Modules:
  1) compiler/intel/19.1
ml compiler/intel/19.1.2 

The following have been reloaded with a version change:
  1) compiler/intel/19.1 => compiler/intel/19.1.2
ml

Currently Loaded Modules:
  1) compiler/intel/19.1.2



Example of two different compilers intel and gnu triggers the module conflict with the error during the load of gnu - the new module is not loaded:

ml compiler/intel/19.1.2
ml

Currently Loaded Modules:
  1) compiler/intel/19.1.2
ml compiler/gnu/10.2 
Lmod has detected the following error:  Cannot load module "compiler/gnu/10.2" because these module(s) are loaded:
   compiler/intel

While processing the following module(s):
    Module fullname    Module Filename
    ---------------    ---------------
    compiler/gnu/10.2  /opt/bwhpc/common/modulefiles/Core/compiler/gnu/10.2.lua
ml

Currently Loaded Modules:
  1) compiler/intel/19.1.2


Solution: intel compiler must be unloaded prior the load of gnu module.

ml -compiler/intel/19.1.2
ml compiler/gnu/10.2
ml

Currently Loaded Modules:
  1) compiler/gnu/10.2

2.5.2 Module dependencies (why there is no mpi module?)

Some modules can depends on other modules. Typically, many modules depends on mpi library, Mpi library depends on a compiler, etc.. The user does not need to care about these fundamental dependencies. Majority of modules automatically take care about loading of all necessary packages it is depending on. However, there is an eminent exception - mpi library. While the most of the installed parallel applications exist for only one compiler-mpi combination, there are variety of mpi libraries of the same versions built with different compilers. For example there are two sets of OpenMPI 4.0 modules for intel and gnu compilers. Thus, an user who wants to load a specific mpi must chose (load) a particular compiler prior the mpi module load. Note, that the mpi modules also remains "invisible" for "module av <mpi_name>" command until a certain compiler is not loaded. This due to the module hierarchy of Lmod. More details about the hierarchy is below in Software_Modules_Lmod#Semi_hierarchical_layout_of_modules_on_JUSTUS_2


2.5.2.1 Consequences of the partial module hierarchy for mpi modules

Mpi modules remains invisible for a user (prompted module avail) until some compiler module has been loaded. Once the compiler module has been activated corresponding mpi modules, i.e. built with the particular compiler, become visible.

E.g., with the originally empty list of the loaded modules, the module command

$ module avail

or its shorthand analogue

 
$ ml av

displays no mpi module available. After running

ml compiler/intel/19.1
ml av

mpi packages compatible with the intel 19.1 compiler becomes visible

------------ /opt/bwhpc/common/modulefiles/Compiler/intel/19.1 --------------------
   mpi/impi/2019.7    mpi/openmpi/4.0

in the list of the available software.

2.5.3 Online user guide of Lmod

The complete user guide can be found on Lmod websites [2]

2.5.4 Additional Module System tasks

Lmod offers more than 25 sub-commands plus various options to manage the modulefile system installed on JUSTUS 2. See, e.g. output of "module --help" command. Large majority of users will use only couple of them. A complete list of module sub-commands can be displayed by entering "module --help" command or in Lmod online documentation. The following text lists only a couple of them.

2.6 Other topics

2.6.1 Which shells supports module commands?

So far Bash is only supported shell on JUSTUS 2 to interpret module commands.

2.6.2 Semi hierarchical layout of modules on JUSTUS 2

2.6.2.1 Module hierarchy in Lmod

The structure of software modules on JUSTUS 2 exploits a "semi" hierarchical structure. This is slightly different from what can be seen on another HPC systems with "full" hierarchical structure. The typical systems with full hierarchy put compiler modules (i.e., intel, gcc) in the uppermost (Core) level, depending libraries (e.g., MPI) on the second level, and more depending libraries on a third level. As a consequence, not all the modules contained in the module system are initially visible, namely the modules putted in the second and third layer. Only after a loading a compiler module, the modules of the second layer directly depending on the particular compiler will become available. And similarly, loading an MPI module will make the modules of the third layer depending on the loaded MPI library visible.

2.6.2.2 Semi hierarchy of software stack on JUSTUS 2

JUSTUS 2 adopted the hierarchical structure of the modules layout only partially. In particular, there is only "Core" and the "second" level presented and there are only mpi modules contained in the second level. All other modules, i.e. for example those from the "chem" sub-cathegory such as vasp, turbomole, or gaussian, or those located in the "numlib" sub-cathegory such as mkl or python_numpy, are embodied in the "Core" level.

2.6.2.3 Module dependency

The adopted hierarchy models is not the only tool handling the module dependency. As a matter of fact, most of the modules on JUSTUS 2 require a provision of functionalities from another modules, albeit located in the "Core" level. Such provisioning is implemented in a modulefile either automatically without a need of any action from the user (the depending modulefile, while loading, loads all additional modules automatically) or the depending modulefile, while loading, informs the user about necessity to pre-load additional modules if those has not been activated yet (in this case the user must repeat the loading operation). Which of the solution is applied rests with the decision of the person who built the particular module.

An example of module with the implemented automated pre-loading is orca module. With the pre-emptied list of the loading modules, i.e.

ml

shows

No modules loaded

, the command sequence

ml chem/orca
ml

shows

Currently Loaded Modules:
  1) compiler/intel/19.1   2) chem/orca/4.2.1 

I.e., loading of the intel compiler is built-in the orca module.

2.6.3 Complete list of Lmod options and sub-commands

The whole list of module options and all commands available can be displayed by running

man module

or

module --help

2.6.4 How do Modules work?

The default shell on the bwHPC clusters is bash, so explanations and examples will be shown for bash. In general, programs cannot modify the environment of the shell they are being run from, so how can the module command do exactly that?
The module command is not a program, but a bash-function. You can view its content using:

type module

and you will get the following result:

type module
module is a function
module ()
{
    eval $($LMOD_CMD bash "$@");
    [ $? = 0 ] && eval $(${LMOD_SETTARG_CMD:-:} -s sh)
}

In this function, lmod is called. Its output to stdout is then executed inside your current shell using the bash-internal eval command. As a consequence, all output that you see from the module is transmitted via stderr (output handle 2) or in so


Back to top