Difference between revisions of "Development/General compiler usage"

From bwHPC Wiki
Jump to: navigation, search
(Makefiles have nothing to do with General Compiler Usage ,-))
(Update General compiler usage to include further Information on Warnings and static code Analysis)
Line 4: Line 4:
 
|-
 
|-
 
| module load
 
| module load
| compiler/gnu|intel|llvm|pgi|...
+
| compiler/gnu or compiler/intel or compiler/llvm and others...
 
|-
 
|-
 
| License
 
| License
| [[Intel_Compiler|Intel]]: Commercial | [[GCC|GNU]]: GPL | LLVM: Apache 2 | PGI: Commercial
+
| [[Intel_Compiler|Intel]]: Commercial | [[GCC|GNU]]: GPL | LLVM: Apache 2 | PGI/NVIDIA: Commercial
 
|}
 
|}
   
Line 13: Line 13:
 
= Description =
 
= Description =
   
  +
Basically, compilers translate human-readable source code (e.g. C++ interpreted as adhering to ISO/IEC 14882:2014, encoded in UTF-8 text) into binary byte code (e.g. x86-64 with Linux ABI in ELF-format).
The basic operations can be performed with the same commands for all available compilers. For advanced usage such as optimization and profiling you should consult the best practice guide of the compiler you intend to use ([[BwHPC_BPG_Compiler#GCC|GCC]], [[BwHPC_BPG_Compiler#Intel Suite|Intel Suite]]).
 
  +
Compilers are complex software and have become very powerful in the last decades, to '''guide''' you as a programmer writing better, more portable, more performant programs. Use the compiler as a tool -- and best use multiple compilers on the same source code for best results.
  +
The basic operations and hints can be performed with the same or similar commands on all available compilers. For advanced usage such as optimization and profiling you should consult the best practice guide of the compiler you intend to use ([[BwHPC_BPG_Compiler#GCC|GCC]], [[BwHPC_BPG_Compiler#Intel Suite|Intel Suite]]).
   
 
More information about the MPI versions of the GNU and Intel Compilers is available here:
 
More information about the MPI versions of the GNU and Intel Compilers is available here:
Line 19: Line 21:
   
   
  +
= Loading compilers as modules =
= Versions and availability =
 
   
  +
Modules and loading of modules is described [Software_Modules_Lmod|here for Lmod] and [Environment_Modules|here for traditional Environment Modules].
A list of versions currently available compilers on the bwHPC-C5-Clusters can be obtained from the CIS system. All the available versions are listed at the end of this page.
 
   
  +
However, modules need to be mentioned, since on any system there's a pre-installed set of compilers (for C, C++ and usually Fortran), which are provided by the Linux distribution -- the so-called system compilers. Which however may lack certain Options for optimization, for warnings or other features. On RedHat Enterprise Linux this is GNU compiler v8.3.1.
On the command line interface of any bwHPC cluster you'll get a list of available versions
 
  +
Be advised to check out the newer compilers available as modules.
by using the command
 
''''module avail compiler''''.
 
<pre>
 
$ : bwUniCluster 2.0
 
$ module avail compiler
 
---------------------- /opt/bwhpc/common/modulefiles/Core -----------------------
 
compiler/clang/9.0 compiler/gnu/8.3.1
 
compiler/gnu/9.3 compiler/gnu/10.2 (D)
 
compiler/intel/19.0 compiler/intel/19.1 (L,D)
 
compiler/llvm/10.0 compiler/pgi/2020
 
   
  +
Since Fortran (and very old C++) requires compiling and linking libraries with the very same compiler, many libraries, first-and-foremost the MPI libraries need to be provided for specific versions of a compiler.
Please note, that further libraries, like MPI Libraries are dependent on the compiler and are only
 
visible, if a compiler has been loaded.
+
On bwUniCluster, these provided libraries will only be visible to <kbd>module avail<kbd>, once a compiler is loaded.
  +
Hence, check out loading
Due to this reason, the default system compiler (here '''compiler/gnu/8.3.1''') has it's own module file.
 
 
$ : bwForCluster (Justus)
 
$ module avail compiler
 
---------------------- /opt/bwhpc/common/modulefiles -----------------------
 
compiler/gnu/4.5 compiler/intel/15.0(default)
 
compiler/gnu/4.7(default) compiler/pgi/12.10(default)
 
compiler/gnu/4.8 compiler/pgi/12.10_static
 
compiler/gnu/4.9 compiler/pgi/13.7
 
compiler/gnu/5.2 compiler/pgi/13.7_static
 
compiler/intel/12.1 compiler/pgi/14.10
 
compiler/intel/13.1 compiler/pgi/14.10_static
 
compiler/intel/14.0
 
</pre>
 
 
 
= Loading the module =
 
 
== Default Version ==
 
 
You can load the default version of the a compiler with the command<br>
 
'''module load compiler'''/'''name-of-the-compiler-suite'''.
 
 
<u>Example with Intel on bwUniCluster</u>
 
 
<pre>
 
<pre>
 
$ module avail compiler/intel
 
$ module avail compiler/intel
  +
...
---------------------- /opt/bwhpc/common/modulefiles/Core -----------------------
 
compiler/intel/19.0 compiler/intel/19.1 (L,D)
+
$ module load compiler/intel/2021.4.0
  +
...
 
$ module load compiler/intel
+
$ module avail
$ module list
 
Currently Loaded Modulefiles:
 
1) compiler/intel/19.1(default)
 
 
</pre>
 
</pre>
  +
to see the available modules.
Here, we got the "default" version 19.1 (example).
 
 
The module will try to load modules it needs to function.
 
If loading the module fails, check if you have already loaded the module with ''''module list''''.
 
 
 
== Specific (newer or older) Version ==
 
 
If you wish to load a specific compiler version and release (if available), you can do so using<br>
 
'''module load compiler''/''name-of-the-compiler-suite''/''version-of-the-compiler-suite'''<br>
 
to load the version you desires.
 
 
<u>Example with Intel compiler, version 19.0 on bwUniCluster</u>
 
<pre>
 
$ module avail compiler/intel
 
---------------------- /opt/bwhpc/common/modulefiles -----------------------
 
compiler/intel/19.0 compiler/intel/19.1 (L,D)
 
 
$ module load compiler/intel/19.0
 
$ module list
 
Currently Loaded Modulefiles:
 
1) compiler/intel/19.0
 
</pre>
 
Intel C-Compiler "version 14.0" is loaded now (example).
 
 
   
 
All Intel, GCC and PGI have compilers for different languages which will be available
 
All Intel, GCC and PGI have compilers for different languages which will be available
Line 101: Line 44:
   
   
== Linux Original Compiler ==
+
== Linux Default Compiler ==
   
The original Compiler installed on all compute nodes is GNU.
+
The default Compiler installed on all compute nodes is the GNU Compiler Collection (GCC) or in short GNU compiler.
 
* Don't get distracted with the available compiler modules.
 
* Don't get distracted with the available compiler modules.
 
* Only the modules are loading the complete environments needed.
 
* Only the modules are loading the complete environments needed.
Line 154: Line 97:
 
| gfortran
 
| gfortran
 
|-
 
|-
| style="vertical-align:top;" rowspan="3" | <font color=green><big>PGI</big></font>
+
| style="vertical-align:top;" rowspan="3" | <font color=green><big>LLVM</big></font>
  +
| C
  +
| clang
  +
|-
  +
| C++
  +
| clang++
  +
|-
  +
| Fortran 77/90
  +
| flang
  +
|-
  +
| style="vertical-align:top;" rowspan="3" | <font color=green><big>PGI/NVIDIA</big></font>
 
| C
 
| C
 
| pgcc
 
| pgcc
Line 168: Line 121:
 
== MPI compiler and Underlying Compilers ==
 
== MPI compiler and Underlying Compilers ==
   
  +
MPI implementations such as MPIch, Intel-MPI (derived from MPIch) or Open MPI provide compiler wrappers, easing the usage of MPI by providing the Include-Directory <kbd>-I</kbd> and require libraries for linking.
 
The following table lists available MPI compiler commands and the underlying compilers, compiler families, languages, and application binary interfaces (ABIs) that they support.
 
The following table lists available MPI compiler commands and the underlying compilers, compiler families, languages, and application binary interfaces (ABIs) that they support.
   
Line 210: Line 164:
 
== Commands ==
 
== Commands ==
   
When ''hello.c'' is a C source code file such as
+
The typical introduction is a "Hello World" program. The following C source code shows best practices:
  +
<source lang="c">
<pre>
 
#include <stdio.h>
+
#include <stdio.h> // for printf
  +
#include <stdlib.h> // for EXIT_SUCCESS and EXIT_FAILURE
int main() {
 
  +
int main (int argc, char * argv[]) { // std. definition of a program taking arguments
printf("Hello world\n");
 
  +
printf("Hello World\n"); // Unix Output is line-buffered, end line with New-line.
return 0;
 
  +
return EXIT_SUCCESS; // End program by returning 0 (No Error)
}
 
</pre>
+
}</source>
it can be compiled and linked with the single command
+
It may be compiled and linked with the single command
 
<pre>$ icc hello.c -o hello</pre>
 
<pre>$ icc hello.c -o hello</pre>
 
to produce an executable named ''hello''.
 
to produce an executable named ''hello''.
Line 229: Line 183:
 
</pre>
 
</pre>
 
When using libraries you must sometimes specify where the
 
When using libraries you must sometimes specify where the
* include files are (option '''-I''') and where the
+
* include files are (option <kbd>-I</kbd>) and where the
* library files are (option '''-L''').
+
* library files are (option <kbd>-L</kbd>).
 
In addition you have to tell the compiler which
 
In addition you have to tell the compiler which
* library you want to use (option '''-l''').
+
* library you want to use (option <kbd>-l</kbd>).
 
For example after loading the module numlib/fftw you can compile code for fftw using
 
For example after loading the module numlib/fftw you can compile code for fftw using
 
<pre>
 
<pre>
Line 239: Line 193:
 
</pre>
 
</pre>
 
When the program crashes or doesn't produce the expected output the compiler can
 
When the program crashes or doesn't produce the expected output the compiler can
help you by printing warning messages:
+
help you by printing all warning messages <kbd>-Wall</kbd> and adding flags for debugging <kbd>-g</kbd>:
<pre>$ icc -Wall hello.c -o hello</pre>
+
<pre>$ icc -Wall -g hello.c -o hello</pre>
   
   
Line 248: Line 202:
 
does [[BwHPC_BPG_Debugger|using a debugger]].
 
does [[BwHPC_BPG_Debugger|using a debugger]].
   
<font color=green>To use the debugger properly with your program you have to compile it with debug information (option -g)</font>:
+
<font color=green>To use the debugger properly with your program you have to compile it with debug information (option <kbd>-g</kbd>)</font>:
   
 
<u>Example</u>
 
<u>Example</u>
 
<pre>$ icc -g hello.c -o hello</pre>
 
<pre>$ icc -g hello.c -o hello</pre>
Although -Wall should always be set, the -g option should only be stated when you want
+
Although the compiler option <kbd>-Wall</kbd> (and possibly others) should always be set, the <kbd>-g</kbd> option should only be passed for
  +
debugging purposes to find bugs.
to find bugs, since it may slow down execution and enlarges the binary due
 
to debugging symbols.
+
It may slow down execution and enlarges the binary due to debugging symbols.
   
   
Line 266: Line 220:
   
   
Both compilers offer a multitude of options (with regard to the above and others),
+
All compilers offer a multitude of optimization options,
one may check the complete list of options with short explanation on [[GCC|GCC]] and
+
one may check the complete list of options with short explanation on [[GCC|GCC]], [[LLVM|LLVM]] and
 
[[Intel_Compiler|Intel Suite]] using option '''-v''' '''--help''':
 
[[Intel_Compiler|Intel Suite]] using option '''-v''' '''--help''':
  +
<pre>
<pre>$ icc -v --help hello.c -o hello</pre>
 
  +
$ icc -v --help | less
  +
$ gcc -v --help | less
  +
$ clang -v --help | less
  +
</pre>
   
 
Please note, that the optimization level <kbd>-O2</kbd> produces code for a general instruction set.
 
Please note, that the optimization level <kbd>-O2</kbd> produces code for a general instruction set.
Line 288: Line 246:
 
where <kbd>-mfma</kbd> is the setting for allowing fused-multiply-add.
 
where <kbd>-mfma</kbd> is the setting for allowing fused-multiply-add.
 
These options may provide considerable speed-up to your code as is.
 
These options may provide considerable speed-up to your code as is.
  +
Please note however, that Cascade Lake may throttle the processor's clock speed, when executing AVX-512 instructions, hence running slower than
  +
(older) AVX2 code paths would have.
   
 
For GCC the options in use are best visible by calling <kbd>gcc -O2 -fverbose-asm -S -o hello.S hello.c</kbd>.
 
For GCC the options in use are best visible by calling <kbd>gcc -O2 -fverbose-asm -S -o hello.S hello.c</kbd>.
Line 297: Line 257:
 
while GCC offers this information using <kbd>-fopt-info-all</kbd>
 
while GCC offers this information using <kbd>-fopt-info-all</kbd>
   
  +
== Warnings and Error detection ==
  +
All compilers have improved tremendously with regards to analyzing and detecting suspicious code: do make '''use''' of such warnings and hints.
  +
The amount of false positives has reduced and it will make your code more accessible, less error-prone and more portable.
  +
  +
The typical warning flags are <kbd>-Wall<kbd> to turn on ''all'' warnings.
  +
However, there's multiple other worthwhile warnings, which are not covered (since they might increase false positives, or since they are not yet considered so prominent).
  +
E.g. <kbd>-Wextra</kbd> turns on several other warnings, which will in the above example show that neither <kbd>argc</kbd> nor <kbd>argv</kbd> have been used inside of <kbd>main</kbd>.
  +
  +
For LLVM's <kbd>clang</kbd> the flag <kbd>-Weverything</kbd> turns on all available warnings, albeit leading to many warnings on larger projects.
  +
However, the fix-it hints are very helpful as well.
  +
  +
[[File:static_code_analysis.png|right|border|300px|Copyright: HS Esslingen)]]
  +
Another powerful feature available in GNU- and LLVM-compilers are static code analysis, otherwise only available in Commercial tools, like [https://www.synopsys.com/software-integrity/security-testing/static-analysis-sast.html|Coverity].
  +
Static code analysis evaluates '''each and every''' code path, making assumptions on input values and branches taken, detecting corner cases which might lead to real errors -- without having to actually execute this code path.
  +
For GCC this is turned on using <kbd>-fanalyzer</kbd> which will detect e.g. cases of memory usage after a <kbd>free()</kbd> of said memory and many others.
  +
For LLVM recompile your project using <kbd>scan-build</kbd>, e.g.:
  +
<pre>
  +
$ scan-build make
  +
</pre>
  +
  +
This produces warnings on <kbd>stdout</kbd>, but more importantly scan reports in directory <kbd>/scratch/scan-build-XXX</kbd>, where XXX is date and time of the build.
  +
For example the output of Open MPI includes real issues of missed memory releases in error code paths:
   
   

Revision as of 21:58, 23 February 2022

Description Content
module load compiler/gnu or compiler/intel or compiler/llvm and others...
License Intel: Commercial | GNU: GPL | LLVM: Apache 2 | PGI/NVIDIA: Commercial


1 Description

Basically, compilers translate human-readable source code (e.g. C++ interpreted as adhering to ISO/IEC 14882:2014, encoded in UTF-8 text) into binary byte code (e.g. x86-64 with Linux ABI in ELF-format). Compilers are complex software and have become very powerful in the last decades, to guide you as a programmer writing better, more portable, more performant programs. Use the compiler as a tool -- and best use multiple compilers on the same source code for best results. The basic operations and hints can be performed with the same or similar commands on all available compilers. For advanced usage such as optimization and profiling you should consult the best practice guide of the compiler you intend to use (GCC, Intel Suite).

More information about the MPI versions of the GNU and Intel Compilers is available here:


2 Loading compilers as modules

Modules and loading of modules is described [Software_Modules_Lmod|here for Lmod] and [Environment_Modules|here for traditional Environment Modules].

However, modules need to be mentioned, since on any system there's a pre-installed set of compilers (for C, C++ and usually Fortran), which are provided by the Linux distribution -- the so-called system compilers. Which however may lack certain Options for optimization, for warnings or other features. On RedHat Enterprise Linux this is GNU compiler v8.3.1. Be advised to check out the newer compilers available as modules.

Since Fortran (and very old C++) requires compiling and linking libraries with the very same compiler, many libraries, first-and-foremost the MPI libraries need to be provided for specific versions of a compiler. On bwUniCluster, these provided libraries will only be visible to module avail, once a compiler is loaded. Hence, check out loading

$ module avail compiler/intel
...
$ module load compiler/intel/2021.4.0
...
$ module avail

to see the available modules.

All Intel, GCC and PGI have compilers for different languages which will be available after the module is loaded.


2.1 Linux Default Compiler

The default Compiler installed on all compute nodes is the GNU Compiler Collection (GCC) or in short GNU compiler.

  • Don't get distracted with the available compiler modules.
  • Only the modules are loading the complete environments needed.

Example

$ module purge                     # unload all modules
$ module list                      # control
No Modulefiles Currently Loaded.
$ gcc --version                    # see version of default Linux GNU compiler
gcc (GCC) 8.3.1 20191121 (Red Hat 8.3.1-5)
[...]
$ module load compiler/gnu         # load default GNU compiler module
$ module list                      # control
Currently Loaded Modulefiles:
  1) compiler/gnu/10.2(default)
$ gcc --version                    # now, check the current (loaded) module
gcc (GCC) 10.2.0
[...]


3 Synoptical Tables

3.1 Compilers (no MPI)

Compiler Suite Language Command
Intel Composer

• Best Practice Guides on Intel Compiler Software
C icc
C++ icpc
Fortran ifort
GCC

• Best Practice Guides on GNU Compiler Software
C gcc
C++ g++
Fortran gfortran
LLVM C clang
C++ clang++
Fortran 77/90 flang
PGI/NVIDIA C pgcc
C++ pgCC
Fortran 77/90 pgf77 or pgf90


3.2 MPI compiler and Underlying Compilers

MPI implementations such as MPIch, Intel-MPI (derived from MPIch) or Open MPI provide compiler wrappers, easing the usage of MPI by providing the Include-Directory -I and require libraries for linking. The following table lists available MPI compiler commands and the underlying compilers, compiler families, languages, and application binary interfaces (ABIs) that they support.

MPI Compiler Command Default Compiler Supported Language(s) Supported ABI's
Generic Compilers
mpicc gcc, cc C 32/64 bit
mpicxx g++ C/C++ 32/64 bit
mpifc gfortran Fortran77/Fortran 95 32/64 bit
GNU Compiler Versions 3 and higher
mpigcc gcc C 32/64 bit
mpigxx g++ C/C++ 32/64 bit
mpif77 g77 Fortran 77 32/64 bit
mpif90 gfortran Fortran 95 32/64 bit
Intel Fortran, C++ Compilers Versions 13.1 through 14.0 and Higher
mpiicc icc C 32/64 bit
mpiicpc icpc C++ 32/64 bit
impiifort ifort Fortran77/Fortran 95 32/64 bit


4 How to use

The following compiler commands work for all the compilers in the list above even though the examples will be for icc only.

4.1 Commands

The typical introduction is a "Hello World" program. The following C source code shows best practices:

#include <stdio.h>                   // for printf
#include <stdlib.h>                  // for EXIT_SUCCESS and EXIT_FAILURE
int main (int argc, char * argv[]) { // std. definition of a program taking arguments
    printf("Hello World\n");         // Unix Output is line-buffered, end line with New-line.
    return EXIT_SUCCESS;             // End program by returning 0 (No Error)
}

It may be compiled and linked with the single command

$ icc hello.c -o hello

to produce an executable named hello.


This process can be divided into two steps:

$ icc -c hello.c
$ icc hello.o -o hello

When using libraries you must sometimes specify where the

  • include files are (option -I) and where the
  • library files are (option -L).

In addition you have to tell the compiler which

  • library you want to use (option -l).

For example after loading the module numlib/fftw you can compile code for fftw using

$ icc -c hello.c -I$FFTW_INC_DIR
$ icc hello.o -o hello -L$FFTW_LIB_DIR -lfftw3

When the program crashes or doesn't produce the expected output the compiler can help you by printing all warning messages -Wall and adding flags for debugging -g:

$ icc -Wall -g hello.c -o hello


4.2 Debugger

If the problem can't be solved this way you can inspect what exactly your program does using a debugger.

To use the debugger properly with your program you have to compile it with debug information (option -g):

Example

$ icc -g hello.c -o hello

Although the compiler option -Wall (and possibly others) should always be set, the -g option should only be passed for debugging purposes to find bugs. It may slow down execution and enlarges the binary due to debugging symbols.


4.3 Optimization

The usual and common way to compile your source is to apply compiler optimization.

Since there are many optimization options, as a start for now the optimization level -O2 is recommended:

$ icc -O2 hello.c -o hello

Beware: The optimization-flag used is a capital-O (like Otto) and not a 0 (Zero)!


All compilers offer a multitude of optimization options, one may check the complete list of options with short explanation on GCC, LLVM and Intel Suite using option -v --help:

$ icc -v --help | less
$ gcc -v --help | less
$ clang -v --help | less

Please note, that the optimization level -O2 produces code for a general instruction set. If You want to set the instruction set available, and take advantage of AVX2 or AVX512f, You have to either add the machine-dependent -mavx512f or set the specific architecture of your target processor. For BwUniCluster_2.0 this depends on whether You run your application on any node, then You would select the older Broadwell CPU, or whether You target the newer HPC nodes (which feature Xeon Gold 6230, aka "Cascade Lake" architecture.

$ gcc -O2 -o hello hello.c                        # General optimization for any architecture
$ gcc -O2 -march=broadwell -o hello hello.c       # Will work on any compute node on bwUniCluster 2.0
$ gcc -O2 -march=cascadelake -o hello hello.c     # This may not run on Broadwell nodes

While adding -march=broadwell adds the compiler options such as -mavx -mavx2 -msse3 -msse4 -msse4.1 -msse4.2 -mssse3, adding -march=cascadelake will further this by -mavx512bw -mavx512cd -mavx512dq -mavx512f -mavx512vl -mavx512vnni -mfma, where -mfma is the setting for allowing fused-multiply-add. These options may provide considerable speed-up to your code as is. Please note however, that Cascade Lake may throttle the processor's clock speed, when executing AVX-512 instructions, hence running slower than (older) AVX2 code paths would have.

For GCC the options in use are best visible by calling gcc -O2 -fverbose-asm -S -o hello.S hello.c. The option -fverbose-asm stores all the options in the assembler file hello.S.

You should then pay attention to vectorization attained by the compiler -- and concentrate on the time-consuming loops, where the compiler was not able to vectorize. This information is available with the Intel compiler using -qopt-report=5 producing a lot of output in hello.optrpt, while GCC offers this information using -fopt-info-all

4.4 Warnings and Error detection

All compilers have improved tremendously with regards to analyzing and detecting suspicious code: do make use of such warnings and hints. The amount of false positives has reduced and it will make your code more accessible, less error-prone and more portable.

The typical warning flags are -Wall to turn on all warnings. However, there's multiple other worthwhile warnings, which are not covered (since they might increase false positives, or since they are not yet considered so prominent). E.g. -Wextra turns on several other warnings, which will in the above example show that neither argc nor argv have been used inside of main.

For LLVM's clang the flag -Weverything turns on all available warnings, albeit leading to many warnings on larger projects. However, the fix-it hints are very helpful as well.

Copyright: HS Esslingen)

Another powerful feature available in GNU- and LLVM-compilers are static code analysis, otherwise only available in Commercial tools, like [1]. Static code analysis evaluates each and every code path, making assumptions on input values and branches taken, detecting corner cases which might lead to real errors -- without having to actually execute this code path. For GCC this is turned on using -fanalyzer which will detect e.g. cases of memory usage after a free() of said memory and many others. For LLVM recompile your project using scan-build, e.g.:

$ scan-build make

This produces warnings on stdout, but more importantly scan reports in directory /scratch/scan-build-XXX, where XXX is date and time of the build. For example the output of Open MPI includes real issues of missed memory releases in error code paths: