We installed Code_Saturne 7.0.0 on CENTOS 7 and everything is working just fine when running in serial mode.
The problem happens when the case is launched on multiple cores :
1 - On the GUI terminal, I had the following error (cf code_saturne_error).
2 - I tried changing the mpi module that is loaded by adding
setenv OMPI_MCA_pml “ucx”
setenv OMPI_MCA_btl “^vader,tcp,openib”
Results : I don’t have any error, but there is no listing file. Just the run.cfg which tells that I ran on 2 procs, the setup.xml and the summary file.
Unfortunately, there is no function in Scotch that I know of (unless I missed recent updates) for querying whether Scotch was build with threading support or not, and adapt the MPI_Initialize method accordingly. And since this can degrade performance on may MPI implementations, I would rather lose a bit of performance with unthreaded Scotch one per run rather than in the code_saturne algorithms for the whole run…
On the cluster, is the post-install configured to use SLURM ? Do you have any messages in the slurm job logs ?
We did both : with one built on our own, then with one installed with the semi-automatic installation (the library installed with the python script were used for the compilation of code_saturne).
In each case, we don’t have any error when running in multi-procs, but there is no listing nor error in the RESU folder.
I tried compile it without the SCOTCH library, just with parametis, and the same thing occurs.
I really don’t know why there isn’t error messages anymore.
With slurm, I add the “code_saturne.cfg” file and just modified “batch = SLURM”. It works fine on one proc, but nothing on 2 or more.
I have no listing file, nothing written in the error slurm log, and here is what I get on the output log file :