I ran into some problems when using the ‘openmpi’ package from ‘devel:libraries:c_c++’ on openSUSE 11.4 x86_64.
I recently started to use the OBS to compile/ package some CFD/ visualization software (see my repo [1]). Since I wanted to use a rather recent ‘openmpi’ version and preferable the same on 11.4 and 12.1 I use the ‘openmpi’ package from ‘devel:libraries:c_c++’ (meaning I aggregated the ‘openmpi’ package in my repository).
Now the problem: When I use the software, i.e. run some mpi-job with ‘mpirun’ than I get some errors (see end of the mail) and it crashes. The problem disappeared when I installed the ‘openmpi-devel’ package.
Now my question: Do I
a) make a mistake when compiling the software
b) have to always install the ‘openmpi-devel’ package to use openmpi
c) is there some bug in the ‘openmpi’ package?
#################################
[kolmogorov:11597] [INVALID],INVALID] ORTE_ERROR_LOG: Not found in file runtime/orte_init.c at line 125
It looks like orte_init failed for some reason; your parallel process is
likely to abort. There are many reasons that a parallel process can
fail during orte_init; some of which are due to configuration or
environment problems. This failure appears to be an internal failure;
here’s some additional information (which may only be relevant to an
Open MPI developer):
orte_ess_base_select failed
→ Returned value Not found (-13) instead of ORTE_SUCCESS
It looks like MPI_INIT failed for some reason; your parallel process is
likely to abort. There are many reasons that a parallel process can
fail during MPI_INIT; some of which are due to configuration or environment
problems. This failure appears to be an internal failure; here’s some
additional information (which may only be relevant to an Open MPI
developer):
ompi_mpi_init: orte_init failed
→ Returned “Not found” (-13) instead of “Success” (0)
*** The MPI_Init() function was called before MPI_INIT was invoked.
*** This is disallowed by the MPI standard.
*** Your MPI job will now abort.
[kolmogorov:11597] Abort before MPI_INIT completed successfully; not able to guarantee that all other processes were killed!
########################
Case 2: OpenFOAM
########################
[kolmogorov:09951] [INVALID],INVALID] ORTE_ERROR_LOG: Not found in file runtime/orte_init.c at line 125
It looks like orte_init failed for some reason; your parallel process is
likely to abort. There are many reasons that a parallel process can
fail during orte_init; some of which are due to configuration or
environment problems. This failure appears to be an internal failure;
here’s some additional information (which may only be relevant to an
Open MPI developer):
orte_ess_base_select failed
→ Returned value Not found (-13) instead of ORTE_SUCCESS
[kolmogorov:09951] [INVALID],INVALID] ORTE_ERROR_LOG: Not found in file orterun.c at line 543