아래와 같은 에러 발생 시
====================================================================================================================
libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes.
This will severely limit memory registrations.
libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes.
This will severely limit memory registrations.
libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes.
This will severely limit memory registrations.
--------------------------------------------------------------------------
The OpenFabrics (openib) BTL failed to initialize while trying to
allocate some locked memory. This typically can indicate that the
memlock limits are set too low. For most HPC installations, the
memlock limits should be set to "unlimited". The failure occured
here:
Local host: compute-0-10.local
OMPI source: btl_openib_component.c:1066
Function: ompi_free_list_init_ex_new()
Device: mlx4_0
Memlock limit: 32768
You may need to consult with your system administrator to get this
problem fixed. This FAQ entry on the Open MPI web site may also be
helpful:
http://www.open-mpi.org/faq/?category=openfabrics#ib-locked-pages
--------------------------------------------------------------------------
--------------------------------------------------------------------------
WARNING: There was an error initializing an OpenFabrics device.
Local host: compute-0-10.local
Local device: mlx4_0
--------------------------------------------------------------------------
libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes.
This will severely limit memory registrations.
libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes.
This will severely limit memory registrations.
libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes.
This will severely limit memory registrations.
libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes.
This will severely limit memory registrations.
libibverbs: Warning: RLIMIT_MEMLOCK is 32768 bytes.
This will severely limit memory registrations.
[compute-0-10.local:02981] 7 more processes have sent help message help-mpi-btl-openib.txt / init-fail-no-mem
[compute-0-10.local:02981] Set MCA parameter "orte_base_help_aggregate" to 0 to see all help / error messages
[compute-0-10.local:02981] 7 more processes have sent help message help-mpi-btl-openib.txt / error in device init
====================================================================================================================
1. limits.conf
# vi /etc/security/limits.conf
* soft memlock unlimited
* hard memlock unlimited
2. For using mpirun in SGE session
# qconf -mconf
======================================================
execd_params H_MEMORYLOCKED=infinity
======================================================
root@ngscluster1.local modified "global" in configuration list