You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
LAMMPS, AthenaK, XGC, and other Kokkos-based applications using versions starting with 4.2.00 in Nov 2023 are affected by an incompatibility with Cray MPICH (based on an older UCX) of the new default option:
-DKokkos_ENABLE_IMPL_CUDA_MALLOC_ASYNC=ON
A runtime error is thrown by CUDA-aware Cray MPICH if you try to use Kokkos with that option enabled:
(GTL DEBUG: 2) cuIpcGetMemHandle: invalid argument, CUDA_ERROR_INVALID_VALUE, line no 148
I assume the 3x prebuilt Kokkos modules were all compiled with that option disabled @zippylab ?
The kokkos/4.3.01_shared_PEg8.5.0_cv12.3_ct12.2.91 module build was built with cmake flag -DKokkos_ENABLE_IMPL_CUDA_MALLOC_ASYNC=OFF
The two kokkos/4.2.01 module builds were built with the default, -DKokkos_ENABLE_IMPL_CUDA_MALLOC_ASYNC=ON. I believe those will have trouble if you enable GPU-aware MPICH.
LAMMPS, AthenaK, XGC, and other Kokkos-based applications using versions starting with 4.2.00 in Nov 2023 are affected by an incompatibility with Cray MPICH (based on an older UCX) of the new default option:
A runtime error is thrown by CUDA-aware Cray MPICH if you try to use Kokkos with that option enabled:
I assume the 3x prebuilt Kokkos modules were all compiled with that option disabled @zippylab ?
There is a discussion to potentially revert the change to the default in 4.5.x
kokkos/kokkos#7353
The text was updated successfully, but these errors were encountered: