Overview

Request 992639 accepted

- update to 4.1.4:
* Fix possible length integer overflow in numerous non-blocking collective
operations.
* Fix segmentation fault in UCX if MPI Tool interface is finalized before
MPI_Init is called.
* Remove /usr/bin/python dependency in configure.
* Fix OMPIO issue with long double etypes.
* Update treematch topology component to fix numerous correctness issues.
* Fix memory leak in UCX MCA parameter registration.
* Fix long operation closing file descriptors on non-Linux systems that
can appear as a hang to users.
* Fix for attribute handling on GCC 11 due to pointer aliasing.
* Fix multithreaded race in UCX PML's datatype handling.
* Fix a correctness issue in CUDA Reduce algorithm.
* Fix compilation issue with CUDA GPUDirect RDMA support.
* Fix to make shmem_calloc(..., 0) conform to the OpenSHMEM specification.
* Add UCC collectives component.
* Fix divide by zero issue in OMPI IO component.
* Fix compile issue with libnl when not in standard search locations.
* Fixed a seg fault in the smcuda BTL. Thanks to Moritz Kreutzer and
@Stadik for reporting the issue.
* Added support for ELEMENTAL to the MPI handle comparison functions
in the mpi_f08 module. Thanks to Salvatore Filippone for raising
the issue.
* Minor datatype performance improvements in the CUDA-based code paths.
* Fix MPI_ALLTOALLV when used with MPI_IN_PLACE.
* Fix MPI_BOTTOM handling for non-blocking collectives. Thanks to
Lisandro Dalcin for reporting the problem.
* Enable OPAL memory hooks by default for UCX.
* Many compiler warnings fixes, particularly for newer versions of

Loading...

Request History
Dirk Mueller's avatar

dirkmueller created request

- update to 4.1.4:
* Fix possible length integer overflow in numerous non-blocking collective
operations.
* Fix segmentation fault in UCX if MPI Tool interface is finalized before
MPI_Init is called.
* Remove /usr/bin/python dependency in configure.
* Fix OMPIO issue with long double etypes.
* Update treematch topology component to fix numerous correctness issues.
* Fix memory leak in UCX MCA parameter registration.
* Fix long operation closing file descriptors on non-Linux systems that
can appear as a hang to users.
* Fix for attribute handling on GCC 11 due to pointer aliasing.
* Fix multithreaded race in UCX PML's datatype handling.
* Fix a correctness issue in CUDA Reduce algorithm.
* Fix compilation issue with CUDA GPUDirect RDMA support.
* Fix to make shmem_calloc(..., 0) conform to the OpenSHMEM specification.
* Add UCC collectives component.
* Fix divide by zero issue in OMPI IO component.
* Fix compile issue with libnl when not in standard search locations.
* Fixed a seg fault in the smcuda BTL. Thanks to Moritz Kreutzer and
@Stadik for reporting the issue.
* Added support for ELEMENTAL to the MPI handle comparison functions
in the mpi_f08 module. Thanks to Salvatore Filippone for raising
the issue.
* Minor datatype performance improvements in the CUDA-based code paths.
* Fix MPI_ALLTOALLV when used with MPI_IN_PLACE.
* Fix MPI_BOTTOM handling for non-blocking collectives. Thanks to
Lisandro Dalcin for reporting the problem.
* Enable OPAL memory hooks by default for UCX.
* Many compiler warnings fixes, particularly for newer versions of


Nicolas Morey-Chaisemartin's avatar

NMoreyChaisemartin accepted request

openSUSE Build Service is sponsored by