Revisions of libfabric

buildservice-autocommit accepted request 1034518 from Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) (revision 82)
baserev update by copy to link target
Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) accepted request 1034517 from Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) (revision 81)
- Add prov-net-fix-error-path-in-xnet_enable_rdm.patch to fix a deadlock
  when no network interfaces are available (bsc#1205139)
buildservice-autocommit accepted request 1012024 from Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) (revision 80)
baserev update by copy to link target
Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) accepted request 1012023 from Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) (revision 79)
- Update to 1.16.1
  - Core
    - Fix windows implementation to remove fd from poll set
  - PSM3
    - Add missing files to release tarball
  - Util
    - Handle NULL address insertion to fi_av_insert
- Drop prov-rxm-Disable-128-bit-atomics.patch which was merged upstream

- Update to 1.16.1
  - Core
    - Fix windows implementation to remove fd from poll set
  - PSM3
    - Add missing files to release tarball
  - Util
    - Handle NULL address insertion to fi_av_insert
- Drop prov-rxm-Disable-128-bit-atomics.patch which was merged upstream
buildservice-autocommit accepted request 1008574 from Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) (revision 78)
baserev update by copy to link target
Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) accepted request 1008573 from Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) (revision 77)
- Add prov-rxm-Disable-128-bit-atomics.patch to fix a potential
  segfault on misaligned buffers.
- Add prov-rxm-Disable-128-bit-atomics.patch to fix a potential
  segfault on misaligned buffers.
buildservice-autocommit accepted request 1007632 from Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) (revision 76)
baserev update by copy to link target
Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) accepted request 1007631 from Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) (revision 75)
- Update to 1.16.0 (jsc#PED-351, jsc#PED-190)
  - Core
    - Added HMEM IPC cache
    - Use exact string comparison checks for network interfaces
    - Restructuring of poll/epoll abstraction
    - Add ability to disable locks completely in debug builds
    - Serialize access to modifying the logging calls
    - Minor fixes to fi_tostr text formatting
    - Add hmem interface checks to memory registration
  - EFA
    - Added support of Synapse AI memory.
    - Improved error message
  - Net
    - Temporarily forked, optimized version of tcp provider
    - Focused on improved performance and scalability over tcp sockets
    - Fork ensures tcp provider stability while net provider is developed
    - Shares the tcp provider protocol and base implementation for msg endpoints
    - Integrates direct support for rdm endpoints, using a derivative from rxm
    - Implements own protocol for rdm endpoints, separate from rxm;tcp
  - OPX
    - Added initial support for SDMA
    - General performance enhancements
    - Performance improvements to reliability protocol
    - Improved deferred work pending complete
    - Added support for OPX_AV=runtime
    - Support iov memory registration ops
    - Added DAOS RPC support
    - Atomic ops enhancements
    - Improved documentation
    - Debug build enhancements
    - Fixed compiler warnings
    - Reduced time to compile prov/opx code
    - General bug fixes
    - Fixed PSN wrapping scaling
    - Added intranode fence
    - Addressed bugs discovered by coverity scan
  - PSM2
    - Fix sending CQ data in some instances of fi_tsendmsg
  - PSM3
    - Updated to match Intel Ethernet Fabric Suite (IEFS) 11.3 release
  - RxM
    - Update to read multiple completions at once from msg provider
    - Move RxM AV implementation to util code to share with net provider
    - Minor code cleanups
  - SHM
    - Implement and use ipc_cache
    - Add log messages for debugging and error tracking
    - Fix check for FI_MR_HMEM mr_mode
    - Move shm signal handlers initialization to EP
    - Added log messages for errors detected
  - TCP
    - Fix incorrect signaling of the CQ
    - Increase max number of poll events to retrieve
    - Acquire ep lock prior to flushing socket in shutdown
    - Verify ep state prior to progressing socket data
    - Read cm error data when receiving connreq response
    - Log error on connect failure
    - Fix assertion failure in CQ progress function
  - Util
    - Fix text in log of UFFD ioctl failure
    - Introduce cuda ipc monitor
    - Fix CQ memory leak handling overflow
    - Fix MR mode bit check for ver 1.5 and greater
    - Add max_array_size to track/check array overflow
    - Always progress transfers when reading from a CQ
    - Handle NULL address insertion
    - Try IPv4 before IPv6 addresses when starting name server
    - Fix IP util av default address length
    - Fix util IP getinfo path to read hints->addr_format
    - Fix debug print mismatch
    - Fix return code when memory allocation fails.
    - Fix build sign warning in ofi_bufpool_region_alloc
    - Minor code cleanups
    - Print warning if an addr is inserted into an AV again
  - Verbs
    - Fix support of FI_SOCKADDR_IB when requested by the application
    - Ensure all posted receives are flushed to the application
    - Update ofi_mr_cache_search API for hmem IPC support
    - Reduce logging verbosity for "no active ports"
    - Fix incorrect length used in memory registration
    - Various minor bug fixes for test failures
    - Fix a memory leak getting IB address
    - Implement verbs provider on Windows over NetworkDirect API
    - Set and check address format correctly
    - Only close qp if it was initialized
    - Portable detection of loopback device
  - Fabtests
    - multi_ep: Separate EP resources and fix MR registration
    - multi_recv: Fix possible crash and check for valid buffer
    - unexpected_msg: Fix printf compiler warning
    - dgram_pingpong.c: Use out-of-band sync
    - multinode: Make multinode tests platform agnostic, fix formatting
    - ubertest: Fix string comparison to include length, fix writedata completion check
    - av_test: add support for -e <ep_type>
    - New tests:
      - dmabuf-rdma: Component level test for dma-buf RDMA
      - sock_test: Component level performance test of poll, epoll, and select
      - rdm_stress: Multi-threaded, multi-process stress test for RDM endpoints
      - sighandler_test: Regression test for signal handler restoration
- Drop patches fixed upstream:
  - prov-opx-Correctly-disable-OPX-if-unsupported.patch
  - disable-flatten-attr.patch
buildservice-autocommit accepted request 998811 from Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) (revision 74)
baserev update by copy to link target
Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) accepted request 998810 from Martin Liška's avatar Martin Liška (marxin) (revision 73)
- Add disable-flatten-attr.patch that drops flatten attribute.
  Note the flatten attribute results in huge compile time hog
  in inliner (same the binary size would be huge).
- Use %make_build and enable LTO (boo#1133235).
- Synchronize used Patches.

- Add disable-flatten-attr.patch that drops flatten attribute.
  Note the flatten attribute results in huge compile time hog
  in inliner (same the binary size would be huge).
- Use %make_build and enable LTO (boo#1133235).
- Synchronize used Patches.
buildservice-autocommit accepted request 989962 from Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) (revision 72)
baserev update by copy to link target
Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) accepted request 989191 from Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) (revision 71)
- Update to 1.15.1
  - Core
    - Fix fi_info indentation error in fi_tostr
    - hmem_ze: Add runtime option to choose specific copy engine
    - Cleanup of configure HMEM checks
    - Fixed stringop-truncation in ofi_ifaddr_get_speed
    - Add utility provider log suffix to make logs easier to read
    - Fix truncation of ipv6 addressing
    - hmem: add support for AWS Trainium devices
    - Fix potential sscanf overflows
    - hmem: pass through device and flags when querying memory interface
    - Rework locking in several areas to convert spinlocks to mutexes
    - Add new locking abstractions to select lock types at runtime
    - Add new FI_PROTO_RXM_TCP for optimized rxm over tcp path
    - Fix windows implementation to remove fd from poll set
  - EFA
    - Added windows support through efawin (https://github.com/aws/efawin)
    - Added support of AWS neuron.
    - Added support of using gdrcopy to copy data from host to device.
    - Fixed a bug that cause 0 byte read to fail.
    - Fixed a memory corruption issue that can caused forked process to crash.
    - Extended testing coverage through new pytest based testing framework.
  - HOOKS
    - Add new hooking provider dmabuf_peer_mem
    - Enable DL build of hooking providers
    - Add HMEM memory registration hook
  - OPX
    - New provider supporting Cornelis Networks Omni-path hardware
  - PSM3
    - Updated psm3 to match IEFS 11.2.0.0 release
    - Added support for sockets (TCP/UDP) via a runtime selectable Hardware
  Abstraction Layer (HAL)
    - Added support for IPv6 addressing in RoCE and sockets
    - Added various NIC selection filtering options (wildcarded NIC name,
      address format, wildcarded IP subnet, link speed)
    - Performance tuning in conjunction with OneAPI and OneCCL
    - Improved PSM3_IDENTIFY output
    - Rename most internal symbols to psm3_
    - Corrected vulnerabilities found during Coverity scans
    - configure options refined and help text improved
    - PSM3_MULTI_EP has been deprecated (recommend always enabled, default
      is enabled [same default as previous releases])
    - Various bug fixes
  - RxM
    - Add check that atomic size is valid
    - Add support to passthru calls to tcp provider in specific
  - TCP
    - Add assert to verify RMA source/target msg sizes match
    - Wake-up threads blocked on CQ to update their poll events
    - Fix use of incorrect events in progress handler
    - Fixes for various compile warnings, mostly on Windows
    - Add support for FI_RMA_EVENT capability
    - Add support for completion counters
    - Fix check for CQ data in tagged messages
    - Add cancel support to shared rx context
    - Add src_addr receive buffer matching
    - Add provider control to assign a src_addr with an ep
    - Handle trecv with FI_PEEK flag
    - Allow binding a CQ with an SRX
    - Restructuring of code in source files
    - Handle EWOULDBLOCK returned by send call
    - Add hot (active) pollfd
  - SHM
    - Properly chain the original signal handlers
    - Avoid uninitialized variable with invalid atomic parameters
    - Fix 0 byte SAR read
    - Initialize len parameter to accept
    - Refactor and simplify protocol code
    - Remove broken support for 128-bit atomics
    - Fix FI_INJECT flag support
    - Add assert to verify RMA source/target msg sizes match
    - Set domain threading to thread safe
    - Fix possible use of uninitiated var in av_insert
  - Util
    - Fix sign warning in ofi_bufpool_region_alloc
    - Remove unused variable from ofi_bufpool_destroy
    - Fix check for valid datatype in ofi_atomic_valid
    - Return with error if util_coll_sched_copy fails
    - Fix use of uninitialized variable in ofi_ep_allreduce
    - Fix memory access in ip_av_insertsym
    - Track ep per collective operation not with multicast
    - Restructure collective av set creation/destruction
    - Change most locks from spin locks to mutexes
    - Allow selection of spinlocks for CQ and domain objects
    - Fix AV default addrlen
    - Update fi_getinfo checks to include hints->addr_
    - Handle NULL address insertion to fi_av_insert
  - Verbs
    - Initial changes for compiling on Windows (via NetworkDirect)
    - Add a failover path to dma-buf based memory registration
    - Replace use of spin locks with mutexes
    - Check for valid qp prior to cleanup
    - Set and check for address format correct in fi_getinfo
  - Fabtests
    - hmem_cuda: used device allocated host buff to fill device buf
    - Add python scripts to control test execution
    - test_configs: include util provider in core config file
    - Add option "--pin-core"
    - Only call nrt_init once
    - Fix a bug in ft_neuron_cleanup
    - Correct help for unit test programs
    - Remove duplicate help prints from fi_mcast
    - configure.ac: fix --enable-debug=no not properly detected
    - msg_inject: handle the case ft_tsendmsg return -FI_EAGAIN
    - Add AWS Trainium device support
    - fi_inj_complete: Add FI_INJECT to fabtests
    - inj_complete.c: Make arguments align with the other tests
    - dgram_pingpong: handle the error return of fi_recv
    - recv_cancel: Remove requirement for unexpected msg handling
    - poll: Fix crash if unable to allocate pollset
    - ubertest: Add GPU testing and validation support
    - Add HMEM options parsing support
    - Update and re-enable fi_multi_ep test
- Add prov-opx-Correctly-disable-OPX-if-unsupported.patch to disable
  OPX compilation on non x86_64 systems
buildservice-autocommit accepted request 971080 from Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) (revision 70)
baserev update by copy to link target
Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) accepted request 971079 from Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) (revision 69)
- Update to 1.14.1
  - Core
    - Use non-shared memory allocations to use MADV_DONTFORK safely
    - Fix incorrect use of gdr_copy_from_mapping
    - Ensure proper timeout time for pollfds to avoid early exit
  - EFA
    - Handle read completion properly for multi_recv
    - Use shm's inject write when possible
    - Support 0 byte read
  - RxM
    - Ensure signaling the CQ fd after writing completion
    - Fix inject path for sending tagged messages with cq data
    - Negotiate credit based flow control support over CM
    - Add PID to CM messages to detect stale vs duplicate connections
    - Fix race handling unexpected messages from unknown peers
    - Fix possible leak of stack data in cm_accept
    - Restrict reported caps based on core provider
    - Delay starting listen until endpoint fully initialized
    - Verify valid atomic size
  - Sockets
    - Fix coverity reports on uninitialized data
    - Check for NULL pointers passed to memcpy
    - Add missing error return code from sock_ep_enable
  - TCP
    - Fix performance regression resulting from sparse pollfd sets
    - Fix assertion failure in CQ progress function
    - Do not generate error completions for inject msgs
    - Fix use of incorrect event names in progress handler
    - Fix check for CQ data in tagged messages
    - Make start_op array a static to reduce memory
Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) accepted request 932983 from Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) (revision 68)
- Update to 1.14.0
  - Add time stamps to log messages
  - Fix gdrcopy calculation of memory region size when aligned
  - Allow user to disable use of p2p transfers
  - Update fi_tostr print FI_SHARED_CONTEXT text instead of value
  - Update fi_tostr to output field names matching header file names
  - Fix narrow race condition in ofi_init
  - Add new fi_log_sparse API to rate limit repeated log output
  - Define memory registration for buffers used for collective operations
  - EFA, SHM, TCP, RXM, and verbs fixes
Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) accepted request 928694 from Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) (revision 66)
- Update to 1.13.2
  - Sort DL providers to ensure consistent load ordering
  - Update hooking providers to handle fi_open_ops calls to avoid crashes
  - Replace cassert with assert.h to avoid C++ headers in C code
  - Enhance serialization for memory monitors to handle external monitors
  - EFA, SHM, TCP, RxM and vers fixes
Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) accepted request 917134 from Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) (revision 65)
- Update to 1.13.1
  - Enable loading ZE library with dlopen()
  - Add IPv6 support to fi_pingpong
  - EFA, PSM3 and SHM fixes
Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) accepted request 905235 from Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) (revision 64)
- Update to 1.13.0
  - Fix behavior of fi_param_get parsing an invalid boolean value
  - Add new APIs to open, export, and import specialized fid's
  - Define ability to import a monitor into the registration cache
  - Add API support for INT128/UINT128 atomics
  - Fix incorrect check for provider name in getinfo filtering path
  - Allow core providers to return default attributes which are lower then
    maximum supported attributes in getinfo call
  - Add option prefer external providers (in order discovered) over internal
    providers, regardless of provider version
  - Separate Ze (level-0) and DRM dependencies
  - Always maintain a list of all discovered providers
  - Fix incorrect CUDA warnings
  - Fix bug in cuda init/cleanup checking for gdrcopy support
  - Shift order providers are called from in fi_getinfo, move psm2 ahead of
    psm3 and efa ahead of psmX
  - See NEWS.md for changelog
buildservice-autocommit accepted request 882724 from Nicolas Morey-Chaisemartin's avatar Nicolas Morey-Chaisemartin (NMoreyChaisemartin) (revision 63)
baserev update by copy to link target
Displaying revisions 21 - 40 of 102
openSUSE Build Service is sponsored by