Simple Linux Utility for Resource Management
SLURM is an open source, fault-tolerant, and highly scalable cluster management and job scheduling system for Linux clusters
containing up to 65,536 nodes. Components include machine status, partition management, job management, scheduling and accounting modules.
- Developed at network:cluster
- Sources inherited from project openSUSE:Factory
-
3
derived packages
- Download package
-
Checkout Package
osc -A https://api.opensuse.org checkout home:seife:Factory/slurm && cd $_
- Create Badge
Refresh
Refresh
Source Files
Filename | Size | Changed |
---|---|---|
Fix-test-21.41.patch | 0000002334 2.28 KB | |
Fix-test-32.8.patch | 0000000740 740 Bytes | |
Fix-test7.2-to-find-libpmix-under-lib64-as-well.pa |
0000000948 948 Bytes | |
Keep-logs-of-skipped-test-when-running-test-cases- |
0000001113 1.09 KB | |
README_Testsuite.md | 0000004747 4.64 KB | |
Remove-rpath-from-build.patch | 0000002941 2.87 KB | |
_service | 0000000133 133 Bytes | |
pam_slurm-Initialize-arrays-and-pass-sizes.patch | 0000002851 2.78 KB | |
slurm-23.02.4.tar.bz2 | 0007259848 6.92 MB | |
slurm-rpmlintrc | 0000002979 2.91 KB | |
slurm.changes | 0000184784 180 KB | |
slurm.spec | 0000046719 45.6 KB | |
slurmctld.xml | 0000000256 256 Bytes | |
slurmd.xml | 0000000262 262 Bytes | |
slurmdbd.xml | 0000000255 255 Bytes | |
test_setup.tar.gz | 0000003165 3.09 KB |
Revision 93 (latest revision is 105)
Ana Guerrero (anag+factory)
accepted
request 1110259
from
Egbert Eich (eeich)
(revision 93)
- Updated to 23.02.4 with the following changes: * Bug Fixes: + Fix main scheduler loop not starting after a failover to backup controller. Avoid slurmctld segfault when specifying `AccountingStorageExternalHost` (bsc#1214983). + Fix sbatch return code when `--wait` is requested on a job array. + Fix collected `GPUUtilization` values for `acct_gather_profile` plugins. + Fix `slurmrestd` handling of job hold/release operations. + Fix step running indefinitely when slurmctld takes more than `MessageTimeout` to respond. Now, `slurmctld` will cancel the step when detected, preventing following steps from getting stuck waiting for resources to be released. + Fix regression to make `job_desc.min_cpus` accurate again in `job_submit` when requesting a job with `--ntasks-per-node`. + Fix handling of `ArrayTaskThrottle` in backfill. + Fix regression in 23.02.2 when checking gres state on `slurmctld` startup or reconfigure. Gres changes in the configuration were not updated on slurmctld startup. On startup or reconfigure, these messages were present in the log: `"error: Attempt to change gres/gpu Count`". + Fix potential double count of gres when dealing with limits. + Fix `slurmstepd` segfault when `ContainerPath` is not set in `oci.conf` + Fixed an issue where jobs requesting licenses were incorrectly rejected. + `scrontab` - Fix cutting off the final character of quoted variables. + `smail` - Fix issues where e-mails at job completion were not being sent. + `scontrol/slurmctld` - fix comma parsing when updating a reservation's nodes. + Fix `--gpu-bind=single binding` tasks to wrong gpus, leading to some gpus having more tasks than they should and other gpus being unused. + Fix regression in 23.02 that causes slurmstepd to crash when `srun` requests more than `TreeWidth` nodes in a step and uses the pmi2 or
Comments 1
Slurm 18.08 has been released: https://www.schedmd.com/downloads.php