- 13 Oct, 2017 5 commits
-
-
John-Paul Robinson authored
-
John-Paul Robinson authored
Split the cluster into gpu and non-gpu nodes with two partitions Add QOS for each partation defined with: sudo sacctmgr add qos maxcpu set maxtresperuser=cpu=4 sudo sacctmgr add qos maxgpucpu set maxtresperuser=cpu=2
-
John-Paul Robinson authored
-
John-Paul Robinson authored
-
John-Paul Robinson authored
Update the gpu config to have a type flag and cpu affinity This feature doesn't work unless select/cons_res is active.
-
- 12 Oct, 2017 7 commits
-
-
John-Paul Robinson authored
-
John-Paul Robinson authored
The Type parameter for GPUs in the gres.conf and NodeName lines in slurm.conf don't work unless select/cons_res is active. In other works, they don't work if select/linear (nodes are alocated as hole units). The --gres=gpu flag works for linear but gpu allocation fails completely for type-based gpus --gres=gpu:p100:1.
-
John-Paul Robinson authored
-
John-Paul Robinson authored
slurmctld logs a warning that pgid is a slow method and recommends linuxproc.
-
John-Paul Robinson authored
The dedicate state dir needs to be created owned by slurm exclusively so that the slurmctld don't complain about security error at start.
-
John-Paul Robinson authored
-
John-Paul Robinson authored
-
- 11 Oct, 2017 5 commits
-
-
John-Paul Robinson authored
-
John-Paul Robinson authored
-
John-Paul Robinson authored
Note gpu type and cpu affinities doen't work with this basic slurm config. If set, slurm will report that the resources are unavailable.
-
John-Paul Robinson authored
This is the default slurm.conf from the OpenHPC repo slurm packages with localization of cluster name and nodename list.
-
John-Paul Robinson authored
-