Way to the science

slurm简明使用教程

1. 显示所有节点信息

sinfo / sinfo -Nl

2.取消123号job

scancel 123

3. 显示node1目前状态

scontrol show node node1

CPUAlloc=32 CPUEfctv=72 CPUTot=72 CPULoad=287.96
AvailableFeatures=(null)
ActiveFeatures=(null)
Gres=(null)
NodeAddr=node1 NodeHostName=node1 Version=22.05.2
OS=Linux 5.4.0-99-generic #112-Ubuntu SMP Thu Feb 3 13:50:55 UTC 2022
RealMemory=1 AllocMem=0 FreeMem=54971 Sockets=2 Boards=1
State=MIXED ThreadsPerCore=2 TmpDisk=0 Weight=1 Owner=N/A MCS_label=N/A
Partitions=xeon72_broadwell
BootTime=2022-10-02T10:05:46 SlurmdStartTime=2022-10-02T10:06:11
LastBusyTime=2022-10-05T09:27:54
CfgTRES=cpu=72,mem=1M,billing=72
AllocTRES=cpu=32
CapWatts=n/a
CurrentWatts=0 AveWatts=0
ExtSensorsJoules=n/s ExtSensorsWatts=0 ExtSensorsTemp=n/s

4. 显示node1目前资源分配情况

scontrol show node node1 | grep Alloc

grep CPUAlloc=32 CPUEfctv=72 CPUTot=72 CPULoad=288.48
RealMemory=1 AllocMem=0 FreeMem=54976 Sockets=2 Boards=1
AllocTRES=cpu=32

5. 使用srun命令直接传自定义命令到指定节点:
例:查看4号节点/tmp下目录
srun nodelist=node4 ls -ahl /tmp/lym@1234

Leave a comment

Your email address will not be published. Required fields are marked *