Wilson Cluster SLURM Status

Refresh Rate=5 mins. Last updated on: 07-09-2020 10:51 PM

SLURM Job Status

JobIDJob NameUserAccount# of NodesPartitionQoSList of Nodes -or-
(Reason for job queued)
GPU ResourcesStart TimeRequested Walltime
(DAY-HH:MM:SS)
Time Remaining
(DAY-HH:MM:SS)
Job State
145457vertex_cnnbjargowsnnet1gpunormalgpu4gpu:12020-07-09T12:00:001-00:00:0013:08:59RUNNING
145524vertex_cnnbjargowsnnet1gpunormalgpu4gpu:12020-07-09T15:52:001-00:00:0017:00:59RUNNING
145536wscnnmelkinsnnet1gpunormalgpu4gpu:12020-07-09T22:12:561-00:00:0023:21:55RUNNING
145535bashmelkinsnnet1gpunormalgpu1gpu:12020-07-09T22:06:558:00:007:15:54RUNNING
145466ewphtyikwangewpht-s8amd32normaltev[0403-0410](null)2020-07-09T02:32:591-00:00:003:41:58RUNNING
145465ewphtyikwangewpht-s8amd32normaltev[0411-0412,0501-0502,0504-0507](null)2020-07-09T02:08:491-00:00:003:17:48RUNNING
145528cache_train_wrap.shtorbunovnnet1gpunormalgpu4gpu:12020-07-09T17:44:151-00:00:0018:53:14RUNNING
145526cache_train_wrap.shtorbunovnnet1gpunormalgpu4gpu:12020-07-09T17:43:261-00:00:0018:52:25RUNNING
145531cache_train_wrap.shtorbunovnnet1gpunormalgpu4gpu:12020-07-09T19:21:051-00:00:0020:30:04RUNNING
145530cache_train_wrap.shtorbunovnnet1gpunormalgpu4gpu:12020-07-09T19:20:531-00:00:0020:29:52RUNNING
145496lifetracbcatheyiota4amd32normaltev[0307-0310](null)2020-07-09T10:00:471-00:00:0011:09:46RUNNING
145495lifetracbcatheyiota4amd32normaltev[0301-0304](null)2020-07-09T10:00:371-00:00:0011:09:36RUNNING

SLURM Usage Plots

SLURM Usage plots (** Requires SERVICES account **)

MRTG Usage plots

SLURM Resource Status

PartitionAvailabilityNode ListStateError# of NodesCores per NodeThreads per CoreGeneric Resource:Type:Numberof
gpuupgpu1mixednone1121gpu:k20:2
gpuupgpu4mixednone1161gpu:p100:8
gpuupgpu2idlenone1161gpu:k40:1
gpuupgpu3downNode unexpectedly rebooted1562gpu:p100nvlink:2
ppc64upibmpower9idlenone11281gpu:v100nvlinkppc64:4
phiupphi[1-4]idlenone4121(null)
intel12uptev[0101-0102,0104-0113,0201-0209,0211,0213]idlenone23121(null)
amd32uptev[0301-0304,0307-0310,0403-0412,0501-0502,0504-0507]allocatednone24321(null)
amd32uptev[0305-0306,0311-0312,0401-0402,0508-0510]idlenone9321(null)
amd32_g4perfuptev[0403,0406-0412,0501-0502,0504-0507]allocatednone14321(null)
amd32_g4perfuptev[0312,0401-0402,0508-0510]idlenone6321(null)
amd32_g4valuptev[0301-0304,0307-0310,0403-0412,0501-0502,0504-0507]allocatednone24321(null)
amd32_g4valuptev[0305-0306,0311-0312,0401-0402,0508-0510]idlenone9321(null)
amd32_g4val_slowuptev[0301-0304,0307-0310,0404-0405]allocatednone10321(null)
amd32_g4val_slowuptev[0305-0306,0311]idlenone3321(null)
knlupknl1idlenone12564(null)
tevnfsg4uptevnfsg4idlenone181(null)