Wilson Cluster SLURM Status

Refresh Rate=5 mins. Last updated on: 08-08-2020 4:10 AM

SLURM Job Status

JobIDJob NameUserAccount# of NodesPartitionQoSList of Nodes -or-
(Reason for job queued)
GPU ResourcesStart TimeRequested Walltime
(DAY-HH:MM:SS)
Time Remaining
(DAY-HH:MM:SS)
Job State
147387bashayankeledune1gpunormalgpu4gpu:12020-08-07T20:39:171-00:00:0016:29:16RUNNING
147378pd_sp_cnnbjargowsdune1gpunormalgpu4gpu:12020-08-07T16:51:221-00:00:0012:41:21RUNNING
147390VFomorenopminervag1gpunormalgpu4gpu:12020-08-07T23:44:241-00:00:0019:34:23RUNNING
147386ls_1250_un-sing_enyxiaonnet1gpunormalgpu4gpu:12020-08-07T20:21:531-00:00:0016:11:52RUNNING
147388ls_muon_catcher_vertexyxiaonnet1gpunormalgpu4gpu:12020-08-07T20:52:451-00:00:0016:42:44RUNNING
147376bashtneumannqcdloop22intel12normaltev[0101-0102,0104-0113,0201-0209,0211](null)2020-08-07T16:19:291-00:00:0012:09:28RUNNING

SLURM Usage Plots

SLURM Usage plots (** Requires SERVICES account **)

MRTG Usage plots

SLURM Resource Status

PartitionAvailabilityNode ListStateError# of NodesCores per NodeThreads per CoreGeneric Resource:Type:Numberof
gpuupgpu4mixednone1161gpu:p100:8
gpuupgpu1idlenone1121gpu:k20:2
gpuupgpu2idlenone1161gpu:k40:1
gpuupgpu3idlenone1562gpu:p100nvlink:2
ppc64upibmpower9idlenone11281gpu:v100nvlinkppc64:4
phiupphi[1-4]idlenone4121(null)
intel12uptev[0101-0102,0104-0113,0201-0209,0211]allocatednone22121(null)
intel12uptev0213idlenone1121(null)
amd32uptev0304drainedAmitoj: something wrong high load1321(null)
amd32uptev[0301-0303,0305-0312,0401-0412,0501-0502,0504-0510]idlenone32321(null)
amd32_g4perfuptev[0312,0401-0403,0406-0412,0501-0502,0504-0510]idlenone20321(null)
amd32_g4valuptev0304drainedAmitoj: something wrong high load1321(null)
amd32_g4valuptev[0301-0303,0305-0312,0401-0412,0501-0502,0504-0510]idlenone32321(null)
amd32_g4val_slowuptev0304drainedAmitoj: something wrong high load1321(null)
amd32_g4val_slowuptev[0301-0303,0305-0311,0404-0405]idlenone12321(null)
knlupknl1idlenone12564(null)
tevnfsg4uptevnfsg4idlenone181(null)