Wilson Cluster SLURM Status

Refresh Rate=5 mins. Last updated on: 05-28-2020 1:57 PM

SLURM Job Status

JobIDJob NameUserAccount# of NodesPartitionQoSList of Nodes -or-
(Reason for job queued)
GPU ResourcesStart TimeRequested Walltime
(DAY-HH:MM:SS)
Time Remaining
(DAY-HH:MM:SS)
Job State
140890bashpaternonumint1ppc64normalibmpower9gpu:12020-05-28T13:08:248:00:007:11:23RUNNING
140885DAResNetjbonillaminervag1gpunormalgpu4gpu:12020-05-28T10:23:091-00:00:0020:26:08RUNNING
140521_1cmsExpMT_amd_e-_50.shg4pg4p1amd32_g4perfg4perftev0504(null)2020-05-28T08:30:271-00:00:0018:33:26RUNNING
140516_3run_ossusertime.shg4pg4p1amd32_g4perfg4perftev0312(null)2020-05-28T08:19:431-00:00:0018:22:42RUNNING
140519_1cmsExpMT_amd_pi-_50.shg4pg4p1amd32_g4perfg4perftev0411(null)2020-05-28T08:21:011-00:00:0018:24:00RUNNING
140516_2run_ossusertime.shg4pg4p1amd32_g4perfg4perftev0403(null)2020-05-28T07:53:531-00:00:0017:56:52RUNNING
140516_1run_ossusertime.shg4pg4p1amd32_g4perfg4perftev0505(null)2020-05-28T07:49:101-00:00:0017:52:09RUNNING
140513_3run_ossusertime.shg4pg4p1amd32_g4perfg4perftev0412(null)2020-05-28T07:39:001-00:00:0017:41:59RUNNING
140513_2run_ossusertime.shg4pg4p1amd32_g4perfg4perftev0507(null)2020-05-28T07:33:541-00:00:0017:36:53RUNNING
140513_1run_ossusertime.shg4pg4p1amd32_g4perfg4perftev0409(null)2020-05-28T07:33:351-00:00:0017:36:34RUNNING
140512_2run_osspcsamp.shg4pg4p1amd32_g4perfg4perftev0401(null)2020-05-28T07:30:031-00:00:0017:33:02RUNNING
140882bashmelkinsnnet1gpunormalgpu4gpu:12020-05-28T09:00:188:00:003:03:17RUNNING
140644ewphtyikwangewpht-s6amd32normaltev[0307-0311,0404](null)2020-05-27T23:40:401-00:00:009:43:39RUNNING
140892run_lo.shisaacsongenmc12amd32normaltev[0301-0305,0407-0408,0410,0501-0502,0508,0510](null)2020-05-28T13:49:161-00:00:0023:52:15RUNNING
140891analyze_batch.shnblinovtbfacllp1amd32normaltev0506(null)2020-05-28T13:30:051-00:00:0023:33:04RUNNING
140887singularitynblinovtbfacllp1amd32normaltev0405(null)2020-05-28T11:38:491-00:00:0021:41:48RUNNING
140888bashtneumannqcdloop20intel12normaltev[0101,0104-0113,0201-0209](null)2020-05-28T11:58:091-00:00:0022:01:08RUNNING

SLURM Usage Plots

SLURM Usage plots (** Requires SERVICES account **)

MRTG Usage plots

SLURM Resource Status

PartitionAvailabilityNode ListStateError# of NodesCores per NodeThreads per CoreGeneric Resource:Type:Numberof
gpuupgpu3drained*SC for gpu issues1562gpu:p100nvlink:2
gpuupgpu4mixednone1161gpu:p100:8
gpuupgpu1idlenone1121gpu:k20:2
gpuupgpu2idlenone1161gpu:k40:1
ppc64upibmpower9mixednone11281gpu:v100nvlinkppc64:4
phiupphi[1-4]idlenone4121(null)
intel12uptev0103down*DEAD keeps powering off1121(null)
intel12uptev[0101,0104-0113,0201-0209]allocatednone20121(null)
intel12uptev[0102,0211,0213]idlenone3121(null)
amd32uptev[0301-0305,0307-0312,0401,0403-0405,0407-0412,0501-0502,0504-0508,0510]allocatednone29321(null)
amd32uptev[0306,0402,0406,0509]idlenone4321(null)
amd32_g4perfuptev[0312,0401,0403,0407-0412,0501-0502,0504-0508,0510]allocatednone17321(null)
amd32_g4perfuptev[0402,0406,0509]idlenone3321(null)
amd32_g4valuptev[0301-0305,0307-0312,0401,0403-0405,0407-0412,0501-0502,0504-0508,0510]allocatednone29321(null)
amd32_g4valuptev[0306,0402,0406,0509]idlenone4321(null)
amd32_g4val_slowuptev[0301-0305,0307-0311,0404-0405]allocatednone12321(null)
amd32_g4val_slowuptev0306idlenone1321(null)
knlupknl1idlenone12564(null)
tevnfsg4uptevnfsg4idlenone181(null)