Wilson Cluster SLURM Status

Refresh Rate=5 mins. Last updated on: 04-08-2020 1:36 PM

SLURM Job Status

JobIDJob NameUserAccount# of NodesPartitionQoSList of Nodes -or-
(Reason for job queued)
GPU ResourcesStart TimeRequested Walltime
(DAY-HH:MM:SS)
Time Remaining
(DAY-HH:MM:SS)
Job State
134453bashmosnuneslartpcnnet1gpunormal(ReqNodeNotAvail, UnavailableNodes:gpu3,tev[0103,0210,0503])gpu:p100nvlink:1N/A8:00:008:00:00PENDING
134434job_124365shoechesherpa1amd32normaltev0403(null)2020-04-07T21:30:3118:00:001:54:30RUNNING
134437job_124635shoechesherpa1amd32normaltev0406(null)2020-04-07T21:30:3118:00:001:54:30RUNNING
134407DARN-Netjbonillaminervag1gpunormalgpu4gpu:12020-04-07T14:10:461-00:00:0034:45RUNNING
134420DARN-Netjbonillaminervag1gpunormalgpu4gpu:12020-04-07T17:26:491-00:00:003:50:48RUNNING
134455ignasso_record_active_gpusignassonnet1gpunormalgpu1gpu:12020-04-08T09:54:331-00:00:0020:18:32RUNNING
134459cauchyluisalsdeepskies1gpunormalgpu4gpu:12020-04-08T11:53:161-00:00:0022:17:15RUNNING
134444ewphtyikwangewpht-s6amd32normaltev[0504-0509](null)2020-04-07T22:54:481-00:00:009:18:47RUNNING
134443ewphtyikwangewpht-s6amd32normaltev[0307-0312](null)2020-04-07T22:53:491-00:00:009:17:48RUNNING
134456bashddoylennet1amd32normaltev0401(null)2020-04-08T10:26:008:00:004:49:59RUNNING

SLURM Usage Plots

SLURM Usage plots (** Requires SERVICES account **)

MRTG Usage plots

SLURM Resource Status

PartitionAvailabilityNode ListStateError# of NodesCores per NodeThreads per CoreGeneric Resource:Type:Numberof
gpuupgpu3drainedSC for gpu issues1562gpu:p100nvlink:2
gpuupgpu1mixednone1121gpu:k20:2
gpuupgpu4allocatednone1161gpu:p100:8
gpuupgpu2idlenone1161gpu:k40:1
ppc64upibmpower9idlenone11281gpu:v100nvlinkppc64:4
phiupphi[1-4]idlenone4121(null)
intel12uptev0103down*DEAD keeps powering off1121(null)
intel12uptev[0101-0102,0104-0113,0201-0209,0211,0213]idlenone23121(null)
amd32uptev[0307-0312,0401,0403,0406,0504-0509]allocatednone15321(null)
amd32uptev[0301-0306,0402,0404-0405,0407-0412,0501-0502,0510]idlenone18321(null)
amd32_g4perfuptev[0312,0401,0403,0406,0504-0509]allocatednone10321(null)
amd32_g4perfuptev[0402,0407-0412,0501-0502,0510]idlenone10321(null)
amd32_g4valuptev[0307-0312,0401,0403,0406,0504-0509]allocatednone15321(null)
amd32_g4valuptev[0301-0306,0402,0404-0405,0407-0412,0501-0502,0510]idlenone18321(null)
amd32_g4val_slowuptev[0307-0311]allocatednone5321(null)
amd32_g4val_slowuptev[0301-0306,0404-0405]idlenone8321(null)
knlupknl1idlenone12564(null)
tevnfsg4uptevnfsg4idlenone181(null)