abstract:mills:status

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

abstract:mills:status [2017-10-24 09:56] – created sraskarabstract:mills:status [2018-05-21 19:57] (current) sraskar
Line 11: Line 11:
 To opt-in the node status notification service for your workgroup(s), send an e-mail to consult@udel.edu with subject="Node notification opt-in Mills" and make the first line of the message body be To opt-in the node status notification service for your workgroup(s), send an e-mail to consult@udel.edu with subject="Node notification opt-in Mills" and make the first line of the message body be
       Type=Cluster       Type=Cluster
 +     
 +==== Live Resources ====
 +[[http://mills.hpc.udel.edu|mills.hpc.udel.edu]] has live resources: system status, job stats, system alerts.
 +
 +==== Machine Information ====
 +[[http://www.hpc.udel.edu/systems/mills|UD IT HPC]] has Mills machine information: attributes including a database of node information, milestones, offline nodes and nodes disabled for maintenance.
 +
 +==== Ganglia Cluster Monitoring ====
 +[[http://mills.hpc.udel.edu/ganglia/|Cluster monitoring]] for Mills uses [[http://ganglia.sourceforge.net/|Ganglia]] to monitor its hardware components.
 +
 +==== System Alerts ====
 +[[https://www.hpc.udel.edu/mantis/default/project_page.php?project_id=4|System Alerts]] for Mills is an opt-in service notifying you about status changes on any of your workgroup's nodes. 
 +
 +
 +==== Job Statistics ====
 +[[http://mills.hpc.udel.edu/jobstats/|Job statistics]]: Check here for the total number of jobs that ended on each day over a range (week, 2 weeks, month, 6 months, year) with an overlay of the total number of jobs which the job scheduler classified as "failed."
  • abstract/mills/status.1508853412.txt.gz
  • Last modified: 2017-10-24 09:56
  • by sraskar