This shows you the differences between two versions of the page.
Next revision | Previous revision Next revisionBoth sides next revision |
abstract:farber:status [2017-10-23 16:00] – created sraskar | abstract:farber:status [2018-07-09 17:35] – [Node Status Notification] anita |
---|
==== Node status notification for Farber ==== | ==== Node Status Notification for Caviness ==== |
| |
An opt-in node status notification service is available for Farber users. This service sends an email notification when any nodes in your workgroup transition between two of the following states: | An opt-in node status notification service is available for Farber users. This service sends an email notification when any nodes in your workgroup transition between two of the following states: |
| |
To opt-in the node status notification service for your workgroup(s), send an e-mail to consult@udel.edu with subject="Node notification opt-in Farber" and make the first line of the message body be | To opt-in the node status notification service for your workgroup(s), send an e-mail to consult@udel.edu with subject="Node notification opt-in Farber" and make the first line of the message body be |
Type=Cluster | Type=Cluster |
| |
| ==== Live Resources ==== |
| [[http://farber.hpc.udel.edu|farber.hpc.udel.edu]] has live resources: system status, job stats, system alerts. |
| |
| ==== Machine Information ==== |
| [[http://www.hpc.udel.edu/systems/farber|UD IT HPC]] has Farber machine information: attributes including a database of node information, milestones, offline nodes and nodes disabled for maintenance. |
| |
| ==== Ganglia Cluster Monitoring ==== |
| [[http://farber.hpc.udel.edu/ganglia/|Cluster monitoring]] for Farber uses [[http://ganglia.sourceforge.net/|Ganglia]] to monitor its hardware components. |
| |
| ==== System Alerts ==== |
| [[https://www.hpc.udel.edu/mantis/default/my_view_page.php|System alerts]]: Check here first if you are experiencing problems with the cluster. |
| |
| |
| ==== Job Statistics ==== |
| [[http://farber.hpc.udel.edu/jobstats/|Job statistics]]: Check here for the total number of jobs that ended on each day over a range (week, 2 weeks, month, 6 months, year) with an overlay of the total number of jobs which the job scheduler classified as "failed." |