We have five file systems to monitor for disk space. One for the /home directory because this does impact Cloverleaf processing in some rare cases. But most importantly, the /hci file system, and our three application file systems:
/prod – this is where our sites are; the /hci integrator directory has links to here
/prodarch – this holds backups of Smat DB files going back 6 weeks
/prodfiles – this holds our batch infrastructure
Our Alerts are named for the file system – one of the alerts is named “disk hci75”
The source is the filesystem – ex: “/hci”
Source Count is “all” ; Duration we have set to “once”.
Comparing is >= 75 which means that when the disk space percentage used is 75 or greater, the alert will fire.
We have two alerts per file system, so for example we have disk hci75 and disk hci90. So we would get emails for both alerts, but when we see the email for the higher number, we can gauge how fast it went from 75% to 90% and tell how serious the situation is.