I will try the dirparse UPOC. We are on 5.8.4, and I’ll let you know how an empty directory is returned.
As I add new monitors, like checking for messages in the error or recovery DB, or checking for successful cron jobs, I make sure to append one line to a daily monitor summary file that is emailed every morning, which will list each monitor that ran, a return code and an an error description. I only want emails for individual monitors to go out if there is an error, otherwise, everybody only gets the one summary email. To do this, my monitors need to open the summary file for append and write a record to it. So this dirparse UPOC needs to evaluate the directory for a specified list of files and report errors on each one (not found), send an email if specified with enough information to identify the problem file and append a line of data to the daily report summary file. Hopefully thats not too much activity in a dirparse UPOC.
This is an example of the daily report summary email. The email font is fixed when it is delivered.
Daily Batch Monitor Report – 02/28/2012
time interface check for report file RC status
—
07:15:02 audit_purge cron execution /home/hci/logs/purge_audit.log 0 ok – cron file has been updated within the last 30 minutes
12:15:01 smatr_audit_12 cron execution /home/hci/logs/smatr_audit_1207.log 0 ok – cron file has been updated within the last 30 minutes
17:15:03 smatr_audit_17 cron execution /home/hci/logs/smatr_audit_1707.log 0 ok – cron file has been updated within the last 30 minutes
01:00:03 smatr_audit_00 cron execution /home/hci/logs/smatr_audit_0050.log 0 ok – cron file has been updated within the last 30 minutes
01:40:04 clean_smat cron execution /prod/crons/smatArchive.log 0 ok – cron file has been updated within the last 500 minutes
02:15:04 uds_ftp cron execution /tmp/udsftp.adt.chbthprd.ftp 0 ok – cron file has been updated within the last 500 minutes
02:40:00 clean_local_smat cron execution /prod/crons/localsmatArchive.log 0 ok – cron file has been updated within the last 500 minutes
05:50:04 smatr_audit_05 cron execution /home/hci/logs/smatr_audit_0540.log 0 ok – cron file has been updated within the last 30 minutes
06:30:02 Error DB messages queued /tmp/hci_rpt_chonlprd_err_db 8 error – 18 msgs found in Err DB for chonlprd – see email
06:30:03 Error DB messages queued /tmp/hci_rpt_cmonlprd_err_db 0 ok – no messages found in Error DB for cmonlprd
06:30:03 Error DB messages queued /tmp/hci_rpt_chbthprd_err_db 0 ok – no messages found in Error DB for chbthprd
06:30:04 Recov DB messages queued /tmp/hci_rpt_chbthprd_rec_db 0 ok – no messages found in Recovery DB for chbthprd
06:40:00 save_RHIO_smat cron execution /home/hci/logs/saveSmatArchive.log 0 ok – cron file has been updated within the last 500 minutes
Peter Heggie
PeterHeggie@crouse.org