alerts still not alerting
Daniel J McDonald
dan.mcdonald at austinenergy.com
Sat Mar 19 17:33:09 CET 2005
I'm still flummoxed by hobbit-alerts. I'm certain I broke something,
because I am not getting any alerts from the box.
The only logs in /var/log/hobbit/page.log are
2005-03-11 07:49:30 Tried to down BOARDBUSY: Invalid argument
2005-03-14 17:24:21 Tried to down BOARDBUSY: Invalid argument
I see a couple of those in the hobbitlaunch.log file as well, I also see
the following error:
2005-03-19 10:14:21 Task bbdisplay started with PID 7417
2005-03-19 10:14:21 Task bbretest started with PID 7418
2005-03-19 10:14:29 Our child has failed and will not talk to us
2005-03-19 10:14:36 Our child has failed and will not talk to us
Not knowning which child makes it difficult to figure out what is going
on. bbpage is aparently running - the logfile says process 5892 is
bbpage, and there is a process 5892 still running.
I fixed the "unmatched" syntax error I had before.
Here is a sample host that is not paging. The info page lists:
Alerting: Service Recipient 1st Delay Stop after Repeat Time of Day
Colors
conn dan.mcdonald at austinenergy.com (R) 30m - 5d - red
telnet dan.mcdonald at austinenergy.com (R) 30m - 5d - red
Both telnet and conn have been down on this host for over two hours.
The salient rule is:
HOST=%.
MAIL=dan.mcdonald at austinenergy.com REPEAT=140h DURATION>30m
RECOVERED COLOR="red" UNMATCHED
I imagine I'm doing something terribly silly, but I'm just not clear
what it might be.
More information about the Xymon
mailing list