alerts still not alerting

Daniel J McDonald dan.mcdonald at austinenergy.com
Sat Mar 19 17:33:09 CET 2005


I'm still flummoxed by hobbit-alerts.  I'm certain I broke something,
because I am not getting any alerts from the box.

The only logs in /var/log/hobbit/page.log are 
2005-03-11 07:49:30 Tried to down BOARDBUSY: Invalid argument
2005-03-14 17:24:21 Tried to down BOARDBUSY: Invalid argument

I see a couple of those in the hobbitlaunch.log file as well, I also see
the following error:
2005-03-19 10:14:21 Task bbdisplay started with PID 7417
2005-03-19 10:14:21 Task bbretest started with PID 7418
2005-03-19 10:14:29 Our child has failed and will not talk to us
2005-03-19 10:14:36 Our child has failed and will not talk to us

Not knowning which child makes it difficult to figure out what is going
on.  bbpage is aparently running - the logfile says process 5892 is
bbpage, and there is a process 5892 still running.

I fixed the "unmatched" syntax error I had before.

Here is a sample host that is not paging.  The info page lists:
Alerting: Service Recipient 1st Delay Stop after Repeat Time of Day
Colors 
conn dan.mcdonald at austinenergy.com (R) 30m  - 5d  - red 
telnet dan.mcdonald at austinenergy.com (R) 30m  - 5d  - red 

Both telnet and conn have been down on this host for over two hours.

The salient rule is:
HOST=%.
        MAIL=dan.mcdonald at austinenergy.com REPEAT=140h DURATION>30m
RECOVERED COLOR="red" UNMATCHED

I imagine I'm doing something terribly silly, but I'm just not clear
what it might be.



More information about the Xymon mailing list