[Xymon] xymon hostdata module going rogue

John Thurston john.thurston at alaska.gov
Tue Dec 1 18:32:58 CET 2015


I was bit by this in the middle of November, and didn't notice it until 
a customer alerted me today to a shortage of email messages.

To recap:

Some alerts get sent correctly, but in other cases the alert daemon 
aborts message processing and no alert is sent. In the cases where the 
daemon stops processing, my debug log begins to accumulate messages of 
the sort:
> 1730 2015-12-01 07:58:39.501785 Checking criteria for host 'upsjdc.state.ak.us', which is not defined

There is sometimes a <defunct> process left hanging around. At other 
times there is not.

Performing a "xymon.sh restart" makes it all work again.

Today, I had a process tree something like:
> 29118 /opt/xymon/server/bin/xymonlaunch --config=/opt/xymon/server/etc/tasks.cfg --en
>   29119 xymond --pidfile=/var/log/xymon/xymond.pid --restart=/opt/xymon/server/tmp/xymo
>   29120 /opt/xymon/server/bin/xymonfetch --id=1 --interval=79 --no-daemon --pidfile=/va
>   29144 xymond_channel --channel=stachg --log=/var/log/xymon/history.log xymond_history
>     29201 xymond_history --pidfile=/var/log/xymon/xymond_history.pid
>   29145 xymond_channel --channel=page --log=/var/log/xymon/alert.log xymond_alert --deb
>     29307 xymond_alert --debug --checkpoint-file=/opt/xymon/server/tmp/alert.chk --checkp
>       1588  <defunct>

I killed off PID 29145, it was recreated, and the alerts began flowing 
again.

In this occurrence, it does not appear to be related to a "drop" 
message. My last recorded "drop" was at 20151103-0846 and the alert 
process didn't start logging "which is not defined" until 20151120-0007

The only thing I can think to do now is make my xymon client monitor the 
alert.log and warn me when "which is not defined" start appearing so I 
can manually kill/restart the process.
-- 
    Do things because you should, not just because you can.

John Thurston    907-465-8591
John.Thurston at alaska.gov
Enterprise Technology Services
Department of Administration
State of Alaska



More information about the Xymon mailing list