[Xymon] Alerting - I'm not doing it right...

henrik at hswn.dk henrik at hswn.dk
Thu Dec 15 12:36:12 CET 2011


On Thu, 15 Dec 2011 10:02:43 +0000, Carl Inglis <Carl.Inglis at rakon.com>
wrote:
> alerts.cfg
> 
> $EMAIL_ALERT=carl.inglis at rakon.com
> $LIN_WINDOWS_PROBLEMS=$EMAIL_ALERT
> 
> HOST=%lin(.*) SERVICE=%win(.*)
>         MAIL $LIN_WINDOWS_PROBLEMS REPEAT=24h DURATION>1d RECOVERED STOP
> 
> HOST=* EXPAGE=printers
>         MAIL $EMAIL_ALERT REPEAT=1h RECOVERED UNMATCHED STOP
> 
> When the host "lin-apps-01" has a yellow alert on it's "winUpdates"
> services, I expect it to shout about it once every 24h. It is, however,
> shouting about it once every hour.

There may be some confusion about "service" here. 

When you refer to "winUpdates" - is that a status-column in Xymon, or a
Windows Service that you are monitoring with a client on the Windows
machine? The latter would typically show up in a "svcs" (services) status
column on Xymon.

The SERVICE=... setting in alerts.cfg refer to the status-column, not a
Windows service. So to catch a "Windows updates" service that is not
running, you would have 'SERVICE=svcs' in alerts.cfg.

What the first part of your alerts.cfg says, is "if you have a host whose
name contains 'lin', and that host has a status-column that contains 'win',
then send an alert after 1 day, and repeat every 24 hours".

The second part of your configuration says "Any status that has an error -
except those on the 'printers' page, and those handled by other rules -
trigger an alert that is repeated once an hour". Pretty broad definition, I
think.


Hope that removes a bit of confusion.


Regards,
Henrik




More information about the Xymon mailing list