[hobbit] EXHOST usage

Charles Jones jonescr at cisco.com
Wed Oct 26 18:52:40 CEST 2005


Henrik Stoerner wrote:

>On Wed, Oct 26, 2005 at 04:12:00AM -0700, Charles Jones wrote:
>  
>
>>Perhaps it's because I'm working on this at 4am, but I'm having a 
>>problem with the EXHOST option, that according to hobbitd_alert --test 
>>isn't working, I also am not sure how to do a particular host/service 
>>exclusion.
>>
>>Heres basically what my below alert config is meant to accomplish.
>>1. For any alerts on any servers, send alerts to an alert email address.
>>2. For 2 particular web servers (web5.mydomain.com and 
>>web6.mydomain.com), send an alert to one person, but *not *the alert alias.
>>3. For a set of oracle servers, send an extra alert message to an 
>>alternate email address/cellphone.
>>    
>>
>
>One way of doing these would be:
>
># 2 special webservers, that ONLY get this alert (2)
>HOST=$WEB_SERVERS SERVICE=msgs COLOR=red
>	MAIL webdev at mydomain.com STOP
>
># Oracle alerts (3)
>HOST=$ORACLE_SERVERS SERVICE=msgs,oradb,orasys COLOR=red FORMAT=sms
>	MAIL dbacell at cellphone.com
>
># Default rule (1)
>HOST=$ALL_HOSTS SERVICE=* COLOR=red 
>	MAIL alert at mydomain.com
>
>  
>
>>4. After hours (from 5pm until 8am), only send alerts to an alternate 
>>email address (but still need the seperate alert for the web5 and web6 
>>hosts described in #2).
>>5. After hours (from 5pm until 8am), send an alert to my cellphone for 
>>any hosts and services being red for more than 30 mins.
>>    
>>
>
>For these, modify the default rule marked (1) to use different alerts
>based on time. E.g.
>
># Default rule (1)
>HOST=$ALL_HOSTS SERVICE=* COLOR=red 
>	MAIL alert at mydomain.com TIME=*:0800:1700
>	# Outside office hours, mail alerts to a different address (4)
>	MAIL alternate at mydomain.com TIME=*:1700:0800
>	# Outside office hours, send to my cell phone (5)
>	MAIL mycell at cellphone.com FORMAT=sms DURATION>30 TIME=*:1700:0800
>
>  
>
Ahh! I didn't realize you could make multiple TIME 
specifications...that's the main thing I was missing.

>>6. Do not alert for high load average on a particular server from 6-10am.
>>    
>>
>
>There's no really elegant way of doing that ... it makes me think that
>perhaps there should be some way of defining a "no-action" rule: "For
>these conditions, do NOT send any alerts, and stop looking for more
>alert recipients".
>
That would be nice, I hereby dub it, the BLACKHOLE option ;-)

> But for now, you'll have to modify the default rule
>to exclude that host, then setup specific rules for that host. So your
>default rule becomes
>
># Default rule (1)
>HOST=$ALL_HOSTS SERVICE=* COLOR=red EXHOST=dataproc1.mydomain.com
>	MAIL alert at mydomain.com TIME=*:0800:1700
>	# Outside office hours, mail alerts to a different address (4)
>	MAIL alternate at mydomain.com TIME=*:1700:0800
>	# Outside office hours, send to my cell phone (5)
>	MAIL mycell at cellphone.com FORMAT=sms DURATION>30 TIME=*:1700:0800
>
>and the specific rules for that host:
>
># Load avg alerts only from 10am -> 6am
>HOST=dataproc1.mydomain.com SERVICE=la TIME=*:1000:0600
>	MAIL alert at mydomain.com TIME=*:0800:1700
>	MAIL alternate at mydomain.com TIME=*:1700:0800
>	MAIL mycell at cellphone.com FORMAT=sms DURATION>30 TIME=*:1700:0800
># All other services alert like the normal default rule.
>HOST=dataproc1.mydomain.com EXSERVICE=la
>	MAIL alert at mydomain.com TIME=*:0800:1700
>	MAIL alternate at mydomain.com TIME=*:1700:0800
>	MAIL mycell at cellphone.com FORMAT=sms DURATION>30 TIME=*:1700:0800
>
>
>  
>
This has me a bit confused. The default rule I understand, as it's the 
normal rule except its excluding the dataproc1 host.  The specific rules 
though, the first one, has a TIME specification in the HOST= line, 
indicating from 6am-10am, but then the MAIL lines following it specify 
times outside that window...is that basically a way to trick hobbit into 
not sending a mail at all?

Note: the way I handle this in BigBrother is via an exclude rule, 
basically when you define a rule with a ! in front of it, it removes 
that host/service from the FINAL match list.  Hopefully you can 
implement something in Hobbit for a similar effect.

# Dont wake OnCall person every morning about dataproc1 cpu/load being high
!dataproc1.mydomain.com;;cpu;;*;0600-1000;alert at mydomain.com

I also use the same technique on BigBrother to remove alerts during 
certain hours:
# Don't send alerts about web errors during non-working hours.
!web*.mydomain.com;;msgs;;*;0000-0800;alert at mydomain..com
!web*.mydomain.com;;msgs;;*;1700-0000;alert at mydomain.com

-Charles
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xymon.com/pipermail/xymon/attachments/20051026/3e19c43b/attachment.html>


More information about the Xymon mailing list