[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

hobbit-alerts.cfg question



Hi all

The alerting is starting to take shape but I've a question regarding
how the alerting works.  If I have a stanza similar to the following,
how is it evaluated?  Once for all hosts, or for one host at a time?
---
HOST=%.*
        # Proliant tests
        MAIL sms (at) somecompany.com SERVICE=proliant FORMAT=SMS REPEAT=1440m
        MAIL sms (at) somecompany.com SERVICE=proliant FORMAT=SMS RECOVERED

        # conn where status is RED
        MAIL sms (at) somecompany.com COLOR=red SERVICE=conn EXPAGE=dev REPEAT=1440m
        MAIL sms (at) somecompany.com COLOR=red SERVICE=conn EXPAGE=dev RECOVERED

        # conn where status is RED (dev/test)
        MAIL email (at) somecompany.com COLOR=red SERVICE=conn PAGE=dev REPEAT=1440m
        MAIL email (at) somecompany.com COLOR=red SERVICE=conn PAGE=dev RECOVERED

        # cpu,disk,memory where status is RED
        MAIL sms (at) somecompany.com COLOR=red SERVICE=cpu,disk,memory
EXPAGE=dev REPEAT=1440m
        MAIL sms (at) somecompany.com COLOR=red SERVICE=cpu,disk,memory
EXPAGE=dev RECOVERED

        # Dev servers
        MAIL email (at) somecompany.com COLOR=red SERVICE=cpu,disk,memory
PAGE=dev REPEAT=1440m
        MAIL email (at) somecompany.com COLOR=red SERVICE=cpu,disk,memory
PAGE=dev RECOVERED

        # Non-dev status YELLOW
        MAIL email (at) somecompany.com COLOR=yellow
SERVICE=cpu,disk,memory REPEAT=1440m DURATION>30m
        MAIL email (at) somecompany.com COLOR=yellow
SERVICE=cpu,disk,memory RECOVERED
---

Also, I've noticed that when a fault occurs I get two emails (or sms')
and another when the fault is rectified.  I'm thinking this is because
of the 'RECOVERED' line but i thought this would only trigger when the
fault goes.  Have I misunderstood?

Thanks

CC

-- 
RHCE#805007969328369