[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [hobbit] possible alerting bug in RC2?



On Mon, Feb 14, 2005 at 01:28:28PM -0800, Bruce Lysik wrote:

> So I installed RC2 this morning.  Later on, I noticed an alert email
>  for a monitor going into yellow.  I had disabled this previously
>  with --alertcolors=red,purple in hobbitlaunch.cfg.

[config from another mail]

>HOST=<snip list of about 100 hosts>
>        SCRIPT /opt/bb/server/ext/email bruce_mail FORMAT=SCRIPT REPEAT=30 DURATION>6 RECOVERED

> And here's the alert I just received:
> 
> im68:cpu yellow [-1]
> yellow Mon Feb 14 13:13:56 PST 2005 up: 208 day(s), 1 users, 115 procs, load=529

The alert you show here looks like a recovery-notice (the "-1" I
assume is the acknowledgment cookie, and this value indicates that
there is no active alert).

If you look in the ~/data/ack/notifications.log file for these
notifications, you can tell if it's an alert message or a recovery
message by the number of columns in the file. E.g. in my log I have

Wed Feb 16 13:08:43 2005 www.sslug.dk.smtp (130.228.2.150) henrik (at) hswn.dk 1108555723 725
Wed Feb 16 13:09:44 2005 www.sslug.dk.smtp (130.228.2.150)henrik (at) hswn.dk 1108555784 725 61

The first one is the alert message, the second is the recovery
message. The recovery has an extra field "61", which is the duration
of the event (in seconds).


Could you check the following in hobbitlaunch.cfg:

* The "hobbitd" command has "--alertcolors=red,purple --okcolors=green"
* The "hobbitd_alert" command has "--alertcolors=red,purple"

This setup should give you alerts when a status is red (or purple),
and recovery notices only when they go green (after being red or
purple).


Regards,
Henrik