[hobbit] False Process Down Alerts

Chris Naude chris.naude.0 at gmail.com
Mon Jan 18 00:11:44 CET 2010


The problem has suddenly become much much worse. I verified with tcpdump
that the data coming from the client is 100% correct. It seems something on
the Xymon server side is not handling the client data correctly. Anyone have
any other ideas?

[image: red] 89%     /testdb3 (37771472% used) has reached the PANIC level (95%)

Filesystem            1024-blocks  Used  Available Capacity Mounted on
/dev/vgtestdb1/lvol1    107844344 70901816 36942528    66%     /testdb1
/dev/vgtestdb2/lvol1    35962064 25453128 10508936    71%     /testdb2
/dev/vgtestdb4/lvol1    970909400 825006344 145903056    85%     /testdb4
/dev/vgtestdb3/lv
l1 ]  338788224 301016752 37771472    89%     /testdb3
/dev/vgtestdb5/lvol1    179789048 150553912 29235136    84%     /testdb5
/dev/vg00/lvol8       24580711    74501 24506210     1%     /home
/dev/vg00/lvol4       10226680  6339283  3887397    62%     /opt



On Sat, Jan 16, 2010 at 10:44 AM, Chris Naude <chris.naude.0 at gmail.com>wrote:

> That makes a lot of sense. I did have some issues with the startup scripts
> on HP-UX. I'll check it out later tonight. Hopefully i can get it fixed
> before it goes live tonight. Thanks!
>
>
> On Sat, Jan 16, 2010 at 7:56 AM, Lars Ebeling <
> lars.ebeling at leopg9.no-ip.org> wrote:
>
>>  It looks like two instances of the client are writing to the file at the
>> same time or almost ;)
>>
>> Lars
>>
>> ----- Original Message -----
>> *From:* Chris Naude <chris.naude.0 at gmail.com>
>> *To:* hobbit at hswn.dk
>> *Sent:* Saturday, January 16, 2010 4:59 AM
>> *Subject:* [hobbit] False Process Down Alerts
>>
>> I'm run into a strange problem with my Xymon server. I noticed today that
>> I'm receiving random false alerts for processes being down. When I look at
>> the process list output in the alert it looks as if the data coming from the
>> clients isn't correct. Here is an example. Has anyone seen anything like
>> this?
>>
>>  9613  1944 root      Jan 11  S 154  0.00 00:00:00    6128 cmclconfd -c
>> 10389  1944 root      Jan 11  S 154  0.00 00:00:00    6128 cmclconfd -c
>>  9794     1 oracle   10:55:57 S 154  0.00 00:00:0
>>   217600]oracleTEST (LOCAL=NO)
>>  1592     1 oracle    Jan 11  S 154  0.00 00:00:11  217136 ora_mman_TEST
>> 12751  1944 root      Jan 11  S 154  0.00 00:00:00    6128 cmclconfd -c
>>  8965  1944 root      Jan 11  S 154  0.00 00:00:00    6128 cmclconfd -c
>>
>>
>> 11819     1 oracle    Jan 12  S 154  0.00 00:00:07  217280 ora_j015_TEST
>>  2711     1 roo
>>       ]ec  4  S 120  0.04 00:02:16     868 /usr/sbin/xntpd
>>  3547     1 xymon     Dec  4  S 168  0.00 00:00:43     268 /opt/xymon/client/bin/hobbitlaunch --config=/opt/xymon/client/etc/clientlaunch.cfg --log=/opt/xymon/client/logs/clientlaunch.log --pidfile=/opt/xymon/client/logs/clientlaunch.101.example.com.pid
>>  3728     1 root      Dec  4  R 152  0.00 00:00:37    4208 /usr/sbin/stm/uut/bin/tools/monitor/WbemWrapperMonitor
>>
>>
>>
>> Xymon version: 4.3.0-0.beta2
>> Xymon server: CentOS 5.4 32 bit
>>
>> Client: HP-UX 11.31 Itanium
>>
>> --
>> Chris Naude
>>
>>
>
>
> --
> Chris Naude
>



-- 
Chris Naude
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xymon.com/pipermail/xymon/attachments/20100117/0d3dd1d3/attachment.html>


More information about the Xymon mailing list