[hobbit] False Process Down Alerts
Josh Luthman
josh at imaginenetworksllc.com
Mon Jan 18 00:21:15 CET 2010
Is there only one client sending data as this name? I don't think you
answered Lars' email.
What does the alert read and what does the data say? Missing process? Too
high of a load?
Josh Luthman
Office: 937-552-2340
Direct: 937-552-2343
1100 Wayne St
Suite 1337
Troy, OH 45373
"The secret to creativity is knowing how to hide your sources."
--- Albert Einstein
On Sun, Jan 17, 2010 at 6:11 PM, Chris Naude <chris.naude.0 at gmail.com>wrote:
> The problem has suddenly become much much worse. I verified with tcpdump
> that the data coming from the client is 100% correct. It seems something on
> the Xymon server side is not handling the client data correctly. Anyone have
> any other ideas?
>
> [image: red] 89% /testdb3 (37771472% used) has reached the PANIC level (95%)
>
> Filesystem 1024-blocks Used Available Capacity Mounted on
> /dev/vgtestdb1/lvol1 107844344 70901816 36942528 66% /testdb1
> /dev/vgtestdb2/lvol1 35962064 25453128 10508936 71% /testdb2
> /dev/vgtestdb4/lvol1 970909400 825006344 145903056 85% /testdb4
> /dev/vgtestdb3/lv
> l1 ] 338788224 301016752 37771472 89% /testdb3
> /dev/vgtestdb5/lvol1 179789048 150553912 29235136 84% /testdb5
> /dev/vg00/lvol8 24580711 74501 24506210 1% /home
> /dev/vg00/lvol4 10226680 6339283 3887397 62% /opt
>
>
>
> On Sat, Jan 16, 2010 at 10:44 AM, Chris Naude <chris.naude.0 at gmail.com>wrote:
>
>> That makes a lot of sense. I did have some issues with the startup scripts
>> on HP-UX. I'll check it out later tonight. Hopefully i can get it fixed
>> before it goes live tonight. Thanks!
>>
>>
>> On Sat, Jan 16, 2010 at 7:56 AM, Lars Ebeling <
>> lars.ebeling at leopg9.no-ip.org> wrote:
>>
>>> It looks like two instances of the client are writing to the file at
>>> the same time or almost ;)
>>>
>>> Lars
>>>
>>> ----- Original Message -----
>>> *From:* Chris Naude <chris.naude.0 at gmail.com>
>>> *To:* hobbit at hswn.dk
>>> *Sent:* Saturday, January 16, 2010 4:59 AM
>>> *Subject:* [hobbit] False Process Down Alerts
>>>
>>> I'm run into a strange problem with my Xymon server. I noticed today that
>>> I'm receiving random false alerts for processes being down. When I look at
>>> the process list output in the alert it looks as if the data coming from the
>>> clients isn't correct. Here is an example. Has anyone seen anything like
>>> this?
>>>
>>> 9613 1944 root Jan 11 S 154 0.00 00:00:00 6128 cmclconfd -c
>>> 10389 1944 root Jan 11 S 154 0.00 00:00:00 6128 cmclconfd -c
>>> 9794 1 oracle 10:55:57 S 154 0.00 00:00:0
>>> 217600]oracleTEST (LOCAL=NO)
>>> 1592 1 oracle Jan 11 S 154 0.00 00:00:11 217136 ora_mman_TEST
>>> 12751 1944 root Jan 11 S 154 0.00 00:00:00 6128 cmclconfd -c
>>> 8965 1944 root Jan 11 S 154 0.00 00:00:00 6128 cmclconfd -c
>>>
>>>
>>> 11819 1 oracle Jan 12 S 154 0.00 00:00:07 217280 ora_j015_TEST
>>> 2711 1 roo
>>> ]ec 4 S 120 0.04 00:02:16 868 /usr/sbin/xntpd
>>> 3547 1 xymon Dec 4 S 168 0.00 00:00:43 268 /opt/xymon/client/bin/hobbitlaunch --config=/opt/xymon/client/etc/clientlaunch.cfg --log=/opt/xymon/client/logs/clientlaunch.log --pidfile=/opt/xymon/client/logs/clientlaunch.101.example.com.pid
>>> 3728 1 root Dec 4 R 152 0.00 00:00:37 4208 /usr/sbin/stm/uut/bin/tools/monitor/WbemWrapperMonitor
>>>
>>>
>>>
>>> Xymon version: 4.3.0-0.beta2
>>> Xymon server: CentOS 5.4 32 bit
>>>
>>> Client: HP-UX 11.31 Itanium
>>>
>>> --
>>> Chris Naude
>>>
>>>
>>
>>
>> --
>> Chris Naude
>>
>
>
>
> --
> Chris Naude
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xymon.com/pipermail/xymon/attachments/20100117/a8f9b167/attachment.html>
More information about the Xymon
mailing list