[Xymon] FW: Troubleshooting Purple CONN and HTTP Tests in Xymon 4.3.10

Larry Barber lebarber at gmail.com
Tue Nov 6 15:47:55 CET 2012


Did you check to see if a xymonnet process is/was still running? If a
process gets hung for some reason xymonlaunch won't start a new process. I
had this happen to me once, but only once. There is also a --debug flag for
xymonnet, but it produces a _lot_ of output, but it might give you some
idea what is going on.

Thanks,
Larry Barber

On Tue, Nov 6, 2012 at 8:02 AM, Don Kuhlman <Don.Kuhlman at schawk.com> wrote:

>  Thanks Larry. Looks like everything went purple again at 6:45 this
> morning.  The logs still show 0 bytes.
> Any other suggestions for trying to figure this out?
>
>  Regards,
>
>  Don
>
>   From: Larry Barber <lebarber at gmail.com>
> Date: Mon, 5 Nov 2012 17:19:53 -0600
> To: Don Kuhlman <don.kuhlman at schawk.com>
>
> Subject: Re: [Xymon] FW: Troubleshooting Purple CONN and HTTP Tests in
> Xymon 4.3.10
>
>  Xymonnet tends to be pretty quiet unless something goes wrong. You won't
> be able to tell for sure until you get one of your purple storms.
>
>  Alerts are handled by a different module. Look in tasks.cfg to find it.
>
>  Thanks,
> Larry Barber
>
> On Mon, Nov 5, 2012 at 3:53 PM, Don Kuhlman <Don.Kuhlman at schawk.com>wrote:
>
>>  Hi Larry/all.  I've noticed that the xymonnet.log and
>> xymonnet-again.log files are staying at 0 bytes.  Does that seem to be
>> indicating a problem?
>> (and Xymon hasn't gone purple all day, but I'm still not sending any
>> email alerts to anyone).
>>
>>  -rw-rw-rw- 1 xymon xymon        0 Nov  5 15:05
>> /var/log/xymon/xymonnet-again.log
>> -rw-rw-rw- 1 xymon xymon        0 Nov  5 15:07 /var/log/xymon/xymonnet.log
>>
>>  Thanks
>>
>>  Don K
>>
>>
>>
>>   From: Larry Barber <lebarber at gmail.com>
>> Date: Mon, 5 Nov 2012 11:19:32 -0600
>> To: Don Kuhlman <don.kuhlman at schawk.com>
>>  Cc: Xymon Email List <xymon at xymon.com>
>> Subject: Re: [Xymon] FW: Troubleshooting Purple CONN and HTTP Tests in
>> Xymon 4.3.10
>>
>>  All the server side Xymon logs are in /var/log/xymon by default. Since
>> you say that you are getting purple storms for conn and http tests, this
>> suggests that the problem is likely with your xymonnet process. Check the
>> xymonnet log, and when you see the purples check to see if there is a
>> xymonnet instance running. If this instance has been running for more than
>> a few minutes, kill it. If the xymonnet process is hanging, you might want
>> to set the MAXTIME parameter on the xymonnet process in tasks.cfg. Doesn't
>> really fix the problem, but it will at least stop things from going
>> purple.
>>
>>  Thanks,
>> Larry Barber
>>
>> On Mon, Nov 5, 2012 at 10:01 AM, Don Kuhlman <Don.Kuhlman at schawk.com>wrote:
>>
>>>  Update to this. While googling further, I saw a thread titled
>>> "[hobbit] stale alerts".  This mentioned that there could be an external
>>> script that I created which may cause issues for xymon when it runs.  I do
>>> have a diskstat.sh script that may be causing problems. For now, I'm
>>> setting it to DISABLED in the tasks.cfg file.
>>>
>>>  Is there a way to see log information in xymon to try and verify
>>> something like this?
>>>
>>>  Thanks
>>>
>>>  Don K
>>>
>>>   From: Don Kuhlman <don.kuhlman at schawk.com>
>>> Date: Mon, 5 Nov 2012 08:34:29 -0600
>>> To: Xymon Email List <xymon at xymon.com>
>>> Subject: Troubleshooting Purple CONN and HTTP Tests in Xymon 4.3.10
>>>
>>>   Hi folks.  We've been running xymon for about 10 months now. It's
>>> been fine all this time.
>>>
>>>  However last week around Wednesday we started getting purple storms on
>>> the CONN and HTTP tests for all our hosts.
>>> I stop Xymon and restart it, or reboot the server (Linux 5.x) and then
>>> it comes back ok.
>>> This also happened Thursday, and then again Saturday around 2PM cst.
>>>
>>>  Anyone have a link or source for which logs to look in on the server
>>> or xymon to see what may be causing the CONN and HTTP tests to randomly
>>> start failing like this or where to start troubleshooting?
>>>
>>>  Can I use xymonlaunch —debug like this to see what is happening?
>>>          /usr/lib64/xymon/server/bin/xymonlaunch --debug
>>> --config=/usr/lib64/xymon/server/etc/tasks.cfg
>>> --env=/usr/lib64/etc/xymonserver.cfg
>>>
>>>
>>>
>>>  While searching the xymon forum and message boards, I saw some things
>>> that say it may be disk space or inodes, but it seems like we are ok there -
>>>  df -i
>>> Filesystem            Inodes   IUsed   IFree IUse% Mounted on
>>> /dev/sda2            3899392  204731 3694661    6% /
>>> tmpfs                 490139       6  490133    1% /dev/shm
>>> /dev/sda1              32768      51   32717    1% /boot
>>>
>>>  df
>>>  Filesystem           1K-blocks      Used Available Use% Mounted on
>>> /dev/sda2             61312028   5748784  52448700  10% /
>>> tmpfs                  1960556       188   1960368   1% /dev/shm
>>> /dev/sda1               516040     87716    402112  18% /boot
>>>
>>>  DNS also seems fine.
>>>
>>>  Thanks
>>>
>>>  Don K
>>>
>>> _______________________________________________
>>> Xymon mailing list
>>> Xymon at xymon.com
>>> http://lists.xymon.com/mailman/listinfo/xymon
>>>
>>>
>>
>
> _______________________________________________
> Xymon mailing list
> Xymon at xymon.com
> http://lists.xymon.com/mailman/listinfo/xymon
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xymon.com/pipermail/xymon/attachments/20121106/ad4a37a5/attachment.html>


More information about the Xymon mailing list