[Xymon] New server causing issues with CONN test

Poppy, Ben poppy.ben at marshfieldclinic.org
Mon Aug 15 23:25:44 CEST 2011


The new server went into a "flapping" state.

During my next test, I'll try stopping the tests on the new server and see what happens.. 

-----Original Message-----
From: xymon-bounces at xymon.com [mailto:xymon-bounces at xymon.com] On Behalf Of Henrik Størner
Sent: Monday, August 15, 2011 4:17 PM
To: xymon at xymon.com
Subject: Re: [Xymon] New server causing issues with CONN test

On 15-08-2011 22:46, Poppy, Ben wrote:
> I'm having a pretty strange issue. We have our existing hobbit servers
> running on Fedora servers running hobbit 4.2.0. I'm working on
> installing brand new servers that will be running CentOS 6 64-bit and
> the latest version of xymon (4.3.3 before I saw 4.3.4 today).

[installs and starts 4.3 version]

> Within a few minutes, 4 servers turn to red alerts on CONN on the
> existing Fedora based Hobbit servers. They begin flapping on and off of
> red alert until I shutdown the new CentOS xymon server. Within a few
> minutes of the new server being shut down, the alerts go away for good.
>
> I have tried going to Centos 5 32-bit, 64-bit, even trying xymon 4.2.3,
> or all the way back to hobbit 4.2.0 all with the same result, and the
> exact same 4 servers each time.

As I understand, you were running both versions simultaneously. Did 
those servers also go red on the new Xymon version, or only on the old 
one? If they were red also on the new server, did you try stopping 
network tests on the old server and did that make a difference ?

Which ping-tool are you using - xymonping or fping ?

I haven't heard of anything like this before, but I suspect it may be an 
issue with the way "ping" works. When routing traffic, most systems will 
pass ping-traffic with a low priority, so it is quite easy for 
ping-requests and -responses to be dropped. Since xymonping and fping 
pump out a lot of ping-traffic rather quickly, maybe the new server just 
happened to be more "lucky" with its data than the old one - perhaps due 
to the switch port it is on, or the speed of the network interface and 
so on.

It might be worthwhile to make sure that the old and the new system does 
not run the network tests at the same time - keep an eye (with "ps" on 
when the network test runs on the old system, and don't start Xymon on 
the new system until about 30 secs after the old system completes the 
network tests. (Assuming your network tests don't take more than a 
couple of minutes, so there is time for both systems to run their tests 
within the default 5 minute interval).


Regards,
Henrik
_______________________________________________
Xymon mailing list
Xymon at xymon.com
http://lists.xymon.com/mailman/listinfo/xymon

______________________________________________________________________
The contents of this message may contain private, protected and/or privileged information.  If you received this message in error, you should destroy the e-mail message and any attachments or copies, and you are prohibited from retaining, distributing, disclosing or using any information contained within.  Please contact the sender and advise of the erroneous delivery by return e-mail or telephone.  Thank you for your cooperation.



More information about the Xymon mailing list