[Xymon] XYMOND Crash with IPCS issue

Gautier Begin gbegin at csc.com
Fri May 9 08:39:21 CEST 2014


Hello,

I finally solved the problem. XYMON crash was due to a data over flow 
coming from the proxy. So the solution was to regulate data flow on the 
proxy by tuning MAXMSGSPERCOMBO and SLEEPBETWEENMSGS in the 
xymonserver.cfg file.
Unfortunately, the man page of xymonserver.cfg is much less explicit than 
in the xymonnet one on how to use these values.

So I lowered the MAXMSGSPERCOMBO and raised the SLEEPBETWEENMSGS and that 
solved the problem. 

MAXMSGSPERCOMBO="50"            # Default 100  - 0 =>unlimited
SLEEPBETWEENMSGS="5000"         # microseconds

The result can be seen in the graph of the xymonproxy test display and the 
graph of the xymond test display.

Remain that during the high flow period, the xymonnet on the proxy doesn't 
send any data. Should I continue to lower MAXMSGSPERCOMBO and raise the 
SLEEPBETWEENMSGS ?


Cordialement, Regards,Mit freundlichen Grüßen,

Gautier BEGIN

System Tools Team Lead
CACEIS and APERAM accounts
CSC Computer Sciences Luxembourg S.A.
12D Impasse Drosbach
L-1882 Luxembourg

Global Outsourcing Service | p:+352 24 834 276 | m:+352 621 229 172 | 
gbegin at csc.com | www.csc.com


CSC • This is a PRIVATE message. If you are not the intended recipient, 
please delete without copying and kindly advise us by e-mail of the 
mistake in delivery.  NOTE: Regardless of content, this e-mail shall not 
operate to bind CSC to any order or other contract unless pursuant to 
explicit written agreement or government initiative expressly permitting 
the use of e-mail for such purpose
 • 
CSC Computer Sciences SAS • Registered Office: Immeuble Le Balzac, 10 
Place des Vosges, 92072 Paris La Défense Cedex, France • Registered in 
France: RCS Nanterre B 315 268 664



From:   Gautier Begin/LUX/CSC at CSC
To:     "xymon at xymon.com" <xymon at xymon.com>
Cc:     "Xymon" <xymon-bounces at xymon.com>
Date:   05/08/2014 10:22 AM
Subject:        Re: [Xymon] XYMOND Crash with IPCS issue
Sent by:        "Xymon" <xymon-bounces at xymon.com>



A piece of infromation more: 

I have to empty the server/tmp directory where checkpoints are to be able 
to restart the xymon. 


Cordialement, Regards,Mit freundlichen Grüßen,

Gautier BEGIN

System Tools Team Lead
CACEIS and APERAM accounts
CSC Computer Sciences Luxembourg S.A.
12D Impasse Drosbach
L-1882 Luxembourg

Global Outsourcing Service | p:+352 24 834 276 | m:+352 621 229 172 | 
gbegin at csc.com | www.csc.com


CSC • This is a PRIVATE message. If you are not the intended recipient, 
please delete without copying and kindly advise us by e-mail of the 
mistake in delivery.  NOTE: Regardless of content, this e-mail shall not 
operate to bind CSC to any order or other contract unless pursuant to 
explicit written agreement or government initiative expressly permitting 
the use of e-mail for such purpose
• 
CSC Computer Sciences SAS • Registered Office: Immeuble Le Balzac, 10 
Place des Vosges, 92072 Paris La Défense Cedex, France • Registered in 
France: RCS Nanterre B 315 268 664 



From:        Gautier Begin/LUX/CSC at CSC 
To:        Jeremy Laidman <jlaidman at rebel-it.com.au> 
Cc:        "xymon at xymon.com" <xymon at xymon.com> 
Date:        05/08/2014 08:52 AM 
Subject:        Re: [Xymon] XYMOND Crash with IPCS issue 
Sent by:        "Xymon" <xymon-bounces at xymon.com> 



Hello, 

Yes it is is. This is the root directory of the xymon server. 

The server crashed again this night. I have got this message in the 
xymonlaunch log 
2014-05-08 00:10:14 Fatal error in select: Invalid argument 
2014-05-08 00:10:14 Cannot open checkpoint file 
/project/xymon0/refer/xymon_cur-vers/server/tmp/xymond.chk.1399500614 : 
Too many open files 

Then all channels log write: 
2014-05-08 00:10:14 Tried to down BOARDBUSY: Invalid argument            
8941 2014-05-08 00:10:14 Semaphore wait aborted: Invalid argument 
8941 2014-05-08 00:10:14 Semaphore wait aborted: Invalid argument 


Current limit on open files is 4096 . But currently, xymonlaunch is using 
only 3 and xymond 4. The other xymon channel processes are using 4. So no 
more than 100. 

Cordialement, Regards,Mit freundlichen Grüßen,

Gautier BEGIN

System Tools Team Lead
CACEIS and APERAM accounts
CSC Computer Sciences Luxembourg S.A.
12D Impasse Drosbach
L-1882 Luxembourg

Global Outsourcing Service | p:+352 24 834 276 | m:+352 621 229 172 | 
gbegin at csc.com | www.csc.com


CSC • This is a PRIVATE message. If you are not the intended recipient, 
please delete without copying and kindly advise us by e-mail of the 
mistake in delivery.  NOTE: Regardless of content, this e-mail shall not 
operate to bind CSC to any order or other contract unless pursuant to 
explicit written agreement or government initiative expressly permitting 
the use of e-mail for such purpose
• 
CSC Computer Sciences SAS • Registered Office: Immeuble Le Balzac, 10 
Place des Vosges, 92072 Paris La Défense Cedex, France • Registered in 
France: RCS Nanterre B 315 268 664 



From:        Jeremy Laidman <jlaidman at rebel-it.com.au> 
To:        Gautier Begin/LUX/CSC at CSC 
Cc:        "xymon at xymon.com" <xymon at xymon.com> 
Date:        05/08/2014 02:22 AM 
Subject:        Re: [Xymon] XYMOND Crash with IPCS issue 



On 7 May 2014 19:36, Gautier Begin <gbegin at csc.com> wrote: 
calling ftok('/project/xymon0/refer/xymon_cur-vers/server',4) 
ftok() returns: 0x400FD56 
Could not get shm of size 2621440: No such file or directory 

Does this exist: /project/xymon0/refer/xymon_cur-vers/ 

Is it readable by the Xymon user? 

Cheers 
Jeremy 
_______________________________________________
Xymon mailing list
Xymon at xymon.com
http://lists.xymon.com/mailman/listinfo/xymon
_______________________________________________
Xymon mailing list
Xymon at xymon.com
http://lists.xymon.com/mailman/listinfo/xymon


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xymon.com/pipermail/xymon/attachments/20140509/3ab3e70a/attachment.html>


More information about the Xymon mailing list