[Xymon] xymond_rrd program crashed

Tom Schmidt (tschmidt) tschmidt at micron.com
Wed Aug 7 01:04:52 CEST 2019


Andreas,
    I likewise am seeing xymond_rrd program crashing with 4.3.29 but it did not with 4.3.28.  If you look at the differences in the xymond/rrd/do_netstat.c between 4.3.28 and 4.3.29 you will see that the code was updated to handle newer Linux systems net-tools format.  Your logs also show that it is not recognizing the OS for your "rackstation*" hosts.  I do not know what type of OS these systems are, but you may have to update your netstat monitor or the xymond/rrd/do_netstat.c code to recognize their netstat output format.

    For me, xymond_rrd is crashing when trying to extract RRD data from some of our temperature reports.  Some of our older temperature monitors returned data in this format:

Device                  Temp(C)  Temp(F)  Threshold(C)
------------------------------------------------------
&green System Board Inlet Temp    21       69     ( 42)
&green CPU1 Temp               32       89     ( 93)
&green CPU2 Temp               31       87     ( 93)
------------------------------------------------------

Xymon 4.3.28 handled this properly by treating the data in the parenthesis as comments.  Attached is my patch to fix xymond/rrd/do_temperature.c to handle temperature reports like the above.  Most of this code was already in 4.3.28 but I do not know why it was removed.

Japheth,
   Please consider adding this patch for xymond/rrd/do_temperature.c  to the next release.  It also includes the stripping off of any leading bold and italic HTML tags from sensor names that I submitted earlier.


Thanks...Tom


[http://collab.micron.com/corp/brand/SiteAssets/Micron.png]<http://www.micron.com/>
Tom Schmidt
Sr Manager, IT, Product Engineering
IT ETD Eng Sites US
Micron Technology, Inc.
Office: +1 (208) 368-4058  Fax: (208)368-2807
Email: tschmidt at micron.com<mailto:tschmidt at micron.com>  Website: micron.com<http://www.micron.com/>
Micron Technology, Inc., Confidential and Proprietary.


From: Xymon <xymon-bounces at xymon.com> On Behalf Of Andreas Kunberger
Sent: Monday, August 5, 2019 4:13 AM
To: 'Xymon at xymon.com' <Xymon at xymon.com>
Subject: [EXT] [Xymon] xymond_rrd program crashed

Since we  have updated to Xymon 4.3.29-1.el7.terabithia

We get the xymond status:
    xymond_rrd program crashed
    Fatal signal caught!

In the /var/log/messages we have:
Aug  4 06:59:33 suse abrt-hook-ccpp[9966]: Process 8736 (xymond_rrd) of user 1000 killed by SIGABRT - dumping core
Aug  4 06:59:34 suse abrt-server[9968]: Package 'xymon' isn't signed with proper key
Aug  4 06:59:34 suse abrt-server[9968]: 'post-create' on '/var/spool/abrt/ccpp-2019-08-04-06:59:33-8736' exited with 1
Aug  4 06:59:34 suse abrt-server[9968]: Deleting problem directory '/var/spool/abrt/ccpp-2019-08-04-06:59:33-8736'

and in /var/log/xymon/rrd-status
2019-08-04 06:58:43.164318 net-janus/ntpd.rrd: Bug - duplicate RRD data with same timestamp 1564894723, different data
2019-08-04 06:59:33.750946 Host 'rackstation' reports netstat for an unknown OS
2019-08-04 06:59:34.621163 xymond_channel: Child process 8736 died: Signal 6
2019-08-04 06:59:34.739280 xymond_channel: Peer at 0.0.0.0:0 failed: Broken pipe
2019-08-04 06:59:34.986178 xymond_channel: Peer not up, flushing message queue
2019-08-04 06:59:56.727302 Host 'rackstation3' reports netstat for an unknown OS
2019-08-04 06:59:57.603455 xymond_channel: Child process 9971 died: Signal 6
2019-08-04 06:59:58.136009 xymond_channel: Peer at 0.0.0.0:0 failed: Broken pipe
2019-08-04 06:59:58.136109 xymond_channel: Peer not up, flushing message queue
2019-08-04 06:59:58.136150 xymond_channel: Peer not up, flushing message queue
2019-08-04 06:59:58.136192 xymond_channel: Peer not up, flushing message queue
2019-08-04 06:59:58.136316 xymond_channel: Peer not up, flushing message queue
2019-08-04 06:59:58.136374 xymond_channel: Peer not up, flushing message queue
....

The Server runs on CentOS Linux release 7.6.1810 (Core)

Thanks in advance!

i.A. Andreas Kunberger

--

Andreas Kunberger
ZD/IT

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xymon.com/pipermail/xymon/attachments/20190806/471efbac/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.png
Type: image/png
Size: 5852 bytes
Desc: image001.png
URL: <http://lists.xymon.com/pipermail/xymon/attachments/20190806/471efbac/attachment.png>
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: Patch-4.3.29_temperature.txt
URL: <http://lists.xymon.com/pipermail/xymon/attachments/20190806/471efbac/attachment.txt>


More information about the Xymon mailing list