[Xymon] xymond_rrd program crashed
Tom Schmidt (tschmidt)
tschmidt at micron.com
Wed Aug 7 01:04:52 CEST 2019
Andreas,
I likewise am seeing xymond_rrd program crashing with 4.3.29 but it did not with 4.3.28. If you look at the differences in the xymond/rrd/do_netstat.c between 4.3.28 and 4.3.29 you will see that the code was updated to handle newer Linux systems net-tools format. Your logs also show that it is not recognizing the OS for your "rackstation*" hosts. I do not know what type of OS these systems are, but you may have to update your netstat monitor or the xymond/rrd/do_netstat.c code to recognize their netstat output format.
For me, xymond_rrd is crashing when trying to extract RRD data from some of our temperature reports. Some of our older temperature monitors returned data in this format:
Device Temp(C) Temp(F) Threshold(C)
------------------------------------------------------
&green System Board Inlet Temp 21 69 ( 42)
&green CPU1 Temp 32 89 ( 93)
&green CPU2 Temp 31 87 ( 93)
------------------------------------------------------
Xymon 4.3.28 handled this properly by treating the data in the parenthesis as comments. Attached is my patch to fix xymond/rrd/do_temperature.c to handle temperature reports like the above. Most of this code was already in 4.3.28 but I do not know why it was removed.
Japheth,
Please consider adding this patch for xymond/rrd/do_temperature.c to the next release. It also includes the stripping off of any leading bold and italic HTML tags from sensor names that I submitted earlier.
Thanks...Tom
[http://collab.micron.com/corp/brand/SiteAssets/Micron.png]<http://www.micron.com/>
Tom Schmidt
Sr Manager, IT, Product Engineering
IT ETD Eng Sites US
Micron Technology, Inc.
Office: +1 (208) 368-4058 Fax: (208)368-2807
Email: tschmidt at micron.com<mailto:tschmidt at micron.com> Website: micron.com<http://www.micron.com/>
Micron Technology, Inc., Confidential and Proprietary.
From: Xymon <xymon-bounces at xymon.com> On Behalf Of Andreas Kunberger
Sent: Monday, August 5, 2019 4:13 AM
To: 'Xymon at xymon.com' <Xymon at xymon.com>
Subject: [EXT] [Xymon] xymond_rrd program crashed
Since we have updated to Xymon 4.3.29-1.el7.terabithia
We get the xymond status:
xymond_rrd program crashed
Fatal signal caught!
In the /var/log/messages we have:
Aug 4 06:59:33 suse abrt-hook-ccpp[9966]: Process 8736 (xymond_rrd) of user 1000 killed by SIGABRT - dumping core
Aug 4 06:59:34 suse abrt-server[9968]: Package 'xymon' isn't signed with proper key
Aug 4 06:59:34 suse abrt-server[9968]: 'post-create' on '/var/spool/abrt/ccpp-2019-08-04-06:59:33-8736' exited with 1
Aug 4 06:59:34 suse abrt-server[9968]: Deleting problem directory '/var/spool/abrt/ccpp-2019-08-04-06:59:33-8736'
and in /var/log/xymon/rrd-status
2019-08-04 06:58:43.164318 net-janus/ntpd.rrd: Bug - duplicate RRD data with same timestamp 1564894723, different data
2019-08-04 06:59:33.750946 Host 'rackstation' reports netstat for an unknown OS
2019-08-04 06:59:34.621163 xymond_channel: Child process 8736 died: Signal 6
2019-08-04 06:59:34.739280 xymond_channel: Peer at 0.0.0.0:0 failed: Broken pipe
2019-08-04 06:59:34.986178 xymond_channel: Peer not up, flushing message queue
2019-08-04 06:59:56.727302 Host 'rackstation3' reports netstat for an unknown OS
2019-08-04 06:59:57.603455 xymond_channel: Child process 9971 died: Signal 6
2019-08-04 06:59:58.136009 xymond_channel: Peer at 0.0.0.0:0 failed: Broken pipe
2019-08-04 06:59:58.136109 xymond_channel: Peer not up, flushing message queue
2019-08-04 06:59:58.136150 xymond_channel: Peer not up, flushing message queue
2019-08-04 06:59:58.136192 xymond_channel: Peer not up, flushing message queue
2019-08-04 06:59:58.136316 xymond_channel: Peer not up, flushing message queue
2019-08-04 06:59:58.136374 xymond_channel: Peer not up, flushing message queue
....
The Server runs on CentOS Linux release 7.6.1810 (Core)
Thanks in advance!
i.A. Andreas Kunberger
--
Andreas Kunberger
ZD/IT
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xymon.com/pipermail/xymon/attachments/20190806/471efbac/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.png
Type: image/png
Size: 5852 bytes
Desc: image001.png
URL: <http://lists.xymon.com/pipermail/xymon/attachments/20190806/471efbac/attachment.png>
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: Patch-4.3.29_temperature.txt
URL: <http://lists.xymon.com/pipermail/xymon/attachments/20190806/471efbac/attachment.txt>
More information about the Xymon
mailing list