[Xymon] xymond_rrd program crashed

Japheth Cleaver cleaver at terabithia.org
Wed Aug 7 23:17:27 CEST 2019


Thanks,

I thought I had actually added this one in, but it appears to have 
gotten lost in an edit.

Added in https://sourceforge.net/p/xymon/code/8073

Regards,
-jc

On 8/6/2019 4:04 PM, Tom Schmidt (tschmidt) wrote:
>
> Andreas,
>
>     I likewise am seeing xymond_rrd program crashing with 4.3.29 but 
> it did not with 4.3.28.  If you look at the differences in the 
> xymond/rrd/do_netstat.c between 4.3.28 and 4.3.29 you will see that 
> the code was updated to handle newer Linux systems net-tools format.  
> Your logs also show that it is not recognizing the OS for your 
> “rackstation*” hosts.  I do not know what type of OS these systems 
> are, but you may have to update your netstat monitor or the 
> xymond/rrd/do_netstat.c code to recognize their netstat output format.
>
>     For me, xymond_rrd is crashing when trying to extract RRD data 
> from some of our temperature reports.  Some of our older temperature 
> monitors returned data in this format:
>
> Device                  Temp(C)  Temp(F) Threshold(C)
>
> ------------------------------------------------------
>
> &green System Board Inlet Temp 21       69     ( 42)
>
> &green CPU1 Temp               32 89     ( 93)
>
> &green CPU2 Temp               31 87     ( 93)
>
> ------------------------------------------------------
>
> Xymon 4.3.28 handled this properly by treating the data in the 
> parenthesis as comments.  Attached is my patch to fix 
> xymond/rrd/do_temperature.c to handle temperature reports like the 
> above.  Most of this code was already in 4.3.28 but I do not know why 
> it was removed.
>
> *Japheth*,
>
>    Please consider adding this patch for xymond/rrd/do_temperature.c 
>  to the next release.  It also includes the stripping off of any 
> leading bold and italic HTML tags from sensor names that I submitted 
> earlier.
>
> Thanks…Tom
>
>
> 	
>
> http://collab.micron.com/corp/brand/SiteAssets/Micron.png 
> <http://www.micron.com/>
>
> 	
>
> Tom Schmidt
> Sr Manager, IT, Product Engineering
> IT ETD Eng Sites US
> Micron Technology, Inc.
>
> Office: +1 (208) 368-4058  Fax: (208)368-2807
>
> Email: tschmidt at micron.com <mailto:tschmidt at micron.com> Website: 
> micron.com <http://www.micron.com/>
> Micron Technology, Inc., Confidential and Proprietary.
>
> *From:* Xymon <xymon-bounces at xymon.com> *On Behalf Of *Andreas Kunberger
> *Sent:* Monday, August 5, 2019 4:13 AM
> *To:* 'Xymon at xymon.com' <Xymon at xymon.com>
> *Subject:* [EXT] [Xymon] xymond_rrd program crashed
>
> Since we  have updated to Xymon 4.3.29-1.el7.terabithia
>
> We get the xymond status:
>
>     xymond_rrd program crashed
>
>     Fatal signal caught!
>
> In the /var/log/messages we have:
>
> Aug  4 06:59:33 suse abrt-hook-ccpp[9966]: Process 8736 (xymond_rrd) 
> of user 1000 killed by SIGABRT - dumping core
>
> Aug  4 06:59:34 suse abrt-server[9968]: Package 'xymon' isn't signed 
> with proper key
>
> Aug  4 06:59:34 suse abrt-server[9968]: 'post-create' on 
> '/var/spool/abrt/ccpp-2019-08-04-06:59:33-8736' exited with 1
>
> Aug  4 06:59:34 suse abrt-server[9968]: Deleting problem directory 
> '/var/spool/abrt/ccpp-2019-08-04-06:59:33-8736'
>
> and in /var/log/xymon/rrd-status
>
> 2019-08-04 06:58:43.164318 net-janus/ntpd.rrd: Bug - duplicate RRD 
> data with same timestamp 1564894723, different data
>
> 2019-08-04 06:59:33.750946 Host 'rackstation' reports netstat for an 
> unknown OS
>
> 2019-08-04 06:59:34.621163 xymond_channel: Child process 8736 died: 
> Signal 6
>
> 2019-08-04 06:59:34.739280 xymond_channel: Peer at 0.0.0.0:0 failed: 
> Broken pipe
>
> 2019-08-04 06:59:34.986178 xymond_channel: Peer not up, flushing 
> message queue
>
> 2019-08-04 06:59:56.727302 Host 'rackstation3' reports netstat for an 
> unknown OS
>
> 2019-08-04 06:59:57.603455 xymond_channel: Child process 9971 died: 
> Signal 6
>
> 2019-08-04 06:59:58.136009 xymond_channel: Peer at 0.0.0.0:0 failed: 
> Broken pipe
>
> 2019-08-04 06:59:58.136109 xymond_channel: Peer not up, flushing 
> message queue
>
> 2019-08-04 06:59:58.136150 xymond_channel: Peer not up, flushing 
> message queue
>
> 2019-08-04 06:59:58.136192 xymond_channel: Peer not up, flushing 
> message queue
>
> 2019-08-04 06:59:58.136316 xymond_channel: Peer not up, flushing 
> message queue
>
> 2019-08-04 06:59:58.136374 xymond_channel: Peer not up, flushing 
> message queue
>
> ….
>
> The Server runs on CentOS Linux release 7.6.1810 (Core)
>
> Thanks in advance!
>
> i.A. Andreas Kunberger
>
> -- 
>
> Andreas Kunberger
>
> ZD/IT
>
>
> _______________________________________________
> Xymon mailing list
> Xymon at xymon.com
> http://lists.xymon.com/mailman/listinfo/xymon


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xymon.com/pipermail/xymon/attachments/20190807/d3fe946d/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.png
Type: image/png
Size: 5852 bytes
Desc: not available
URL: <http://lists.xymon.com/pipermail/xymon/attachments/20190807/d3fe946d/attachment.png>


More information about the Xymon mailing list