[Xymon] Multiple Issues with 4.3.17 install

Neal, Jonathan W wes.neal at verizon.com
Mon Oct 20 18:51:33 CEST 2014


No core file on the system.  I think there is something else odd going on.  I removed all the data that belongs to the xymonems host from /data/* .  I restarted the system and xymond_rrd is still blue, even though it isn’t even disabled any longer.  It’s like it can’t or doesn’t know how to update the status for it.  I watched the xymond status for from yellow to green after the restarted, but xymond_rrd never changed.

Wes Neal
Group e-mail: SES-HSI.Team at one.verizon.com<mailto:SES-HSI.Team at one.verizon.com>
24x7 Helpdesk for EMS:  VZOES/OSC 1-800-634-5887 Opt3

From: Jeremy Laidman [mailto:jlaidman at rebel-it.com.au]
Sent: Sunday, October 19, 2014 4:08 PM
To: Neal, Jonathan W
Subject: Re: [Xymon] Multiple Issues with 4.3.17 install


Look for a core file, then use gdb to get a backtrace. This will tell us what it is doing when it crashes.

J
On 18/10/2014 7:31 AM, "Neal, Jonathan W via Xymon" <xymon at xymon.com<mailto:xymon at xymon.com>> wrote:
_______________________________________________
Xymon mailing list
Xymon at xymon.com<mailto:Xymon at xymon.com>
http://lists.xymon.com/mailman/listinfo/xymon


---------- Forwarded message ----------
From: "Neal, Jonathan W" <wes.neal at verizon.com<mailto:wes.neal at verizon.com>>
To: "xymon at xymon.com<mailto:xymon at xymon.com>" <xymon at xymon.com<mailto:xymon at xymon.com>>
Cc:
Date: Fri, 17 Oct 2014 16:31:06 -0400
Subject: RE: [Xymon] Multiple Issues with 4.3.17 install
I am unsure why it is even showing purple, but it definitely is and it keeps alerting on it.  If I drill down into a system I see data being graphed that is valid.  If I look at the processes on the system I see:

xymonems:xymon > ps -ef |grep xymond_rrd
   xymon 18720 18714   0 02:45:03 ?           0:00 xymond_channel --channel=data --log=/var/log/xymon/rrd-data.log xymond_rrd --rr
   xymon 18806 18720   0 02:45:12 ?           0:02 xymond_rrd --rrddir=/export/home/xymon/data/rrd
   xymon 18761 18719   0 02:45:04 ?           0:51 xymond_rrd --rrddir=/export/home/xymon/data/rrd
   xymon 18719 18714   0 02:45:03 ?           0:14 xymond_channel --channel=status --log=/var/log/xymon/rrd-status.log xymond_rrd

So to me it seems as if it is running.  What am I missing here?

Wes


---------- Forwarded message ----------
From: "Neal, Jonathan W" <wes.neal at verizon.com<mailto:wes.neal at verizon.com>>
To: "xymon at xymon.com<mailto:xymon at xymon.com>" <xymon at xymon.com<mailto:xymon at xymon.com>>
Cc:
Date: Thu, 16 Oct 2014 18:40:46 -0400
Subject: Multiple Issues with 4.3.17 install
I am coming from an early 4.2 install.  I merged my bb-hosts, hobbit-alerts.cfg and hobbit-clients.cfg files into the proper files in the new 4.3.17 install.  I also copied over the entire histlogs directory from data.   Currently xymon_rrd keeps dying and going purple with a Fatal signal error.

rrd-status log has this in it going back most of the day:

2014-10-16 19:23:00 Peer at 0.0.0.0:0<http://0.0.0.0:0> failed: Broken pipe
2014-10-16 19:23:00 Peer not up, flushing message queue
2014-10-16 19:24:43 Shutting down, flushing cached updates to disk
2014-10-16 19:28:39 Peer not up, flushing message queue
2014-10-16 20:00:35 Shutting down, flushing cached updates to disk
2014-10-16 20:00:36 Cache flush completed
2014-10-16 21:58:19 Peer not up, flushing message queue
2014-10-16 22:30:58 Shutting down, flushing cached updates to disk
2014-10-16 22:30:59 Cache flush completed
2014-10-16 22:31:14 Peer not up, flushing message queue

Xymond is also constantly going yellow and I again see that 0.0.0.0:1984<http://0.0.0.0:1984> that is mentioned above:

Statistics for Xymon daemon
Version: 4.3.17
Up since 16-Oct-2014 22:31:09 (0 days, 00:04:59)

Incoming messages      :        937
- status               :        885
- combo                :          1
- extcombo             :         22
- page                 :          0
- summary              :          0
- data                 :          6
- client               :          2
- notes                :          0
- enable               :          0
- disable              :          0
- ack                  :          0
- config               :          4
- query                :          0
- xymondboard          :          6
- xymondlog            :          5
- drop                 :          0
- rename               :          0
- dummy                :          1
- ping                 :          0
- notify               :          0
- schedule             :          0
- download             :          0
- Bogus/Timeouts       :          5
Incoming messages/sec  :          3 (average last 300 seconds)

status channel messages:        885 (1 readers)
stachg channel messages:        877 (1 readers)
page   channel messages:         37 (1 readers)
data   channel messages:          6 (1 readers)
notes  channel messages:          0 (0 readers)
enadis channel messages:          0 (0 readers)
client channel messages:          2 (1 readers)
clichg channel messages:          0 (1 readers)
user   channel messages:          0 (0 readers)
backfeed messages      :          0


Latest error messages:
Loading hostnames
Loading saved state
Setting up network listener on 0.0.0.0:1984<http://0.0.0.0:1984>
Setting up signal handlers
Setting up xymond channels
Setting up logfiles
Setup complete

Can anyone tell me what might be going on?
Thanks in advance!



-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xymon.com/pipermail/xymon/attachments/20141020/69b77977/attachment.html>


More information about the Xymon mailing list