[Xymon] Multiple Issues with 4.3.17 install
J.C. Cleaver
cleaver at terabithia.org
Tue Oct 21 23:08:46 CEST 2014
If you're seeing a crash alert ("Signal received", etc) and it's purple,
it was just the one-time note that something internal to xymon crashed.
(That, of course, isn't supposed to happen, but... :/ )
xymond_rrd normally doesn't send in a test about itself (none of the
processors launched via xymond_channel do by default, only the
xymonlaunch-ed daemons and run-once commands), so it won't clear even if
the system is running fine now.
Also, I'd have to check, but I believe re-enabling of disables doesn't
take effect right away -- there's a part that may not update until the
next status message is received for it.
In either case, just drop the now-spurious "xymond_rrd" test using
something like:
./xymon 0.0.0.0 "drop xymonems xymond_rrd"
HTH,
-jc
On Tue, October 21, 2014 6:57 am, Neal, Jonathan W wrote:
> Anyone got any ideas about this? The test does not show as disabled on
> the enable/disable page, but is still blue and doesnât seem to update at
> all. No xymond_rrd files are being created in /export/home/xymon/data/*
> anywhere. If I do a ./xymon 0.0.0.0 "enable xymonems.xymond_rrd" it
> doesnât change it at all. It is like the status is stuck somewhere, but
> I am not sure how or where.
>
>
> From: Neal, Jonathan W [mailto:wes.neal at verizon.com]
> Sent: Monday, October 20, 2014 12:52 PM
> To: Jeremy Laidman
> Cc: xymon at xymon.com
> Subject: RE: [Xymon] Multiple Issues with 4.3.17 install
>
> No core file on the system. I think there is something else odd going
> on. I removed all the data that belongs to the xymonems host from
> /data/* . I restarted the system and xymond_rrd is still blue, even
> though it isnât even disabled any longer. Itâs like it canât or
> doesnât know how to update the status for it. I watched the xymond
> status for from yellow to green after the restarted, but xymond_rrd never
> changed.
>
>
> From: Jeremy Laidman [mailto:jlaidman at rebel-it.com.au]
> Sent: Sunday, October 19, 2014 4:08 PM
> To: Neal, Jonathan W
> Subject: Re: [Xymon] Multiple Issues with 4.3.17 install
>
> Look for a core file, then use gdb to get a backtrace. This will tell us
> what it is doing when it crashes.
> J
> On 18/10/2014 7:31 AM, "Neal, Jonathan W via Xymon" <xymon at xymon.com>
> wrote:
> _______________________________________________
> Xymon mailing list
> Xymon at xymon.com
> http://lists.xymon.com/mailman/listinfo/xymon
>
>
> ---------- Forwarded message ----------
> From:Â "Neal, Jonathan W" <wes.neal at verizon.com>
> To:Â "xymon at xymon.com" <xymon at xymon.com>
> Cc:Â
> Date:Â Fri, 17 Oct 2014 16:31:06 -0400
> Subject:Â RE: [Xymon] Multiple Issues with 4.3.17 install
> I am unsure why it is even showing purple, but it definitely is and it
> keeps alerting on it. If I drill down into a system I see data being
> graphed that is valid. If I look at the processes on the system I see:
>
> xymonems:xymon > ps -ef |grep xymond_rrd
>   xymon 18720 18714  0 02:45:03 ?      0:00
> xymond_channel --channel=data --log=/var/log/xymon/rrd-data.log xymond_rrd
> --rr
>   xymon 18806 18720  0 02:45:12 ?      0:02 xymond_rrd
> --rrddir=/export/home/xymon/data/rrd
>   xymon 18761 18719  0 02:45:04 ?      0:51 xymond_rrd
> --rrddir=/export/home/xymon/data/rrd
>   xymon 18719 18714  0 02:45:03 ?      0:14
> xymond_channel --channel=status --log=/var/log/xymon/rrd-status.log
> xymond_rrd
>
> So to me it seems as if it is running. What am I missing here?
>
> Wes
>
>
> ---------- Forwarded message ----------
> From:Â "Neal, Jonathan W" <wes.neal at verizon.com>
> To:Â "xymon at xymon.com" <xymon at xymon.com>
> Cc:Â
> Date:Â Thu, 16 Oct 2014 18:40:46 -0400
> Subject:Â Multiple Issues with 4.3.17 install
> I am coming from an early 4.2 install. I merged my bb-hosts,
> hobbit-alerts.cfg and hobbit-clients.cfg files into the proper files in
> the new 4.3.17 install. I also copied over the entire histlogs directory
> from data.  Currently xymon_rrd keeps dying and going purple with a
> Fatal signal error.
> Â
> rrd-status log has this in it going back most of the day:
> Â
> 2014-10-16 19:23:00 Peer at 0.0.0.0:0 failed: Broken pipe
> 2014-10-16 19:23:00 Peer not up, flushing message queue
> 2014-10-16 19:24:43 Shutting down, flushing cached updates to disk
> 2014-10-16 19:28:39 Peer not up, flushing message queue
> 2014-10-16 20:00:35 Shutting down, flushing cached updates to disk
> 2014-10-16 20:00:36 Cache flush completed
> 2014-10-16 21:58:19 Peer not up, flushing message queue
> 2014-10-16 22:30:58 Shutting down, flushing cached updates to disk
> 2014-10-16 22:30:59 Cache flush completed
> 2014-10-16 22:31:14 Peer not up, flushing message queue
> Â
> Xymond is also constantly going yellow and I again see that 0.0.0.0:1984
> that is mentioned above:
> Â
> Statistics for Xymon daemon
> Version: 4.3.17
> Up since 16-Oct-2014 22:31:09 (0 days, 00:04:59)
> Â
> Incoming messages     :       937
> - status              :       885
> - combo               :         1
> - extcombo            :        22
> - page                :         0
> - summary             :         0
> - data                :         6
> - client              :         2
> - notes               :         0
> - enable              :         0
> - disable             :         0
> - ack                 :         0
> - config              :         4
> - query               :         0
> - xymondboard         :         6
> - xymondlog           :         5
> - drop                :         0
> - rename              :         0
> - dummy               :         1
> - ping                :         0
> - notify              :         0
> - schedule            :         0
> - download            :         0
> - Bogus/Timeouts      :         5
> Incoming messages/sec :         3 (average last 300 seconds)
> Â
> status channel messages:Â Â Â Â Â Â Â 885 (1 readers)
> stachg channel messages:Â Â Â Â Â Â Â 877 (1 readers)
> page  channel messages:        37 (1 readers)
> data  channel messages:         6 (1 readers)
> notes channel messages:         0 (0 readers)
> enadis channel messages:Â Â Â Â Â Â Â Â Â 0 (0 readers)
> client channel messages:Â Â Â Â Â Â Â Â Â 2 (1 readers)
> clichg channel messages:Â Â Â Â Â Â Â Â Â 0 (1 readers)
> user  channel messages:         0 (0 readers)
> backfeed messages     :         0
> Â
> Â
> Latest error messages:
> Loading hostnames
> Loading saved state
> Setting up network listener on 0.0.0.0:1984
> Setting up signal handlers
> Setting up xymond channels
> Setting up logfiles
> Setup complete
> Â
> Can anyone tell me what might be going on?
> Thanks in advance!
> Â
>
>
More information about the Xymon
mailing list