[hobbit] Network tests stopped graphing
Geoff Steer
gsteer at firstwave.com.au
Wed Sep 28 07:57:59 CEST 2005
On Wed, 2005-09-28 at 07:28 +0200, Henrik Stoerner wrote:
> On Wed, Sep 28, 2005 at 11:28:32AM +1000, Geoff Steer wrote:
> >
> > For no reason that I can see, my network tests are no longer being
> > graphed.
> > I run tests for ldap, smtp and ssh. All tests on all hosts are working
> > with no alerts being generated. Until about 2 days ago, there was a
> > single graph available for each host that showed the response times for
> > these three tests and the ping test. Now all the graphs show is the
> > value for ping.
>
> Any messages in /var/log/hobbit/rrd-status.log ?
>
> Could you show me the output from "ls -l ~hobbit/data/rrd/HOSTNAME" ?
>
> Are the graphs missing from both the individual status view (e.g. the
> "smtp" detailed status should have a graph at the bottom), and from
> the combined view on the "trends" page ? Or just one of them ?
tail of /var/log/hobbit/rrd-status.log:
2005-09-26 11:09:47 RRD error
updating /usr/local/hobbit/data/rrd/vwall.test.firstwave.com.au/tcp.ssh.rrd from 202.12.141.141: illegal attempt to update using time 1127696987 when last update time is 1127696987 (minimum one second step)
2005-09-26 11:09:47 RRD error
updating /usr/local/hobbit/data/rrd/admin5.firstwave.com.au/tcp.ssh.rrd
from 202.12.141.141: illegal attempt to update using time 1127696987
when last update time is 1127696987 (minimum one second step)
2005-09-26 11:09:47 RRD error
updating /usr/local/hobbit/data/rrd/admin3.firstwave.com.au/tcp.ssh.rrd
from 202.12.141.141: illegal attempt to update using time 1127696987
when last update time is 1127696987 (minimum one second step)
2005-09-26 11:09:47 RRD error
updating /usr/local/hobbit/data/rrd/vwall.test.firstwave.com.au/tcp.smtp.rrd from 202.12.141.141: illegal attempt to update using time 1127696987 when last update time is 1127696987 (minimum one second step)
2005-09-28 14:53:04 Tried to down BOARDBUSY: Invalid argument
2005-09-28 14:54:10 Tried to down BOARDBUSY: Invalid argument
2005-09-28 15:12:14 Tried to down BOARDBUSY: Invalid argument
2005-09-28 15:22:46 Tried to down BOARDBUSY: Invalid argument
2005-09-28 15:24:37 Tried to down BOARDBUSY: Invalid argument
2005-09-28 15:26:11 Tried to down BOARDBUSY: Invalid argument
NOTE: vwall.test.firstwave.com.au is not one of the hosts showing this
problem. Clocks are synced to a ntp server running on the hobbit server.
An ls -l of one host (only tcp related rrd's shown.
-rw-r--r-- 1 hobbit hobbit 19548 Sep 28 15:51 tcp.conn.rrd
-rw-r--r-- 1 hobbit hobbit 19548 Sep 28 15:51 tcp.ldap.rrd
-rw-r--r-- 1 hobbit hobbit 19548 Sep 28 15:51 tcp.smtp5000.rrd
-rw-r--r-- 1 hobbit hobbit 19548 Sep 28 15:51 tcp.smtp.rrd
-rw-r--r-- 1 hobbit hobbit 19548 Sep 28 15:51 tcp.ssh.rrd
The problem shows up in both the trends and the detailed graphs.
--
Geoff Steer <gsteer at firstwave.com.au>
-------------------------------Safe Stamp-----------------------------------
The sender's Anti-virus Service scanned this email. It is safe from known viruses.
More information about the Xymon
mailing list