[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [hobbit] Network tests stopped graphing



On Wed, 2005-09-28 at 07:28 +0200, Henrik Stoerner wrote: 
> On Wed, Sep 28, 2005 at 11:28:32AM +1000, Geoff Steer wrote:
> > 
> > For no reason that I can see, my network tests are no longer being
> > graphed. 
> > I run tests for ldap, smtp and ssh. All tests on all hosts are working
> > with no alerts being generated. Until about 2 days ago, there was a
> > single graph available for each host that showed the response times for
> > these three tests and the ping test. Now all the graphs show is the
> > value for ping.
> 
> Any messages in /var/log/hobbit/rrd-status.log ?
> 
> Could you show me the output from "ls -l ~hobbit/data/rrd/HOSTNAME" ?
> 
> Are the graphs missing from both the individual status view (e.g. the
> "smtp" detailed status should have a graph at the bottom), and from
> the combined view on the "trends" page ? Or just one of them ?

tail of  /var/log/hobbit/rrd-status.log:

2005-09-26 11:09:47 RRD error
updating /usr/local/hobbit/data/rrd/vwall.test.firstwave.com.au/tcp.ssh.rrd from 202.12.141.141: illegal attempt to update using time 1127696987 when last update time is 1127696987 (minimum one second step)
2005-09-26 11:09:47 RRD error
updating /usr/local/hobbit/data/rrd/admin5.firstwave.com.au/tcp.ssh.rrd
from 202.12.141.141: illegal attempt to update using time 1127696987
when last update time is 1127696987 (minimum one second step)
2005-09-26 11:09:47 RRD error
updating /usr/local/hobbit/data/rrd/admin3.firstwave.com.au/tcp.ssh.rrd
from 202.12.141.141: illegal attempt to update using time 1127696987
when last update time is 1127696987 (minimum one second step)
2005-09-26 11:09:47 RRD error
updating /usr/local/hobbit/data/rrd/vwall.test.firstwave.com.au/tcp.smtp.rrd from 202.12.141.141: illegal attempt to update using time 1127696987 when last update time is 1127696987 (minimum one second step)
2005-09-28 14:53:04 Tried to down BOARDBUSY: Invalid argument
2005-09-28 14:54:10 Tried to down BOARDBUSY: Invalid argument
2005-09-28 15:12:14 Tried to down BOARDBUSY: Invalid argument
2005-09-28 15:22:46 Tried to down BOARDBUSY: Invalid argument
2005-09-28 15:24:37 Tried to down BOARDBUSY: Invalid argument
2005-09-28 15:26:11 Tried to down BOARDBUSY: Invalid argument

NOTE:  vwall.test.firstwave.com.au is not one of the hosts showing this
problem. Clocks are synced to a ntp server running on the hobbit server.

An ls -l of one host (only tcp related rrd's shown.

-rw-r--r--  1 hobbit hobbit 19548 Sep 28 15:51 tcp.conn.rrd
-rw-r--r--  1 hobbit hobbit 19548 Sep 28 15:51 tcp.ldap.rrd
-rw-r--r--  1 hobbit hobbit 19548 Sep 28 15:51 tcp.smtp5000.rrd
-rw-r--r--  1 hobbit hobbit 19548 Sep 28 15:51 tcp.smtp.rrd
-rw-r--r--  1 hobbit hobbit 19548 Sep 28 15:51 tcp.ssh.rrd

The problem shows up in both the trends and the detailed graphs.



-- 
Geoff Steer <gsteer (at) firstwave.com.au>


-------------------------------Safe Stamp-----------------------------------
The sender's Anti-virus Service scanned this email. It is safe from known viruses.