[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: [hobbit] Graphs stop update 24 hours after client reboot; start again 24 hours later.



> -----Original Message-----
> From: Henrik "StC8rner [mailto:henrik (at) hswn.dk]
> Sent: Wednesday, January 28, 2009 7:23 AM
> To: hobbit (at) hswn.dk
> Subject: Re: [hobbit] Graphs stop update 24 hours after client reboot;
> start again 24 hours later.
> 
> In <E38DCD6606C55F499A4125611AB8D99605C8F6B1 (at) cvsexbpd2.Corp.CVS.com>
> "Brand, Thomas R." <TRBrand (at) cvs.com> writes:
> 
> >I need some help/suggestions to figure out why my "cpu load" and
"users
> >& processes" graphs stop updating about 24 hours after the systems
> >reboot. The updates stop for anywhere from 12 to 24 hours, then
simply
> >start back up again.
> >Only the "CPU load" and the "Users and Processes" graphs are having
the
> >problem; disk, memory, cpu utilization, network traffic don't miss a
> >beat.
> 
> The only explanation I can come up with is that the format of
> some of the "cpu" status message is different for the first 24 hours
> after a reboot.
> 
> Could you send me an example of the cpu status shortly after a reboot,
> and one when the graphs are working ?
> 
> What OS are these boxes ?
> 
> 
> Regards,
> Henrik

Hi Henrik,

  I'm still struggling to understand why the graphs stop updating and
appreciate your taking the time to respond.

I'm not sure what you mean by 'send me an example of the cpu status'...
are you looking for a data file? a log file? or the 'client data' as
reported by http://lxadmin02/hobbit-cgi/bb-hostsvc.sh?CLIENT=s00766rxs?

Both the Hobbit server & client are running on a SUSE Linux Enterprise
Server 10 SP1 (SLES 10.1) OS.
It's pretty much a standard out-of-the-box OS install, nothing very odd.

Hobbit is version 4.2.0 with all-in-one patch.

To clarify, the server reboots, graphs update for 24 hours. Then, 24
hours after the reboot graphs stop for 24 hours, then graphs start
again...
    reboot: wed 00:30
    graph shows data until Thursday 00:30 and then stops
    graph data starts again -- usually the updates start exactly 24
hours after stopping.


Here is a list of current rrd directory from a system that rebooted
yesterday (Feb 15) at 9:40 am. The data for users,procs,la stopped
updating at 9:40 am today, but the other data files are still updating:
-rw-r--r-- 1 hobbit hobbit  19552 Feb 16 09:40 users.rrd
-rw-r--r-- 1 hobbit hobbit  19552 Feb 16 09:40 procs.rrd
-rw-r--r-- 1 hobbit hobbit  19552 Feb 16 09:40 la.rrd
-rw-r--r-- 1 hobbit hobbit  57520 Feb 16 17:18 mysql.rrd
-rw-r--r-- 1 hobbit hobbit 323296 Feb 16 17:18 vmstat.rrd
-rw-r--r-- 1 hobbit hobbit 304312 Feb 16 17:18 netstat.rrd
-rw-r--r-- 1 hobbit hobbit  19552 Feb 16 17:18 memory.swap.rrd
-rw-r--r-- 1 hobbit hobbit  19552 Feb 16 17:18 memory.real.rrd
-rw-r--r-- 1 hobbit hobbit  19552 Feb 16 17:18 memory.actual.rrd
-rw-r--r-- 1 hobbit hobbit  38536 Feb 16 17:18 ifstat.eth0.rrd
-rw-r--r-- 1 hobbit hobbit  38536 Feb 16 17:18 disk,root.rrd
-rw-r--r-- 1 hobbit hobbit  38536 Feb 16 17:18 disk,cvsrx.rrd
-rw-r--r-- 1 hobbit hobbit  19552 Feb 16 17:18 clock.rrd
-rw-r--r-- 1 hobbit hobbit  19552 Feb 16 17:20 tcp.conn.rrd
-rw-r--r-- 1 hobbit hobbit  19552 Feb 16 17:20 tcp.ssh.rrd




Thanks for taking time to respond,
Tom