[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [hobbit] strange graph behavior - random machines & graphs
- To: hobbit (at) hswn.dk
- Subject: Re: [hobbit] strange graph behavior - random machines & graphs
- From: "Gary Baluha" <gumby3203 (at) gmail.com>
- Date: Fri, 30 Nov 2007 13:51:01 -0500
- Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; bh=wmftXHPYVoKQZDIW6gW9ZWHAKXUOYMnS3fin7ZW0sXE=; b=hf2CoiWG5J8/ofzccrjYwjUT2WoiOnBqo+qeU93WnteYa/E+AXpMfESXdF1FEkvxdb5RD3y15d+H3rB1nAaIiYPVCpbi+sQPadbiyN5klkV9Qhd80V8mUmYSG8GMvV9NthBtS0U9GyNe/dY+8QzQj2z9E2CzteXs8/23VNUcSQs=
- Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:references; b=WKKql0haFQtj2e5SzFF9lD+v8EDuo9yDvoo4XfP0d7OtzNfCzQHGWe76Sg9XjnG51n8fQdr8xoWQVgf6YXT3E9MHmNY6vQIeFXKywQEDhY5zQQKY4bYm9shcGvJ2y6WsTOh09kwOy2NtOj9/4R7G2+7E1yNMOMxINJiYbDKnwKg=
- References: <29f517690711280656x5c9fa38cta0e80f0d5761c1f7 (at) mail.gmail.com> <29f517690711280708k54276fd1me58d46d1e70ea600 (at) mail.gmail.com> <29f517690711300725g127fd5f7v148aa688764a4f94 (at) mail.gmail.com> <58EF0861D3A1A04182720B3A5231C7C201E022DB (at) usplm205.amer.corp.eds.com> <29f517690711300814q7a1ef9bew600c2d02d2d4de29 (at) mail.gmail.com> <29f517690711300855n7633ae02m7b93bf482f634992 (at) mail.gmail.com> <997a524e0711300918v204e8c30x4e74b37075d61a74 (at) mail.gmail.com> <29f517690711301027j3a7c27e5jf4288842ac3be39b (at) mail.gmail.com> <997a524e0711301045t763a0dacvc8771b17e3b5e5bd (at) mail.gmail.com>
On Nov 30, 2007 1:45 PM, Ralph Mitchell <ralphmitchell (at) gmail.com> wrote:
> On Nov 30, 2007 12:27 PM, Gary Baluha <gumby3203 (at) gmail.com> wrote:
>
> >
> > There wasn't anything useful in any of the logs, besides the usual
> > stuff. I turned on the --debug option, and here is a sample of the data for
> > one of the affected machines:
> >
> > 2007-11-30 13:14:07 hobbitd_rrd: Got message 562165
> > @@status#562165|1196446447.724393|192.168.232.110||danno|disk|1196448247|yellow||yellow|1196053505|0||0||1196446447
> > 2007-11-30 13:14:07 startpos 343968, fillpos 343968, endpos -1
> > 2007-11-30 13:14:07 RRD update param 00: 'rrdupdate'
> > 2007-11-30 13:14:07 RRD update param 01:
> > '/var/hobbit/data/rrd/danno/disk,dev,odm.rrd'
> > 2007-11-30 13:14:07 RRD update param 02: '-t'
> > 2007-11-30 13:14:07 RRD update param 03: 'pct:used'
> > 2007-11-30 13:14:07 RRD update param 04: '1196446447:0:0'
> >
> > I'm afraid I don't know how to interpret all of this, unfortunately. I
> > get that the "param 03" means the graph is showing "percentage [disk space]
> > used", and that "param 01" means it is updating that specific rrd file. And
> > I remember that "-t" in "param 02" is some rrdtool flag. But I don't know
> > what the numbers in "param 04" mean. I assume the first number is the #
> > seconds since 1970, and the second number is the current value, but I don't
> > know what the last number means. Also, I'm not sure how to interpret all of
> > the data in the "@@status" line.
> >
> > By the way, this excerpt is from a machine that is having the graph
> > display problems. In this case, the data it is receiving is normal and
> > correct. I'm waiting for another update when the data is incorrect.
> >
>
> The "-t" option specifies the template to use, which is in param03 -
> "pct:used". Param 04 is the actual data to insert, starting with the
> date/time in seconds (i.e. 1196446447), then zero for the "pct" value,
> then zero for the "used" value.
>
> This stuff may not help much, but maybe it will show where the data goes
> weird - i.e. is hobbitd_rrd being handed bad data, or does it get
> corrupted later on.
>
That's what I'm hoping. One other thing I noticed is that for the hosts
that have bad graphs, but where some graphs are still okay, the good graphs
have a gap of data precisely when the bad graphs have another data spike.