[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [hobbit] Loss of Apache graphs
Thomas,
Anyway while the shared memory is not working, you'll posible
have trouble.
Did you already follow the verification of IPC/Shared memory
verification? Is not, take a look at one described by Henrik is at
http://www.hswn.dk/hobbiton/2005/10/msg00200.html
IPC also has a dedicated session on the install manual
(http://www.hswn.dk/hobbit/help/install.html)
Hope this helps.
-wm
On Thu, 23 Feb 2006 16:45:37 +0100
Thomas <tlp-hobbit (at) holme-pedersen.dk> wrote:
> Yes thats right, mine are usually around 2 GB when I get into problems,
> so this is not it. Also I done have this in my logs
>
> 2006-02-09 15:24:36 Tried to down BOARDBUSY: Invalid argument
> 2006-02-09 15:31:54 Could not get shm of size 262144: No such file or
> directory
> 2006-02-09 15:31:54 Channel not available
>
> so something else is wrong.
>
> Rob Munsch wrote:
> > ... are these not the rrd logs you meant..? Can't really find any
> > others... also, if it's that, why is only one host affected?
> >
> > Rob Munsch wrote:
> >
> >> Hmm.
> >>
> >> Well, there are two webservers; one is showing apache graphs, the
> >> other isn't.
> >> Went ahead and added 'em to a (pretty aggressive) rotation schedule;
> >> rrd-data.log is 600k, rrd-status was about 3.5M. Just in case,
> >> rrd-status is now limited to 1M.
> >>
> >> Stopped and restarted the server, but no apparent effect. The server
> >> that had its Apache graphs still does, and that one that doesn't,
> >> doesn't.
> >>
> >> Here are some recent rrd log entries, if that sheds any light. "Mo"
> >> is the server with the graphs, "ws-1" is the one without:
> >>
> >> rrd-data.log
> >>
> >> 2006-02-03 02:55:17 RRD error updating
> >> /home/hobbit/data/rrd/ws-1/apache.rrd from 10.10.10.47: illegal
> >> attempt to update using time 1138953317 when last update time is
> >> 1138953317 (minimum one second step)
> >> 2006-02-03 02:55:17 RRD error updating
> >> /home/hobbit/data/rrd/mo/apache.rrd from 10.10.10.47: illegal attempt
> >> to update using time 1138953317 when last update time is 1138953317
> >> (minimum one second step)
> >> 2006-02-03 02:58:25 RRD error updating
> >> /home/hobbit/data/rrd/ws-1/apache.rrd from 10.10.10.47: illegal
> >> attempt to update using time 1138953505 when last update time is
> >> 1138953505 (minimum one second step)
> >> 2006-02-03 02:58:25 RRD error updating
> >> /home/hobbit/data/rrd/mo/apache.rrd from 10.10.10.47: illegal attempt
> >> to update using time 1138953505 when last update time is 1138953505
> >> (minimum one second step)
> >> 2006-02-09 04:04:12 Could not get shm of size 262144: No such file or
> >> directory
> >> 2006-02-09 04:04:12 Channel not available
> >> 2006-02-09 15:24:36 Tried to down BOARDBUSY: Invalid argument
> >> 2006-02-09 15:31:54 Could not get shm of size 262144: No such file or
> >> directory
> >> 2006-02-09 15:31:54 Channel not available
> >> 2006-02-16 11:18:57 Tried to down BOARDBUSY: Invalid argument
> >> 2006-02-16 11:18:57 Worker process died with exit code 0, terminating
> >> 2006-02-22 11:41:59 Tried to down BOARDBUSY: Invalid argument
> >> root (at) randomaccess /var/log/hobbit #
> >>
> >> rrd-status.log (the former 3.5M log - current is empty file with no
> >> entries post-rotate)
> >>
> >> 2006-02-09 15:24:36 Tried to down BOARDBUSY: Invalid argument
> >> 2006-02-09 15:31:54 Could not get shm of size 262144: No such file or
> >> directory
> >> 2006-02-09 15:31:54 Channel not available
> >> 2006-02-09 22:25:38 RRD error updating
> >> /home/hobbit/data/rrd/randomaccess/bbgen.rrd from 10.10.10.47:
> >> illegal attempt to update using time 1139541938 when last update time
> >> is 1139545383 (minimum one second step)
> >> 2006-02-16 11:18:57 Tried to down BOARDBUSY: Invalid argument
> >> root (at) randomaccess /var/log/hobbit #
> >>
> >> Not sure what's going on here.
> >>
> >> Thomas wrote:
> >>
> >>> Hi Rob,
> >>>
> >>> I dont know if this can help you but every time I have had problems
> >>> with missing graphs its been because the rrd logfiles were too big.
> >>>
> >>> Just a info..
> >>>
> >>> /Thomas
> >>>
> >>> Rob Munsch wrote:
> >>>
> >>>> Hello,
> >>>>
> >>>> There are two webservers being monitored by hobbit (among many
> >>>> other different servers).
> >>>> Both have bb-hosts entries that are nearly identical. Both have
> >>>> the same version of the client on them (4.1.2p1). Both seem to be
> >>>> working perfectly well in all other respects - both internal (CPU,
> >>>> disk etc) and external (conn, http) tests seem to be working, and
> >>>> have graphs.
> >>>>
> >>>> However on one, the apache trends show up as expected, and on the
> >>>> other, they have stopped graphing. Current values for the
> >>>> graphless one are good ol' "nan," but *just* for the 4 apache trend
> >>>> graphs - Utilization, Workers, CPU Ut and RPS.
> >>>>
> >>>> All other trend graphs are there.
> >>>>
> >>>> Historical data for before the sudden loss of graphing is there
> >>>> (i.e., about a week ago the graphing stopped - 12 day graph shows
> >>>> data before this cutoff).
> >>>>
> >>>> Nothing has changed, been added, or modified as far as i can tell.
> >>>>
> >>>> What am i missing..?
> >>>>
> >>>> Thanks!
> >>>>
> >>>
> >>>
> >>> To unsubscribe from the hobbit list, send an e-mail to
> >>> hobbit-unsubscribe (at) hswn.dk
> >>>
> >>>
> >>
> >>
> >
> >
>
> To unsubscribe from the hobbit list, send an e-mail to
> hobbit-unsubscribe (at) hswn.dk
>
>
>
> E-mail classificado pelo Identificador de Spam Inteligente Terra.
> Para alterar a categoria classificada, visite
> http://mail.terra.com.br/protected_email/imail/imail.cgi?+_u=wmlistas&_l=1,1140709586.254648.2481.malavi.terra.com.br,6984,Des15,Des15