[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [hobbit] hobbitd_larrd is crashing
No joy, it is still crashing, stack trace:
(gdb)
#0 0x0046260a in do_lookup_versioned () from /lib/ld-linux.so.2
#1 0x00461776 in _dl_lookup_versioned_symbol_internal () from /lib/ld-
linux.so.2
#2 0x00465473 in fixup () from /lib/ld-linux.so.2
#3 0x00465330 in _dl_runtime_resolve () from /lib/ld-linux.so.2
#4 0x08054c79 in sigsegv_handler (signum=11) at sig.c:51
#5 <signal handler called>
#6 0x004623da in do_lookup () from /lib/ld-linux.so.2
#7 0x00461103 in _dl_lookup_symbol_internal () from /lib/ld-linux.so.2
#8 0x0046540f in fixup () from /lib/ld-linux.so.2
#9 0x00465330 in _dl_runtime_resolve () from /lib/ld-linux.so.2
#10 0x0804a92b in create_and_update_rrd (hostname=0x7 <Address 0x7 out of
bounds>,
fn=0x805f6e0
"tcp.http.https:,,pws.tc.sc.egov.usda.gov,siteminderagent,dmsforms,login_banner.fcc?TYPE=33554433&REALMOID=06-d38f4375-a8bd-4190-b6f9-3c77f0901647&GUID=&SMAUTHREASON=0&METHOD=GET&SMAGENTNAME=$SM$hIspF3"...,
creparams=0x805e5c0, template=0x93f7b20 "sec") at do_rrd.c:145
#11 0x0804f2a0 in do_net_rrd (hostname=0xb755f036 "stellent_pre-prod_v-ip",
testname=0xb755f04d "http",
msg=0xb755f07b "status stellent_pre-prod_v-ip.http green Fri Jun 9
16:53:40 2006: OK ; OK\n\n&green
https://pws.tc.sc.egov.usda.gov/siteminderagent/dmsforms/login_banner.fcc?TYPE=33554433&REALMOID=06-d38f4375-a8bd-419"...,
tstamp=1149890052) at rrd/do_net.c:48
#12 0x08050256 in update_rrd (hostname=0xb755f036 "stellent_pre-prod_v-ip",
testname=0xb755f04d "http",
msg=0xb755f07b "status stellent_pre-prod_v-ip.http green Fri Jun 9
16:53:40 2006: OK ; OK\n\n&green
https://pws.tc.sc.egov.usda.gov/siteminderagent/dmsforms/login_banner.fcc?TYPE=33554433&REALMOID=06-d38f4375-a8bd-419"...,
tstamp=1149890052, sender=0x1ca3f <Address 0x1ca3f out of bounds>,
ldef=0x1ca3f) at do_rrd.c:293
#13 0x08049cf0 in main (argc=117311, argv=0xbfffab14) at hobbitd_rrd.c:199
I was looking at your patch, and it doesn't look to me like that new lines
are doing the same thing as the old:
- strcat(filedir, "/"); strcat(filedir, fn);
+ snprintf(filedir, sizeof(filedir)-1, "%s/%s/%s", rrddir, hostname, fn);
+ filedir[sizeof(filedir)-1] = '\0';
creparams[1] = filedir; /* Icky */
It looks like the original line creates something like "filedir/fn"
while the new lines create something like "filedir/hostname/fn". Is
this right?
Thanks,
Larry Barber
On 6/9/06, Henrik Stoerner <henrik (at) hswn.dk> wrote:
On Fri, Jun 09, 2006 at 04:21:56PM -0500, Larry Barber wrote:
> I loaded p1, and hobbitd_rrd is still dumping, the stack trace looks
like:
>
> #5 <signal handler called>
> #6 0x00dfe3da in do_lookup () from /lib/ld-linux.so.2
> #7 0x00dfd103 in _dl_lookup_symbol_internal () from /lib/ld-linux.so.2
> #8 0x00e0140f in fixup () from /lib/ld-linux.so.2
> #9 0x00e01330 in _dl_runtime_resolve () from /lib/ld-linux.so.2
> #10 0x0804a91f in create_and_update_rrd (hostname=0xb755d037
> "stellent_pre-prod_v-ip",
> fn=0x805f6e0
> "tcp.http.https:,,pws.tc.sc.egov.usda.gov
,siteminderagent,dmsforms,login_banner.fcc?TYPE=33554433&REALMOID=06-d38f4375-a8bd-4190-b6f9-3c77f0901647&GUID=&SMAUTHREASON=0&METHOD=GET&SMAGENTNAME=$SM$hIspF3"...,
> creparams=0x805e5c0, template=0x9cf6b20 "sec") at do_rrd.c:143
OK, the call trace looks sane so I think we can rule out simple memory
corruption here.
The crash happens when trying to print an error-message from the RRDtool
library, when trying to create a new RRD file for tracking a http test
response time (it has just called the rrd_create() function, which returns
an error and hobbit is trying to print out the error message when it
crashes.
The filename looks somewhat suspicious. It is generated from the URL
that is tested, and it is a very long filename beginning with
"tcp.http.https:,,pws.tc.sc.egov.usda.gov
,siteminderagent,dmsforms,login_banner.fcc?TYPE="
It's an http test for the host "stellent_pre-prod_v-ip"
My guess is that this filename is just too long. It *could* overflow the
buffer set aside for the RRD filename - in that case, the attached patch
against 4.1.2p1 should help.
> It just started doing this today, I can't think of anything that I have
done
> that could cause it.
I think You just added this http test for "stellent_pre-prod_v-ip".
Regards,
Henrik
To unsubscribe from the hobbit list, send an e-mail to
hobbit-unsubscribe (at) hswn.dk