hobbitd core dumps when receiving a status message on a host not in bb-hosts (snapshot 18 March 2008)
Sebastian
spa at syntec.co.uk
Wed Mar 19 20:33:52 CET 2008
Hi,
hobbitd core dumps when receiving a status message on a host not in bb-hosts
(snapshot 18 March 2008). This may, possibly, just be when the message
comes from a Big Brother server, since that was the case here. I haven't
tested when the message comes from another Hobbit server and I have now
downgraded again. However, if needed, I could try reupdating. Backtraces
follow...
[root at IVRA1 hobbit]# gdb hobbit-snapshot-18-Mar-08/hobbitd/hobbitd
server/tmp/core.683
GNU gdb Red Hat Linux (6.3.0.0-1.96rh)
Copyright 2004 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain
conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB. Type "show warranty" for details.
This GDB was configured as "i386-redhat-linux-gnu"...Using host libthread_db
library "/lib/tls/libthread_db.so.1".
Core was generated by `hobbitd --pidfile=/var/log/hobbit/hobbitd.pid
--restart=/usr/local/hobbit/serve'.
Program terminated with signal 6, Aborted.
Reading symbols from /lib/libpcre.so.0...done.
Loaded symbols for /lib/libpcre.so.0
Reading symbols from /usr/lib/libz.so.1...done.
Loaded symbols for /usr/lib/libz.so.1
Reading symbols from /lib/tls/libc.so.6...done.
Loaded symbols for /lib/tls/libc.so.6
Reading symbols from /lib/ld-linux.so.2...done.
Loaded symbols for /lib/ld-linux.so.2
#0 0x005557a2 in _dl_sysinfo_int80 () from /lib/ld-linux.so.2
(gdb)
(gdb)
(gdb) bt
#0 0x005557a2 in _dl_sysinfo_int80 () from /lib/ld-linux.so.2
#1 0x005957f5 in raise () from /lib/tls/libc.so.6
#2 0x00597199 in abort () from /lib/tls/libc.so.6
#3 0x0805c132 in sigsegv_handler (signum=11) at sig.c:58
#4 <signal handler called>
#5 hostinfo (hostname=0x991a6f7 "Flaxman") at loadhosts.c:401
#6 0x0805d81d in check_downtime (hostname=0x991a6f7 "Flaxman",
testname=0x991a6ff "conn") at timefunc.c:285
#7 0x0804b8ff in get_hts (
msg=0xb7cb21ac "status Flaxman.conn red <!-- [flags:ordAstLe] --> Wed
Mar 19 16:13:13 2008 conn NOT ok \n\nService conn on Flaxman is not OK :
Host does not respond to ping\n\n\nSystem unreachable for 15720 poll periods
("..., sender=0xbff45fb0 "192.168.4.118", origin=0x805fb8e "",
host=0xbff44728, test=0xbff4472c, grouplist=0xbff44730, log=0xbff44734,
color=0xbff44738, downcause=0xbff4473c, alltests=0x0,
createhost=1, createlog=1) at hobbitd.c:1012
#8 0x0804f304 in do_message (msg=0x9926eb0, origin=0x805fb8e "") at
hobbitd.c:2787
#9 0x08055129 in main (argc=8, argv=0xbff48424) at hobbitd.c:4867
(gdb)
And
[root at IVRA1 hobbit]# gdb hobbit-snapshot-18-Mar-08/hobbitd/hobbitd
server/tmp/core.685
GNU gdb Red Hat Linux (6.3.0.0-1.96rh)
Copyright 2004 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain
conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB. Type "show warranty" for details.
This GDB was configured as "i386-redhat-linux-gnu"...Using host libthread_db
library "/lib/tls/libthread_db.so.1".
Core was generated by `hobbitd --pidfile=/var/log/hobbit/hobbitd.pid
--restart=/usr/local/hobbit/serve'.
Program terminated with signal 6, Aborted.
Reading symbols from /lib/libpcre.so.0...done.
Loaded symbols for /lib/libpcre.so.0
Reading symbols from /usr/lib/libz.so.1...done.
Loaded symbols for /usr/lib/libz.so.1
Reading symbols from /lib/tls/libc.so.6...done.
Loaded symbols for /lib/tls/libc.so.6
Reading symbols from /lib/ld-linux.so.2...done.
Loaded symbols for /lib/ld-linux.so.2
#0 0x005557a2 in _dl_sysinfo_int80 () from /lib/ld-linux.so.2
(gdb)
(gdb)
(gdb) bt
#0 0x005557a2 in _dl_sysinfo_int80 () from /lib/ld-linux.so.2
#1 0x005957f5 in raise () from /lib/tls/libc.so.6
#2 0x00597199 in abort () from /lib/tls/libc.so.6
#3 0x0805c132 in sigsegv_handler (signum=11) at sig.c:58
#4 <signal handler called>
#5 hostinfo (hostname=0x8924897 "faxmonster") at loadhosts.c:401
#6 0x0805d81d in check_downtime (hostname=0x8924897 "faxmonster",
testname=0x89248a2 "conn") at timefunc.c:285
#7 0x0804b8ff in get_hts (
msg=0x8981748 "status faxmonster.conn red Wed Mar 19 16:16:44 2008 Can't
connect to faxmonster\r\nPING faxmonster (192.168.4.91): 56 data
bytes\r\nhost faxmonster unreachable\r\nhost faxmonster unreachable\r\nhost
faxmonste"..., sender=0xbfeffbd0 "192.168.4.101", origin=0x805fb8e "",
host=0xbfefe348, test=0xbfefe34c, grouplist=0xbfefe350, log=0xbfefe354,
color=0xbfefe358, downcause=0xbfefe35c,
alltests=0x0, createhost=1, createlog=1) at hobbitd.c:1012
#8 0x0804f845 in do_message (msg=0x88fb5a0, origin=0x805fb8e "") at
hobbitd.c:2854
#9 0x08055129 in main (argc=8, argv=0xbff02044) at hobbitd.c:4867
(gdb)
Etc. More very similar to these. It would appear that every single one of
these status messages causes a core dump as they happen at approx. 5 minute,
or less, intervals.
Both of these hosts are commented out in bb-hosts, but hobbitd is receiving
updates on them from a Big Brother server (where they should probably be
commented out too, but I found I didn't need to with hobbit 4.2.0, so long
as I don't mind getting the Ghost reports warning).
Kind regards,
Sebastian
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xymon.com/pipermail/xymon/attachments/20080319/de5b62b8/attachment.html>
More information about the Xymon
mailing list