FW: Re: [hobbit] bbdisplay problems after adding some new hosts

Stefan Loos stefan_loos at hotmail.com
Thu May 12 14:31:44 CEST 2005


Hello,

is the number of tests per host limited? I've cleaned my bb-hosts and as I 
add one host with many tests. I disabled all customized tests and tried to 
find out if its a single test. But all test run by itself didn't crash the 
hobbit server - running all together did!
The host runs about 20 tests.

Regards,

Stefan


<br><br><br>>From: "Stefan Loos" 
<stefan_loos at hotmail.com><br>>Reply-To: hobbit at hswn.dk<br>>To: 
hobbit at hswn.dk<br>>Subject: Re: [hobbit] bbdisplay problems after adding 
some new hosts<br>>Date: Wed, 11 May 2005 09:49:30 
+0000<br>><br>>Hi Henrik,<br>><br>>now the errors in the 
hobbitlaunch.log are gone but in <br>>bb-display.log are still there. And 
another strange thing - since I <br>>reenabled the bbdisplay this morning 
I didn't see any host at the <br>>hobbit server! Just the subpages and 
groups are there.<br>><br>>Regards,<br>><br>>Stefan 
Loos<br>><br>><br><br><br>&gt;From: 
henrik at hswn.dk (Henrik <br>>Stoerner)<br>&gt;Reply-To: 
hobbit at hswn.dk<br>&gt;To: 
<br>>hobbit at hswn.dk<br>&gt;Subject: Re: [hobbit] bbdisplay 
problems after <br>>adding some new hosts<br>&gt;Date: Wed, 11 
May 2005 11:06:13 <br>>+0200<br>&gt;<br>&gt;Could you 
try removing the <br>>&quot;HEARTBEAT&quot; line from 
hobbitlaunch.cfg and<br>&gt;see if <br>>things run OK after 
that 
<br>>?<br>&gt;<br>&gt;<br>&gt;Regards,<br>&gt;Henrik<br>&gt;<br>&gt;On 
<br>>Wed, May 11, 2005 at 07:41:37AM +0000, Stefan Loos 
wrote:<br>&gt; <br>>&gt; Hi,<br>&gt; 
&gt;<br>&gt; &gt; yesterday I add some new hosts to 
<br>>my hobbit-server and short after that<br>&gt; &gt; 
hobbit had some <br>>problems.<br>&gt; &gt; Here is what 
hobbitlauch.log says:<br>&gt; <br>>&gt;<br>&gt; 
&gt; 2005-05-11 09:11:18 Heartbeat lost for task <br>>hobbitd, 
bouncing it<br>&gt; &gt; 2005-05-11 09:11:18 Task bbretest 
<br>>started with PID 4523<br>&gt; &gt; 2005-05-11 09:11:23 
Heartbeat <br>>lost for task hobbitd, killing it<br>&gt; 
&gt; 2005-05-11 09:11:23 <br>>Task bbdisplay started with PID 
4524<br>&gt; &gt; 2005-05-11 <br>>09:11:23 Task hobbitd 
terminated by signal 9<br>&gt; &gt; 2005-05-11 
<br>>09:11:23 Task hobbitd started with PID 4525<br>&gt; 
&gt; 2005-05-11 <br>>09:11:23 Loading hostnames<br>&gt; 
&gt; 2005-05-11 09:11:23 Loading <br>>saved state<br>&gt; 
&gt; 2005-05-11 09:11:23 Setting up network <br>>listener on 
0.0.0.0:1984<br>&gt; &gt; 2005-05-11 09:11:23 Setting up 
<br>>signal handlers<br>&gt; &gt; 2005-05-11 09:11:23 
Setting up hobbitd <br>>channels<br>&gt; &gt; 2005-05-11 
09:11:23 Setting up <br>>logfiles<br>&gt; &gt; 2005-05-11 
09:11:28 Task bbhistory started <br>>with PID 4527<br>&gt; 
&gt; 2005-05-11 09:11:28 Task bbenadis started <br>>with PID 
4528<br>&gt; &gt; 2005-05-11 09:11:28 Task bbpage started 
<br>>with PID 4530<br>&gt; &gt; 2005-05-11 09:11:28 Task 
larrdstatus <br>>started with PID 4532<br>&gt; &gt; 
2005-05-11 09:11:28 Task <br>>larrddata started with PID 
4534<br>&gt; &gt; 2005-05-11 09:12:18 <br>>Task bbretest 
started with PID 4541<br>&gt; &gt; 2005-05-11 09:12:23 
<br>>Task bbdisplay started with PID 4542<br>&gt; &gt; 
2005-05-11 <br>>09:12:43 Heartbeat lost for task hobbitd, bouncing 
it<br>&gt; &gt; <br>>2005-05-11 09:12:48 Heartbeat lost for 
task hobbitd, killing <br>>it<br>&gt; &gt; 2005-05-11 
09:12:48 Task hobbitd terminated by <br>>signal 9<br>&gt; 
&gt; 2005-05-11 09:12:48 Task bbdisplay terminated <br>>by signal 
15<br>&gt; &gt;<br>&gt; &gt; So I tried to find 
out which <br>>component causes the problem and 
disabled<br>&gt; &gt; everything in <br>>hobbitlauch.cfg 
and reenabled one by one.<br>&gt; &gt; I found out 
<br>>that everytime I enabled bbdisplay those errors 
occour.<br>&gt; &gt; <br>>The bb-display.log looks like 
this:<br>&gt; &gt;<br>&gt; &gt; 
<br>>2005-05-11 09:09:48 Whoops ! bb failed to send message - 
<br>>timeout<br>&gt; &gt; 2005-05-11 09:09:48 hobbitd 
status-board not <br>>available<br>&gt; &gt; 2005-05-11 
09:09:53 Whoops ! bb failed to <br>>send message - 
timeout<br>&gt; &gt; 2005-05-11 09:10:53 Whoops ! bb 
<br>>failed to send message - timeout<br>&gt; &gt; 
2005-05-11 09:10:53 <br>>hobbitd status-board not 
available<br>&gt; &gt; 2005-05-11 09:11:23 <br>>Could not 
connect to bbd at 10.207.193.41:1984 -<br>&gt; &gt; 
<br>>Connection refused<br>&gt; &gt; 2005-05-11 09:11:23 
Whoops ! bb <br>>failed to send message - Connection 
failed<br>&gt; &gt; 2005-05-11 <br>>09:11:23 hobbitd 
status-board not available<br>&gt; &gt; 2005-05-11 
<br>>09:11:23 Could not connect to bbd at 10.207.193.41:1984 
-<br>&gt; &gt; <br>>Connection refused<br>&gt; 
&gt; 2005-05-11 09:11:23 Whoops ! bb <br>>failed to send message - 
Connection failed<br>&gt; &gt;<br>&gt; &gt; 
<br>>I also found some core files in ~server/tmp but I'm pretty shure 
<br>>they came<br>&gt; &gt; from killing hobbit - 
nevertheless I've run <br>>the gdb util:<br>&gt; 
&gt;<br>&gt; &gt; GNU gdb Red Hat Linux 
<br>>(6.1post-1.20040607.52rh)<br>&gt; &gt; Copyright 2004 
Free Software <br>>Foundation, Inc.<br>&gt; &gt; GDB is 
free software, covered by the <br>>GNU General Public License, and you 
are<br>&gt; &gt; welcome to <br>>change it and/or 
distribute copies of it under certain<br>&gt; &gt; 
<br>>conditions.<br>&gt; &gt; Type &quot;show 
copying&quot; to see the <br>>conditions.<br>&gt; &gt; 
There is absolutely no warranty for GDB.  <br>>Type &quot;show 
warranty&quot; for details.<br>&gt; &gt; This GDB 
<br>>was configured as &quot;i386-redhat-linux-gnu&quot;...Using 
<br>>host<br>&gt; &gt; libthread_db library 
<br>>&quot;/lib/tls/libthread_db.so.1&quot;.<br>&gt; 
&gt;<br>&gt; &gt; <br>>Core was generated by `hobbitd 
--debug<br>&gt; &gt; 
<br>>--pidfile=/var/log/hobbit/hobbitd.pid 
<br>>--restart=/usr/local/hobb'.<br>&gt; &gt; Program 
terminated with <br>>signal 6, Aborted.<br>&gt; &gt; 
Reading symbols from <br>>/lib/tls/libc.so.6...done.<br>&gt; 
&gt; Loaded symbols for <br>>/lib/tls/libc.so.6<br>&gt; 
&gt; Reading symbols from 
<br>>/lib/ld-linux.so.2...done.<br>&gt; &gt; Loaded symbols 
for <br>>/lib/ld-linux.so.2<br>&gt; &gt; #0  0x00df4cef in 
raise () from <br>>/lib/tls/libc.so.6<br>&gt; &gt; (gdb) 
bt<br>&gt; &gt; #0  0x00df4cef <br>>in raise () from 
/lib/tls/libc.so.6<br>&gt; &gt; #1  0x00df64f5 in 
<br>>abort () from /lib/tls/libc.so.6<br>&gt; &gt; #2  
0x08054126 in <br>>sigsegv_handler (signum=11) at 
sig.c:57<br>&gt; &gt; #3  &lt;signal <br>>handler 
called&gt;<br>&gt; &gt; #4  0x00e46cac in mempcpy () from 
<br>>/lib/tls/libc.so.6<br>&gt; &gt; #5  0x00e3a4d2 in 
<br>>_IO_default_xsputn_internal () from 
/lib/tls/libc.so.6<br>&gt; &gt; <br>>#6  0x00e13527 in 
vfprintf () from /lib/tls/libc.so.6<br>&gt; &gt; <br>>#7  
0x00e2f3dc in vsprintf () from /lib/tls/libc.so.6<br>&gt; &gt; 
<br>>#8  0x00e1a03d in sprintf () from 
/lib/tls/libc.so.6<br>&gt; &gt; #9 <br>>  0x0804d7a4 in 
do_message (msg=0x9e0b3f8, origin=0x80554bb <br>>&quot;&quot;) 
at<br>&gt; &gt; hobbitd.c:1903<br>&gt; &gt; #10 
<br>>0x0804fcb5 in main (argc=8, argv=0xbfff9084) at 
<br>>hobbitd.c:2944<br>&gt; &gt; (gdb)<br>&gt; 
&gt;<br>&gt; &gt; Now I <br>>try to find out which of 
the new hosts - and what test causes <br>>the<br>&gt; &gt; 
problems...<br>&gt; &gt;<br>&gt; &gt; 
<br>>Regards,<br>&gt; &gt;<br>&gt; &gt; 
Stefan Loos<br>&gt; &gt;<br>&gt; 
<br>>&gt;<br>&gt; &gt;<br>&gt; &gt; To 
unsubscribe from the hobbit list, <br>>send an e-mail 
to<br>&gt; &gt; hobbit-unsubscribe at hswn.dk<br>&gt; 
<br>>&gt;<br>&gt; 
&gt;<br>&gt;<br>&gt;--<br>&gt;Henrik 
<br>>Storner<br>&gt;<br>&gt;To unsubscribe from the 
hobbit list, send an <br>>e-mail 
to<br>&gt;hobbit-unsubscribe at hswn.dk<br>&gt;<br>&gt;<br><br>><br>><br>><br>>To 
unsubscribe from the hobbit list, send an e-mail 
to<br>>hobbit-unsubscribe at hswn.dk<br>><br>><br>





More information about the Xymon mailing list