[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
FW: Re: [hobbit] bbdisplay problems after adding some new hosts
- To: hobbit (at) hswn.dk
- Subject: FW: Re: [hobbit] bbdisplay problems after adding some new hosts
- From: "Stefan Loos" <stefan_loos (at) hotmail.com>
- Date: Thu, 12 May 2005 12:31:44 +0000
Hello,
is the number of tests per host limited? I've cleaned my bb-hosts and as I
add one host with many tests. I disabled all customized tests and tried to
find out if its a single test. But all test run by itself didn't crash the
hobbit server - running all together did!
The host runs about 20 tests.
Regards,
Stefan
<br><br><br>>From: "Stefan Loos"
<stefan_loos (at) hotmail.com><br>>Reply-To: hobbit (at) hswn.dk<br>>To:
hobbit (at) hswn.dk<br>>Subject: Re: [hobbit] bbdisplay problems after adding
some new hosts<br>>Date: Wed, 11 May 2005 09:49:30
+0000<br>><br>>Hi Henrik,<br>><br>>now the errors in the
hobbitlaunch.log are gone but in <br>>bb-display.log are still there. And
another strange thing - since I <br>>reenabled the bbdisplay this morning
I didn't see any host at the <br>>hobbit server! Just the subpages and
groups are there.<br>><br>>Regards,<br>><br>>Stefan
Loos<br>><br>><br><br><br>&gt;From:
henrik (at) hswn.dk (Henrik <br>>Stoerner)<br>&gt;Reply-To:
hobbit (at) hswn.dk<br>&gt;To:
<br>>hobbit (at) hswn.dk<br>&gt;Subject: Re: [hobbit] bbdisplay
problems after <br>>adding some new hosts<br>&gt;Date: Wed, 11
May 2005 11:06:13 <br>>+0200<br>&gt;<br>&gt;Could you
try removing the <br>>&quot;HEARTBEAT&quot; line from
hobbitlaunch.cfg and<br>&gt;see if <br>>things run OK after
that
<br>>?<br>&gt;<br>&gt;<br>&gt;Regards,<br>&gt;Henrik<br>&gt;<br>&gt;On
<br>>Wed, May 11, 2005 at 07:41:37AM +0000, Stefan Loos
wrote:<br>&gt; <br>>&gt; Hi,<br>&gt;
&gt;<br>&gt; &gt; yesterday I add some new hosts to
<br>>my hobbit-server and short after that<br>&gt; &gt;
hobbit had some <br>>problems.<br>&gt; &gt; Here is what
hobbitlauch.log says:<br>&gt; <br>>&gt;<br>&gt;
&gt; 2005-05-11 09:11:18 Heartbeat lost for task <br>>hobbitd,
bouncing it<br>&gt; &gt; 2005-05-11 09:11:18 Task bbretest
<br>>started with PID 4523<br>&gt; &gt; 2005-05-11 09:11:23
Heartbeat <br>>lost for task hobbitd, killing it<br>&gt;
&gt; 2005-05-11 09:11:23 <br>>Task bbdisplay started with PID
4524<br>&gt; &gt; 2005-05-11 <br>>09:11:23 Task hobbitd
terminated by signal 9<br>&gt; &gt; 2005-05-11
<br>>09:11:23 Task hobbitd started with PID 4525<br>&gt;
&gt; 2005-05-11 <br>>09:11:23 Loading hostnames<br>&gt;
&gt; 2005-05-11 09:11:23 Loading <br>>saved state<br>&gt;
&gt; 2005-05-11 09:11:23 Setting up network <br>>listener on
0.0.0.0:1984<br>&gt; &gt; 2005-05-11 09:11:23 Setting up
<br>>signal handlers<br>&gt; &gt; 2005-05-11 09:11:23
Setting up hobbitd <br>>channels<br>&gt; &gt; 2005-05-11
09:11:23 Setting up <br>>logfiles<br>&gt; &gt; 2005-05-11
09:11:28 Task bbhistory started <br>>with PID 4527<br>&gt;
&gt; 2005-05-11 09:11:28 Task bbenadis started <br>>with PID
4528<br>&gt; &gt; 2005-05-11 09:11:28 Task bbpage started
<br>>with PID 4530<br>&gt; &gt; 2005-05-11 09:11:28 Task
larrdstatus <br>>started with PID 4532<br>&gt; &gt;
2005-05-11 09:11:28 Task <br>>larrddata started with PID
4534<br>&gt; &gt; 2005-05-11 09:12:18 <br>>Task bbretest
started with PID 4541<br>&gt; &gt; 2005-05-11 09:12:23
<br>>Task bbdisplay started with PID 4542<br>&gt; &gt;
2005-05-11 <br>>09:12:43 Heartbeat lost for task hobbitd, bouncing
it<br>&gt; &gt; <br>>2005-05-11 09:12:48 Heartbeat lost for
task hobbitd, killing <br>>it<br>&gt; &gt; 2005-05-11
09:12:48 Task hobbitd terminated by <br>>signal 9<br>&gt;
&gt; 2005-05-11 09:12:48 Task bbdisplay terminated <br>>by signal
15<br>&gt; &gt;<br>&gt; &gt; So I tried to find
out which <br>>component causes the problem and
disabled<br>&gt; &gt; everything in <br>>hobbitlauch.cfg
and reenabled one by one.<br>&gt; &gt; I found out
<br>>that everytime I enabled bbdisplay those errors
occour.<br>&gt; &gt; <br>>The bb-display.log looks like
this:<br>&gt; &gt;<br>&gt; &gt;
<br>>2005-05-11 09:09:48 Whoops ! bb failed to send message -
<br>>timeout<br>&gt; &gt; 2005-05-11 09:09:48 hobbitd
status-board not <br>>available<br>&gt; &gt; 2005-05-11
09:09:53 Whoops ! bb failed to <br>>send message -
timeout<br>&gt; &gt; 2005-05-11 09:10:53 Whoops ! bb
<br>>failed to send message - timeout<br>&gt; &gt;
2005-05-11 09:10:53 <br>>hobbitd status-board not
available<br>&gt; &gt; 2005-05-11 09:11:23 <br>>Could not
connect to bbd (at) 10.207.193.41:1984 -<br>&gt; &gt;
<br>>Connection refused<br>&gt; &gt; 2005-05-11 09:11:23
Whoops ! bb <br>>failed to send message - Connection
failed<br>&gt; &gt; 2005-05-11 <br>>09:11:23 hobbitd
status-board not available<br>&gt; &gt; 2005-05-11
<br>>09:11:23 Could not connect to bbd (at) 10.207.193.41:1984
-<br>&gt; &gt; <br>>Connection refused<br>&gt;
&gt; 2005-05-11 09:11:23 Whoops ! bb <br>>failed to send message -
Connection failed<br>&gt; &gt;<br>&gt; &gt;
<br>>I also found some core files in ~server/tmp but I'm pretty shure
<br>>they came<br>&gt; &gt; from killing hobbit -
nevertheless I've run <br>>the gdb util:<br>&gt;
&gt;<br>&gt; &gt; GNU gdb Red Hat Linux
<br>>(6.1post-1.20040607.52rh)<br>&gt; &gt; Copyright 2004
Free Software <br>>Foundation, Inc.<br>&gt; &gt; GDB is
free software, covered by the <br>>GNU General Public License, and you
are<br>&gt; &gt; welcome to <br>>change it and/or
distribute copies of it under certain<br>&gt; &gt;
<br>>conditions.<br>&gt; &gt; Type &quot;show
copying&quot; to see the <br>>conditions.<br>&gt; &gt;
There is absolutely no warranty for GDB. <br>>Type &quot;show
warranty&quot; for details.<br>&gt; &gt; This GDB
<br>>was configured as &quot;i386-redhat-linux-gnu&quot;...Using
<br>>host<br>&gt; &gt; libthread_db library
<br>>&quot;/lib/tls/libthread_db.so.1&quot;.<br>&gt;
&gt;<br>&gt; &gt; <br>>Core was generated by `hobbitd
--debug<br>&gt; &gt;
<br>>--pidfile=/var/log/hobbit/hobbitd.pid
<br>>--restart=/usr/local/hobb'.<br>&gt; &gt; Program
terminated with <br>>signal 6, Aborted.<br>&gt; &gt;
Reading symbols from <br>>/lib/tls/libc.so.6...done.<br>&gt;
&gt; Loaded symbols for <br>>/lib/tls/libc.so.6<br>&gt;
&gt; Reading symbols from
<br>>/lib/ld-linux.so.2...done.<br>&gt; &gt; Loaded symbols
for <br>>/lib/ld-linux.so.2<br>&gt; &gt; #0 0x00df4cef in
raise () from <br>>/lib/tls/libc.so.6<br>&gt; &gt; (gdb)
bt<br>&gt; &gt; #0 0x00df4cef <br>>in raise () from
/lib/tls/libc.so.6<br>&gt; &gt; #1 0x00df64f5 in
<br>>abort () from /lib/tls/libc.so.6<br>&gt; &gt; #2
0x08054126 in <br>>sigsegv_handler (signum=11) at
sig.c:57<br>&gt; &gt; #3 &lt;signal <br>>handler
called&gt;<br>&gt; &gt; #4 0x00e46cac in mempcpy () from
<br>>/lib/tls/libc.so.6<br>&gt; &gt; #5 0x00e3a4d2 in
<br>>_IO_default_xsputn_internal () from
/lib/tls/libc.so.6<br>&gt; &gt; <br>>#6 0x00e13527 in
vfprintf () from /lib/tls/libc.so.6<br>&gt; &gt; <br>>#7
0x00e2f3dc in vsprintf () from /lib/tls/libc.so.6<br>&gt; &gt;
<br>>#8 0x00e1a03d in sprintf () from
/lib/tls/libc.so.6<br>&gt; &gt; #9 <br>> 0x0804d7a4 in
do_message (msg=0x9e0b3f8, origin=0x80554bb <br>>&quot;&quot;)
at<br>&gt; &gt; hobbitd.c:1903<br>&gt; &gt; #10
<br>>0x0804fcb5 in main (argc=8, argv=0xbfff9084) at
<br>>hobbitd.c:2944<br>&gt; &gt; (gdb)<br>&gt;
&gt;<br>&gt; &gt; Now I <br>>try to find out which of
the new hosts - and what test causes <br>>the<br>&gt; &gt;
problems...<br>&gt; &gt;<br>&gt; &gt;
<br>>Regards,<br>&gt; &gt;<br>&gt; &gt;
Stefan Loos<br>&gt; &gt;<br>&gt;
<br>>&gt;<br>&gt; &gt;<br>&gt; &gt; To
unsubscribe from the hobbit list, <br>>send an e-mail
to<br>&gt; &gt; hobbit-unsubscribe (at) hswn.dk<br>&gt;
<br>>&gt;<br>&gt;
&gt;<br>&gt;<br>&gt;--<br>&gt;Henrik
<br>>Storner<br>&gt;<br>&gt;To unsubscribe from the
hobbit list, send an <br>>e-mail
to<br>&gt;hobbit-unsubscribe (at) hswn.dk<br>&gt;<br>&gt;<br><br>><br>><br>><br>>To
unsubscribe from the hobbit list, send an e-mail
to<br>>hobbit-unsubscribe (at) hswn.dk<br>><br>><br>