[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [hobbit] Re: Status not available
Scratch that last message. Hobbitd appears to be dying independent of
hobbitlaunch.
On Mon, 28 Feb 2005 14:41:26 -0800, Brian Lynch <brianlynch (at) gmail.com> wrote:
> Is it possible that the SIGUSR2 signal is not being sent properly?
> What signal is sent to kill hobbitd from hobbitlaunch if it doesn't
> recieve notice?
>
> The HEARTBEAT keyword can only be used for one task, and is
> specifically aimed at monitoring the hobbitd(8) task. This
> task must send a SIGUSR2 signal to hobbitlaunch regularly;
> if this signal fails to arrive for more than 15 seconds,
> hobbitlaunch will kill the running task and start up a new
> one. hobbitd(8) will send this signal every 5 seconds.
>
>
> On Mon, 28 Feb 2005 14:26:55 -0800, Brian Lynch <brianlynch (at) gmail.com> wrote:
> > Enabled the -DDEBUG on compile and enabled --debug on runtime for hobbitd.
> > Same output from gdb..
> >
> > #0 0xff19fc14 in _libc_kill () from /usr/lib/libc.so.1
> > (gdb) backtrace
> > #0 0xff19fc14 in _libc_kill () from /usr/lib/libc.so.1
> > #1 0xff13598c in abort () from /usr/lib/libc.so.1
> > #2 0x000281f4 in sigsegv_handler (signum=0) at sig.c:57
> > #3 <signal handler called>
> >
> > Here are the sections of each log:
> >
> > hobbitlaunch.log
> >
> > 2005-02-28 22:24:56 Task bbhistory started with PID 2567
> > 2005-02-28 22:24:56 Task bbenadis started with PID 2572
> > 2005-02-28 22:24:56 Task bbpage started with PID 2575
> > 2005-02-28 22:24:56 Task larrdstatus started with PID 2578
> > 2005-02-28 22:24:56 Task larrddata started with PID 2580
> > 2005-02-28 22:24:56 Task bbdisplay started with PID 2582
> > 2005-02-28 22:24:56 Task bbnet started with PID 2584
> > 2005-02-28 22:24:56 Task bbretest started with PID 2585
> > 2005-02-28 22:24:58 Task larrdcolumn started with PID 2608
> > 2005-02-28 22:24:58 Task infocolumn started with PID 2609
> > 2005-02-28 22:25:16 Task hobbitd terminated by signal 6
> > 2005-02-28 22:25:17 Loading hostnames
> > 2005-02-28 22:25:17 Task hobbitd started with PID 2712
> > 2005-02-28 22:25:17 Loading saved state
> > 2005-02-28 22:25:17 Setting up network listener on 0.0.0.0:1984
> > 2005-02-28 22:25:17 Setting up signal handlers
> > 2005-02-28 22:25:17 Setting up hobbitd channels
> > 2005-02-28 22:25:17 Setting up status channel (id=1)
> > 2005-02-28 22:25:17 calling ftok('/opt/hobbit/server',1)
> > 2005-02-28 22:25:17 ftok() returns: 0x10078B4
> > 2005-02-28 22:25:17 shmget() returns: 0x8FC
> > 2005-02-28 22:25:17 Setting up stachg channel (id=2)
> > 2005-02-28 22:25:17 calling ftok('/opt/hobbit/server',2)
> > 2005-02-28 22:25:17 ftok() returns: 0x20078B4
> > 2005-02-28 22:25:17 shmget() returns: 0x8FD
> > 2005-02-28 22:25:17 Setting up page channel (id=3)
> > 2005-02-28 22:25:17 calling ftok('/opt/hobbit/server',3)
> > 2005-02-28 22:25:17 ftok() returns: 0x30078B4
> > 2005-02-28 22:25:17 shmget() returns: 0x8FE
> > 2005-02-28 22:25:17 Setting up data channel (id=4)
> > 2005-02-28 22:25:17 calling ftok('/opt/hobbit/server',4)
> > 2005-02-28 22:25:17 ftok() returns: 0x40078B4
> > 2005-02-28 22:25:17 shmget() returns: 0x8FF
> > 2005-02-28 22:25:17 Setting up notes channel (id=5)
> > 2005-02-28 22:25:17 calling ftok('/opt/hobbit/server',5)
> > 2005-02-28 22:25:17 ftok() returns: 0x50078B4
> > 2005-02-28 22:25:17 shmget() returns: 0x900
> > 2005-02-28 22:25:17 Setting up enadis channel (id=6)
> > 2005-02-28 22:25:17 calling ftok('/opt/hobbit/server',6)
> > 2005-02-28 22:25:17 ftok() returns: 0x60078B4
> > 2005-02-28 22:25:17 shmget() returns: 0xB59
> > 2005-02-28 22:25:17 Setting up logfiles
> > 2005-02-28 22:25:17 Task bbenadis terminated, status 1
> > 2005-02-28 22:25:22 Task bbhistory started with PID 2740
> > 2005-02-28 22:25:22 Task bbenadis started with PID 2743
> > 2005-02-28 22:25:22 Task bbpage started with PID 2747
> > 2005-02-28 22:25:22 Task larrdstatus started with PID 2750
> > 2005-02-28 22:25:22 Task larrddata started with PID 2752
> >
> > hobbitd.log
> >
> > 005-02-28 22:15:16 Posting message 12 to 1 readers
> > 2005-02-28 22:15:16 Message posted
> > 2005-02-28 22:15:16 Posting message 13 to 1 readers
> > 2005-02-28 22:15:16 Message posted
> > 2005-02-28 22:15:16 oldcolor=6, oldas=2, newcolor=0, newas=0
> > 2005-02-28 22:15:16 posting to stachg channel
> > 2005-02-28 22:15:17 Setup complete
> > 2005-02-28 22:15:17 Sending heartbeat to pid 28397
> > 2005-02-28 22:15:18 posting to status channel
> > 2005-02-28 22:15:18 Dropping message - no readers
> > 2005-02-28 22:15:19 posting to status channel
> > 2005-02-28 22:15:19 Dropping message - no readers
> > 2005-02-28 22:15:21 posting to status channel
> > 2005-02-28 22:15:21 Dropping message - no readers
> > 2005-02-28 22:15:22 Sending heartbeat to pid 28397
> > 2005-02-28 22:15:22 posting to status channel
> > 2005-02-28 22:15:22 Dropping message - no readers
> > 2005-02-28 22:15:23 posting to status channel
> > 2005-02-28 22:15:23 Posting message 1 to 1 readers
> > 2005-02-28 22:15:23 Message posted
> > 2005-02-28 22:15:24 posting to status channel
> > 2005-02-28 22:15:24 Posting message 2 to 1 readers
> > 2005-02-28 22:15:24 Message posted
> > 2005-02-28 22:15:26 posting alert to page channel
> > 2005-02-28 22:15:26 Posting message 1 to 1 readers
> > 2005-02-28 22:15:26 Message posted
> > 2005-02-28 22:15:26 posting to status channel
> > 2005-02-28 22:15:26 Posting message 3 to 1 readers
> > 2005-02-28 22:15:26 Message posted
> > 2005-02-28 22:15:28 Sending heartbeat to pid 28397
> >
> >
> > On Mon, 28 Feb 2005 14:11:43 -0800, Brian Lynch <brianlynch (at) gmail.com> wrote:
> > > Have it running with '--debug' now... Hopefully, this will yield more
> > > information.
> > >
> > > - Brian
> > >
> > >
> > > On Mon, 28 Feb 2005 23:10:14 +0100, Henrik Stoerner <henrik (at) hswn.dk> wrote:
> > > > On Mon, Feb 28, 2005 at 02:00:34PM -0800, Brian Lynch wrote:
> > > > > It is also dumping core... here is the gdb results from the core file.
> > > > [snip]
> > > > > (gdb) backtrace
> > > > > #0 0xff19fc14 in _libc_kill () from /usr/lib/libc.so.1
> > > > > #1 0xff13598c in abort () from /usr/lib/libc.so.1
> > > > > #2 0x0001cf54 in sigsegv_handler (signum=-14958584) at sig.c:57
> > > > > #3 <signal handler called>
> > > >
> > > > No more lines than that ? I'd expect a few more unless there's some
> > > > serious memory corruption taking place.
> > > >
> > > > Henrik
> > > >
> > > > To unsubscribe from the hobbit list, send an e-mail to
> > > > hobbit-unsubscribe (at) hswn.dk
> > > >
> > > >
> > >
> >
>