[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [hobbit] Advice on how to handle HA monitoring
- To: hobbit (at) hswn.dk
- Subject: Re: [hobbit] Advice on how to handle HA monitoring
- From: "Asif Iqbal" <vadud3 (at) gmail.com>
- Date: Wed, 26 Sep 2007 09:28:24 -0400
- Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=beta; h=domainkey-signature:received:received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; bh=1dyzx6rQpCjgpHC++ZDRvidPqHCPyzgQvZMQ0BmCJNA=; b=YpBzsMlYFkI9gvdnQwBYk1rT0JlBMfbbpKU+ratO5UoYOFdxn2MJyzL550GkxFwAFpxljsR7l5AxrD6XTJm0KVOcVlEzI9HGctl0y98LKg3yCIxYJln+i+i2v7msPx2a/QsExDz8WIY6uLEPW665X/0MPlwM0jL9bnq0A6sF34Q=
- Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=beta; h=received:message-id:date:from:to:subject:in-reply-to:mime-version:content-type:content-transfer-encoding:content-disposition:references; b=UQ8IveEQrj5gQB9pSyKfM+LWNLUrp90g7gICa6Ivo3I0xpbTOFp0I0sTVr5Q19CU0T4SQgqT7eBt/cCa/R227yFJzp24Ddd3+tkGM4UBAZpbzSgPtvW5qEEmeVGplAWOas2qcgko+inMhnMS2osPBYWX0b7VY3RpIFD+UcJ8Xvo=
- References: <46F4176E.9030306 (at) cisco.com> <9836EA7D7FDAE34099AED87A2D9C3A8D9894EC (at) 306181ANEX2.global.avaya.com> <46F4242E.4010707 (at) cisco.com>
On 9/21/07, Charles Jones <jonescr (at) cisco.com> wrote:
> So you are using a custom script to monitor instead of hobbitclient-*.sh ?
> This really isn't an option for me, since I have literally dozens of
> servers that all have the same hobbit homedir, although I have written
It does not matter. You can still write an external script like this
[external]
ENVFILE /path/to/local/etc/hobbitclient.cfg
CMD /path/to/local/external/script
LOGFILE /path/to/local/logs/external.log
INTERVAL 5m
That will work just fine even though you are sharing same hobbit client
> some custom scripts that check the hostname and only run if they are
> launched on the host that they need to be on.
>
> I think I will just have the oncall persons manually edit
> hobbit-clients.cfg in the case of a failover (oncall gets woken up
> anyhow). They can just uncomment/comment definitions for whichever host
> is the master.
Ouch!
>
> It would be nice if you could set dependencies for PROC tests, then I
> could just make all of the PROC tests dependant upon something, like one
> of the failover daemons, or a flag on the filesystem, etc.
>
> -Charles
>
> Haertig, David F (Dave) wrote:
> > I do this with a custom monitoring script (I don't use the standard
> > Hobbit 'procs' test).
> >
> > There should be something you can check via script that tells you if a
> > server is primary or not. In my case, a database filesystem is mounting
> > on the primary but not on the secondary. So my script uses 'df' to look
> > for that filesystem. You could use 'mount' as well. If that database
> > filesystem is mounted the script does the normal test for processes and
> > reports red/green. But if it's not mounted, the script reports a clear
> > condition.
> >
> > -----Original Message-----
> > From: Charles Jones [mailto:jonescr (at) cisco.com]
> > Sent: Friday, September 21, 2007 1:12 PM
> > To: hobbit (at) hswn.dk
> > Subject: [hobbit] Advice on how to handle HA monitoring
> >
> > We have 2 hosts, HostA and HostB. They are part of an HA cluster via HP
> > ServiceGuard. There is a virtual IP and DNS name of "virtual" that
> > automatically goes to whichever of HostA and HostB is the primary at the
> > time.
> >
> > I am currently monitoring both HostA and HostB via Hobbit. Currently
> > HostA is the primary, and I am doing various PROC checks. Currently on
> > HostB, I am not doing process checks.
> >
> > My problem is, how do I smoothly handle a failover scenario (HostB
> > becoming the primary)? When a failover occurs, all of the procs on
> > HostA are stopped (either by the server crashing, or manualy by
> > ServiceGuard), and the same procs are started up on HostB.
> >
> > I'm trying to think of ways to monitor both hosts, but only monitor
> > procs on the one that is primary. So far the best I can come up with is
> > to run the hobbit clients in local mode, and maybe have the ServiceGuard
> > scripts swap out the config files and restart the Hobbit clients when
> > there is a failover. That would probably work, BUT in this case the
> > Hobbit homdir is also the same (SAN mount) on both machines, so moving
> > or editing a file on one does the same on the other :(
> >
> > Simply shutting down the hobbit client on the non-primary is not an
> > option, as then it would no longer be monitored at all.
> >
> > Any ideas? :)
> >
> > -Charles
> >
> > To unsubscribe from the hobbit list, send an e-mail to
> > hobbit-unsubscribe (at) hswn.dk
> >
> > To unsubscribe from the hobbit list, send an e-mail to
> > hobbit-unsubscribe (at) hswn.dk
> >
>
> To unsubscribe from the hobbit list, send an e-mail to
> hobbit-unsubscribe (at) hswn.dk
>
>
>
--
Asif Iqbal
PGP Key: 0xE62693C5 KeyServer: pgp.mit.edu