[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Interesting question
- To: hobbit (at) hswn.dk
- Subject: Interesting question
- From: Jerald Sheets <questy (at) gmail.com>
- Date: Wed, 30 Dec 2009 11:34:12 -0500
- Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:date:message-id:subject :from:to:content-type; bh=45yaDTROqQYu3QL5bBxQC9vhetNd3eNReqWAhFQNiag=; b=SHhIVRrNu2AoEmmUHnhNEwCu+s9r5sCDcoJNSuNK5MixEiYa4e8ryv2P4RjZWZQ6vL ZC15ZZymdUyoJRumB+yoe/PnqRzlhTzyZ/gxIvRb7Osa7vE/E8r4xeC2qL3BJq2J3mIN 9pZYvuWYgSo9xm2rqH8XX+o7wEs8Y8ZR2QnO0=
- Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:date:message-id:subject:from:to:content-type; b=b0UoElMnSxrI2c7J0aHBg5O1JU2gKw+hjw1/298uXG9a8p2wVfFYvHKj0SBjYfeDJA Tql8e2Z+JIXBKSL/5vC05VCXLgJ1zofZUGg01zBQbqAQI7HACWEbQJZD0EdJybqH6znX iZplrC/6y18+op7UauevN+Pp+TbaSAigmUmEk=
I'm about to undertake writing a series of monitors for a custom app we have
here in-house.
This app lives over a networked filesystem (think cvfs or gpfs) that is
managed by two master nodes, a master and a failover. As a result, only one
node at a time can answer the query I want to give it.
My conundrum:
If I make the query form my Xymon server on node1 and it fails over to
node2, node1 becomes completely unable to answer the question, and that
check would go "red". Same for node2 back to node1.
If I run the check locally on each system, while one is working, the other
will go "blue".
Have any of you ever written monitors for servers that carry a service in an
active/passive configuration, and been able to keep the individual servers
from going into some strange state as a result of failovers and such?
How did you handle it?
---
Jerald M. Sheets jr.