[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [hobbit] Hobbit client executing a script to be proactive if a problem occurs?

To: hobbit (at) hswn.dk
Subject: Re: [hobbit] Hobbit client executing a script to be proactive if a problem occurs?
From: Chris Wopat <chrisw (at) supranet.net>
Date: Fri, 11 Apr 2008 12:17:24 -0500
References: <47FF6DF2.4030306 (at) supranet.net> <258e9b160804110816v5e6dca48h6e430e43dc9fd97b (at) mail.gmail.com>
User-agent: Mozilla-Thunderbird 2.0.0.12 (X11/20080406)

Phil Wild wrote:

Hi Chris,
I think it really depends on what you are testing? If you are using thestandard hobbit client and the standard tests, most of the client sideis pretty basic, I guess you could call it a dumb client in a way as itdoes a simple job of pulling the data out and sends it on without anyintelligent decisions being made about thresholds etc.
To do what you want, you either have to do as you say (set up keys fromthe server etc and have the server perform an action after a thresholdbreach from a script initiated/configured in the hobbit-alerts file.

Bummer, I was hoping there was perhaps a barely documented feature thatwould let you exec a script on the client to make my life easier.

Or, which would be much simpler, put some code in your monitoring scriptto take action, but then you are starting to move away from thesimplicity of hobbit. It decomes even harder if you want to take actionbased on something picked up in the standard tests (like CPU that youmentioned in your post).


Indeed, my intent is for standard tests.

You may need to write your own test/new columnthat monitors the same metric but in a different light. In my view, anautomated action based on a detected event probably does not belong inthe monitoring system. If a failure can be expected and an automatedaction is known to fix the issue, perhaps that should be built into thestartup process of the application (a watchdog process etc).

Indeed, a daemon shouldn't fail and it should run properly or be fixednatively. However, in the case of what I'm trying to monitor is an itemthat has a series of dependencies - Postfix, depending on Amavis,depending on p0f, depending on ClamAV, depending on greylist software,depending on database, etc. Under certain circumstances if one of thesewere to go down, it ends up snowballing to have high CPU, in my case.

The better way for me to handle this is to likely search logs for items,instead of relying on high CPU.

Hobbit canthen be used to monitor the log for a restart event, or a failed restartevent etc. Actually, thinking about it more, building the intelligentaction into the agent is an ok idea and you also have the opportunity ofcapturing and transmitting additional information about why somethingdies if you run an action to fix and it failed etc.

For the scenario I laid out above, I intend to write a script that willrestart the daemons properly in the correct order, but this is the "ohshit" script, and wouldn't be a system startup script, for example.

I am waffling... Youstill need extra security based configuration steps on the client withsudo or ssh anyway to get around access permission to restart somethinganyway as your client is running as the hobbit userid so this brings theclient configuration closer to an ssh setup on the server. I don't thinkeither way is perfect but both would do what you want...

Indeed. It sounds like generally the two scenarios I'd mentioned in myemail are the way to get it to work, and whichever is most reliable/lesshack-ish would be the best way to do it.

Perhaps this is something that belongs on a request for feature list ofa future release of hobbit.
The hobbit client installation to configure sudo to allow it to runcommands as other users (on admins acceptance during the installation ofcourse).The ability of the hobbit server to send a series of actions to thehobbit client for execution via the hobbit communication channel. Soundslike something that could have lots of uses if done well...

I think this would be the perfect scenario. Something added to thehobbit client, that would go in 'localclient.cfg'. A simple 'SCRIPT'line that could be nested under a test, that would pass along whatevermeaningful variables that could be useful, such as PID.

The hobbit server *already uses* 'setuid root' for some binaries, suchas 'hobbitping'. The client would simply need to call some binary who'ssole purpose is to launch scripts as root so essentially anything wouldbe possible.

I can't think of any reason that some hook would have to exist for thescript to tell the hobbit client anything back, I think it can just waitfor the next poll period to see if it went back to green.


--Chris

Follow-Ups:
- Re: [hobbit] Hobbit client executing a script to be proactive if a problem occurs?
  - From: Josh Luthman

References:
- Hobbit client executing a script to be proactive if a problem occurs?
  - From: Chris Wopat
- Re: [hobbit] Hobbit client executing a script to be proactive if a problem occurs?
  - From: Phil Wild

Prev by Date: RE: [hobbit] Hobbit client executing a script to be proactive if a problem occurs?
Next by Date: Re: [hobbit] sms alert sample script - good solution?
Previous by thread: Re: [hobbit] Hobbit client executing a script to be proactive if a problem occurs?
Next by thread: Re: [hobbit] Hobbit client executing a script to be proactive if a problem occurs?
Index(es):
- Date
- Thread