[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: [hobbit] nightly reboots



Even stranger is that they hit right when the hobbit-alerts file starts sending pages for the days.. 5am bleh...

-Gavin

From: Ralph Mitchell [mailto:ralphmitchell (at) gmail.com]
Sent: Tuesday, January 13, 2009 11:55 AM
To: hobbit (at) hswn.dk
Subject: Re: [hobbit] nightly reboots

How much work is the server doing??  The company that just laid me off has an old single-cpu, 733MHz DL380 running RedHat 7.2.  It runs a lot of bash scripts out of cron to fetch and check web pages, with the results being reported back to the same machine.  Last time I was able to see it, there were over 400 bb-hosts entries and over 2500 reports.  It has a fairly constant load average of around 5 or 6, spiking to maybe 10 or 11 whenever the planets align and a lot of stuff happens simultaneously.

As soon as they can figure out how to replace it, Hobbit'll be shutdown, as it's not one of the officially blessed monitoring systems.  However, even the folks in their Integration Labs admit they have nothing that can do quite what I've done with Hobbit, so I imagine they'll end up telling their customers the monitoring is being downgraded.  I'd love to be a fly on the wall for *those* conversations...  :)

Ralph Mitchell

On Tue, Jan 13, 2009 at 11:25 AM, Gavin Leonard <gleonard (at) progrexion.com<mailto:gleonard (at) progrexion.com>> wrote:
Ok.. so how do you delay the red alerts? I am wondering if I am just over loading this system... I may need to build another bb server so I can split up the work load a bit.. thanks in advance!!

-Gavin

-----Original Message-----
From: Josh Luthman [mailto:josh (at) imaginenetworksllc.com<mailto:josh (at) imaginenetworksllc.com>]
Sent: Tuesday, January 13, 2009 9:49 AM
To: hobbit (at) hswn.dk<mailto:hobbit (at) hswn.dk>
Subject: Re: [hobbit] nightly reboots

I have had the problem where the conn test goes bad for everything
(not every host, just groups based on bb-hosts) since I installed it
at the office.  No idea why :(

What I do is delay the red sms alerts by a few minutes as it is red
for only a few seconds, sometimes a minute.

On 1/13/09, Gavin Leonard <gleonard (at) progrexion.com<mailto:gleonard (at) progrexion.com>> wrote:
> All,
>                 I am having an issue where my hobbit server thinks that
> every server it monitors has been rebooted, so I get blasted with sms
> messages when this happens. And none of the servers have actually rebooted
> nor has there been any network outages.. ideas?thoughts?
>
>
>
>
>
> Gavin Leonard
>
> [cid:image001.gif@01C97562.5DF2D550]
>
> Director, Systems-Network Engineering
>
> T
>
>  801-828-1735
>
> F
>
>  801-828-1704
>
> E
>
>  gleonard (at) progrexion.com<mailto:gleonard (at) progrexion.com><mailto:gleonard (at) progrexion.com<mailto:gleonard (at) progrexion.com>>
>
>
>
>
>
>
>
>
> Research | Marketing | Sales Generation
>
> www.progrexion.com<http://www.progrexion.com><http://www.progrexion.com/>
>
>
>
>
> This email and its contents are confidential. If you are not the intended
> recipient, delete this email and do not use or disclose the information
> within this email or its attachments. Thank you.
>
>
>
>
>


--
Josh Luthman
Office: 937-552-2340
Direct: 937-552-2343
1100 Wayne St
Suite 1337
Troy, OH 45373

Those who don't understand UNIX are condemned to reinvent it, poorly.
--- Henry Spencer

To unsubscribe from the hobbit list, send an e-mail to
hobbit-unsubscribe (at) hswn.dk<mailto:hobbit-unsubscribe (at) hswn.dk>



To unsubscribe from the hobbit list, send an e-mail to
hobbit-unsubscribe (at) hswn.dk<mailto:hobbit-unsubscribe (at) hswn.dk>