[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [hobbit] nightly reboots



Maybe his NET is overloaded. We run some security scripts at night that hit the networks so heavy we get network problems .
Ralph Mitchell wrote:
How much work is the server doing?? The company that just laid me off has an old single-cpu, 733MHz DL380 running RedHat 7.2. It runs a lot of bash scripts out of cron to fetch and check web pages, with the results being reported back to the same machine. Last time I was able to see it, there were over 400 bb-hosts entries and over 2500 reports. It has a fairly constant load average of around 5 or 6, spiking to maybe 10 or 11 whenever the planets align and a lot of stuff happens simultaneously.

As soon as they can figure out how to replace it, Hobbit'll be shutdown, as it's not one of the officially blessed monitoring systems. However, even the folks in their Integration Labs admit they have nothing that can do quite what I've done with Hobbit, so I imagine they'll end up telling their customers the monitoring is being downgraded. I'd love to be a fly on the wall for *those* conversations... :)

Ralph Mitchell


On Tue, Jan 13, 2009 at 11:25 AM, Gavin Leonard <gleonard (at) progrexion.com <mailto:gleonard (at) progrexion.com>> wrote:

    Ok.. so how do you delay the red alerts? I am wondering if I am
    just over loading this system... I may need to build another bb
    server so I can split up the work load a bit.. thanks in advance!!

    -Gavin

    -----Original Message-----
    From: Josh Luthman [mailto:josh (at) imaginenetworksllc.com
    <mailto:josh (at) imaginenetworksllc.com>]
    Sent: Tuesday, January 13, 2009 9:49 AM
    To: hobbit (at) hswn.dk <mailto:hobbit (at) hswn.dk>
    Subject: Re: [hobbit] nightly reboots

    I have had the problem where the conn test goes bad for everything
    (not every host, just groups based on bb-hosts) since I installed it
    at the office.  No idea why :(

    What I do is delay the red sms alerts by a few minutes as it is red
    for only a few seconds, sometimes a minute.

    On 1/13/09, Gavin Leonard <gleonard (at) progrexion.com
    <mailto:gleonard (at) progrexion.com>> wrote:
    > All,
    >                 I am having an issue where my hobbit server
    thinks that
    > every server it monitors has been rebooted, so I get blasted
    with sms
    > messages when this happens. And none of the servers have
    actually rebooted
    > nor has there been any network outages.. ideas?thoughts?
    >
    >
    >
    >
    >
    > Gavin Leonard
    >
    > [cid:image001.gif@01C97562.5DF2D550]
    >
    > Director, Systems-Network Engineering
    >
    > T
    >
    >  801-828-1735
    >
    > F
    >
    >  801-828-1704
    >
    > E
    >
    >  gleonard (at) progrexion.com
    <mailto:gleonard (at) progrexion.com><mailto:gleonard (at) progrexion.com
    <mailto:gleonard (at) progrexion.com>>
    >
    >
    >
    >
    >
    >
    >
    >
    > Research | Marketing | Sales Generation
    >
    > www.progrexion.com
    <http://www.progrexion.com><http://www.progrexion.com/>
    >
    >
    >
    >
    > This email and its contents are confidential. If you are not the
    intended
    > recipient, delete this email and do not use or disclose the
    information
    > within this email or its attachments. Thank you.
    >
    >
    >
    >
    >


    --
    Josh Luthman
    Office: 937-552-2340
    Direct: 937-552-2343
    1100 Wayne St
    Suite 1337
    Troy, OH 45373

    Those who don't understand UNIX are condemned to reinvent it, poorly.
    --- Henry Spencer

    To unsubscribe from the hobbit list, send an e-mail to
    hobbit-unsubscribe (at) hswn.dk <mailto:hobbit-unsubscribe (at) hswn.dk>



    To unsubscribe from the hobbit list, send an e-mail to
    hobbit-unsubscribe (at) hswn.dk <mailto:hobbit-unsubscribe (at) hswn.dk>