[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
RE: [hobbit] nightly reboots
- To: "hobbit (at) hswn.dk" <hobbit (at) hswn.dk>
- Subject: RE: [hobbit] nightly reboots
- From: Gavin Leonard <gleonard (at) progrexion.com>
- Date: Tue, 27 Jan 2009 10:10:20 -0700
- Accept-language: en-US
- Acceptlanguage: en-US
- References: <C76DE1C678818B4E9D7722DA7D47897B75A4A56243 (at) OMAHA.pgx.local> <961092e10901130849v1bb72ea7x843b4e0d0a9bd4a6 (at) mail.gmail.com> <C76DE1C678818B4E9D7722DA7D47897B75A4A56249 (at) OMAHA.pgx.local> <961092e10901130930k209387f9q5e357abe0648db7c (at) mail.gmail.com>
- Thread-index: Acl1pNelqcMCvInaSBeMKbT6p3iVVgK/PtFg
- Thread-topic: [hobbit] nightly reboots
Update to this.. so I stopped getting pages for all my servers supposedly becoming unreachable, now I just get one that states that the bbtest had recovered.. looks like this. Does that shed any more light for those having this same problem?
green Tue Jan 27 07:31:00 2009
bbtest-net version 4.2.0
SSL library : OpenSSL 0.9.7f 22 Mar 2005 LDAP library: OpenLDAP 20223
Statistics:
Hosts total : 62
Hosts with no tests : 0
Total test count : 64
Status messages : 65
Alert status msgs : 0
Transmissions : 2
DNS statistics:
# hostnames resolved : 62
# succesful : 61
# failed : 1
# calls to dnsresolve : 64
TCP test statistics:
# TCP tests total : 2
# HTTP tests : 1
# Simple TCP tests : 1
# Connection attempts : 2
# bytes written : 133
# bytes read : 149174
TIME SPENT
Event Starttime Duration
bbtest-net startup 1233066660.603990 -
Service definitions loaded 1233066660.605545 0.001555
Tests loaded 1233066660.617314 0.011769
DNS lookups completed 1233066660.640820 0.023506
Test engine setup completed 1233066660.642001 0.001181
TCP tests completed 1233066660.643672 0.001671
PING test completed (62 hosts) 1233066663.593909 2.950237
PING test results sent 1233066663.594425 0.000516
Test result collection completed 1233066663.594433 0.000008
LDAP test engine setup completed 1233066663.594434 0.000001
LDAP tests executed 1233066663.594436 0.000002
LDAP tests result collection completed 1233066663.594437 0.000001
Test results transmitted 1233066663.594806 0.000369
bbtest-net completed 1233066663.596373 0.001567
TIME TOTAL 2.992383
-Gavin
From: Josh Luthman [mailto:josh (at) imaginenetworksllc.com]
Sent: Tuesday, January 13, 2009 10:31 AM
To: hobbit (at) hswn.dk
Subject: Re: [hobbit] nightly reboots
HOST=%.*\.imaginenetworksllc\.com
MAIL 1231231234 (at) txt.att.net<mailto:1231231234 (at) txt.att.net> COLOR=RED DURATION>2m REPEAT=60 RECOVERED FORMAT=SMS
Like that
Josh Luthman
Office: 937-552-2340
Direct: 937-552-2343
1100 Wayne St
Suite 1337
Troy, OH 45373
Those who don't understand UNIX are condemned to reinvent it, poorly.
--- Henry Spencer
On Tue, Jan 13, 2009 at 12:25 PM, Gavin Leonard <gleonard (at) progrexion.com<mailto:gleonard (at) progrexion.com>> wrote:
Ok.. so how do you delay the red alerts? I am wondering if I am just over loading this system... I may need to build another bb server so I can split up the work load a bit.. thanks in advance!!
-Gavin
-----Original Message-----
From: Josh Luthman [mailto:josh (at) imaginenetworksllc.com<mailto:josh (at) imaginenetworksllc.com>]
Sent: Tuesday, January 13, 2009 9:49 AM
To: hobbit (at) hswn.dk<mailto:hobbit (at) hswn.dk>
Subject: Re: [hobbit] nightly reboots
I have had the problem where the conn test goes bad for everything
(not every host, just groups based on bb-hosts) since I installed it
at the office. No idea why :(
What I do is delay the red sms alerts by a few minutes as it is red
for only a few seconds, sometimes a minute.
On 1/13/09, Gavin Leonard <gleonard (at) progrexion.com<mailto:gleonard (at) progrexion.com>> wrote:
> All,
> I am having an issue where my hobbit server thinks that
> every server it monitors has been rebooted, so I get blasted with sms
> messages when this happens. And none of the servers have actually rebooted
> nor has there been any network outages.. ideas?thoughts?
>
>
>
>
>
> Gavin Leonard
>
> [cid:image001.gif@01C97562.5DF2D550]
>
> Director, Systems-Network Engineering
>
> T
>
> 801-828-1735
>
> F
>
> 801-828-1704
>
> E
>
> gleonard (at) progrexion.com<mailto:gleonard (at) progrexion.com><mailto:gleonard (at) progrexion.com<mailto:gleonard (at) progrexion.com>>
>
>
>
>
>
>
>
>
> Research | Marketing | Sales Generation
>
> www.progrexion.com<http://www.progrexion.com><http://www.progrexion.com/>
>
>
>
>
> This email and its contents are confidential. If you are not the intended
> recipient, delete this email and do not use or disclose the information
> within this email or its attachments. Thank you.
>
>
>
>
>
--
Josh Luthman
Office: 937-552-2340
Direct: 937-552-2343
1100 Wayne St
Suite 1337
Troy, OH 45373
Those who don't understand UNIX are condemned to reinvent it, poorly.
--- Henry Spencer
To unsubscribe from the hobbit list, send an e-mail to
hobbit-unsubscribe (at) hswn.dk<mailto:hobbit-unsubscribe (at) hswn.dk>
To unsubscribe from the hobbit list, send an e-mail to
hobbit-unsubscribe (at) hswn.dk<mailto:hobbit-unsubscribe (at) hswn.dk>