[hobbit] bbtest - errors
Michael A. Price
mprice at sgt-inc.com
Mon Dec 31 16:39:49 CET 2007
Nice sleuthing...
It looks like the ball is back in my court. The trace command at the
command line, never seems to end. I will do some research..
Thanks, michael
-----Original Message-----
From: Josh Luthman [mailto:josh at imaginenetworksllc.com]
Sent: Monday, December 31, 2007 10:27 AM
To: hobbit at hswn.dk
Subject: Re: [hobbit] bbtest - errors
Can you do a trace at the shell?
On 12/31/07, Michael A. Price <mprice at sgt-inc.com> wrote:
> Josh,
>
> I just figured out it's the #trace option. When I remove that option
the
> errors go away...
>
> Thanks, michael
>
>
>
>
> -----Original Message-----
> From: Michael A. Price
> Sent: Monday, December 31, 2007 7:35 AM
> To: hobbit at hswn.dk
> Subject: RE: [hobbit] bbtest - errors
>
> Josh,
>
> Thanks for help, AGAIN.... One step closer...
>
> I have one host down, and I have the trace option on all of my hosts
listed
> in bb-hosts. When I comment out that downed host, the errors clear up
in
> bb-test. Take a look...
> -----------------------------------------------------
> Mon Dec 31 12:22:16 2007
>
> bbtest-net version 4.2.0
> SSL library : OpenSSL 0.9.7m 23 Feb 2007
> LDAP library: OpenLDAP 20213
>
> Statistics:
> Hosts total : 310
> Hosts with no tests : 7
> Total test count : 307
> Status messages : 308
> Alert status msgs : 0
> Transmissions : 5
>
> DNS statistics:
> # hostnames resolved : 303
> # succesful : 303
> # failed : 0
> # calls to dnsresolve : 307
>
> TCP test statistics:
> # TCP tests total : 2
> # HTTP tests : 1
> # Simple TCP tests : 1
> # Connection attempts : 2
> # bytes written : 135
> # bytes read : 553
>
>
> TIME SPENT
> Event Starttime
Duration
> bbtest-net startup 1199103736.384784
-
> Service definitions loaded 1199103736.385887
0.001103
> Tests loaded 1199103736.768919
0.383032
> DNS lookups completed 1199103736.768928
0.000009
> Test engine setup completed 1199103736.772261
0.003333
> TCP tests completed 1199103736.773300
0.001039
> PING test completed (303 hosts) 1199103755.089536
18.316236
> PING test results sent 1199103755.091233
0.001697
> Test result collection completed 1199103755.091241
0.000008
> LDAP test engine setup completed 1199103755.091245
0.000004
> LDAP tests executed 1199103755.091249
0.000004
> LDAP tests result collection completed 1199103755.091252
0.000003
> NSLOOKUP tests executed 1199103755.095923
0.004671
> Test results transmitted 1199103755.098103
0.002180
> bbtest-net completed 1199103755.099180
0.001077
> TIME TOTAL
18.714396
>
> --------------------------------------------
>
> But once I uncomment out the host and the hobbit server tries to do a
> traceroute to it, the errors come back again. Even if I disable the
alerting
> of that host. Take a look....
>
> ----------------------------------------
>
> Mon Dec 31 12:32:24 2007
>
> bbtest-net version 4.2.0
> SSL library : OpenSSL 0.9.7m 23 Feb 2007
> LDAP library: OpenLDAP 20213
>
> Statistics:
> Hosts total : 311
> Hosts with no tests : 7
> Total test count : 308
> Status messages : 309
> Alert status msgs : 0
> Transmissions : 5
>
> DNS statistics:
> # hostnames resolved : 304
> # succesful : 304
> # failed : 0
> # calls to dnsresolve : 308
>
> TCP test statistics:
> # TCP tests total : 2
> # HTTP tests : 1
> # Simple TCP tests : 1
> # Connection attempts : 2
> # bytes written : 135
> # bytes read : 553
>
>
> Error output:
> Timeout waiting for data from child, killing it
> Timeout waiting for data from child, killing it
> Child process terminated with signal 15
>
>
> TIME SPENT
> Event Starttime
Duration
> bbtest-net startup 1199104344.425092
-
> Service definitions loaded 1199104344.426152
0.001060
> Tests loaded 1199104344.543955
0.117803
> DNS lookups completed 1199104344.543964
0.000009
> Test engine setup completed 1199104344.547454
0.003490
> TCP tests completed 1199104344.548434
0.000980
> PING test completed (304 hosts) 1199104369.082520
24.534086
> PING test results sent 1199104399.089988
30.007468
> Test result collection completed 1199104399.090003
0.000015
> LDAP test engine setup completed 1199104399.090007
0.000004
> LDAP tests executed 1199104399.090011
0.000004
> LDAP tests result collection completed 1199104399.090015
0.000004
> NSLOOKUP tests executed 1199104399.095563
0.005548
> Test results transmitted 1199104399.097862
0.002299
> bbtest-net completed 1199104399.098975
0.001113
> TIME TOTAL
54.673883
>
> -------------------------------------
>
> Any ideas of why its doing it??? Or how to resolve it???
>
> Thanks, michael
>
>
>
>
>
>
> ________________________________________
> From: Josh Luthman [mailto:josh at imaginenetworksllc.com]
> Sent: Friday, December 28, 2007 5:30 PM
> To: hobbit at hswn.dk
> Subject: Re: [hobbit] bbtest - errors
>
> Try Henrik's fping command at the bottom of this page:
>
> http://www.hswn.dk/hobbiton/2007/11/msg00069.html
>
> and stick a time in front to see how long it takes.
> On 12/28/07, Michael A. Price <mprice at sgt-inc.com> wrote:
> Josh,
>
> Thanks for getting back to me so quickly, I updated my /etc/hosts
file to
> have every single one of my monitored hosts, just as a test. I now
have
> 'failed hosts' in my DNS statistic's, but my 'PING test results sent'
are
> still off the charts. I still cant figure out the problem...
>
> bbtest-net version 4.2.0
> SSL library : OpenSSL 0.9.7m 23 Feb 2007
> LDAP library: OpenLDAP 20213
>
> Statistics:
>
> Hosts total : 311
> Hosts with no tests : 7
>
> Total test count : 308
> Status messages : 309
>
> Alert status msgs : 0
> Transmissions : 5
>
>
> DNS statistics:
>
> # hostnames resolved : 304
> # succesful : 304
>
> # failed : 0
> # calls to dnsresolve : 308
>
> TCP test statistics:
>
> # TCP tests total : 2
> # HTTP tests : 1
>
> # Simple TCP tests : 1
> # Connection attempts : 2
>
> # bytes written : 135
> # bytes read : 553
>
>
>
> Error output:
> Timeout waiting for data from child, killing it
>
> Timeout waiting for data from child, killing it
> Child process terminated with signal 15
>
>
>
> TIME SPENT
> Event Starttime
Duration
>
> bbtest-net startup 1198875012.330887
-
> Service definitions loaded
> 1198875012.331984 0.001097
> Tests loaded 1198875012.405015
0.073031
> DNS lookups completed 1198875012.405024
0.000009
>
> Test engine setup completed 1198875012.408543
0.003519
> TCP tests completed 1198875012.409325
> 0.000782
> PING test completed (304 hosts) 1198875037.083126
24.673801
>
> PING test results sent 1198875067.092719
30.009593
> Test result collection completed
> 1198875067.092733 0.000014
> LDAP test engine setup completed 1198875067.092737
0.000004
> LDAP tests executed 1198875067.092741
0.000004
>
> LDAP tests result collection completed 1198875067.092745
0.000004
> NSLOOKUP tests executed 1198875067.096007
> 0.003262
> Test results transmitted 1198875067.098247
0.002240
>
> bbtest-net completed 1198875067.099155
0.000908
> TIME TOTAL
> 54.768268
>
>
>
>
>
>
> Thanks, michael
> ________________________________________
> From: Josh Luthman [mailto:josh at imaginenetworksllc.com]
> Sent: Thursday, December 27, 2007 11:15 AM
>
> To: hobbit at hswn.dk
> Subject: Re: [hobbit] bbtest - errors
>
> Michael,
>
> Try adding "testip" after the comment in as many hosts as possible,
IE:
>
> 10.0.0.250 myftp.server.com # testip
>
> Josh
> On 12/27/07, Michael A. Price <mprice at sgt-inc.com> wrote:
> I just modified the /etc/nsswitch.conf file to remove DNS.
>
> I find it interesting that no matter if the hobbit server uses DNS
servers
> or local host files to look up the hosts the 'PING Test Results Sent'
number
> is still off the charts.
>
> Thanks so much for getting back to me
>
> Thanks, michael
>
>
>
> From: Josh Luthman [mailto: josh at imaginenetworksllc.com]
> Sent: Wednesday, December 26, 2007 6:00 PM
>
> To: hobbit at hswn.dk
> Subject: Re: [hobbit] bbtest - errors
>
> Your calls to dnsresolve went up one, how in the world did you
"[update] the
> hobbit server to not use the DNS servers"?
>
> It looks like it is still doing the exact same stuff concerning DNS to
me...
> On 12/26/07, Michael A. Price <mprice at sgt-inc.com> wrote:
> Thanks for getting back to me on this.
>
> I updated the hobbit server to not use the DNS servers and all that
does is
> cause it to go from 100 failed hosts to 299 failed hosts.
>
> I think it's the large "PING test results sent" number, what else
could be
> the problem???
>
> Here is another printout...
>
> Thanks, michael
>
> ---------------------------------------
>
>
> bbtest-net version 4.2.0
> SSL library : OpenSSL 0.9.7m 23 Feb 2007
> LDAP library: OpenLDAP 20213
>
> Statistics:
> Hosts total : 311
> Hosts with no tests : 7
>
>
>
> Total test count : 308
>
> Status messages : 309
>
>
> Alert status msgs : 0
> Transmissions : 5
>
>
>
> DNS statistics:
>
>
>
>
> # hostnames resolved : 304
>
> # succesful : 203
>
> # failed : 101
> # calls to dnsresolve : 308
>
>
> TCP test statistics:
>
>
>
> # TCP tests total : 2
>
> # HTTP tests : 1
>
> # Simple TCP tests : 1
> # Connection attempts : 2
>
>
> # bytes written : 135
> # bytes read : 553
>
>
>
>
>
> Error output:
>
>
>
> Timeout waiting for data from child, killing it
>
>
> Timeout waiting for data from child, killing it
> Child process terminated with signal 15
>
>
>
>
> TIME SPENT
>
>
> Event Starttime
Duration
>
>
> bbtest-net startup
>
> 1198691205.281738 -
> Service definitions loaded 1198691205.282850
>
> 0.001112
>
> Tests loaded 1198691205.316420
0.033570
>
>
> DNS lookups completed 1198691215.446830
>
> 10.130410
> Test engine setup completed
>
>
> 1198691215.450594 0.003764
> TCP tests completed
> 1198691215.451393 0.000799
> PING test completed (304 hosts) 1198691240.081987
24.630594
>
>
>
> PING test results sent 1198691270.090627
30.008640
>
>
> Test result collection completed 1198691270.090642
>
> 0.000015
>
> LDAP test engine setup completed
> 1198691270.090656 0.000014
>
>
> LDAP tests executed 1198691270.090660
0.000004
>
>
> LDAP tests result collection completed
>
> 1198691270.090663 0.000003
>
> NSLOOKUP tests executed
> 1198691270.146990 0.056327
>
> Test results transmitted
> 1198691270.149410 0.002420
>
>
> bbtest-net completed 1198691270.150271
0.000861
>
> TIME TOTAL
> 64.868533
>
>
>
>
>
>
> ________________________________________
> From: Josh Luthman [mailto: josh at imaginenetworksllc.com]
> Sent: Thursday, December 20, 2007 11:04 AM
>
> To: hobbit at hswn.dk
> Subject: Re: [hobbit] bbtest - errors
>
> If that was the only change you made recently try switching the DNS
servers
> back to see if the problem disappears.
> On 12/20/07, Michael A. Price < mprice at sgt-inc.com> wrote:
> Thanks...
>
> Actually, I updated my DNS servers and went from 300 failed lookups to
100.
> So I thought I was going to improve....
>
> But it got worse!!!! Any other ideas???
>
> Thanks, michael
>
> ________________________________________
> From: Josh Luthman [mailto: josh at imaginenetworksllc.com]
> Sent: Thursday, December 20, 2007 8:10 AM
> To: hobbit at hswn.dk
> Subject: Re: [hobbit] bbtest - errors
>
> # failed : 100 <--- may be the cause, lots of
failed DNS
> queries
> On 12/19/07, Michael A. Price < mprice at sgt-inc.com> wrote:
>
>
> My bbtest time went from 10 seconds to 89.0 ....
>
> Has anyone seen this before???
>
> Wed Dec 19 19:15:55 2007
>
> bbtest-net version 4.2.0
> SSL library : OpenSSL 0.9.7m 23 Feb 2007
> LDAP library: OpenLDAP 20213
>
> Statistics:
> Hosts total : 310
> Hosts with no tests : 7
> Total test count : 307
> Status messages : 308
> Alert status msgs : 0
> Transmissions : 5
>
> DNS statistics:
> # hostnames resolved : 303
> # succesful : 203
> # failed : 100
> # calls to dnsresolve : 307
>
> TCP test statistics:
> # TCP tests total : 2
> # HTTP tests : 1
> # Simple TCP tests : 1
> # Connection attempts : 2
> # bytes written : 135
> # bytes read : 553
>
>
> Error output:
> Timeout waiting for data from child, killing it
> Timeout waiting for data from child, killing it
> Child process terminated with signal 15
> Timeout waiting for data from child, killing it
> Timeout waiting for data from child, killing it
> Child process terminated with signal 15
>
>
> TIME SPENT
> Event Starttime
> Duration
> bbtest-net startup 1198091755.294810
> -
> Service definitions loaded 1198091755.297812
> 0.003002
> Tests loaded 1198091755.346908
> 0.049096
> DNS lookups completed 1198091765.439050
> 10.092142
> Test engine setup completed 1198091765.442685
> 0.003635
> TCP tests completed 1198091765.443457
> 0.000772
> PING test completed (303 hosts) 1198091790.084027
> 24.640570
> PING test results sent 1198091850.102236
> 60.018209
> Test result collection completed 1198091850.102455
> 0.000219
> LDAP test engine setup completed 1198091850.102472
> 0.000017
> LDAP tests executed 1198091850.102475
> 0.000003
> LDAP tests result collection completed 1198091850.102482
> 0.000007
> NSLOOKUP tests executed 1198091850.111523
> 0.009041
> Test results transmitted 1198091850.118622
> 0.007099
> bbtest-net completed 1198091850.120484
> 0.001862
> TIME TOTAL
> 94.825674
>
>
> Thanks, michael
>
> To unsubscribe from the hobbit list, send an e-mail to
> hobbit-unsubscribe at hswn.dk
>
>
>
> --
> Josh Luthman
> Office: 937-552-2340
> Direct: 937-552-2343
> 1100 Wayne St
> Suite 1337
> Troy, OH 45373
>
> Those who don't understand UNIX are condemned to reinvent it, poorly.
> --- Henry Spencer
>
>
>
> --
> Josh Luthman
> Office: 937-552-2340
> Direct: 937-552-2343
> 1100 Wayne St
> Suite 1337
> Troy, OH 45373
>
> Those who don't understand UNIX are condemned to reinvent it, poorly.
> --- Henry Spencer
>
>
>
> --
> Josh Luthman
> Office: 937-552-2340
> Direct: 937-552-2343
> 1100 Wayne St
> Suite 1337
> Troy, OH 45373
>
> Those who don't understand UNIX are condemned to reinvent it, poorly.
> --- Henry Spencer
>
>
>
> --
> Josh Luthman
> Office: 937-552-2340
> Direct: 937-552-2343
> 1100 Wayne St
> Suite 1337
> Troy, OH 45373
>
> Those who don't understand UNIX are condemned to reinvent it, poorly.
> --- Henry Spencer
>
>
>
> --
> Josh Luthman
> Office: 937-552-2340
> Direct: 937-552-2343
> 1100 Wayne St
> Suite 1337
> Troy, OH 45373
>
> Those who don't understand UNIX are condemned to reinvent it, poorly.
> --- Henry Spencer
>
> To unsubscribe from the hobbit list, send an e-mail to
> hobbit-unsubscribe at hswn.dk
>
>
>
--
Josh Luthman
Office: 937-552-2340
Direct: 937-552-2343
1100 Wayne St
Suite 1337
Troy, OH 45373
Those who don't understand UNIX are condemned to reinvent it, poorly.
--- Henry Spencer
To unsubscribe from the hobbit list, send an e-mail to
hobbit-unsubscribe at hswn.dk
More information about the Xymon
mailing list