[hobbit] Xymon crash after adding too many hosts?

Josh Luthman josh at imaginenetworksllc.com
Fri May 28 22:08:50 CEST 2010


Try just one page at a time?  Not sure if you need "ping" on there.

On 5/28/10, e-mail j.sansford <j.sansford at ntlworld.com> wrote:
> Sure, they are in 3 different configuration files which are included in the
> main bb-hosts file. Also note that they are currently commented out due to
> it causing the issue!:
>
>
> h1.conf:
> #subpage memcached Memcached
> #10.6.73.1 h1-mem01     # NET:H1 ping ssh
> #10.6.73.2 h1-mem02     # NET:H1 ping ssh
> #10.6.73.3 h1-mem03     # NET:H1 ping ssh
> #10.6.73.4 h1-mem04     # NET:H1 ping ssh
>
> h2.conf:
> #subpage memcached Memcached
> #10.7.73.1 h2-mem01      # NET:H2 ping ssh
> #10.7.73.2 h2-mem02      # NET:H2 ping ssh
> #10.7.73.3 h2-mem03      # NET:H2 ping ssh
> #10.7.73.4 h2-mem04      # NET:H2 ping ssh
>
> h3.conf:
> #subpage memcached Memcached
> #10.8.73.1 h2-mem01      # NET:H3 ping ssh
> #10.8.73.2 h2-mem02      # NET:H3 ping ssh
> #10.8.73.3 h2-mem03      # NET:H3 ping ssh
> #10.8.73.4 h2-mem04      # NET:H3 ping ssh
>
>
> (I thought there were 15 new hosts, but it looks like there are only 12). I
> can't see any syntax issues here?
>
> On 28 May 2010 20:27, Olivier Beau <obeau79 at gmail.com> wrote:
>
>> Hi,
>>
>> Could you give us those 15 hosts definitions ?
>> (i suspect there might be a syntax error..)
>>
>> try adding them one by one
>>
>>
>> Olivier.
>>
>> e-mail j.sansford a écrit :
>>
>>  Hi there,
>>>
>>> We run Xymon 4.2.3, using both the proxy and the server in a multisite
>>> configuration. We recently added 15 new hosts to our configuration, and
>>> since doing so the server application appears to keep crashing. Running
>>> "ps
>>> -ef | grep hobbitlaunch" shows that this process no longer exists.
>>> Looking
>>> in the log files, it appears "rrd status" kept terminating with status 1.
>>> Since removing these hosts from the configuration, the server has managed
>>> to
>>> start up again.
>>>
>>> I don't believe there is anything strange in the definition of these
>>> hosts
>>> - so have we reached a memory limit or otherwise in the application, and
>>> if
>>> so is there a fix for this? Such as increasing a variable somewhere?
>>>
>>>
>>> Here's the output of bbtest if it helps:
>>>
>>> bbtest-net version 4.2.3
>>> SSL library : OpenSSL 0.9.8k 25 Mar 2009
>>> LDAP library: OpenLDAP 20416
>>>
>>> Statistics:
>>>  Hosts total           :      159
>>>
>>>  Hosts with no tests   :        4
>>>  Total test count      :      332
>>>  Status messages       :      333
>>>  Alert status msgs     :        0
>>>  Transmissions         :        4
>>>
>>> DNS statistics:
>>>  # hostnames resolved  :      189
>>>
>>>  # succesful           :      154
>>>  # failed              :        1
>>>  # calls to dnsresolve :      332
>>>
>>> TCP test statistics:
>>>  # TCP tests total     :      174
>>>  # HTTP tests          :       34
>>>  # Simple TCP tests    :      140
>>>
>>>  # Connection attempts :      174
>>>  # bytes written       :     6828
>>>  # bytes read          :   585619
>>>
>>>
>>> TIME SPENT
>>> Event                                            Starttime
>>>  Duration
>>> bbtest-net startup                            97953.286121
>>> -
>>>
>>> Service definitions loaded                    97953.288188
>>>  0.002067 Tests loaded                                  97953.299493
>>>  0.011304 DNS lookups completed                         97953.328789
>>>  0.029296
>>> Test engine setup completed                   97953.344493
>>>  0.015703 TCP tests completed                           97965.450124
>>> 12.105631 PING test completed (155 hosts)               97965.451495
>>>  0.001370
>>> PING test results sent                        97965.454795
>>>  0.003299 Test result collection completed              97965.454907
>>>  0.000112 LDAP test engine setup completed              97965.454908
>>>  0.000000
>>> LDAP tests executed                           97965.454909
>>>  0.000000 LDAP tests result collection completed        97965.454909
>>>  0.000000 DNS tests executed                            97965.458490
>>>  0.003580
>>> NTP tests executed                            97965.572899
>>>  0.114409 Test results transmitted                      97965.577715
>>>  0.004815 bbtest-net completed                          97965.581260
>>>  0.003545
>>>
>>>
>>> Thanks!
>>> TIME TOTAL
>>> 12.295138
>>>
>>
>>
>> To unsubscribe from the hobbit list, send an e-mail to
>> hobbit-unsubscribe at hswn.dk
>>
>>
>>
>


-- 
Josh Luthman
Office: 937-552-2340
Direct: 937-552-2343
1100 Wayne St
Suite 1337
Troy, OH 45373

“Success is not final, failure is not fatal: it is the courage to
continue that counts.”
--- Winston Churchill



More information about the Xymon mailing list