[hobbit] Xymon crash after adding too many hosts?

e-mail j.sansford j.sansford at ntlworld.com
Fri May 28 22:06:53 CEST 2010


Oh how blind I am...I just noticed those in h3.conf have the same hostnames
as those in h2.conf! That makes a lot of sense! Think I have answered my own
question...

Thanks anyway!

On 28 May 2010 21:02, e-mail j.sansford <j.sansford at ntlworld.com> wrote:

> Sure, they are in 3 different configuration files which are included in the
> main bb-hosts file. Also note that they are currently commented out due to
> it causing the issue!:
>
>
> h1.conf:
> #subpage memcached Memcached
> #10.6.73.1 h1-mem01     # NET:H1 ping ssh
> #10.6.73.2 h1-mem02     # NET:H1 ping ssh
> #10.6.73.3 h1-mem03     # NET:H1 ping ssh
> #10.6.73.4 h1-mem04     # NET:H1 ping ssh
>
> h2.conf:
> #subpage memcached Memcached
> #10.7.73.1 h2-mem01      # NET:H2 ping ssh
> #10.7.73.2 h2-mem02      # NET:H2 ping ssh
> #10.7.73.3 h2-mem03      # NET:H2 ping ssh
> #10.7.73.4 h2-mem04      # NET:H2 ping ssh
>
> h3.conf:
> #subpage memcached Memcached
> #10.8.73.1 h2-mem01      # NET:H3 ping ssh
> #10.8.73.2 h2-mem02      # NET:H3 ping ssh
> #10.8.73.3 h2-mem03      # NET:H3 ping ssh
> #10.8.73.4 h2-mem04      # NET:H3 ping ssh
>
>
> (I thought there were 15 new hosts, but it looks like there are only 12). I
> can't see any syntax issues here?
>
>
> On 28 May 2010 20:27, Olivier Beau <obeau79 at gmail.com> wrote:
>
>> Hi,
>>
>> Could you give us those 15 hosts definitions ?
>> (i suspect there might be a syntax error..)
>>
>> try adding them one by one
>>
>>
>> Olivier.
>>
>> e-mail j.sansford a écrit :
>>
>>  Hi there,
>>>
>>> We run Xymon 4.2.3, using both the proxy and the server in a multisite
>>> configuration. We recently added 15 new hosts to our configuration, and
>>> since doing so the server application appears to keep crashing. Running "ps
>>> -ef | grep hobbitlaunch" shows that this process no longer exists. Looking
>>> in the log files, it appears "rrd status" kept terminating with status 1.
>>> Since removing these hosts from the configuration, the server has managed to
>>> start up again.
>>>
>>> I don't believe there is anything strange in the definition of these
>>> hosts - so have we reached a memory limit or otherwise in the application,
>>> and if so is there a fix for this? Such as increasing a variable somewhere?
>>>
>>>
>>> Here's the output of bbtest if it helps:
>>>
>>> bbtest-net version 4.2.3
>>> SSL library : OpenSSL 0.9.8k 25 Mar 2009
>>> LDAP library: OpenLDAP 20416
>>>
>>> Statistics:
>>>  Hosts total           :      159
>>>
>>>  Hosts with no tests   :        4
>>>  Total test count      :      332
>>>  Status messages       :      333
>>>  Alert status msgs     :        0
>>>  Transmissions         :        4
>>>
>>> DNS statistics:
>>>  # hostnames resolved  :      189
>>>
>>>  # succesful           :      154
>>>  # failed              :        1
>>>  # calls to dnsresolve :      332
>>>
>>> TCP test statistics:
>>>  # TCP tests total     :      174
>>>  # HTTP tests          :       34
>>>  # Simple TCP tests    :      140
>>>
>>>  # Connection attempts :      174
>>>  # bytes written       :     6828
>>>  # bytes read          :   585619
>>>
>>>
>>> TIME SPENT
>>> Event                                            Starttime
>>>  Duration
>>> bbtest-net startup                            97953.286121
>>>   -
>>>
>>> Service definitions loaded                    97953.288188
>>>  0.002067 Tests loaded                                  97953.299493
>>>  0.011304 DNS lookups completed                         97953.328789
>>>  0.029296
>>> Test engine setup completed                   97953.344493
>>>  0.015703 TCP tests completed                           97965.450124
>>> 12.105631 PING test completed (155 hosts)               97965.451495
>>>  0.001370
>>> PING test results sent                        97965.454795
>>>  0.003299 Test result collection completed              97965.454907
>>>  0.000112 LDAP test engine setup completed              97965.454908
>>>  0.000000
>>> LDAP tests executed                           97965.454909
>>>  0.000000 LDAP tests result collection completed        97965.454909
>>>  0.000000 DNS tests executed                            97965.458490
>>>  0.003580
>>> NTP tests executed                            97965.572899
>>>  0.114409 Test results transmitted                      97965.577715
>>>  0.004815 bbtest-net completed                          97965.581260
>>>  0.003545
>>>
>>>
>>> Thanks!
>>> TIME TOTAL
>>> 12.295138
>>>
>>
>>
>> To unsubscribe from the hobbit list, send an e-mail to
>> hobbit-unsubscribe at hswn.dk
>>
>>
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xymon.com/pipermail/xymon/attachments/20100528/98ff9e37/attachment.html>


More information about the Xymon mailing list