[hobbit] Problems with writing client data while running several clients

Rolf Schrittenlocher Schrittenlocher at rz.uni-frankfurt.de
Wed Jan 18 15:33:52 CET 2006


Hi Mathias,

there are no identical entries in bb-hosts. I'm almost sure that the 
client info send is using the correct names, once for the hostname and 
once for each alias, because there is the accurate data in 
hobbit/hobbit/data for both names before and after the period the log 
indicates failure. So it is a temporary problem (it occurred for about 
half an hour, when everything went green again). I have the impression 
that if two hosts are sending data at almost the same time hobbit 
doesn't like that. If they do so for a period>status bbdisplay goes 
purple. Once the point of time those two are sending differs for at 
least one second (if my interpretation of the message in the log is 
right) data is stored and everything goes back green. My proposition is 
to make the server wait for 1 second before writing if he can't instead 
of writing the information to a log but perhaps this is infeasable.

kind regards
Rolf 

>Hi Rolf,
>
>have you looked into your hobbit-log files?
>New with hobbit i had the same problem and found out
>that this happens when to identical (in my case conn-tests)
>run on the server and collect the information for the
>same machine:
>
>page core-router
>1.2.3.4  router1 #
>[..]
>page locationA
>1.2.3.4  router1 #
>[..]
>Maybe you find identical lines testing the same in your bb-hosts.
>
>Are the service you are monitoring for the virtual hosts equaly named?
>
>Best
>  Mathias
>
>In message <43CE350C.7020404 at rz.uni-frankfurt.de>, Rolf Schrittenlocher writes:
>  
>
>>Hi,
>>
>>we are running 1 to 3 clients on each host, one for $hostname and one 
>>for each virtual host defined on the machine in order to get client data 
>>like disk, cpu, etc. for $hostname and for each virtual host. Each 
>>client is started using
>>$HOBBITCLIENTHOME/bin/hobbitlaunch 
>>--config=$HOBBITCLIENTHOME/etc/clientlaunch.${MACHINE}.cfg 
>>--log=$HOBBITCLIENTHOME/logs/clientlaunch.${MACHINE}.log 
>>--pidfile=$HOBBITCLIENTHOME/logs/clientlaunch.${MACHINE}.pid
>>
>>Now I have the problem that client data of one or both clients is 
>>neglected by the server. In rrd-data.log it shows:
>>2006-01-18 13:03:40 RRD error updating 
>>/pica/ffm/com/sys/hobbit/hobbit/data/rrd/lbscgi2/netstat.rrd from 
>>141.2.164.215: illegal attempt to update using time 1137585820 when last 
>>update time is 1137585820 (minimum one second step)
>>In rrd-status.log are similar entries like
>>2006-01-18 12:48:38 RRD error updating 
>>/pica/ffm/com/sys/hobbit/hobbit/data/rrd/lbscgi2/memory.real.rrd from 
>>141.2.164.215: illegal attempt to update using time 1137584918 when last 
>>update time is 1137584918 (minimum one second step)
>>
>>The client data (disk, etc) for both becomes purple in bb.html.
>>Date for both clients is stored (sometimes) correctly:
>>hobbit/hobbit/data/rrd/otter
>>-rw-r--r--   1 bb       prod       19576 Jan 18 12:43 memory.real.rrd
>>hobbit/hobbit/data/rrd/lbscgi2
>>-rw-r--r--   1 bb       prod       19576 Jan 18 12:58 memory.real.rrd
>>
>>I tried starting the clients with some seconds difference, that helps 
>>for a certain time and then the problem reappears. What can I do?
>>
>>kind regards
>>Rolf
>>
>>-- 
>>Mit freundlichen Gruessen
>>Rolf Schrittenlocher
>>
>>HRZ/BDV, Senckenberganlage 31, 60054 Frankfurt 
>>Tel: (49) 69 - 798 28908   Fax: (49) 69 798 2881
>>LBS: lbs-f at mlist.uni-frankfurt.de
>>Persoenlich: schritte at rz.uni-frankfurt.de
>>
>>
>>
>>To unsubscribe from the hobbit list, send an e-mail to
>>hobbit-unsubscribe at hswn.dk
>>
>>
>>
>>    
>>
>
>To unsubscribe from the hobbit list, send an e-mail to
>hobbit-unsubscribe at hswn.dk
>
>
>
>  
>


-- 
Mit freundlichen Gruessen
Rolf Schrittenlocher

HRZ/BDV, Senckenberganlage 31, 60054 Frankfurt 
Tel: (49) 69 - 798 28908   Fax: (49) 69 798 2881
LBS: lbs-f at mlist.uni-frankfurt.de
Persoenlich: schritte at rz.uni-frankfurt.de





More information about the Xymon mailing list