[hobbit] hobbitd coredumping and purple trends

Terry Barnes tbarnes1 at hfhs.org
Sun Apr 3 20:21:25 CEST 2005


I experienced same thing after making some changes to hobbit - might be
a longshot, but here is what caused this for me.

After restarting hobbit and receiving the same as you, found that some
hobbit processes were hung. If I stopped hobbit - could still see most
processes were still running. Even after multiple attempt to do a
~/server/hobbit.sh stop, the processes continued to run. Killed those
processes and restart hobbit - problem solved.

Like I say - could be a longshot, but worth a look.

Terry Barnes
Siemens Com @ HFHS
248-853-4968 (Office)
586-405-8382 (Cellular)
248-844-3030 (Fax)
5864058382 at messaging.nextel.com (Text Pager)
tbarnes1 at hfhs.org

>>> rdeal at tigr.org 4/1/05 5:22:42 PM >>>
My hobbitd is core dumping every so often and less often but still
occasional the trends column turns purple.

Looking through the makefile the only oddity is MAXMSG=32768
Were my old BBd was set to #define MAXLINE  11264

I have core files in /tmp from hobbitd 

Logs :


> more bb-display.log 
2005-04-01 15:47:59 Whoops ! bb failed to send message - timeout
2005-04-01 16:02:59 Whoops ! bb failed to send message - timeout
2005-04-01 16:03:00 connect to bbd failed - Connection refused
2005-04-01 16:03:00 Whoops ! bb failed to send message - Connection
failed
2005-04-01 16:03:00 connect to bbd failed - Connection refused
2005-04-01 16:03:00 Whoops ! bb failed to send message - Connection
failed
2005-04-01 16:03:00 connect to bbd failed - Connection refused
2005-04-01 16:03:00 Whoops ! bb failed to send message - Connection
failed
2005-04-01 16:18:05 Whoops ! bb failed to send message - timeout
2005-04-01 17:03:08 Whoops ! bb failed to send message - timeout

> more hobbitd.log
2005-04-01 15:32:47 Setup complete
2005-04-01 15:32:54 Setup complete
2005-04-01 15:48:01 Setup complete
2005-04-01 16:03:01 Setup complete
2005-04-01 16:33:03 Setup complete
2005-04-01 16:48:04 Setup complete

I have a lot of these errors in larrd-data.log from various hosts.
2005-04-01 17:17:53 RRD error updating
/local/packages/IT/HOBBIT/hobbit/data/rrd/ray1.tigr.org/netstat.rrd
from
172.17.10.20: expected 12 data source readings (got 16) from
1112393873:597496849:203665680:0:1400608:474490:380897:4323:190:65584910
3:2750185864:9271815:54370878:358842800:919424657:55608:57615:...
2005-04-01 17:18:15 RRD error updating
/local/packages/IT/HOBBIT/hobbit/data/rrd/akela.tigr.org/netstat.rrd
from 172.17.10.87: expected 12 data source readings (got 16) from
1112393894:7278664:4601574:0:2187293:80558:15408:1028:18:3786687185:3319
9304:551592:3055134:392628802:534540232:12324:8938:...
2005-04-01 17:18:22 RRD error updating
/local/packages/IT/HOBBIT/hobbit/data/rrd/vader.tigr.org/netstat.rrd
from 172.16.4.50: expected 12 data source readings (got 16) from
1112393902:844147:844153:0:173177:11681993:15774:1756237:109:2946405093:
1171800154:1508:44541250:1263968085:53592252:29:1305303:...
2005-04-01 17:18:49 RRD error updating
/local/packages/IT/HOBBIT/hobbit/data/rrd/invino.tigr.org/netstat.rrd
from 172.17.10.29: expected 12 data source readings (got 16) from
1112393929:161474660:161355279:0:979032:1013326:8108:2751:26:3077107260:
3115145104:3779497608:1171327:3474031250:2366740414:176290878:15382:...

I used the moverrd.sh .


And these errors from lard-status.log:
005-04-01 17:18:10 RRD error updating
/local/packages/IT/HOBBIT/hobbit/data/rrd/IGR51RRTB.tigr.org/temperature
.module_6_asic-.rrd from 172.17.10.16: illegal attempt to update using
time 1112393889 when last update time is 1112393889 (minimum one
second
step)
2005-04-01 17:20:04 RRD error updating
/local/packages/IT/HOBBIT/hobbit/data/rrd/utah.tigr.org/disk.rrd from
172.17.10.79: illegal attempt to update using time 1112394004 when
last
update time is 1112394004 (minimum one second step)
2005-04-01 17:20:04 RRD error updating
/local/packages/IT/HOBBIT/hobbit/data/rrd/utah.tigr.org/disk.rrd from
172.17.10.79: illegal attempt to update using time 1112394004 when
last
update time is 1112394004 (minimum one second step)
2005-04-01 17:21:27 RRD error updating
/local/packages/IT/HOBBIT/hobbit/data/rrd/atlas.tigr.org/netstat.rrd
from 172.17.10.80: expected 11 data source readings (got 16) from
1112394087:23501770:2904610:0:97558:26724:76:17:8:U:U:U:U:226801128:2976
62863:U:956:...

any suggestions?
Thanks


To unsubscribe from the hobbit list, send an e-mail to
hobbit-unsubscribe at hswn.dk 



==============================================================================
CONFIDENTIALITY NOTICE: This email contains information from the sender that may be CONFIDENTIAL, LEGALLY PRIVILEGED, PROPRIETARY or otherwise protected from disclosure. This email is intended for use only by the person or entity to whom it is addressed.  If you are not the intended recipient, any use, disclosure, copying, distribution, printing, or any action taken in reliance on the contents of this email, is strictly prohibited. If you received this email in error, please contact the sending party by replying in an email to the sender, delete the email from your computer system and shred any paper copies of the email you printed.

Note to Patients: There are a number of risks you should consider before using e-mail to communicate with us. These risks are described in our Privacy Policy at http://henryford.com.  Review that policy carefully before continuing to communicate with us by e-mail. For greater Internet security, our policy describes the Henry Ford MyHealth electronic communication process - you may register at http://henryford.com.  If you do not believe that our policy gives you the privacy and security protection you need, do not send e-mail or Internet communications to us.


==============================================================================




More information about the Xymon mailing list