hobbitmon 4.2.3 client on esx 3.5.0 works first time only - then goes purple
Overy, Steve
steve.overy at gb.unisys.com
Wed Jul 22 14:35:30 CEST 2009
Hi,
I have hobbitmon clients running on various servers; this week turned my attentions to some ESX servers. The hobbitmon client configure/make went fine & after running "runclient.sh start" the status columns of the hobbitmon server display page went green (and expected data reported). However, 20 - 30 minutes later, those statuses went purple & inspecting the data showed (from the timestamp) that the client ran only the first time.
The ...client/logs show nothing untoward, no core files in ...client/tmp - and I opened the ESX firewall port!
Client start-up
[xymon at esx112 logs]$ ../runclient.sh start
Hobbit client for linux started on esx112
[xymon at esx112 logs]$ date
Wed Jul 22 11:32:04 BST 2009
Initial data reported (green smiley's) (server: http://W.X.Y.Z/xymon-cgi/bb-hostsvc.sh?HOST=esx112&SERVICE=cpu<http://w.x.y.z/xymon-cgi/bb-hostsvc.sh?HOST=esx112&SERVICE=cpu> )
Top of Form
Bottom of Form
Wed Jul 22 11:31:56 BST 2009 up: 113 days, 2 users, 72 procs, load=0.02
System clock is -220 seconds off
However, a little over 30 minutes later those smiley's went purple & the data timestamp was unchanged from the above.
The logs show little
log/hobbitclient.log # no entries even for the start-up
log/clientlaunch.log
2009-07-22 11:31:56 hobbitlaunch starting
2009-07-22 11:31:56 Loading tasklist configuration from /home/xymon/client/etc/clientlaunch.cfg
2009-07-22 11:31:56 Task client started with PID 31241
And there is nothing in: /var/log/messages
And no core files:
[root at esx112 xymon]# pwd
/home/xymon
[root at esx112 xymon]# find . -name "*core*"
[root at esx112 xymon]#
Looking at xymon processes I notice that there is a difference between a working linux client and this failed esx client.
Working SLES10-SP2 client:
xymon 5155 1 0 May28 ? 00:00:02 /home/xymon/client/bin/hobbitlau
nch --config=/home/xymon/client/etc/clientlaunch.cfg --log=/home/xymon/client/lo
gs/clientlaunch.log --pidfile=/home/xymon/client/logs/clientlaunch.sles10-SP2-do
mU-1.pid
xymon 18347 1 0 12:11 ? 00:00:00 sh -c vmstat 300 2 1>/home/xymon
/client/tmp/hobbit_vmstat.sles10-SP2-domU-1.18313 2>&1; mv /home/xymon/client/tm
p/hobbit_vmstat.sles10-SP2-domU-1.18313 /home/xymon/client/tmp/hobbit_vmstat.sle
s10-SP2-domU-1
xymon 18349 18347 0 12:11 ? 00:00:00 vmstat 300 2
Failed esx 3.5.0 client:
[root at esx112 xymon]# ps -ef|grep "^xymon"
xymon 30803 30801 0 09:38 ? 00:00:00 sshd: xymon at pts/0
xymon 30804 30803 0 09:38 pts/0 00:00:00 -bash
xymon 31240 1 0 11:31 ? 00:00:00 /home/xymon/client/bin/hobbitlaunch --verbose --config=/home/xymon/client/etc/clientlaunch.cfg --log=/home/xymon/client/logs/clientlaunch.log --pidfile=/home/xymon/client/logs/clientlaunch.esx112.pid
So vmstat seems to be a clue... so I ran hobbitclient-linux.sh - no errors reported in the output & a "0" return code.
Any ideas where I go next?
TIA & regards
steve overy | support analyst | EMEA Client Support Centre | Global Outsourcing and Infrastructure Services
Unisys Limited | Fox Milne | Tongwell Street, Milton Keynes, MK15 0YS
Registered in England Company No. 103709
Registered Office: Bakers Court, Bakers Road, Uxbridge, UB8 1RG
Phone: +44 (0) 1908 212306, net 741 2306
Mobile: +44 (0) 7808 391673, net 839 1673
Email: steve.overy at unisys.com
THIS COMMUNICATION MAY CONTAIN CONFIDENTIAL AND/OR OTHERWISE PROPRIETARY MATERIAL and is thus for use only by the intended recipient. If you received this in error, please contact the sender and delete the e-mail and its attachments from all computers.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xymon.com/pipermail/xymon/attachments/20090722/eead5db8/attachment.html>
More information about the Xymon
mailing list