[xymon] CPU load average not being graphed for some servers

Ward, Martin Martin.Ward at colt.net
Thu Oct 14 16:46:28 CEST 2010


Thanks Tom, that got me sorted. It seems  that Solaris relies on the
BOOT_TIME record held in the /var/adm/utmpx file. This file has been
rotated out of the way in order to save disk space so I got no uptime
values at all.  It looks like this messed with the load average data
since the uptime output didn't have any uptime in it.

 

Like you I have hacked the hobbitclient-sunos.sh file and put in a small
perl scriptlet so that if there is no uptime it adds a fake value in
just to ensure that the load averages get stored properly.

 

|\/|

 

From: Brand, Thomas R. [mailto:TRBrand at cvs.com] 
Sent: 13 October 2010 17:53
To: xymon at xymon.com
Subject: RE: [xymon] CPU load average not being graphed for some servers

 

From: Ward, Martin [mailto:Martin.Ward at colt.net] 

 

Hi all,

 

I'm having trouble tracking this one down. For many of my
Xymon-monitored servers I can see the graphs on the CPU load page and
they have data in them. Yet for many others the CPU graphs are empty. I
can see the graphs but there is no data.

 

Investigation reveals that for those servers who are missing data the
.cpu file has not been updated in months, yet when I look at the client
data available it shows the load average in the uptime section, as in:

 

[uptime]

  5:07pm  2 users,  load average: 0.53, 0.71, 0.62

 

which is, I believe, where Xymon gets the LA from. 

 

 

Martin,

 

This may be related to an issue I came across last year.

 

The (cpu load and users & processes) graphs appear to be dependent on
the exact output of the 'uptime' command.

 

In my case, the graphs did show for the first 24 hours of uptime,
didn't show after 24 hours, then started showing again after 48 hours of
uptime.

I was finally able to isolate this to the output of 'uptime' for the
first full day - it showed 'day' not 'days'.

 

# uptime

 12:41pm  up 196 days 21:49,  5 users,  load average: 0.23, 0.19, 0.17

 

# uptime

 12:41pm  up 1 day 21:49,  2 users,  load average: 0.06, 0.02, 0.00

 

I 'fixed' this by updating hobbitclient-linux.sh and modified the output
of the 'uptime' command replacing 'day ' with 'days '.

 

echo "[uptime]"

uptime | perl -pe "s/^(.*) day (.*)/\1 days \2/"

 

It appears your uptime command does not show the 'up x days HH:MM'.

 

Cheers,

Tom Brand


[Colt Disclaimer]
The message is intended for the named addressee only and may not be disclosed
to or used by anyone else, nor may it be copied in any way. The contents of
this message and its attachments are confidential and may also be subject to
legal privilege. If you are not the named addressee and/or have received this
message in error, please advise us by e-mailing abuse at colt.net and delete the
message and any attachments without retaining any copies. Internet
communications are not secure and Colt does not accept responsibility for this
message, its contents nor responsibility for any viruses. No contracts can be
created or varied on behalf of Colt Technology Services, its subsidiaries,
group companies or affiliates ("Colt") and any other party by email
communications unless expressly agreed in writing with such other party.
Please note that incoming emails will be automatically scanned to eliminate
potential viruses and unsolicited promotional emails. For more information
refer to www.colt.net or contact us on +44(0)20 7390 3900

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.xymon.com/pipermail/xymon/attachments/20101014/f8046ff5/attachment.html>


More information about the Xymon mailing list