[hobbit] Setting this up and monitoring VM Servers?

Buchan Milne bgmilne at staff.telkomsa.net
Thu Jun 25 14:37:15 CEST 2009


On Thursday 25 June 2009 13:36:10 T.J. Yang wrote:
> ________________________________
>
> > From: Vernon.Everett at woodside.com.au
> > To: hobbit at hswn.dk
> > Date: Thu, 25 Jun 2009 09:23:00 +0800
> > Subject: RE: [hobbit] Setting this up and monitoring VM Servers?
> >
> >
> >
> >
> >
> >
> >
> >
> > I started putting something together for Devmon to pull
> > infor from the VMWare host using SNMP.
> >
> > Then we did a reorg here at work, and suddenly we had way
> > more work than we could handle, and no time to play with things like
> > Devmon modules.
> >
> > I posted my (incomplete) work on the list, in the hope
> > somebody would take over, but I know of no takers.
> >
> > Search the archive for me, Devmon and ESX. You should find
> > it there somewhere.
>
> I did try out devmon some time ago but I don't have urgent need to monitor
> snmp devices. and stop putting time into devmon since Henrik is working on
> snmp query/monitoring in 4.(3/4).0 version.

Well, besides the most basic network interface utilisation graphs, NBAR 
protocol utilisation graphs, QoS graphs, and monitoring and graphing of CPU 
and memory usage on all network devices, we use devmon for:

-monitoring HP ProLiant:
--RAID array status
--Fan status
--Temperature (with graphs)
--Integrated Management Log
--Power supply status

We have replaced a number of power supplies which had intermittent problems 
that would not otherwise have been easily picked up.

-Monitoring for Dell PowerEdge:
--RAID array status
--Fan status
--Temperature (with graphs)
--Power supply status

-Radius servers:
--Monitoring per-RADIUS-client accounting and authentication requests

-Linux servers:
--Red Hat cluster status
--Per-Disk IO graphs

-Firewalls:
--Graph of connections
--Monitoring of cluster status

-Load balancer (templates still in progress)
--Connections to each real server
--Connections to the virtual server
--Server farm status

Anti-spam appliances:
--CPU utilisation
--Fans
--Keys for licensed features (e.g., warn when expiration imminent)
--Memory
--Power supplies
--RIAD arrays
--Resources  (graphs of disk io, open files, and threads)
--Temperatures (with graphs)

One of my colleagues is using devmon for agent-less monitoring of Windows 
servers (using microsoft-win2k3server template available in svn). Some 
agentless monitoring is also available for Linux (e.g., linux-netsnmp or 
linux-openwrt templates)

In most cases, templates can be developed quite easily. The VMWare case seems 
to be a bit difficult, as the IDs they use as indexes in some of the SNMP tables 
don't match up to the IDs they use as indexes in other tables.

Now, while MRTG can do some of this, most of the more complex templates 
(ProLiant IML, RAID for ProLiant and PowerEdge, Cisco CSM load balancer) I 
think can not be done with MRTG, and while they can be done with Cacti with 
some effort, there's no decent integration with any monitoring system.

AFAICT, most of the work done on Xymon for SNMP is following the (not so 
great) MRTG model. And besides, I get all the features Xymon's SNMP support 
will bring in future right now.

And, using the data, I can do things like this:
http://staff.telkomsa.net/~bgmilne/weathermap/
(every link that is not grey would have a pup-up to the Hobbit graph of e.g. 
network utilisation, and any host with a background colour to the text will 
have popup of a Hobbit service graph - conn by default - it works fine from 
inside our network, but won't work from outside it now).

I am trying to finish up this code so I can publish a final version.

Besides that, I think we are just about monitoring everything we can monitor 
with SNMP, at this stage the only think I can do to monitor more things via 
SNMP with devmon is to start filing bugs on SNMP agents (e.g. net-snmp can 
monitor mail queues, but only for sendmail, and we are running postfix on 
anything with more than a trivial queue on it).

Regards,
Buchan



More information about the Xymon mailing list