[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

alerting with combo question




Hi !

I've got an difficult problem with an alerting-rule, which my customers expect ..

.. unfortunately i'm afraid that it couldn't work, maybe someone on the list got an idea .. :-)


I've got a lustere-quota-check
which should run on 4 host for reliability reasons  ...

If everything is ok, the output looks on each host  like

Tue Nov 30 11:13:35 CET 2010 - LUSTRE Mount(s) on tcx060 OK

filesystem summary:        78.2T       58.0T       16.3T  74% /scratch/hh/lustre/atlas
filesystem summary:        19.9T       18.7T      233.8G  93% /scratch/zn/lustre/atlas

If the quota will be reached, the output got additional informations of all user-directory and there size ... till this point all easy ..

Now, my customers wants to get only one alert with the additional informations of all user-directory and there size and not four times for each host

I've tested it with the bbcombotest.cfg like

e.g.
AtlasLustre.lustre-atlas = (tcx040.lustre\-atlas + tcx060.lustre\-atlas + tcx080.lustre\-atlas + tcx120.lustre\-atlas )  >= 4

This Alarm works of course, but i've got only this alert-message

Red Mon Nov 29 14:37:07 2010

(tcx040.lustre\-atlas+tcx060.lustre\-atlas+tcx080.lustre\-atlas+tcx120.lustre\-atlas)>=4 = (1+1+1+1)>=4 = 0
&green tcx040.lustre-atlas
&green tcx060.lustre-atlas
&green tcx080.lustre-atlas
&red tcx120.lustre-atlas


Without the additional informations of all user-directory and there size, and thats logical of course .. but didn't solve my problem :-(

Any hints are welcome !

Thanks & cheers

       Martin