[Xymon] server-side message pre-processor?

John Thurston john.thurston at alaska.gov
Tue Nov 3 20:11:23 CET 2015


On 11/3/2015 9:22 AM, Japheth Cleaver wrote:
- snip -
> I guess one question would be whether these are being caused by poorly
> configured local settings, or by people coming up with new tests they're
> reporting in that you don't want to hear about?

These are poorly managed hosts whose owners see none of the cost of 
flooding the server with garbage. "What? Your stupid server can't handle 
my 1,000 events per host per day?! What kind of crappy system is that?!"

They are treating my Xymon server like it was an instance of Splunk. I 
keep trying to tell them it is an _alerting_ system, not a _logging_ 
system. If there is yellow on the page, they should be looking to see if 
there really is a problem. If there is a red on the page, they should be 
scrambling to fix the problem. Some of the owners are just ignoring all 
colors on the page until a customer calls with a problem. Then they are 
using the 'msgs' test result to see the most recent contents of the 
event-log.

> Are you running in local config mode? If so, I might suggest using this
> as justification for migrating to central configuration. Put the
> thresholding control back into your hands unless the users are willing
> to accept responsibility for the types of things they're sending in.

But I don't know their business requirements ('course, it seems like 
they don't either). Nor do I have the time to personally handle 
configuration of 600+ hosts. Nor do I want to have access to those 
systems through the client distribution and management system.

Local-config mode works fine for us, except for the client's implicit 
ability to flood me with garbage. Its the "all or none" acceptance model 
which is painful.

- snip -
> Is the problem more along the lines of "I don't want to receive test
> 'xyz' on any host", or more "Here's a list of 38 different tests I want
> to reject on 18 / 400 servers".

It's more the latter. "The owner of host=foo can't configure its client 
correctly. This has been going on for months. Fine. I will only accept 
disk and cpu from them."

I don't see the former condition as being realistic. Since any client 
can send a message containing any possible string as their test-name, it 
would impossible filter noise. Further, 90% of my system owners are 
politely handling their 'msgs' test. I don't want to punish those folks.

- snip -
> As an initial step, I think adding hard-coded ignore-test records at
> xymond startup (by command option or by xymonserver.cfg) would probably
> be a pretty simple stop gap to create in the next rev.

I don't think this would be of any value to me. In general, I _want_ to 
accept messages of the name 'msgs'. There are just 10% of my hosts from 
which I want silence on messages of that type. The option of defining a 
per-host white-list is what I need.

But it is probably a very fringe need. I'll try getting my bat out 
again. Maybe I can beat some sense into the 10%.

-- 
    Do things because you should, not just because you can.

John Thurston    907-465-8591
John.Thurston at alaska.gov
Enterprise Technology Services
Department of Administration
State of Alaska



More information about the Xymon mailing list