[xymon] bbtest yellow-mystery

Josh Luthman josh at imaginenetworksllc.com
Mon Jul 12 21:44:56 CEST 2010


Are you prepared to share it?

Josh Luthman
Office: 937-552-2340
Direct: 937-552-2343
1100 Wayne St
Suite 1337
Troy, OH 45373



On Mon, Jul 12, 2010 at 3:39 PM, TJ Yang <tjyang2001 at gmail.com> wrote:
> On Mon, Jul 12, 2010 at 2:17 PM, Smith, Jim <JMSmith at stvincenthealth.com> wrote:
>> LOL!  Funny, but kinda rude.
>>
> Josh didn't mean to be rude.
>
> There was a book called "Hobbit RTFM" created using LaTeX by me. it is
> a book composed of Henrik's documentation(html,troff material etc)
> with Table of Contents and index.
> I showed him a while back.
> This is quite labor intensive even I only try to put the existing docs together.
>
> The book should be renamed to "Collection of Xymon Docs". so nobody
> will got offended.
>
>
> tj
>
>> -----Original Message-----
>> From: Josh Luthman [mailto:josh at imaginenetworksllc.com]
>> Sent: Monday, July 12, 2010 1:50 PM
>> To: xymon at xymon.com
>> Subject: Re: [xymon] bbtest yellow-mystery
>>
>> To my knowledge the best thing that relates to this is T.J.'s RTFM
>> guide.  I browsed through it when the IRC channel was first
>> created...long ago.
>>
>> You can always ask logical questions after searching the docs in IRC.
>>
>> Josh Luthman
>> Office: 937-552-2340
>> Direct: 937-552-2343
>> 1100 Wayne St
>> Suite 1337
>> Troy, OH 45373
>>
>>
>>
>> On Mon, Jul 12, 2010 at 2:45 PM, Carl Melgaard <Carl.Melgaard at stab.rm.dk> wrote:
>>> Hi,
>>>> Yes, that is only for disabling alerting to create a maintenance window.
>>>> It doesn't stop the checks from occurring. Thanks.
>>> Great, I feel like a beginner now. Is there any day-to-day operations-manual
>>> in the Xymon-documentation/wiki somewhere? I think I've missed the basic
>>> Xymon operation-guidelines somehow.
>>>
>>>
>>>
>>> /melgaard
>>>
>>> On Mon, Jul 12, 2010 at 1:50 PM, Carl Melgaard <Carl.Melgaard at stab.rm.dk>
>>> wrote:
>>>>
>>>> Hi,
>>>>
>>>>
>>>>
>>>> > Have you disabled alerting for that host or commented it out in your
>>>> > bb-hosts? The way to disable a "check" for a host is to remove the check or
>>>> > comment out the host in the bb-hosts. If you have just disabled the
>>>> > "alerting", then it will continue to do the "check", just not alert you.
>>>>
>>>>
>>>>
>>>> I've used the Enable/Disable-options in he Administration-part of Xymon,
>>>> to disable the tests (the hobbit-enadis.sh-script) - is that just
>>>> alert-disabling?
>>>>
>>>>
>>>>
>>>> /melgaard
>>>>
>>>>
>>>>
>>>> On Mon, Jul 12, 2010 at 1:13 PM, Carl Melgaard <Carl.Melgaard at stab.rm.dk>
>>>> wrote:
>>>>>
>>>>> >> Hi,
>>>>> >>
>>>>> >> One of our AD-servers went down for maintenance for 1 week, and I
>>>>> >> disabled
>>>>> >>  the conn+dns tests in Xymon for that period. Now the "bbtest" column
>>>>> >>  suddenly goes yellow, and indicates that DNS lookups takes 450+
>>>>> >> seconds
>>>>> >>  now (indicating that the DNS-check for the maintenanced server is
>>>>> >> still
>>>>> >>  active).
>>>>> >
>>>>> > No, it indicates that the Xymon server is taking a long time to resolve
>>>>> > host
>>>>> > names to IP addresses. You may need to configure the DNS client
>>>>> > settings on the
>>>>> > Xymon server (e.g. adjust the 'server' parameters in /etc/resolv.conf).
>>>>> >
>>>>> > If you have most IP addresses hardcoded correctly in bb-hosts, you may
>>>>> > want to
>>>>> > add the 'testip' flag to the relevant lines of bb-hosts.
>>>>>
>>>>> But why is the check still running against the AD-host I've disabled? It
>>>>> looks like its that exact dns-check that takes 450 seconds to run (timeout).
>>>>> Looking at the DNS-statistics:
>>>>>
>>>>> DNS tests executed                          4602546.200466
>>>>>  450.096498
>>>>>
>>>>> which is the normal checks + the 450 second timeout. All other dns-checks
>>>>> run fine.
>>>>>
>>>>> Shouldnt I except something like this (from the conn checks):
>>>>>
>>>>> "System unreachable for 764 poll periods (363317 seconds)"
>>>>>
>>>>> instead of:
>>>>>
>>>>> "Service dns on xxx07 is OK
>>>>> Dialup host/service, or test depends on another failed test
>>>>> Host appears to be down
>>>>> Timeout
>>>>> Seconds: 450.003"
>>>>>
>>>>> when I've disabled all the checks on the host, and the server is powered
>>>>> down? Or am I missing something here?
>>>>>
>>>>> /melgaard
>>>>> To unsubscribe from the xymon list, send an e-mail to
>>>>> xymon-unsubscribe at xymon.dk
>>>>>
>>>>>
>>>>
>>>
>>>
>>
>> To unsubscribe from the xymon list, send an e-mail to
>> xymon-unsubscribe at xymon.dk
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>> The information contained in this message is privileged
>> and confidential information intended for the review and
>> use of the individual and entity named above. If the
>> reader of this message is not the intended recipient, you
>> are hereby notified that any disclosure, dissemination,
>> distribution or copying of this communication or the
>> information contained herein is strictly prohibited. If
>> you have received this communication in error, please
>> immediately notify us.
>>
>>
>>
>>
>>
>> To unsubscribe from the xymon list, send an e-mail to
>> xymon-unsubscribe at xymon.dk
>>
>>
>>
>
>
>
> --
> T.J. Yang
>
> To unsubscribe from the xymon list, send an e-mail to
> xymon-unsubscribe at xymon.dk
>
>
>



More information about the Xymon mailing list