[Xymon] df hanging will cause xymon-client to hang.

Richard L. Hamilton rlhamil2 at gmail.com
Thu Jun 18 17:03:43 CEST 2015


A “hard” NFS mount will not give up on an access, but retry until the server comes back up (contrasted to a “soft” mount which will eventually give up, but can cause program crashes or even data loss on write operations when it gives up).  “soft” mounts are almost always Evil (TM).

An NFS mount with BOTH “hard” and “intr” options is as robust as a regular “hard” mount, but programs hung in an access to an unresponsive server can be killed.


> On Jun 18, 2015, at 09:45, Novosielski, Ryan <novosirj at ca.rutgers.edu> wrote:
> 
> I wouldn't be certain that would work. Hangs on NFS tend not to respond to CTRL-C or kill. I'm working from memory here, but it would be interesting to try.
> 
> --
> ____ *Note: UMDNJ is now Rutgers-Biomedical and Health Sciences*
> || \\UTGERS      |---------------------*O*---------------------
> ||_// Biomedical | Ryan Novosielski - Senior Technologist
> || \\ and Health | novosirj at rutgers.edu - 973/972.0922 (2x0922)
> ||  \\  Sciences | OIRT/High Perf & Res Comp - MSB C630, Newark
>      `'
> ________________________________________
> From: Steve Anderson [steve.anderson at bipsolutions.com]
> Sent: Thursday, June 18, 2015 9:23 AM
> To: Novosielski, Ryan; Cédric BRINER; xymon at xymon.com
> Subject: RE: [Xymon] df hanging will cause xymon-client to hang.
> 
> If you don't care about the nfs mounted volumes, you may get away with replacing the df with something like
> 
> df -l
> or
> df -x nfs
> 
> Those /should/ (depending on implementation) just skip the testing of the nfs volumes.
> 
> timeout 5s df
> 
> is another option, which should kill the df if it takes more than 5 seconds.
> 
> 
> Steve
> 
> 
> -----Original Message-----
> From: Xymon [mailto:xymon-bounces at xymon.com] On Behalf Of Novosielski, Ryan
> Sent: 18 June 2015 14:16
> To: Cédric BRINER; xymon at xymon.com
> Subject: Re: [Xymon] df hanging will cause xymon-client to hang.
> 
> You do eventually find out when the status turns purple.
> 
> This is a pretty hard one to deal with, from my experience. A hang on something NFS-related is pretty difficult to get out of. I've seen that mounting NFS with the "bg" option can improve this somewhat, but that might create other problems.
> 
> --
> ____ *Note: UMDNJ is now Rutgers-Biomedical and Health Sciences*
> || \\UTGERS      |---------------------*O*---------------------
> ||_// Biomedical | Ryan Novosielski - Senior Technologist
> || \\ and Health | novosirj at rutgers.edu - 973/972.0922 (2x0922)
> ||  \\  Sciences | OIRT/High Perf & Res Comp - MSB C630, Newark
>      `'
> ________________________________________
> From: Xymon [xymon-bounces at xymon.com] On Behalf Of Cédric BRINER [Cedric.BRINER at UniGE.ch]
> Sent: Thursday, June 18, 2015 9:11 AM
> To: xymon at xymon.com
> Subject: [Xymon] df hanging will cause xymon-client to hang.
> 
> Hello,
> 
> I'm running a xymon-client on a Debian Jessie.
> 
> OS: Debian
> OS-Release: 8 (Jessie)
> xymon-client version : 4.3.17
> 
> The error happend due to a nfs ressource not responding. I suppose that
> as xymon launch "df" to get information and as the nfs was hanging, the
> xymon-client is no able to detect that the df hangs and it does not send
> any data to the server. Worst, the xymon-client is not verbose at all,
> it does not tell this test (df) takes too long to accomplish.
> 
> Many thanks for xymon.
> 
> Regards.
> 
> Cédric BRINER
> _______________________________________________
> Xymon mailing list
> Xymon at xymon.com
> http://lists.xymon.com/mailman/listinfo/xymon
> 
> _______________________________________________
> Xymon mailing list
> Xymon at xymon.com
> http://lists.xymon.com/mailman/listinfo/xymon
> 
> BiP Solutions Limited is a company registered in Scotland with Company
> Number SC086146 and VAT number 383030966 and having its registered
> office at Medius, 60 Pacific Quay, Glasgow, G51 1DZ.
> 
> In order to improve the quality of the service we offer, calls may be recorded
> for quality management and training purposes.
> 
> ****************************************************************************
> This e-mail (and any attachment) is intended only for the attention of
> the addressee(s). Its unauthorised use, disclosure, storage or copying
> is not permitted. If you are not the intended recipient, please destroy
> all copies and inform the sender by return e-mail.
> This e-mail (whether you are the sender or the recipient) may be
> monitored, recorded and retained by BiP Solutions Ltd.
> E-mail monitoring/ blocking software may be used, and e-mail content may
> be read at any time.You have a responsibility to ensure laws are not
> broken when composing or forwarding e-mails and their contents.
> ****************************************************************************
> 
> _______________________________________________
> Xymon mailing list
> Xymon at xymon.com
> http://lists.xymon.com/mailman/listinfo/xymon




More information about the Xymon mailing list