[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: False red alerts for disk

To: hobbit (at) hswn.dk
Subject: Re: False red alerts for disk
From: Patrik Nilsson <patrik (at) jalbum.net>
Date: Thu, 11 Jun 2009 11:40:40 +0200
References: <8826b2830906100757y309f77ay9be8edf33cc800fb (at) mail.gmail.com>

I am now also seeing this with memory reports. There seem to be a
general but intermittent parsing error of client data.

T 2009][uname]
Linux tc1.jalbum.net 2.6.18-92.1
22.el5xen ]86_64 - Memory CRITICAL
   Memory              Used       Total  Percentage
red Physical          48576M          1M    4857600%
red Actual              819M          1M      81900%
green Swap                 80M       1983M          4%

Notice the messed up brackets.

The corresponing part of the actual client data reported is:

client tc1,hostnamechanged,net.linux linux
[date]
Thu Jun 11 11:31:36 CEST 2009
[uname]
Linux tc1.hostnamechanged.net 2.6.18-92.1.22.el5xen x86_64
[osversion]
CentOS release 5.2 (Final)
[uptime]
 11:31:36 up 26 days, 22:25,  1 user,  load average: 0.12, 0.10, 0.03
[who]
root     xvc0         May 15 13:09
[df]
Filesystem         1024-blocks      Used Available Capacity Mounted on
/dev/mapper/VolGroup00-LogVol00  10102072   5553636   4131628      58% /
/dev/xvda1              101086     20724     75143      22% /boot
[mount]
/dev/mapper/VolGroup00-LogVol00 on / type ext3 (rw)
proc on /proc type proc (rw)
sysfs on /sys type sysfs (rw)
devpts on /dev/pts type devpts (rw,gid=5,mode=620)
/dev/xvda1 on /boot type ext3 (rw)
tmpfs on /dev/shm type tmpfs (rw)
none on /proc/sys/fs/binfmt_misc type binfmt_misc (rw)
sunrpc on /var/lib/nfs/rpc_pipefs type rpc_pipefs (rw)
192.168.8.8:/mnt/share on /share type nfs (rw,addr=192.168.8.8)
[free]
             total       used       free     shared    buffers     cached
Mem:       1048576    1043172       5404          0       1936     201892
-/+ buffers/cache:     839344     209232
Swap:      2031608      82368    1949240
[ifconfig]

Patrik

On Wed, Jun 10, 2009 at 4:57 PM, Patrik Nilsson<patrik (at) jalbum.net> wrote:
> Hi,
>
> Running Xymon 4.3.0-0.beta2, I sometimes gets false red alerts from
> disk on a few servers (One of the servers is the xymon server itself).
>
> Usually disk status is reported green, as this:
>
> Wed Jun 10 16:29:17 CEST 2009 - Filesystems OK
>
> Filesystem         1024-blocks      Used Available Capacity Mounted on
> /dev/sda1            204603376   1616748 192593380       1% /
>
> But occasionally, I get red alerts, like this:
>
> - Filesystems NOT ok
>
> red 192593256       1% / (1616872% used) has reached the PANIC level (95%)
>
> Filesystem         1024-blocks
> Use] Available Capacity Mounted on
> /dev/sda1            204603376   1616872 192593256       1% /
>
> Somehow the parsing of the client data doesn't work right, resulting
> the disk blocks being interpreted as percent used.
>
> The corresponding df part in the actual client report looks like this:
>
>  [df]
> Filesystem         1024-blocks      Used Available Capacity Mounted on
> /dev/sda1            204603376   1616872 192593256       1% /
>
>
> On another server, the false red alert looks like this:
> Wed Jun 10 15:51:53 CEST 2009 - Filesystems NOT ok
>
> red 44% / (2778580% used) has reached the PANIC level (95%)
> red 6% /home (2167204% used) has reached the PANIC level (95%)
>
> Filesystem         1
> 24-]locks      Used Available Capacity Mounted on
> /dev/xvda2             5162828   2121988   2778580      44% /
> /dev/xvda3             24
> 7244  ] 136744   2167204       6% /home
>
> While it usually looks like this:
>  Wed Jun 10 15:56:54 CEST 2009 - Filesystems OK
>
> Filesystem         1024-blocks      Used Available Capacity Mounted on
> /dev/xvda2             5162828   2122012   2778556      44% /
> /dev/xvda3             2427244    136784   2167164       6% /home
>
>
> Slightly different, but once again, blocks used being interpreted as
> percentage used.
>
> Anyone has an idea of what might be causing this?
>
> Thanks,
>
> Patrik Nilsson
>

References:
- False red alerts for disk
  - From: Patrik Nilsson

Prev by Date: Re: [hobbit] Alert mail TIME tag
Next by Date: RE: [hobbit] Alert mail TIME tag
Previous by thread: False red alerts for disk
Next by thread: Re: False red alerts for disk
Index(es):
- Date
- Thread