[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [hobbit] bbdisplay problems after adding some new hosts



Hello Henrik,

I will send you the checkpoint file directly.
But I found out that it seems to be a single test after all (first time I didn't wait long enough).
Everytime I send the attached status message to my hobbit server it will crash after 5-10 minutes.


Regards,

Stefan


<br><br><br>&gt;From: henrik (at) hswn.dk (Henrik Stoerner)<br>&gt;Reply-To: hobbit (at) hswn.dk<br>&gt;To: hobbit (at) hswn.dk<br>&gt;Subject: Re: [hobbit] bbdisplay problems after adding some new hosts<br>&gt;Date: Sun, 15 May 2005 08:46:01 +0200<br>&gt;<br>&gt;On Thu, May 12, 2005 at 12:31:44PM +0000, Stefan Loos wrote:<br>&gt; &gt;<br>&gt; &gt; is the number of tests per host limited? I've cleaned my bb-hosts and as I<br>&gt; &gt; add one host with many tests. I disabled all customized tests and tried to<br>&gt; &gt; find out if its a single test. But all test run by itself didn't crash the<br>&gt; &gt; hobbit server - running all together did!<br>&gt; &gt; The host runs about 20 tests.<br>&gt;<br>&gt;No, there is no limit (other than running out of memory, but I think we<br>&gt;can rule out that one).<br>&gt;<br>&gt;It seems to crash while loading the checkpoint file. Could you send me<br>&gt;that file - it's the ~hobbit/server/tmp/hobbitd.chk file ? I believe it<br>&gt;has somehow become corrupted, but that still shouldn't crash the server.<br>&gt;<br>&gt;<br>&gt;Regards,<br>&gt;Henrik<br>&gt;<br>&gt;<br>&gt;To unsubscribe from the hobbit list, send an e-mail to<br>&gt;hobbit-unsubscribe (at) hswn.dk<br>&gt;<br>&gt;<br>


status i-epc01.hwmon green Fri May 13 13:32:41 CEST 2005 Everything is ok (it seems ... :-))<br> <br> <table align=center border=0 cellpadding=2 cellspacing=2> <tr> <td align=center colspan=2>------------------ System Information ---------------</td> </tr> <tr> <td align=center>Info</td><td align=center>Content</td> </tr> <tr> <td>Serialnumber:</td><td>J061KYD327 </td> </tr> <tr> <td>Product:</td><td>ProLiant DL360 G3</td> </tr> <tr> <td>Systemid:</td><td>CPQ0733</td> </tr> </table> <br> <br> <table align=center border=0 cellpadding=2 cellspacing=2> <tr> <td align=center colspan=6>------------------ HPLOG entries ---------------</td> </tr> <tr> <td>Color</td><td>Id</td><td>Severity</td><td>Inital time</td><td>Update time</td><td>Count</td> </tr> <tr> <td align=center>&green</td><td align=center>0</td><td align=center>Info</td><td>11.03.2005 13:54</td><td>11.03.2005 13:54</td><td align=center>1</td> </tr> <tr> <td> </td><td> </td><td colspan=4>IML Cleared (Message Log Cleared by root)</td> </tr> </table> <br> <br> <table align=center border=0 cellpadding=2 cellspacing=2> <tr> <td align=center colspan=2>------------------ Status of HP UID (blue LED) ---------------</td> </tr> <tr> <td align=center>Color</td><td align=center>Status</td> </tr> <tr> <td align=center>&clear</td><td align=center>LED off</td> </tr> </table> <br> <br> <table align=center border=0 cellpadding=2 cellspacing=2> <tr> <td align=center colspan=6>------------------ Temperature checks ---------------</td> </tr> <tr> <td>Color</td><td>Id</td><td>Location</td><td>Status</td><td align=center>Current</td><td align=center>Treshold</td> </tr> <tr> <td align=center>&green</td><td>1</td><td>Cpu</td><td>Normal</td><td>86F / 30C</td><td>138F / 59C</td> </tr> <tr> <td align=center>&green</td><td>2</td><td>Cpu</td><td>Normal</td><td>89F / 32C</td><td>163F / 73C</td> </tr> <tr> <td align=center>&green</td><td>3</td><td>IO Board</td><td>Normal</td><td>93F / 34C</td><td>140F / 60C</td> </tr> <tr> <td align=center>&green</td><td>4</td><td>Cpu</td><td>Normal</td><td>89F / 32C</td><td>163F / 73C</td> </tr> </table> <br> <br> <table align=center border=0 cellpadding=2 cellspacing=2> <tr> <td align=center colspan=7>------------------ Fan checks ---------------</td> </tr> <tr> <td>Color</td><td>Id</td><td>Type</td><td>Location</td><td>Status</td><td>Redundant</td><td>Fan speed</td> </tr> <tr> <td align=center>&green</td><td align=center>1</td><td>Spin detect</td><td>Cpu</td><td align=center>normal</td><td align=center>NO</td><td align=center>normal</td> </tr> <tr> <td align=center>&green</td><td align=center>2</td><td>Spin detect</td><td>System</td><td align=center>normal</td><td align=center>NO</td><td align=center>normal</td> </tr> </table> <br> <br> <table align=center border=0 cellpadding=2 cellspacing=2> <tr> <td align=center colspan=5>------------------ Power supply checks ---------------</td> </tr> <tr> <td>Color</td><td>Id</td><td align=center>Status</td><td align=center>Redundant</td><td align=center>Hotplug</td> </tr> <tr> <td align=center>&green</td><td align=center>1</td><td align=center>OK</td><td align=center>YES</td><td align=center>YES</td> </tr> <tr> <td align=center>&green</td><td align=center>2</td><td align=center>OK</td><td align=center>YES</td><td align=center>YES</td> </tr> </table> <br> <br> <table align=center border=0 cellpadding=2 cellspacing=2> <tr> <td align=center colspan=5>------------------ Automatic server restart status ---------------</td> </tr> <tr> <td>Color</td><td align=center>Status</td><td align=center>Condition</td><td>Timeout</td><td align=center>Reboot<br>What / Limit / Count</td> </tr> <tr> <td align=center>&green</td><td align=center>ENABLED</td><td align=center>OK</td><td>10 Min</td><td>Boot OS / 10 / 0</td> </tr> </table> <br> <br> <table align=center border=0 cellpadding=2 cellspacing=2> <tr> <td align=center colspan=2>------------------ Diskarray and disk state ---------------</td> </tr> <tr valign=top> <td align=center>&green</td><td colspan=7 align=left>Controller Sa-5i, Version: 2.36 Rev. B in Slot 0 - Status: OK - Controller condition: OK</td> </tr> <tr valign=top> <td align=center>&green</td><td colspan=7>Accelerator status: Enabled</td> </tr> <tr valign=top> <td align=center>&clear</td><td colspan=7>Battery: not present</td> </tr> <tr valign=top> <td align=center>&green</td><td colspan=7>Logical drive 1 (/dev/cciss/c0d0):<br>consists of phys. drive(s): <ul><li>Controller: 0 Disk: 0</li><li>Controller: 0 Disk: 1</li></ul> </tr> </table> <br> <table border=0 cellpadding=2 cellspacing=2> <tr> <td>Color</td><td>Bay</td><td>Status</td><td>Model</td><td colspan=2>Read Errors</td><td colspan=2>Write Errors</td><td>Smart Status</td> </tr> <tr> <td> </td><td> </td><td> </td><td> </td><td>Hard</td><td>Recv</td><td>Hard</td><td>Recv</td><td> </td> </tr> <tr> <td align=center>&green</td><td align=center>0</td><td align=center>OK</td><td>COMPAQ BD03685A24 </td><td align=center>0</td><td align=center>0</td><td align=center>0</td><td align=center>0</td><td align=center>OK</td> </tr> <tr> <td align=center>&green</td><td align=center>1</td><td align=center>OK</td><td>COMPAQ BD03686223 </td><td align=center>0</td><td align=center>0</td><td align=center>0</td><td align=center>1</td><td align=center>OK</td> </tr> </table> <br> <br> <table align=center border=0 cellpadding=2 cellspacing=2> <tr> <td align=center colspan=4>------------------ Correctable Memory Error Status ---------------</td> </tr> <tr> <td align=center>&green</td><td colspan=3>Status: enabled, Condition: N/A, Errorcount: 0</td> </tr> </table> <br> <br> <table align=center border=0 cellpadding=2 cellspacing=2> <tr> <td align=center colspan=6>------------------ Memory state ---------------</td> </tr> <tr> <td>Color</td><td>Type</td><td>Status</td><td>Condition</td><td>Hotplug</td><td>Speed</td> </tr> <tr> <td align=center>&green</td><td align=center>advanced ECC</td><td align=center>protected</td><td align=center>OK</td><td align=center>NO</td><td align=center>266MHz</td> </tr> </table> <br> <table align=center border=0 cellpadding=2 cellspacing=2> <tr> <td>Color</td><td>Board-Id</td><td>Module Id</td><td>Status</td><td>Condition</td> </tr> <tr> <td align=center>&green</td><td align=center>0</td><td align=center>1</td><td align=center>good</td><td align=center>OK</td> </tr> <tr> <td align=center>&green</td><td align=center>0</td><td align=center>2</td><td align=center>good</td><td align=center>OK</td> </tr> <tr> <td align=center>&green</td><td align=center>0</td><td align=center>3</td><td align=center>good</td><td align=center>OK</td> </tr> <tr> <td align=center>&green</td><td align=center>0</td><td align=center>4</td><td align=center>good</td><td align=center>OK</td> </tr> </table> <br> <br>