Motherboard Forums


Reply
Thread Tools Display Modes

S5397 - i5400pw - BIOS 1.03 - Ubuntu system freeze

 
 





















Mark Jeynes
Guest
Posts: n/a

 
      03-15-2009, 11:24 PM


I run a system with this mobo and Ubuntu 8.04 server 64-bit. Recently
the system has been hanging after about 20-30 mins, sometimes reporting
ata disk errors.

If I reboot the board boots as far as "Mouse Initialized" then hangs for
3 mins or so. After this it reports error with disk 1.

Poweriing down for 3-4 mins and rebooting, the system will boot
successfully again. Only it hangs 20 mins or so later and we're back
into the same loop. The hang seems to occur when there is substantial
SATA activity. After boot my 3 SATA disks start a RAID resync which
says will take 240 mins to complete - all disk activity stops 15 mins or
so later and the system is frozen. Sabe result on a variety of recent
kernels.

The same condition occurs if I do not start the RAID software and run
concurrent but independent disk integrity checking software on the SATA
disks (i.e. I'm using the 'badblocks' utility under Linux).

Right now I'm unsure if my issue lies with the disks or the mobo; it's
hard to isolate the problem when you are dependent on the mobo to test
the disks and vice versa!

any words of wisdom from you clever people will be gratefully received!
 
Reply With Quote
 
Paul
Guest
Posts: n/a

 
      03-16-2009, 12:12 AM
Mark Jeynes wrote:
> I run a system with this mobo and Ubuntu 8.04 server 64-bit. Recently
> the system has been hanging after about 20-30 mins, sometimes reporting
> ata disk errors.
>
> If I reboot the board boots as far as "Mouse Initialized" then hangs for
> 3 mins or so. After this it reports error with disk 1.
>
> Poweriing down for 3-4 mins and rebooting, the system will boot
> successfully again. Only it hangs 20 mins or so later and we're back
> into the same loop. The hang seems to occur when there is substantial
> SATA activity. After boot my 3 SATA disks start a RAID resync which
> says will take 240 mins to complete - all disk activity stops 15 mins or
> so later and the system is frozen. Sabe result on a variety of recent
> kernels.
>
> The same condition occurs if I do not start the RAID software and run
> concurrent but independent disk integrity checking software on the SATA
> disks (i.e. I'm using the 'badblocks' utility under Linux).
>
> Right now I'm unsure if my issue lies with the disks or the mobo; it's
> hard to isolate the problem when you are dependent on the mobo to test
> the disks and vice versa!
>
> any words of wisdom from you clever people will be gratefully received!


One thing I've noticed here, as a home user, is that if a SATA disk
has a problem, there doesn't appear to be a mechanism to reset the
disk interface from the motherboard. When something similar happened
to me, I had to power cycle, before the hard drive was reset and
could be seen again.

The fact that a reboot after a failure in your case, results in an
"error with disk1", which is cleared by powering down, suggests the
disk is the part that is hung up, rather than the motherboard.
The chipset should be resettable, on the reboot, so I wouldn't expect
it to stay in a stuck state.

Have you tried downloading the disk diagnostic from the disk
manufacturer website ?

Is there a chance the disk(s) are overheating ?

Does the power supply have enough 12V amps for all
the loads you have connected ?

You could also try testing the disks as simple data disks on
another computer. You could use something like the free version
of HDTune for Windows, as a test stimulus for the drives (i.e. no need
for the OS to see a file system on the drive, to test it). HDTune has
a read benchmark, that reads the disk surface, and also has an error scan.
It also reads drive temperature via SMART (that is, as long as
the port the disk is connected to, can issue SMART commands).

http://www.hdtune.com/download.html

Paul
 
Reply With Quote
 
Mark Jeynes
Guest
Posts: n/a

 
      03-17-2009, 08:46 AM
Paul wrote:
> Mark Jeynes wrote:
>> I run a system with this mobo and Ubuntu 8.04 server 64-bit. Recently
>> the system has been hanging after about 20-30 mins, sometimes
>> reporting ata disk errors.
>>
>> If I reboot the board boots as far as "Mouse Initialized" then hangs
>> for 3 mins or so. After this it reports error with disk 1.
>>
>> Poweriing down for 3-4 mins and rebooting, the system will boot
>> successfully again. Only it hangs 20 mins or so later and we're back
>> into the same loop. The hang seems to occur when there is substantial
>> SATA activity. After boot my 3 SATA disks start a RAID resync which
>> says will take 240 mins to complete - all disk activity stops 15 mins
>> or so later and the system is frozen. Sabe result on a variety of
>> recent kernels.
>>
>> The same condition occurs if I do not start the RAID software and run
>> concurrent but independent disk integrity checking software on the
>> SATA disks (i.e. I'm using the 'badblocks' utility under Linux).
>>
>> Right now I'm unsure if my issue lies with the disks or the mobo; it's
>> hard to isolate the problem when you are dependent on the mobo to test
>> the disks and vice versa!
>>
>> any words of wisdom from you clever people will be gratefully received!


Firstly I'd like to say a hearty thankyou Paul. Just seeing a reply
this morning made me feel I'm not alone on this planet. cheers mate.

>
> One thing I've noticed here, as a home user, is that if a SATA disk
> has a problem, there doesn't appear to be a mechanism to reset the
> disk interface from the motherboard. When something similar happened
> to me, I had to power cycle, before the hard drive was reset and
> could be seen again.
>
> The fact that a reboot after a failure in your case, results in an
> "error with disk1", which is cleared by powering down, suggests the
> disk is the part that is hung up, rather than the motherboard.
> The chipset should be resettable, on the reboot, so I wouldn't expect
> it to stay in a stuck state.
>
> Have you tried downloading the disk diagnostic from the disk
> manufacturer website ?


I did today ... on your advice (thankyou! I'd not considered they would
offer such a thing). The tool says my disks are fine :-) (phew)

>
> Is there a chance the disk(s) are overheating ?


Possibly ... and I know they have in the past (smartctl told me). I did
have them stacked in one of those 5-in-3 backplane caddies. Probably
not a good idea when 3 neighbouring RAID disks decide to do a total
resync. There's not much room in there for airflow, so this kind of
need means things will get steamy - even though it's backed with a fan
that could suck a golfball through six feet of hose.

>
> Does the power supply have enough 12V amps for all
> the loads you have connected ?


I should say so ... it's a nice 750W supply from Silverstone.
Hopelessly overdone but you know how gadget-lust takes over when
shopping for machine parts.

>
> You could also try testing the disks as simple data disks on
> another computer. You could use something like the free version
> of HDTune for Windows, as a test stimulus for the drives (i.e. no need
> for the OS to see a file system on the drive, to test it). HDTune has
> a read benchmark, that reads the disk surface, and also has an error scan.
> It also reads drive temperature via SMART (that is, as long as
> the port the disk is connected to, can issue SMART commands).
>
> http://www.hdtune.com/download.html


Wow. I'm humbled by your knowledge of this topic and very, very
grateful (there you go, I said I would be).

I believe my main problem is my desire to silence the machine as far as
practical so after turning off most casefans it was getting a bit hot in
there. Though I can't back this with hardcore science, I've discovered
that re-enabling a couple of case fans today gave me several hours of
uptime, enough to complete the resync. This brought my recovery task to
critical mass - resync done means disk activity mainly stops and the
problem cause of getting hot subsides. That's my theory - but the
beautiful truth is it's still going now. Now I can turn to a
preventative course of action rather than desperate recovery task.

thanks again

>
> Paul

 
Reply With Quote
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
Dimension L566cx BIOS Level A06 wm_walsh@hotmail.com Dell 5 05-28-2008 04:35 PM
Lost stiped RAID array after bios upgrade - Crosshair Greybeard Asus 7 04-24-2007 03:23 AM
Edit/creating new Dimension 8300 BIOS??? allegro Dell 5 03-01-2007 11:50 PM
Flashed bios, now system wont boot berdjmk Abit 5 12-29-2006 01:46 AM
EZ-Flash Hangs After Update/ Intel CPU Ucode Error Murphy Asus 10 12-20-2006 03:38 AM


All times are GMT. The time now is 08:10 AM.

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43