1. This forum section is a read-only archive which contains old newsgroup posts. If you wish to post a query, please do so in one of our main forum sections (here). This way you will get a faster, better response from the members on Motherboard Point.

disk failure or SCSI controller failure ?

Discussion in 'Sun Hardware' started by Andrew Tyson, Apr 16, 2004.

  1. Andrew Tyson

    Andrew Tyson Guest

    Hi,

    At work we have a U60 running a secondary PCI SCSI controller with a Maxtor
    36GB
    68pin drive (the latter is only 6 months old). In the last day we have had
    two severe disk failures where the a filesystem mounted
    on the secondary controller/drive fails (trying to 'ls' the FS causes an IO
    error). Looking in /var/adm/messages
    it can be seen that the disk starts reporting problems, and eventually fails
    completely. This problem appears
    to be intermittent, because the FS can be fsck'ed, and mounted when the
    machine is power cycled. However the
    problem has now occurred twice in a short space of time.

    I am wondering whether there is an easy way of diagnosing whether the
    problem is disk related - or contoller
    related. The easiest way would be to attach a second disk to the controller
    and see whether it is ok, however I
    do not have the means to do this right now. Conversely we could attach the
    disk to another controller - however it is required
    in a production system.

    I have attached selected output from /var/adm/messages

    Apr 15 19:32:43 europa scsi: [ID 365881 kern.info] /[email protected],4000/[email protected]
    (glm2):
    Apr 15 19:32:43 europa Cmd (0x1554c48) dump for Target 0 Lun 0:
    Apr 15 19:32:43 europa scsi: [ID 365881 kern.info] /[email protected],4000/[email protected]
    (glm2):
    Apr 15 19:32:43 europa cdb=[ 0x28 0x0 0x1 0x81 0x13 0x2c 0x0 0x0
    0x10 0
    x0 ]
    Apr 15 19:32:43 europa scsi: [ID 365881 kern.info] /[email protected],4000/[email protected]
    (glm2):
    Apr 15 19:32:43 europa pkt_flags=0x4000 pkt_statistics=0x61 pkt_state=0x7
    Apr 15 19:32:43 europa scsi: [ID 365881 kern.info] /[email protected],4000/[email protected]
    (glm2):
    Apr 15 19:32:43 europa pkt_scbp=0x0 cmd_flags=0x8e1
    Apr 15 19:32:43 europa scsi: [ID 107833 kern.warning] WARNING:
    /[email protected],4000/scsi
    @2 (glm2):
    Apr 15 19:32:43 europa Disconnected tagged cmd(s) (1) timeout for Target
    0.0
    Apr 15 19:32:43 europa genunix: [ID 408822 kern.info] NOTICE: glm2: fault
    detect
    ed in device; service still available
    Apr 15 19:32:43 europa genunix: [ID 611667 kern.info] NOTICE: glm2:
    Disconnected
    tagged cmd(s) (1) timeout for Target 0.0
    Apr 15 19:32:43 europa glm: [ID 401478 kern.warning] WARNING:
    ID[SUNWpd.glm.cmd_
    timeout.6018]
    Apr 15 19:32:43 europa scsi: [ID 107833 kern.warning] WARNING:
    /[email protected],4000/scsi
    @2/[email protected],0 (sd30):
    Apr 15 19:32:43 europa SCSI transport failed: reason 'timeout': retrying
    comman
    d

    <snip>

    Apr 15 19:39:38 europa Error for Command: read(10) Error
    Level:
    Fatal
    Apr 15 19:39:38 europa scsi: [ID 107833 kern.notice] Requested Block:
    2882254
    0 Error Block: 28822540
    Apr 15 19:39:38 europa scsi: [ID 107833 kern.notice] Vendor: MAXTOR
    Serial Number: B2DC6SGM
    Apr 15 19:39:38 europa scsi: [ID 107833 kern.notice] Sense Key: Not Ready
    Apr 15 19:39:38 europa scsi: [ID 107833 kern.notice] ASC: 0x4 (<vendor
    unique
    code 0x4>), ASCQ: 0x2, FRU: 0x0
    Apr 15 19:39:38 europa scsi: [ID 107833 kern.warning] WARNING:
    /[email protected],4000/scsi
    @2/[email protected],0 (sd30):
    Apr 15 19:39:38 europa Error for Command: read(10) Error
    Level:
    Fatal
    Apr 15 19:39:38 europa scsi: [ID 107833 kern.notice] Requested Block:
    2882228
    4 Error Block: 28822284
    Apr 15 19:39:38 europa scsi: [ID 107833 kern.notice] Vendor: MAXTOR
    Serial Number: B2DC6SGM
    Apr 15 19:39:38 europa scsi: [ID 107833 kern.notice] Sense Key: Not Ready
    Apr 15 19:39:38 europa scsi: [ID 107833 kern.notice] ASC: 0x4 (<vendor
    unique
    code 0x4>), ASCQ: 0x1, FRU: 0x0
    Apr 15 20:12:09 europa scsi: [ID 107833 kern.warning] WARNING:
    /[email protected],4000/scsi
    @2/[email protected],0 (sd30):
    Apr 15 20:12:09 europa SCSI transport failed: reason 'incomplete': retrying
    com
    mand


    Thanks and regards,
    Andrew
     
    Andrew Tyson, Apr 16, 2004
    #1
    1. Advertisements

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments (here). After that, you can post your question and our members will help you out.