Results 1 to 6 of 6

Thread: Issues with ATI SB700/SB800 SATA Controller

  1. #1

    Question Issues with ATI SB700/SB800 SATA Controller

    I wonder if this is a regression, I have seen info that this bug had been in the SB600 and worked around with 2.6.26:

    Jul 3 08:16:01 bigbox kernel: ata1: softreset failed (device not ready)
    Jul 3 08:16:01 bigbox kernel: ata1: failed due to HW bug, retry pmp=0
    Jul 3 08:16:02 bigbox kernel: ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
    Jul 3 08:16:02 bigbox smartd[4447]: Device: /dev/sda [SAT], not capable of SMART self-check
    Jul 3 08:16:02 bigbox kernel: ata1.00: configured for UDMA/133
    Jul 3 08:16:02 bigbox kernel: ata1: EH complete
    Jul 3 08:16:02 bigbox kernel: sd 0:0:0:0: [sda] 2930277168 512-byte hardware sectors: (1.50TB/1.36TiB)
    Jul 3 08:16:02 bigbox kernel: sd 0:0:0:0: [sda] Write Protect is off
    Jul 3 08:16:02 bigbox kernel: sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
    Jul 3 08:16:02 bigbox kernel: sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
    Jul 3 08:16:02 bigbox smartd[4447]: Device: /dev/sda [SAT], SMART Usage Attribute: 188 Unknown_Attribute changed from 62 to 61
    Jul 3 08:16:09 bigbox kernel: ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
    Jul 3 08:16:09 bigbox kernel: ata2.00: cmd b0/da:00:00:4f:c2/00:00:00:00:00/00 tag 0
    Jul 3 08:16:09 bigbox kernel: res 40/00:00:2f:7b:a8/00:00:ae:00:00/e0 Emask 0x4 (timeout)
    Jul 3 08:16:09 bigbox kernel: ata2.00: status: { DRDY }
    Jul 3 08:16:09 bigbox kernel: ata2: hard resetting link
    Jul 3 08:16:09 bigbox kernel: ata2: softreset failed (device not ready)
    Jul 3 08:16:09 bigbox kernel: ata2: failed due to HW bug, retry pmp=0
    Jul 3 08:16:10 bigbox kernel: ata2: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
    Jul 3 08:16:11 bigbox smartd[4447]: Device: /dev/sdb [SAT], not capable of SMART self-check
    Jul 3 08:16:11 bigbox kernel: ata2.00: configured for UDMA/133
    Jul 3 08:16:11 bigbox kernel: ata2: EH complete
    here's my lspci:
    00:00.0 Host bridge: Advanced Micro Devices [AMD] RS780 Host Bridge
    00:01.0 PCI bridge: Advanced Micro Devices [AMD] RS780 PCI to PCI bridge (int gfx)
    00:0a.0 PCI bridge: Advanced Micro Devices [AMD] RS780 PCI to PCI bridge (PCIE port 5)
    00:11.0 SATA controller: ATI Technologies Inc SB700/SB800 SATA Controller [IDE mode]
    00:12.0 USB Controller: ATI Technologies Inc SB700/SB800 USB OHCI0 Controller
    00:12.1 USB Controller: ATI Technologies Inc SB700 USB OHCI1 Controller
    00:12.2 USB Controller: ATI Technologies Inc SB700/SB800 USB EHCI Controller
    00:13.0 USB Controller: ATI Technologies Inc SB700/SB800 USB OHCI0 Controller
    00:13.1 USB Controller: ATI Technologies Inc SB700 USB OHCI1 Controller
    00:13.2 USB Controller: ATI Technologies Inc SB700/SB800 USB EHCI Controller
    00:14.0 SMBus: ATI Technologies Inc SBx00 SMBus Controller (rev 3a)
    00:14.1 IDE interface: ATI Technologies Inc SB700/SB800 IDE Controller
    00:14.2 Audio device: ATI Technologies Inc SBx00 Azalia (Intel HDA)
    00:14.3 ISA bridge: ATI Technologies Inc SB700/SB800 LPC host controller
    00:14.4 PCI bridge: ATI Technologies Inc SBx00 PCI to PCI Bridge
    00:14.5 USB Controller: ATI Technologies Inc SB700/SB800 USB OHCI2 Controller
    00:18.0 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] HyperTransport Technology Configuration
    00:18.1 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] Address Map
    00:18.2 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] DRAM Controller
    00:18.3 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] Miscellaneous Control
    01:05.0 VGA compatible controller: ATI Technologies Inc Radeon HD 3200 Graphics
    01:05.1 Audio device: ATI Technologies Inc RS780 Azalia controller
    02:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168B PCI Express Gigabit Ethernet controller (rev 02)
    uname:
    Linux bigbox 2.6.27.23-0.1-xen #1 SMP 2009-05-26 17:02:05 -0400 x86_64 x86_64 x86_64 GNU/Linux
    This is happening on three out of three SATA disks in the system and, even though I can't explain how, seems to have caused data loss on the mdraid they support.

    It would be great if someone has a good idea on this one.

    adding smartctl -a, just in case

    smartctl 5.39 2008-10-24 22:33 [x86_64-suse-linux-gnu] (openSUSE RPM)
    Copyright (C) 2002-8 by Bruce Allen, http://smartmontools.sourceforge.net

    === START OF INFORMATION SECTION ===
    Device Model: ST31500341AS
    Serial Number: 9VS11W0F
    Firmware Version: CC1H
    User Capacity: 1.500.301.910.016 bytes
    Device is: Not in smartctl database [for details use: -P showall]
    ATA Version is: 8
    ATA Standard is: ATA-8-ACS revision 4
    Local Time is: Fri Jul 3 08:35:58 2009 CEST
    SMART support is: Available - device has SMART capability.
    SMART support is: Enabled

    === START OF READ SMART DATA SECTION ===
    SMART overall-health self-assessment test result: PASSED

    General SMART Values:
    Offline data collection status: (0x82) Offline data collection activity
    was completed without error.
    Auto Offline Data Collection: Enabled.
    Self-test execution status: ( 0) The previous self-test routine completed
    without error or no self-test has ever
    been run.
    Total time to complete Offline
    data collection: ( 609) seconds.
    Offline data collection
    capabilities: (0x7b) SMART execute Offline immediate.
    Auto Offline data collection on/off support.
    Suspend Offline collection upon new
    command.
    Offline surface scan supported.
    Self-test supported.
    Conveyance Self-test supported.
    Selective Self-test supported.
    SMART capabilities: (0x0003) Saves SMART data before entering
    power-saving mode.
    Supports SMART auto save timer.
    Error logging capability: (0x01) Error logging supported.
    General Purpose Logging supported.
    Short self-test routine
    recommended polling time: ( 1) minutes.
    Extended self-test routine
    recommended polling time: ( 255) minutes.
    Conveyance self-test routine
    recommended polling time: ( 2) minutes.
    SCT capabilities: (0x103f) SCT Status supported.
    SCT Feature Control supported.
    SCT Data Table supported.

    SMART Attributes Data Structure revision number: 10
    Vendor Specific SMART Attributes with Thresholds:
    ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
    1 Raw_Read_Error_Rate 0x000f 118 099 006 Pre-fail Always - 167828869
    3 Spin_Up_Time 0x0003 092 092 000 Pre-fail Always - 0
    4 Start_Stop_Count 0x0032 098 098 020 Old_age Always - 2709
    5 Reallocated_Sector_Ct 0x0033 098 098 036 Pre-fail Always - 93
    7 Seek_Error_Rate 0x000f 072 060 030 Pre-fail Always - 20781990
    9 Power_On_Hours 0x0032 098 098 000 Old_age Always - 2315
    10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 16
    12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 29
    184 Unknown_Attribute 0x0032 100 100 099 Old_age Always - 0
    187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0
    188 Unknown_Attribute 0x0032 061 001 000 Old_age Always - 6249330706724
    189 High_Fly_Writes 0x003a 083 083 000 Old_age Always - 17
    190 Airflow_Temperature_Cel 0x0022 065 053 045 Old_age Always - 35 (Lifetime Min/Max 31/44)
    194 Temperature_Celsius 0x0022 035 047 000 Old_age Always - 35 (0 20 0 0)
    195 Hardware_ECC_Recovered 0x001a 049 032 000 Old_age Always - 167828869
    197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0
    198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0
    199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0
    240 Head_Flying_Hours 0x0000 100 253 000 Old_age Offline - 69814693398795
    241 Unknown_Attribute 0x0000 100 253 000 Old_age Offline - 1345796097
    242 Unknown_Attribute 0x0000 100 253 000 Old_age Offline - 290015065

    SMART Error Log Version: 1
    No Errors Logged

    SMART Self-test log structure revision number 1
    No self-tests have been logged. [To run self-tests, use: smartctl -t]


    SMART Selective self-test log data structure revision number 1
    SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
    1 0 0 Not_testing
    2 0 0 Not_testing
    3 0 0 Not_testing
    4 0 0 Not_testing
    5 0 0 Not_testing
    Selective self-test flags (0x0):
    After scanning selected spans, do NOT read-scan remainder of disk.
    If Selective self-test is pending on power-up, resume after 0 minute delay.
    Last edited by pjakobs; 02-Jul-2009 at 23:37. Reason: added smartctl -a output

  2. #2

    Default Re: Issues with ATI SB700/SB800 SATA Controller

    just giving this a quick bump, wondering if I'm the only one who's ever seen this. If so, I'll just replace the motherboard with an nVidia based one.

    pj

  3. #3

    Default Re: Issues with ATI SB700/SB800 SATA Controller

    A quick google so no authority https://bugs.launchpad.net/ubuntu/+s...ux/+bug/389192 looks like it is cropping up seems to be the same controller.

    I noticed
    I again have to disable the write caching in order for the drives to function under heavy data loads
    Don't know what it means though

    Edit
    After reading a bit more I suspect the kernel mailing list maybe a better place to get assistance.
    Man first, have a try at Info, have a look at Wiki, if all that fails Scroogle!!!!!
    If I've helped click on the Rep button I don't know what it does but it sounds cool.

  4. #4

    Default Re: Issues with ATI SB700/SB800 SATA Controller

    well, the original fix was for the SB600 chipset, but indeed what I'm seeing for the SB700/SB800 seems to be the identical issue. I was hoping there was an openSuSE patch for this rather than to look at the lkml (which would potentially mean to run a stock kernel yadda yadda)

    pj

  5. #5

    Default Re: Issues with ATI SB700/SB800 SATA Controller

    You could try catching someone more knowledable in one of the suse irc channels but I wouldn't hold your breath if it is cropping up in the 2.6.30 kernel.
    Communicate/IRC - openSUSE
    Otherwise perhaps a bug report.

    You maybe lucky and one of the devs will see this here, but unlikely.

    Edit
    Or the kernel mailing list http://en.opensuse.org/Communicate/Mailinglists
    Man first, have a try at Info, have a look at Wiki, if all that fails Scroogle!!!!!
    If I've helped click on the Rep button I don't know what it does but it sounds cool.

  6. #6

    Unhappy Re: Issues with ATI SB700/SB800 SATA Controller

    I just bought the ASRock M3A770DE motherboard, with onboard ATI SB700/SB800 SATA Controller.

    I set the RAID controller to AHCI mode, for both RAID and Hot-Swap capability.

    OpenSuSE appears to see the hardware RAID mirrors correctly, and even lets me install the system.

    However, after building the system, when the system reboots for the first time, it doesn't see anything to boot from!

    It doesn't see a MBR, nor does it see a /boot partition.

    I built the system about 20 times last night.

    When I build the server just using the disks in JBOD mode, where each of the disks looks like a plain old drive, it boots OK.

    In fact, I'm writing this message from within a Foxfire session, on this computer!

    But I want the O/S drives to be mirrored, for redundancy, and right now they are not!

    Is there currently a linux / OpenSuSE linux driver for this RAID controller, that will fix this boot problem?

    TIA

    jxfish2

Tags for this Thread

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •