SMART responses for SATA disks on SAS get interpreted as errors

Bug #583128 reported by mbailey
16
This bug affects 2 people
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Fix Released
Medium
Lee Jones
Lucid
Fix Released
Medium
Unassigned

Bug Description

Ubuntu Server Lucid 10.04

I am hitting a kernel bug as reported here:

https://bugzilla.kernel.org/show_bug.cgi?id=13594

It is fixed in 2.6.34, by this:

http://git.kernel.org/?p=linux/kernel/git/jejb/scsi-misc-2.6.git;a=commit;h=91b25002bd58f55207e4662a611a6cded4ef9834

tl;dr: Controller spitting out SMART statistics being falsely interpreted as drive faults by the scsi subsystem consequently kicking the drives out of md volumes.

Revision history for this message
mbailey (mbailey-system42) wrote :
Lee Jones (lag)
Changed in linux (Ubuntu):
assignee: nobody → Lee Jones (lag)
Revision history for this message
mbailey (mbailey-system42) wrote :

Workaround: use the maverick 2.6.34 kernel

True fix apparently coming in the 2.6.32.13 stable update for Lucid.

Revision history for this message
Lee Jones (lag) wrote :

I believe I have fixed this issue in the 2.6.32 version of the kernel.

The fixed kernels are available for download here: http://people.canonical.com/~ljones/lp583128-lucid

Would those of you affected by this bug be kind enough to test these fixes and report back here please?

Thanks in advance.

Lee Jones (lag)
Changed in linux (Ubuntu):
status: New → Incomplete
Revision history for this message
mbailey (mbailey-system42) wrote :

Still getting these in dmesg on 2.6.32-22 from your site:

[ 139.056987] sd 1:0:2:0: [sdk] Add. Sense: ATA pass through information available
[ 139.298118] sd 1:0:3:0: [sdl] Sense Key : Recovered Error [current] [descriptor]
[ 139.298122] Descriptor sense data with sense descriptors (in hex):
[ 139.298124] 72 01 00 1d 00 00 00 0e 09 0c 00 00 01 00 00 00
[ 139.298129] 00 4f 00 c2 00 50

I don't get these with .34-2

Thanks,
-mb

Revision history for this message
mbailey (mbailey-system42) wrote :

Also, no md arrays de-synched with .32-22, but this is almost prod, so I'll probably stick with .34-2

Andy Whitcroft (apw)
tags: added: v2.6.23.13
tags: added: v2.6.32.13
removed: v2.6.23.13
Revision history for this message
Lee Jones (lag) wrote :

This is expected to be fixed by updates coming to Ubuntu via 2.6.32.13, will await those fixes.

Revision history for this message
Stefan Bader (smb) wrote :

This bugfix has been included in the upstream stable update for Lucid tto 2.6.32.13 (see Bug #583414).

Changed in linux (Ubuntu):
status: Incomplete → Fix Committed
importance: Undecided → Medium
status: Fix Committed → Fix Released
Changed in linux (Ubuntu Lucid):
importance: Undecided → Medium
status: New → Fix Committed
Revision history for this message
Martin Pitt (pitti) wrote : Please test proposed package

Accepted linux into lucid-proposed, the package will build now and be available in a few hours. Please test and give feedback here. See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you in advance!

tags: added: verification-needed
Revision history for this message
Launchpad Janitor (janitor) wrote :
Download full text (25.2 KiB)

This bug was fixed in the package linux - 2.6.32-23.37

---------------
linux (2.6.32-23.37) lucid-proposed; urgency=low

  [ Alex Deucher ]

  * SAUCE: drm/radeon/kms/atom: fix dual-link DVI on DCE3.2/4.0
    - LP: #564559

  [ Andy Whitcroft ]

  * [Config] ports -- build in dm-mod to enable LVM boot
    - LP: #560717
  * tools -- fix perf version extraction for multi-part flavours
    - LP: #555130
  * SAUCE: ACPI: EC: Allow multibyte access to EC (v3)
    - LP: #526354
  * [Config] enforce -- ensure dm_mod is built-in for LVM
    - LP: #560717
  * update to ubuntu-debian:7e708d33054c373faf41da23b73e8b48c342d958
    - LP: #570500, #576274

  [ Chase Douglas ]

  * Revert "(pre-stable): input: ALPS - Add signature for HP Pavilion dm3
    laptops"
    - LP: #550625
  * Enable ftrace function profiler
    - LP: #570389
  * enforce CONFIG_TMPFS_POSIX_ACL=y
    - LP: #575940

  [ Leann Ogasawara ]

  * Revert "staging/comdi -- disable"
    - LP: #563436
  * [Config] Enable multicast routing for sparc
    - LP: #416266
  * [Config] Add ahci.ko to virtual sub-flavour
    - LP: #570542

  [ Stefan Bader ]

  * Revert "SAUCE: drm/i915: Disable FBC on 915GM and 945GM"
    - LP: #588832

  [ Tim Gardner ]

  * ubuntu: rtl8192se -- update to version 0015.0127.2010
    - LP: #567016
  * [Config] Add atl1c to nic-modules udeb
    - LP: #557130

  [ Upstream Kernel Changes ]

  * Revert "(pre-stable) iwlwifi: fix nfreed--"
    - LP: #575853
  * Revert "backlight: mbp_nvidia_bl - add five more MacBook variants"
    - LP: #575853
  * Revert "(pre-stable) pata_via: Add VIA VX900 support"
    - LP: #575853
  * Revert "(pre-stable) x86-32, resume: do a global tlb flush in S4
    resume"
    - LP: #575853
  * Revert "x86: disable IOMMUs on kernel crash"
    - LP: #575853
  * Revert "sunrpc: fix peername failed on closed listener"
    - LP: #575853
  * Revert "sunrpc: move the close processing after do recvfrom method"
    - LP: #575853
  * Revert "(pre-stable) drm/edid: allow certain bogus edids to hit a fixup
    path rather than fail"
    - LP: #575853
  * Revert "drm/radeon/kms: don't print error on -ERESTARTSYS."
    - LP: #575853
  * Revert "ath9k: fix lockdep warning when unloading module" on stable
    kernels
    - LP: #588832
  * Staging: comedi: removed "depricated" from COMEDI_CB_BLOCK
    - LP: #483343
  * fat: fix buffer overflow in vfat_create_shortname()
    - LP: #575853
  * xfs: simplify inode teardown
    - LP: #575853
  * xfs: fix mmap_sem/iolock inversion in xfs_free_eofblocks
    - LP: #575853
  * xfs: I/O completion handlers must use NOFS allocations
    - LP: #575853
  * xfs: Wrapped journal record corruption on read at recovery
    - LP: #575853
  * xfs: Fix error return for fallocate() on XFS
    - LP: #575853
  * xfs: check for not fully initialized inodes in xfs_ireclaim
    - LP: #575853
  * xfs: fix timestamp handling in xfs_setattr
    - LP: #575853
  * xfs: Don't flush stale inodes
    - LP: #575853
  * xfs: Ensure we force all busy extents in range to disk
    - LP: #575853
  * xfs: reclaim inodes under a write lock
    - LP: #575853
  * xfs: Avoid inodes in reclaim when flushing from inode cache
    - LP: #575853
  * xfs: recla...

Changed in linux (Ubuntu Lucid):
status: Fix Committed → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.