[i965] X freeze on karmic after resume from full screen application: i915_gem_retire_work_handler() / finish_task_switch()

Bug #388357 reported by Matt Zimmerman
78
This bug affects 11 people
Affects Status Importance Assigned to Milestone
xf86-video-intel
Fix Released
Critical
linux (Ubuntu)
Invalid
High
Unassigned
xserver-xorg-video-intel (Ubuntu)
Fix Released
High
Unassigned

Bug Description

Binary package hint: xserver-xorg-video-intel

I had finished watching a video in totem, and had been writing email using mutt and vim in a terminal for some time, when the screen stopped updating. My music was still playing, though; everything seemed to be running except for the X server (symptoms similar to bug 359392).

I was able to ssh in from another system and collect intel_gpu_dump output, which i will attach.

/proc/interrupts showed no change in the number of interrupts for i915.

The kernel logged a page allocation failure while intel_gpu_dump was running(!), which will be shown in the attached dmesg.

I've seen it happen twice now (in the span of 2 hours), and both times, dmesg shows:

[ 6000.528124] INFO: task events/1:10 blocked for more than 120 seconds.
[ 6000.528133] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 6000.528140] events/1 D 0000000100151496 0 10 2
[ 6000.528152] ffff8800bded1db0 0000000000000046 ffff8800bded1d30 0000000000013000
[ 6000.528163] ffff8800bdec83a8 0000000000013000 0000000000013000 0000000000013000
[ 6000.528173] 0000000000013000 0000000000013000 ffff8800bdec83a8 0000000000013000
[ 6000.528183] Call Trace:
[ 6000.528203] [<ffffffff806d9467>] __mutex_lock_slowpath+0xd7/0x160
[ 6000.528216] [<ffffffff802436b1>] ? finish_task_switch+0x51/0x110
[ 6000.528225] [<ffffffff806d9186>] mutex_lock+0x26/0x50
[ 6000.528260] [<ffffffffa0251ec8>] i915_gem_retire_work_handler+0x38/0x90 [i915]
[ 6000.528283] [<ffffffffa0251e90>] ? i915_gem_retire_work_handler+0x0/0x90 [i915]
[ 6000.528292] [<ffffffff802643d5>] run_workqueue+0x95/0x170
[ 6000.528300] [<ffffffff80264554>] worker_thread+0xa4/0x120
[ 6000.528310] [<ffffffff80268e90>] ? autoremove_wake_function+0x0/0x40
[ 6000.528318] [<ffffffff802644b0>] ? worker_thread+0x0/0x120
[ 6000.528327] [<ffffffff80268a35>] kthread+0x55/0xa0
[ 6000.528335] [<ffffffff802130ca>] child_rip+0xa/0x20
[ 6000.528344] [<ffffffff802689e0>] ? kthread+0x0/0xa0
[ 6000.528351] [<ffffffff802130c0>] ? child_rip+0x0/0x20

ProblemType: Bug
Architecture: amd64
Date: Wed Jun 17 10:20:15 2009
DistroRelease: Ubuntu 9.10
MachineType: LENOVO 6465CTO
Package: xserver-xorg-video-intel 2:2.7.99.1+git20090602.ec2fde7c-0ubuntu2
ProcCmdLine: root=UUID=305dde78-d20a-4248-aaf4-09447b7c5791 ro quiet splash
ProcEnviron:
 LC_COLLATE=C
 PATH=(custom, user)
 LANG=en_US.UTF-8
 SHELL=/bin/zsh
ProcVersionSignature: Ubuntu 2.6.30-9.10-generic
RelatedPackageVersions:
 xserver-xorg 1:7.4~5ubuntu21
 libgl1-mesa-glx 7.4.1-1ubuntu2
 libdrm2 2.4.11-0ubuntu1
 xserver-xorg-video-intel 2:2.7.99.1+git20090602.ec2fde7c-0ubuntu2
 xserver-xorg-video-ati 1:6.12.2-2ubuntu1
SourcePackage: xserver-xorg-video-intel
Uname: Linux 2.6.30-9-generic x86_64
dmi.bios.date: 01/21/2008
dmi.bios.vendor: LENOVO
dmi.bios.version: 7LETB0WW (2.10 )
dmi.board.name: 6465CTO
dmi.board.vendor: LENOVO
dmi.board.version: Not Available
dmi.chassis.asset.tag: No Asset Information
dmi.chassis.type: 10
dmi.chassis.vendor: LENOVO
dmi.chassis.version: Not Available
dmi.modalias: dmi:bvnLENOVO:bvr7LETB0WW(2.10):bd01/21/2008:svnLENOVO:pn6465CTO:pvrThinkPadT61:rvnLENOVO:rn6465CTO:rvrNotAvailable:cvnLENOVO:ct10:cvrNotAvailable:
dmi.product.name: 6465CTO
dmi.product.version: ThinkPad T61
dmi.sys.vendor: LENOVO
fglrx: Not loaded
system:
 distro: Ubuntu
 architecture: x86_64kernel: 2.6.30-9-generic

Revision history for this message
Matt Zimmerman (mdz) wrote :
Revision history for this message
Matt Zimmerman (mdz) wrote :
Revision history for this message
Matt Zimmerman (mdz) wrote :
Revision history for this message
Matt Zimmerman (mdz) wrote :

Here is a backtrace of the X server at the time of the hang:

#0 0x00007feedbda0ec7 in ioctl () from /lib/libc.so.6
#1 0x00007feeda9812e3 in drmIoctl () from /usr/lib/libdrm.so.2
#2 0x00007feeda9815e6 in drmCommandNone () from /usr/lib/libdrm.so.2
#3 0x00007feeda50b370 in I830BlockHandler (i=0,
    blockData=<value optimized out>, pTimeout=0x7fff7b671df8,
    pReadmask=0x7dff80) at ../../src/i830_driver.c:2281
#4 0x0000000000536885 in AnimCurScreenBlockHandler (
    screenNum=<value optimized out>, blockData=<value optimized out>,
    pTimeout=<value optimized out>, pReadmask=<value optimized out>)
    at ../../render/animcur.c:222
#5 0x0000000000500d86 in compBlockHandler (i=0, blockData=0x0,
    pTimeout=0x7fff7b671df8, pReadmask=<value optimized out>)
    at ../../composite/compinit.c:158
#6 0x00000000004520e0 in BlockHandler (pTimeout=0x7fff7b671df8,
    pReadmask=0x7dff80) at ../../dix/dixutils.c:384
#7 0x00000000004eed31 in WaitForSomething (
    pClientsReady=<value optimized out>) at ../../os/WaitFor.c:215
#8 0x000000000044dd52 in Dispatch () at ../../dix/dispatch.c:367
#9 0x0000000000433f15 in main (argc=<value optimized out>,
    argv=0x7fff7b672018, envp=<value optimized out>) at ../../dix/main.c:397

Revision history for this message
In , Bryce Harrington (bryce) wrote :
Download full text (3.7 KiB)

Created an attachment (id=26894)
intel_gpu_dump.txt.gz

Forwarding this Ubuntu bug:
https://bugs.edge.launchpad.net/ubuntu/+source/xserver-xorg-video-intel/+bug/388357

[Problem]
GPU hang with call trace in dmesg occurs subsequent to full screen activity (video playback); also seen by other users after resuming from screen blanking via DPMS and after resuming from screensaver.

[Call Trace]
[ 6000.528124] INFO: task events/1:10 blocked for more than 120 seconds.
[ 6000.528133] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 6000.528140] events/1 D 0000000100151496 0 10 2
[ 6000.528152] ffff8800bded1db0 0000000000000046 ffff8800bded1d30 0000000000013000
[ 6000.528163] ffff8800bdec83a8 0000000000013000 0000000000013000 0000000000013000
[ 6000.528173] 0000000000013000 0000000000013000 ffff8800bdec83a8 0000000000013000
[ 6000.528183] Call Trace:
[ 6000.528203] [<ffffffff806d9467>] __mutex_lock_slowpath+0xd7/0x160
[ 6000.528216] [<ffffffff802436b1>] ? finish_task_switch+0x51/0x110
[ 6000.528225] [<ffffffff806d9186>] mutex_lock+0x26/0x50
[ 6000.528260] [<ffffffffa0251ec8>] i915_gem_retire_work_handler+0x38/0x90 [i915]
[ 6000.528283] [<ffffffffa0251e90>] ? i915_gem_retire_work_handler+0x0/0x90 [i915]
[ 6000.528292] [<ffffffff802643d5>] run_workqueue+0x95/0x170
[ 6000.528300] [<ffffffff80264554>] worker_thread+0xa4/0x120
[ 6000.528310] [<ffffffff80268e90>] ? autoremove_wake_function+0x0/0x40
[ 6000.528318] [<ffffffff802644b0>] ? worker_thread+0x0/0x120
[ 6000.528327] [<ffffffff80268a35>] kthread+0x55/0xa0
[ 6000.528335] [<ffffffff802130ca>] child_rip+0xa/0x20
[ 6000.528344] [<ffffffff802689e0>] ? kthread+0x0/0xa0
[ 6000.528351] [<ffffffff802130c0>] ? child_rip+0x0/0x20

[Original Report]
I had finished watching a video in totem, and had been writing email using mutt and vim in a terminal for some time, when the screen stopped updating. My music was still playing, though; everything seemed to be running except for the X server (symptoms similar to bug 359392).

I was able to ssh in from another system and collect intel_gpu_dump output, which i will attach.

/proc/interrupts showed no change in the number of interrupts for i915.

The kernel logged a page allocation failure while intel_gpu_dump was running(!), which will be shown in the attached dmesg.

I've seen it happen twice now (in the span of 2 hours), and both times, dmesg shows the above trace.

ProblemType: Bug
Architecture: amd64
Date: Wed Jun 17 10:20:15 2009
DistroRelease: Ubuntu 9.10
MachineType: LENOVO 6465CTO
Package: xserver-xorg-video-intel 2:2.7.99.1+git20090602.ec2fde7c-0ubuntu2
ProcCmdLine: root=UUID=305dde78-d20a-4248-aaf4-09447b7c5791 ro quiet splash
ProcEnviron:
 LC_COLLATE=C
 PATH=(custom, user)
 LANG=en_US.UTF-8
 SHELL=/bin/zsh
ProcVersionSignature: Ubuntu 2.6.30-9.10-generic
RelatedPackageVersions:
 xserver-xorg 1:7.4~5ubuntu21
 libgl1-mesa-glx 7.4.1-1ubuntu2
 libdrm2 2.4.11-0ubuntu1
 xserver-xorg-video-intel 2:2.7.99.1+git20090602.ec2fde7c-0ubuntu2
 xserver-xorg-video-ati 1:6.12.2-2ubuntu1
SourcePackage: xserver-xorg-video-intel
Uname: Linux 2.6.30-9-generic x86_64
dmi.bios.date: 01/21/2008
dmi.bios.vendor: LENOVO
dmi.bios.version: 7LETB0WW (2.10 )
...

Read more...

Revision history for this message
In , Bryce Harrington (bryce) wrote :

Created an attachment (id=26895)
dmesg

Revision history for this message
In , Bryce Harrington (bryce) wrote :

Ubuntu bugs with similar backtraces which I suspect are dupes:

  https://bugs.edge.launchpad.net/ubuntu/+source/linux/+bug/383973
  Freeze when trying to resume from a blanked screen

  https://bugs.edge.launchpad.net/ubuntu/+source/xserver-xorg-video-intel/+bug/383822
  Freeze when trying to resume from screensaver

  https://bugs.edge.launchpad.net/ubuntu/+source/xserver-xorg-video-intel/+bug/384242
  Freeze after DPMS has kicked in

  https://bugs.edge.launchpad.net/ubuntu/+source/linux/+bug/384865
  kernel oops with intel graphics when screensaver turns screen off

Revision history for this message
In , Bryce Harrington (bryce) wrote :

*** Bug 22318 has been marked as a duplicate of this bug. ***

Revision history for this message
Matt Zimmerman (mdz) wrote : Re: [i965] GPU hang on i965 under karmic: INFO: task events/1:10 blocked for more than 120 seconds

seb128 mentioned that he sees similar dmesg output for bug bug 383822, so they may be related.

summary: - [i965] GPU hang on i965 under karmic
+ [i965] GPU hang on i965 under karmic: INFO: task events/1:10 blocked for
+ more than 120 seconds
description: updated
Revision history for this message
Matt Zimmerman (mdz) wrote :

I've turned off desktop effects for now, to see if it goes away.

Revision history for this message
In , Eric Anholt (eric-anholt) wrote :

That backtrace is a generic "the gpu is hung" backtrace. Don't use it for classifying bugs.

The dump in this report is broken because there were too many batchbuffers queued up and seqfile failed thanks to its use of kmalloc (the page allocation failure warning). If you can find a way to reliably reproduce the problem, and any 3D applications are in use, running them with INTEL_DEBUG=sync in the environment may help get successful dumping.

Revision history for this message
In , Bryce Harrington (bryce) wrote :

Here is a backtrace of the X server at the time of the hang:

#0 0x00007feedbda0ec7 in ioctl () from /lib/libc.so.6
#1 0x00007feeda9812e3 in drmIoctl () from /usr/lib/libdrm.so.2
#2 0x00007feeda9815e6 in drmCommandNone () from /usr/lib/libdrm.so.2
#3 0x00007feeda50b370 in I830BlockHandler (i=0,
    blockData=<value optimized out>, pTimeout=0x7fff7b671df8,
    pReadmask=0x7dff80) at ../../src/i830_driver.c:2281
#4 0x0000000000536885 in AnimCurScreenBlockHandler (
    screenNum=<value optimized out>, blockData=<value optimized out>,
    pTimeout=<value optimized out>, pReadmask=<value optimized out>)
    at ../../render/animcur.c:222
#5 0x0000000000500d86 in compBlockHandler (i=0, blockData=0x0,
    pTimeout=0x7fff7b671df8, pReadmask=<value optimized out>)
    at ../../composite/compinit.c:158
#6 0x00000000004520e0 in BlockHandler (pTimeout=0x7fff7b671df8,
    pReadmask=0x7dff80) at ../../dix/dixutils.c:384
#7 0x00000000004eed31 in WaitForSomething (
    pClientsReady=<value optimized out>) at ../../os/WaitFor.c:215
#8 0x000000000044dd52 in Dispatch () at ../../dix/dispatch.c:367
#9 0x0000000000433f15 in main (argc=<value optimized out>,
    argv=0x7fff7b672018, envp=<value optimized out>) at ../../dix/main.c:397

Looks the same as in
https://bugs.freedesktop.org/show_bug.cgi?id=20560

Revision history for this message
Matt Zimmerman (mdz) wrote : Re: [i965] GPU hang on i965 under karmic: INFO: task events/1:10 blocked for more than 120 seconds

The dmesg on bug 383973 shows the same message and backtrace

Revision history for this message
Matt Zimmerman (mdz) wrote :

After two hangs in two hours with desktop effects enabled, I haven't seen it happen again after 5 hours with them disabled. Not conclusive yet, but a data point.

Revision history for this message
Bryce Harrington (bryce) wrote :

<jbarnes> bryce: yeah that might be a dupe of the vblank sync bug

Changed in xserver-xorg-video-intel:
status: Unknown → Confirmed
Bryce Harrington (bryce)
summary: - [i965] GPU hang on i965 under karmic: INFO: task events/1:10 blocked for
- more than 120 seconds
+ [i965] X freeze on karmic after resume from full screen application:
+ i915_gem_retire_work_handler() / finish_task_switch()
Changed in xserver-xorg-video-intel (Ubuntu):
importance: Undecided → High
status: New → Triaged
Changed in linux (Ubuntu):
importance: Undecided → High
status: New → Triaged
Revision history for this message
Bryce Harrington (bryce) wrote :

Based on the upstream comment, I'm not 100% sure all the bugs I've duped to this one are indeed the same bug, but the data collected so far on each of them matches. We may need to undupe them if we find a better way to differentiate them, or find the fix to this one doesn't fix all of them.

Revision history for this message
Bryce Harrington (bryce) wrote :

This does sound similar to the vblank sync bug, but the symptoms differ, and we have already pulled in the patch for the vblank sync bug: 120_fix_dri2_vblank_syncing_segfault.patch (commit b8e360bf2b77d28559d15a7c0f9c766848eb6ced)

Revision history for this message
Bryce Harrington (bryce) wrote :

<tjaalton> Sarvatt thinks it's because of this change that got in after 2.6.30 rc8 http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=c9fb15f60eb517c958dec64dca9357bf62bf2201 ; -8 works fine, -9 doesn't, at least on my box

Revision history for this message
Bryce Harrington (bryce) wrote :

mdz, I notice you're listed as running 2.6.30-9-generic in this bug report. Did you only start seeing this issue after moving from -8?

Revision history for this message
Nicolò Chieffo (yelo3) wrote : Re: [Bug 388357] Re: [i965] X freeze on karmic after resume from full screen application: i915_gem_retire_work_handler() / finish_task_switch()

I'm seeing a dupe with -8 too

Revision history for this message
Bryce Harrington (bryce) wrote :

Found http://bugzilla.kernel.org/show_bug.cgi?id=13490 as well, which is another i965 gpu freeze on Ubuntu with the same call trace and backtrace, however he traced it to a commit in the -intel driver: ec2fde7c8250fdc30984f16c8a1d3587d70b0144 ("Sync DRI2 CopyRegion to vertical retrace"), and was seeing it on the -rc8 mainline kernel.

Revision history for this message
GiuseppeVerde (launchpad-digitasaru) wrote :

Sorry I can't update; I've gotten busy. I'll get back when I get a moment when it's not incredibly inconvenient to kill X.

Revision history for this message
Bryce Harrington (bryce) wrote :

The discussion around that kernel bug moved to https://bugs.freedesktop.org/show_bug.cgi?id=22283

Revision history for this message
Matt Zimmerman (mdz) wrote : Re: [Bug 388357] Re: [i965] X freeze on karmic after resume from full screen application: i915_gem_retire_work_handler() / finish_task_switch()

On Wed, Jun 17, 2009 at 07:25:42PM -0000, Bryce Harrington wrote:
> mdz, I notice you're listed as running 2.6.30-9-generic in this bug
> report. Did you only start seeing this issue after moving from -8?

I have only ever seen this issue these two times, and both were with
2.6.30-9.10.

--
 - mdz

Revision history for this message
Robert Hooker (sarvatt) wrote :

I should clarify about that comment tjaalton made that I just looked for changes between rc8 and .30 release that touched on the dpms handling that he said he was having problems with, not that I think that commit was the root of the problem or related to this bug.

Revision history for this message
Matt Zimmerman (mdz) wrote :

I've just seen another hang, where I wasn't running compiz. I didn't get the blocked task in dmesg, but otherwise the symptoms were similar. This happened after an hour or so of work in the morning, after dismissing a full-screen totem session (which I'd accidentally left running overnight when I meant to suspend).

Revision history for this message
Arvind (arvind0) wrote :

The hangs do not occur every time I blank the screen using xset dpms force off. Sometimes it occurs the 1st time I do it, sometimes after many blanks.
I'm on jaunty with the 2.6.28-11-generic kernel, with the latest xserver-xorg-core and have the xserver-xorg-video-intel packages from the xorg-edgers ppa.

On an X3100 - Mobile GM965/GL960 Integrated Graphics Controller [8086:2a02] (rev 0c)

Revision history for this message
In , Matt Zimmerman (mdz) wrote :

I have experienced this problem several times without the use of any 3D applications (I switched from compiz to metacity in hopes of a workaround).

I'll attach another GPU dump. This one was taken sooner after the hang, so perhaps it will be more useful. How can I tell a useful dump from a useless one?

Revision history for this message
In , Matt Zimmerman (mdz) wrote :

Created an attachment (id=26932)
intel_gpu_dump output from a subsequent hang

Revision history for this message
Matt Zimmerman (mdz) wrote :

The "full screen" aspect of this bug seems to be a red herring. It has happened to me several times today without any full-screen activity.

Revision history for this message
Bryce Harrington (bryce) wrote :

mdz, I've forwarded this bug upstream to https://bugs.freedesktop.org/show_bug.cgi?id=22336 - Eric wants the dump to be generated in a different way, could you follow his directions to collect a dump using INTEL_DEBUG=sync and post the results on the upstream bug?

Re the full screen aspect, it could be coincidental, but sort of jumped out. As Eric says, the only way to know exactly what's going on is to check the dump output.

Revision history for this message
Matt Zimmerman (mdz) wrote : Re: [Bug 388357] Re: [i965] X freeze on karmic after resume from full screen application: i915_gem_retire_work_handler() / finish_task_switch()

On Thu, Jun 18, 2009 at 08:37:16PM -0000, Bryce Harrington wrote:
> mdz, I've forwarded this bug upstream to
> https://bugs.freedesktop.org/show_bug.cgi?id=22336 - Eric wants the dump
> to be generated in a different way, could you follow his directions to
> collect a dump using INTEL_DEBUG=sync and post the results on the
> upstream bug?
>
> Re the full screen aspect, it could be coincidental, but sort of jumped
> out. As Eric says, the only way to know exactly what's going on is to
> check the dump output.

I don't think INTEL_DEBUG=sync applies, since I can get the crash without
any 3D applications. I attached a second dump to see if it's more useful
than the first.

--
 - mdz

Revision history for this message
Oleksij Rempel (olerem) wrote :

I have same symptoms on intel DG45ID (g45)... so probably the same bug. It is easy to reproduce: start video in full screen, in about 80 minutes xorg will crash.

Revision history for this message
Nicolò Chieffo (yelo3) wrote : Re: [Bug 388357] Re: [i965] X freeze on karmic after resume from full screen application: i915_gem_retire_work_handler() / finish_task_switch()

Do you see this bug also after a resume from suspend?

Revision history for this message
Matt Zimmerman (mdz) wrote : Re: [Bug 388357] Re: [i965] X freeze on karmic after resume from full screen application: i915_gem_retire_work_handler() / finish_task_switch()

On Fri, Jun 19, 2009 at 08:45:47AM -0000, Nicolò Chieffo wrote:
> Do you see this bug also after a resume from suspend?

Not directly after, but I do suspend every night.

--
 - mdz

Revision history for this message
Nicolò Chieffo (yelo3) wrote : Re: [Bug 388357] Re: [i965] X freeze on karmic after resume from full screen application: i915_gem_retire_work_handler() / finish_task_switch()

Immediately after a resume my screen remains black, so I think that my
bug is not a duplicate of this

Revision history for this message
In , Eric Anholt (eric-anholt) wrote :

The dump there looks pretty sane. Do you have a way to reproduce this bug?

Revision history for this message
Bryce Harrington (bryce) wrote :

Nicolo, I've unduped your bug 383973 and will follow up there.

Revision history for this message
In , Matt Zimmerman (mdz) wrote :

"sane" as in "not broken like the previous one" or "sane" as in "contains no indication of any problem"? Has this information provided any clue as to where the problem lies?

I've switched from compiz to metacity to get my life back, but was seeing this recur a couple of times per day while using compiz. I expect I could reproduce it by going back to compiz. Is there something more I can do to help diagnose the problem if indeed I can reproduce it? I am happy to try.

If you want to have a go at reproducing it on your own hardware, I recommend trying with Ubuntu Karmic alpha 2: http://cdimage.ubuntu.com/releases/karmic/alpha-2/

Revision history for this message
In , Eric Anholt (eric-anholt) wrote :

"contains no indication of any problem"

There are changes in intel-gpu-tools git that improve dump reporting and might have more information, but I don't expect it to help. We just need to figure out how to reliably reproduce the problem in a short period of time, so we can fix it.

Revision history for this message
GiuseppeVerde (launchpad-digitasaru) wrote :

I don't think my bug's a duplicate. I can still reproduce, even with xorg-edgers (ii xserver-xorg-video-intel 2:2.7.99.901+git20090624.5d80e24b-0ubuntu0sarvatt). Posting more info on my bug˙

Revision history for this message
GiuseppeVerde (launchpad-digitasaru) wrote :

Eh, maybe it is a dup. I'll include my info here; I just reproduced it this evening accidentally.

Revision history for this message
GiuseppeVerde (launchpad-digitasaru) wrote :

(It should be noted that xorg.conf is blank)

Revision history for this message
Martin Emrich (emme) wrote :

I disabled desktop effects a week ago, and since then I had no further crash after resuming von S3. So it's possible that it's related to 3D acceleration or compiz.

Ciao

Martin

Revision history for this message
Oleksij Rempel (olerem) wrote : Re: [Bug 388357] Re: [i965] X freeze on karmic after resume from full screen application: i915_gem_retire_work_handler() / finish_task_switch()

My video issue was fixed with latest driver (xserver-xorg-video-intel
2:2.7.99.901+git20090624.5d80e24b) ...

Revision history for this message
Robert Hooker (sarvatt) wrote :

Drop-in replacements for the intel driver in karmic that contains the fix are available in this PPA.

https://edge.launchpad.net/~ubuntu-x-swat/+archive/x-updates

Revision history for this message
In , Jeffrey Baker (jwbaker) wrote :

*** Bug 22624 has been marked as a duplicate of this bug. ***

Revision history for this message
In , Jeffrey Baker (jwbaker) wrote :

I have a 100% reliable way to reproduce this on Ubuntu Karmic x86_64. On any normal system with all defaults, kms and compiz enabled, just login and wait for the screen to blank. That hangs the display right there, and I get these same stacks in dmesg (see my duplicated bug, which actually has two hung stacks, not just the one noted here).

Revision history for this message
In , Jeffrey Baker (jwbaker) wrote :

Created an attachment (id=27401)
output of intel_gpu_dump.gz after hang

My GPU dump after hang ... looks similar to previous but I'm not smart enough to tell the difference.

Geir Ove Myhr (gomyhr)
tags: added: 965gm freeze karmic
Revision history for this message
In , Eric Anholt (eric-anholt) wrote :

jwbaker, you have a completely different bug. We still need to figure out how to reliably reproduce this one.

Revision history for this message
In , Matt Zimmerman (mdz) wrote :

What additional information can I provide? The best recipe I have so far is:

- Install a recent Ubuntu snapshot
- Boot the system
- Work normally in X for a while

I've re-enabled compiz to confirm that it still happens with the latest bits. However, unless you can provide instructions for diagnosis, the best I'll be able to do is run intel_gpu_dump and attach another (probably useless) dump.

Revision history for this message
Maxnux (maximo-monsalvo) wrote :

I can confirm the same error in a notebook asus z96f with i945 intel chipset,
x windows hang with black screen but music continuos playing, ssh etc .

Revision history for this message
Robert Hooker (sarvatt) wrote :

Maxnux please open a new bug for your problem, there is no way to verify that your system setup is the same as this bug reporters from what you stated.

Revision history for this message
Launchpad Janitor (janitor) wrote :

This bug was fixed in the package xserver-xorg-video-intel - 2:2.7.99.901+git20090702.74227141-0ubuntu1

---------------
xserver-xorg-video-intel (2:2.7.99.901+git20090702.74227141-0ubuntu1) karmic; urgency=low

  [Robert Hooker]
  * Update to git 20090702 (master branch) up to commit 74227141
   - Treat disabled CRTCs as "not covering" for scanline wait purposes.
     Fixes problems returning from a dpms off state.
     (LP: #390917, #383973, #390633, #385448, #388357)
   - Add an xorg.conf option to control swapbuffers behavior.
   - Set hot plug interrupt to detect HDMI output.
   - Add a pipe-a quirk for Dell mini
     (LP: #395306)
   - Add a pipe-a quirk for thinkpad x30
     (LP: #304614)
   - Reenable XvMC support in UXA. Now includes vld support.
   - Fix EDID for LVDS output device to add the default modes in UMS.
     (LP: #385832)
   - Export EDID when using KMS, so Display options shows info correctly
     (LP: #395140)
   - Fixes for SDVO LVDS mode detection.
   - Fix major performance regression of trapezoid rendering in UXA.
   - Fixes for hangs when GL compositing is used with multiple monitors
     with different refresh rates.
   - Support for new IGDNG devices.
   - Only get the VBIOS in non-KMS mode
   - Add a few error messages for DRM initialization
   - Don't try to pin buffers in KMS mode
   - Fix 945GM VT switch in UMS
   - Clear the bo on the rotate scratch pixmap (Fixes problems with rotation)
   - uxa: Fix segfault on source-only picture usage with FallbackDebug.
   - Various XvMC fixes.
  * Merge with Debian experimental. Remaining Ubuntu changes:
   - Add lpia architecture
  * Drop patches since they're already upstream:
    - 110_quirk_hp_mini.patch
    - 117_quirk_thinkpad_x30.patch
    - 118_pixmap_inline_funcs.patch
    - 119_fix_drawable_abuse.patch
    - 120_fix_dri2_vblank_syncing_segfault.patch
    - 121_dont_change_blank_sync_width.patch
    - 122_wait_for_vblank_to_sub_border_color.patch
    - 123_kms_export_edid.patch

 -- Bryce Harrington <email address hidden> Thu, 09 Jul 2009 14:02:45 -0700

Changed in xserver-xorg-video-intel (Ubuntu):
status: Triaged → Fix Released
Revision history for this message
Maxnux (maximo-monsalvo) wrote :

I been upgraded today and error seems to be gone

Revision history for this message
Leann Ogasawara (leannogasawara) wrote :

Closing linux kernel task.

Changed in linux (Ubuntu):
status: Triaged → Invalid
Revision history for this message
In , Achim (ach1m) wrote :

Created an attachment (id=28262)
intel gpu dump, dmesg and system info

I am not 100% sure if I have the same problem but I hope the attached information will help to clarify this.

How I produced the gpu hung.

(steps which were used to produce the gpu_dump)
1. compiz is in use
2. open 25-30 pictures 1920x1200 with EOG
3. press <Alt>+<Tap>
4. the screen should now be frozen except the mouse cursor, but it could be that the mouse cursor is also frozen.

I was not able to reproduce this with metacity as window manager (30 pictures).

Another way to hung the gpu is to change the wallpaper in gnome while compiz is active (1920x1200). The gpu doesn't always hung immediately.

1. compiz is in use
2. 1-3 workspaces with each a full-screen windows open
3. 4th workspace to change the wallpaper
4. the screen should be frozen immediately or after some time.

If it doesn't hung immediately you should choose different wallpaper until the system is frozen. I think that the system freezes much easier if I had it in use for some time, before I try to change the wallpaper.

I hope this information is somehow helpful.

Regards
Achim

Revision history for this message
In , Eric Anholt (eric-anholt) wrote :

I can't say for sure in your case, since you didn't mention using any other 3d apps, it looks like you've got a screen with an appropriately aligned height, and it looks like compiz doesn't use a depth buffer, but it may still be worth trying with this commit series:

xf86-video-intel:
commit e8f0763d405a8152c74c28792c52fe12c1d41dd5
Author: Eric Anholt <email address hidden>
Date: Fri Aug 7 18:24:44 2009 -0700

    Fix math in the tiling alignment fix.

commit 222b52ef16895823fbf3a0fc0be4eb23b930ed1b
Author: Eric Anholt <email address hidden>
Date: Fri Aug 7 18:05:29 2009 -0700

    Align tiled pixmap height so we don't address beyond the end of our buffers.

Mesa:
commit ceb8afcca5b0a52b005a782ea54b301beaee1a15
Author: Eric Anholt <email address hidden>
Date: Fri Aug 7 18:09:31 2009 -0700

    intel: Align region height as required for tiled regions.

    Otherwise, we would address beyond the end of our buffers. Fixes reliable
    GPU segfault with texture_tiling=true and oglconform shadow.c.

    Bug #22406.

Revision history for this message
In , Chris Wilson (ickle) wrote :

Created an attachment (id=29294)
Avoid wrapping mid-instruction.

The first gpu dump shows that we wrapped the ringbuffer mid-instruction, which is invalid according to the docs. I've posted this patch for review.

Revision history for this message
In , Gordon Jin (gordon-jin) wrote :

decreasing priority and not to block Q3 release, as lacking of response.

Revision history for this message
In , Eric Anholt (eric-anholt) wrote :

Closing this due to lack of response. If the problem continues with the components updated for the other hangs we've fixed, please reopen.

Changed in xserver-xorg-video-intel:
status: Confirmed → Fix Released
Changed in xserver-xorg-video-intel:
importance: Unknown → Critical
Changed in xserver-xorg-video-intel:
importance: Critical → Unknown
Changed in xserver-xorg-video-intel:
importance: Unknown → Critical
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.