hangs, windows become unresponsive without noapic

Bug #98999 reported by Chris Hermansen
4
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Fix Released
Undecided
Unassigned
linux-source-2.6.20 (Ubuntu)
Won't Fix
Medium
Unassigned

Bug Description

Hello, good people.

I am using up-to-date Edgy on a Toshiba Satellite A70 with 1Gb of ram.

Periodically (from once per week, to several times in a day) the system will hang up.

What I mean by this is that windows stop responding to mouse input. The mouse will continue to move around for a minute or so, then it disappears. Ctrl-Alt-BkSp does not restart the X server, though it sometimes will at least partly kill it. Alt-F1 through Alt-F6 will switch to the Ascii login screens if I try them soon enough (within 15-30 seconds), and I can type in my user id, press enter.... but I'm just seeing my typing echoed back to me at that point.

The only way out is to power-cycle the machine.

There are no traces of anything odd in /var/log/messages. Below you can see today's "hiccup":

    Jan 30 07:07:24 dabuntu gconfd (clh-4796): Resolved address "xml:readwrite:/home/clh/.gconf" to a writable configuration source at position 0
    Jan 30 07:18:26 dabuntu kernel: [17180307.316000] tun: Universal TUN/TAP device driver, 1.6
    Jan 30 07:18:26 dabuntu kernel: [17180307.316000] tun: (C) 1999-2004 Max Krasnyansky <email address hidden>
    Jan 30 07:46:41 dabuntu -- MARK --
    Jan 30 08:06:41 dabuntu -- MARK --
    Jan 30 08:26:41 dabuntu -- MARK --
    Jan 30 08:46:42 dabuntu -- MARK --
    Jan 30 09:06:42 dabuntu -- MARK --
    Jan 30 09:26:42 dabuntu -- MARK --
    Jan 30 09:46:42 dabuntu -- MARK --
    Jan 30 10:06:42 dabuntu -- MARK --
    Jan 30 10:26:43 dabuntu -- MARK --
    Jan 30 10:46:43 dabuntu -- MARK --
    Jan 30 11:06:43 dabuntu -- MARK --
    Jan 30 11:26:43 dabuntu -- MARK --
    Jan 30 11:46:43 dabuntu -- MARK --
    Jan 30 12:06:43 dabuntu -- MARK --
    Jan 30 12:26:44 dabuntu -- MARK --
    Jan 30 12:46:44 dabuntu -- MARK --
    Jan 30 13:06:44 dabuntu -- MARK --
    Jan 30 13:26:44 dabuntu -- MARK --
    Jan 30 13:43:42 dabuntu syslogd 1.4.1#18ubuntu6: restart.
    Jan 30 13:43:42 dabuntu kernel: Inspecting /boot/System.map-2.6.17-10-generic
    Jan 30 13:43:42 dabuntu kernel: Loaded 22826 symbols from /boot/System.map-2.6.17-10-generic.

This problem has dogged me ever since upgrading from 6.06 to Edgy, a few days after Edgy was released.

I have tried some different settings in my xorg.conf to no avail, which I can post if it's helpful. There are some warnings in the Xorg.log file:

(II) Reloading /usr/lib/xorg/modules/libi2c.so
(II) RADEON(0): I2C bus "DDC" initialized.
(II) RADEON(0): Legacy BIOS detected
(II) RADEON(0): LVDS port is not in connector table, added in.
(WW) RADEON(0): Unknown DDCType 5 found
(WW) RADEON(0): LCD DDC Info Table found!
(II) RADEON(0): Connector0: DDCType-0, DACType-1, TMDSType--1, ConnectorType-1
(II) RADEON(0): Connector1: DDCType-3, DACType-0, TMDSType--1, ConnectorType-2
(II) RADEON(0): I2C device "DDC:ddc2" registered at address 0xA0.
(II) RADEON(0): I2C device "DDC:ddc2" removed.
(II) RADEON(0): I2C device "DDC:ddc2" registered at address 0xA0.
(II) RADEON(0): I2C device "DDC:ddc2" removed.
(II) RADEON(0): I2C device "DDC:ddc2" registered at address 0xA0.
(II) RADEON(0): I2C device "DDC:ddc2" removed.
(II) RADEON(0): DDC Type: 3, Detected Type: 0

and

(II) RADEON(0): PLL parameters: rf=1432 rd=6 min=20000 max=35000; xclk=16662
(WW) RADEON(0): Failed to detect secondary monitor, MergedFB/Clone mode disabled
(==) RADEON(0): Using gamma correction (1.0, 1.0, 1.0)

and

(II) RADEON(0): [drm] Added 32 65536 byte vertex/indirect buffers
(II) RADEON(0): [drm] Mapped 32 vertex/indirect buffers
(II) RADEON(0): [drm] dma control initialized, using IRQ 177
(II) RADEON(0): [drm] Initialized kernel GART heap manager, 5111808
(WW) RADEON(0): DRI init changed memory map, adjusting ...
(WW) RADEON(0): MC_FB_LOCATION was: 0x3fff3800 is: 0x3fff3800
(WW) RADEON(0): MC_AGP_LOCATION was: 0xffffffc0 is: 0xd27fd200
(**) RADEON(0): GRPH_BUFFER_CNTL from 20037c7c to 20085c5c
(II) RADEON(0): Direct rendering enabled
(**) RADEON(0): Setting up final surfaces
(**) RADEON(0): Initializing Acceleration
(II) RADEON(0): Render acceleration enabled
(**) RADEON(0): EngineInit (32/32)
(**) RADEON(0): Pitch for acceleration = 160
(**) RADEON(0): EngineRestore (32/32)
(II) RADEON(0): Using XFree86 Acceleration Architecture (XAA)

drmOpenDevice: node name is /dev/dri/card0
drmOpenDevice: open result is 8, (OK)
drmOpenByBusid: Searching for BusID pci:0000:01:05.0
drmOpenDevice: node name is /dev/dri/card0
drmOpenDevice: open result is 8, (OK)
drmOpenByBusid: drmOpenMinor returns 8
drmOpenByBusid: drmGetBusid reports pci:0000:01:05.0
(WW) AIGLX: 3D driver claims to not support visual 0x23
(WW) AIGLX: 3D driver claims to not support visual 0x24
(WW) AIGLX: 3D driver claims to not support visual 0x25
(WW) AIGLX: 3D driver claims to not support visual 0x26
(WW) AIGLX: 3D driver claims to not support visual 0x27
(WW) AIGLX: 3D driver claims to not support visual 0x28
(WW) AIGLX: 3D driver claims to not support visual 0x29
(WW) AIGLX: 3D driver claims to not support visual 0x2a
(WW) AIGLX: 3D driver claims to not support visual 0x2b
(WW) AIGLX: 3D driver claims to not support visual 0x2c
(WW) AIGLX: 3D driver claims to not support visual 0x2d
(WW) AIGLX: 3D driver claims to not support visual 0x2e
(WW) AIGLX: 3D driver claims to not support visual 0x2f
(WW) AIGLX: 3D driver claims to not support visual 0x30
(WW) AIGLX: 3D driver claims to not support visual 0x31
(WW) AIGLX: 3D driver claims to not support visual 0x32
(II) AIGLX: Loaded and initialized /usr/lib/dri/r200_dri.so
(II) GLX: Initialized DRI GL provider for screen 0

plus the warnings that seem to be standard about missing font files (up at the front of the log file) and errors about being unable to open the WACOM, which I have made go away before but also to no avail.

I should point out that my chipset, the ATI Technologies Inc RS300M AGP [Radeon Mobility 9100IGP], is not fully supported by the ATI proprietary driver (doesn't support DRI, for instance), and in any case I've had the same kind of hangs while finding that out.

Not running Beryl or Compiz or anything like that.

I seem to be most often using Firefox when the system crashes, but I think that's a mere statistic.

Revision history for this message
Chris Hermansen (c-hermansen) wrote :

Originally, I seem to have filed this as a support request. Not sure how that happened. Anyway, my apologies for the duplication

Revision history for this message
Brian Murray (brian-murray) wrote :

Thanks for taking the time to report this bug and helping to make Ubuntu better. Could you please add your '/etc/X11/xorg.conf' file and your '/var/log/Xorg.0.log' file as attachments to your bug report? Thanks in advance.

Revision history for this message
Chris Hermansen (c-hermansen) wrote :

Thanks for the response, Brian. Here is /var/log/Xorg.0.log.

Revision history for this message
Chris Hermansen (c-hermansen) wrote :

and here is /etc/X11/xorg.conf

Revision history for this message
Chris Hermansen (c-hermansen) wrote :

I probably should mention that there is no system hang covered by the period of this Xorg.0.log

Revision history for this message
quacklin (quacklin) wrote :

and this is mine as requested

Revision history for this message
quacklin (quacklin) wrote :

and this is the log, just after a reboot due to a sudden block (probably skype fault)

Revision history for this message
Chris Hermansen (c-hermansen) wrote :
Download full text (3.2 KiB)

Hi Brian, quacklin;

Is quacklin having a similar problem to mine? I can't tell from the above...

Anyway, some more information.

Over the weekend, I tried Feisty beta. I had similar "hang up" problems, but much more frequently... I could not get more than one hour of work in before "the system would hang". This was true with both Kubuntu and Ubuntu. Note I was using a fresh install from CD on a brand-new hard drive of a slightly different type than I use for Edgy, in both cases.

So, back to Edgy.

Next, on Saturday (two days ago) I had a hang-up happen when I was using Gmail inside Firefox with Edgy; I was still able to type in the composition window for quite some time after I stopped being able to actually move windows around, close them, etc. Eventually, I would say after at least one minute, everything froze up.

Another thing - when windows become unresponsive in this "hang up" thing, I can usually (maybe always) CTRL-ALT-BACKSPACE, at which time all windows and menu bars disappear but the screen background image usually (maybe always) stays in place, with a frozen mouse pointer. Sometimes, at that point, I can get a partially operational console window, but as soon as I try to do something like "ps" it loses its communication with anything that matters (though I still get characters echoed back to the screen).

On the theory that there's something funny with my Xorg setup, I edited the /etc/X11/xorg.conf file to get rid of the incorrect font file paths and the references to the Wacom tablet. This gave me less warnings and errors in my /var/log/Xorg.0.log file. The remaining ones that seem to perhaps be of concern are:

(WW) RADEON(0): Unknown DDCType 5 found
(WW) RADEON(0): LCD DDC Info Table found!

(WW) RADEON(0): DRI init changed memory map, adjusting ...
(WW) RADEON(0): MC_FB_LOCATION was: 0x3fff3800 is: 0x3fff3800
(WW) RADEON(0): MC_AGP_LOCATION was: 0xffffffc0 is: 0xd27fd200

(WW) AIGLX: 3D driver claims to not support visual 0x23
(WW) AIGLX: 3D driver claims to not support visual 0x24
(WW) AIGLX: 3D driver claims to not support visual 0x25
(WW) AIGLX: 3D driver claims to not support visual 0x26
(WW) AIGLX: 3D driver claims to not support visual 0x27
(WW) AIGLX: 3D driver claims to not support visual 0x28
(WW) AIGLX: 3D driver claims to not support visual 0x29
(WW) AIGLX: 3D driver claims to not support visual 0x2a
(WW) AIGLX: 3D driver claims to not support visual 0x2b
(WW) AIGLX: 3D driver claims to not support visual 0x2c
(WW) AIGLX: 3D driver claims to not support visual 0x2d
(WW) AIGLX: 3D driver claims to not support visual 0x2e
(WW) AIGLX: 3D driver claims to not support visual 0x2f
(WW) AIGLX: 3D driver claims to not support visual 0x30
(WW) AIGLX: 3D driver claims to not support visual 0x31
(WW) AIGLX: 3D driver claims to not support visual 0x32

error opening security policy file /usr/lib/xserver/SecurityPolicy

Could not init font path element /usr/share/fonts/X11/TTF/, removing from list!
Could not init font path element /usr/share/fonts/X11/OTF, removing from list!
Could not init font path element /usr/share/fonts/X11/CID/, removing from list!

ProcXCloseDevice to close or not ?

I have attached the Xo...

Read more...

Revision history for this message
Chris Hermansen (c-hermansen) wrote :

Here's the modified xorg.conf

Note that I have had one "hang-up" already today.

Revision history for this message
Chris Hermansen (c-hermansen) wrote :

One "last" thing. When my machine hung up this morning, I did a CTRL-ALT-BACKSPACE (which made everything but the window background vanish) and then tried to get a console login with ALT-Fx. I was able to get a login prompt but not a password prompt. Finally I power-cycled the machine.

Once again, there are no error messages in /var/log/syslog covering that period of time:

Apr 16 08:56:16 dabuntu -- MARK --
Apr 16 09:16:17 dabuntu -- MARK --
Apr 16 09:17:01 dabuntu /USR/SBIN/CRON[6958]: (root) CMD ( run-parts --report /etc/cron.hourly)
Apr 16 09:36:17 dabuntu -- MARK --
Apr 16 09:56:17 dabuntu -- MARK --
Apr 16 10:02:01 dabuntu /USR/SBIN/CRON[8056]: (root) CMD (if [ -x /usr/sbin/pg_maintenance ]; then /usr/sbin/pg_maintenance --analyze >/dev/null; fi)
Apr 16 10:16:17 dabuntu -- MARK --
Apr 16 10:17:01 dabuntu /USR/SBIN/CRON[8427]: (root) CMD ( run-parts --report /etc/cron.hourly)
Apr 16 10:36:17 dabuntu -- MARK --
Apr 16 10:56:17 dabuntu -- MARK --
Apr 16 11:07:19 dabuntu syslogd 1.4.1#18ubuntu6: restart.

In conclusion, I'm assuming this is a problem caused by the video subsystem, but I'm not sure. And I really don't know what to do to diagnose it any further.

Revision history for this message
Timo Aaltonen (tjaalton) wrote :

Please run memtest to see if memory is broken. Let it run for a couple of cycles to be sure

Revision history for this message
Chris Hermansen (c-hermansen) wrote :

I have run memtest more than twice, for more than four hours the last time. However, I will start it up tonight and let it run until the morning, in case.

On the subject of memory, is it possible there is some kind of conflict between what the graphics card thinks is its memory and what the rest of the system wants? Is there any way to check this kind of thing?

Revision history for this message
Chris Hermansen (c-hermansen) wrote :

I ran memtest last night, from about 11PM until about 7AM today. It went through 17 passes and detected no errors.

Revision history for this message
Chris Hermansen (c-hermansen) wrote :

I have tried Feisty and I seem to have the same problem there, too, only much worse; sometimes I can't even finish running the install app.

I did manage to get Feisty running for awhile. Here are some things that seemed odd, relative to this problem:

- once again, my screen saver would not "engage". though I finally got it to start working
  by setting it to lock the screen as well;

- at one point, I had the system activity monitor running, when I clicked on the Firefox icon.
  The screen froze before firefox could start, and the top process in the system activity
  monitor was Firefox in an "uninterruptible" state.

- at another point, the windows had frozen. I played around with the ctrl-alt-f1, f2, etc.
  I was able to get a login window, enter my name and password, but at that point I did
  not get a shell prompt, though returns were still echoed back to the screen. In some
  desperation, I tried my toshiba's Fn keys for suspend, resume, and so forth. When I
  pressed the "switch screen" key, I suddenly received a shell prompt, and was able to
  do a "ps" and a couple of other things - but then things froze again.

So, to sum up. Right now, I'm running Edgy, and I can pretty much count on one screen hangup per day. I cannot upgrade to Feisty because it will not run without hanging up for more than an hour or so.

This is a really serious problem. I have no idea what to do about it. I do not wish to stop using Ubuntu.

Revision history for this message
Chris Hermansen (c-hermansen) wrote :

Hmm, I may have solved the problem.

As I mentioned above, I tried using Feisty and the problem seemed worse.

However, there was a message early in the boot phase that made me wonder...

MP-BIOS bug: 8254 timer not connected to IO-APIC

A bit of internet sleuthing led me to add "noapic" to the grub boot defaults. And, things seem to work just fine now - no hangups after a fairly extensive wringing-out yesterday and this morning. Right now I even have desktop effects enabled just to really try hard to break things!

Interesting that some of the Internet discussion of the 8254 timer problem relate back to Dapper. I don't recall ever seeing the message in Dapper, but that was some time ago now. The message certainly never showed up in Edgy, though it may have been concealed by the splash.

Anyway, I think I will close this bug report, and soldier optimistically on.

Revision history for this message
Tormod Volden (tormodvolden) wrote :

Can you please attach /var/log/dmesg ?

Revision history for this message
Chris Hermansen (c-hermansen) wrote :

Hi Tormod;

The /var/log/dmesg is attached below. This is the Feisty - working I hope - /var/log/dmesg. I can fetch the Edgy one as well if you like, from two days ago.

Revision history for this message
Tormod Volden (tormodvolden) wrote :

Thanks, can you attach the output from "sudo dmidecode" as well, in order to precisely identify your hardware?

Revision history for this message
Chris Hermansen (c-hermansen) wrote :

Hi Tormod;

Here is the output of sudo dmidecode. If there are any tests I can run here please let me know, it's no trouble!

Changed in linux-source-2.6.20:
status: Needs Info → Confirmed
Changed in linux-source-2.6.20:
assignee: brian-murray → ubuntu-kernel-team
importance: Undecided → Medium
Revision history for this message
jtholmes (jtholmes) wrote :

I have the exact same problem with the Toshiba A75-226 I will try the noapic
because I too cannot use Feisty reliably. However, my lockups are so random
as to be rediculious. However, It does sound like my hard drive is going offline
and not coming back.

Revision history for this message
jtholmes (jtholmes) wrote :

additionally i too have the 8254 timer message

Revision history for this message
jtholmes (jtholmes) wrote :

added noapic to boot command line and have been up for two hours
the longest I have been up on this laptop.

Note that I never had the apic problem with edgy for whatever reason.
And I havent had any lockups with Nvidia cards on Desktops. Only this
Toshiba A75-226 Laptop with the ATI 9100 card has had the Lockup problem. I will post to
this bug tomorrow after using Feisty all day to announce if it is still
staying up and not locking up.

I hope that was the problem as I really want to move forward to Feisty.

Revision history for this message
Chris Hermansen (c-hermansen) wrote :

Just to confirm, but I have had no hangs in the month or so since I added noapic to my boot options.

Revision history for this message
Adolfo R. Brandes (arbrandes) wrote :

I have Feisty on a Toshiba A75-S206 with the same symptoms: "MP-BIOS bug: 8254 timer not connected to IO-APIC" at boot, hangs constantly from 1 to 10 minutes after logging in X. I can confirm that adding "noapic" to boot options fixes the problem.

Revision history for this message
Tormod Volden (tormodvolden) wrote :

Have any of you tried the 2.6.22 kernel from Gutsy? (Without the noapic option of course)

Revision history for this message
Launchpad Janitor (janitor) wrote : This bug is now reported against the 'linux' package

Beginning with the Hardy Heron 8.04 development cycle, all open Ubuntu kernel bugs need to be reported against the "linux" kernel package. We are automatically migrating this bug to the new "linux" package. However, development has already began for the upcoming Intrepid Ibex 8.10 release. It would be helpful if you could test the upcoming release and verify if this is still an issue - http://www.ubuntu.com/testing . If the issue still exists, please update this report by changing the Status of the "linux" task from "Incomplete" to "New". We appreciate your patience and understanding as we make this transition. Thanks!

Revision history for this message
Chris Hermansen (c-hermansen) wrote :

With Gutsy and now Hardy kernels, I no longer need to use the noapic option, as per Tormod's question above. However I still get the same MP-BIOS error when booting. I conclude from this that there is some kind of successful workaround in the kernel.

I think this bug should be marked "solved".

Revision history for this message
Tormod Volden (tormodvolden) wrote :

Thanks. If the message is just the kernel noticing there is a bug in the BIOS, that's ok.

Changed in linux:
status: Incomplete → Fix Released
Revision history for this message
Chris Hermansen (c-hermansen) wrote :

Tormod, just to confirm, I have experienced no system hangs of the type described above in either Gutsy or Hardy, and I no longer need to use the noapic option. It seems that the behaviour detected by the kernel is no longer harmful.

Revision history for this message
Launchpad Janitor (janitor) wrote : Kernel team bugs

Per a decision made by the Ubuntu Kernel Team, bugs will longer be assigned to the ubuntu-kernel-team in Launchpad as part of the bug triage process. The ubuntu-kernel-team is being unassigned from this bug report. Refer to https://wiki.ubuntu.com/KernelTeamBugPolicies for more information. Thanks.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Related questions

Remote bug watches

Bug watches keep track of this bug in other bug trackers.