[Feisty] Kernel timer losing clock ticks on amd64

Bug #102383 reported by edschofield
4
Affects Status Importance Assigned to Milestone
Linux
Fix Released
Undecided
Unassigned
linux-source-2.6.20 (Ubuntu)
Won't Fix
Undecided
Unassigned
linux-source-2.6.22 (Ubuntu)
Fix Released
Medium
Unassigned

Bug Description

Binary package hint: linux-image-2.6.20-13-generic

This may or may not be a duplicate of bug 26064, reported for Breezy (now closed).

We have four Core 2 duo machines with an Intel Q965 chipset that are all losing ticks at about three seconds per minute (three hours per day). This appears to be an upstream kernel bug with both 2.6.19 and 2.6.20, since this occurs for us with both Feisty and Fedora Core 6 (with either 2.6.19 or 2.6.20 kernel). This occurs both with and without the proprietary nvidia kernel module.

The full dmesg log is attached. The relevant lines seem to be:

[ 92.801333] Losing some ticks... checking if CPU frequency changed.
...
[ 2240.211414] warning: many lost ticks.
[ 2240.211416] Your time source seems to be instable or some driver is hogging i
nterupts
[ 2240.211425] rip _spin_unlock_irqrestore+0x8/0x10

As a stab in the dark, I've tried a few boot options, such as hpet=off, clocksource=acpi_pm, clock=jiffies, and acpi=off, but these seem to have no effect.

The problem seems to be with the upstream kernel sources, rather than the Ubuntu-specific patches, since it also occurs under Fedora Core 6 (with either 2.6.19 or 2.6.20). But the problem appears have been resolved in the mainline 2.6.21-rc5 kernel; the timer works normally on the same system with this kernel (self-compiled) but otherwise running Feisty.

edschofield (schofield)
description: updated
Revision history for this message
edschofield (schofield) wrote :

I should add that this is a huge problem for these machines (HP Compaq dc7700p desktops). The clock drift is far too severe for ntpd to correct for it. The only workaround I know is to turn off the ntp daemon and run ntpd -q every couple of minutes.

Revision history for this message
edschofield (schofield) wrote :

The same bug is present in the 2.6.20-14-generic kernel.

Revision history for this message
Shal (shal) wrote :

I have also the same problem on the same mothecard ( Intel DQ965F) with a Core 2 Duo (E6600).

The PC lost 1hour by day.
An external USB sound card can not run normally: every three second there is a blank in the sound.

I run Feisty with different kernel 2.6.19, 2.6.20 lowlantency and generic kernel => same problem.

Revision history for this message
John Stultz (jstultz) wrote :

I suspect this is an APIC/PIT bug causing the timer interrupt to trigger too slowly. 2.6.21 contains the x86_64 GENERIC_TIME work, so the timer interrupt issue won't manifest itself through gettimeofday, but will likely still be present (you might notice larger latencies on sleep, etc).

Could you try booting w/ noapic?

Revision history for this message
edschofield (schofield) wrote :

Booting with noapic doesn't seem to help; the clock is still losing three seconds every minute. I've attached the output of dmesg.

Shall I report this upstream?

Revision history for this message
edschofield (schofield) wrote :

Confirmed to affect kernel 2.6.20 on several different machines. Fixed by 2.6.21-rc5.

Changed in linux-source-2.6.20:
status: Unconfirmed → Confirmed
Revision history for this message
Shal (shal) wrote :

Yes, I confirm that bug is resolved with a 2.6.21 kernel.

Revision history for this message
Brian Murray (brian-murray) wrote :

Thank you for taking the time to report this bug and helping to make Ubuntu better. The issue that you reported is one that should be reproducable with the live environment of the Desktop CD of the development release - Gutsy Gibbon. It would help us greatly if you could test with it so we can work on getting it fixed in the actively developed kernel. You can find out more about the development release at http://www.ubuntu.com/testing/ . Thanks again and we appreciate your help.

Changed in linux-source-2.6.20:
assignee: nobody → brian-murray
status: Confirmed → Incomplete
Revision history for this message
edschofield (schofield) wrote : Re: [Bug 102383] Re: [Feisty] Kernel timer losing clock ticks on amd64

This bug is fixed by 2.6.21 and 2.6.22, presumably by the clockevents
patches. So Gutsy will be fine.

Revision history for this message
Brian Murray (brian-murray) wrote :

It would still be a good idea to test with it though.

Revision history for this message
edschofield (schofield) wrote :

> It would still be a good idea to test with it though.

Okay, if nobody beats me to it, I'll test it in about two weeks, when
I have access to one of those machines again. (Interestingly, it's
never been an issue on one G965 I have, but affects a whole lab-full
of HP Q965 machines. Presumably it only affects some BIOSes.)

-- Ed

Revision history for this message
edschofield (schofield) wrote :

Just to confirm: Gutsy's kernel is unaffected by this bug.

Changed in linux-source-2.6.22:
importance: Undecided → Medium
status: New → Fix Released
Changed in linux-source-2.6.20:
assignee: brian-murray → nobody
status: Incomplete → Won't Fix
Revision history for this message
Pietro Battiston (toobaz) wrote :

so this is fixed

Changed in linux:
status: New → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.