Frequent random KVM host kernel OOPS

Bug #361819 reported by Michael Robinson
26
This bug affects 1 person
Affects Status Importance Assigned to Milestone
kvm (Ubuntu)
Invalid
High
Unassigned
linux (Ubuntu)
Invalid
Medium
John Johansen

Bug Description

Binary package hint: kvm

Under jaunty:
kvm 1:84+dfsg-0ubuntu10
linux-image-2.6.28-11-generic 2.6.28-11.41

kern.log:
Apr 15 22:53:52 aethereal kernel: [ 9542.651947] kvm: 16340: cpu0 unhandled wrms
r: 0xc0010117 data 0
Apr 15 23:06:31 aethereal kernel: [10301.986575] kvm: 17236: cpu0 unhandled wrms
r: 0xc0010117 data 0
Apr 15 23:34:07 aethereal kernel: [11957.332259] BUG: unable to handle kernel pa
ging request at ffff8801a031bcbc
Apr 15 23:34:07 aethereal kernel: [11957.332272] IP: [<ffffffff8041c2ff>] rb_nex
t+0x4f/0x60
Apr 15 23:34:07 aethereal kernel: [11957.332288] PGD 202063 PUD 0
Apr 15 23:34:07 aethereal kernel: [11957.332295] Oops: 0000 [#1] SMP
Apr 15 23:34:07 aethereal kernel: [11957.332302] last sysfs file: /sys/devices/p
ci0000:00/0000:00:1c.1/0000:0c:00.0/rfkill/rfkill0/state
Apr 15 23:34:07 aethereal kernel: [11957.332309] Dumping ftrace buffer:
Apr 15 23:34:07 aethereal kernel: [11957.332315] (ftrace buffer empty)
Apr 15 23:34:07 aethereal kernel: [11957.332318] CPU 1
Apr 15 23:34:07 aethereal kernel: [11957.332323] Modules linked in: binfmt_misc
i915 drm ppdev bridge stp bnep input_polldev kvm_intel kvm snd_hwdep sbp2 lp par
port snd_hda_intel snd_pcm_oss snd_mixer_oss snd_pcm snd_seq_dummy snd_seq_oss s
nd_seq_midi pata_pcmcia arc4 snd_rawmidi snd_seq_midi_event ecb snd_seq snd_time
r snd_seq_device iwlagn iwlcore pcmcia snd led_class yenta_socket mac80211 sound
core psmouse rsrc_nonstatic pcmcia_core snd_page_alloc iTCO_wdt iTCO_vendor_supp
ort serio_raw pcspkr dcdbas cfg80211 btusb joydev sha256_generic aes_x86_64 aes_
generic cbc dm_crypt fbcon tileblit font bitblit softcursor squashfs unionfs nls
_iso8859_1 nls_cp437 vfat fat usbhid usb_storage ohci1394 ieee1394 tg3 video out
put intel_agp
Apr 15 23:34:07 aethereal kernel: [11957.332444] Pid: 6844, comm: kvm Not tainte
d 2.6.28-11-generic #41-Ubuntu
Apr 15 23:34:07 aethereal kernel: [11957.332449] RIP: 0010:[<ffffffff8041c2ff>]
 [<ffffffff8041c2ff>] rb_next+0x4f/0x60
Apr 15 23:34:07 aethereal kernel: [11957.332459] RSP: 0018:ffff8800b20b19b8 EFL
AGS: 00010286
Apr 15 23:34:07 aethereal kernel: [11957.332464] RAX: ffff8801a031bcb4 RBX: 07ae
3a8c63920000 RCX: ffff88011fdc0098
Apr 15 23:34:07 aethereal kernel: [11957.332469] RDX: 0000000000000000 RSI: ffff
8801a031bcb6 RDI: ffff8801a031bcb4
Apr 15 23:34:07 aethereal kernel: [11957.332474] RBP: ffff8800b20b19b8 R08: 0000
0000ffffffff R09: 00000000000000d0
Apr 15 23:34:07 aethereal kernel: [11957.332479] R10: ffff880042826000 R11: 0000000000000000 R12: ffffc20000000000
Apr 15 23:34:07 aethereal kernel: [11957.332484] R13: ffffffffffffe000 R14: ffffe1ffffffffff R15: 0000000000002000
Apr 15 23:34:07 aethereal kernel: [11957.332490] FS: 00007f822ec01950(0000) GS:ffff88011f803a80(0000) knlGS:0000000000000000
Apr 15 23:34:07 aethereal kernel: [11957.332495] CS: 0010 DS: 002b ES: 002b CR0: 000000008005003b
Apr 15 23:34:07 aethereal kernel: [11957.332500] CR2: ffff8801a031bcbc CR3: 000000003fd8e000 CR4: 00000000000026a0
Apr 15 23:34:07 aethereal kernel: [11957.332505] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Apr 15 23:34:07 aethereal kernel: [11957.332510] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Apr 15 23:34:07 aethereal kernel: [11957.332515] Process kvm (pid: 6844, threadinfo ffff8800b20b0000, task ffff8800b1c85980)
Apr 15 23:34:07 aethereal kernel: [11957.332520] Stack:
Apr 15 23:34:07 aethereal kernel: [11957.332523] ffff8800b20b1a48 ffffffff802d0d2d ffff8800b20b1a9c 0000000000001fff
Apr 15 23:34:07 aethereal kernel: [11957.332532] 0000000000002000 ffff8800155dc600 0000000034483040 ffff8800155dc600
Apr 15 23:34:07 aethereal kernel: [11957.332542] ffffc20000001fff ffffc20000002000 0000000000001fff 0000000000000286
Apr 15 23:34:07 aethereal kernel: [11957.332553] Call Trace:
Apr 15 23:34:07 aethereal kernel: [11957.332557] [<ffffffff802d0d2d>] alloc_vmap_area+0x13d/0x2c0
Apr 15 23:34:07 aethereal kernel: [11957.332567] [<ffffffff802d13d9>] __get_vm_area_node+0xc9/0x1c0
Apr 15 23:34:07 aethereal kernel: [11957.332575] [<ffffffff802d1541>] get_vm_area_caller+0x31/0x40
Apr 15 23:34:07 aethereal kernel: [11957.332583] [<ffffffffa03dc1cf>] ? pio_copy_data+0x3f/0x130 [kvm]
Apr 15 23:34:07 aethereal kernel: [11957.332612] [<ffffffff802d15f9>] vmap+0x49/0x80
Apr 15 23:34:07 aethereal kernel: [11957.332620] [<ffffffffa03dc1cf>] pio_copy_data+0x3f/0x130 [kvm]
Apr 15 23:34:07 aethereal kernel: [11957.332644] [<ffffffffa03e1728>] kvm_emulate_pio_string+0x2e8/0x440 [kvm]
Apr 15 23:34:07 aethereal kernel: [11957.332670] [<ffffffffa03ea1cf>] x86_emulate_insn+0x132f/0x32e0 [kvm]
Apr 15 23:34:07 aethereal kernel: [11957.332697] [<ffffffffa03e7474>] ? seg_override_base+0x24/0x50 [kvm]
Apr 15 23:34:07 aethereal kernel: [11957.332722] [<ffffffffa03e8a8d>] ? x86_decode_insn+0x55d/0x970 [kvm]
Apr 15 23:34:07 aethereal kernel: [11957.332746] [<ffffffffa03e023f>] emulate_instruction+0x15f/0x2f0 [kvm]
Apr 15 23:34:07 aethereal kernel: [11957.332770] [<ffffffffa03ec35d>] ? pic_irq_request+0x2d/0x80 [kvm]
Apr 15 23:34:07 aethereal kernel: [11957.332794] [<ffffffffa0405fd7>] handle_io+0x27/0x60 [kvm_intel]
Apr 15 23:34:07 aethereal kernel: [11957.332808] [<ffffffffa0407ca5>] kvm_handle_exit+0xb5/0x1d0 [kvm_intel]
Apr 15 23:34:07 aethereal kernel: [11957.332820] [<ffffffffa03db978>] vcpu_enter_guest+0x1f8/0x400 [kvm]
Apr 15 23:34:07 aethereal kernel: [11957.332845] [<ffffffffa03ddc49>] __vcpu_run+0x69/0x2d0 [kvm]
Apr 15 23:34:07 aethereal kernel: [11957.332869] [<ffffffffa03e190a>] kvm_arch_vcpu_ioctl_run+0x8a/0x1f0 [kvm]
Apr 15 23:34:07 aethereal kernel: [11957.332892] [<ffffffffa03d9295>] ? kvm_vm_ioctl+0xd5/0x230 [kvm]
Apr 15 23:34:07 aethereal kernel: [11957.332917] [<ffffffffa03d6582>] kvm_vcpu_ioctl+0x2e2/0x5a0 [kvm]
Apr 15 23:34:07 aethereal kernel: [11957.332940] [<ffffffff8069e559>] ? _spin_lock+0x9/0x10
Apr 15 23:34:07 aethereal kernel: [11957.332949] [<ffffffff80277718>] ? futex_wake+0xf8/0x130
Apr 15 23:34:07 aethereal kernel: [11957.332958] [<ffffffff802f62d1>] vfs_ioctl+0x31/0xa0
Apr 15 23:34:07 aethereal kernel: [11957.332967] [<ffffffff802f6685>] do_vfs_ioctl+0x75/0x230
Apr 15 23:34:07 aethereal kernel: [11957.332975] [<ffffffff802f68d9>] sys_ioctl+0x99/0xa0
Apr 15 23:34:07 aethereal kernel: [11957.332982] [<ffffffff8021253a>] system_call_fastpath+0x16/0x1b
Apr 15 23:34:07 aethereal kernel: [11957.332991] Code: 1f 44 00 00 48 89 c2 48 8b 42 10 48 85 c0 75 f4 48 89 d0 c9 c3 0f 1f 80 00 00 00 00 48 8b 37 48 89 f9 4889 f7 48 83 e7 fc 74 e5 <48> 39 4f 08 74 eb 48 89 fa eb da 66 0f 1f 44 00 00 5548 8b 37
Apr 15 23:34:07 aethereal kernel: [11957.333011] RIP [<ffffffff8041c2ff>] rb_next+0x4f/0x60
Apr 15 23:34:07 aethereal kernel: [11957.333011] RSP <ffff8800b20b19b8>
Apr 15 23:34:07 aethereal kernel: [11957.333011] CR2: ffff8801a031bcbc
Apr 15 23:34:07 aethereal kernel: [11957.333011] ---[ end trace 65dab77134cc8f1d ]---
Apr 15 23:34:25 aethereal kernel: [11975.271149] iwlagn: Microcode SW error detected. Restarting 0x2000000.

Revision history for this message
Dustin Kirkland  (kirkland) wrote :

Hello, thanks for the report.

Can you please install the kvm-source package, which will build the latest kvm module from source for your kernel, and install it? Please then reboot, and let me know if you can reproduce the error.

Thanks,
:-Dustin

Changed in kvm (Ubuntu):
importance: Undecided → High
Revision history for this message
Michael Robinson (robinson-netrinsics) wrote :
Download full text (5.7 KiB)

Hi, the frequency of this problem has been reduced significantly since I installed kvm-source, however, it has not been completely eliminated.

kvm 1:84+dfsg-0ubuntu11
kvm-source 1:84+dfsg-0ubuntu11
linux-image-2.6.28-11-generic 2.6.28-11.42

Apr 23 22:20:57 aethereal kernel: [19380.467724] wlan0: associated
Apr 23 22:26:40 aethereal kernel: [19723.700129] ------------[ cut here ]------------
Apr 23 22:26:40 aethereal kernel: [19723.700135] kernel BUG at /var/lib/dkms/kvm/84/build/x86/mmu.c:684!
Apr 23 22:26:40 aethereal kernel: [19723.700137] invalid opcode: 0000 [#1] SMP
Apr 23 22:26:40 aethereal kernel: [19723.700140] last sysfs file: /sys/devices/pci0000:00/0000:00:1c.1/0000:0c:00.0/rfkill/rfkill0/state
Apr 23 22:26:40 aethereal kernel: [19723.700143] Dumping ftrace buffer:
Apr 23 22:26:40 aethereal kernel: [19723.700146] (ftrace buffer empty)
Apr 23 22:26:40 aethereal kernel: [19723.700147] CPU 0
Apr 23 22:26:40 aethereal kernel: [19723.700149] Modules linked in: binfmt_misci915 drm ppdev bridge stp bnep input_polldev kvm_intel kvm snd_hwdep sbp2 lp parport snd_hda_intel snd_pcm_oss snd_mixer_oss snd_pcm snd_seq_dummy arc4 snd_seq_oss ecb snd_seq_midi pata_pcmcia snd_rawmidi snd_seq_midi_event iwlagn iwlcore snd_seq snd_timer snd_seq_device led_class snd pcmcia mac80211 soundcore yenta_socket rsrc_nonstatic pcmcia_core dcdbas iTCO_wdt iTCO_vendor_support psmouse btusb snd_page_alloc cfg80211 pcspkr serio_raw joydev sha256_generic aes_x86_64 aes_generic cbc dm_crypt fbcon tileblit font bitblit softcursor squashfs unionfs nls_iso8859_1 nls_cp437 vfat fat usbhid usb_storage ohci1394 ieee1394 tg3 video output intel_agp
Apr 23 22:26:40 aethereal kernel: [19723.700189] Pid: 10015, comm: kvm Tainted:G W 2.6.28-11-generic #42-Ubuntu
Apr 23 22:26:40 aethereal kernel: [19723.700191] RIP: 0010:[<ffffffffa03e7cd5>] [<ffffffffa03e7cd5>] rmap_write_protect+0x325/0x340 [kvm]
Apr 23 22:26:40 aethereal kernel: [19723.700207] RSP: 0018:ffff8800b5023a38 EFLAGS: 00010246
Apr 23 22:26:40 aethereal kernel: [19723.700209] RAX: ffffc20014d971f0 RBX: 000000000001493e RCX: ffff88011dc3f030
Apr 23 22:26:40 aethereal kernel: [19723.700210] RDX: 00000000edbfa63e RSI: 09bf570aedbfa63e RDI: ffff88005992dc30
Apr 23 22:26:40 aethereal kernel: [19723.700212] RBP: ffff8800b5023a58 R08: ffff880059c2f0a0 R09: ffff88005b3f8001
Apr 23 22:26:40 aethereal kernel: [19723.700213] R10: 0000000000000002 R11: 0000000000000000 R12: 0000000000000001
Apr 23 22:26:40 aethereal kernel: [19723.700215] R13: ffff88005dd48840 R14: ffff8800b5028000 R15: ffff88005d0b8000
Apr 23 22:26:40 aethereal kernel: [19723.700217] FS: 00007f28bcb21950(0000) GS:ffffffff80aa3000(0000) knlGS:0000000000000000
Apr 23 22:26:40 aethereal kernel: [19723.700219] CS: 0010 DS: 002b ES: 002b CR0: 0000000080050033
Apr 23 22:26:40 aethereal kernel: [19723.700220] CR2: 0000000007c6d38c CR3: 000000005dcfe000 CR4: 00000000000026a0
Apr 23 22:26:40 aethereal kernel: [19723.700222] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Apr 23 22:26:40 aethereal kernel: [19723.700223] DR3: 0000000000000000 DR6: 000000...

Read more...

Revision history for this message
Michael Robinson (robinson-netrinsics) wrote :
Download full text (6.2 KiB)

"unable to handle kernel NULL pointer dereference"

That doesn't sound very healthy.

May 3 08:02:41 aethereal kernel: [237003.338883] BUG: unable to handle kernel NULL pointer dereference at 0000000000000000
May 3 08:02:41 aethereal kernel: [237003.338891] IP: [<ffffffffa03e2872>] gfn_to_rmap+0x22/0x70 [kvm]
May 3 08:02:41 aethereal kernel: [237003.338910] PGD 38a4d067 PUD a19b067 PMD 0

May 3 08:02:41 aethereal kernel: [237003.338914] Oops: 0000 [#1] SMP
May 3 08:02:41 aethereal kernel: [237003.338917] last sysfs file: /sys/devices/
pci0000:00/0000:00:1c.1/0000:0c:00.0/rfkill/rfkill0/state
May 3 08:02:41 aethereal kernel: [237003.338921] Dumping ftrace buffer:
May 3 08:02:41 aethereal kernel: [237003.338923] (ftrace buffer empty)
May 3 08:02:41 aethereal kernel: [237003.338925] CPU 1
May 3 08:02:41 aethereal kernel: [237003.338927] Modules linked in: ppp_async c
rc_ccitt binfmt_misc i915 drm ppdev bridge stp bnep input_polldev kvm_intel kvm
snd_hwdep sbp2 lp parport snd_hda_intel snd_pcm_oss snd_mixer_oss snd_pcm snd_se
q_dummy arc4 snd_seq_oss ecb snd_seq_midi pata_pcmcia snd_rawmidi iwlagn snd_seq
_midi_event iwlcore snd_seq snd_timer snd_seq_device led_class mac80211 snd pcmc
ia soundcore dcdbas psmouse yenta_socket rsrc_nonstatic pcmcia_core iTCO_wdt iTC
O_vendor_support pcspkr snd_page_alloc cfg80211 serio_raw btusb joydev sha256_ge
neric aes_x86_64 aes_generic cbc dm_crypt fbcon tileblit font bitblit softcursor
 squashfs unionfs nls_iso8859_1 nls_cp437 vfat fat usbhid usb_storage ohci1394 i
eee1394 tg3 video output intel_agp
May 3 08:02:41 aethereal kernel: [237003.338974] Pid: 5624, comm: kvm Tainted:
G W 2.6.28-11-generic #42-Ubuntu
May 3 08:02:41 aethereal kernel: [237003.338976] RIP: 0010:[<ffffffffa03e2872>]
  [<ffffffffa03e2872>] gfn_to_rmap+0x22/0x70 [kvm]
May 3 08:02:41 aethereal kernel: [237003.338986] RSP: 0018:ffff8800ab5e79f8 EF
LAGS: 00010202
May 3 08:02:41 aethereal kernel: [237003.338988] RAX: 0000000000000000 RBX: 000
0000000000080 RCX: 0000000000000000
May 3 08:02:41 aethereal kernel: [237003.338990] RDX: 00000000000fee01 RSI: 000
0000000000022 RDI: fffffffffffff001
May 3 08:02:41 aethereal kernel: [237003.338991] RBP: ffff8800ab5e7a08 R08: 000
0000000000022 R09: 0000000000000000
May 3 08:02:41 aethereal kernel: [237003.338993] R10: ffff8800ab5e7ab8 R11: 0000000000000000 R12: fffffffffffff001
May 3 08:02:41 aethereal kernel: [237003.338995] R13: ffff8800ae47e160 R14: ffff88003f854000 R15: ffff8800ab5e7a88
May 3 08:02:41 aethereal kernel: [237003.338997] FS: 00007ffee209e950(0000) GS:ffff88011f803a80(0000) knlGS:0000000000000000
May 3 08:02:41 aethereal kernel: [237003.338999] CS: 0010 DS: 002b ES: 002b CR0: 0000000080050033
May 3 08:02:41 aethereal kernel: [237003.339001] CR2: 0000000000000000 CR3: 000000000cd94000 CR4: 00000000000026a0
May 3 08:02:41 aethereal kernel: [237003.339003] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
May 3 08:02:41 aethereal kernel: [237003.339005] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
May 3 08:02:41 aethereal kernel: [237003.339007] Process kvm (pid: 5624, threadinfo ffff8800ab5e6000, task ffff880013e08...

Read more...

Revision history for this message
Dustin Kirkland  (kirkland) wrote :

Thanks for the information, Michael. I'm adding the Ubuntu linux package, as this looks to be a kernel bug.

:-Dustin

Changed in kvm (Ubuntu):
status: New → Confirmed
Changed in linux (Ubuntu):
importance: Undecided → Medium
Revision history for this message
Dustin Kirkland  (kirkland) wrote :

Subscribing John Johansen, hoping you might have some insight :-)

:-Dustin

Changed in linux (Ubuntu):
assignee: nobody → John Johansen (jjohansen)
Revision history for this message
Michael Robinson (robinson-netrinsics) wrote :
Download full text (6.0 KiB)

Again.

May 7 01:20:10 aethereal kernel: [147379.785707] BUG: unable to handle kernel N
ULL pointer dereference at 0000000000000000
May 7 01:20:10 aethereal kernel: [147379.785713] IP: [<ffffffffa03e28a0>] gfn_t
o_rmap+0x50/0x70 [kvm]
May 7 01:20:10 aethereal kernel: [147379.785733] PGD cf107067 PUD c7165067 PMD
0
May 7 01:20:10 aethereal kernel: [147379.785736] Oops: 0000 [#1] SMP
May 7 01:20:10 aethereal kernel: [147379.785739] last sysfs file: /sys/devices/
pci0000:00/0000:00:1c.1/0000:0c:00.0/rfkill/rfkill0/state
May 7 01:20:10 aethereal kernel: [147379.785741] Dumping ftrace buffer:
May 7 01:20:10 aethereal kernel: [147379.785744] (ftrace buffer empty)
May 7 01:20:10 aethereal kernel: [147379.785745] CPU 0
May 7 01:20:10 aethereal kernel: [147379.785747] Modules linked in: binfmt_misc
 i915 drm ppdev bridge stp bnep input_polldev kvm_intel kvm snd_hwdep sbp2 lp pa
rport snd_hda_intel snd_pcm_oss snd_mixer_oss snd_pcm snd_seq_dummy snd_seq_oss
arc4 ecb snd_seq_midi pata_pcmcia snd_rawmidi snd_seq_midi_event iwlagn iwlcore
snd_seq snd_timer snd_seq_device led_class pcmcia snd mac80211 soundcore yenta_s
ocket rsrc_nonstatic pcmcia_core dcdbas psmouse iTCO_wdt iTCO_vendor_support pcs
pkr btusb snd_page_alloc cfg80211 serio_raw joydev sha256_generic aes_x86_64 aes
_generic cbc dm_crypt fbcon tileblit font bitblit softcursor squashfs unionfs nl
s_iso8859_1 nls_cp437 vfat fat usbhid usb_storage ohci1394 ieee1394 tg3 video ou
tput intel_agp
May 7 01:20:10 aethereal kernel: [147379.785786] Pid: 2053, comm: kvm Not taint
ed 2.6.28-11-generic #42-Ubuntu
May 7 01:20:10 aethereal kernel: [147379.785788] RIP: 0010:[<ffffffffa03e28a0>]
  [<ffffffffa03e28a0>] gfn_to_rmap+0x50/0x70 [kvm]
May 7 01:20:10 aethereal kernel: [147379.785797] RSP: 0018:ffff8800bb861968 EF
LAGS: 00010246
May 7 01:20:10 aethereal kernel: [147379.785799] RAX: 0000000000000000 RBX: 000
0000000000000 RCX: 0000000000000000
May 7 01:20:10 aethereal kernel: [147379.785800] RDX: 00000000000fee01 RSI: 000
0000000000022 RDI: fffffffffffff001
May 7 01:20:10 aethereal kernel: [147379.785802] RBP: ffff8800bb861978 R08: 000
0000000000022 R09: 0000000000000000
May 7 01:20:10 aethereal kernel: [147379.785803] R10: ffff8800bb8619f8 R11: 000
0000000000000 R12: fffffffffffff001
May 7 01:20:10 aethereal kernel: [147379.785805] R13: ffff880047182a50 R14: fff
f8800bb968000 R15: ffff8800bb8619f8
May 7 01:20:10 aethereal kernel: [147379.785807] FS: 00007f7b640ea950(0000) GS
:ffffffff80aa3000(0000) knlGS:0000000000000000
May 7 01:20:10 aethereal kernel: [147379.785809] CS: 0010 DS: 002b ES: 002b CR
0: 0000000080050033
May 7 01:20:10 aethereal kernel: [147379.785810] CR2: 0000000000000000 CR3: 000
00000bf568000 CR4: 00000000000026a0
May 7 01:20:10 aethereal kernel: [147379.785812] DR0: 0000000000000000 DR1: 000
0000000000000 DR2: 0000000000000000
May 7 01:20:10 aethereal kernel: [147379.785814] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
May 7 01:20:10 aethereal kernel: [147379.785816] Process kvm (pid: 2053, threadinfo ffff8800bb860000, task ffff8800bf5a9660)
May 7 01:20:10 aethereal kernel: [147379.785817] Stack:
May 7 01:20:10 aethereal kernel: [147...

Read more...

Revision history for this message
Dustin Kirkland  (kirkland) wrote :

Michael-

Do you have frequency scaling enabled on the host? If so, could you disable it, or pin your cpu and see if you see the oops again?

:-Dustin

Revision history for this message
Michael Robinson (robinson-netrinsics) wrote :

Dustin -
Could you be more specific about how I would go about "pinning my cpu"?

Thanks.

Revision history for this message
Michael Robinson (robinson-netrinsics) wrote :
Download full text (5.9 KiB)

Meanwhile, here's another. This crash seems like it may be correlated with suspend/resume somehow. It never happens immediately after a suspend/resume, but as I recall it also never happens until at least one suspend/resume.

May 12 22:40:01 aethereal kernel: [22670.242629] ------------[ cut here ]------------
May 12 22:40:01 aethereal kernel: [22670.242635] kernel BUG at /var/lib/dkms/kvm/84/build/x86/mmu.c:640!
May 12 22:40:01 aethereal kernel: [22670.242637] invalid opcode: 0000 [#1] SMP
May 12 22:40:01 aethereal kernel: [22670.242640] last sysfs file: /sys/devices/pci0000:00/0000:00:1c.1/0000:0c:00.0/rfkill/rfkill0/state
May 12 22:40:01 aethereal kernel: [22670.242642] Dumping ftrace buffer:
May 12 22:40:01 aethereal kernel: [22670.242644] (ftrace buffer empty)
May 12 22:40:01 aethereal kernel: [22670.242646] CPU 1
May 12 22:40:01 aethereal kernel: [22670.242648] Modules linked in: binfmt_misc i915 drm ppdev bridge stp bnep input_polldev kvm_intel kvm snd_hwdep sbp2 lp parport snd_hda_intel snd_pcm_oss snd_mixer_oss snd_pcm snd_seq_dummy snd_seq_oss arc4 snd_seq_midi ecb snd_rawmidi snd_seq_midi_event pata_pcmcia snd_seq iwlagn snd_timer iwlcore snd_seq_device led_class pcmcia snd mac80211 soundcore psmouse yenta_socket rsrc_nonstatic pcmcia_core dcdbas pcspkr iTCO_wdt iTCO_vendor_support snd_page_alloc cfg80211 serio_raw btusb joydev sha256_generic aes_x86_64 aes_generic cbc dm_crypt fbcon tileblit font bitblit softcursor squashfs unionfs nls_iso8859_1 nls_cp437 vfat fat usbhid usb_storage ohci1394 ieee1394 tg3 intel_agp video output
May 12 22:40:01 aethereal kernel: [22670.242687] Pid: 7022, comm: kvm Tainted: G W 2.6.28-11-generic #42-Ubuntu
May 12 22:40:01 aethereal kernel: [22670.242689] RIP: 0010:[<ffffffffa03e2a30>] [<ffffffffa03e2a30>] rmap_remove+0x170/0x230 [kvm]
May 12 22:40:01 aethereal kernel: [22670.242702] RSP: 0018:ffff8800b6905988 EFLAGS: 00010246
May 12 22:40:01 aethereal kernel: [22670.242704] RAX: 0000000000000000 RBX: 0000000bcb8e7cff RCX: 0000000000000008
May 12 22:40:01 aethereal kernel: [22670.242705] RDX: 0000000000000000 RSI: 0000000000000002 RDI: ffff88005910c320
May 12 22:40:01 aethereal kernel: [22670.242707] RBP: ffff8800b69059a8 R08: ffffc20014d86418 R09: ffff88005910c320
May 12 22:40:01 aethereal kernel: [22670.242709] R10: ffff8800b69059f8 R11: 0000000000000000 R12: ffff880119733000
May 12 22:40:01 aethereal kernel: [22670.242710] R13: ffff880001f35630 R14: ffff8800bd894000 R15: ffff8800b69059f8
May 12 22:40:01 aethereal kernel: [22670.242712] FS: 00007f6e52874950(0000) GS:ffff88011f803a80(0000) knlGS:0000000000000000
May 12 22:40:01 aethereal kernel: [22670.242714] CS: 0010 DS: 002b ES: 002b CR0: 0000000080050033
May 12 22:40:01 aethereal kernel: [22670.242715] CR2: 000000006d000000 CR3: 00000000acd9d000 CR4: 00000000000026a0
May 12 22:40:01 aethereal kernel: [22670.242717] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
May 12 22:40:01 aethereal kernel: [22670.242718] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
May 12 22:40:01 aethereal kernel: [22670.242720] Process kvm (pid: 7022, threadinfo ffff8800b6904000, task ffff88005c0dacc0)
May 12 22:40:0...

Read more...

Revision history for this message
Michael Robinson (robinson-netrinsics) wrote :
Download full text (6.3 KiB)

"but as I recall it also never happens until at least one suspend/resume"

Until today. Sigh. Nevermind.

May 13 16:38:45 aethereal kernel: [28098.589766] BUG: unable to handle kernel NULL pointer dereference at 0000000000000000
May 13 16:38:45 aethereal kernel: [28098.589774] IP: [<ffffffffa03e88a0>] gfn_to_rmap+0x50/0x70 [kvm]
May 13 16:38:45 aethereal kernel: [28098.589797] PGD cddd7067 PUD d98bc067 PMD 0
May 13 16:38:45 aethereal kernel: [28098.589802] Oops: 0000 [#1] SMP
May 13 16:38:45 aethereal kernel: [28098.589806] last sysfs file: /sys/devices/pci0000:00/0000:00:1c.1/0000:0c:00.0/rfkill/rfkill0/state
May 13 16:38:45 aethereal kernel: [28098.589810] Dumping ftrace buffer:
May 13 16:38:45 aethereal kernel: [28098.589813] (ftrace buffer empty)
May 13 16:38:45 aethereal kernel: [28098.589814] CPU 0
May 13 16:38:45 aethereal kernel: [28098.589817] Modules linked in: binfmt_misc i915 drm ppdev bridge stp bnep input_polldev kvm_intel kvm snd_hwdep sbp2 lp parport snd_hda_intel snd_pcm_oss
 snd_mixer_oss arc4 snd_pcm ecb pata_pcmcia snd_seq_dummy snd_seq_oss iwlagn iwlcore snd_seq_midi snd_rawmidi snd_seq_midi_event snd_seq led_class pcmcia snd_timer snd_seq_device mac80211 sn
d soundcore yenta_socket rsrc_nonstatic pcmcia_core iTCO_wdt iTCO_vendor_support snd_page_alloc psmouse btusb cfg80211 dcdbas pcspkr serio_raw joydev sha256_generic aes_x86_64 aes_generic cb
c dm_crypt fbcon tileblit font bitblit softcursor squashfs unionfs nls_iso8859_1 nls_cp437 vfat fat usbhid usb_storage ohci1394 ieee1394 tg3 video output intel_agp
May 13 16:38:45 aethereal kernel: [28098.589879] Pid: 6963, comm: kvm Not tainted 2.6.28-11-generic #42-Ubuntu
May 13 16:38:45 aethereal kernel: [28098.589881] RIP: 0010:[<ffffffffa03e88a0>] [<ffffffffa03e88a0>] gfn_to_rmap+0x50/0x70 [kvm]
May 13 16:38:45 aethereal kernel: [28098.589895] RSP: 0018:ffff8800b3887bd8 EFLAGS: 00010246
May 13 16:38:45 aethereal kernel: [28098.589897] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000
May 13 16:38:45 aethereal kernel: [28098.589899] RDX: 00000000000fee01 RSI: 0000000000000022 RDI: fffffffffffff001
May 13 16:38:45 aethereal kernel: [28098.589902] RBP: ffff8800b3887be8 R08: 0000000000000022 R09: 0000000000000000
May 13 16:38:45 aethereal kernel: [28098.589904] R10: 0000000000000002 R11: 0000000000000000 R12: fffffffffffff001
May 13 16:38:45 aethereal kernel: [28098.589906] R13: ffff880013087420 R14: ffff8800c2508000 R15: 0000000000000000
May 13 16:38:45 aethereal kernel: [28098.589909] FS: 0000000000000000(0000) GS:ffffffff80aa3000(0000) knlGS:0000000000000000
May 13 16:38:45 aethereal kernel: [28098.589912] CS: 0010 DS: 002b ES: 002b CR0: 000000008005003b
May 13 16:38:45 aethereal kernel: [28098.589914] CR2: 0000000000000000 CR3: 00000000bb89d000 CR4: 00000000000026a0
May 13 16:38:45 aethereal kernel: [28098.589916] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
May 13 16:38:45 aethereal kernel: [28098.589919] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
May 13 16:38:45 aethereal kernel: [28098.589922] Process kvm (pid: 6963, threadinfo ffff8800b3886000, task ffff8800b89d0000)
May 13 16:38:45 aethereal kernel: [28...

Read more...

Revision history for this message
Michael Robinson (robinson-netrinsics) wrote :

Last week, I pulled a stock 2.6.29.3 kernel from kernel.org, built it with kpkg, and installed kvm-85 on top. It's been completely stable since, so I expect I will stick with this.

Changed in linux (Ubuntu):
status: New → Triaged
Revision history for this message
Dustin Kirkland  (kirkland) wrote :

For info on pinning your CPU, see the cpufreq-selector utility.

http://manpages.ubuntu.com/manpages/jaunty/en/man1/cpufreq-selector.1.html

:-Dustin

Revision history for this message
Dustin Kirkland  (kirkland) wrote :

John-

Any update on this? Can you confirm that it's a kernel issue, and not KVM userspace?

:-Dustin

Changed in kvm (Ubuntu):
status: Confirmed → Incomplete
Revision history for this message
John Johansen (jjohansen) wrote :

No I haven't been able to confirm it is a kernel issue yet, though I am planning on devoting some good time to this bug over the next couple days.

Revision history for this message
Dustin Kirkland  (kirkland) wrote :

Can anyone reproduce this on karmic?

:-Dustin

Revision history for this message
John Johansen (jjohansen) wrote :

I haven't, though it is possible I just haven't spent enough time testing it in karmic yet.

Revision history for this message
Jeremy Foshee (jeremyfoshee) wrote :

This bug report was marked as Triaged a while ago but has not had any updated comments for quite some time. Please let us know if this issue remains in the current Ubuntu release, http://www.ubuntu.com/getubuntu/download . If the issue remains, click on the current status under the Status column and change the status back to "New". Thanks.

[This is an automated message. Apologies if it has reached you inappropriately; please just reply to this message indicating so.]

Thierry Carrez (ttx)
Changed in linux (Ubuntu):
status: Triaged → Incomplete
Revision history for this message
Vish (vish) wrote :

Thank you for taking the time to report this bug and helping to make Ubuntu better. We are closing this bug report because it lacks the information we need to investigate the problem, as described in the previous comments. Please reopen it if you can give us the missing information, and don't hesitate to submit bug reports in the future.
To reopen the bug report you can click on the current status, under the Status column, and change the Status back to "New".

Changed in linux (Ubuntu):
status: Incomplete → Invalid
Changed in kvm (Ubuntu):
status: Incomplete → Invalid
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Duplicates of this bug

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.