kernel BUG at lib/radix-tree.c:372! on a vmware node

Bug #58172 reported by Herbert Straub
2
Affects Status Importance Assigned to Milestone
linux-meta (Ubuntu)
New
Undecided
Unassigned

Bug Description

Binary package hint: linux-image-2.6-686

Situation: all virtual vmware machines are on a nfs mounted filesystem. One vmware machine goes unuseable and comsume all the CPU time. A kill -KIL pid doesn't wipe out the vmware process. A ls -lh on the nfs mount hangs.

I'm using Dapper Drake, details:
2.6.15-26-686 #1 SMP PREEMPT Thu Aug 3 03:13:28 UTC 2006 i686 GNU/Linux
dpkg -l nfs-common
Desired=Unknown/Install/Remove/Purge/Hold
| Status=Not/Installed/Config-files/Unpacked/Failed-config/Half-installed
|/ Err?=(none)/Hold/Reinst-required/X=both-problems (Status,Err: uppercase=bad)
||/ Name Version Description
+++-==============-==============-============================================
ii nfs-common 1.0.7-3ubuntu2 NFS support files common to client and serve

I found in the /var/log/kern.log the following sequenze:

kernel: [17758077.728000] ------------[ cut here ]------------
kernel: [17758077.728000] kernel BUG at lib/radix-tree.c:372!
kernel: [17758077.728000] invalid operand: 0000 [#1]
kernel: [17758077.728000] PREEMPT SMP
kernel: [17758077.728000] Modules linked in: vmnet parport_pc vmmon nfs lockd sunrpc ipv6 md_mod dm_mod lp parport ide_disk tsdev serio_raw psmouse i2c_piix4 i2c_core floppy cfi_probe gen_probe pcspkr scb2_flash mtdcore chipreg tg3 sworks_agp agpgart map_funcs evdev ext3 jbd ide_generic ohci_hcd usbcore ide_cd cdrom serverworks generic cciss scsi_mod thermal processor fan capability commoncap vga16fb vgastate fbcon tileblit font bitblit softcursor
kernel: [17758077.728000] CPU: 0
kernel: [17758077.728000] EIP: 0060:[radix_tree_tag_set+147/160] Tainted: P VLI
kernel: [17758077.728000] EFLAGS: 00013046 (2.6.15-26-686)
kernel: [17758077.728000] EIP is at radix_tree_tag_set+0x93/0xa0
kernel: [17758077.728000] eax: 00000000 ebx: 00000000 ecx: f5ba81a0 edx: 00000006
kernel: [17758077.728000] esi: 00000000 edi: 00000004 ebp: 00000003 esp: ee27fd10
kernel: [17758077.728000] ds: 007b es: 007b ss: 0068
kernel: [17758077.728000] Process vmware-vmx (pid: 25602, threadinfo=ee27e000 task=ed392a90)
kernel: [17758077.728000] Stack: 00000008 c2a5da84 ef6ac730 00000000 ef6ac740 c015766c ef6ac734 00005b86
kernel: [17758077.728000] 00000001 00003202 e2e90c54 e2e90c60 ef5e84c0 ee27fd90 f8cf835d c2a5da84
kernel: [17758077.728000] 00000050 f8cf726d ef6ac630 f78e6410 00000000 e2e90b00 00000000 ef5e84c0
kernel: [17758077.728000] Call Trace:
kernel: [17758077.728000] [test_set_page_writeback+284/320] test_set_page_writeback+0x11c/0x140
kernel: [17758077.728000] [pg0+948556637/1069167616] nfs_flush_one+0xdd/0x190 [nfs]
kernel: [17758077.728000] [pg0+948552301/1069167616] nfs_find_request+0x3d/0x50 [nfs]
kernel: [17758077.728000] [pg0+948556909/1069167616] nfs_flush_list+0x5d/0xc0 [nfs]
kernel: [17758077.728000] [pg0+948559695/1069167616] nfs_flush_inode+0x8f/0xd0 [nfs]
kernel: [17758077.728000] [pg0+948551700/1069167616] nfs_writepages+0xb4/0x140 [nfs]
kernel: [17758077.728000] [__filemap_fdatawrite_range+107/128] __filemap_fdatawrite_range+0x6b/0x80
kernel: [17758077.728000] [filemap_fdatawrite+48/64] filemap_fdatawrite+0x30/0x40
kernel: [17758077.728000] [pg0+948518038/1069167616] nfs_sync_mapping+0x46/0x90 [nfs]
kernel: [17758077.728000] [pg0+948522552/1069167616] nfs_revalidate_mapping+0xa8/0xe0 [nfs]
kernel: [17758077.728000] [__up+28/32] __up+0x1c/0x20
kernel: [17758077.728000] [pg0+948511850/1069167616] nfs_file_write+0x9a/0x130 [nfs]
kernel: [17758077.728000] [do_sync_write+201/304] do_sync_write+0xc9/0x130
kernel: [17758077.728000] [update_atime+138/160] update_atime+0x8a/0xa0
kernel: [17758077.728000] [autoremove_wake_function+0/96] autoremove_wake_function+0x0/0x60
kernel: [17758077.728000] [vfs_write+214/432] vfs_write+0xd6/0x1b0
kernel: [17758077.728000] [sys_pwrite64+128/144] sys_pwrite64+0x80/0x90
kernel: [17758077.728000] [sysenter_past_esp+84/117] sysenter_past_esp+0x54/0x75
kernel: [17758077.728000] Code: 24 d3 ea 83 e2 3f 8d 8c 03 04 01 00 00 0f a3 11 19 c0 85 c0 75 03 0f ab 11 8b 5c 93 04 85 db 74 0a 45 83 ee 06 39 ef 75 cf eb a3 <0f> 0b 74 01 d4 a7 32 c0 eb ec 8d 76 00 55 57 56 53 83 ec 40 8b
kernel: [17758077.728000] <4>rtc: lost some interrupts at 2048Hz.
kernel: [17758078.152000] note: vmware-vmx[25602] exited with preempt_count 1

A Google search (kernel BUG at lib/radix-tree.c:372) show a problem with NFS file truncation race condition in combination with 2.6.15

Revision history for this message
Herbert Straub (herbert) wrote :

This is a duplicate bug of 58170...

Changed in linux-meta:
status: Unconfirmed → Rejected
Changed in linux-meta:
status: Rejected → Unconfirmed
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.