kernel oops during heavy tar across nfs

Bug #140865 reported by schobes
6
Affects Status Importance Assigned to Milestone
linux-source-2.6.20 (Ubuntu)
Invalid
Undecided
Unassigned

Bug Description

~uname -a
Linux ghost 2.6.20-16-generic #2 SMP Fri Aug 31 00:55:27 UTC 2007 i686 GNU/Linux

I am taking a system tar of the machine using the following command.
cd / && nice -n 19 tar -czf /mnt/sonofbeast/backup/ghost/.system-daily-20070918.tar.gz --exclude=cdrom --exclude=dev --exclude=home --exclude=media --exclude=mnt --exclude=proc --exclude=sys ./

The file being created is on a mounted nfs directory.

Several minutes or so into the backup I receive this kernel oops:
 Sep 18 23:04:57 ghost kernel: [892635.504354] BUG: unable to handle kernel NULL pointer dereference at virtual address 00000001
Sep 18 23:04:57 ghost kernel: [892635.504380] printing eip:
Sep 18 23:04:57 ghost kernel: [892635.504386] 00000001
Sep 18 23:04:57 ghost kernel: [892635.504388] *pde = 00000000
Sep 18 23:04:57 ghost kernel: [892635.504397] Oops: 0000 [#1]
Sep 18 23:04:57 ghost kernel: [892635.504402] SMP
Sep 18 23:04:57 ghost kernel: [892635.504411] Modules linked in: ppp_deflate zlib_deflate bsd_comp ppp_async ppp_generic slhc xt_tcpudp xt_state ipt_MASQUERADE iptable_nat nf_nat nf_conntrack_ipv4 nf_conntrack
Sep 18 23:04:57 ghost kernel: bcore 8139too 8139cp mii thermal processor fan fbcon tileblit font bitblit softcursor vesafb capability commoncap
Sep 18 23:04:57 ghost kernel: [892635.504726] CPU: 0
Sep 18 23:04:57 ghost kernel: [892635.504727] EIP: 0060:[_proxy_pda+1/1024] Tainted: P VLI
Sep 18 23:04:57 ghost kernel: [892635.504729] EFLAGS: 00052682 (2.6.20-16-generic #2)
Sep 18 23:04:57 ghost kernel: [892635.504769] EIP is at 0x1
Sep 18 23:04:57 ghost kernel: [892635.504783] eax: 00000000 ebx: 00000282 ecx: 00000036 edx: f35ea7b8
Sep 18 23:04:57 ghost kernel: [892635.504803] esi: f35ea704 edi: f35ea5dc ebp: c71d8c20 esp: df8c7df0
Sep 18 23:04:57 ghost kernel: [892635.504822] ds: 007b es: 007b ss: 0068
Sep 18 23:04:57 ghost kernel: [892635.504838] Process pdflush (pid: 131, ti=df8c6000 task=df86e030 task.ti=df8c6000)
Sep 18 23:04:57 ghost kernel: [892635.504858] Stack: f92896f2 c1452940 c1452940 0000000a c1452940 f35ea704 f928ac7b df8c7ea4
Sep 18 23:04:57 ghost kernel: [892635.504901] df8c7f64 00000286 c1452940 0000000a df8c7f64 f35ea7a8 f928ad81 c1452940
Sep 18 23:04:57 ghost kernel: [892635.504943] 0000000a c015cd2d 00000000 0000000e 0000000e 00000000 0000000e f6d899a0
Sep 18 23:04:57 ghost kernel: [892635.504985] Call Trace:
Sep 18 23:04:57 ghost kernel: [892635.505009] [<f92896f2>] nfs_page_mark_flush+0xf2/0x100 [nfs]
Sep 18 23:04:57 ghost kernel: [892635.505076] [<f928ac7b>] nfs_writepage_locked+0x5b/0x150 [nfs]
Sep 18 23:04:57 ghost kernel: [892635.505114] [<f928ad81>] nfs_writepage+0x11/0x30 [nfs]
Sep 18 23:04:57 ghost kernel: [892635.505148] [generic_writepages+525/816] generic_writepages+0x20d/0x330
Sep 18 23:04:57 ghost kernel: [892635.505179] [<f928ad70>] nfs_writepage+0x0/0x30 [nfs]
Sep 18 23:04:57 ghost kernel: [892635.505227] [<f928b9a8>] nfs_writepages+0x38/0x130 [nfs]
Sep 18 23:04:57 ghost kernel: [892635.505261] [<f89763b8>] jfs_commit_inode+0x88/0x120 [jfs]
Sep 18 23:04:57 ghost kernel: [892635.505302] [do_writepages+43/80] do_writepages+0x2b/0x50
Sep 18 23:04:57 ghost kernel: [892635.505323] [__writeback_single_inode+147/960] __writeback_single_inode+0x93/0x3c0
Sep 18 23:04:57 ghost kernel: [892635.505352] [native_write_cr0+0/16] native_write_cr0+0x0/0x10
Sep 18 23:04:57 ghost kernel: [892635.505377] [schedule+765/2704] __sched_text_start+0x2fd/0xa90
Sep 18 23:04:57 ghost kernel: [892635.505402] [sync_sb_inodes+382/576] sync_sb_inodes+0x17e/0x240
Sep 18 23:04:57 ghost kernel: [892635.505426] [writeback_inodes+153/208] writeback_inodes+0x99/0xd0
Sep 18 23:04:57 ghost kernel: [892635.505448] [pdflush+0/512] pdflush+0x0/0x200
Sep 18 23:04:57 ghost kernel: [892635.505466] [wb_kupdate+133/240] wb_kupdate+0x85/0xf0
Sep 18 23:04:57 ghost kernel: [892635.505494] [pdflush+294/512] pdflush+0x126/0x200
Sep 18 23:04:57 ghost kernel: [892635.505514] [wb_kupdate+0/240] wb_kupdate+0x0/0xf0
Sep 18 23:04:57 ghost kernel: [892635.505537] [kthread+186/240] kthread+0xba/0xf0
Sep 18 23:04:57 ghost kernel: [892635.505559] [kthread+0/240] kthread+0x0/0xf0
Sep 18 23:04:57 ghost kernel: [892635.505579] [kernel_thread_helper+7/16] kernel_thread_helper+0x7/0x10
Sep 18 23:04:57 ghost kernel: [892635.505605] =======================
Sep 18 23:04:57 ghost kernel: [892635.505619] Code: Bad EIP value.
Sep 18 23:04:57 ghost kernel: [892635.505638] EIP: [_proxy_pda+1/1024] 0x1 SS:ESP 0068:df8c7df0

Usually followed by normal operation but a sustained high load average with very high idle time on the cpu. Other times I get a kernel panic and the machine completely freezes. This is what happend last time the machine froze.

Sep 18 23:10:07 ghost kernel: [892635.505983] <1>BUG: unable to handle kernel NULL pointer dereference at virtual address 00000001
Sep 18 23:10:16 ghost kernel: [892945.092278] printing eip:
Sep 18 23:10:16 ghost kernel: [892945.092351] 00000001
Sep 18 23:10:16 ghost kernel: [892945.092417] *pde = 00000000
Sep 18 23:10:16 ghost kernel: [892945.092474] Oops: 0000 [#2]
Sep 18 23:10:16 ghost kernel: [892945.092528] SMP
Sep 18 23:10:16 ghost kernel: [892945.092663] Modules linked in: ppp_deflate zlib_deflate bsd_comp ppp_async ppp_generic slhc xt_tcpudp xt_state ipt_MASQUERADE iptable_nat nf_nat nf_conntrack_ipv4 nf_conntrack
Sep 18 23:10:16 ghost kernel: bcore 8139too 8139cp mii thermal processor fan fbcon tileblit font bitblit softcursor vesafb capability commoncap
Sep 18 23:10:16 ghost kernel: [892945.097494] CPU: 0
Sep 18 23:10:16 ghost kernel: [892945.097495] EIP: 0060:[_proxy_pda+1/1024] Tainted: P VLI
Sep 18 23:10:16 ghost kernel: [892945.097496] EFLAGS: 00052686 (2.6.20-16-generic #2)
Sep 18 23:10:16 ghost kernel: [892945.097657] EIP is at 0x1
Sep 18 23:10:16 ghost kernel: [892945.097709] eax: 00000000 ebx: 00000282 ecx: 0000000d edx: f35ea7b8
Sep 18 23:10:16 ghost kernel: [892945.097767] esi: f35ea704 edi: f35ea5dc ebp: eba5bee0 esp: de489b8c
Sep 18 23:10:16 ghost kernel: [892945.097825] ds: 007b es: 007b ss: 0068
Sep 18 23:10:16 ghost kernel: [892945.097879] Process gzip (pid: 5415, ti=de488000 task=d2be8a90 task.ti=de488000)
Sep 18 23:10:16 ghost kernel: [892945.097937] Stack: f92896f2 c1733760 c1733760 0000000d c1733760 f35ea704 f928ac7b de489c40
Sep 18 23:10:16 ghost kernel: [892945.098362] de489d0c 00000286 c1733760 0000000d de489d0c f35ea7a8 f928ad81 c1733760
Sep 18 23:10:16 ghost kernel: [892945.098788] 0000000d c015cd2d 00000000 0000000e 00000001 00000000 0000000e f6d899a0
Sep 18 23:10:16 ghost kernel: [892945.099213] Call Trace:
Sep 18 23:10:16 ghost kernel: [892945.099313] [<f92896f2>] nfs_page_mark_flush+0xf2/0x100 [nfs]
Sep 18 23:10:16 ghost kernel: [892945.099458] [<f928ac7b>] nfs_writepage_locked+0x5b/0x150 [nfs]
Sep 18 23:10:16 ghost kernel: [892945.099574] [<f928ad81>] nfs_writepage+0x11/0x30 [nfs]
Sep 18 23:10:16 ghost kernel: [892945.099684] [generic_writepages+525/816] generic_writepages+0x20d/0x330
Sep 18 23:10:16 ghost kernel: [892945.099793] [<f928ad70>] nfs_writepage+0x0/0x30 [nfs]
Sep 18 23:10:16 ghost kernel: [892945.099916] [<f928b9a8>] nfs_writepages+0x38/0x130 [nfs]
Sep 18 23:10:16 ghost kernel: [892945.100030] [do_writepages+43/80] do_writepages+0x2b/0x50
Sep 18 23:10:16 ghost kernel: [892945.100127] [__writeback_single_inode+658/960] __writeback_single_inode+0x292/0x3c0
Sep 18 23:10:16 ghost kernel: [892945.100231] [<f9286140>] nfs_unlock_request+0x30/0x40 [nfs]
Sep 18 23:10:16 ghost kernel: [892945.100342] [<f928a881>] nfs_writepage_setup+0x151/0x4f0 [nfs]
Sep 18 23:10:16 ghost kernel: [892945.100457] [sync_sb_inodes+382/576] sync_sb_inodes+0x17e/0x240
Sep 18 23:10:16 ghost kernel: [892945.100558] [writeback_inodes+153/208] writeback_inodes+0x99/0xd0
Sep 18 23:10:16 ghost kernel: [892945.100657] [balance_dirty_pages_ratelimited_nr+177/480] balance_dirty_pages_ratelimited_nr+0xb1/0x1e0
Sep 18 23:10:16 ghost kernel: [892945.100768] [generic_file_buffered_write+963/1744] generic_file_buffered_write+0x3c3/0x6d0
Sep 18 23:10:16 ghost kernel: [892945.100893] [net_rx_action+201/544] net_rx_action+0xc9/0x220
Sep 18 23:10:16 ghost kernel: [892945.100994] [__generic_file_aio_write_nolock+749/1552] __generic_file_aio_write_nolock+0x2ed/0x610
Sep 18 23:10:16 ghost kernel: [892945.101099] [do_IRQ+69/128] do_IRQ+0x45/0x80
Sep 18 23:10:16 ghost kernel: [892945.101200] [common_interrupt+35/48] common_interrupt+0x23/0x30
Sep 18 23:10:16 ghost kernel: [892945.101306] [generic_file_aio_write+85/208] generic_file_aio_write+0x55/0xd0
Sep 18 23:10:16 ghost kernel: [892945.101404] [skb_copy_and_csum_bits+340/816] skb_copy_and_csum_bits+0x154/0x330
Sep 18 23:10:16 ghost kernel: [892945.101511] [<f9280ad4>] nfs_file_write+0xc4/0x180 [nfs]
Sep 18 23:10:16 ghost kernel: [892945.101625] [do_sync_write+213/288] do_sync_write+0xd5/0x120
Sep 18 23:10:16 ghost kernel: [892945.101724] [<f8852502>] rtl8139_start_xmit+0xc2/0x140 [8139too]
Sep 18 23:10:16 ghost kernel: [892945.101837] [dev_hard_start_xmit+471/688] dev_hard_start_xmit+0x1d7/0x2b0
Sep 18 23:10:16 ghost kernel: [892945.101934] [autoremove_wake_function+0/80] autoremove_wake_function+0x0/0x50
Sep 18 23:10:16 ghost kernel: [892945.102038] [__qdisc_run+128/464] __qdisc_run+0x80/0x1d0
Sep 18 23:10:16 ghost kernel: [892945.102142] [net_tx_action+228/272] net_tx_action+0xe4/0x110
Sep 18 23:10:16 ghost kernel: [892945.102242] [vfs_write+190/400] vfs_write+0xbe/0x190
Sep 18 23:10:16 ghost kernel: [892945.102338] [do_sync_write+0/288] do_sync_write+0x0/0x120
Sep 18 23:10:16 ghost kernel: [892945.102436] [sys_write+65/112] sys_write+0x41/0x70
Sep 18 23:10:16 ghost kernel: [892945.102535] [sysenter_past_esp+105/169] sysenter_past_esp+0x69/0xa9
Sep 18 23:10:16 ghost kernel: [892945.102642] =======================
Sep 18 23:10:16 ghost kernel: [892945.102695] Code: Bad EIP value.
Sep 18 23:10:16 ghost kernel: [892945.102828] EIP: [_proxy_pda+1/1024] 0x1 SS:ESP 0068:de489b8c

Let me know if there is any other information that would be useful. It is very easy for me to reproduce the errors. Thanks in advance.

Tags: kernel-oops
Revision history for this message
Leann Ogasawara (leannogasawara) wrote :

Thank you for taking the time to report this bug and helping to make Ubuntu better. Now that the 7.10 Gutsy Gibbon release of Ubuntu is out, we were wondering if you can still reproduce this issue. Could you please download and try the new version of Ubuntu from http://www.ubuntu.com/getubuntu/download and report back your results. If the issue is still present in the new release, please attach the following information:

* uname -a > uname-a.log
* cat /proc/version_signature > version.log
* dmesg > dmesg.log
* sudo lspci -vvnn > lspci-vvnn.log

Please be sure to attach each file as a separate attachment. For more information regarding the kernel team bug policy, please refer to https://wiki.ubuntu.com/KernelTeamBugPolicies . Thanks again and we appreciate your help and feedback.

Changed in linux-source-2.6.20:
status: New → Incomplete
Revision history for this message
wolfger (wolfger) wrote :

5 months with no reply. We are closing this bug report because it lacks the information we need to investigate the problem, as described in the previous comments. Please reopen it if you can give us the missing information, and don't hesitate to submit bug reports in the future. To reopen the bug report you can click on the current status, under the Status column, and change the Status back to "New". Thanks again!

Changed in linux-source-2.6.20:
status: Incomplete → Invalid
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.