[Regression] iwl3945 module always crashes on load and hangs boot process

Bug #275227 reported by Marcin Feder
10
Affects Status Importance Assigned to Milestone
Linux
Fix Released
Unknown
linux (Fedora)
Fix Released
Medium
linux (Ubuntu)
Fix Released
High
Amit Kucheria

Bug Description

Kubuntu Intrepid updated daily + Asus V6J

The boot process always permanently hangs on "Configuring network interfaces" when started with kernels from 2.6.27-3,4,6,7 but It works with my hardy kernel 2.6.24-19-generic.

The system boots with 2.6.27 kernels only when the wireless card is previously disabled in BIOS.

I have also achieved to boot with the iwl3945and mac80211 modules blacklisted. Than I tried to load iwl3945 using modprobe and that produced the following error trace.

[ 161.964119] ------------[ cut here ]------------
[ 161.964196] kernel BUG at /build/buildd/linux-2.6.27/net/mac80211/ieee80211_i.h:764!
[ 161.964284] invalid opcode: 0000 [#1] SMP
[ 161.964454] Modules linked in: arc4 ecb iwl3945 rfkill mac80211 cfg80211 binfmt_misc bridge stp bnep rfcomm l2cap uinput ppdev acpi_cpufreq cpufreq_userspace cpufreq_powersave cpufreq_conservat
ive cpufreq_ondemand cpufreq_stats freq_table wmi pci_slot sbs container sbshc af_packet iptable_filter ip_tables x_tables dm_crypt crypto_blkcipher dm_mod sbp2 parport_pc lp parport snd_hda_intel
 snd_pcm_oss snd_mixer_oss snd_pcm snd_seq_dummy snd_seq_oss snd_seq_midi snd_rawmidi snd_seq_midi_event snd_seq snd_timer sdhci_pci iTCO_wdt sdhci btusb psmouse iTCO_vendor_support bluetooth snd_
seq_device serio_raw nvidia(P) snd mmc_core video asus_laptop i2c_core output ac joydev soundcore battery led_class intel_agp button snd_page_alloc shpchp pci_hotplug agpgart evdev ext3 jbd mbcach
e sg sd_mod sr_mod crc_t10dif cdrom ata_generic pata_acpi usbhid hid ohci1394 ata_piix ieee1394 ehci_hcd libata scsi_mod dock uhci_hcd r8169 usbcore thermal processor fan fbcon tileblit font bitbl
it softcursor uvesafb cn fuse
[ 161.968013]
[ 161.968013] Pid: 7252, comm: udevd Tainted: P (2.6.27-4-generic #1)
[ 161.968013] EIP: 0060:[<f8f0c938>] EFLAGS: 00010246 CPU: 0
[ 161.968013] EIP is at netdev_notify+0x88/0x90 [mac80211]
[ 161.968013] EAX: f8f25bd8 EBX: f76c6800 ECX: f76c6800 EDX: f3ca8040
[ 161.968013] ESI: fffffff2 EDI: 00000000 EBP: f3ca3e38 ESP: f3ca3e08
[ 161.968013] DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068
[ 161.968013] Process udevd (pid: 7252, ti=f3ca2000 task=f3c4bed0 task.ti=f3ca2000)
[ 161.968013] Stack: 00000000 00000000 0000000a f3ca3e38 c0356b3d f3ca3e38 f8dfe3ee 0000000a
[ 161.968013] 0000000a f8eab8d8 f8f25c20 fffffff2 f3ca3e54 c03970b5 f76c6800 0000000a
[ 161.968013] 00000000 f76c6810 f3ca3e98 f3ca3e64 c014c0bf ffffffff 00000000 f3ca3ea4
[ 161.968013] Call Trace:
[ 161.968013] [<c0356b3d>] ? inetdev_event+0xfd/0x240
[ 161.968013] [<f8dfe3ee>] ? packet_notifier+0x1e/0x1b0 [af_packet]
[ 161.968013] [<c03970b5>] ? notifier_call_chain+0x35/0x70
[ 161.968013] [<c014c0bf>] ? raw_notifier_call_chain+0x1f/0x30
[ 161.968013] [<c0308ec6>] ? dev_change_name+0x146/0x1d0
[ 161.968013] [<c017fa5e>] ? ftrace_record_ip+0xee/0x1f0
[ 161.968013] [<c030a797>] ? dev_ifsioc+0x267/0x300
[ 161.968013] [<c03934fc>] ? mutex_lock+0x1c/0x20
[ 161.968013] [<c030acb1>] ? dev_ioctl+0x481/0x520
[ 161.968013] [<c03946ed>] ? _spin_lock+0xd/0x10
[ 161.968013] [<c01e3525>] ? inotify_d_instantiate+0x45/0x60
[ 161.968013] [<c0215378>] ? security_d_instantiate+0x18/0x30
[ 161.968013] [<c01c72a7>] ? d_instantiate+0x57/0x60
[ 161.968013] [<c02fb1d4>] ? sock_ioctl+0x84/0x250
[ 161.968013] [<c02fb150>] ? sock_ioctl+0x0/0x250
[ 161.968013] [<c01c048d>] ? vfs_ioctl+0x2d/0x90
[ 161.968013] [<c01c0676>] ? do_vfs_ioctl+0x66/0x1f0
[ 161.968013] [<c01b14de>] ? filp_close+0x4e/0x80
[ 161.968013] [<c01c086b>] ? sys_ioctl+0x6b/0x70
[ 161.968013] [<c0103f6b>] ? sysenter_do_call+0x12/0x2f
[ 161.968013] [<c0390000>] ? native_cpu_up+0x56/0x180
[ 161.968013] =======================
[ 161.968013] Code: 07 00 00 8b 42 18 89 34 24 89 c1 e8 a3 9c 2f c7 85 c0 75 a3 89 74 24 04 c7 04 24 14 8b f1 f8 e8 78 5b 48 c7 eb 91 90 8d 74 26 00 <0f> 0b eb fe 8d 74 26 00 55 89 e5 83 ec 04 90 8d 74 26 00 b8 20
[ 161.968013] EIP: [<f8f0c938>] netdev_notify+0x88/0x90 [mac80211] SS:ESP 0068:f3ca3e08
[ 161.979850] ---[ end trace bc190790f56df6e5 ]---

Revision history for this message
Marcin Feder (marfed) wrote :
Changed in linux:
assignee: nobody → ubuntu-kernel-team
importance: Undecided → High
status: New → Triaged
Revision history for this message
In , Steve (steve-redhat-bugs) wrote :

Description of problem:
Trying to boot the 2.6.27-0.398.rc9.fc10.x86_64 kernel produces a bug output.

How reproducible:
Everytime.

Steps to Reproduce:
1. Boot
2. Wait for udev to start probing things

Actual results:
------------[ cut here ]------------
kernel BUG at net/mac80211/ieee80211_i.h:764!
invalid opcode: 0000 [1] SMP
CPU 0
Modules linked in: b43 rfkill input_polldev snd_atiixp_modem snd_atiixp snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss radeon snd_pcm drm snd_timer i2c_algo_bit video output snd tifm_7xx1 sdhci_pci sdhci tifm_core firewire_ohci battery ac yenta_socket mmc_core 8139cp rsrc_nonstatic shpchp soundcore wmi ssb 8139too firewire_core pcspkr i2c_piix4 mii crc_itu_t joydev i2c_core snd_page_alloc k8temp hwmon pata_atiixp pata_acpi ata_generic
Pid: 1378, comm: udevd Not tainted 2.6.27-0.398.rc9.fc10.x86_64 #1
RIP: 0010:[<ffffffff8134e874>] [<ffffffff8134e874>] netdev_notify+0x43/0x94
RSP: 0018:ffff8800331b5c98 EFLAGS: 00010246
RAX: ffff880033210060 RBX: ffff8800354ea800 RCX: ffffffff81537f50
RDX: ffffffff81537f00 RSI: 000000000000000a RDI: ffffffff81537f50
RBP: ffff8800331b5cc8 R08: ffff8800331b5bc8 R09: 0000000000000292
R10: ffff8800331dc280 R11: 0000000300000000 R12: 00000000fffffffb
R13: ffffffff815382e0 R14: ffff8800354ea800 R15: 000000000000000a
FS: 00007f567e10b780(0000) GS:ffffffff81677700(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 00007fff8612ebc0 CR3: 0000000035968000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process udevd (pid: 1378, threadinfo ffff8800331b4000, task ffff880035f25d00)
Stack: ffff8800331b5cc8 ffffffff812ca295 0000000000000000 00000000fffffffc
 0000000000000000 00000000fffffffb ffff8800331b5d08 ffffffff813708df
 ffff8800331b5cf8 ffff8800354ea800 0000000000000000 ffff8800354ea810
Call Trace:
 [<ffffffff812ca295>] ? fib_rules_event+0x1b/0xfe
 [<ffffffff813708df>] notifier_call_chain+0x38/0x60
 [<ffffffff8105c9c4>] raw_notifier_call_chain+0x14/0x16
 [<ffffffff812bdcbf>] dev_change_name+0x1a2/0x1ce
 [<ffffffff812bdf36>] dev_ifsioc+0x24b/0x308
 [<ffffffff8136c494>] ? mutex_lock+0x27/0x38
 [<ffffffff812be507>] dev_ioctl+0x514/0x626
 [<ffffffff810111c4>] ? mcount_call+0x5/0x31
 [<ffffffff813048c3>] ? udp_ioctl+0x12/0x8b
 [<ffffffff812af681>] sock_ioctl+0x202/0x211
 [<ffffffff810d4a2f>] vfs_ioctl+0x2f/0x7d
 [<ffffffff810d4ccf>] do_vfs_ioctl+0x252/0x26f
 [<ffffffff810d4d46>] sys_ioctl+0x5a/0x7c
 [<ffffffff810113aa>] system_call_fastpath+0x16/0x1b

Code: 70 48 8b 82 00 02 00 00 48 85 c0 74 64 48 8b 00 48 85 c0 74 5c 48 8b 15 9b 96 1e 00 48 39 50 08 75 4f 48 39 98 10 03 00 00 75 04 <0f> 0b eb fe 4c 8d 65 d0 48 89 da 48 c7 c6 7d 12 4c 81 31 c0 4c
RIP [<ffffffff8134e874>] netdev_notify+0x43/0x94
 RSP <ffff8800331b5c98>
---[ end trace 789c1e6982921e6b ]---

Revision history for this message
In , John (john-redhat-bugs) wrote :

This bug has been triaged

Revision history for this message
In , Chuck (chuck-redhat-bugs) wrote :

static inline struct ieee80211_sub_if_data *
IEEE80211_DEV_TO_SUB_IF(struct net_device *dev)
{
        struct ieee80211_local *local = wdev_priv(dev->ieee80211_ptr);

        BUG_ON(!local || local->mdev == dev);

        return netdev_priv(dev);
}

Revision history for this message
In , Johannes (johannes-redhat-bugs) wrote :

I think this is an issue that was inadvertedly fixed by "mac80211: make master iface not wireless"; the code in netdev_notify() should, in 2.6.27, check if it's the master interface and if not refuse to work. I'll check out the code for 2.6.27 and post a patch for -stable.

Revision history for this message
ivro (ivro) wrote :

Hi

I have a problem that seems similar running kernel from 2.6.27.4 (when I start using Intrepid beta) to 2.6.27.6 (my current kernel). I run an ASUS A8J - is something important here, i dont know ?

My problem is :
When I start my computer for the first time a day (or after a kernel upgrade) the boot hangs after loading iwl3945. Generaly two or three more lines are written to the screen and the boot process stops here. I have to restart a second time for my computer to boot.

Second problem :
After booting I'm unable to use my wireless cards but I think this is another bug because I see :
Starting NetworkManager...
[ ... ] pan0: Droping NETIF_F_UFO since no NETIF_F_HW_CSUM feature
               [OK]
Starting NetworkManager... [fail]

I don't have log for the first problem as my computer hangs before starting the kernel log daemon.
Tell me if there is way to save the early message.
I add the dmesg when my computer success to boot but with no wireless.

Marcin Feder (marfed)
description: updated
description: updated
Revision history for this message
ivro (ivro) wrote :

Itss still the same with kernel 2.6.27-7.

I update and restart. I had to restart 3 times to boot properly. The process stop after writing on the screen some message about iwl3945.

For the second problem I see this "killswitch_getpower_reply(): Error getting killswitch power: hal-ipw-killswitch-linux returned 255" on the syslog.

Revision history for this message
ivro (ivro) wrote :

I attach my complet syslog. It will be difficult to read as I try some workaround found on #230844 and elsewhere.

Revision history for this message
Lee Lists (lists-jave) wrote :

same problem on a dell inspiron 9400 the system hang on loading iwl3945 driver
a was able to boot using kernel 2.6.24-19

Revision history for this message
Loïc Minier (lool) wrote :

I get this as well with 2.6.27-7; I have a dual boot with Debian where 2.6.27-rc9 works fine.

Revision history for this message
In , John (john-redhat-bugs) wrote :

Johannes' patch has been checked-in to the rawhide kernels...

Revision history for this message
Marcin Feder (marfed) wrote :

I have reported the issue at kernel.org - http://bugzilla.kernel.org/show_bug.cgi?id=11746 .

It seems that patch is already prepared. Unfortunately, it is not incorporated into the intrepid kernel. I can confirm that the issue persist in the 2.6.27-7.

description: updated
Revision history for this message
Loïc Minier (lool) wrote :

Would someone be so kind to merge the patch at http://marc.info/?l=linux-wireless&m=122365398314624&w=2 in the ubuntu-intrepid tree?

Amit Kucheria (amitk)
Changed in linux:
assignee: ubuntu-kernel-team → amitk
Changed in linux:
status: Unknown → Fix Committed
Revision history for this message
Amit Kucheria (amitk) wrote :

Since this is already queued for the stable 2.6.27 tree, I have pulled in this patch.

Changed in linux:
status: Triaged → Fix Committed
Revision history for this message
Loïc Minier (lool) wrote :

I pushed a fixed linux kernel to my ppa at https://launchpad.net/~lool/+archive but since I couldn't reproduce the problem 100% of the time, it'd be nice if someone experiencing this bug could confirm it goes away with these packages.

Thanks,

Revision history for this message
Loïc Minier (lool) wrote :

I got a hang in one boot out of 6; not sure whether the patches improve the situation for you folks or not.

Revision history for this message
Marcin Feder (marfed) wrote :

Loïc : It seems that your package failed to build. Could you check it?

Revision history for this message
Marcin Feder (marfed) wrote :

... sorry, I didn't notice that i386 and AMD64 are OK. Only "lpia" failed to build - whatever it is.

Revision history for this message
Marcin Feder (marfed) wrote :

Loïc : I have installed the kernel from your ppa and it solved my issue. I can modprobe iwl3945. It works and doesn't produce an error message.

Unfortunately, now I experience https://bugs.launchpad.net/intellinuxwireless/+bug/263059 . I did several reboots and I got a hang 2 times.

Revision history for this message
Launchpad Janitor (janitor) wrote :

This bug was fixed in the package linux - 2.6.27-7.11

---------------
linux (2.6.27-7.11) intrepid; urgency=low

  [ Amit Kucheria ]

  * STABLE queue: mac80211: fix two issues in debugfs
    - LP: #275227
  * SAUCE: Adds support for COMPAL JHL90 webcam

  [ Ben Collins ]

  * SAUCE: (no-up) x86: Quiet "Kernel alive" messages
    - LP: #39985
  * SAUCE: (no-up) Modularize vesafb
  * build/config: Enable vesafb module
  * build: Switch to vesafb as preferred.

  [ Leann Ogasawara ]

  * Add Dell Dimension 9200 reboot quirk
    - LP: #271370

  [ Michael Haas ]

  * SAUCE: Revert aufs changes from AppArmor merge

  [ Tim Gardner ]

  * fix virtio udeb layout
    - LP: #257739
  * Enabled CONFIG_EXT4DEV_FS=m
  * Changed default TCP congestion algorithm to 'cubic'
    - LP: #278801
  * SAUCE: ipw2200: change default policy for auto-associate
    - LP: #264104

  [ Upstream Kernel Changes ]

  * x86, early_ioremap: fix fencepost error
    - LP: #263543

 -- Tim Gardner <email address hidden> Sat, 11 Oct 2008 08:07:42 -0600

Changed in linux:
status: Fix Committed → Fix Released
Changed in linux:
status: Unknown → Fix Released
Revision history for this message
covox (covox) wrote :

Still experiencing hangs on boot with 2.6.27-7.11.

Revision history for this message
In , Steve (steve-redhat-bugs) wrote :

Kernel -13 is the first 2.6.27 kernel to work for me. Thanks everyone. Closing.

Changed in linux:
status: Fix Committed → Fix Released
Revision history for this message
Marcin Feder (marfed) wrote :

I can confirm that installation of kernel 2.6.27-7.11 solved the bug #275227 in my case.

covox: Do you experience hangs each time you trying to boot? Do you get the same error message as reported here:
...
[ 161.964196] kernel BUG at /build/buildd/linux-2.6.27/net/mac80211/ieee80211_i.h:764!
[ 161.964284] invalid opcode: 0000 [#1] SMP
...

If not than it is probable Bug #263059 not this one.

Revision history for this message
covox (covox) wrote :

Yeah, I think you're right. Thanks for the new link :)

Revision history for this message
taiebot65 (taiebot65) wrote :

I add similar probem in ubuntu i ve just deleted the wifi configuration in /etc/network/interfaces

and set up like this

auto lo
iface lo inet loopback

and now i boot without any hang. And networkmanager work

Changed in linux (Fedora):
importance: Unknown → Medium
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.