lxc

restarting container with a vlan interface results in kernel stack trace

Bug #1410852 reported by Lars Heer
18
This bug affects 2 people
Affects Status Importance Assigned to Milestone
lxc
Invalid
Undecided
Unassigned
linux (Ubuntu)
Fix Released
Undecided
Unassigned
Trusty
Fix Released
Medium
Chris J Arges

Bug Description

[Impact]
Users of VLANs with LXC containers will see kernel BUGs when starting and stopping containers.

[Test Case]
- In trusty, install lxc and vlan.
- Edit /etc/lxc/default.conf as follows:
    lxc.network.type=vlan
    lxc.network.vlan.id=10
    lxc.network.flags=up
    lxc.network.link=eth0
    lxc.aa_profile = unconfined
- Setup LXC container
    sudo lxc-create -t download -n u1 -- --dist ubuntu --release trusty --arch amd64
- Run the following until you see the failure:
    lxc-start --name u1 -d
    lxc-stop --name u1

[Fix]
commit 5bb025fae53889cc99a21058c5dd369bf8cce820
support patches:
commit 0a59f3a9fd7e2801a445682465ea0522ea497183
commit 3ee32707560955e92d30f7f6e5138cb92a3b1a7e

--

Stopping a container and starting it again results in kernel stack trace with a vlan interface.

---8<---
lxc.network.type = vlan
lxc.network.flags = up
lxc.network.link = p3p1
lxc.network.vlan.id = 2
---8<---

dpkg -l | grep lxc
ii liblxc1 1.0.6-0ubuntu0.1 amd64 Linux Containers userspace tools (library)
ii lxc 1.0.6-0ubuntu0.1 amd64 Linux Containers userspace tools
ii lxc-templates 1.0.6-0ubuntu0.1 amd64 Linux Containers userspace tools (templates)
ii python3-lxc 1.0.6-0ubuntu0.1 amd64 Linux Containers userspace tools (Python 3.x bindings)

cat /proc/cmdline
BOOT_IMAGE=/boot/vmlinuz-3.13.0-44-generic root=UUID=6d1fc9aa-735c-4e61-ae1c-0a9d2587e77b ro acpi_osi=Linux acpi=force acpi_enforce_resources=lax i915.i915_enable_rc6=1 i915.i915_enable_fbc=1 i915.lvds_downclock=1 i915.semaphores=1

Snipplet from stack trace:
---8<---
[35188.374621] WARNING: CPU: 4 PID: 3933 at /build/buildd/linux-3.13.0/fs/sysfs/dir.c:486 sysfs_warn_dup+0x86/0xa0()
[35188.374623] sysfs: cannot create duplicate filename '/devices/pci0000:00/0000:00:1c.3/0000:02:00.0/net/p3p1/upper_vlan2'
---8<---

Is deconfiguration on stop of an lxc instance not working anymore?

Revision history for this message
Lars Heer (l-heer) wrote :
Revision history for this message
Gunnlaugur Máni Hrólfsson (mani-tungli) wrote :

I have the same problem.

Network config:
--------------
lxc.network.type=vlan
lxc.network.link=eth0
lxc.network.name=eth1
lxc.network.vlan.id = 3575
lxc.network.flags=up
--------------

From stack trace:
----------------
[63371.593308] WARNING: CPU: 6 PID: 10623 at /build/buildd/linux-lts-trusty-3.13.0/fs/sysfs/dir.c:486 sysfs_warn_dup+0x91/0xb0()
[63371.593310] sysfs: cannot create duplicate filename '/devices/pci0000:00/0000:00:06.0/0000:03:00.0/0000:04:00.0/net/eth0/upper_vlan3575'
---------------

It seems to be a problem with the kernel. This is the kernel version on this host:
# uname -r
3.13.0-32-generic

I have another host with the same lxc version. But with a different kernel that doesn't have this problem:
# uname -r
3.5.0-39-generic

Revision history for this message
Lars Heer (l-heer) wrote :

Installation of kernel 3.16 (of course with boot option intel_pstate=disable) fixed the problem.

Revision history for this message
Brad Figg (brad-figg) wrote : Missing required logs.

This bug is missing log files that will aid in diagnosing the problem. From a terminal window please run:

apport-collect 1410852

and then change the status of the bug to 'Confirmed'.

If, due to the nature of the issue you have encountered, you are unable to run this command, please add a comment stating that fact and change the bug status to 'Confirmed'.

This change has been made by an automated script, maintained by the Ubuntu Kernel Team.

Changed in linux (Ubuntu):
status: New → Incomplete
tags: added: trusty
Revision history for this message
Chris J Arges (arges) wrote :

I can easily reproduce this issue.
Tested with 3.16 mainline kernel and this did not occur. I can begin a bisect tomorrow, or test any proposals.

tags: added: bot-stop-nagging
Changed in linux (Ubuntu):
status: Incomplete → Confirmed
Chris J Arges (arges)
Changed in linux (Ubuntu):
status: Confirmed → Fix Released
Changed in linux (Ubuntu Trusty):
status: New → In Progress
importance: Undecided → Medium
assignee: nobody → Chris J Arges (arges)
Changed in lxc:
status: New → Invalid
Chris J Arges (arges)
description: updated
Stefan Bader (smb)
Changed in linux (Ubuntu Trusty):
status: In Progress → Fix Committed
Revision history for this message
Brad Figg (brad-figg) wrote :

This bug is awaiting verification that the kernel in -proposed solves the problem. Please test the kernel and update this bug with the results. If the problem is solved, change the tag 'verification-needed-trusty' to 'verification-done-trusty'.

If verification is not done by 5 working days from today, this fix will be dropped from the source code, and this bug will be closed.

See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you!

tags: added: verification-needed-trusty
Revision history for this message
Seth Forshee (sforshee) wrote :

Bug is easily reproducible with 3.13.0-48, cannot reproduce in 3.13.0-49. Marking as verified for trusty.

tags: added: verification-done-trusty
removed: verification-needed-trusty
Revision history for this message
Launchpad Janitor (janitor) wrote :
Download full text (5.9 KiB)

This bug was fixed in the package linux - 3.13.0-49.81

---------------
linux (3.13.0-49.81) trusty; urgency=low

  [ Kamal Mostafa ]

  * Release Tracking Bug
    - LP: #1436016

  [ Alex Hung ]

  * SAUCE: ACPI / blacklist: blacklist Win8 OSI for HP Pavilion dv6
    - LP: #1416940

  [ Andy Whitcroft ]

  * [Packaging] generate live watchdog blacklists
    - LP: #1432837

  [ Ben Widawsky ]

  * SAUCE: i915_bdw: drm/i915/bdw: enable eDRAM.
    - LP: #1430855

  [ Chris J Arges ]

  * [Config] Add ibmvfc to d-i
    - LP: #1416001

  [ Seth Forshee ]

  * [Config] updateconfigs - enable X86_UP_APIC_MSI

  [ Upstream Kernel Changes ]

  * net: add sysfs helpers for netdev_adjacent logic
    - LP: #1410852
  * net: Mark functions as static in core/dev.c
    - LP: #1410852
  * net: rename sysfs symlinks on device name change
    - LP: #1410852
  * btrfs: fix null pointer dereference in clone_fs_devices when name is
    null
    - LP: #1429804
  * cdc-acm: add sanity checks
    - LP: #1413992
  * x86: thinkpad_acpi.c: fixed spacing coding style issue
    - LP: #1417915
  * thinkpad_acpi: support new BIOS version string pattern
    - LP: #1417915
  * net: sctp: fix slab corruption from use after free on INIT collisions
    - LP: #1416506
    - CVE-2015-1421
  * ipv4: try to cache dst_entries which would cause a redirect
    - LP: #1420027
    - CVE-2015-1465
  * x86, mm/ASLR: Fix stack randomization on 64-bit systems
    - LP: #1423757
    - CVE-2015-1593
  * net: llc: use correct size for sysctl timeout entries
    - LP: #1425271
    - CVE-2015-2041
  * net: rds: use correct size for max unacked packets and bytes
    - LP: #1425274
    - CVE-2015-2042
  * Btrfs: clear compress-force when remounting with compress option
    - LP: #1434183
  * ext4: merge uninitialized extents
    - LP: #1430184
  * btrfs: filter invalid arg for btrfs resize
    - LP: #1435441
  * Bluetooth: Add firmware update for Atheros 0cf3:311f
  * Bluetooth: btusb: Add IMC Networks (Broadcom based)
  * Bluetooth: sort the list of IDs in the source code
  * Bluetooth: append new supported device to the list [0b05:17d0]
  * Bluetooth: Add support for Intel bootloader devices
  * Bluetooth: Ignore isochronous endpoints for Intel USB bootloader
  * Bluetooth: Add support for Acer [13D3:3432]
  * Bluetooth: Add support for Broadcom device of Asus Z97-DELUXE
    motherboard
  * Add a new PID/VID 0227/0930 for AR3012.
  * Bluetooth: Add support for Acer [0489:e078]
  * Bluetooth: Add USB device 04ca:3010 as Atheros AR3012
  * x86: mm: move mmap_sem unlock from mm_fault_error() to caller
  * vm: add VM_FAULT_SIGSEGV handling support
  * vm: make stack guard page errors return VM_FAULT_SIGSEGV rather than
    SIGBUS
  * spi/pxa2xx: Clear cur_chip pointer before starting next message
  * spi: dw: Fix detecting FIFO depth
  * spi: dw-mid: fix FIFO size
  * ASoC: wm8960: Fix capture sample rate from 11250 to 11025
  * regulator: core: fix race condition in regulator_put()
  * ASoC: omap-mcbsp: Correct CBM_CFS dai format configuration
  * can: c_can: end pending transmission on network stop (ifdown)
  * nfs: fix dio deadlock when O_DIRECT flag is flipped
  * NFSv4.1: Fix an Oops in nfs41_...

Read more...

Changed in linux (Ubuntu Trusty):
status: Fix Committed → Fix Released
status: Fix Committed → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.