Race with deleting HA routers

Bug #1605546 reported by Ann Taraday
12
This bug affects 2 people
Affects Status Importance Assigned to Milestone
neutron
In Progress
Medium
LIU Yulong

Bug Description

Scale environment 3 controllers, 45 computes Mitaka, L3 HA
After running rally test create_and_delete_routers agent logs are full of errors http://paste.openstack.org/show/525987/ and on server side http://paste.openstack.org/show/525175/.

The root cause of this issue seems that deletion of router is not isolated properly, HA ports are deleted concurrently, so we have HA router proceed, although HA interfaces are deleted.

In logs we can see "Got router deleted notification for <id> router_deleted" and after that still some actions with that router, warning "Info for router <id> was not found. Performing router cleanup" and after that "Got routers updated notification :[u'<id>'] routers_updated", all this caused infinitive loop for deleting routers until l3-agent restarted.

Logs - https://drive.google.com/a/mirantis.com/folderview?id=0By7EBG7Kg4gHXzJpRUNiS1MzU2M&usp=sharing#

Tags: l3-ha
Revision history for this message
John Schwarz (jschwarz) wrote :

The trace that Ann posted describes the agent-side logs (which are dealt with in https://bugs.launchpad.net/neutron/+bug/1606844). For this reason, this bug is supposed to focus on the server-side cause.

Even though these 2 bugs describe a similar issue, we should use both to track the server-side and client-side bugs (the client-side bugs aren't limited to l3-ha).

Miguel Lavalle (minsel)
Changed in neutron:
importance: Undecided → Medium
Revision history for this message
LIU Yulong (dragon889) wrote :

OK, let's use this as the new tracker for the "infinite loop".
This bug is a successor of this: https://bugs.launchpad.net/neutron/+bug/1533441

Some related patch:
https://review.openstack.org/#/c/265672/
https://review.openstack.org/#/c/265680/

Changed in neutron:
assignee: Ann Taraday (akamyshnikova) → LIU Yulong (dragon889)
status: New → In Progress
Revision history for this message
LIU Yulong (dragon889) wrote :
Revision history for this message
LIU Yulong (dragon889) wrote :
Revision history for this message
John Schwarz (jschwarz) wrote :

A summary of the last few comments: this bug seems like a duplicate of https://bugs.launchpad.net/neutron/+bug/1533457 (the traces reported on both are identical), so this bug report can be marked as such and closed.

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Related fix proposed to neutron (master)

Related fix proposed to branch: master
Review: https://review.openstack.org/348215

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Change abandoned on neutron (master)

Change abandoned by Kevin Benton (<email address hidden>) on branch: master
Review: https://review.openstack.org/348215
Reason: This isn't necessary any more

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.