Bug #1952730 “Segment updates may cause unnecessary overload” : Bugs : neutron

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2021-11-30: Fix proposed to neutron (master)

#1

Fix proposed to branch: master
Review: https://review.opendev.org/c/openstack/neutron/+/819777

Changed in neutron:
status:	New → In Progress

Lajos Katona (lajos-katona) on 2021-11-30

tags:

added: performance segments

Oleg Bondarev (obondarev) on 2021-12-08

tags:

added: loadimpact

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2021-12-08: Fix merged to neutron (master)

#2

Reviewed: https://review.opendev.org/c/openstack/neutron/+/819777
Committed: https://opendev.org/openstack/neutron/commit/176503e610aee16cb5799a77466579bc55129450
Submitter: "Zuul (22348)"
Branch: master

commit 176503e610aee16cb5799a77466579bc55129450
Author: Bence Romsics <email address hidden>
Date: Mon Nov 29 09:40:42 2021 +0100

Avoid writing segments to the DB repeatedly

    When:
    * the segments service plugin is enabled and
    * we have multiple rpc worker processes (as in the sum of rpc_workers
      and rpc_state_report_workers, since both kind processes agent
      state_reports) and
    * many ovs-agents report physnets,
    then rabbitmq dispatches the state_report messages between the workers
    in a round robin fashion, therefore eventually the state_reports of the
    same agent will hit all rpc workers.

    Unfortunately all worker processes have a 'reported_hosts' set to
    remember from which host it has seen agent reports already. But right
    after a server start when that set is still empty, each worker will
    unconditionally write the received physnet-segment information into
    the db. This means we multiply the load on the db and rpc workers by
    a factor of the rpc worker count.

This patch tries to reduce the load on the db by adding another early
return before the unconditional db write.

Change-Id: I935186b6ee95f0cae8dc05869d9742c8fb3353c3
Closes-Bug: #1952730

Changed in neutron:
status:	In Progress → Fix Released

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2021-12-08: Fix proposed to neutron (stable/xena)

#3

Fix proposed to branch: stable/xena
Review: https://review.opendev.org/c/openstack/neutron/+/821072

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2021-12-08: Fix proposed to neutron (stable/wallaby)

#4

Fix proposed to branch: stable/wallaby
Review: https://review.opendev.org/c/openstack/neutron/+/821073

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2021-12-08: Fix proposed to neutron (stable/victoria)

#5

Fix proposed to branch: stable/victoria
Review: https://review.opendev.org/c/openstack/neutron/+/821074

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2021-12-17: Fix merged to neutron (stable/wallaby)

#6

Reviewed: https://review.opendev.org/c/openstack/neutron/+/821073
Committed: https://opendev.org/openstack/neutron/commit/0c909e3b55c0f4d38647fa54882f8cbfd85f662a
Submitter: "Zuul (22348)"
Branch: stable/wallaby

commit 0c909e3b55c0f4d38647fa54882f8cbfd85f662a
Author: Bence Romsics <email address hidden>
Date: Mon Nov 29 09:40:42 2021 +0100

Avoid writing segments to the DB repeatedly

    When:
    * the segments service plugin is enabled and
    * we have multiple rpc worker processes (as in the sum of rpc_workers
      and rpc_state_report_workers, since both kind processes agent
      state_reports) and
    * many ovs-agents report physnets,
    then rabbitmq dispatches the state_report messages between the workers
    in a round robin fashion, therefore eventually the state_reports of the
    same agent will hit all rpc workers.

    Unfortunately all worker processes have a 'reported_hosts' set to
    remember from which host it has seen agent reports already. But right
    after a server start when that set is still empty, each worker will
    unconditionally write the received physnet-segment information into
    the db. This means we multiply the load on the db and rpc workers by
    a factor of the rpc worker count.

This patch tries to reduce the load on the db by adding another early
return before the unconditional db write.

    Change-Id: I935186b6ee95f0cae8dc05869d9742c8fb3353c3
    Closes-Bug: #1952730
    (cherry picked from commit 176503e610aee16cb5799a77466579bc55129450)
    (cherry picked from commit dcb372b041a97121027706ca18c616adfc07d243)

tags:

added: in-stable-wallaby

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2021-12-20: Fix merged to neutron (stable/xena)

#7

Reviewed: https://review.opendev.org/c/openstack/neutron/+/821072
Committed: https://opendev.org/openstack/neutron/commit/dcb372b041a97121027706ca18c616adfc07d243
Submitter: "Zuul (22348)"
Branch: stable/xena

commit dcb372b041a97121027706ca18c616adfc07d243
Author: Bence Romsics <email address hidden>
Date: Mon Nov 29 09:40:42 2021 +0100

Avoid writing segments to the DB repeatedly

    When:
    * the segments service plugin is enabled and
    * we have multiple rpc worker processes (as in the sum of rpc_workers
      and rpc_state_report_workers, since both kind processes agent
      state_reports) and
    * many ovs-agents report physnets,
    then rabbitmq dispatches the state_report messages between the workers
    in a round robin fashion, therefore eventually the state_reports of the
    same agent will hit all rpc workers.

    Unfortunately all worker processes have a 'reported_hosts' set to
    remember from which host it has seen agent reports already. But right
    after a server start when that set is still empty, each worker will
    unconditionally write the received physnet-segment information into
    the db. This means we multiply the load on the db and rpc workers by
    a factor of the rpc worker count.

This patch tries to reduce the load on the db by adding another early
return before the unconditional db write.

    Change-Id: I935186b6ee95f0cae8dc05869d9742c8fb3353c3
    Closes-Bug: #1952730
    (cherry picked from commit 176503e610aee16cb5799a77466579bc55129450)

tags:

added: in-stable-xena

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2021-12-20: Fix merged to neutron (stable/victoria)

#8

Reviewed: https://review.opendev.org/c/openstack/neutron/+/821074
Committed: https://opendev.org/openstack/neutron/commit/ed7bfa11ded509466a8d07ebe952c21621a22c2d
Submitter: "Zuul (22348)"
Branch: stable/victoria

commit ed7bfa11ded509466a8d07ebe952c21621a22c2d
Author: Bence Romsics <email address hidden>
Date: Mon Nov 29 09:40:42 2021 +0100

Avoid writing segments to the DB repeatedly

    When:
    * the segments service plugin is enabled and
    * we have multiple rpc worker processes (as in the sum of rpc_workers
      and rpc_state_report_workers, since both kind processes agent
      state_reports) and
    * many ovs-agents report physnets,
    then rabbitmq dispatches the state_report messages between the workers
    in a round robin fashion, therefore eventually the state_reports of the
    same agent will hit all rpc workers.

    Unfortunately all worker processes have a 'reported_hosts' set to
    remember from which host it has seen agent reports already. But right
    after a server start when that set is still empty, each worker will
    unconditionally write the received physnet-segment information into
    the db. This means we multiply the load on the db and rpc workers by
    a factor of the rpc worker count.

This patch tries to reduce the load on the db by adding another early
return before the unconditional db write.

    Change-Id: I935186b6ee95f0cae8dc05869d9742c8fb3353c3
    Closes-Bug: #1952730
    (cherry picked from commit 176503e610aee16cb5799a77466579bc55129450)
    (cherry picked from commit dcb372b041a97121027706ca18c616adfc07d243)
    (cherry picked from commit 0c909e3b55c0f4d38647fa54882f8cbfd85f662a)

tags:

added: in-stable-victoria

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2022-01-10: Fix included in openstack/neutron 19.1.0

#9

This issue was fixed in the openstack/neutron 19.1.0 release.

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2022-01-13: Fix included in openstack/neutron 17.3.0

#10

This issue was fixed in the openstack/neutron 17.3.0 release.

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2022-01-13: Fix included in openstack/neutron 18.2.0

#11

This issue was fixed in the openstack/neutron 18.2.0 release.

Bernard Cafarelli (bcafarel) on 2022-01-14

tags:

added: neutron-proactive-backport-potential

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2022-01-21: Fix proposed to neutron (stable/ussuri)

#12

Fix proposed to branch: stable/ussuri
Review: https://review.opendev.org/c/openstack/neutron/+/825734

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2022-01-21: Fix proposed to neutron (stable/train)

#13

Fix proposed to branch: stable/train
Review: https://review.opendev.org/c/openstack/neutron/+/825735

Slawek Kaplonski (slaweq) on 2022-01-21

tags:

removed: neutron-proactive-backport-potential

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2022-01-26: Fix merged to neutron (stable/ussuri)

#14

Reviewed: https://review.opendev.org/c/openstack/neutron/+/825734
Committed: https://opendev.org/openstack/neutron/commit/67190040113018fc955113640c2e7654a5a9cd5b
Submitter: "Zuul (22348)"
Branch: stable/ussuri

commit 67190040113018fc955113640c2e7654a5a9cd5b
Author: Bence Romsics <email address hidden>
Date: Mon Nov 29 09:40:42 2021 +0100

Avoid writing segments to the DB repeatedly

    When:
    * the segments service plugin is enabled and
    * we have multiple rpc worker processes (as in the sum of rpc_workers
      and rpc_state_report_workers, since both kind processes agent
      state_reports) and
    * many ovs-agents report physnets,
    then rabbitmq dispatches the state_report messages between the workers
    in a round robin fashion, therefore eventually the state_reports of the
    same agent will hit all rpc workers.

    Unfortunately all worker processes have a 'reported_hosts' set to
    remember from which host it has seen agent reports already. But right
    after a server start when that set is still empty, each worker will
    unconditionally write the received physnet-segment information into
    the db. This means we multiply the load on the db and rpc workers by
    a factor of the rpc worker count.

This patch tries to reduce the load on the db by adding another early
return before the unconditional db write.

    Change-Id: I935186b6ee95f0cae8dc05869d9742c8fb3353c3
    Closes-Bug: #1952730
    (cherry picked from commit 176503e610aee16cb5799a77466579bc55129450)
    (cherry picked from commit dcb372b041a97121027706ca18c616adfc07d243)
    (cherry picked from commit 0c909e3b55c0f4d38647fa54882f8cbfd85f662a)
    (cherry picked from commit ed7bfa11ded509466a8d07ebe952c21621a22c2d)

tags:

added: in-stable-ussuri

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2022-02-11: Fix merged to neutron (stable/train)

#15

Reviewed: https://review.opendev.org/c/openstack/neutron/+/825735
Committed: https://opendev.org/openstack/neutron/commit/7816ae3750b79a55d5e80daec9a93579d43ae94b
Submitter: "Zuul (22348)"
Branch: stable/train

commit 7816ae3750b79a55d5e80daec9a93579d43ae94b
Author: Bence Romsics <email address hidden>
Date: Mon Nov 29 09:40:42 2021 +0100

Avoid writing segments to the DB repeatedly

    When:
    * the segments service plugin is enabled and
    * we have multiple rpc worker processes (as in the sum of rpc_workers
      and rpc_state_report_workers, since both kind processes agent
      state_reports) and
    * many ovs-agents report physnets,
    then rabbitmq dispatches the state_report messages between the workers
    in a round robin fashion, therefore eventually the state_reports of the
    same agent will hit all rpc workers.

    Unfortunately all worker processes have a 'reported_hosts' set to
    remember from which host it has seen agent reports already. But right
    after a server start when that set is still empty, each worker will
    unconditionally write the received physnet-segment information into
    the db. This means we multiply the load on the db and rpc workers by
    a factor of the rpc worker count.

This patch tries to reduce the load on the db by adding another early
return before the unconditional db write.

    Depends-On: https://review.opendev.org/c/openstack/devstack/+/828769
    Change-Id: I935186b6ee95f0cae8dc05869d9742c8fb3353c3
    Closes-Bug: #1952730
    (cherry picked from commit 176503e610aee16cb5799a77466579bc55129450)
    (cherry picked from commit dcb372b041a97121027706ca18c616adfc07d243)
    (cherry picked from commit 0c909e3b55c0f4d38647fa54882f8cbfd85f662a)
    (cherry picked from commit ed7bfa11ded509466a8d07ebe952c21621a22c2d)

tags:

added: in-stable-train

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2022-03-10: Fix included in openstack/neutron 20.0.0.0rc1

#16

This issue was fixed in the openstack/neutron 20.0.0.0rc1 release candidate.

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2023-10-10: Fix included in openstack/neutron train-eol

#17

This issue was fixed in the openstack/neutron train-eol release.

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2024-01-17: Fix included in openstack/neutron ussuri-eol

#18

This issue was fixed in the openstack/neutron ussuri-eol release.

neutron

Segment updates may cause unnecessary overload

Bug Description

Other bug subscribers

Remote bug watches