git.baikalelectronics.ru Git - kernel.git/commit

author	Seth Forshee <sforshee@digitalocean.com>
	Tue, 26 Oct 2021 13:06:59 +0000 (08:06 -0500)
committer	Jakub Kicinski <kuba@kernel.org>
	Thu, 28 Oct 2021 00:09:26 +0000 (17:09 -0700)
commit	dbc122fd279beb7beade608b58ef5e81cea52f23
tree	459b93fe654d9db80fb62e78d2e2ac54be4204a9	tree \| snapshot
parent	9be6ad2f11c5cb68a6a7a2f7b7fb9dd7596249eb	commit \| diff

net: sch: eliminate unnecessary RCU waits in mini_qdisc_pair_swap()

Currently rcu_barrier() is used to ensure that no readers of the
inactive mini_Qdisc buffer remain before it is reused. This waits for
any pending RCU callbacks to complete, when all that is actually
required is to wait for one RCU grace period to elapse after the buffer
was made inactive. This means that using rcu_barrier() may result in
unnecessary waits.

To improve this, store the current RCU state when a buffer is made
inactive and use poll_state_synchronize_rcu() to check whether a full
grace period has elapsed before reusing it. If a full grace period has
not elapsed, wait for a grace period to elapse, and in the non-RT case
use synchronize_rcu_expedited() to hasten it.

Since this approach eliminates the RCU callback it is no longer
necessary to synchronize_rcu() in the tp_head==NULL case. However, the
RCU state should still be saved for the previously active buffer.

Before this change I would typically see mini_qdisc_pair_swap() take
tens of milliseconds to complete. After this change it typcially
finishes in less than 1 ms, and often it takes just a few microseconds.

Thanks to Paul for walking me through the options for improving this.

Cc: "Paul E. McKenney" <paulmck@kernel.org>
Signed-off-by: Seth Forshee <sforshee@digitalocean.com>
Link: https://lore.kernel.org/r/20211026130700.121189-1-seth@forshee.me
Signed-off-by: Jakub Kicinski <kuba@kernel.org>

include/net/sch_generic.h		diff \| blob \| history
net/sched/sch_generic.c		diff \| blob \| history