Two synchronize_net() calls are currently done while holding RTNL.
This is source of RTNL contention in workloads adding and deleting
many network namespaces per second, because synchronize_rcu()
and synchronize_rcu_expedited() can use 60+ ms in some cases.
For cleanup_net() use, temporarily release RTNL
while calling the last synchronize_net().
This should be safe, because devices are no longer visible
to other threads at this point.
In any case, the new netdev_lock() / netdev_unlock()
infrastructure that we are adding should allow
to fix potential issues, with a combination
of a per-device mutex and dev->reg_state awareness.
Signed-off-by: Eric Dumazet <edumazet@google.com>
Reviewed-by: Jesse Brandeburg <jbrandeburg@cloudflare.com>
Link: https://patch.msgid.link/20250114205531.967841-5-edumazet@google.com
Signed-off-by: Jakub Kicinski <kuba@kernel.org>
 
        rtnl_drop_if_cleanup_net();
        flush_all_backlogs();
        rtnl_acquire_if_cleanup_net();
+       /* TODO: move this before the prior rtnl_acquire_if_cleanup_net() */
        synchronize_net();
 
        list_for_each_entry(dev, head, unreg_list) {
 #endif
        }
 
+       rtnl_drop_if_cleanup_net();
        synchronize_net();
+       rtnl_acquire_if_cleanup_net();
 
        list_for_each_entry(dev, head, unreg_list) {
                netdev_put(dev, &dev->dev_registered_tracker);