This patch is under WIP. The main goal is to eliminate rwlock from fast path.
The problem with rwlock can be observed on multi-core systems, where multi-queues NIC handles packets rate about several Mpps.
Just for example: 28 cores Xeon, Mellanox 100G mlx5en, 6Mpps inbound traffic. Executing of `tcpdump -npi mce3 host 198.18.0.1` immediately produces packets drop around 500kpps-1.5Mpps due to rwlock contention.