aboutsummaryrefslogtreecommitdiff
path: root/tools/testing
diff options
context:
space:
mode:
authorGravatar Haiyang Zhang <haiyangz@microsoft.com> 2023-05-26 08:38:57 -0700
committerGravatar Paolo Abeni <pabeni@redhat.com> 2023-05-30 12:05:22 +0200
commit1919b39fc6eabb9a6f9a51706ff6d03865f5df29 (patch)
tree95a808f091f1e15ebc76760b171ea1814353358c /tools/testing
parentMerge branch 'two-fixes-for-smcrv2' (diff)
downloadlinux-1919b39fc6eabb9a6f9a51706ff6d03865f5df29.tar.gz
linux-1919b39fc6eabb9a6f9a51706ff6d03865f5df29.tar.bz2
linux-1919b39fc6eabb9a6f9a51706ff6d03865f5df29.zip
net: mana: Fix perf regression: remove rx_cqes, tx_cqes counters
The apc->eth_stats.rx_cqes is one per NIC (vport), and it's on the frequent and parallel code path of all queues. So, r/w into this single shared variable by many threads on different CPUs creates a lot caching and memory overhead, hence perf regression. And, it's not accurate due to the high volume concurrent r/w. For example, a workload is iperf with 128 threads, and with RPS enabled. We saw perf regression of 25% with the previous patch adding the counters. And this patch eliminates the regression. Since the error path of mana_poll_rx_cq() already has warnings, so keeping the counter and convert it to a per-queue variable is not necessary. So, just remove this counter from this high frequency code path. Also, remove the tx_cqes counter for the same reason. We have warnings & other counters for errors on that path, and don't need to count every normal cqe processing. Cc: stable@vger.kernel.org Fixes: bd7fc6e1957c ("net: mana: Add new MANA VF performance counters for easier troubleshooting") Signed-off-by: Haiyang Zhang <haiyangz@microsoft.com> Reviewed-by: Horatiu Vultur <horatiu.vultur@microchip.com> Reviewed-by: Jiri Pirko <jiri@nvidia.com> Link: https://lore.kernel.org/r/1685115537-31675-1-git-send-email-haiyangz@microsoft.com Signed-off-by: Paolo Abeni <pabeni@redhat.com>
Diffstat (limited to 'tools/testing')
0 files changed, 0 insertions, 0 deletions