From: Song Liu Date: Mon, 19 Sep 2022 18:42:07 +0000 (-0700) Subject: Merge branch 'md-next-raid10-optimize' into md-next X-Git-Tag: dma-mapping-6.2-2022-12-13~219^2~34^2~2 X-Git-Url: https://www.infradead.org/git/?a=commitdiff_plain;h=74173ff458de5841d7fa4c6fcc85011daa21adf5;p=users%2Fhch%2Fdma-mapping.git Merge branch 'md-next-raid10-optimize' into md-next This patchset tries to avoid that two locks are held unconditionally in hot path. Test environment: Architecture: aarch64 Huawei KUNPENG 920 x86 Intel(R) Xeon(R) Platinum 8380 Raid10 initialize: mdadm --create /dev/md0 --level 10 --bitmap none --raid-devices 4 \ /dev/nvme0n1 /dev/nvme1n1 /dev/nvme2n1 /dev/nvme3n1 Test cmd: (task set -c 0-15) fio -name=0 -ioengine=libaio -direct=1 -\ group_reporting=1 -randseed=2022 -rwmixread=70 -refill_buffers \ -filename=/dev/md0 -numjobs=16 -runtime=60s -bs=4k -iodepth=256 \ -rw=randread Test result: aarch64: before this patchset: 3.2 GiB/s bind node before this patchset: 6.9 Gib/s after this patchset: 7.9 Gib/s bind node after this patchset: 8.0 Gib/s x86:(bind node is not tested yet) before this patchset: 7.0 GiB/s after this patchset : 9.3 GiB/s Please noted that in the test machine, memory access latency is very bad across nodes compare to local node in aarch64, which is why bandwidth while bind node is much better. --- 74173ff458de5841d7fa4c6fcc85011daa21adf5