aboutsummaryrefslogtreecommitdiffstats
path: root/drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c
diff options
context:
space:
mode:
authorThomas Zimmermann <[email protected]>2022-02-23 19:38:01 +0000
committerThomas Zimmermann <[email protected]>2022-03-02 19:20:46 +0000
commit6f29e04938bf509fccfad490a74284cf158891ce (patch)
tree29ee3fde14a288d51e34cf9bdbd03702bee641fa /drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c
parentfbdev: Improve performance of sys_fillrect() (diff)
downloadkernel-6f29e04938bf509fccfad490a74284cf158891ce.tar.gz
kernel-6f29e04938bf509fccfad490a74284cf158891ce.zip
fbdev: Improve performance of sys_imageblit()
Improve the performance of sys_imageblit() by manually unrolling the inner blitting loop and moving some invariants out. The compiler failed to do this automatically. The resulting binary code was even slower than the cfb_imageblit() helper, which uses the same algorithm, but operates on I/O memory. A microbenchmark measures the average number of CPU cycles for sys_imageblit() after a stabilizing period of a few minutes (i7-4790, FullHD, simpledrm, kernel with debugging). The value for CFB is given as a reference. sys_imageblit(), new: 25934 cycles sys_imageblit(), old: 35944 cycles cfb_imageblit(): 30566 cycles In the optimized case, sys_imageblit() is now ~30% faster than before and ~20% faster than cfb_imageblit(). v2: * move switch out of inner loop (Gerd) * remove test for alignment of dst1 (Sam) Signed-off-by: Thomas Zimmermann <[email protected]> Reviewed-by: Javier Martinez Canillas <[email protected]> Acked-by: Sam Ravnborg <[email protected]> Link: https://patchwork.freedesktop.org/patch/msgid/[email protected]
Diffstat (limited to 'drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c')
0 files changed, 0 insertions, 0 deletions