mesa/src/amd
Georg Lehmann 497f607c8e radv/nir/lower_cmat: vectorize GFX11 B -> ACC conversion
Foz-DB Navi31:
Totals from 7 out of 14 FSR4 shaders:
MaxWaves: 50 -> 52 (+4.00%)
Instrs: 44951 -> 44516 (-0.97%); split: -1.00%, +0.03%
CodeSize: 309176 -> 305500 (-1.19%); split: -1.23%, +0.04%
VGPRs: 1464 -> 1416 (-3.28%)
SpillVGPRs: 188 -> 92 (-51.06%)
Scratch: 24064 -> 11776 (-51.06%)
Latency: 171318 -> 163663 (-4.47%); split: -4.51%, +0.04%
InvThroughput: 178796 -> 178956 (+0.09%); split: -0.04%, +0.13%
VClause: 769 -> 730 (-5.07%); split: -6.50%, +1.43%
Copies: 3149 -> 3261 (+3.56%); split: -1.21%, +4.76%
PreVGPRs: 1607 -> 1467 (-8.71%)
VALU: 37715 -> 37744 (+0.08%); split: -0.11%, +0.18%
SALU: 754 -> 753 (-0.13%)
VMEM: 2813 -> 2621 (-6.83%)
VOPD: 1674 -> 1685 (+0.66%); split: +1.55%, -0.90%

Acked-by: Daniel Schürmann <daniel@schuermann.dev>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36115>
2025-07-16 11:46:52 +00:00
..
addrlib amd/addrlib: remove the DCC page fault workaround 2025-04-01 03:23:22 -04:00
ci radv: implement DGC IB chaining when the number of sequences is too high 2025-07-16 10:30:41 +00:00
common ac/surface: add a flag to forbid some swizzles for surface<->memory copies 2025-07-15 09:12:13 +00:00
compiler nir,aco: add byte_perm_amd 2025-07-16 11:46:52 +00:00
drm-shim amd: Add user queue HQD count to hw_ip info 2025-07-08 10:17:51 +00:00
gmlib amd/gmlib: remove the executable bit 2025-06-12 07:44:27 +00:00
lanczoslib radeonsi/vpe: enhance scaling quality 2025-06-12 07:44:26 +00:00
llvm ac/llvm: rename misnamed get_memory_ptr -> get_shared_mem_ptr 2025-07-12 10:28:21 +00:00
registers amd: Rename GFX1103_R1/R2 to PHOENIX/2 2024-11-20 02:14:40 +00:00
vpelib amd/vpelib: Fix CodeQL issues Pt1 2025-05-16 11:33:08 +08:00
vulkan radv/nir/lower_cmat: vectorize GFX11 B -> ACC conversion 2025-07-16 11:46:52 +00:00
meson.build radeonsi/vpe: enhance scaling quality 2025-06-12 07:44:26 +00:00