mesa/src/amd
Georg Lehmann b2172467d1 aco/gfx10_3: work around NSA hazard
4+ dword NSA can hang if exec becomes non-zero again directly before
the instruction.

Foz-DB Navi21:
Totals from 608 (0.74% of 82161) affected shaders:
Instrs: 945138 -> 946431 (+0.14%)
CodeSize: 5171580 -> 5176864 (+0.10%)
Latency: 13356895 -> 13357113 (+0.00%)
InvThroughput: 3043234 -> 3043236 (+0.00%); split: -0.00%, +0.00%

Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9852
Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13981
Cc: mesa-stable

Reviewed-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Daniel Schürmann <daniel@schuermann.dev>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38215>
2025-11-05 10:06:04 +00:00
..
addrlib addrlib: __debugbreak only present on Windows and from intrin.h 2025-08-07 07:47:42 +00:00
ci radv/ci: Enable video tests on navi21 and navi31 2025-11-03 08:39:59 +00:00
common radeonsi/vcn: Fix AV1 bidir compound encode with order_hint disabled 2025-11-05 09:44:04 +00:00
compiler aco/gfx10_3: work around NSA hazard 2025-11-05 10:06:04 +00:00
drm-shim Revert ABI breakage "amd: Add user queue HQD count to hw_ip info" 2025-11-02 02:54:59 +00:00
gmlib meson: Relax -Wmaybe-uninitialized errors 2025-09-16 06:16:20 +00:00
lanczoslib meson: Relax -Wmaybe-uninitialized errors 2025-09-16 06:16:20 +00:00
llvm nir: remove load_smem_amd 2025-10-08 08:54:11 +00:00
registers amd: Rename GFX1103_R1/R2 to PHOENIX/2 2024-11-20 02:14:40 +00:00
vpelib amd/vpelib: add FL capabilitie and lut container size 2025-09-22 10:37:22 +00:00
vulkan radv/video: Fix AV1 bidir compound encode with order_hint disabled 2025-11-05 09:44:04 +00:00
meson.build radeonsi/vpe: enhance scaling quality 2025-06-12 07:44:26 +00:00