mesa/src/amd
Timur Kristóf 81620fc7b0 aco: Enable constant exec mask based optimization on compute shaders.
We know for sure exec is initially -1 when the shader always has full subgroups.

Fossil DB stats on GFX11:
Totals from 3884 (2.88% of 134913) affected shaders:
SpillSGPRs: 1673 -> 1697 (+1.43%); split: -1.67%, +3.11%
SpillVGPRs: 2316 -> 2310 (-0.26%); split: -0.65%, +0.39%
CodeSize: 19584436 -> 19567156 (-0.09%); split: -0.13%, +0.04%
Scratch: 217088 -> 216832 (-0.12%)
Instrs: 3784596 -> 3780303 (-0.11%); split: -0.15%, +0.03%
Latency: 39971204 -> 39794967 (-0.44%); split: -0.47%, +0.03%
InvThroughput: 7885552 -> 7801247 (-1.07%); split: -1.14%, +0.07%
VClause: 74654 -> 74611 (-0.06%); split: -0.07%, +0.01%
SClause: 103139 -> 103043 (-0.09%); split: -0.13%, +0.04%
Copies: 279864 -> 281995 (+0.76%); split: -0.72%, +1.48%
Branches: 92082 -> 92084 (+0.00%); split: -0.03%, +0.03%
PreSGPRs: 155637 -> 149491 (-3.95%)

Signed-off-by: Timur Kristóf <timur.kristof@gmail.com>
Reviewed-by: Daniel Schürmann <daniel@schuermann.dev>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20670>
2023-01-26 01:59:26 +00:00
..
addrlib meson: Enable initialized-but-unused warning for MSVC 2022-11-17 21:20:38 +00:00
ci ci: Add manual rules variations to disable irrelevant driver jobs. 2023-01-26 00:48:19 +00:00
common ac,radeonsi: move shadow regs create ib preamble function to amd common 2023-01-25 04:53:34 +00:00
compiler aco: Enable constant exec mask based optimization on compute shaders. 2023-01-26 01:59:26 +00:00
drm-shim r300: use drm_shim_override 2022-11-16 14:37:47 +00:00
llvm ac/llvm: add support for fp32 addition atomics 2023-01-17 17:39:15 +00:00
registers
vulkan radv, aco: Add uses_full_subgroups to compute shader info. 2023-01-26 01:59:26 +00:00
.clang-format radv: Add nir_foreach_variable_with_modes to .clang-format 2022-12-09 07:07:10 +00:00
meson.build meson: build radeon drm-shim also for r300 and r600 2022-11-16 14:37:47 +00:00