mesa/src/amd
Samuel Pitoiset 2e79234f9d ac: allow to use 64K of LDS for tessellation on GFX9+
This is the hardware limit and it's supposed to be working. GFX7-8 also
support 64KiB but Stoney used to hang in the past and using 32KiB was
the only known solution.

fossils-db (NAVI21):
Totals from 326 (0.41% of 79395) affected shaders:
MaxWaves: 6352 -> 6378 (+0.41%); split: +0.50%, -0.09%
Instrs: 232575 -> 232827 (+0.11%); split: -0.04%, +0.15%
CodeSize: 1256940 -> 1258744 (+0.14%); split: -0.04%, +0.18%
VGPRs: 17552 -> 17384 (-0.96%); split: -1.09%, +0.14%
LDS: 2828800 -> 3899392 (+37.85%)
Latency: 2937650 -> 2934667 (-0.10%); split: -0.30%, +0.20%
InvThroughput: 704214 -> 700854 (-0.48%); split: -0.51%, +0.04%
VClause: 4398 -> 4442 (+1.00%); split: -0.20%, +1.21%
SClause: 5297 -> 5292 (-0.09%); split: -0.32%, +0.23%
Copies: 14892 -> 14921 (+0.19%); split: -0.44%, +0.63%
PreVGPRs: 13294 -> 13293 (-0.01%); split: -0.06%, +0.05%
VALU: 156536 -> 156793 (+0.16%); split: -0.03%, +0.20%
SALU: 21806 -> 21795 (-0.05%)

Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28015>
2024-04-23 17:20:40 +00:00
..
addrlib amd: fix addrlib regression 2024-03-22 08:25:21 +00:00
ci radeonsi/ci: update vangogh expectations after piglit uprev 2024-04-22 12:14:44 +02:00
common ac: allow to use 64K of LDS for tessellation on GFX9+ 2024-04-23 17:20:40 +00:00
compiler aco: split vop3p results 2024-04-23 12:31:59 +00:00
drm-shim amd: Use align64 instead of ALIGN for 64 bit value parameter 2024-01-03 22:02:17 +00:00
llvm nir: add nir_intrinsic_optimization_barrier_sgpr_amd 2024-04-13 16:45:08 +00:00
registers amd/registers: add correct gfx11.x enums for BINNING_MODE 2024-03-11 23:36:55 +00:00
vpelib radeonsi/vpe: support vpe 1.1 2024-03-25 00:59:02 +00:00
vulkan radv: rework the number of tess patches computation 2024-04-23 17:20:40 +00:00
meson.build amd,radeonsi: add libvpe 2023-12-01 00:23:38 +00:00