mesa/src/amd
Rhys Perry 6dbf44ad9c ac/nir: allow less than one register of overfetch
This is to allow vectorization of 8/16-bit loads, which can later be
cheaply lowered to a 32-bit load.

fossil-db (gfx1201):
Totals from 178 (0.22% of 79377) affected shaders:
MaxWaves: 4138 -> 4102 (-0.87%)
Instrs: 619714 -> 617917 (-0.29%); split: -0.32%, +0.03%
CodeSize: 3364396 -> 3352724 (-0.35%); split: -0.38%, +0.03%
VGPRs: 12896 -> 12980 (+0.65%); split: -0.19%, +0.84%
SpillSGPRs: 546 -> 545 (-0.18%)
Latency: 7589585 -> 7406076 (-2.42%); split: -2.45%, +0.04%
InvThroughput: 1926356 -> 1879866 (-2.41%); split: -2.42%, +0.00%
VClause: 12301 -> 11750 (-4.48%)
SClause: 13614 -> 13583 (-0.23%); split: -0.45%, +0.22%
Copies: 82207 -> 82265 (+0.07%); split: -0.10%, +0.17%
Branches: 19284 -> 19266 (-0.09%)
PreSGPRs: 9525 -> 9457 (-0.71%)
PreVGPRs: 12366 -> 12421 (+0.44%)
VALU: 347928 -> 348020 (+0.03%); split: -0.01%, +0.04%
SALU: 82620 -> 82519 (-0.12%); split: -0.19%, +0.07%
VMEM: 22248 -> 21430 (-3.68%)
SMEM: 17951 -> 17843 (-0.60%)

Signed-off-by: Rhys Perry <pendingchaos02@gmail.com>
Reviewed-by: Georg Lehmann <dadschoorse@gmail.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34162>
2025-05-08 13:30:50 +00:00
..
addrlib amd/addrlib: remove the DCC page fault workaround 2025-04-01 03:23:22 -04:00
ci radv/ci: rename .test-radv to .ci-tron-test-radv 2025-05-07 21:02:16 +00:00
common ac/nir: allow less than one register of overfetch 2025-05-08 13:30:50 +00:00
compiler aco/gfx12: allow 8/16-bit smem loads 2025-05-08 13:30:50 +00:00
drm-shim amd/drm-shim: add gfx1201 2025-03-10 11:21:36 +00:00
gmlib amd/gmlib: add gmlib for radeonsi 2025-02-27 03:15:16 +00:00
llvm ac/llvm: use mul24 intrinsics 2025-04-23 01:11:48 +00:00
registers amd: Rename GFX1103_R1/R2 to PHOENIX/2 2024-11-20 02:14:40 +00:00
vpelib amd/vpelib: More parameters to the segmentation process and introduce validation hook 2025-03-06 02:11:53 +00:00
vulkan radv/gfx12: use dword3 smem loads for push constants 2025-05-08 13:30:50 +00:00
meson.build amd/gmlib: add gmlib for radeonsi 2025-02-27 03:15:16 +00:00