fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-17 20:28:05 +02:00

Author	SHA1	Message	Date
Iago Toral Quiroga	0a04468704	nir/nir_opt_move: allow to move uniform loads Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15056>	2022-02-24 11:36:00 +00:00
Emma Anholt	d199d65c3a	nir/nir_opt_move,sink: Include load_ubo_vec4 as a load_ubo instr. We weren't doing much motion in nir-to-tgsi because we considered all our lowered load-ubos as unmovable. softpipe shader-db: total temps in shared programs: 563942 -> 563136 (-0.14%) temps in affected programs: 9833 -> 9027 (-8.20%) r300 shader-db: instructions in affected programs: 22858 -> 23575 (3.14%) temps in affected programs: 2039 -> 1813 (-11.08%) (NIR had given r300 -19% instrs for +40% temps, so this feels like a worthwhile trade back). Reivewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14138>	2021-12-11 02:12:27 +00:00
Rhys Perry	a6d92eaf4f	nir/sink,nir/move: sink/move reorderable load_ssbo Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6490>	2021-01-21 18:07:03 +00:00
Rhys Perry	870724d43b	nir/opt_sink: use common instruction removal/insertion helpers Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7880>	2020-12-03 11:00:16 +00:00
Daniel Schürmann	e60fcf0a87	nir/opt_sink: return early when trying to sink unused instructions Fixes: `5f6c5e5b86` ('nir: don't sink instructions into loops') Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7874>	2020-12-02 17:12:44 +00:00
Daniel Schürmann	5f6c5e5b86	nir: don't sink instructions into loops Repeatedly loading constants or evaluating ALU operations in loops doesn't seem beneficial. This might increase the register pressure, but the tradeoff seems worth it. Totals from 13629 (9.77% of 139517) affected shaders (RAVEN): SGPRs: 1179481 -> 1184697 (+0.44%); split: -0.03%, +0.47% VGPRs: 978776 -> 978732 (-0.00%); split: -0.02%, +0.02% SpillSGPRs: 51036 -> 50943 (-0.18%); split: -1.35%, +1.17% CodeSize: 113775020 -> 113428812 (-0.30%); split: -0.34%, +0.04% MaxWaves: 49877 -> 49881 (+0.01%); split: +0.02%, -0.01% Instrs: 22295979 -> 22204936 (-0.41%); split: -0.42%, +0.02% Cycles: 1637198832 -> 1626916048 (-0.63%); split: -0.64%, +0.01% VMEM: 2403434 -> 2507645 (+4.34%); split: +4.76%, -0.42% SMEM: 849676 -> 834576 (-1.78%); split: +0.60%, -2.38% VClause: 412396 -> 398139 (-3.46%); split: -3.46%, +0.01% SClause: 810480 -> 817349 (+0.85%); split: -0.19%, +1.04% Copies: 2188260 -> 2166716 (-0.98%); split: -1.18%, +0.19% Branches: 761204 -> 760475 (-0.10%); split: -0.15%, +0.05% PreSGPRs: 972892 -> 981054 (+0.84%); split: -0.05%, +0.89% PreVGPRs: 925390 -> 925420 (+0.00%); split: -0.02%, +0.02% Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7694>	2020-12-01 17:31:06 +01:00
Daniel Schürmann	e4281dbecc	nir: also move b2i in case of nir_move_copies Booleans are often more efficient with register usage. This also allows to move comparisons further. Totals from affected shaders: (VEGA) SGPRS: 451608 -> 450320 (-0.29 %) VGPRS: 351448 -> 351256 (-0.05 %) Spilled SGPRs: 105 -> 105 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 1008 -> 1008 (0.00 %) dwords per thread Code Size: 26555596 -> 26551080 (-0.02 %) bytes LDS: 10323 -> 10323 (0.00 %) blocks Max Waves: 42850 -> 42934 (0.20 %) Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4830>	2020-07-20 15:56:45 +00:00
Daniel Schürmann	9300a14ffb	nir: refactor nir_can_move_instr Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5622>	2020-07-07 19:24:28 +02:00
Daniel Schürmann	09d0e06c5c	nir: also move vecN in case of nir_move_copies Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5622>	2020-07-07 19:24:28 +02:00
Rhys Perry	d8e05edbd9	nir/sink,nir/move: move/sink nir_op_mov Can uncover opportunities to move other instructions. This can increase register usage, but that doesn't seem to actually happen. This optimizes a pattern of a load_per_vertex_input followed by several moves and then a store_output in a different block. v2: add nir_move_copies to make it optional Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> (v1) Acked-by: Rob Clark <robdclark@chromium.org> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/2420> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/2420>	2020-01-14 13:56:45 +00:00
Rhys Perry	04fac72ec7	nir/sink,nir/move: move/sink load_per_vertex_input Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/2420>	2020-01-14 13:56:45 +00:00
Connor Abbott	5ac32b2954	nir/sink: Don't sink load_ubo to outside of its defining loop Previously, this could have made the resource divergent in code like that which is genereated by nir_lower_non_uniform_access. Fixes: `da8ed68a` ('nir: replace nir_move_load_const() with nir_opt_sink()') Reviewed-by: Daniel Schürmann <daniel@schuermann.dev>	2019-10-09 17:55:25 +00:00
Connor Abbott	af9296b8c0	nir/sink: Rewrite loop handling logic Previously, for code like: loop { loop { a = load_ubo() } use(a) } adjust_block_for_loops() would return the block before the first loop. Now we compute the range of allowed blocks and then walk the dominance tree directly, guaranteeing directly that we always choose a block that dominates all the uses and is dominated by the definition. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev>	2019-10-09 17:55:25 +00:00
Rhys Perry	da8ed68aca	nir: replace nir_move_load_const() with nir_opt_sink() This is mostly the same as nir_move_load_const() but can also move undef instructions, comparisons and some intrinsics (being careful with loops). v2: actually delete nir_move_load_const.c v3: fix nir_opt_sink() usage in freedreno v3: update Makefile.sources v4: replace get_move_def with nir_can_move_instr and nir_instr_ssa_def v4: handle if uses v4: fix handling of nested loops v5: re-write adjust_block_for_loops v5: re-write setting of use_block for if uses Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Co-authored-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-08-12 22:01:30 +00:00

14 commits