fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-25 06:30:10 +01:00

Author	SHA1	Message	Date
Daniel Schürmann	e60fcf0a87	nir/opt_sink: return early when trying to sink unused instructions Fixes: `5f6c5e5b86` ('nir: don't sink instructions into loops') Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7874>	2020-12-02 17:12:44 +00:00
Daniel Schürmann	5f6c5e5b86	nir: don't sink instructions into loops Repeatedly loading constants or evaluating ALU operations in loops doesn't seem beneficial. This might increase the register pressure, but the tradeoff seems worth it. Totals from 13629 (9.77% of 139517) affected shaders (RAVEN): SGPRs: 1179481 -> 1184697 (+0.44%); split: -0.03%, +0.47% VGPRs: 978776 -> 978732 (-0.00%); split: -0.02%, +0.02% SpillSGPRs: 51036 -> 50943 (-0.18%); split: -1.35%, +1.17% CodeSize: 113775020 -> 113428812 (-0.30%); split: -0.34%, +0.04% MaxWaves: 49877 -> 49881 (+0.01%); split: +0.02%, -0.01% Instrs: 22295979 -> 22204936 (-0.41%); split: -0.42%, +0.02% Cycles: 1637198832 -> 1626916048 (-0.63%); split: -0.64%, +0.01% VMEM: 2403434 -> 2507645 (+4.34%); split: +4.76%, -0.42% SMEM: 849676 -> 834576 (-1.78%); split: +0.60%, -2.38% VClause: 412396 -> 398139 (-3.46%); split: -3.46%, +0.01% SClause: 810480 -> 817349 (+0.85%); split: -0.19%, +1.04% Copies: 2188260 -> 2166716 (-0.98%); split: -1.18%, +0.19% Branches: 761204 -> 760475 (-0.10%); split: -0.15%, +0.05% PreSGPRs: 972892 -> 981054 (+0.84%); split: -0.05%, +0.89% PreVGPRs: 925390 -> 925420 (+0.00%); split: -0.02%, +0.02% Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7694>	2020-12-01 17:31:06 +01:00
Daniel Schürmann	e4281dbecc	nir: also move b2i in case of nir_move_copies Booleans are often more efficient with register usage. This also allows to move comparisons further. Totals from affected shaders: (VEGA) SGPRS: 451608 -> 450320 (-0.29 %) VGPRS: 351448 -> 351256 (-0.05 %) Spilled SGPRs: 105 -> 105 (0.00 %) Spilled VGPRs: 0 -> 0 (0.00 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 1008 -> 1008 (0.00 %) dwords per thread Code Size: 26555596 -> 26551080 (-0.02 %) bytes LDS: 10323 -> 10323 (0.00 %) blocks Max Waves: 42850 -> 42934 (0.20 %) Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4830>	2020-07-20 15:56:45 +00:00
Daniel Schürmann	9300a14ffb	nir: refactor nir_can_move_instr Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5622>	2020-07-07 19:24:28 +02:00
Daniel Schürmann	09d0e06c5c	nir: also move vecN in case of nir_move_copies Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5622>	2020-07-07 19:24:28 +02:00
Rhys Perry	d8e05edbd9	nir/sink,nir/move: move/sink nir_op_mov Can uncover opportunities to move other instructions. This can increase register usage, but that doesn't seem to actually happen. This optimizes a pattern of a load_per_vertex_input followed by several moves and then a store_output in a different block. v2: add nir_move_copies to make it optional Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> (v1) Acked-by: Rob Clark <robdclark@chromium.org> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/2420> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/2420>	2020-01-14 13:56:45 +00:00
Rhys Perry	04fac72ec7	nir/sink,nir/move: move/sink load_per_vertex_input Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/2420>	2020-01-14 13:56:45 +00:00
Connor Abbott	5ac32b2954	nir/sink: Don't sink load_ubo to outside of its defining loop Previously, this could have made the resource divergent in code like that which is genereated by nir_lower_non_uniform_access. Fixes: `da8ed68a` ('nir: replace nir_move_load_const() with nir_opt_sink()') Reviewed-by: Daniel Schürmann <daniel@schuermann.dev>	2019-10-09 17:55:25 +00:00
Connor Abbott	af9296b8c0	nir/sink: Rewrite loop handling logic Previously, for code like: loop { loop { a = load_ubo() } use(a) } adjust_block_for_loops() would return the block before the first loop. Now we compute the range of allowed blocks and then walk the dominance tree directly, guaranteeing directly that we always choose a block that dominates all the uses and is dominated by the definition. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev>	2019-10-09 17:55:25 +00:00
Rhys Perry	da8ed68aca	nir: replace nir_move_load_const() with nir_opt_sink() This is mostly the same as nir_move_load_const() but can also move undef instructions, comparisons and some intrinsics (being careful with loops). v2: actually delete nir_move_load_const.c v3: fix nir_opt_sink() usage in freedreno v3: update Makefile.sources v4: replace get_move_def with nir_can_move_instr and nir_instr_ssa_def v4: handle if uses v4: fix handling of nested loops v5: re-write adjust_block_for_loops v5: re-write setting of use_block for if uses Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Co-authored-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-08-12 22:01:30 +00:00

10 commits