fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-19 22:18:18 +02:00

Author	SHA1	Message	Date
Christian Gmeiner	0158075b22	nir/opt_peephole_select: handle speculative ubo loads Some platforms may be able to speculate ubo loads safely. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8299>	2024-01-03 20:02:25 +00:00
Emma Anholt	c5712410ec	nir: Flatten ifs with discards in nir_opt_peephole_select for HW without CF. i915g and r300-r400 don't have if statements, and discards are all nir_intrinsic_discard_if. We can flatten those discards here, saving a separate GLSL pass to try to do so. i915g: GAINED: shaders/closed/xcom-enemy-unknown/413.shader_test FS rv370: GAINED: shaders/closed/xcom-enemy-unknown/12.shader_test FS GAINED: shaders/closed/xcom-enemy-unknown/122.shader_test FS GAINED: shaders/closed/xcom-enemy-unknown/132.shader_test FS GAINED: shaders/closed/xcom-enemy-unknown/145.shader_test FS GAINED: shaders/closed/xcom-enemy-unknown/146.shader_test FS GAINED: shaders/closed/xcom-enemy-unknown/19.shader_test FS GAINED: shaders/closed/xcom-enemy-unknown/413.shader_test FS GAINED: shaders/closed/xcom-enemy-unknown/415.shader_test FS Closes: #9918 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24763>	2023-10-18 01:27:04 +00:00
Alyssa Rosenzweig	c39896b17b	nir: Use getters for nir_src::parent_* First, we need to give the parent_instr field a unique name to be able to replace with a helper. We have parent_instr fields for both nir_src and nir_def, so let's rename nir_src::parent_instr in preparation for rework. This was done with a combination of sed and manual fix-ups. Then we use semantic patches plus manual fixups: @@ expression s; @@ -s->renamed_parent_instr +nir_src_parent_instr(s) @@ expression s; @@ -s.renamed_parent_instr +nir_src_parent_instr(&s) @@ expression s; @@ -s->parent_if +nir_src_parent_if(s) @@ expression s; @@ -s.renamed_parent_if +nir_src_parent_if(&s) @@ expression s; @@ -s->is_if +nir_src_is_if(s) @@ expression s; @@ -s.is_if +nir_src_is_if(&s) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24671>	2023-10-10 04:58:05 -04:00
Timothy Arceri	af1528cc15	nir: replace use of nir_src_copy() Since `03b2c34793` nir_src_copy() no longer does anything useful, it will be removed in the following patch. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24986>	2023-09-08 03:01:39 +00:00
Faith Ekstrand	b5d6b7c402	nir: Drop most uses if nir_instr_rewrite_src() Generated by the following semantic patch: @@ expression I, S, D; @@ -nir_instr_rewrite_src(I, S, nir_src_for_ssa(D)); +nir_src_rewrite(S, D); Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24729>	2023-08-18 01:00:15 +00:00
Faith Ekstrand	964c73e13e	nir: Drop nir_if_rewrite_condition() Use nir_src_rewrite() instead. In a couple of cases, we can even drop a switch on whether or not it's an if source. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24729>	2023-08-18 01:00:15 +00:00
Faith Ekstrand	65b6ac8aa4	nir: Rename nir_instr_type_ssa_undef to nir_instr_type_undef We already renamed the type, we just need to rename the enum and the casting helper functions. Generated with sed: sed -i -e 's/nir_instr_type_ssa_undef/nir_instr_type_undef/g' src/*/.h src/*/.c src/*/.cpp sed -i -e 's/nir_instr_as_ssa_undef/nir_instr_as_undef/g' src/*/.h src/*/.c src/*/.cpp and two tiny whitespace fixups in lima. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24703>	2023-08-15 17:44:27 +00:00
Faith Ekstrand	4695bebc79	nir: Drop nir_dest Instead, we replace every use of it with nir_def. Most of this commit was generated by sed: sed -i -e 's/dest.ssa/def/g' src/*/.h src/*/.c src/*/.cpp A few manual fixups were required in lima and the nir_legacy code. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24674>	2023-08-14 21:22:53 +00:00
Faith Ekstrand	6c1d32581a	nir: Drop nir_alu_dest Instead, we replace it directly with nir_def. We could replace it with nir_dest but the next commit gets rid of that so this avoids unnecessary churn. Most of this commit was generated by sed: sed -i -e 's/dest.dest.ssa/def/g' src/*/.h src/*/.c src/*/.cpp There were a few manual fixups required in the nir_legacy.c and nir_from_ssa.c as nir_legacy_reg and nir_parallel_copy_entry both have a similar pattern. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24674>	2023-08-14 21:22:53 +00:00
Faith Ekstrand	ed9affa02f	nir: Drop most instances of nir_ssa_dest_init() Generated using the following two semantic patches: @@ expression I, J, NC, BS; @@ -nir_ssa_dest_init(I, &J->dest, NC, BS); +nir_def_init(I, &J->dest.ssa, NC, BS); @@ expression I, J, NC, BS; @@ -nir_ssa_dest_init(I, &J->dest.dest, NC, BS); +nir_def_init(I, &J->dest.dest.ssa, NC, BS); Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24658>	2023-08-13 17:12:52 +00:00
Alyssa Rosenzweig	09d31922de	nir: Drop "SSA" from NIR language Everything is SSA now. sed -e 's/nir_ssa_def/nir_def/g' \ -e 's/nir_ssa_undef/nir_undef/g' \ -e 's/nir_ssa_scalar/nir_scalar/g' \ -e 's/nir_src_rewrite_ssa/nir_src_rewrite/g' \ -e 's/nir_gather_ssa_types/nir_gather_types/g' \ -i $(git grep -l nir \| grep -v relnotes) git mv src/compiler/nir/nir_gather_ssa_types.c \ src/compiler/nir/nir_gather_types.c ninja -C build/ clang-format cd src/compiler/nir && find .c .h -type f -exec clang-format -i \{} \; Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24585>	2023-08-12 16:44:41 -04:00
Faith Ekstrand	777d336b1f	nir: clang-format src/compiler/nir/*.[ch] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24382>	2023-08-12 19:27:28 +00:00
Alyssa Rosenzweig	42ee8a55dd	nir: Remove nir_alu_dest::write_mask Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24432>	2023-08-03 22:40:30 +00:00
Alyssa Rosenzweig	579bc1e72e	treewide: Drop some is_ssa if's Via Coccinelle patch: @@ expression x; @@ -if (!x.is_ssa) { -... -} and likewise with x->is_ssa, with invalid hunks manually filtered out. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24432>	2023-08-03 22:40:29 +00:00
Alyssa Rosenzweig	5fead24365	treewide: Drop is_ssa asserts We only see SSA now. Via Coccinelle patch: @@ expression x; @@ -assert(x.is_ssa); Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24432>	2023-08-03 22:40:28 +00:00
Alyssa Rosenzweig	d559764e7c	nir: Remove nir_alu_dest::saturate Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24432>	2023-08-03 22:40:28 +00:00
Konstantin Seurer	574079e354	nir: Use nir_builder_at Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23883>	2023-07-03 15:21:37 +00:00
Alyssa Rosenzweig	190b1fdc64	nir: Convert to nir_foreach_function_impl Done by hand at each call site but going very quickly with funny Vim motions and common regexes. This is a very common idiom in NIR. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23807>	2023-06-27 22:44:04 +00:00
Rhys Perry	48674a1799	nir/peephole_select: allow some invocation broadcast intrinsics fossil-db (navi21): Totals from 3 (0.00% of 133428) affected shaders: Instrs: 2074 -> 2083 (+0.43%) CodeSize: 10596 -> 10692 (+0.91%) Latency: 75754 -> 75946 (+0.25%) InvThroughput: 16900 -> 16975 (+0.44%) Copies: 312 -> 309 (-0.96%) Branches: 150 -> 132 (-12.00%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23621>	2023-06-27 18:53:49 +00:00
Alyssa Rosenzweig	815efcdf7e	nir: Use nir_builder_create perl -p0e 's/nir_builder ([^;]);\snir_builder_init\(&\1, /nir_builder \1 = nir_builder_create(/g' -i $(git grep -l nir_builder_init) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23860>	2023-06-27 18:13:02 +00:00
Alyssa Rosenzweig	01e9ee79f7	nir: Drop unused name from nir_ssa_dest_init Since `624e799cc3` ("nir: Drop nir_ssa_def::name and nir_register::name"), SSA defs don't have names, making the name argument unused. Drop it from the signature and fix the call sites. This was done with the help of the following Coccinelle semantic patch: @@ expression A, B, C, D, E; @@ -nir_ssa_dest_init(A, B, C, D, E); +nir_ssa_dest_init(A, B, C, D); Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23078>	2023-05-17 23:46:16 +00:00
Alyssa Rosenzweig	aa6bdbd54a	nir: Use nir_foreach_phi(_safe) The pattern shows up all the time open-coded. Use the macro instead. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22967>	2023-05-12 14:02:23 +00:00
Alyssa Rosenzweig	7f6491b76d	nir: Combine if_uses with instruction uses Every nir_ssa_def is part of a chain of uses, implemented with doubly linked lists. That means each requires 2 * 64-bit = 16 bytes per def, which is memory intensive. Together they require 32 bytes per def. Not cool. To cut that memory use in half, we can combine the two linked lists into a single use list that contains both regular instruction uses and if-uses. To do this, we augment the nir_src with a boolean "is_if", and reimplement the abstract if-uses operations on top of that list. That boolean should fit into the padding already in nir_src so should not actually affect memory use, and in the future we sneak it into the bottom bit of a pointer. However, this creates a new inefficiency: now iterating over regular uses separate from if-uses is (nominally) more expensive. It turns out virtually every caller of nir_foreach_if_use(_safe) also calls nir_foreach_use(_safe) immediately before, so we rewrite most of the callers to instead call a new single `nir_foreach_use_including_if(_safe)` which predicates the logic based on `src->is_if`. This should mitigate the performance difference. There's a bit of churn, but this is largely a mechanical set of changes. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22343>	2023-04-07 23:48:03 +00:00
Faith Ekstrand	01275a1a95	nir: Drop a bunch of Authors tags This is what git blame is for. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22120>	2023-03-26 00:16:25 +00:00
Alyssa Rosenzweig	f4b3201244	nir/peephole_select: Allow load_preamble load_preamble is intended to be almost free (costing at most a move), and it does not have special bounds checking requirement, so it's ok to select with it. With this, drivers that use nir_opt_preamble together with a late call to peephole_select can optimize sequences like: if (x) { <uniform-on-uniform calculation> } else { <different uniform-on-uniform calculation> } to simply bcsel(x, <uniform register 0>, <uniform register 1>) rather than emitting needless control flow / branching over some moves. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20597>	2023-01-13 00:43:04 +00:00
Rhys Perry	69ba1c4d59	nir: adjust nir_src_copy signature to take a nir_instr * This is almost always a nir_instr and updating the src of a nir_if will have to work slightly differently in the future. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12910>	2022-08-30 18:21:44 +00:00
Rhys Perry	aa2d6e020b	Revert "nir: Drop the unused instr arg for src/dest copy functions." This reverts commit `c3a0184118`. Acked-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12910>	2022-08-30 18:21:44 +00:00
Jason Ekstrand	e8acc5a7ea	nir: Add a new sample_pos_or_center system value Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14198>	2021-12-17 16:02:16 +00:00
Jason Ekstrand	956199e870	nir: s/nir_var_mem_image/nir_var_image/g We typically use nir_var_mem_* for stuff that has an explicit byte-based memory layout. Images are opaque. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13386>	2021-10-16 03:47:10 +00:00
Caio Marcelo de Oliveira Filho	de3705edb0	nir: Add nir_var_mem_image Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:55 +00:00
Emma Anholt	aed4c0b5a9	nir: Drop the unused instr arg for src/dest copy functions. Now that we don't use ralloc, we don't need this arg to get at the right ralloc ctx. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11776>	2021-09-14 17:53:06 +00:00
Lionel Landwerlin	a13e79843e	nir: prevent peephole from generating invalid NIR We can't append instructions following a return/halt instruction because the control flow helpers will modify the successor of the block containing the return/halt. And the NIR validator enforces that the return/halt must have the end of the function as successor. This tends to happen following lower_shader_calls lowering which inserts halts. This probably doesn't prevent the optimization, it'll just happen in one of the return shaders after the halt has been removed. v2: Move prev block ending check earlier in the function (Daniel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12506>	2021-08-25 11:38:21 +00:00
Jason Ekstrand	624e799cc3	nir: Drop nir_ssa_def::name and nir_register::name We say that they're for debug only but we don't really have a good policy around when to set them and when not to. In particular, nir_lower_system_values and nir_lower_vars_to_ssa which are the chief producers of SSA values which might reasonably have a name do not bother to set one. We have some names set from things like BLORP and RADV's meta shaders but AFAICT, they're setting a name more because it's there than because they actually care. Also, most things other than nir_clone and nir_serialize don't bother to try and preserve them. You can see in the diffstat of this commit exactly what passes attempt to preserve names. Notably missing from the list is opt_algebraic which is the single largest source of SSA def churn and it happily throws names away. These observations lead me to question whether or not names are actually useful at all or if they're just taking up space (8B per instruction) and wasting CPU cycles (to ralloc_strdup on the off chance we do have one). I don't think I can think of a single time in recent history where I've been debugging a shader issue and a SSA value name has been there and been useful. If anything, the few times they are there, they just throw me off because they mess up the indentation in nir_print. iris shader-db on my system gets runtime -2.07734% +/- 1.26933% (n=5) Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5439>	2021-07-08 17:34:41 +00:00
Eric Anholt	47804f53f9	nir: Do peephole select on other instructions if the limit is ~0. limit==0 is the signal for "don't peephole anything but a move that will be optimized aways." limit > 0 is "up to N alu instructions may be moved out." nir-to-tgsi uses ~0 as the indicator of "No, we really need to eliminate all if instructions" on hardware like i915 that doesn't have control flow. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11329>	2021-06-18 04:30:43 +00:00
Caio Marcelo de Oliveira Filho	c8a7bd0dc8	nir: Rename WORK_GROUP (and similar) to WORKGROUP Be consistent with other usages in Vulkan and SPIR-V, and the recently added workgroup_size field. Acked-by: Emma Anholt <emma@anholt.net> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11190>	2021-06-07 22:34:42 +00:00
Jason Ekstrand	117668b811	nir: Make nir_ssa_def_rewrite_uses take an SSA value This commit replaces the new_src parameter of nir_ssa_def_rewrite_uses() with an SSA def, removes nir_ssa_def_rewrite_uses_ssa(), and rewrites all the users as needed. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa@collabora.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9383>	2021-03-08 16:59:55 +00:00
Rhys Perry	2d2decc905	nir: add sparse_residency_code_and Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7774>	2021-01-06 20:36:38 +00:00
Rhys Perry	4cbdf9ec4d	nir,spirv: implement SpvOpImageSparseTexelsResident Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7774>	2021-01-06 20:36:38 +00:00
Rhys Perry	95819663b7	nir: allow 5 component vectors These will be useful for sparse texture instructions and image load intrinsics. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7774>	2021-01-06 20:36:38 +00:00
Lionel Landwerlin	1c9488e0d1	nir: wire shading rate variables v2: Fixup comment about bits in nir_intrinsics.py v3: Use varying for primitive shading rate builtin (samuel) v4: Reoder switch alphabetically Make divergence of frag_shading_rate an option v5: Remove stage check for frag_shading_rate in divergence (Samuel) v6: s/frag_shading_rate_per_subgroup/single_frag_shading_rate_per_subgroup/ (Jason) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7795>	2020-12-01 08:20:38 +00:00
Daniel Schürmann	1c17223c02	nir/opt_peephole_select: respect selection_control when collapsing ifs Totals from 34 (0.02% of 138013) affected shaders (RAVEN): CodeSize: 625888 -> 626336 (+0.07%); split: -0.00%, +0.08% Instrs: 124121 -> 124229 (+0.09%); split: -0.00%, +0.09% Cycles: 1403072 -> 1403588 (+0.04%); split: -0.01%, +0.04% VMEM: 5308 -> 5364 (+1.06%); split: +1.07%, -0.02% Copies: 12773 -> 12838 (+0.51%); split: -0.08%, +0.59% Branches: 5758 -> 5801 (+0.75%); split: -0.21%, +0.96% Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7478>	2020-11-24 08:39:35 +00:00
Daniel Schürmann	28395407eb	nir/opt_peephole_select: collapse nested IFs if applicable Single-sided nested IFs can sometimes be collapsed even if they cannot be flattened. This optimization re-uses block_check_for_allowed_instrs() to determine if it is beneficial to collapse the IFs. Additionally, it is required that the phis of the outer IF become trivial after this optimization, so that no additional bcsel instructions are added. This optimization turns if (cond1) { <allowed instruction> if (cond2) { <any code> } else { } } else { } into <allowed instruction> if (cond1 && cond2) { <any code> } else { } Totals from 17044 (12.35% of 138013) affected shaders (RAVEN): SGPRs: 1246416 -> `1246256` (-0.01%); split: -0.01%, +0.00% VGPRs: 802752 -> 802736 (-0.00%); split: -0.01%, +0.01% SpillSGPRs: 45857 -> 45850 (-0.02%); split: -0.07%, +0.05% CodeSize: 85318240 -> 85208592 (-0.13%); split: -0.15%, +0.02% Instrs: 16769049 -> 16738195 (-0.18%); split: -0.20%, +0.02% Cycles: 947328732 -> 947145796 (-0.02%); split: -0.03%, +0.01% VMEM: 7271539 -> 7274090 (+0.04%); split: +0.05%, -0.01% SMEM: 925983 -> 927374 (+0.15%); split: +0.19%, -0.04% VClause: 294334 -> 294340 (+0.00%); split: -0.00%, +0.00% SClause: 633600 -> 634048 (+0.07%); split: -0.01%, +0.08% Copies: 1589650 -> 1580573 (-0.57%); split: -0.66%, +0.09% Branches: 540830 -> 525767 (-2.79%); split: -2.79%, +0.00% PreSGPRs: 902500 -> 902415 (-0.01%); split: -0.02%, +0.01% PreVGPRs: 759992 -> 760019 (+0.00%); split: -0.00%, +0.01% Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7478>	2020-11-24 08:39:35 +00:00
Daniel Schürmann	8d477baa4f	nir: allow for cheap intrinsics in nir_opt_peephole_select() Also added nir_instr_type_ssa_undef for convenience. Out of the added intrinsics, it seems that only load_helper_invocation has an effect on tested games. Totals from 446 (0.32% of 138013) affected shaders (RAVEN): SGPRs: 17600 -> 17688 (+0.50%); split: -0.09%, +0.59% VGPRs: 14140 -> 14312 (+1.22%); split: -0.03%, +1.24% CodeSize: 1157696 -> 1131208 (-2.29%) MaxWaves: 3430 -> 3427 (-0.09%) Instrs: 220402 -> 214200 (-2.81%) Cycles: 900776 -> 875752 (-2.78%) VMEM: 160894 -> 180439 (+12.15%); split: +12.19%, -0.04% SMEM: 19854 -> 20169 (+1.59%); split: +1.74%, -0.16% VClause: 3597 -> 3604 (+0.19%) SClause: 7258 -> 7248 (-0.14%); split: -0.15%, +0.01% Copies: 17060 -> 16336 (-4.24%); split: -4.44%, +0.20% Branches: 3995 -> 2518 (-36.97%) PreSGPRs: 11972 -> 12148 (+1.47%); split: -0.13%, +1.60% Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/2804>	2020-11-20 13:46:41 +01:00
Jason Ekstrand	9d377c01d0	nir: Make nir_deref_instr::mode a bitfield We rename it to "modes" to make it clear that it may contain more than one mode and adjust all the uses of nir_deref_instr::modes to attempt to handle multiple modes. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	5e1c42d85f	nir: Call nir_metadata_preserve on !progress Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5171>	2020-06-11 05:08:12 +00:00
Jason Ekstrand	99540edfde	nir: Treat vec8/16 as select in opt_peephole_select Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4365>	2020-03-31 00:18:05 +00:00
Alejandro Piñeiro	2865d79a33	nir/opt_peephole_select: remove unused variables To avoid "unused variable" warnings. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2019-12-13 17:14:58 +01:00
Ian Romanick	e342d6970b	nir/opt_peephole_select: Don't count some unary operations In many cases, fsat, fneg, fabs, ineg, and iabs will get folded into another instruction as either source or destination modifiers. Counting them as instructions means that some if-statements won't get converted to selects. For example, vec1 32 ssa_25 = flt32 ssa_0, ssa_23.x /* succs: block_1 block_2 / if ssa_25 { block block_1: / preds: block_0 / vec1 32 ssa_26 = fabs ssa_24 vec1 32 ssa_27 = fneg ssa_26 vec1 32 ssa_28 = fabs ssa_20 vec1 32 ssa_29 = fneg ssa_28 vec1 32 ssa_30 = fmul ssa_27, ssa_29 vec1 32 ssa_31 = fsat ssa_30 / succs: block_3 / } else { block block_2: / preds: block_0 / / succs: block_3 / } block block_3: / preds: block_1 block_2 */ block_1 isn't really 6 instructions, but it will be counted that way. Most callers of the peephole_select pass use either 1 or 8. It's very easy to blow way past either of these limits with things that are really only one or two actual instructions. I also tried some fancier things like making sure the fsat was of another SSA def from the same block, but the simple test was actually better. The i965 back-end SEL peephole pass still helps ~700 shaders in shader-db with this change. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Matt Turner <mattst88@gmail.com> All Gen6+ platforms had similar results. (Ice Lake shown) total instructions in shared programs: 14743694 -> 14738910 (-0.03%) instructions in affected programs: 156575 -> 151791 (-3.06%) helped: 1204 HURT: 0 helped stats (abs) min: 1 max: 27 x̄: 3.97 x̃: 3 helped stats (rel) min: 0.15% max: 19.57% x̄: 5.15% x̃: 4.55% 95% mean confidence interval for instructions value: -4.12 -3.82 95% mean confidence interval for instructions %-change: -5.35% -4.95% Instructions are helped. total cycles in shared programs: 231749141 -> 231602916 (-0.06%) cycles in affected programs: 2818975 -> 2672750 (-5.19%) helped: 876 HURT: 322 helped stats (abs) min: 2 max: 788 x̄: 180.99 x̃: 220 helped stats (rel) min: <.01% max: 43.82% x̄: 20.75% x̃: 19.44% HURT stats (abs) min: 1 max: 1188 x̄: 38.27 x̃: 20 HURT stats (rel) min: 0.09% max: 102.67% x̄: 5.17% x̃: 1.70% 95% mean confidence interval for cycles value: -130.47 -113.64 95% mean confidence interval for cycles %-change: -14.85% -12.72% Cycles are helped. total sends in shared programs: 730495 -> 730491 (<.01%) sends in affected programs: 46 -> 42 (-8.70%) helped: 2 HURT: 0 Iron Lake and GM45 had similar results. (Iron Lake shown) total instructions in shared programs: 8122757 -> 8122617 (<.01%) instructions in affected programs: 14716 -> 14576 (-0.95%) helped: 46 HURT: 1 helped stats (abs) min: 1 max: 8 x̄: 3.07 x̃: 3 helped stats (rel) min: 0.36% max: 10.00% x̄: 2.54% x̃: 1.06% HURT stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 HURT stats (rel) min: 1.59% max: 1.59% x̄: 1.59% x̃: 1.59% 95% mean confidence interval for instructions value: -3.42 -2.54 95% mean confidence interval for instructions %-change: -3.28% -1.62% Instructions are helped. total cycles in shared programs: 188510100 -> 188509780 (<.01%) cycles in affected programs: 58994 -> 58674 (-0.54%) helped: 32 HURT: 1 helped stats (abs) min: 2 max: 96 x̄: 10.06 x̃: 6 helped stats (rel) min: 0.05% max: 15.29% x̄: 1.37% x̃: 0.31% HURT stats (abs) min: 2 max: 2 x̄: 2.00 x̃: 2 HURT stats (rel) min: 0.68% max: 0.68% x̄: 0.68% x̃: 0.68% 95% mean confidence interval for cycles value: -16.34 -3.06 95% mean confidence interval for cycles %-change: -2.46% -0.15% Cycles are helped.	2019-12-02 16:46:19 -08:00
Timothy Arceri	7f106a2b5d	util: rename list_empty() to list_is_empty() This makes it clear that it's a boolean test and not an action (eg. "empty the list"). Reviewed-by: Eric Engestrom <eric@engestrom.ch>	2019-10-28 11:24:38 +00:00
Jason Ekstrand	f2dc0f2872	nir: Drop imov/fmov in favor of one mov instruction The difference between imov and fmov has been a constant source of confusion in NIR for years. No one really knows why we have two or when to use one vs. the other. The real reason is that they do different things in the presence of source and destination modifiers. However, without modifiers (which many back-ends don't have), they are identical. Now that we've reworked nir_lower_to_source_mods to leave one abs/neg instruction in place rather than replacing them with imov or fmov instructions, we don't need two different instructions at all anymore. Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Acked-by: Rob Clark <robdclark@chromium.org>	2019-05-24 08:38:11 -05:00

1 2

75 commits