fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-21 15:50:11 +01:00

Author	SHA1	Message	Date
Rhys Perry	e465ac2561	nir/lower_shader_calls: remove CF before nir_opt_if Otherwise, opt_if_simplification() can attempt to insert an inot after a jump. Fixes RADV compilation of a Cyberpunk 2077 pipeline with PIPELINE_CREATE_DISABLE_OPTIMIZATION_BIT. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27193>	2024-01-23 19:02:03 +00:00
Rhys Perry	1afd0878e9	nir/lower_shader_calls: skip zero-sized qsort Fixes UBSan: src/compiler/nir/nir_lower_shader_calls.c:1681:7: runtime error: null pointer passed as argument 1, which is declared to never be null Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25853>	2023-10-25 17:27:47 +00:00
Alyssa Rosenzweig	c39896b17b	nir: Use getters for nir_src::parent_* First, we need to give the parent_instr field a unique name to be able to replace with a helper. We have parent_instr fields for both nir_src and nir_def, so let's rename nir_src::parent_instr in preparation for rework. This was done with a combination of sed and manual fix-ups. Then we use semantic patches plus manual fixups: @@ expression s; @@ -s->renamed_parent_instr +nir_src_parent_instr(s) @@ expression s; @@ -s.renamed_parent_instr +nir_src_parent_instr(&s) @@ expression s; @@ -s->parent_if +nir_src_parent_if(s) @@ expression s; @@ -s.renamed_parent_if +nir_src_parent_if(&s) @@ expression s; @@ -s->is_if +nir_src_is_if(s) @@ expression s; @@ -s.is_if +nir_src_is_if(&s) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24671>	2023-10-10 04:58:05 -04:00
Alyssa Rosenzweig	25cc04c59b	treewide: Use nir_before/after_impl in easy cases These open-code the same idiom as the helper. Via Coccinelle patch: @@ expression func_impl; @@ -nir_before_cf_list(&func_impl->body) +nir_before_impl(func_impl) @@ expression func_impl; @@ -nir_after_cf_list(&func_impl->body) +nir_after_impl(func_impl) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24910>	2023-08-30 19:30:58 +00:00
Alyssa Rosenzweig	011f0b0d7d	nir/lower_shader_calls: Fix warning with clang Implicit conversion warning. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24864>	2023-08-29 14:06:14 +00:00
Konstantin Seurer	a209d76722	nir/lower_shader_calls: Limit the remat chain length There is no way we will rematerialize a 40k instruction long chain and it also won't be beneficial. This improves the replay time if our CP2077 fossil by 350% when compiling only ray tracing pipelines. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24864>	2023-08-29 14:06:14 +00:00
Alyssa Rosenzweig	cda1961835	treewide: Also handle struct nir_builder form Via Coccinelle patch: @def@ typedef bool; typedef nir_builder; typedef nir_instr; typedef nir_def; identifier fn, instr, intr, x, builder, data; @@ static fn(struct nir_builder* builder, -nir_instr instr, +nir_intrinsic_instr intr, ...) { ( - if (instr->type != nir_instr_type_intrinsic) - return false; - nir_intrinsic_instr intr = nir_instr_as_intrinsic(instr); \| - nir_intrinsic_instr intr = nir_instr_as_intrinsic(instr); - if (instr->type != nir_instr_type_intrinsic) - return false; ) <... ( -instr->x +intr->instr.x \| -instr +&intr->instr ) ...> } @pass depends on def@ identifier def.fn; expression shader, progress; @@ ( -nir_shader_instructions_pass(shader, fn, +nir_shader_intrinsics_pass(shader, fn, ...) \| -NIR_PASS_V(shader, nir_shader_instructions_pass, fn, +NIR_PASS_V(shader, nir_shader_intrinsics_pass, fn, ...) \| -NIR_PASS(progress, shader, nir_shader_instructions_pass, fn, +NIR_PASS(progress, shader, nir_shader_intrinsics_pass, fn, ...) ) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24852>	2023-08-24 15:48:02 +00:00
Faith Ekstrand	b5d6b7c402	nir: Drop most uses if nir_instr_rewrite_src() Generated by the following semantic patch: @@ expression I, S, D; @@ -nir_instr_rewrite_src(I, S, nir_src_for_ssa(D)); +nir_src_rewrite(S, D); Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24729>	2023-08-18 01:00:15 +00:00
Faith Ekstrand	de063a1481	nir: Drop most uses of nir_instr_rewrite_src_ssa() Generated with the following semantic patch: @@ expression I, S, D; @@ -nir_instr_rewrite_src_ssa(I, S, D); +nir_src_rewrite(S, D); Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24729>	2023-08-18 01:00:15 +00:00
Faith Ekstrand	964c73e13e	nir: Drop nir_if_rewrite_condition() Use nir_src_rewrite() instead. In a couple of cases, we can even drop a switch on whether or not it's an if source. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24729>	2023-08-18 01:00:15 +00:00
Rhys Perry	afb465013f	nir/lower_shader_calls: fix align_offset I don't think this does anything at the moment, because all accesses are scalar aligned. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24350>	2023-08-16 19:11:26 +00:00
Faith Ekstrand	b64da56b1a	nir: s/nir_instr_ssa_def/nir_instr_def/ Generated by sed: sed -i -e 's/nir_instr_ssa_def/nir_instr_def/g' src/*/.h src/*/.c src/*/.cpp Suggested-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24703>	2023-08-15 17:44:27 +00:00
Faith Ekstrand	43be4129d2	nir: s/live_ssa_def/live_def/ Generated mostly with sed: sed -i -e 's/live_ssa_def/live_def/g' src/compiler/nir/nir.h src/compiler/nir/*.c Plus three fixups in various Intel drivers. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24703>	2023-08-15 17:44:27 +00:00
Faith Ekstrand	65b6ac8aa4	nir: Rename nir_instr_type_ssa_undef to nir_instr_type_undef We already renamed the type, we just need to rename the enum and the casting helper functions. Generated with sed: sed -i -e 's/nir_instr_type_ssa_undef/nir_instr_type_undef/g' src/*/.h src/*/.c src/*/.cpp sed -i -e 's/nir_instr_as_ssa_undef/nir_instr_as_undef/g' src/*/.h src/*/.c src/*/.cpp and two tiny whitespace fixups in lima. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24703>	2023-08-15 17:44:27 +00:00
Faith Ekstrand	4695bebc79	nir: Drop nir_dest Instead, we replace every use of it with nir_def. Most of this commit was generated by sed: sed -i -e 's/dest.ssa/def/g' src/*/.h src/*/.c src/*/.cpp A few manual fixups were required in lima and the nir_legacy code. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24674>	2023-08-14 21:22:53 +00:00
Faith Ekstrand	6c1d32581a	nir: Drop nir_alu_dest Instead, we replace it directly with nir_def. We could replace it with nir_dest but the next commit gets rid of that so this avoids unnecessary churn. Most of this commit was generated by sed: sed -i -e 's/dest.dest.ssa/def/g' src/*/.h src/*/.c src/*/.cpp There were a few manual fixups required in the nir_legacy.c and nir_from_ssa.c as nir_legacy_reg and nir_parallel_copy_entry both have a similar pattern. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24674>	2023-08-14 21:22:53 +00:00
Faith Ekstrand	9d81f13a75	nir: Get rid of nir_dest_num_components() We could add a nir_def_num_components() helper but we use ssa.num_components about 3x as often as nir_dest_num_components() today so that's a major Coccinelle refactor anyway and this doesn't make it much worse. Most of this commit was generated byt the following semantic patch: @@ expression D; @@ <... -nir_dest_num_components(D) +D.ssa.num_components ... Some manual fixup was needed, especially in cpp files where Coccinelle tends to give up the moment it sees any interesting C++. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24674>	2023-08-14 21:22:53 +00:00
Alyssa Rosenzweig	09d31922de	nir: Drop "SSA" from NIR language Everything is SSA now. sed -e 's/nir_ssa_def/nir_def/g' \ -e 's/nir_ssa_undef/nir_undef/g' \ -e 's/nir_ssa_scalar/nir_scalar/g' \ -e 's/nir_src_rewrite_ssa/nir_src_rewrite/g' \ -e 's/nir_gather_ssa_types/nir_gather_types/g' \ -i $(git grep -l nir \| grep -v relnotes) git mv src/compiler/nir/nir_gather_ssa_types.c \ src/compiler/nir/nir_gather_types.c ninja -C build/ clang-format cd src/compiler/nir && find .c .h -type f -exec clang-format -i \{} \; Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24585>	2023-08-12 16:44:41 -04:00
Faith Ekstrand	777d336b1f	nir: clang-format src/compiler/nir/*.[ch] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24382>	2023-08-12 19:27:28 +00:00
Alyssa Rosenzweig	42ee8a55dd	nir: Remove nir_alu_dest::write_mask Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24432>	2023-08-03 22:40:30 +00:00
Alyssa Rosenzweig	95e3df39c0	treewide: sed out more is_ssa Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24432>	2023-08-03 22:40:28 +00:00
Alyssa Rosenzweig	5fead24365	treewide: Drop is_ssa asserts We only see SSA now. Via Coccinelle patch: @@ expression x; @@ -assert(x.is_ssa); Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24432>	2023-08-03 22:40:28 +00:00
Rhys Perry	59f24c7df8	nir/lower_shader_calls: vectorize stack access for all shaders fossil-db (gfx1100): Totals from 9 (0.01% of 133461) affected shaders: MaxWaves: 156 -> 158 (+1.28%) Instrs: 37193 -> 37324 (+0.35%) CodeSize: 191008 -> 191968 (+0.50%) VGPRs: 816 -> 804 (-1.47%) Latency: 75789 -> 75641 (-0.20%); split: -0.35%, +0.15% InvThroughput: 10475 -> 10441 (-0.32%); split: -0.40%, +0.08% VClause: 666 -> 663 (-0.45%); split: -0.75%, +0.30% SClause: 1077 -> 1076 (-0.09%) Copies: 3425 -> 3407 (-0.53%); split: -0.73%, +0.20% PreVGPRs: 770 -> 745 (-3.25%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24334>	2023-07-27 12:38:01 +00:00
Alyssa Rosenzweig	9eab1e7521	nir/lower_shader_calls: Convert to register intrinsics Yet another internal use of nir_register that gets lowered back to SSA after the pass. Easy enough to replace with intrinsic-based registers instead. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23089>	2023-07-12 01:34:27 +00:00
Alyssa Rosenzweig	36b29201fa	nir: Produce intrinsics in lower_{phis,ssa_defs}_to_regs A number of passes lower SSA partially to registers, do work that would be invalid in SSA, and then go back into SSA with nir_lower_regs_to_ssa. As a step towards replacing nir_register with intrinsics, the nir_lower_{phis,ssa_defs}_to_regs passes are changed to produce intrinsics instead of nir_registers, and their callers are updated to call nir_lower_reg_intrinsics_to_ssa instead of nir_lower_regs_to_ssa to compensate. Jointly authored with Faith. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23089>	2023-07-12 01:34:27 +00:00
Konstantin Seurer	6532751e4d	nir/lower_shader_calls: Remat derefs after shader calls This avoids spilling deref instructions by wrapping shader calls inside dummy blocks, rematerializing derefs in their use blocks and removing the dummy blocks. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22064>	2023-07-11 17:32:55 +00:00
Konstantin Seurer	574079e354	nir: Use nir_builder_at Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23883>	2023-07-03 15:21:37 +00:00
Alyssa Rosenzweig	bed2f3f8e6	nir: Rename load/store_reg -> load/store_register This frees up the shorter names for the new register-based intrinsics. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23956>	2023-06-30 18:19:51 -04:00
Alyssa Rosenzweig	190b1fdc64	nir: Convert to nir_foreach_function_impl Done by hand at each call site but going very quickly with funny Vim motions and common regexes. This is a very common idiom in NIR. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23807>	2023-06-27 22:44:04 +00:00
Alyssa Rosenzweig	069cca9d66	treewide: Remove unused builders -Wunused-variables kicks in now that it can see through the init. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23860>	2023-06-27 18:13:02 +00:00
Alyssa Rosenzweig	173b9ee69a	treewide: Use nir_builder_create more perl -p0e 's/nir_builder_init\(&([^,]*), /\1 = nir_builder_create(/g' -i $(git grep -l nir_builder_init) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23860>	2023-06-27 18:13:02 +00:00
Alyssa Rosenzweig	815efcdf7e	nir: Use nir_builder_create perl -p0e 's/nir_builder ([^;]);\snir_builder_init\(&\1, /nir_builder \1 = nir_builder_create(/g' -i $(git grep -l nir_builder_init) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23860>	2023-06-27 18:13:02 +00:00
Sviatoslav Peleshko	08e95f8f8e	nir/lower_shader_calls: Fix cursor if broken after nir_cf_extract() call Fixes: `e2dadda3` ("Revert "nir/lower_shader_calls: put inserted instructions into a dummy block") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8978 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22884>	2023-06-11 00:29:49 +00:00
Lionel Landwerlin	54dfc08b89	nir: add a new intrinsic to describe resources accessed on intel Intel HW has multiple ways to access resources like UBO/SSBO/images : - binding tables : a small ~240 heap of surfaces - bindless surfaces : a 64Mb heap of surfaces up to Gfx12+, 4Gb on Gfx12.5+ - surfaces : a 4Gb heap on Gfx12.5+ (mostly unused at the moment, only available through the LSC) For samplers, we have 2 options since Gfx11+ : - samplers indexed from the Dynamic State Heap (4Gb) - samplers indexed from the Bindless Sampler Heap (4Gb) Additionally our whole push constant promotion mechanism is based around binding table indices. This is problematic if you want to also promote to push constants things that would be accessed through the bindless heap. To solve this issue, we introduce a new intrinsic that will cary a block index that is not based off the binding table index nor the bindless table offset. We will also use this intrinsic to identify whether the buffer/surface index in load_ubo/load_ssbo/store_ssbo/etc... is relative to the binding table or the bindless heap. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21645>	2023-05-30 06:36:37 +00:00
Lionel Landwerlin	b8790e9808	nir/lower_shader_calls: add ability to force remat of instructions Some instruction we would like to keep around because they carry additional information in their indices. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21645>	2023-05-30 06:36:36 +00:00
Konstantin Seurer	40653f0783	nir/lower_shader_calls: Remat derefs earlier spill_ssa_defs_and_lower_shader_calls can insert phis as well which can make nir_opt_shrink_stores crash. Fixes: `200e551c` ("nir/lower_shader_calls: Remat derefs before lowering resumes") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9003 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23007>	2023-05-16 18:24:17 +00:00
Alyssa Rosenzweig	aa6bdbd54a	nir: Use nir_foreach_phi(_safe) The pattern shows up all the time open-coded. Use the macro instead. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22967>	2023-05-12 14:02:23 +00:00
Konstantin Seurer	200e551cbb	nir/lower_shader_calls: Remat derefs before lowering resumes Closes: #7923 cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20399>	2023-03-24 14:55:37 +00:00
Daniel Schürmann	2bb369dd8d	nir: add assertions that loops don't have a Continue Construct Hoping that I didn't miss any, this should add assertions to all functions and passes which explicitly handle 'nir_loop'. Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13962>	2023-02-21 10:41:11 +00:00
Friedrich Vock	e20564cfdb	nir/lower_shader_calls: Remove phis after dead control flow This potentially gets rid of some more phis without sources. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19960>	2022-12-11 22:13:32 +00:00
Lionel Landwerlin	9d0560fe87	nir/lower_shader_calls: enable vectorizer We cannot fully use the vectorizer outside of this pass because once stack load/store operations have been lower to global load/store, the robustness rule applies to those as they would to application load/store. But this is all internal and we know it doesn't require out of bound checking. So doing the vectorizing here is the best solution. We just have to teach the vectorizer about our intrinsics. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20058>	2022-11-30 07:23:30 +00:00
Lionel Landwerlin	9c76cda7f0	nir/lower_shader_calls: add a pass to split load/store into scalars We'll run this pass prior to opt_load_store_vectorize to maximize the effect of the optimization. At the moment opt_load_store_vectorize is unable to pack this : store vec3 store vec3 store vec2 into this : store vec4 store vec3 If your backend can only do vec4 stores max. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20058>	2022-11-30 07:23:30 +00:00
Lionel Landwerlin	e84eab42c4	nir/lower_shader_calls: avoid moving loads into loops This is similar to what opt_gcm is doing. Moving a load inside a loop will increase memory bandwidth. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20058>	2022-11-30 07:23:30 +00:00
Lionel Landwerlin	e2dadda35f	Revert "nir/lower_shader_calls: put inserted instructions into a dummy block" This reverts commit `35d82ecf1e`. Cc: mesa-stable Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19820>	2022-11-19 10:53:18 +00:00
Lionel Landwerlin	3686d5a312	nir/lower_shader_calls: wrap only jumps rather than entire code blocks Moving entire chunks of code into a dummy if block is causing issues in some situations. To work around the issue that we tried to fix in `35d82ecf1e` ("nir/lower_shader_calls: put inserted instructions into a dummy block") which is that we cannot cut and past a block of instruction that ends with a jump if there are more instruction behind where we're going to past. We can instead just wraps the jumps into dummy if blocks. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19820>	2022-11-19 10:53:18 +00:00
Lionel Landwerlin	96d84e2a77	nir/lower_shader_calls: update metadata before validation Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19820>	2022-11-19 10:53:18 +00:00
Konstantin Seurer	bdd2abe334	nir/lower_shader_calls: Get rid of any brw occurences Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19749>	2022-11-18 12:28:14 +00:00
Lionel Landwerlin	29da1c8253	nir/lower_shader_calls: run opt_cse after lower stack intrinsics In particular when using scratch_base_ptr Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16556>	2022-10-26 12:53:25 +00:00
Lionel Landwerlin	3c242e551d	nir/lower_shader_calls: move scratch loads closer to where they're needed The intel backend compiler is not dealing with the scratch loads emitted by this pass very well. There are 2 reasons for this : - all loads are at the top of the shader - the loads are global load intrinsics (cannot be differentiated from ssbo loads for example) This leads the backend to generate ridiculous amount of spills. To help a bit (actually quite a lot), we can move the scratch loads in the blocks where they're needed, using the dominance information. Quite often that also ends up moving loads in a block that might not be reached by all the lanes, so we're potentially avoiding some loads. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16556>	2022-10-26 12:53:25 +00:00
Lionel Landwerlin	5717f13dff	nir/lower_shader_calls: add a pass to sort/pack values on the stack The previous pass shrinking values stored on the stack might have left some gaps on the stack (a vec4 turned into a vec3 for instance). This pass reorders variables on the stack, by component bit size and by ssa value number. The component size is useful to pack smaller values together. The ssa value number is also important because if we have 2 calls spilling the same values, then we can avoid reemiting the spillings if the values are stored in the same location. v2: Remove unused sorting function (Konstantin) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16556>	2022-10-26 12:53:25 +00:00

1 2

73 commits