fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-21 04:38:09 +02:00

Author	SHA1	Message	Date
Timur Kristóf	96d11d0f56	nir/opt_varyings: Fix assertion when deduplicating TCS outputs. When deduplicating TCS outputs, we may find outputs that aren't loaded by the shader itself. This previously hit a bad assertion. Fixes: `c66967b5cb` Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12410 Cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34317>	2025-04-03 19:54:51 +00:00
Matt Turner	7534559f2f	nir: Return NULL, not false, from functions returning pointers Reported by clang's `-Wbool-conversion`. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34014>	2025-03-13 20:11:09 +00:00
Georg Lehmann	f595bcfe78	nir/opt_varyings: clean up nir_progress usage Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33770>	2025-02-28 14:38:14 +00:00
Alyssa Rosenzweig	9a58a8257e	treewide: Switch to nir_progress Via the Coccinelle patch at the end of the commit message, followed by sed -ie 's/progress = progress \| /progress \|=/g' $(git grep -l 'progress = prog') ninja -C ~/mesa/build clang-format cd ~/mesa/src/compiler/nir && clang-format -i *.c agxfmt @@ identifier prog; expression impl, metadata; @@ -if (prog) { -nir_metadata_preserve(impl, metadata); -} else { -nir_metadata_preserve(impl, nir_metadata_all); -} -return prog; +return nir_progress(prog, impl, metadata); @@ expression prog_expr, impl, metadata; @@ -if (prog_expr) { -nir_metadata_preserve(impl, metadata); -return true; -} else { -nir_metadata_preserve(impl, nir_metadata_all); -return false; -} +bool progress = prog_expr; +return nir_progress(progress, impl, metadata); @@ identifier prog; expression impl, metadata; @@ -nir_metadata_preserve(impl, prog ? (metadata) : nir_metadata_all); -return prog; +return nir_progress(prog, impl, metadata); @@ identifier prog; expression impl, metadata; @@ -nir_metadata_preserve(impl, prog ? (metadata) : nir_metadata_all); +nir_progress(prog, impl, metadata); @@ expression impl, metadata; @@ -nir_metadata_preserve(impl, metadata); -return true; +return nir_progress(true, impl, metadata); @@ expression impl; @@ -nir_metadata_preserve(impl, nir_metadata_all); -return false; +return nir_no_progress(impl); @@ identifier other_prog, prog; expression impl, metadata; @@ -if (prog) { -nir_metadata_preserve(impl, metadata); -} else { -nir_metadata_preserve(impl, nir_metadata_all); -} -other_prog \|= prog; +other_prog = other_prog \| nir_progress(prog, impl, metadata); @@ identifier prog; expression impl, metadata; @@ -if (prog) { -nir_metadata_preserve(impl, metadata); -} else { -nir_metadata_preserve(impl, nir_metadata_all); -} +nir_progress(prog, impl, metadata); @@ identifier other_prog, prog; expression impl, metadata; @@ -if (prog) { -nir_metadata_preserve(impl, metadata); -other_prog = true; -} else { -nir_metadata_preserve(impl, nir_metadata_all); -} +other_prog = other_prog \| nir_progress(prog, impl, metadata); @@ expression prog_expr, impl, metadata; identifier prog; @@ -if (prog_expr) { -nir_metadata_preserve(impl, metadata); -prog = true; -} else { -nir_metadata_preserve(impl, nir_metadata_all); -} +bool impl_progress = prog_expr; +prog = prog \| nir_progress(impl_progress, impl, metadata); @@ identifier other_prog, prog; expression impl, metadata; @@ -if (prog) { -other_prog = true; -nir_metadata_preserve(impl, metadata); -} else { -nir_metadata_preserve(impl, nir_metadata_all); -} +other_prog = other_prog \| nir_progress(prog, impl, metadata); @@ expression prog_expr, impl, metadata; identifier prog; @@ -if (prog_expr) { -prog = true; -nir_metadata_preserve(impl, metadata); -} else { -nir_metadata_preserve(impl, nir_metadata_all); -} +bool impl_progress = prog_expr; +prog = prog \| nir_progress(impl_progress, impl, metadata); @@ expression prog_expr, impl, metadata; @@ -if (prog_expr) { -nir_metadata_preserve(impl, metadata); -} else { -nir_metadata_preserve(impl, nir_metadata_all); -} +bool impl_progress = prog_expr; +nir_progress(impl_progress, impl, metadata); @@ identifier prog; expression impl, metadata; @@ -nir_metadata_preserve(impl, metadata); -prog = true; +prog = nir_progress(true, impl, metadata); @@ identifier prog; expression impl, metadata; @@ -if (prog) { -nir_metadata_preserve(impl, metadata); -} -return prog; +return nir_progress(prog, impl, metadata); @@ identifier prog; expression impl, metadata; @@ -if (prog) { -nir_metadata_preserve(impl, metadata); -} +nir_progress(prog, impl, metadata); @@ expression impl; @@ -nir_metadata_preserve(impl, nir_metadata_all); +nir_no_progress(impl); @@ expression impl, metadata; @@ -nir_metadata_preserve(impl, metadata); +nir_progress(true, impl, metadata); squashme! sed -ie 's/progress = progress \| /progress \|=/g' $(git grep -l 'progress = prog') Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33722>	2025-02-26 15:19:53 +00:00
Alyssa Rosenzweig	91872c9c51	nir: clang-format Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33722>	2025-02-26 15:19:53 +00:00
Rhys Perry	e04c0025ef	nir: add NIR_DEBUG=extended_validation This runs validation even if the pass makes no progress. It also requires all kinds of metadata before the pass to test whether it correctly preserves or invalidates them. It's disabled by default because it can be extremely slow. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33354>	2025-02-10 15:01:37 +00:00
Marek Olšák	61e289d0ca	nir/opt_varyings: handle user barycentrics This failed an assertion because the barycentric src wasn't an intrinsic. v2: also do it in backward_inter_shader_code_motion Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> (v1) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:26 -05:00
Daniel Schürmann	1feb733cd4	Revert "nir: add nir_clear_divergence_info, use it in nir_opt_varyings" This reverts commit `9d043e138d`. It is no longer needed. nir_convert_from_ssa() is now capable to ignore divergence information. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33009>	2025-01-23 01:31:24 +00:00
Timur Kristóf	ec548fd37b	Revert "nir/opt_varyings: Add workaround for RADV mesh shader multiview." The workaround is not needed anymore, because RADV now implements the FS layer ID input as a sysval. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32641>	2025-01-02 14:07:51 +00:00
Marek Olšák	a50d069d1c	nir/opt_varyings: clear info->clip/cull_distance_array_size if relocated svga breaks if shader_info declares these, but the shader is missing the outputs. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32684>	2024-12-20 02:32:08 +00:00
Marek Olšák	9d129505b5	nir/opt_varyings: set all IO types to float to facilitate full vectorization If types differ between components of a vec4 slot, IO vectorization can't be done. This also helps drivers like d3d12 that require matching types between shaders. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32684>	2024-12-20 02:32:08 +00:00
Benjamin Lee	becb014d27	nir: treat per-view outputs as arrayed IO This is needed for implementing multiview in panvk, where the address calculation for multiview outputs is not well-represented by lowering to nir_intrinsic_store_output with a single offset. The case where a variable is both per-view and per-{vertex,primitive} is now unsupported. This would come up with drivers implementing NV_mesh_shader or using nir_lower_multiview on geometry, tessellation, or mesh shaders. No drivers currently do either of these. There was some code that attempted to handle the nested per-view case by unwrapping per-view/arrayed types twice, but it's unclear to what extent this actually worked. ANV and Turnip both rely on per-view outputs being assigned a unique driver location for each view, so I've added on option to configure that behavior rather than removing it. Signed-off-by: Benjamin Lee <benjamin.lee@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31704>	2024-12-09 20:31:49 +00:00
Marek Olšák	f5a0cde125	nir/opt_varyings: fix compile failures in the disabled PRINT code linkage is a pointer, but it was used as a structure. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32424>	2024-12-04 13:40:41 +00:00
Marek Olšák	dd788d0a7f	nir/opt_varyings: remove rare dead output stores after inter-shader code motion Backward inter-shader code motion left dead output stores in the producer in rare cases. Those dead stores would then make their way into drivers and hw. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32424>	2024-12-04 13:40:41 +00:00
Marek Olšák	f0c4e71d58	nir/opt_varyings: fix getting deref variables for sysvals This might fix array system values. Noticed by luck. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32424>	2024-12-04 13:40:41 +00:00
Marek Olšák	dcc679ab3a	nir/opt_varyings: add inter-shader code motion for uniform/UBO indexing If input_value, index, index1 or index2 is an input, here are examples of code that this commit moves from consumers to producers: * input_value * uniform_array[index] * uniform_array[index] * ubo[0].array[index] * ubo[index].var * ubo[index1].array[index2] If the array index is computed from an input, it must be flat or convergent within a primitive to be moved. If the array index is not an input, it must be a uniform expression. dEQP-GLES31.functional.shaders.opaque_type_indexing.ubo.dynamically_uniform_fragment has UBO indexing that is moved to the producer by this. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32424>	2024-12-04 13:40:41 +00:00
Marek Olšák	f52ae35d73	nir/opt_varyings: propagate indirect uniform/UBO loads into the next shader Uniform and UBO loads with non-constant indices are now propagated. The majority of this code implements cloning deref chains. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32424>	2024-12-04 13:40:41 +00:00
Marek Olšák	c0de78f120	nir/opt_varyings: change try_move_postdominator param to nir_instr type We want more instructions to be movable, like load_deref(var, index = load_input). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32424>	2024-12-04 13:40:41 +00:00
Marek Olšák	8e39e8ed4d	nir/opt_varyings: make top-level compaction code for TES, TCS, GS separate Add a separate "if" block for each and use a helper for repeated code. There will be more code added here that keeping TES, TCS, and GS compaction code unified would be a mess. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32424>	2024-12-04 13:40:41 +00:00
Marek Olšák	d20e07dbad	nir/opt_varyings: fix max_slot for color varying compaction It should be in units of slots. This was unlikely to break anything. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32424>	2024-12-04 13:40:41 +00:00
Marek Olšák	69b1853ecf	nir/opt_varyings: count the number of unused components for compaction correctly Holes due to indirectly-indexed inputs were ignored, making the compaction worse when such inputs were present alongside convergent inputs. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32424>	2024-12-04 13:40:41 +00:00
Marek Olšák	1aa9fec542	nir/opt_varyings: fix compaction with sparse indirect FS inputs Without this, compaction can put inputs into vec4 slots already occupied by indirectly-accessed inputs while ignoring their interpolation qualifier, which is incorrect. All input components sharing the same vec4 slot must use interpolation qualifiers that are compatible with each other. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32424>	2024-12-04 13:40:41 +00:00
Marek Olšák	b01f3cea7a	nir/opt_varyings: remove redundant conditions from a while loop Most of these conditions are repeated below with a continue statement. This just puts break at the end where all of them are false. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32424>	2024-12-04 13:40:41 +00:00
Marek Olšák	c26da94b4c	nir/opt_varyings: replace options::lower_varying_from_uniform with a cost number This is a simple way for drivers to enable uniform expression propagation without having to set any callbacks for it. It replaces the old option. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32390>	2024-11-28 15:39:46 +00:00
Marek Olšák	428613b690	nir/opt_varyings: add a default callback for varying_estimate_instr_cost used when the driver doesn't set it. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32390>	2024-11-28 15:39:46 +00:00
Marek Olšák	1f238f0a2e	nir/opt_varyings: always call remove_dead_varyings in init_linkage so that we don't have to do it after every init_linkage call. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32390>	2024-11-28 15:39:46 +00:00
Marek Olšák	6f0333920b	nir/opt_varyings: use a hash table to make cloning SSA faster Cloning recursively can have an exponential time complexity if we don't skip already cloned nodes. Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32337>	2024-11-25 22:24:22 +00:00
Marek Olšák	899bee4af8	nir/opt_varyings: don't count the cost of the same instruction multiple times Use pass_flags to indicate whether the instruction has already been added to the total cost of the expression. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32174>	2024-11-18 13:39:08 +00:00
Marek Olšák	405e9d9b74	nir/opt_varyings: implement compaction without flexible interpolation We have to honor drivers when they say that different interpolation qualifiers can't be mixed in the same vec4, indicated by nir_io_has_flexible_input_interpolation_except_flat not being set. This is a prerequisite for enabling nir_opt_varyings for all drivers. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32174>	2024-11-18 13:39:08 +00:00
Marek Olšák	a7c671efc6	nir/opt_varyings: fix packing color varyings BITSET_TEST_RANGE_INSIDE_WORD uses first_bit .. last_bit, same as BITSET_RANGE, not first_bit .. size like BITFIELD_RANGE. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32174>	2024-11-18 13:39:08 +00:00
Marek Olšák	f9b03cf405	nir/opt_varyings: add nir_io_compaction_rotates_color_channels This was enabled by default in nir_opt_varyings, but vc4 can't handle when shader outputs write Y but not X. Add an option for it and enable it only for the driver that benefits from it. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32174>	2024-11-18 13:39:08 +00:00
Marek Olšák	8518e1cfd7	nir/opt_varyings: add nir_io_always_interpolate_convergent_fs_inputs for Asahi Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32174>	2024-11-18 13:39:08 +00:00
Marek Olšák	9d043e138d	nir: add nir_clear_divergence_info, use it in nir_opt_varyings nir_opt_varyings computes vertex divergence, which isn't exactly expected by any other passes. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31968>	2024-11-05 14:13:40 +00:00
Daniel Schürmann	87cb42f953	treewide: don't lower to LCSSA before calling nir_divergence_analysis() Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30787>	2024-10-24 10:06:17 +00:00
Daniel Schürmann	8d1abd4996	treewide: use nir_src_is_divergent() rather than checking the divergence of the SSA Without LCSSA, divergence between src and def might differ. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30787>	2024-10-24 10:06:17 +00:00
Georg Lehmann	dbf63a0788	nir: remove nir_op_is_derivative Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31014>	2024-10-17 09:50:19 +00:00
Marek Olšák	948f94b8c5	nir/opt_varyings: pack TCS inputs with cross-invocation access together Unigine Heaven has a TCS that reads pos.xyz and tescoord.w from all invocations in every invocation. By putting those two in the same vec4, AMD hw can reduce the amount of shared memory that is allocated for those inputs from 2 vec4s to 1 vec4. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31670>	2024-10-17 03:30:07 +00:00
Marek Olšák	8e93907b7c	nir/opt_varyings: assign locations of no_varying IO for TCS outputs only Skip the code for other shader stages because it doesn't do anything there. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31670>	2024-10-17 03:30:07 +00:00
Marek Olšák	9bfea3183a	nir/opt_varyings: improve convergent input handling to fix data corruption Backward inter-shader code motion can move any code into the previous shader if it only uses convergent inputs. The problem is the final input type can end up being integer or FP64, which is incompatible with the assumption that convergent inputs can always be interpolated. If such a case occurs and the type is integer or FP64, either don't do any code motion, or if the driver exposes the new flag, rewrite convergent loads to use load_input. If the new flag is supported, all convergent loads are rewritten to use load_input, and flat varyings are allowed to be classified as convergent, which means they are packed into interpolated vec4 slots if there are unused components. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29895>	2024-07-23 16:13:16 +00:00
Marek Olšák	b2d32ae246	nir: add nir_intrinsic_load_per_primitive_input, split from io_semantics flag Instead of having 1 bit in nir_io_semantics indicating a per-primitive FS input, add a dedicated intrinsic for it. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29895>	2024-07-23 16:13:16 +00:00
Alyssa Rosenzweig	da752ed7c1	treewide: use nir_def_replace sometimes Two Coccinelle patches here. Didn't catch nearly as much as I would've liked but it's a start. Coccinelle patch: @@ expression intr, repl; @@ -nir_def_rewrite_uses(&intr->def, repl); -nir_instr_remove(&intr->instr); +nir_def_replace(&intr->def, repl); Coccinelle patch: @@ identifier intr; expression instr, repl; @@ nir_intrinsic_instr *intr = nir_instr_as_intrinsic(instr); ... -nir_def_rewrite_uses(&intr->def, repl); -nir_instr_remove(instr); +nir_def_replace(&intr->def, repl); Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com> [broadcom] Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> [lima] Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> [etna] Reviewed-by: Pavel Ondračka <pavel.ondracka@gmail.com> [r300] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29817>	2024-06-21 15:36:56 +00:00
Alyssa Rosenzweig	15257b65c6	treewide: use nir_metadata_control_flow Via Coccinelle patch: @@ @@ -nir_metadata_block_index \| nir_metadata_dominance +nir_metadata_control_flow ...plus some manual fixups for call sites missed by coccinelle. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Acked-by: Karol Herbst <kherbst@redhat.com> Acked-by: Juan A. Suarez Romero <jasuarez@igalia.com> [broadcom] Acked-by: Vasily Khoruzhick <anarsoul@gmail.com> [lima] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29745>	2024-06-17 16:28:14 -04:00
Natanael Copa	0274518615	nir/opt_varyings: reduce stack usage Avoid put a huge struct on stack to fix a stack overflow on musl libc. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10988 Fixes: `c66967b5cb` (nir: add nir_opt_varyings, new pass optimizing and compacting varyings) Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29375>	2024-05-24 13:15:33 +00:00
Timur Kristóf	c23c5c0a07	nir/opt_varyings: Don't promote flat inputs when moving post-dominator. Promoting flat inputs should only happen while assigning FS input slot groups. Otherwise we risk adding extra input slots, which is undesireable. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29208>	2024-05-23 13:14:46 +00:00
Timur Kristóf	9dad0ced52	nir/opt_varyings: Print FS VEC4 type when debugging relocate_slot. Useful when debugging this pass. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29208>	2024-05-23 13:14:46 +00:00
Timur Kristóf	2b1031ec10	nir/opt_varyings: Add workaround for RADV mesh shader multiview. The layer output is added in ac_nir_lower_ngg which is called later than this pass; prevent deleting layer input from FS here. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28685>	2024-04-14 19:51:12 +00:00
Timur Kristóf	91dd9c35be	nir/opt_varyings: Fix relocate_slot so it doesn't mix up 32-bit and 16-bit I/O. Previously, nir_opt_varyings was unable to distinguish between a fully occupied 32-bit flat input and the low part of a 16-bit flat input, and would assign them the same slot, thereby messing up both I/O slots in the process. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28685>	2024-04-14 19:51:12 +00:00
Timur Kristóf	7e43c2d08f	nir/opt_varyings: Debug print during relocate_slot. VERY useful when debugging issues with this pass. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28685>	2024-04-14 19:51:11 +00:00
Timur Kristóf	bf2227d0d0	nir/opt_varyings: Only propagate constant MS outputs, not other uniforms. Due to how mesh shaders work, we'll need a workgroup divergence pass in order to really prove that an output is uniform. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28685>	2024-04-14 19:51:11 +00:00
Timur Kristóf	5dd1461ca4	nir/opt_varyings: Add early return when producer stage is task. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28685>	2024-04-14 19:51:11 +00:00

1 2

58 commits