fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-27 07:58:14 +02:00

Author	SHA1	Message	Date
Rhys Perry	aa32dc704f	nir/range_analysis: fix vectorized phis and intrinsics Found by inspection. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-By: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21288>	2023-03-04 12:58:38 +00:00
Marek Olšák	b80bd58265	nir: skip nir_op_unpack_32_4x8 in nir_lower_alu_width The pass can't handle it just like the other unpack opcodes and generates invalid NIR. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19399>	2023-03-03 03:27:40 +00:00
Marek Olšák	ec38758e86	nir: return progress from nir_lower_io_to_scalar oversight? Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19399>	2023-03-03 03:27:40 +00:00
Faith Ekstrand	c11ac5e446	nir: Handle wider unaligned loads in lower_mem_access_bit_size Reviewed-by: M Henning <drawoc@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21524>	2023-03-03 02:00:39 +00:00
Faith Ekstrand	7e8a10be67	nir: Make chunk_align_offset const in lower_mem_load() This should make things more clear than changing the value from earlier in the loop. Also, rename chunk_offset to load_offset so they match. Reviewed-by: M Henning <drawoc@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21524>	2023-03-03 02:00:39 +00:00
Faith Ekstrand	eb9a56b6ca	nir: Rename nir_mem_access_size_align::align_mul to align It's a simple alignment so calling it align_mul is a bit misleading. Suggested-by: M Henning <drawoc@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21524>	2023-03-03 02:00:39 +00:00
Faith Ekstrand	802bf1d9a6	nir: Rename align to whole_align in lower_mem_load Reviewed-by: M Henning <drawoc@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21524>	2023-03-03 02:00:39 +00:00
Faith Ekstrand	ca4d73ba36	nir: Add a combined alignment helper Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@colllabora.com> Reviewed-by: M Henning <drawoc@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21524>	2023-03-03 02:00:39 +00:00
Faith Ekstrand	e433a7c4fa	nir: Add UBO support to nir_lower_mem_access_bit_sizes Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: M Henning <drawoc@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21524>	2023-03-03 02:00:39 +00:00
Faith Ekstrand	116a851264	nir: Add mode filtering to lower_mem_access_bit_sizes Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: M Henning <drawoc@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21524>	2023-03-03 02:00:39 +00:00
Faith Ekstrand	4b06b1a7c5	nir: Check against combined alignment in nir_lower_mem_access_bit_sizes Checking against align_mul is insufficient if align_offset > 0. We need to check against the combined alignment instead. Fixes: `2e2d7803c7` ("nir: Add a load/store bit size lowering pass") Reviewed-by: M Henning <drawoc@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21524>	2023-03-03 02:00:39 +00:00
Georg Lehmann	0a3387a190	nir/lower_mediump: don't use fp16 for constants if the result is denormal Image stores are not required to preserve denorms. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21622>	2023-03-02 11:42:10 +00:00
Emma Anholt	106019a5d8	nir/split_64bit_vec3_and_vec4: Handle 64-bit matrix types. The offset handling should already work for flattening to our split vars, just need to make sure we have enough (or any!) array elements. Fixes: #7154 Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13288>	2023-03-01 16:09:25 +00:00
Georg Lehmann	aeb68c29b4	nir/opt_algebraic: add patterns for iand/ior of feq/fneu with 0 Foz-DB Navi21: Totals from 1245 (0.92% of 134913) affected shaders: VGPRs: 66232 -> 66248 (+0.02%); split: -0.01%, +0.04% CodeSize: 5874976 -> 5868168 (-0.12%); split: -0.17%, +0.05% MaxWaves: 25278 -> 25274 (-0.02%); split: +0.01%, -0.02% Instrs: 1087502 -> 1085267 (-0.21%); split: -0.21%, +0.00% Latency: 6531489 -> 6531672 (+0.00%); split: -0.04%, +0.05% InvThroughput: 1531774 -> 1532327 (+0.04%); split: -0.02%, +0.05% VClause: 22218 -> 22202 (-0.07%); split: -0.08%, +0.00% SClause: 45906 -> 45873 (-0.07%); split: -0.08%, +0.01% Copies: 64004 -> 64102 (+0.15%); split: -0.24%, +0.39% Branches: 21529 -> 21534 (+0.02%); split: -0.00%, +0.03% PreSGPRs: 51936 -> 51850 (-0.17%) PreVGPRs: 55393 -> 55398 (+0.01%); split: -0.02%, +0.03% Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21576>	2023-03-01 11:24:43 +00:00
Emma Anholt	6d52e6fd2c	nir: Port a floor->truncate algebraic opt pattern from GLSL. Prevents regression when dropping code from the GLSL optimizer. Acked-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21475>	2023-02-28 03:36:09 +00:00
Emma Anholt	ef02581590	nir: Add optimization for fdot(x, 0) -> 0. We had all these nice fdot opts to drop individual channels that were 0, but nothing handling it being entirely 0! Avoids r300g regression when dropping them from GLSL. Acked-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21475>	2023-02-28 03:36:08 +00:00
Caio Oliveira	1db7e6a261	nir: Support use_scoped_barrier in nir_lower_atomics_to_ssbo Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Acked-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3339>	2023-02-27 20:24:01 +00:00
Alyssa Rosenzweig	4eabd6586b	nir/lower_blend: Don't dereference null If a dual source blend colour is never written, src1 will be null and it will be invalid to dereference it. src1 is dereferenced both for the f2fN instruction but also if a dual blend factor is used... even if the latter isn't strictly valid, segfaulting in the NIR pass seems a lot meaner than blending with zero. The referenced commit hosed Asahi, causing anything that used blending to crash. Panfrost is unaffected since it always supplies a dual colour due to our crude construction of blend shaders. Fixes: `8313016543` ("nir/lower_blend: Consume dual stores") Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21544>	2023-02-27 15:47:33 +00:00
Georg Lehmann	a00b50d820	nir: change 16bit image dest folding option to per type Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21404>	2023-02-27 09:55:34 +00:00
Alyssa Rosenzweig	8058d31a25	nir: Add nir_texop_lod_bias_agx Add a new texture opcode that returns the LOD bias of the sampler. This will be used on AGX to lower sampler LOD bias to txb and friends. This needs to be a texture op (and not a new intrinsic) to handle both bindless and bindful samplers across GL and Vulkan in a uniform way. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21276>	2023-02-27 02:35:41 +00:00
Marek Olšák	0c8e7ad47e	nir: lower to fragment_mask_fetch/load_amd with EQAA correctly Fixes: `194add2c23` ("nir: lower image add lower_to_fragment_mask_load_amd option") Reviewed-by: Qiang Yu <yuq825@gmail.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21436>	2023-02-27 09:39:41 +08:00
Alyssa Rosenzweig	8313016543	nir/lower_blend: Consume dual stores Now that we're working on lowered I/O, passing in the dual source blend colour via a sideband doesn't make any sense. The primary source blend colours are implicitly passed in as the sources of store_output intrinsics; likewise, we should get dual source blend colours from their respective stores. And since dual colours are only needed by blending, we can delete the stores as we go. That means nir_lower_blend now provides an all-in-one software lowering of dual source blending with no driver support needed! It even works for 8 dual-src render targets, but I don't have a use case for that. The only tricky bit here is making sure we are robust against different orders of store_output within the exit block. In particular, if we naively lower x = ... primary color = x y = ... dual color = y we end up emitting uses of y before it has been defined, something like x = ... primary color = blend(x, y) y = ... Instead, we remove dual stores and sink blend stores to the bottom of the block, so we end up with the correct x = ... y = ... primary color = blend(x, y) lower_io_to_temporaries ensures that the stores will be in the same (exit) block, so we don't need to sink further than that ourselves. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21426>	2023-02-26 17:35:08 -05:00
Konstantin Seurer	8ae5a42990	nir: Add cull_mask_and_flags_amd intrinsic Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21470>	2023-02-25 12:07:46 +00:00
Marek Olšák	9f1e6d8f70	nir,amd: add and use nir_intrinsic_load_esgs_vertex_stride_amd This will emulate VGT_ESGS_RING_ITEMSIZE, which does the multiplication for us. It's beneficial to stop setting VGT_ESGS_RING_ITEMSIZE to reduce context rolls, and also the register will be removed in the future. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21525>	2023-02-24 21:27:24 +00:00
Faith Ekstrand	e41753cf17	nir/lower_io: Handle buffer_array_length for more address modes Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21446>	2023-02-24 20:37:10 +00:00
Roland Scheidegger	a4fa489002	lavapipe, nir: Fix wrong array index scaling in nir_collect_src_uniforms The scaling needs to be ubo * MAX_INLINABLE_UNIFORMS, not ubo * PIPE_MAX_CONSTANT_BUFFERS, otherwise accesses beyond buffer size will result for ubo >= 4 (and we'd also access the wrong values later for other non-zero ubo indices). Fixes: `a7696a4d98` ("lavapipe: Fix bad array index scale factor in lvp_inline_uniforms pass") Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21506>	2023-02-24 16:13:55 +00:00
Caio Oliveira	3328714295	nir/lower_subgroups: Add option lower_rotate_to_shuffle Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19797>	2023-02-24 06:33:51 +00:00
Caio Oliveira	e40b1df432	nir: Add nir_intrinsic_rotate Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19797>	2023-02-24 06:33:51 +00:00
Karol Herbst	56a9aad401	nir/deref: don't replace casts with deref_struct if we'd lose the stride The result might be used in a deref_ptr_as_array, which requires a proper stride within lower_explicit_io. If we'd lose that information or end up with a different stride don't execute this optimization. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8289 Fixes: `b779baa9bf` ("nir/deref: fix struct wrapper casts. (v3)") Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21458>	2023-02-23 01:44:25 +00:00
Georg Lehmann	ee47cc8256	amd,nir: remove byte_permute_amd intrinsic It's unused and if we ever want to use it again we should make it an alu opcode instead. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21445>	2023-02-22 20:13:52 +00:00
Karol Herbst	6e666c6303	nir: Skip samplers and textures in lower_explicit_io We have specialized lowering passes dealing with most of that already: 1. gl_nir_lower_samplers_as_deref 2. nir_lower_samplers 3. nir_lower_cl_images If we need more than that, those passes can deal with following deref chains as well. We _might_ need to improve nir_lower_cl_images a bit for more complex kernels, but CL also doesn't allow indirect images, so we are always able to optimize the entire deref chain away. Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20161>	2023-02-22 14:20:21 +00:00
Daniel Schürmann	93a47bab04	nir: simplify nir_block_cf_tree_{next\|prev} Removes some case distinction by first checking if this is the first/last block of a cf_node. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13962>	2023-02-21 10:41:11 +00:00
Daniel Schürmann	2e394b5cc1	nir/lower_continue_targets: only repair SSA when necessary Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13962>	2023-02-21 10:41:11 +00:00
Daniel Schürmann	7fba5abfd7	nir/lower_continue_constructs: special-case Continue Constructs with zero or one predecessors If a loop has only a single continue, the control flow is already converged and we can inline the continue construct. If a loop has no continue statement at all, the Continue Construct is unreachable and can simply be deleted. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13962>	2023-02-21 10:41:11 +00:00
Daniel Schürmann	c20751d61d	nir: add lowering for Loop Continue Constructs This pass lowers Loop Continue Constructs to the previous solution by inserting it at the beginning of the loop: loop { if (i != 0) { continue construct } loop body } Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13962>	2023-02-21 10:41:11 +00:00
Daniel Schürmann	312510448f	nir: create nir_push_continue() and related helpers nir_control_flow.h: void nir_loop_add_continue_construct(nir_loop loop); void nir_loop_remove_continue_construct(nir_loop loop); Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13962>	2023-02-21 10:41:11 +00:00
Daniel Schürmann	2bb369dd8d	nir: add assertions that loops don't have a Continue Construct Hoping that I didn't miss any, this should add assertions to all functions and passes which explicitly handle 'nir_loop'. Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13962>	2023-02-21 10:41:11 +00:00
Daniel Schürmann	d4b97bf3fa	nir: add Continue Construct to nir_loop The added continue_list corresponds to the SPIR-V Continue Construct and serves as a converged control-flow construct and is executed after each continue statement and before the next iteration of the loop body. Also adds validation rules for loops with Continue Construct Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13962>	2023-02-21 10:41:11 +00:00
Alyssa Rosenzweig	50b82ca818	nir/lower_blend,agx,panfrost: Use lowered I/O This is one step towards lowering I/O during shader preprocess rather than at variant create time, which helps mitigate shader variant jank. It's also a lot simpler. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> [v1] Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20836>	2023-02-17 23:09:19 +00:00
Alyssa Rosenzweig	acfda67b4f	nir/lower_blend: Don't handle gl_FragColor In OpenGL, FRAG_RESULT_COLOR implicitly broadcasts to every render target. Our existing lower_blend code (somewhat arbitrarily) aliases to the the first render target's format and blend settings. That said, I don't think that works if different render targets have different settings -- or blend with their different destinations -- though I don't have relevant spec text right now. The actual reason this works is that all users of this pass either call nir_lower_fragcolor first (panfrost, asahi) or don't have FRAG_RESULT_COLOR as part of their API (panvk, soon agxv). Unless/until we actually have a use case for nir_lower_blend with gl_FragColor, assert that gl_FragColor is lowered first so we don't need to worry about this imaginary case. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20836>	2023-02-17 23:09:19 +00:00
Alyssa Rosenzweig	b3f229c510	nir/lower_blend: Don't touch store->dest Stores don't have destinations, and if they did, it would be invalid to change their ssa_def's num_components without also changing the SSA def. Remove the nonsensical (but harmless) assignment. This fixes `25249e8be2` ("nir/lower_blend: Expand or shrink output variables as needed"), but as the bug is harmless in practice, it does not need to be backported. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Suggested-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Italo Nicola <italonicola@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20836>	2023-02-17 23:09:19 +00:00
Alyssa Rosenzweig	1b6607fa13	nir: Augment raw_output_pan with IO_SEMANTICS+BASE This is a form of lowered I/O, it needs I/O semantics so we can know the location to store to instead of passing via a sideband. Over in !20906, we will use the BASE to lower blend shader with multisampling in NIR instead of passing the number of samples and framebuffer format along a sideband to the Midgard compiler. That's not needed for this series (this patch was cherry-picked to avoid regressions in the lower_blend changes) but it's good to model the full form of the I/O lowered intrinsic here. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Italo Nicola <italonicola@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20836>	2023-02-17 23:09:19 +00:00
Ian Romanick	862b5b7d01	nir/loop_analyze: Simplify some logic in compute_induction_information This part now looks more like it did before `0b9639c35d`. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21289>	2023-02-17 22:12:05 +00:00
Ian Romanick	9461cc4424	nir/loop_analyze: Track induction variables with uniform initializer Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21289>	2023-02-17 22:12:05 +00:00
Ian Romanick	4edf1cdd3d	nir/loop_analyze: Eliminate nir_basic_induction_var No longer used. All of the information that was previously track here is tracked directly in nir_loop_variable... and, technically speaking, has been tracked there ever since `0b9639c35d`. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21289>	2023-02-17 22:12:05 +00:00
Ian Romanick	e444ed9210	nir/loop_analyze: Use nir_loop_variable::init_src instead of nir_basic_induction_var::def_outside_loop These track the same information in a slightly different way. Since nir_loop_variable::init_src is visible outside this module, it cannot be eliminated. As an intentional side effect, induction variables with constant initializers will now have their nir_loop_induction_variable::init_src field point to the load_const source. Previously this pointer would be NULL. v2: Update unit tests and commit message. Remove the now unused ind_var variable in find_trip_count. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21289>	2023-02-17 22:12:05 +00:00
Ian Romanick	72e763650c	nir/loop_analyze: Use nir_loop_variable::update_src instead of nir_basic_induction_var::alu These track the same information in a slightly different way. Since nir_loop_variable::update_src is visible outside this module, it cannot be eliminated. This leads to some nice simplification in find_trip_count. Previously this code only had access to the ALU instruction that performs the increment. It had to "search" the parameters to determine which (if any) was the constant. With this change, this code has access to the nir_alu_src of the ALU instruction that performs the increment. It no longer needs to search the parameters for the constant. It's either the supplied nir_alu_src or nothing. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21289>	2023-02-17 22:12:05 +00:00
Ian Romanick	1bc43c0778	nir/loop_analyze: Track induction variables with uniform increments As an intentional side effect, induction variables with constant increments will now have their nir_loop_induction_variable::update_src field point to the load_const source. Previously this pointer would be NULL. v2: Update unit tests and commit message. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21289>	2023-02-17 22:12:05 +00:00
Ian Romanick	c26d356dd5	nir/tests: Add tests for nir_loop_info::induction_vars tracking Later commits in this MR will change the way some data is track, and these tests will verify this behavior change. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21289>	2023-02-17 22:12:05 +00:00
Ian Romanick	168e54f7e3	nir/tests: Add tests for "inverted" loops A couple basic tests for loops with the exit condition after the increment. In compiler literature, the optimization that moves the exit condition from the top to the bottom is called "loop inversion." v2: Pass parameters to loop_builder_invert using a struct. Add a comment describing the loop being constructed to loop_builder_invert. Both suggested by Caio. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21289>	2023-02-17 22:12:05 +00:00

... 34 35 36 37 38 ...

5961 commits