fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-17 11:48:05 +02:00

Author	SHA1	Message	Date
Ian Romanick	862b5b7d01	nir/loop_analyze: Simplify some logic in compute_induction_information This part now looks more like it did before `0b9639c35d`. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21289>	2023-02-17 22:12:05 +00:00
Ian Romanick	9461cc4424	nir/loop_analyze: Track induction variables with uniform initializer Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21289>	2023-02-17 22:12:05 +00:00
Ian Romanick	4edf1cdd3d	nir/loop_analyze: Eliminate nir_basic_induction_var No longer used. All of the information that was previously track here is tracked directly in nir_loop_variable... and, technically speaking, has been tracked there ever since `0b9639c35d`. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21289>	2023-02-17 22:12:05 +00:00
Ian Romanick	e444ed9210	nir/loop_analyze: Use nir_loop_variable::init_src instead of nir_basic_induction_var::def_outside_loop These track the same information in a slightly different way. Since nir_loop_variable::init_src is visible outside this module, it cannot be eliminated. As an intentional side effect, induction variables with constant initializers will now have their nir_loop_induction_variable::init_src field point to the load_const source. Previously this pointer would be NULL. v2: Update unit tests and commit message. Remove the now unused ind_var variable in find_trip_count. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21289>	2023-02-17 22:12:05 +00:00
Ian Romanick	72e763650c	nir/loop_analyze: Use nir_loop_variable::update_src instead of nir_basic_induction_var::alu These track the same information in a slightly different way. Since nir_loop_variable::update_src is visible outside this module, it cannot be eliminated. This leads to some nice simplification in find_trip_count. Previously this code only had access to the ALU instruction that performs the increment. It had to "search" the parameters to determine which (if any) was the constant. With this change, this code has access to the nir_alu_src of the ALU instruction that performs the increment. It no longer needs to search the parameters for the constant. It's either the supplied nir_alu_src or nothing. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21289>	2023-02-17 22:12:05 +00:00
Ian Romanick	1bc43c0778	nir/loop_analyze: Track induction variables with uniform increments As an intentional side effect, induction variables with constant increments will now have their nir_loop_induction_variable::update_src field point to the load_const source. Previously this pointer would be NULL. v2: Update unit tests and commit message. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21289>	2023-02-17 22:12:05 +00:00
Ian Romanick	c26d356dd5	nir/tests: Add tests for nir_loop_info::induction_vars tracking Later commits in this MR will change the way some data is track, and these tests will verify this behavior change. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21289>	2023-02-17 22:12:05 +00:00
Ian Romanick	168e54f7e3	nir/tests: Add tests for "inverted" loops A couple basic tests for loops with the exit condition after the increment. In compiler literature, the optimization that moves the exit condition from the top to the bottom is called "loop inversion." v2: Pass parameters to loop_builder_invert using a struct. Add a comment describing the loop being constructed to loop_builder_invert. Both suggested by Caio. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21289>	2023-02-17 22:12:05 +00:00
Ian Romanick	ffe0db099c	nir/tests: Refactor creation of loops for loop_analyze test cases Inspired heavily by the work by Yevhenii Kolesnikov in the original versions of !3445. v2: Pass parameters to loop_builder using a struct. Add a comment describing the loop being constructed to loop_builder. Both suggested by Caio. v3: mscv C++ designated initializer lolz. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21289>	2023-02-17 22:12:05 +00:00
Ian Romanick	7384ea7978	nir/tests: Don't unconditionally log shaders from this one CF test All of the other tests only log the shader when validation fails, so having that shader scroll by in the output is very distracting. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21289>	2023-02-17 22:12:05 +00:00
Faith Ekstrand	2e2d7803c7	nir: Add a load/store bit size lowering pass This is based on brw_nir_lower_mem_access_bit_sizes() but ended up being substantially different. While the core concepts are all the same, the brw_* version made a lot of Intel-specific assumptions. The new version takes a callback which takes a number of bytes of data and an alignment pair and returns a bit size and number of components to load/store. Reviewed-by: M Henning <drawoc@darkrefraction.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21232>	2023-02-17 00:55:54 +00:00
Timothy Arceri	cb58d75224	nir/nir_opt_copy_prop_vars: don't call memset when cloning This makes the pass significantly faster cutting execution time by around 30% in the cts test dEQP-GLES31.functional.ubo.random.all_per_block_buffers.20 This 30% improvement is in addition to all the improvements from the proceeding patches. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20381>	2023-02-16 23:31:59 +00:00
Timothy Arceri	d1a41d9c64	nir/nir_opt_copy_prop_vars: reorder clone calls This helps with the reuse of dynamic arrays. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20381>	2023-02-16 23:31:59 +00:00
Timothy Arceri	2a2d85e254	nir/nir_opt_copy_prop_vars: reuse dynamic arrays As per the previous commit if we don't reuse these dynamic arrays we end up needlessly thrashing the memory handling functions. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20381>	2023-02-16 23:31:59 +00:00
Timothy Arceri	ffe0f3fda1	nir/nir_opt_copy_prop_vars: reuse hash tables Due to how this pass works we can end up thrashing memory if we do not reuse these hash tables rather than reusing them. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20381>	2023-02-16 23:31:59 +00:00
Timothy Arceri	731e9fd535	nir/nir_opt_copy_prop_vars: avoid comparison explosion Previously the pass was comparing every deref to every load/store causing the pass to slow down more the larger the shader is. Here we use a hash table so we can simple store everything needed for comparision of a var separately. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20381>	2023-02-16 23:31:59 +00:00
Timothy Arceri	8f6f5730f6	nir/nir_opt_copy_prop_vars: remove extra loop The fix in `947f7b452a` introduced an extra loop over the copies array to find the correct entry in the case it had been moved. The problem is these loops can be iterated over millions of times so lets simply update the entry pointer in the case we change its location in the array. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20381>	2023-02-16 23:31:59 +00:00
Faith Ekstrand	4e09d37f3b	nir/from_ssa: Move the loop bounds check in resolve_parallel_copy We loop, effectively, over two stacks: ready and to_do and finish only when both are empty. In the case where ready is empty, we pull one off of to_do, add a copy to a temporary, and push it onto the ready stack. Previously, we assumed that we would never get to the temporary copy case if to_do has exactly one entry because that would imply that there was only one copy left which means there can't possibly be a cycle to break. This was true until `c7fc44f9eb` ("nir/from_ssa: Respect and populate divergence information") which changed things such that temporary copies sometimes get added in the case where a convergent value is copied both to convergent and divergent destinations. This patch adjusts our loop iteration to always attempt to clear the ready stack before checking if there's anything left on the to_do stack. I also added an assert to make the exit condition more clear. Fixes: `c7fc44f9eb` ("nir/from_ssa: Respect and populate divergence information") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8037 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21315>	2023-02-16 20:23:42 +00:00
Faith Ekstrand	5afba073c6	nir/from_ssa: Only re-locate values that are destinations There is an optimization in the parallel copy algorithm where, after a copy has been performed, we can treat the destination as the new source for future copies of the same source. In particular, consider the following parallel copy: A -> B, C -> A, A -> C. In this case, after we have done the A -> B copy, we can make note that the value in A is now in B and emit the sequence: A -> B, C -> A, B -> C. This allows us to resolve the swap cycle between A anc C without allocating a temporary register because we know B is also a copy of A. When one of the registers involved is convergent and the other is divergent, this optimization is problematic because, while convergent to divergent copies are fine, we can't re-use the divergent copy in later copies if any of those copies are to a convergent variable. We could, but it would require a read_first_invocation which would get messy. In In `c7fc44f9eb` ("nir/from_ssa: Respect and populate divergence information"), we attempted to deal with this by limiting the rename optimization to the case where the divergence matched. The problem is that we did the re-name part whenever the divergence matched but only marked it as ready if the thing being copied was a destination. (We actually left two instances of loc[a] = b, one which always happened and one which only happened if we also wanted to flag the source as being ready to use as a destination.) While this technically doesn't cause any problems, it may result in more inter-mov dependencies which hurts instruction scheduling. For example, if we had the parallel copy A -> B, A -> C, A -> D, we now end up emitting the sequence A -> B, B -> C, C -> D which has many more data hazards between instructions caused by the constant shuffling. This commit restores the original logic in which we only perform the rename optimization if the rename would free up a register we will later use as a destination. This isn't entirely optimal as it still doesn't prove that there is a cycle involved first, but it should lead to a reduction in unnecessary dependencies. No shader-db changes on SKL or DG2 Fixes: `c7fc44f9eb` ("nir/from_ssa: Respect and populate divergence information") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21315>	2023-02-16 20:23:42 +00:00
Timur Kristóf	2e9f5aadd0	nir: Clarify comment above load_buffer_amd. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21358>	2023-02-16 15:29:36 +00:00
Faith Ekstrand	41b0407d5c	nir/from_ssa: Use more helpers in resolve_parallel_copies Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21299>	2023-02-14 17:54:12 +00:00
Kenneth Graunke	3e09a636db	nir: Fix typos in the from-SSA pass comments Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21299>	2023-02-14 17:54:12 +00:00
Kenneth Graunke	b1ebd9978c	nir: Fix merge_set_dump() to compile again This #if 0'd debug code has been broken since -Werror=vla was added. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21299>	2023-02-14 17:54:11 +00:00
Kenneth Graunke	8343d7fd2a	nir: Print divergence information for registers as well as SSA defs This patch causes us to print "con" and "div" for registers as well as SSA defs. We print it on both register declarations, and destinations. The latter isn't strictly necessary, but it is handy to be able to see e.g. a convergent value being assigned to a divergent register without having to constantly refer back to definitions that might be much earlier in the program. I originally printed it for sources as well, but that got to be a bit wordy, so I dropped that. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21299>	2023-02-14 17:54:11 +00:00
Giancarlo Devich	f9a827d61e	nir: Check sampler_binding is valid when lowering tex shadow Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21247>	2023-02-13 22:57:03 +00:00
Bas Nieuwenhuizen	0a17c3afc5	nir: Apply a maximum stack depth to avoid stack overflows. A stackless (or at least using allocated memory for stack) version might be nice but for now this works around some games compiling large shaders and hitting stack overflows. CC: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21231>	2023-02-11 15:01:42 +01:00
Jesse Natalie	25ee07373c	nir_lower_fp16_casts: Allow opting out of lowering certain rounding modes Reviewed-by: Giancarlo Devich <gdevich@microsoft.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21029>	2023-02-11 06:12:23 +00:00
Jesse Natalie	c0c2b60f1d	nir: Add alignment to load_push_constant Reviewed-by: Giancarlo Devich <gdevich@microsoft.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21029>	2023-02-11 06:12:23 +00:00
Faith Ekstrand	af9212dd82	nir/deref: Preserve alignments in opt_remove_cast_cast() This also removes the loop so opt_remove_cast_cast() will only optimize cast(cast(x)) and not cast(cast(cast(x))). However, since nir_opt_deref walks instructions top-down, there will almost never be a tripple cast because the parent cast will have opt_remove_cast_cast() run on it. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21252>	2023-02-10 23:08:19 +00:00
Pavel Ondračka	94eff7ccd8	nir: shrink phi nodes in nir_opt_shrink_vectors While this change helps with few shaders, the main benefit is that it allows to unroll loops comming from nine+ttn on vec4 backends. D3D9 REP ... ENDREP type loops are unrolled now already, LOOP ... ENDLOOP need some nine changes that will come later. r300 RV530 shader-db: total instructions in shared programs: 132481 -> 132344 (-0.10%) instructions in affected programs: 3532 -> 3395 (-3.88%) helped: 13 HURT: 0 total temps in shared programs: 16961 -> 16957 (-0.02%) temps in affected programs: 88 -> 84 (-4.55%) helped: 4 HURT: 0 Reviewed-by: Emma Anholt <emma@anholt.net> Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Partial fix for: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8102 Partial fix for: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7222 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21038>	2023-02-10 09:06:25 +00:00
Ian Romanick	18fc4daaf6	nir/inline_uniforms: Add inot condition support From the `96c19d23c9` commit message: Ever since `4246c2869c` and `7d85dc4f35` loop unrolling can no longer depend on inot being eliminated from the loop terminator condition so we need to be able to handle it. Support these conditions here too. Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21179>	2023-02-10 03:18:23 +00:00
Ian Romanick	682e83f012	nir/inline_uniforms: Make add_inlinable_uniforms public This is step 5 in an attempt to unify a bunch of nir_inline_uniforms.c and lvp_inline_uniforms.c code. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21179>	2023-02-10 03:18:23 +00:00
Ian Romanick	cdd23b1efa	nir/inline_uniforms: Make src_only_uses_uniforms public, change name While making the function public, rename it to nir_collect_src_uniforms. The old name makes it sound like it's just a query that doesn't have side effects. That is, however, not the case. This is step 4 in an attempt to unify a bunch of nir_inline_uniforms.c and lvp_inline_uniforms.c code. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21179>	2023-02-10 03:18:23 +00:00
Ian Romanick	edb89b71c5	nir/inline_uniforms: Allow possibility of uni_offsets and num_offsets being NULL This is step 3 in an attempt to unify a bunch of nir_inline_uniforms.c and lvp_inline_uniforms.c code. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21179>	2023-02-10 03:18:23 +00:00
Ian Romanick	0c0fb216dd	nir/inline_uniforms: Allow possibility of more than one UBO Only caller in this file still only passes 1. This is step 2 in an attempt to unify a bunch of nir_inline_uniforms.c and lvp_inline_uniforms.c code. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21179>	2023-02-10 03:18:23 +00:00
Ian Romanick	23b4266f9e	nir/inline_uniforms: Pass max_num_bo and max_offset around as parameters max_num_bo is currently limited to 1. That will change in the next commit. This is step 1 in an attempt to unify a bunch of nir_inline_uniforms.c and lvp_inline_uniforms.c code. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21179>	2023-02-10 03:18:23 +00:00
Ian Romanick	1d5033823e	nir/inline_uniforms: Change num_offsets type to uint8_t This is step 0 in an attempt to unify a bunch of nir_inline_uniforms.c and lvp_inline_uniforms.c code. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21179>	2023-02-10 03:18:23 +00:00
Alejandro Piñeiro	3685528c1e	nir: track if var copies lowering was called In general we should only call it once, and then we should avoid to call any lowering that introduce back copies. So far we were tracking that manually out of the nir shader on several places. Ideally we would like to add a nir_validate rule, but right now there are some exceptions to this rule. For example right now the Intel compiler calls nir_lower_io_to_temporaries as part of linking tess_ctrl/mesh/task sahders. One option would be to allow drivers to reset the value, but for now let's not add that validation rule. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19338>	2023-02-06 22:11:34 +00:00
Konstantin Seurer	9104dafb6f	vulkan,nir: Refactor ycbcr conversion state into a struct This will be useful for RADV since it hashes the state. v3dv changes: Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20731>	2023-02-06 18:36:29 +00:00
Jason Ekstrand	9c62e0c77d	nir: Remove nir_lower_io_force_sample_interpolation It's no longer used. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21094>	2023-02-06 09:12:17 +00:00
Alyssa Rosenzweig	6b97f396e6	nir/lower_clip: Only emit 1 discard If we have multiple clip planes, rather than emit multiple discards we can just OR together the discard criteria. Then a nir_opt_algebraic rule kicks in to optimize out the flt/.../flt/ior/.../ior into fmin/.../fmin/flt, generating much less code at the end. Written while debugging an unrelated issue with the clip lowering. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21103>	2023-02-06 02:50:20 +00:00
Alyssa Rosenzweig	93db6094a1	nir/print: Pretty-print color0/1_interp These are an enum. Furthermore, their 0 state is INTERP_MODE_NONE which we shouldn't bother printing at all. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21091>	2023-02-04 17:26:30 +00:00
Alyssa Rosenzweig	b235be1fd4	nir/print: Pretty-print I/O semantic locations Instead of printing the raw location number, which is pretty hard to interpret, let's print the name of the location. Example output: vec4 16 ssa_2 = intrinsic load_interpolated_input (ssa_0, ssa_1) (base=0, component=0, dest_type=float16 /144/, io location=VARYING_SLOT_VAR0 slots=1 mediump /8388768/) One of the "regressions" from moving to purely lowered I/O with all variables removed is a lack of debuggability, since otherwise these location strings don't show up anywhere in the printed shader! By contrast this should make the lowered I/O nice to read like the early I/O. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21091>	2023-02-04 17:26:30 +00:00
Alyssa Rosenzweig	435e7f5e6d	nir/print: Extract get_location_str Locations show up in two places: variables and lowered I/O semantics. We want to reuse the logic in both places, so extract it out. The extracted logic is IMO easier to read, too. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21091>	2023-02-04 17:26:30 +00:00
Hampus Linander	4ffc7c3ff4	nir: Add extr_agx opcode The AGX extr instruction extracts a bitfield from two 32bit registers. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20628>	2023-02-04 11:13:24 -05:00
Ian Romanick	ea413e826b	nir: Eliminate nir_op_f2b Builds on the work of !15121. This gets to delete even more code because many drivers shared a lot of code for i2b and f2b. No shader-db or fossil-db changes on any Intel platform. v2: Rebase on `1a35acd8d9`. v3: Update a comment in nir_opcodes_c.py. Suggested by Konstantin. v4: Another rebase. Remove f2b stuff from Midgard. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20509>	2023-02-03 22:39:57 +00:00
Ian Romanick	024122c069	nir/builder: Handle f2b conversions specially in nir_type_convert No shader-db or fossil-db changes on any Intel platform. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20509>	2023-02-03 22:39:57 +00:00
Ian Romanick	b265020b82	nir/builder: Eliminate nir_f2b helper (and use of nir_f2b32 helper) There were only two users. Replace each with nir_fneu instead. This is now a squash of what was two separate commits. nir_lower_pstipple_block is called after nir_lower_bool_to_int32, so nir_fneu32 has to be used here or there will be regresssions in stipple tests on llvmpipe. v2: Rebase on !20869. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Suggested-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20509>	2023-02-03 22:39:57 +00:00
Alyssa Rosenzweig	071ac59960	nir: Add a late texcoord replacement pass Add a second NIR pass for lowering point/texture coordinate replacement (i.e. point sprites). Why a second one? The current pass works on derefs/variables, which is good for drivers that don't lower I/O at all (like Zink, where the pass originates). However, it is problematic for hardware drivers: the inputs to this pass depend on the shader key, so we want to run the pass as late as possible to minimize the cost of building/compiling the associated shader variants. In particular, we need to be able to lower point sprites after lowering I/O if we would like to lower I/O when preprocessing NIR. The logic for early lowering and late lowering is considerably different (the late lowering is a lot simpler), so I've split this out into a second pass rather than trying to weld them together into one. This pass will be used on Asahi, which currently uses the early pass. It may be useful for other drivers as well. (Actually, it's been shipping on Asahi for a little while now, just hasn't been sent upstream yet.) Tested with Neverball. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Asahi Lina <lina@asahilina.net> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21065>	2023-02-03 15:03:06 +00:00
Qiang Yu	f6b194b648	nir,ac/llvm,aco,radv,radeonsi: remove nir_export_vertex_amd Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20691>	2023-02-03 12:27:44 +00:00

1 2 3 4 5 ...

4169 commits