fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-16 03:18:06 +02:00

Author	SHA1	Message	Date
Connor Abbott	b63782da16	isaspec: Add "custom" field type Add support for a field which is decoded by a user callback. This will be used for decoding control registers in cread/cwrite by afuc. In order for this to interact well with the align feature, we need to pull print() out of the decode implementation so that the callback can call it and keep track of the line column. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23949>	2023-07-28 18:41:58 +00:00
Connor Abbott	dc874e4654	isaspec: Add support for function and entrypoint labels Functions (i.e. labels reached from call instructions) should be printed differently from normal labels. In addition we also need to add support for entrypoints with user-defined names in order to show packet names in afuc. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23949>	2023-07-28 18:41:58 +00:00
Connor Abbott	569d3ac5a1	isaspec: Add support for "absolute" branches afuc has branches which use an absolute offset in instructions from the microcode base. Add another type to support them. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23949>	2023-07-28 18:41:57 +00:00
Connor Abbott	86b17d96b3	isaspec: Add "displayname" for altering {NAME} when decoding In afuc, we have the situation where there are a number of ALU instructions with two (almost) completely different encodings, including a different opcode location, etc. These need to be different leaf bitsets with different names for the encoder to work, because otherwise the encoder has no way of descriminating between them, but when displaying them we want to use the same name. This adds a small facility to make the name used for {NAME} when displaying and for the opcode when encoding different, so that e.g. OPC_ADDI can display as "add" instead of "addi". Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23949>	2023-07-28 18:41:57 +00:00
Mike Blumenkrantz	e68e612826	nir: add a helper for calculating variable slots this will maybe avoid future bugs, but probably not Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24163>	2023-07-28 13:14:35 +00:00
Mike Blumenkrantz	59396eefe6	nir: fix slot calculations for compact variables with location_frac a variable with a component offset may span multiple slots, and this cannot be inferred from its type alone (e.g., compacted clip+cull distances) cc: mesa-stable Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24163>	2023-07-28 13:14:35 +00:00
Caio Oliveira	e693fd815a	nir: Let nir_fixup_deref_modes() fix deref_casts when possible If a deref_cast parent is also a deref, and the parent mode is not generic, we can safely propagate the parent mode down to the deref_cast. This will allow the fixup to work when the deref_cast is used just to change the glsl_type when accessing a variable/deref-chain. Reviewed-by: Faith Ekstrand <faith@gfxstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23734>	2023-07-27 16:20:36 +00:00
Rhys Perry	59f24c7df8	nir/lower_shader_calls: vectorize stack access for all shaders fossil-db (gfx1100): Totals from 9 (0.01% of 133461) affected shaders: MaxWaves: 156 -> 158 (+1.28%) Instrs: 37193 -> 37324 (+0.35%) CodeSize: 191008 -> 191968 (+0.50%) VGPRs: 816 -> 804 (-1.47%) Latency: 75789 -> 75641 (-0.20%); split: -0.35%, +0.15% InvThroughput: 10475 -> 10441 (-0.32%); split: -0.40%, +0.08% VClause: 666 -> 663 (-0.45%); split: -0.75%, +0.30% SClause: 1077 -> 1076 (-0.09%) Copies: 3425 -> 3407 (-0.53%); split: -0.73%, +0.20% PreVGPRs: 770 -> 745 (-3.25%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24334>	2023-07-27 12:38:01 +00:00
Faith Ekstrand	29c4417fb8	nir: Add a backend_flags field to nir_tex_instr In `9ffd00bcf1` ("nir_to_tgsi: Pack our tex coords into vec4 nir_tex_src_backend[12]"), Emma added a pair of back-end sources to nir_tex_instr to allow complex lowering to be done in NIR. This adds a tiny bit more hw-specific back-end information that a NIR lowering pass can communicate to the back-end compiler. While the opcode contains most of the information needed, some thing such as the presence of offsets is currently only communicated via the presence of specific source types in the source list. This information is gone when the texture instruction is lowered to back-end sources. Adding a backend_flags field fixes this by allowing the lowering pass to communicate a small amount of side-band information if needed. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22303>	2023-07-26 20:12:49 +00:00
Alyssa Rosenzweig	6619317172	nir/lower_blend: Optimize out PIPE_LOGICOP_NOOP Just drop the store. Written while debugging dEQP-VK.pipeline.monolithic.logic_op.r8_uint.no_op. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Italo Nicola <italonicola@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24252>	2023-07-25 18:03:57 +00:00
Alyssa Rosenzweig	9c0740211d	nir/lower_blend: Fix 32-bit logicops nir_const_value_for_int asserts signed bounds on the input, but we pass in an unsigned value that would be out-of-bounds for 32-bit channels, causing the assert to fail for 32-bit channel formats. Fixes dEQP-VK.pipeline.monolithic.logic_op.r32_uint.* on AGXV (and probably PanVK). Fixes: `dbd0615e7a` ("nir/lower_blend: Avoid useless iand with logic ops") Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Italo Nicola <italonicola@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24252>	2023-07-25 18:03:57 +00:00
Faith Ekstrand	355afc92d1	nir/schedule: Support load/store_reg These are tracked the same way as register reads and writes, allowing them to be re-arranged as long as they respect dependencies within the same reg. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24153>	2023-07-25 15:36:52 +00:00
Iago Toral Quiroga	dff85b6163	nir/trivialize: Move decl_reg to the start of the block This makes it so we never find a reg_decl in between a reg_store and the def for its value, which helps avid inserting copy movs. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24153>	2023-07-25 15:36:52 +00:00
Alyssa Rosenzweig	0655bada4b	nir/trivialize: Handle more RaW hazards Consider the snippet of NIR: div 32 %447 = @load_reg (%442) (base=0, legacy_fabs=0, legacy_fneg=0) div 32 %463 = @load_reg (%442) (base=0, legacy_fabs=0, legacy_fneg=0) con 32 %409 = iadd %17 (0x3), %447 @store_output (%182 (0x601), %463) (base=0, wrmask=x, component=0, src_type=invalid... @store_reg (%409, %442) (base=0, wrmask=x, legacy_fsat=0) The load_reg's are trivial, so the %442 read will get folded into store_output. But under the old definition, the store_reg is also trivial so it gets folded into the iadd... causing a read-after-write hazard and invalid code generation. The fix is to amend our definition of store_reg triviality to account for loads getting folded in. It's not good enough that there's no intervening load_reg, there can also be no intervening source that gets chased to a load_reg. Handle that case as well. Identified in dEQP-VK.geometry.input.basic_primitive.triangles_adjacency on V3DV. Fixes: `d313eba94e` ("nir: Add pass for trivializing register access") Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reported-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24153>	2023-07-25 15:36:52 +00:00
Faith Ekstrand	f8b69abbd4	nir/trivialize: Trivialize cross-block loads In order for a register load to be trivial, it cannot be used in any block other than the one in which it is loaded. We're not currently explicitly doing anything to ensure this invariant holds. It may be that it holds regardless but I couldn't find any documented reason why it should so let's explicitly handle that case. Worst case, the newly added code does nothing. Fixes: `d313eba94e` ("nir: Add pass for trivializing register access") Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24153>	2023-07-25 15:36:52 +00:00
Faith Ekstrand	f1f05cc7cf	nir/trivialize: Maintain divergence information Because this pass is intended to be run after out-of-SSA and directly before injesting the NIR into the back-end, it may come after divergence analysis and needs to preserve the divergence information. Fortunately, since all we ever do is insert nir_op_mov, this is easy. Fixes: `d313eba94e` ("nir: Add pass for trivializing register access") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24153>	2023-07-25 15:36:52 +00:00
Faith Ekstrand	4fd257d20f	nir: Properly handle divergence for load_reg This commit makes three changes: 1. Default all newly created registers divergent because this is the safer default. 2. Make divergence analysis do something sane with register divergence. It's not perfect because divergence analysis isn't able to prove registers divergent based on stores but at least if someone uses registers a bit they'll end up with safe defaults. This matches what they'd get with nir_ssa_def_init(). 3. Make the load_reg() helper automatically propagate divergence from the register. Because the defaults for both nir_ssa_def_init() and nir_decl_reg() are to mark everything divergent, this only means that nir_load_reg() of a uniform reg is now uniform. Putting all these together, nir_from_ssa should now be producing load_reg intrinsics with the proper uniform information. Fixes: `7229bffcb1` ("nir: Add intrinsics for register access") Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24153>	2023-07-25 15:36:52 +00:00
Rhys Perry	a53d3ff0b3	nir/tests: add nir_opt_dead_cf_test.jump_before_constant_if Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24235>	2023-07-24 14:06:16 +01:00
Rhys Perry	21f0aca948	nir/opt_dead_cf: remove nodes after a jump earlier In the case of: halt // succs: b9 if %618 { block b3:// preds: break // succs: b6 } else { block b4: // preds: , succs: b5 } block b5: // preds: b4 32 %556 = iadd %617, %2 (0x1) opt_constant_if() doesn't work because stitch_blocks() can't join blocks if the before ends in a jump and the after isn't empty. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24235>	2023-07-24 14:06:16 +01:00
Konstantin Seurer	1c8577b493	nir/tests: Use a single binary Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24249>	2023-07-24 11:44:46 +00:00
Konstantin Seurer	6eb0a3a5b7	nir/tests: Refactor boilerplate into a common header Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24249>	2023-07-24 11:44:46 +00:00
Timothy Arceri	2cf8c8cba4	nir/opt_copy_prop_vars: drop reuse of dynamic arrays After the previous commit there are so few to reuse that this is no longer worth doing and actually causes compilation to slow down. The Blender shader compile time in issue #9326 improves as folows: 21.11 seconds -> 9.90 seconds The CTS test dEQP-GLES31.functional.ubo.random.all_per_block_buffers.20 improves as follows: 0.92 seconds -> 0.68 seconds Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9326 Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24227>	2023-07-24 02:29:54 +00:00
Timothy Arceri	d56e739417	nir/opt_copy_prop_vars: skip cloning of copies arrays until needed Most of the variables in the hash table will never actually be looked up for any given block so cloning every possible value just creates a bunch of unrequired memcpy calls. Here we change the code to only clone the copies array once it is actually looked up for the first time. The Blender shader compile time in issue #9326 improves as folows: 151.09 seconds -> 21.11 seconds The CTS test dEQP-GLES31.functional.ubo.random.all_per_block_buffers.20 improves as follows: 1.67 seconds -> 0.92 seconds Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24227>	2023-07-24 02:29:54 +00:00
Timothy Arceri	869b5a562e	nir/opt_copy_prop_vars: remove var hash entry on kill alias If kill alias results in the hash table entry holding an empty copies array then remove the hash entry and return the dynamic array to the unused pool. This helps avoid hash table size getting out of control in very large shaders. 151.09 seconds -> 118.60 seconds Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24227>	2023-07-24 02:29:54 +00:00
Timothy Arceri	9b4c7cc611	nir/opt_copy_prop_vars: speedup cloning of copy tables Here we change things to simply clone the entire hash table. This is much faster than trying to rebuild it and is needed to avoid slow compilation of very large shaders. The Blender shader compile time in issue #9326 improves as folows: 251.29 seconds -> 151.09 seconds The CTS test dEQP-GLES31.functional.ubo.random.all_per_block_buffers.20 improves as follows: 2.38 seconds -> 1.67 seconds Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24227>	2023-07-24 02:29:54 +00:00
Timothy Arceri	e9804bdc4c	nir/opt_copy_prop_vars: don't clone copies if branch empty There is no point doing an expensive clone of the copies if the if-branch is empty. Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24227>	2023-07-24 02:29:54 +00:00
Bas Nieuwenhuizen	c2e3986326	nir: Fix 16-component nir_replicate. Fixes: `f534c2c539` ("nir/builder: Add nir_replicate helper") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24286>	2023-07-22 22:11:15 +00:00
Alyssa Rosenzweig	03b2c34793	nir: Remove register arrays Nothing produces them any more, so remove them from NIR. This massively reduces the size of nir_src, which should improve performance all over. nir_src size reduced from 56 bytes -> 40 bytes (pahole results on arm64, x86_64 should be similar.) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24253>	2023-07-21 11:25:49 +00:00
Alyssa Rosenzweig	1466014184	nir: Rename lower_locals_to_reg_intrinsics back The short name is freed up. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24253>	2023-07-21 11:25:49 +00:00
Alyssa Rosenzweig	d2c94f9e71	nir: Remove nir_lower_locals_to_regs No more users, all switched to the intrinsic version. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24253>	2023-07-21 11:25:49 +00:00
Christian Gmeiner	fb48d3d1da	nir: add enta specific intrinsic used for txs lowering Non of the know etnaviv GPUs support this feature in hardware and the binary blob provides sizes via uniforms too. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24217>	2023-07-21 08:52:03 +00:00
Christian Gmeiner	019e5cbd39	nir/print: print instr pass_flags From time to time it can be helpful to "see" the pass_flags. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24234>	2023-07-20 18:03:47 +00:00
Caio Oliveira	97c79cdf19	nir: Use instructions_pass() for nir_fixup_deref_modes() Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24220>	2023-07-19 20:15:12 +00:00
Timothy Arceri	c64ad299e4	glsl: fix validation of ES vertex attribs From OpenGL ES 3.0 spec, page 56: "Binding more than one attribute name to the same location is referred to as aliasing, and is not permitted in OpenGL ES Shading Language 3.00 vertex shaders. LinkProgram will fail when this condition exists. However, aliasing is possible in OpenGL ES Shading Language 1.00 vertex shaders. This will only work if only one of the aliased attributes is active in the executable program, or if no path through the shader consumes more than one attribute of a set of attributes aliased to the same location. A link error can occur if the linker determines that every path through the shader consumes multiple aliased attributes, but implemen- tations are not required to generate an error in this case." So here we make sure to allow the optimisations before validation for earlier ES shader versions. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Fixes: `80c001013c` ("glsl: do vs attribute validation in NIR linker") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9342 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24205>	2023-07-19 00:43:26 +00:00
Alyssa Rosenzweig	d0f0afc6a4	nir: Initialize workgroup_size in builder_init_simple_shader It can't be 0 in Vulkan. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24158>	2023-07-17 19:53:49 +00:00
Alyssa Rosenzweig	4f0f76346e	nir: Add nir_lower_tess_coord_z pass Lowers tess_coord to tess_coord_xy and math. Based on ir3's version. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24159>	2023-07-17 17:31:52 +00:00
Alyssa Rosenzweig	9109830bb0	nir: Promote tess_coord_r600 to tess_coord_xy This intrinsic (vec2 tess_coord) is generally useful for non-r600 backends. Promote it. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24159>	2023-07-17 17:31:52 +00:00
Rhys Perry	5e4029bfe5	nir/tests: add test for unsigned_upper_bound with loop header phis Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23990>	2023-07-17 14:45:21 +00:00
Rhys Perry	1139d870f3	nir/unsigned_upper_bound: fix phi(bcsel) This was looking at the wrong sources. src0 is the condition. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Fixes: `72ac3f6026` ("nir: add nir_unsigned_upper_bound and nir_addition_might_overflow") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23990>	2023-07-17 14:45:21 +00:00
Alyssa Rosenzweig	9bcdc45ee7	nir: Devendor load_sample_mask AGX will use this too for its MSAA lowerings. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24148>	2023-07-15 19:48:30 +00:00
Alyssa Rosenzweig	56d61d9a64	nir: Add fence_{pbe,mem}_to_tex(_pixel)_agx intrinsics Read-after-write hazards require special handling on AGX, since image loads are implemented with texturing. Add intrinsics to handle these hazards. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24148>	2023-07-15 19:48:30 +00:00
Alyssa Rosenzweig	d54aa28b97	nir/legacy: Fix handling of fsat(fabs) Consider code like: 32x4 %2 = @load_interpolated_input (%1, %0 (0x0)) (base=0, component=0, dest_type=float32, io location=VARYING_SLOT_VAR0 slots=1 mediump) // Color 32x4 %3 = fabs %2 32x4 %4 = fsat %3 32x4 %5 = fsin %4 The existing logic would incorrectly tell the backend that both fabs and fsat could be folded, and then half the shader disappears. Whoops. Fix by stopping the folding in this case. I choose to do this check in the fsat rather than the fabs because it's more straightforward (1 source vs N uses) but it's somewhat arbitrary. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Pavel Ondračka <pavel.ondracka@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24116>	2023-07-13 22:43:36 +00:00
Alyssa Rosenzweig	34fcf6d479	nir/legacy: Fix fneg(load_reg) case Consider the IR: %0 = load_reg %1 = fneg %0 %2 = ffloor %1 %3 = bcsel .., .., %1 Because the fneg has both foldable and non-foldable users, nir/legacy does not fold the fneg into the load_reg. This ensures that the backend correctly emits a dedicated fneg instruction (with the load_reg folded in) for the bcsel to use. However, because the chasing helpers did not previously take other uses of a modifier into account, the helpers would fuse in the fneg to the ffloor. Except that doesn't work, because the load_reg instruction is supposed to be eliminated. So we end up with broken chased IR: 1 = fneg r0 2 = ffloor -NULL 3 = bcsel, ..., 1 The fix is easy: only fold modifiers into ALU instructions if the modifiers can be folded away. If we can't eliminate the modifier instruction altogether, it's not necessarily beneficial to fold it anyway from a register pressure perspective. So this is probably ok. With that check in place we get correct IR 1 = fneg r0 2 = ffloor 1 3 = bcsel, ..., 1 Fixes carchase/230.shader_test under softpipe. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24116>	2023-07-13 22:43:36 +00:00
Alyssa Rosenzweig	a608f5804c	compiler: Remove blend enums duplicating util Now unused. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Italo Nicola <italonicola@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24076>	2023-07-13 21:03:32 +00:00
Alyssa Rosenzweig	a2d56c4c73	nir/lower_blend: Use util enums This avoids the silly compiler versions. Some bits are slightly more complicated, because they have to account for inverted enum values (rather than a separate invert bit), but this is a LOT friendlier to drivers using the pass and it makes the pass itself more readable. The conversion functions in panfrost/panvk will go away momentarily. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Italo Nicola <italonicola@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24076>	2023-07-13 21:03:32 +00:00
Christian Gmeiner	f831883af6	nir/lower_tex: optimize offset lowering for has_texture_scaling Generates much better code and even helps to beat a blob driver. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24054>	2023-07-12 10:03:06 +00:00
Christian Gmeiner	9383009809	nir: rename has_txs to has_texture_scaling Convert it to an opt-in for backends to prefer and use nir_load_texture_scale instead of txs for nir lowerings. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Suggested-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24054>	2023-07-12 10:03:06 +00:00
Christian Gmeiner	9ddedf4554	nir: rename intrinsic to have a more generic nameing Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24054>	2023-07-12 10:03:06 +00:00
Alyssa Rosenzweig	fded7e7b66	nir: Remove nir_register-based unit tests Non-SSA functionality will become obsolete after nir_register is removed, so there's no need to keep the tests around, and they will interfere with the nir_register de-clawing. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23089>	2023-07-12 01:34:27 +00:00
Alyssa Rosenzweig	e96a9a1b71	nir: Remove nir_lower_regs_to_ssa It is now unused, as all internal producers of registers have been switched over to intrinsics and no drivers call it. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23089>	2023-07-12 01:34:27 +00:00

1 2 3 4 5 ...

8308 commits