fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-01-03 00:40:09 +01:00

Author	SHA1	Message	Date
Dave Airlie	fc137ecaca	radv/video: handle inline queries for vulkan video encode. This handles the new inlines queries for VK_KHR_video_maintenance1 (Decode doesn't do queries on AMD at the moment). Reviewed-by: Lynne <dev@lynne.ee> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30671>	2024-08-16 13:35:22 +10:00
Dave Airlie	1b4ae3d7af	vulkan/video: handle KHR_video_maintenance1 lack of profile list. VK_KHR_video_maintenance1 allows no profile in which cases drivers should just be pessimisitic. Reviewed-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Lynne <dev@lynne.ee> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30671>	2024-08-16 13:31:41 +10:00
David Rosca	b48bc87783	radv/video: Add support for 12-bit AV1 decode Reviewed-by: Lynne <dev@lynne.ee> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30551>	2024-08-16 02:41:30 +00:00
David Rosca	4dbb9f5355	vulkan/format: Add VK_FORMAT_G12X4_B12X4R12X4_2PLANE_420_UNORM_3PACK16 Used for 12-bit AV1. Reviewed-by: Lynne <dev@lynne.ee> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30551>	2024-08-16 02:41:30 +00:00
Timothy Arceri	08b93c841a	nir: make static assert more flexible The static assert used in encode deref modes used the fact there was less than 16 modes that we wanted to compress as an opportunity to reuse MODE_ENC_GENERIC_BIT as it just happened to represent 16. However if we add more than 16 modes i.e need to compress to 6 bits not 5 bits then MODE_ENC_GENERIC_BIT becomes 32 and the logic in the assert breaks. Instead we more precisely make sure MODE_ENC_GENERIC_BIT is large enough to fit all but the last 4 generic modes and that the last 4 modes defined in the enum are in fact the 4 generic modes. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30654>	2024-08-15 23:02:20 +00:00
Faith Ekstrand	85a70bbc05	nvk: Enable shader bounds checking when nullDescriptor is enabled Fixes: `c9eac89da8` ("nvk: Advertise VK_EXT_robustness2") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30663>	2024-08-15 22:25:13 +00:00
Faith Ekstrand	8445190663	nvk: Plumb the whole vk_pipeline_robustness_state through to nvk_ubo/ssbo_addr_format Fixes: `c9eac89da8` ("nvk: Advertise VK_EXT_robustness2") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30663>	2024-08-15 22:25:13 +00:00
Faith Ekstrand	6ae401aa86	vulkan: Add null descriptor bits to vk_pipeline_robustness_state Fixes: `c9eac89da8` ("nvk: Advertise VK_EXT_robustness2") Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30663>	2024-08-15 22:25:13 +00:00
Matt Turner	c437f2e79c	nir/tests: Add tests for opt_if_merge Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30629>	2024-08-15 20:34:54 +00:00
Matt Turner	d2e6be94ae	nir: Skip opt_if_merge when next_if has block ending in a jump Similar to commit `6cef804067` ("nir/opt_if: fix opt_if_merge when destination branch has a jump"), we shouldn't combine if statements when the second if-then-else has a block that ends in a jump. This fixes a case where opt_if_merge combines if (cond) { [then-block-1] } else { [else-block-1] } if (cond) { [then-block-2] } else { [else-block-2] } where `then-block-2` or `else-block-2` ends in a jump. The phi nodes following the control flow will be incorrectly updated to have an input from a block that is not a predecessor. Fixes: `4d3f6cb973` ("nir: merge some basic consecutive ifs") Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30629>	2024-08-15 20:34:54 +00:00
Ruijing Dong	b9c1fcc59b	radeonsi/vcn: qp map IB package sent by default This is to support QP map enabled and disabled mixed case. When qp map disabled, it still needs the IB package to tell VCN engine qp map is not needed. Reviewed-by: Boyuan Zhang <Boyuan.Zhang@amd.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Signed-off-by: Ruijing Dong <ruijing.dong@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30665>	2024-08-15 18:16:06 +00:00
Ruijing Dong	ac45948136	frontends/va: reset roi number reason: roi number is an indication to do qp_map in vcn encoder. if not resetting this number, the previous roi style will be used if not changed, or not used. In the case non roi case mixed with roi, the behavior will not be expected. reset roi_num at the beginning of each frame, if application doesn't send roi map, then roi will be stopped. Reviewed-by: Boyuan Zhang <Boyuan.Zhang@amd.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Signed-off-by: Ruijing Dong <ruijing.dong@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30665>	2024-08-15 18:16:06 +00:00
Gurchetan Singh	8485067541	tu: use os_get_total_physical_memory(..) This is more OS-agnostic. On Linux, os_get_total_physical_memory(..) is based on __SC_PHYS_PAGES, which under the hood seems to call sysinfo: https://github.com/bminor/glibc/blob/master/sysdeps/unix/sysv/linux/getsysstats.c#L273 As an optimization, perhaps os_get_total_physical_memory(..) should use sysinfo directly too. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30678>	2024-08-15 17:46:37 +00:00
Pavel Ondračka	a1eced7819	mesa: implement GL_FRAMEBUFFER_BLEND query Right now mesa always returns GL_FULL_SUPPORT, so use the is_format_supported hook to get the actual info from drivers instead. Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30612>	2024-08-15 16:36:17 +00:00
Rhys Perry	aafb49f56b	aco: set prefer_remove for gfx9- too This is a hint that the branch is worth removing. Assume that's the case, regardless of the gfx level. fossil-db (vega10): Totals from 22 (0.03% of 63053) affected shaders: Instrs: 23927 -> 23856 (-0.30%) CodeSize: 125096 -> 124812 (-0.23%) Latency: 138258 -> 137765 (-0.36%) InvThroughput: 55900 -> 55884 (-0.03%) Branches: 391 -> 320 (-18.16%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30321>	2024-08-15 16:00:19 +00:00
Rhys Perry	9f1a5645cf	aco: completely skip branches if they're never taken fossil-db (navi21): Totals from 196 (0.25% of 79395) affected shaders: Instrs: 101902 -> 101706 (-0.19%) CodeSize: 576988 -> 576232 (-0.13%) Latency: 750344 -> 750280 (-0.01%); split: -0.01%, +0.00% InvThroughput: 119170 -> 119161 (-0.01%) Branches: 3933 -> 3737 (-4.98%) fossil-db (vega10): Totals from 585 (0.93% of 63053) affected shaders: Instrs: 346877 -> 346292 (-0.17%) CodeSize: 1810600 -> 1808260 (-0.13%) Latency: 1817743 -> 1814233 (-0.19%) InvThroughput: 652142 -> 651944 (-0.03%) Branches: 5087 -> 4502 (-11.50%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30321>	2024-08-15 16:00:19 +00:00
Rhys Perry	c29d9f1184	aco: only remove branch jumping over SMEM/barrier if it's never taken SMEM might be an invalid access, and barriers are probably expensive. fossil-db (navi21): Totals from 126 (0.16% of 79395) affected shaders: Instrs: 2764965 -> 2765377 (+0.01%) CodeSize: 15155348 -> 15156788 (+0.01%) Latency: 17604293 -> 17604296 (+0.00%) Branches: 105211 -> 105623 (+0.39%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Backport-to: 24.1 Backport-to: 24.2 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30321>	2024-08-15 16:00:19 +00:00
Rhys Perry	b934255510	aco: split selection_control_remove into rarely_taken and never_taken No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Backport-to: 24.1 Backport-to: 24.2 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30321>	2024-08-15 16:00:18 +00:00
Connor Abbott	c59be8516b	Revert "tu/a750: Disable HW binning when there is GS" This reverts commit `7eb6123e98`. The root cause was actually the bug fixed by the previous commit. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30675>	2024-08-15 15:27:08 +00:00
Connor Abbott	850f2aab03	ir3, tu: Use a UBO for VS primitive params on a750+ Before we were using direct CP_LOAD_STATE, which is broken with multiple back-to-back draws. This caused regressions in some DX11 traces when enabling early preamble. We still need to use indirect CP_LOAD_STATE for VS params, which are sometimes written by the CP, however for everything else we should use the new UBO path instead. Fixes: `76e417ca59` ("turnip,ir3/a750: Implement consts loading via preamble") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30675>	2024-08-15 15:27:08 +00:00
Connor Abbott	4f2b5442a6	tu: Fix off-by-one in UBO CP_LOAD_STATE size It's one header dword and 5 payload dwords. This was papered over by us not actually using the UBO path for one of the loads, but that's changed in the next commit. Fixes: `76e417ca59` ("turnip,ir3/a750: Implement consts loading via preamble") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30675>	2024-08-15 15:27:08 +00:00
Eric Engestrom	4811632bfe	nvk/ci: add vkd3d job on the ga106 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30430>	2024-08-15 15:07:54 +00:00
Job Noorman	f448cf90c8	zink/ci: add a618 flake Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	72bb4d79dc	ir3/legalize: handle scalar ALU WAR hazards for a0.x It turns out that mova executes on the normal pipeline, which means that users of a0.x on the scalar pipeline might cause a WAR hazard with mova. Fixes: `876c5396a7` ("ir3: Add support for "scalar ALU"") Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	dead168200	ir3: make fullsync sync after shared writes fullsync would only sync after cat4/5/6 instructions. However, since the introduction of scalar ALU, we also need to sync after writes to shared registers. This commit fixes this by using the is_ss/sy_producer helpers. This should also catch all cases where (ss) is need for WAR hazards. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	2e40dda3cd	ir3/ci: remove fixed tests from a307-fails Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	83b55a7d7c	ir3: use correct bit size for bools in emit_alu The special case for 32b bools on pre-a5xx gens was not taken into account everywhere. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	cf395d1437	ir3: use rpt instructions for frag coord Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	6e6b338f33	ir3: add support for rpt bary.f/flat.b These can be repeated like other instructions with one interesting wrinkle: their immediate input location can also be repeated and its value gets incremented by one for every repeat. They seem to be the only instructions to support this. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	4a6d48cf4c	ir3: enable load/store_const_ir3 vectorization Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	9998b65695	nir/load_store_vectorize: add load/store_const_ir3 Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	db2859cb7f	nir/load_store_vectorize: support stores without wrmask Some store intrinsics (e.g., store_const_ir3) don't have a wrmask so don't assume it always exists. Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	97aefc4405	nir/load_store_vectorize: support non-byte offset Some load/store intrinsics (e.g., load/store_const_ir3) use offsets in units other than bytes. Currently, byte offsets were assumed in multiple places. This patch adds a new offset_scale field to intrinsic_info and uses it were needed. Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	fbd2c80671	ir3: rename @store_uniform_ir3 to @store_const_ir3 Uniforms are a legacy thing and this intrinsic was only used to store to the const file so the new naming is less confusing. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	e0bad1dd20	ir3: replace @load_uniform by new @load_const_ir3 intrinsic Uniforms are a legacy thing and this intrinsic was only used to load from const registers so the new naming is less confusing. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	94c49b2cc3	ir3: add support for vectorized NIR phi nodes Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	6b611dbe79	nir/opt_vectorize: add support for phi nodes Phi nodes are mostly handled the same way as ALU instructions: if all sources point to the same def (which happens if they are scalar or have been previously vectorized), combine them into a single vectorized phi node. There is one case where this doesn't work, however: sources that come from a loop back-edge. Since their defs haven't been processed yet, they are generally not the same. We could simply refuse to vectorize such phi nodes but this could leave many values used in loops unnecessarily scalarized. Instead, this patch implements a simple heuristic: if all defs coming from a back-edge have the same instructions type and, in case of ALU, the same operation, assume they will be vectorized later. Since we require that normal edges are vectorized already, chances are that the back-edge can also be vectorized. Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	79eb57de93	nir/opt_vectorize: process blocks in source-code order To handle phi nodes, it's important that all sources have been processed before processing the phi node itself. The current traversal order (depth-first on dom_children) does not guarantee this. This patch rewrites the pass to visit blocks in source-code order. Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	b451575989	nir/opt_vectorize: prepare for multiple try_combine functions Dispatch to different functions inside instr_try_combine. To prepare for upcoming support for phi nodes. Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	e2cb646148	nir/opt_vectorize: move rewriting of uses to a function Will be shared with upcoming support for phi nodes. Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	58d18bc7a8	ir3: lower vectorized NIR instructions Use the new repeat group builders to lower vectorized NIR instructions. Add NIR pass to vectorize NIR before lowering. Support for repeated instruction is added over a number of different commits. Here's how they all tie together: ir3 is a scalar architecture and as such most instructions cannot be vectorized. However, many instructions support the (rptN) modifier that allows us to mimic vector instructions. Whenever an instruction has the (rptN) modifier set it will execute N more times, incrementing its destination register for each repetition. Additionally, source registers with the (r) flag set will also be incremented. For example: (rpt1)add.f r0.x, (r)r1.x, r2.x is the same as: add.f r0.x, r1.x, r2.x add.f r0.y, r1.y, r2.x The main benefit of using repeated instructions is a reduction in code size. Since every iteration is still executed as a scalar instruction, there's no direct benefit in terms of runtime. The only exception seems to be for 3-source instructions pre-a7xx: if one of the sources is constant (i.e., without the (r) flag), a repeated instruction executes faster than the equivalent expanded sequence. Presumably, this is because the ALU only has 2 register read ports. I have not been able to measure this difference on a7xx though. Support for repeated instructions consists of two parts. First, we need to make sure NIR is (mostly) vectorized when translating to ir3. I have not been able to find a way to keep NIR vectorized all the way and still generate decent code. Therefore, I have taken the approach of vectorizing the (scalarized) NIR right before translating it to ir3. Secondly, ir3 needs to be adapted to ingest vectorized NIR and translate it to repeated instructions. To this end, I have introduced the concept of "repeat groups" to ir3. A repeat group is a group of instructions that were produced from a vectorized NIR operation and linked together. They are, however, still separate scalar instructions until quite late. More concretely: 1. Instruction emission: for every vectorized NIR operation, emit separate scalar instructions for its components and link them together in a repeat group. For every instruction builder ir3_X, a new repeat builder ir3_X_rpt has been added to facilitate this. 2. Optimization passes: for now, repeat groups are completely ignored by optimizations. 3. Pre-RA: clean up repeat groups that can never be merged into an actual rptN instruction (e.g., because their instructions are not consecutive anymore). This ensures no useless merge sets will be created in the next step. 4. RA: create merge sets for the sources and defs of the instructions in repeat groups. This way, RA will try to allocate consecutive registers for them. This will not be forced though because we prefer to split-up repeat groups over creating movs to reorder registers. 5. Post-RA: create actual rptN instructions for repeat groups where the allocated registers allow it. The idea for step 2 is that we prefer that any potential optimizations take precedence over creating rptN instructions as the latter will only yield a code size benefit. However, it might be interesting to investigate if we could make some optimizations repeat aware. For example, the scheduler could try to schedule instructions of a repeat group together. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	4c4366179b	ir3: add post-RA pass to merge repeat groups into rptN instructions For repeat groups where the register assignment allows it, merge them into a single rptN instruction. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	c510b83a4d	ir3: add pre-RA pass to clean up repeat groups Clean up repeat groups that can never be merged into an actual rptN instruction (e.g., because their instructions are not consecutive anymore). This ensures no useless merge sets will be created for RA. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	4fcee235a6	ir3: make RA aware of repeat groups Create merge sets for the sources and defs of the instructions in repeat groups. This way, RA will try to allocate consecutive registers for them. This will not be forced though because we prefer to split-up repeat groups over creating movs to reorder registers. When choosing a register for a repeat group's merge set, if its merge set is unique (i.e., only used for these repeated instructions), try to first allocate one of their sources (for the same reason as for ALU/SFU instructions). This also prevents us from allocating a new register range for this merge set when the one from a source could be reused. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	a5b03fc316	ir3: add builders for repeated instructions For every instruction builder ir3_X, this patch adds new repeat builder ir3_X_rpt to create a repeated version of an instruction. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	6aea957948	ir3: add backend support for repeated instructions In order to represent repeated instructions (rptN) in ir3, this patch introduces the concept of "repeat groups". A repeat group is a group of instructions that were produced from a vectorized NIR operation and linked together. They are, however, still separate scalar instructions. Repeat groups are created by linking together multiple instructions using a new rpt_node list. This patch adds this list as well as a number of helper functions the can be used to create and manipulate repeat groups. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	849005a471	ir3: print (sat) modifier of instructions Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	cd171964a6	ir3: add debug option to expand rpt instructions Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	ef162f9a6f	ir3: correctly count vectorized instructions for tex prefetch The tex prefetch heuristic simply counts the number of NIR instructions. Since a vectorized NIR instruction expands to an ir3 instruction per component, we have to take this into account while counting them. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	fe09ea29b9	ir3: fix counting of repeated registers (r) registers also have their wrmask set so the instruction's rpt field should not be taken into account. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00

1 2 3 4 5 ...

179300 commits