fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-20 00:28:08 +02:00

Author	SHA1	Message	Date
Marek Olšák	0fdd6de65f	nir/lower_io: validate locations more accurately Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36091>	2025-07-15 13:38:29 +00:00
Marek Olšák	2ba2a61101	nir: switch indirect IO load lowering to nir_lower_io_indirect_loads for GLSL This reduces GLSL compile times with the gallium noop driver by 0.6%. This might decrease register usage and do less code reordering because nir_lower_io_vars_to_temporaries is no longer called for inputs, which moved most input loads to the top. radeonsi+ACO shader-db results are noise. More uniforms are identified as inlinable. TOTALS FROM ALL SHADERS (58138): VGPRs: 2152680 -> 2158032 (0.25 %) Code Size: 71008908 -> 71064812 (0.08 %) bytes Max Waves: 916943 -> 916924 (-0.00 %) Inline Uniforms: 6395 -> 6414 (0.30 %) Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36018>	2025-07-10 16:37:45 +00:00
Marek Olšák	3dd9a9782b	nir: add new pass nir_lower_io_indirect_loads This is a partial replacement for nir_lower_io_vars_to_temporaries. It supports all input and output loads. It doesn't handle stores. The motivation is to improve compile times. The main differences compared to nir_lower_io_vars_to_temporaries are: - it only lowers indirect loads to temps and doesn't touch direct loads which improves compile times and removes the need for nir_lower_vars_to_ssa afterward because indirect temp access can't be lowered to SSA - it doesn't move all input loads to the top; it only moves those input loads to the top whose indirect loads are lowered (which improves register usage because direct loads are not moved) - it doesn't have to deal with complexities of variables Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36018>	2025-07-10 16:37:44 +00:00
Marek Olšák	070aaa1c9f	nir/lower_io: validate that location and num_slots fit in the bitfields Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35999>	2025-07-08 14:01:56 +00:00
Marek Olšák	80ed5653a7	nir: invert the meaning of has_indirect_* flags in nir_lower_io_passes Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35945>	2025-07-08 06:11:44 +00:00
Marek Olšák	a065a09d22	glsl: don't lower outputs to temps unconditionally It's done later in nir_lower_io_passes only for shader stages not supporting indirect access. Unfortunately we have add a hack into nir_lower_io_passes to get rid of output loads. A later commit will remove it. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35945>	2025-07-08 06:11:44 +00:00
Marek Olšák	1754507d49	nir: rename nir_lower_io_to_temporaries -> nir_lower_io_vars_to_temporaries Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:54 +00:00
Marek Olšák	aefea49dad	nir: move lots of code from nir_lower_io.c into new nir_lower_explicit_io.c nir_lower_io is just for regular inputs/outputs. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:52 +00:00
Marek Olšák	5bd3e0c08c	nir: move nir_assign_var_locations to freedreno (its only use) Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:52 +00:00
Marek Olšák	c8cda0dc1a	nir: move nir_io_add_const_offset_to_base into its own file Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:51 +00:00
Marek Olšák	d78070ded5	nir: move nir_io_add_intrinsic_xfb_info into its own file Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:51 +00:00
Alyssa Rosenzweig	caa0854da8	nir: plumb load_global_bounded this lets the backend implement bounded loads (i.e. robust SSBOs) in a way that's more clever than a full branch. similar idea to load_global_constant_bound which should eventually be merged into this. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Job Noorman <job@noorman.info> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35720>	2025-06-26 16:41:53 +00:00
Lionel Landwerlin	16fca611d7	nir: add new intel ssbo intrinsics Similar to ir3 ones, to optimize offsets in the backend. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35252>	2025-06-22 10:55:23 +00:00
Rohan Garg	909ec6ff1f	nir/lower_io: add io_offset support for more intrinsics This will be used by upcoming changes in the intel compiler. Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35252>	2025-06-22 10:55:22 +00:00
Marek Olšák	bf2ed20eb9	nir: remove unused nir_io_semantics::invariant Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Acked-by: Alyssa on IRC Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35256>	2025-06-02 23:08:58 +00:00
Marek Olšák	deda05e2b7	nir: move nir_lower_color_inputs into radeonsi it's the only user Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34492>	2025-05-14 20:19:17 +00:00
Marek Olšák	a1ee6d6730	nir: fix gathering color interp modes in nir_lower_color_inputs Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Fixes: `709ebd82` ("amd: expose nir_io_mix_convergent_flat_with_interpolated") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12800 Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34942>	2025-05-13 00:05:37 -04:00
Caio Oliveira	33295b2249	spirv, nir: Allow non-Aliased workgroup memory blocks Allocate space for the aliased region first, then allocate the non-Aliased blocks in sequence after that. SPV_KHR_workgroup_memory_explicit_layout extension added support for having Blocks of workgroup (shared) memory, which include layout decoration. For that extension all such blocks must be decorated with Aliased. SPV_KHR_untyped_pointers extension lifts that requirement, allowing blocks that don't alias in workgroup memory. They are still explicitly laid out. The motivation is that untyped pointers provide a different mechanism to obtain the same effect as the Aliased blocks. Instead of having two Aliased variables with different types, have a single variable and use an untyped pointer with a different type to access it. This patch is a preparation for supporting untyped pointers. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34139>	2025-04-17 19:13:18 +00:00
Caio Oliveira	fd0a7efb5a	spirv, nir: Delay calculation of shared_size when using explicit layout Move the calculation to nir_lower_vars_to_explicit_types(). This consolidates the check of shader_info::shared_memory_explicit_layout in a single place instead of in all drivers. This is motivated by SPV_KHR_untyped_pointers. Before that extension we had essentially two modes for shared memory variables - No layout decorations in the SPIR-V, and both internal layout and driver location was _given by the driver_. - Explicitly laid out, i.e. they are blocks, and decorated with Aliased. Because they all alias, we could assign them driver location directly to the start of the shared memory. With the untyped pointers extension, there's a third option, to be added by a later commit - Explicitly laid out, i.e. they are blocks, and NOT decorated with Aliased. Driver location is _given by the driver_. Blocks with and without Aliased can be mixed. The driver location of multiple blocks that don't alias depend on alignment that is driver-specific, which we can more easily do from the nir_lower_vars_to_explicit_types() that already has access to a function to obtain such value. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> (hk) Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> (v3dv) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (anv/hasvk) Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> (panvk) Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (radv) Reviewed-by: Rob Clark <robdclark@gmail.com> (tu) Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34139>	2025-04-17 19:13:17 +00:00
Faith Ekstrand	7ac6ec2ceb	nir: Add a get_io_index_src() helper Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33402>	2025-03-01 08:44:15 +00:00
Alyssa Rosenzweig	9a58a8257e	treewide: Switch to nir_progress Via the Coccinelle patch at the end of the commit message, followed by sed -ie 's/progress = progress \| /progress \|=/g' $(git grep -l 'progress = prog') ninja -C ~/mesa/build clang-format cd ~/mesa/src/compiler/nir && clang-format -i *.c agxfmt @@ identifier prog; expression impl, metadata; @@ -if (prog) { -nir_metadata_preserve(impl, metadata); -} else { -nir_metadata_preserve(impl, nir_metadata_all); -} -return prog; +return nir_progress(prog, impl, metadata); @@ expression prog_expr, impl, metadata; @@ -if (prog_expr) { -nir_metadata_preserve(impl, metadata); -return true; -} else { -nir_metadata_preserve(impl, nir_metadata_all); -return false; -} +bool progress = prog_expr; +return nir_progress(progress, impl, metadata); @@ identifier prog; expression impl, metadata; @@ -nir_metadata_preserve(impl, prog ? (metadata) : nir_metadata_all); -return prog; +return nir_progress(prog, impl, metadata); @@ identifier prog; expression impl, metadata; @@ -nir_metadata_preserve(impl, prog ? (metadata) : nir_metadata_all); +nir_progress(prog, impl, metadata); @@ expression impl, metadata; @@ -nir_metadata_preserve(impl, metadata); -return true; +return nir_progress(true, impl, metadata); @@ expression impl; @@ -nir_metadata_preserve(impl, nir_metadata_all); -return false; +return nir_no_progress(impl); @@ identifier other_prog, prog; expression impl, metadata; @@ -if (prog) { -nir_metadata_preserve(impl, metadata); -} else { -nir_metadata_preserve(impl, nir_metadata_all); -} -other_prog \|= prog; +other_prog = other_prog \| nir_progress(prog, impl, metadata); @@ identifier prog; expression impl, metadata; @@ -if (prog) { -nir_metadata_preserve(impl, metadata); -} else { -nir_metadata_preserve(impl, nir_metadata_all); -} +nir_progress(prog, impl, metadata); @@ identifier other_prog, prog; expression impl, metadata; @@ -if (prog) { -nir_metadata_preserve(impl, metadata); -other_prog = true; -} else { -nir_metadata_preserve(impl, nir_metadata_all); -} +other_prog = other_prog \| nir_progress(prog, impl, metadata); @@ expression prog_expr, impl, metadata; identifier prog; @@ -if (prog_expr) { -nir_metadata_preserve(impl, metadata); -prog = true; -} else { -nir_metadata_preserve(impl, nir_metadata_all); -} +bool impl_progress = prog_expr; +prog = prog \| nir_progress(impl_progress, impl, metadata); @@ identifier other_prog, prog; expression impl, metadata; @@ -if (prog) { -other_prog = true; -nir_metadata_preserve(impl, metadata); -} else { -nir_metadata_preserve(impl, nir_metadata_all); -} +other_prog = other_prog \| nir_progress(prog, impl, metadata); @@ expression prog_expr, impl, metadata; identifier prog; @@ -if (prog_expr) { -prog = true; -nir_metadata_preserve(impl, metadata); -} else { -nir_metadata_preserve(impl, nir_metadata_all); -} +bool impl_progress = prog_expr; +prog = prog \| nir_progress(impl_progress, impl, metadata); @@ expression prog_expr, impl, metadata; @@ -if (prog_expr) { -nir_metadata_preserve(impl, metadata); -} else { -nir_metadata_preserve(impl, nir_metadata_all); -} +bool impl_progress = prog_expr; +nir_progress(impl_progress, impl, metadata); @@ identifier prog; expression impl, metadata; @@ -nir_metadata_preserve(impl, metadata); -prog = true; +prog = nir_progress(true, impl, metadata); @@ identifier prog; expression impl, metadata; @@ -if (prog) { -nir_metadata_preserve(impl, metadata); -} -return prog; +return nir_progress(prog, impl, metadata); @@ identifier prog; expression impl, metadata; @@ -if (prog) { -nir_metadata_preserve(impl, metadata); -} +nir_progress(prog, impl, metadata); @@ expression impl; @@ -nir_metadata_preserve(impl, nir_metadata_all); +nir_no_progress(impl); @@ expression impl, metadata; @@ -nir_metadata_preserve(impl, metadata); +nir_progress(true, impl, metadata); squashme! sed -ie 's/progress = progress \| /progress \|=/g' $(git grep -l 'progress = prog') Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33722>	2025-02-26 15:19:53 +00:00
Alyssa Rosenzweig	91872c9c51	nir: clang-format Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33722>	2025-02-26 15:19:53 +00:00
Timur Kristóf	65139305e2	nir: Don't use deprecated NIR_PASS_V macro anymore. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33609>	2025-02-22 08:54:16 +01:00
Marek Olšák	7b55ee999d	nir: don't set num_slots/src/dest_type/write_mask when they're set automatically to those values Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32779>	2025-01-06 19:09:17 +00:00
Caterina Shablia	f4fcfa8016	pan,nir: introduce load_attribute_pan load_attribute_pan is a panfrost-specific intrinsic for loading vertex attributes. Takes explicit vertex and instance IDs which we need in order to implement vertex attribute divisor with non-zero base instance on v9+. Passes which are used by panvk are modified to be aware of load_attribute_pan. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32039>	2024-12-18 08:33:16 +00:00
Benjamin Lee	b01afd06cd	nir: update docs for nir_get_io_arrayed_index_src Signed-off-by: Benjamin Lee <benjamin.lee@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31704>	2024-12-09 20:31:49 +00:00
Benjamin Lee	74ccf6cbdc	nir: add option to use compact view indices In panvk we pass absolute view indices to the hardware, so we need to do the conversion from compacted to absolute at some point. Emitting absolute indices from nir_lower_multiview initially looks like the simplest option, but nir_lower_io_to_temporaries will emit a write for every element of array varyings. This results in unnecessary writes to disabled views. Signed-off-by: Benjamin Lee <benjamin.lee@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31704>	2024-12-09 20:31:49 +00:00
Benjamin Lee	becb014d27	nir: treat per-view outputs as arrayed IO This is needed for implementing multiview in panvk, where the address calculation for multiview outputs is not well-represented by lowering to nir_intrinsic_store_output with a single offset. The case where a variable is both per-view and per-{vertex,primitive} is now unsupported. This would come up with drivers implementing NV_mesh_shader or using nir_lower_multiview on geometry, tessellation, or mesh shaders. No drivers currently do either of these. There was some code that attempted to handle the nested per-view case by unwrapping per-view/arrayed types twice, but it's unclear to what extent this actually worked. ANV and Turnip both rely on per-view outputs being assigned a unique driver location for each view, so I've added on option to configure that behavior rather than removing it. Signed-off-by: Benjamin Lee <benjamin.lee@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31704>	2024-12-09 20:31:49 +00:00
Marek Olšák	3effa3d53b	nir/lower_io_passes: lower indirect IO for TCS nir_lower_io_to_temporaries doesn't do anything and gives up when it gets TCS. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32424>	2024-12-04 13:40:41 +00:00
Marek Olšák	25d4943481	nir: make use_interpolated_input_intrinsics a nir_lower_io parameter This will need to be set to true when the GLSL linker lowers IO, which can later be unlowered by st/mesa, and then drivers can lower it again without load_interpolated_input. Therefore, it can't be a global immutable option. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32229>	2024-11-20 02:45:37 +00:00
Marek Olšák	dacae272bf	nir: add nir_io_semantics::fb_fetch_output_coherent Lowering IO should preserve this. Freedreno needs it. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32173>	2024-11-19 23:48:38 +00:00
Kenneth Graunke	95bc42af74	nir: Use load_global_constant for reorderable nir_var_mem_global access The main difference between load_global and load_global_constant is that the latter can be reordered arbitrarily. If the access being lowered is already tagged as being reorderable, then we can preserve that by using the load_global_constant intrinsics instead of load_global. This gives us more flexibility. On Intel, this lets us use the load_global_constant_uniform_block_intel intrinsic for doing convergent block loads in more cases. This nets us significant reductions in spill/fills: Borderlands 3 on Lunarlake sees spills/fills reduced by 53%. Alchemist sees a 13% reduction. Improves performance of Borderlands 3 DX12 on Intel Battlemage by around 44%. Improves Hogwarts Legacy by around 14%. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31995>	2024-11-18 12:55:47 +00:00
Marek Olšák	b71edce77a	nir/lower_io: change INTERP_MODE_NONE to SMOOTH when NONE means SMOOTH to improve CSE of load_barycentric_* and IO vectorization. This is only for load_interpolated_input, which can never be FLAT. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31968>	2024-11-05 14:13:40 +00:00
Pierre-Eric Pelloux-Prayer	60578df33a	nir: skip offset=0 in nir_io_add_const_offset_to_base When offset=0, the pass was a no-op but was setting the progress flag which could cause infinite loops when this pass is going to be added to gl_nir_opts. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31684>	2024-10-25 13:36:54 +00:00
Marek Olšák	09e64e3682	nir/opt_shrink_vectors: shrink memory loads, not just IO The problem with radeonsi+ACO is that UBO loads from vec4 uniforms using only 1 component always load all 4 components. This fixes that. We are only interested in shrinking UBO and SSBO loads, but I added more intrinsics because why not. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29384>	2024-09-26 03:01:38 +00:00
Marek Olšák	b2d32ae246	nir: add nir_intrinsic_load_per_primitive_input, split from io_semantics flag Instead of having 1 bit in nir_io_semantics indicating a per-primitive FS input, add a dedicated intrinsic for it. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29895>	2024-07-23 16:13:16 +00:00
Alyssa Rosenzweig	da752ed7c1	treewide: use nir_def_replace sometimes Two Coccinelle patches here. Didn't catch nearly as much as I would've liked but it's a start. Coccinelle patch: @@ expression intr, repl; @@ -nir_def_rewrite_uses(&intr->def, repl); -nir_instr_remove(&intr->instr); +nir_def_replace(&intr->def, repl); Coccinelle patch: @@ identifier intr; expression instr, repl; @@ nir_intrinsic_instr *intr = nir_instr_as_intrinsic(instr); ... -nir_def_rewrite_uses(&intr->def, repl); -nir_instr_remove(instr); +nir_def_replace(&intr->def, repl); Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com> [broadcom] Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> [lima] Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> [etna] Reviewed-by: Pavel Ondračka <pavel.ondracka@gmail.com> [r300] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29817>	2024-06-21 15:36:56 +00:00
Alyssa Rosenzweig	15257b65c6	treewide: use nir_metadata_control_flow Via Coccinelle patch: @@ @@ -nir_metadata_block_index \| nir_metadata_dominance +nir_metadata_control_flow ...plus some manual fixups for call sites missed by coccinelle. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Acked-by: Karol Herbst <kherbst@redhat.com> Acked-by: Juan A. Suarez Romero <jasuarez@igalia.com> [broadcom] Acked-by: Vasily Khoruzhick <anarsoul@gmail.com> [lima] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29745>	2024-06-17 16:28:14 -04:00
Karol Herbst	358e09f9ff	nir: add global_atomic_2x32 variants to nir_get_io_offset_src_number Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29711>	2024-06-17 10:07:56 +00:00
Faith Ekstrand	b107240474	nir: Add some new _nv intrinsics The ldc_nv and ldcx_nv intrinsics correspond to the index and bindless forms of NVIDIA's LDC instruction, respectively. ldc_nv is pretty much load_ubo without some of the unnecessary constant bits while ldcx_nv takes a 64-bit bindless handle instead of an index. The other two give us a little control over register allocation at the NIR level to ensure that LDCX handles are placed in uniform registers. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29591>	2024-06-13 20:43:45 +00:00
Timur Kristóf	0ea2bad74d	nir/lower_io: Add option to implement mediump as 32-bit. For drivers that don't lower mediump shader inputs / outputs to 16-bit, it's better to ignore the mediump flag completely, letting mediump inputs / outputs work like normal 32-bit IO. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29435>	2024-05-30 12:57:20 +00:00
Italo Nicola	62c8e58f39	nir: add {load,store}_global_etna intrinsics Acked-by: David Heidelberg <david@ixit.cz> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Signed-off-by: Italo Nicola <italonicola@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29402>	2024-05-27 17:58:51 +00:00
Mike Blumenkrantz	3541ed8502	nir: store variable names to io instrs during io lowering this creates a reference between variables and their access instrs before the variables are deleted, which improves debugging Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28814>	2024-04-24 12:35:59 +00:00
Timur Kristóf	ecbf3464f6	nir: Record per-primitive inputs without variables. Previously, this information would have been lost when the shader has no I/O variables. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28489>	2024-04-02 23:00:01 +00:00
Alyssa Rosenzweig	1773eb329c	nir: add offset to load_coefficients_agx for indirect varyings Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28483>	2024-03-30 00:26:19 +00:00
Boris Brezillon	544f76dd13	nir: Extend nir_get_io_offset_src_number() to support load_push_constant Will be needed to support push constants in nir_lower_mem_access_bit_sizes(). Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28175>	2024-03-26 11:09:37 +01:00
Marek Olšák	2034cf87c5	nir/lower_io: add nir_io_semantics::interp_explicit_strict This preserves the misnamed "per_vertex" flag in lowered IO. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28312>	2024-03-22 22:39:50 +00:00
Marek Olšák	9b819adbd8	glsl/linker,st/mesa: enable nir_opt_varyings and lower IO in the linker The varying linker isn't changed. The passes are executed after linking varyings and before linking uniforms if nir->options->lower_io_variables is true. nir_opt_varyings can move uniforms between shaders and cause them to be DCE'd. It requires moving IO deref lowering from st/mesa into the GLSL linker and nir_opt_varyings should be added at the same time because IO deref lowering alone would disable IO optimizations in st/mesa such as compaction. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26819>	2024-03-15 19:55:46 +00:00
Mike Blumenkrantz	9e2c7314f2	nir/lower_io: fix handling for compact arrays with indirect derefs this logic relies on constant indexing for compact arrays, but this is frequently not the case for compact array builtins (e.g., gl_TessLevelOuter). the usual strategy of lowering to temps isn't viable in TCS, which means io lowering has to be able to handle indirect access to these builtins without crashing cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27534>	2024-02-13 16:13:13 +00:00
Marek Olšák	5ffa4d879c	nir: add a lower_mediump_io callback into options This will be called by the GLSL linker before nir_opt_varyings. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26918>	2024-02-02 16:45:51 -05:00

1 2 3 4 5 ...

309 commits