fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-30 18:18:14 +02:00

Author	SHA1	Message	Date
Georg Lehmann	15d754fefa	nir: add load_front_face_fsign Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32791>	2024-12-30 22:31:34 +00:00
Caterina Shablia	f4fcfa8016	pan,nir: introduce load_attribute_pan load_attribute_pan is a panfrost-specific intrinsic for loading vertex attributes. Takes explicit vertex and instance IDs which we need in order to implement vertex attribute divisor with non-zero base instance on v9+. Passes which are used by panvk are modified to be aware of load_attribute_pan. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32039>	2024-12-18 08:33:16 +00:00
Rhys Perry	26790e90d3	nir: make ballot ALU and mbcnt_amd operations reorderable These can be lowered to ALU and load_subgroup_invocation, all of which are reorderable. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32512>	2024-12-11 14:47:12 +00:00
Rhys Perry	5368569d06	nir: make load_helper_invocation non-reorderable This can't be moved to after demote, so it's not reorderable. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32512>	2024-12-11 14:47:12 +00:00
Alyssa Rosenzweig	816c14d33d	nir: add printf_abort intrinsic abort() for the gpu, implemented with the printf infrastructure since they go together. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32564>	2024-12-10 19:13:07 +00:00
Benjamin Lee	74ccf6cbdc	nir: add option to use compact view indices In panvk we pass absolute view indices to the hardware, so we need to do the conversion from compacted to absolute at some point. Emitting absolute indices from nir_lower_multiview initially looks like the simplest option, but nir_lower_io_to_temporaries will emit a write for every element of array varyings. This results in unnecessary writes to disabled views. Signed-off-by: Benjamin Lee <benjamin.lee@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31704>	2024-12-09 20:31:49 +00:00
Benjamin Lee	becb014d27	nir: treat per-view outputs as arrayed IO This is needed for implementing multiview in panvk, where the address calculation for multiview outputs is not well-represented by lowering to nir_intrinsic_store_output with a single offset. The case where a variable is both per-view and per-{vertex,primitive} is now unsupported. This would come up with drivers implementing NV_mesh_shader or using nir_lower_multiview on geometry, tessellation, or mesh shaders. No drivers currently do either of these. There was some code that attempted to handle the nested per-view case by unwrapping per-view/arrayed types twice, but it's unclear to what extent this actually worked. ANV and Turnip both rely on per-view outputs being assigned a unique driver location for each view, so I've added on option to configure that behavior rather than removing it. Signed-off-by: Benjamin Lee <benjamin.lee@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31704>	2024-12-09 20:31:49 +00:00
Job Noorman	e6c63a88fb	nir: add read_getlast_ir3 intrinsic Like read_first_invocation but using getlast. Note that I intentionally used the name of the ir3 instruction in the name as its semantics are tricky to exactly describe otherwise. Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31731>	2024-11-29 16:22:47 +00:00
Caterina Shablia	7ca8c19246	Revert "nir: introduce instance_index system value" This reverts commit `b9be1f1f20`. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32332>	2024-11-28 07:53:01 +00:00
Caterina Shablia	b9be1f1f20	nir: introduce instance_index system value The semantics of this newly introduced system value match Vulkan's InstanceIndex exactly, and are equivalent to instance_id + base_instance. Some hardware, such as Mali Valhall or later, only provides instance id offset by base_instance. Introducing a new system value to represent this, rather than handling the mismatch when lowering to BIR lets us use NIR to eliminate redundant arithmetic that would follow from mismatched semantics, e.g. instance_id could be lowered to instance_index - base_instance, so expressions such as instance_id + base_instance would be optimized to a simple instance_index. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32158>	2024-11-19 09:18:47 +00:00
Rhys Perry	d3ae1842a2	aco,ac/nir: flag loads to use smem in NIR This pass will be re-used later. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31904>	2024-11-13 12:59:26 +00:00
Samuel Pitoiset	a85f0143e0	nir: add nir_intrinsic_debug_break instruction This instruction can be used as a breakpoint in shaders to enter a trap if supported by the driver. It will be used to handle NonSemantic.DebugBreak in SPIR-V. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32061>	2024-11-12 16:05:17 +00:00
Marek Olšák	ee452129c6	nir: add cull_triangles_, cull_lines_ prefixes to viewport_xy_scale_and_offset for radeonsi Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31865>	2024-10-29 16:47:44 +00:00
Marek Olšák	2227f5be9d	nir: rename load_cull_small_primitive_precision -> triangle, add line_precision for radeonsi Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31865>	2024-10-29 16:47:44 +00:00
Marek Olšák	0914e0d02f	nir: rename load_cull_small_primitives -> triangles, add load_cull_small_lines for radeonsi Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31865>	2024-10-29 16:47:44 +00:00
Lionel Landwerlin	97b17aa0b1	brw/nir: rework inline_data_intel to work with compute This intrinsic was initially dedicated to mesh/task shaders, but the mechanism it exposes also exists in the compute shaders on Gfx12.5+. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31508>	2024-10-17 19:35:59 +00:00
Job Noorman	4556b18f51	nir: add shuffle_{xor,up,down}_uniform_ir3 intrinsics These are like shuffle_{xor,up,down} except they expect a dynamically uniform index. This is necessary since the ir3 shfl instruction does not work with a divergent index. Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31501>	2024-10-16 22:05:10 +00:00
Georg Lehmann	bcfc5c09fa	amd: add offset to is_subgroup_invocation_lt_amd Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31184>	2024-09-26 14:29:13 +00:00
Timothy Arceri	6bb6b0e5ad	nir: add nir_intrinsic_deref_implicit_array_length intrinsic This will be used to handle .length() calls on unsized arrays Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31137>	2024-09-25 09:39:44 +00:00
Georg Lehmann	e0bcab953d	nir: add amd shared append/consume Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31075>	2024-09-19 16:21:47 +00:00
Boris Brezillon	eeb3512498	nir/lower_ssbo: Extend the load_ssbo_address intrinsic to pass an offset On Mali(Valhall), the bounds checking can be done when in hardware, but for this to work properly, we need to pass the offset to the nir_load_ssbo_address() intrinsic. Add an offset source to the intrinsic, and adjust the lowering pass to conditionally lower the offset addition. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31164>	2024-09-18 13:45:57 +00:00
Alyssa Rosenzweig	f977c52b84	ail: swallow up formats ail is a more sensible place for the format tables to live. this does create a bit of dependency soup but hey. nfc Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30981>	2024-09-02 23:27:14 +00:00
Alyssa Rosenzweig	afc7557cb6	nir,agx: make block image store an image() intrinsic so we can do a bindless version Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30981>	2024-09-02 23:27:14 +00:00
Karol Herbst	fc88f04ba1	vtn, nir: handle OpImageQueryLevels on images This is needed for cl_khr_mipmap_image, specifically the OpenCL C function get_image_num_mip_levels. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30834>	2024-08-27 15:06:17 +00:00
Lionel Landwerlin	cf986dd589	nir: remove unused intel intrinsics Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30713>	2024-08-22 19:44:40 +00:00
Lionel Landwerlin	fbafa9cabd	intel/nir: remove load_global_const_block_intel intrinsic load_global_constant_uniform_block_intel is equivalent in terms of loading, then for the predicate we just do a bcsel afterward in places where that is required. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30659>	2024-08-16 11:12:39 +00:00
Job Noorman	fbd2c80671	ir3: rename @store_uniform_ir3 to @store_const_ir3 Uniforms are a legacy thing and this intrinsic was only used to store to the const file so the new naming is less confusing. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Job Noorman	e0bad1dd20	ir3: replace @load_uniform by new @load_const_ir3 intrinsic Uniforms are a legacy thing and this intrinsic was only used to load from const registers so the new naming is less confusing. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341>	2024-08-15 12:07:27 +00:00
Alyssa Rosenzweig	e754e54f88	nir: model AGX explicit coordinate intrinsics I don't know what Apple calls these, so we're using the name "explicit coordinates". AGX has instructions for loading/stores register <---> tilebuffer ---> storage images. Usually these are used in the fragment shader and end-of-tile shader to implement colour attachments, with implicitly specified coordinates based on the shader stage. However they can also be used in compute shaders with explicitly specified coordinates ("imageblocks" in Apple parlance). Model this in NIR. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30633>	2024-08-12 18:46:31 -04:00
Alyssa Rosenzweig	f04ae930d9	nir,agx: add "active threads in subgroup" intrinsic Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30633>	2024-08-12 18:45:58 -04:00
Alyssa Rosenzweig	24b722a692	nir: add derivative intrinsics Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30565>	2024-08-08 15:26:07 +00:00
Karol Herbst	14ea102175	nir: add load_global_size intrinsic There is no need to compute it in the shader as the result is known at runtime already. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Tested-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30467>	2024-08-01 17:43:42 +00:00
Timothy Arceri	017770ff14	nir: add nir_tex_src_{sampler,texture}_deref_intrinsic To be used as a placeholder until after function inlining so we can replace function params with bindless handles if needed. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30315>	2024-07-29 00:06:10 +00:00
Marek Olšák	b2d32ae246	nir: add nir_intrinsic_load_per_primitive_input, split from io_semantics flag Instead of having 1 bit in nir_io_semantics indicating a per-primitive FS input, add a dedicated intrinsic for it. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29895>	2024-07-23 16:13:16 +00:00
Georg Lehmann	2d3f536174	aco,nir: add dpp16_shift_amd intrinsic Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24650>	2024-07-17 15:04:38 +00:00
Marek Olšák	1b2cd628b8	nir: rename ordered_xfb_counter_add_gfx12_amd -> ordered_add_loop_gfx12_amd because it can also be used by compute. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30063>	2024-07-13 01:32:48 +00:00
Connor Abbott	45a57fa735	ir3: Plumb through descriptor prefetch intrinsics Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29873>	2024-07-10 11:54:15 +00:00
Connor Abbott	ec37e65a2d	ir3: Introduce elect_any_ir3 For preambles, we don't actually care which invocation we get, so we don't have to enable helper invocations when the preamble uses "getone." Introduce a new intrinsic with the right semantics and plumb it through. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29914>	2024-06-26 17:40:15 +00:00
Ian Romanick	6b678d32cb	nir: dpas_intel second source can have different number of components The number of components for the second source is -1 to avoid validation of its value. Some supported configurations will have the component count of that matrix different than the others. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28834>	2024-06-25 14:17:47 -07:00
Faith Ekstrand	9d3b144018	nir: Add a nir_intrinsic_use for unit tests Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28793>	2024-06-19 01:56:22 +00:00
Alyssa Rosenzweig	ae3af4c73a	nir: document restriction on load_smem_amd constantness This came up while reviewing https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29398 ... Possibly this intrinsic should be renamed to load_smem_constant_amd for consistency with load_global_constant. But if we're not going to convey constantness in the intrinsic name, let's at least document the restriction, because NIR's optimizer relies on it. (I didn't inspect every call site, but it looks like load_smem_amd is just used for descriptor loads so there's no bug to fix.) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29743>	2024-06-17 21:17:09 +00:00
Daniel Schürmann	9b1a748b5e	nir: remove nir_intrinsic_discard The semantics of discard differ between GLSL and HLSL and their various implementations. Subsequently, numerous application bugs occurred and SPV_EXT_demote_to_helper_invocation was written in order to clarify the behavior. In NIR, we now have 3 different intrinsics for 2 things, and while demote and terminate have clear semantics, discard still doesn't and can mean either of the two. This patch entirely removes nir_intrinsic_discard and nir_intrinsic_discard_if and replaces all occurences either with nir_intrinsic_terminate{_if} or nir_intrinsic_demote{_if} in the case that the NIR option 'discard_is_demote' is being set. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27617>	2024-06-17 19:37:16 +00:00
Faith Ekstrand	7e3d157bee	nak,nir: Drop r2ur_nv in favor of as_uniform Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29737>	2024-06-15 06:14:27 +00:00
Job Noorman	d3f8de791d	ir3: lower SSBO access imm offsets Add the BASE index to the load/store_ssbo_ir3 intrinsic to store an immediate offset. This offset is encoded in the corresponding fields of isam.v/ldib.b/stib.b. One extra optimization is implemented: whenever the regular offset is also a constant, the total offset (regular plus immediate) is aligned down to a multiple of the max immediate offset and this is used as the regular offset while the immediate is set to the remainder. This ensures that the register used for the regular offset can often be reused among multiple contiguous accesses. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28664>	2024-06-14 17:12:59 +00:00
Faith Ekstrand	b107240474	nir: Add some new _nv intrinsics The ldc_nv and ldcx_nv intrinsics correspond to the index and bindless forms of NVIDIA's LDC instruction, respectively. ldc_nv is pretty much load_ubo without some of the unnecessary constant bits while ldcx_nv takes a 64-bit bindless handle instead of an index. The other two give us a little control over register allocation at the NIR level to ensure that LDCX handles are placed in uniform registers. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29591>	2024-06-13 20:43:45 +00:00
Alyssa Rosenzweig	5f72234745	asahi: split param structs for GS internal kernel this simplifies state management consdierably Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29607>	2024-06-07 16:57:03 +00:00
Konstantin Seurer	a93f95c69c	radv/rt: Remove load_rt_dynamic_callable_stack_base_amd Reviewed-by: Friedrich Vock <friedrich.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28619>	2024-05-28 12:23:45 +00:00
Italo Nicola	62c8e58f39	nir: add {load,store}_global_etna intrinsics Acked-by: David Heidelberg <david@ixit.cz> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Signed-off-by: Italo Nicola <italonicola@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29402>	2024-05-27 17:58:51 +00:00
Lionel Landwerlin	ecbec25e84	intel/nir: add reloc delta to load_reloc_const_intel intrinsic We'll use the delta for an upcoming internal printf mechanism, where the PARAM_IDX will be the base printf reloc identifier and the BASE will be the string id. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25814>	2024-05-15 13:13:38 +00:00
Lionel Landwerlin	2be28ee58a	nir: add a base offset for printf indexing This will allow a driver to use a single table of printf strings across all shaders. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25814>	2024-05-15 13:13:37 +00:00

1 2 3 4 5 ...

497 commits