fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-30 05:18:16 +02:00

Author	SHA1	Message	Date
Karol Herbst	9d90cbc314	nak: add input predicate to load_global_nv and OpLd This is new in SM75 (Turing). Let's use it because it allows us to get rid of the if/else around bound checked global loads. There are some changes in fossils, but it seems that's mostly due to CFG optimizations doing things a bit differently? Totals: CodeSize: 9442152688 -> 9442133184 (-0.00%); split: -0.00%, +0.00% Static cycle count: 6120910991 -> 6120907718 (-0.00%); split: -0.00%, +0.00% Spills to reg: 184789 -> 184810 (+0.01%) Fills from reg: 223831 -> 223860 (+0.01%); split: -0.00%, +0.01% Totals from 334 (0.03% of 1163204) affected shaders: CodeSize: 22020752 -> 22001248 (-0.09%); split: -0.10%, +0.01% Static cycle count: 26582978 -> 26579705 (-0.01%); split: -0.01%, +0.00% Spills to reg: 3110 -> 3131 (+0.68%) Fills from reg: 3401 -> 3430 (+0.85%); split: -0.03%, +0.88% Reviewed-by: Mary Guillemard <mary@mary.zone> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Acked-by: Mel Henning <mhenning@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40272>	2026-03-10 00:10:05 +00:00
Lionel Landwerlin	e14d6b535c	brw/nir: add new intrinsics to load data from the indirect address This address is delivered on Gfx12.5+ in compute/mesh/task shaders from the command stream instruction. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40174>	2026-03-06 06:34:43 +00:00
Lionel Landwerlin	7b1533414a	brw/nir: enable constant offsets for global_constant_uniform_block_intel Will be useful to retain the base offset added in `0e9453291c` ("brw: improve push constant loading using base offsets") once we move push constant data loading into NIR. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40174>	2026-03-06 06:34:43 +00:00
Lionel Landwerlin	7f19814414	brw/nir: handle inline_data_intel more like push_data_intel It's pretty much the same mechanism, except it's a different register location. With this change we gain indirect loading support. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39405>	2026-02-25 10:44:09 +00:00
Faith Ekstrand	e3dc3dccd6	pan/fb: Add a common FB load shader builder One of the advantages to this new FB load shader, apart from it being common, is that it's able to properly handle partial tile loads. Instead of doing the force_preload/clear dance that PanVK is currently doing, these shaders are clever enough to detect whether or not they're inside the Vulkan render area and clear the inside while loading the border pixels. In order for this to work, there are two new intrinsics which provide the framebuffer bounding box and the clear values. We need this in order to handle partial loads correctly. Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Acked-by: Boris Brezillon <boris.brezillon@collabora.com> Acked-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39759>	2026-02-23 21:00:01 +00:00
Marek Olšák	a9df891bc6	nir: allow get_ssbo_size to return a 64-bit result to match get_ubo_size, and to support HW where SSBOs can have a 64-bit size. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39743>	2026-02-16 12:59:36 +00:00
Marek Olšák	c151402f35	nir: add ACCESS to get_ubo_size so that we can set NON_UNIFORM Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39743>	2026-02-16 12:59:36 +00:00
Marek Olšák	0a9bdcac79	ac: lower load_workgroup_ids for ACO in NIR Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39638>	2026-02-13 15:33:19 +00:00
Karol Herbst	18bf6fb96d	nir: add nvidias shared memory non unform address shift Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39709>	2026-02-11 03:41:23 +01:00
Kenneth Graunke	beb4b78fe7	intel: Rename intel_msaa_flags to intel_fs_config This started out as dynamic configuration for MSAA related state, but has since expanded to cover many dynamic fragment shader options. We rename it to intel_fs_config, similar to intel_tess_config, to better indicate its purpose. Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39748>	2026-02-06 20:51:43 -08:00
Karol Herbst	4add3959e9	nir: add BASE to nvidia memory intrinsics Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39525>	2026-02-03 22:23:50 +00:00
Karol Herbst	e779538ad2	nir: add nvidia IO intrinsics Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39525>	2026-02-03 22:23:50 +00:00
Marek Olšák	44bc1e6bf4	nir: add dest_type to load_buffer_amd for lowering the result to 16 bits Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39474>	2026-02-02 17:56:52 +00:00
Iván Briano	5b48805b42	brw: fix local_invocation_index with quad derivaties on mesh/task shaders For mesh/task shaders, the thread payload provides a local invocation index, but it's always linear so it doesn't give the correct value when quad derivatives are in use. The lowering pass where all of this is done correctly for compute shaders assumes load_local_invocation_index will be lowered in the backend for mesh/task, calculates the values for the quads correctly but then avoid replacing the original intrinsic and we remain with the wrong results. Add an intel specific intrinsic and always lower the generic one to that (or whatever else was calculated) to avoid ambiguities and fix the value for quad derivatives. Fixes future CTS tests using mesh/task shaders under: dEQP-VK.spirv_assembly.instruction.compute.compute_shader_derivatives.* Fixes: `d89bfb1ff7` ("intel/brw: Reorganize lowering of LocalID/Index to handle Mesh/Task") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39276>	2026-01-27 22:28:19 +00:00
Kenneth Graunke	c2f03ba12f	nir: Add memory modes to URB load intrinsics This makes it easier for NIR passes to distinguish between inputs and outputs without having to reason about which URB handle source was passed to the intrinsic. It probably also makes it a bit easier for humans to read the NIR too. v2: Don't add memory mode to store intrinsics. It's always output. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39250>	2026-01-27 16:08:36 +00:00
Lionel Landwerlin	a19e949824	brw: move coarse_z computation to NIR So that we can print it easily with debug printfs Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38996>	2026-01-21 16:00:52 +00:00
Lionel Landwerlin	98194dfa0b	nir: add intrinsics for Z calculation in shaders with FSR Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38996>	2026-01-21 16:00:52 +00:00
Natalie Vock	30f6eacfad	radv/rt: Call ahit/isec shaders Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39314>	2026-01-20 21:49:55 +00:00
Faith Ekstrand	d6556a580f	nir,pan: Add and implement a new store_tile_pan intrinsic Like we just did with load_tile_pan, this maps directly to ST_TILE in the hardware. This is more versatile and lets us do more of our lowering in NIR. Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39367>	2026-01-19 21:33:13 +00:00
Faith Ekstrand	11b6cd2f2c	nir,pan: Rework the pafrost tile load intrinsic Instead of making it explicitly about outputs, this switchies it to being a NIR version of LD_TILE. It means we have to do a bit of work in NIR and add a builder helper but the end result is something much more versatile. Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39367>	2026-01-19 21:33:13 +00:00
Konstantin Seurer	38d0bd7dd3	nir: Add an assert_eq intrinsic for testing nir_opt_algebraic During the test this will compares both sources and fails the test if they are not equal. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39076>	2026-01-15 19:09:37 +00:00
Emma Anholt	ed8676dc28	nir: Rename the unit_test_*_amd intrinics to be un-vendored. We'll reuse these from the nir_opt_algebraic_pattern_test. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39076>	2026-01-15 19:09:37 +00:00
Emma Anholt	5bd669868f	nir: Add a note on how load_sample_pos_from_id works. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38911>	2026-01-15 07:52:14 +00:00
Natalie Vock	c5d796c902	radv/rt: Use function call structure in NIR lowering Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29580>	2026-01-14 14:19:06 +00:00
Natalie Vock	9d2c3c3db2	nir/intrinsics: Add incoming/outgoing payload load/store instructions With RT function calls, these are going to get lowered to: - load/store_param (incoming payload) - load/store_var (outgoing payload) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29580>	2026-01-14 14:19:05 +00:00
John Anthony	50682ec22c	pan: Use correct architecture name for v12+ The official name for the architecture after Valhall is 'Arm 5th Gen'. In code we can use 'FIFTHGEN' or 'fifthgen', while in documentation and printed output we should use 'Arm 5th Gen' or '5th Gen'. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39267>	2026-01-13 13:28:34 +01:00
Lars-Ivar Hesselberg Simonsen	ce3e13774a	nir: Add channels to pan texel_buf intrinsics Rather than loading a single 64bit channel with load_texel_buf_index_address_pan, load three channels of 32bit each. The last channel is required by the next commit. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38490>	2026-01-13 10:00:58 +01:00
Faith Ekstrand	6fc1030e4f	nir: Add some new panfrost fragment shader intrinsics Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39244>	2026-01-12 18:14:43 +00:00
Lionel Landwerlin	6d19b898e7	anv/brw: prep work for SIMD32 ray queries Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36181>	2026-01-12 12:19:21 +00:00
Lionel Landwerlin	26e4632f64	nir: add a new push_data_intel intrinsic We're finally moving on from misusing various intrinsics : - load_uniform - load_push_constant - load_ubo* Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38975>	2026-01-09 14:19:46 +00:00
Lionel Landwerlin	799258fdde	nir: use load() helper for inline_data_intel Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38975>	2026-01-09 14:19:45 +00:00
Juan A. Suarez Romero	a6330ed4d0	nir: add ACCESS to load_uniforms v3d/v3dv drivers require this information. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38759>	2026-01-08 12:59:44 +00:00
Marek Olšák	99a42bdd4b	nir,radeonsi: simplify load_color0 & load_color1 intrinsics and shader_info We don't need the shader_info fields anymore. sample and centroid fields are unused. The interp field is already available from si_shader_info::color_interpolate. The loads don't need to be sysvals. Add also the _amd suffix. Don't handle it in st_nir_lower_drawpixels either because the intrinsics are created much later in compilation now. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38802>	2026-01-01 18:30:28 +00:00
Arcady Goldmints-Orlov	68bb5d9e49	kk: enable shaderClipDistance Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Since Metal doesn't pass clip distance into the fragment shader, we have to do it ourselves. The CLIP_DIST0/1 varying slots are used to represent the user-defined varyings we use to pass them from vertex to fragment and a new intrinsic is added to represent the write to the built-in clip_distance variable. Since the CLIP_DIST0/1 varying slots are not affected by opt_varyings, there can be potential interface mismatches so the machinery in msl_iomap.c is refactored to allow them to be output as a series of scalars rather than vectors. Reviewed-by: Aitor Camacho <aitor@lunarg.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38839>	2025-12-08 23:09:53 -05:00
Connor Abbott	ad84ae2719	tu: Implement VK_QCOM_subpass_shader_resolve Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38451>	2025-12-08 20:44:46 +00:00
Connor Abbott	bd821b9a17	nir, tu: Add and use load_frag_coord_gmem_ir3 We used load_frag_coord_unscaled_ir3 for loading the fragment coord for input attachments in GMEM, where the normal scaling for gl_FragCoord shouldn't be used. However with custom resolve a different scaling will apply to attachments in GMEM. Separate "unscaled" from "gmem" and rename the NIR options, in preparation for this. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38451>	2025-12-08 20:44:45 +00:00
Karol Herbst	a255e2ca56	nir: add ACCESS to shared_uniform_block_intel intel_nir_blockify_uniform_loads simply overwrites the intrinsic for load_shared, which leads to messed up indicies, e.g: "base=0, access=volatile, align_mul=4, align_offset=0 became: "base=0, align_mul=4, align_offset=4" Fixes: `0dd09a292b` ("nir: add ACCESS_ATOMIC") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38801>	2025-12-04 10:01:52 +00:00
Karol Herbst	626c6b35f0	nak: add Movm Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37998>	2025-11-26 14:09:37 +00:00
Karol Herbst	c4f07f3d79	nir: mark cmat_load_shared_nv as CAN_ELIMINATE It's just a special load shared and has no side effects. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37998>	2025-11-26 14:09:35 +00:00
Aitor Camacho	bdaff0b457	kk: Handle memory coherency for textures and buffers Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details M1 chips are more restrictive than M2 and above. We need to enforce memory coherency when needed through "coherent" for buffer memory and "memory_coherence_device" for textures. Without these the memory operations are not visible to other threads. Reviewed-by: Arcady Goldmints-Orlov <arcady@lunarg.com> Signed-off-by: Aitor Camacho <aitor@lunarg.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38595>	2025-11-26 02:26:21 +00:00
Faith Ekstrand	fcb107accb	poly: Fetch the index size from a sysval On asahi, we can still specialize based on the shader key and get everything folded. But this gives drivers the option to make it dynamic if they wish. Co-authored-by: Mary Guillemard <mary.guillemard@collabora.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Mary Guillemard <mary@mary.zone> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38404>	2025-11-25 23:20:23 +00:00
Faith Ekstrand	05aaa7df65	nir: Improve comments for a couple poly intrinsics Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Mary Guillemard <mary@mary.zone> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38404>	2025-11-25 23:20:22 +00:00
Faith Ekstrand	05723bfa35	poly,asahi: Fetch directly from poly_vertex_state::output_buffer in GS We have access to the poly_vertex_state from the GS so we might as well use it. Asahi uses a single poly_vertex_state for VS and TCS and just assumes the tessellator stalls before we update it for TCS. If a driver wants to use two separate poly_vertex_state buffers, it will be the driver's responsibility to make the system values return the right one. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Mary Guillemard <mary@mary.zone> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38404>	2025-11-25 23:20:19 +00:00
Faith Ekstrand	89fbb9cf84	poly,asahi: Move vertex_output_buffer to poly_vertex_param Instead of having the vertex output buffer be a system value and something the driver needs to manage, put it in poly_vertex_param. We already need to have it somewhere GPU-writable so we can write it from indirect setup kernels. Instead of manually allocating 8B all over the place just to hold this one pointer, stick it in poly_vertex_param. This also lets us get rid of a NIR intrinsic. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Mary Guillemard <mary@mary.zone> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38404>	2025-11-25 23:20:18 +00:00
Faith Ekstrand	f36465d574	poly,asahi: Rename poly_ia_state to poly_vertex_params We're about to put more than just input assembly data in there so the name will make a lot more sense. Also, add a comment to make it more clear that this buffer applys to both VS and TES. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Mary Guillemard <mary@mary.zone> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38404>	2025-11-25 23:20:16 +00:00
Kenneth Graunke	96d331766a	brw: Generalize read_attribute_payload_intel to handle more cases We were using this for indirect loads of the shader input thread payload, but there's no reason we can't use it for constant access too. In this case we can just MOV from the ATTR file directly without a special opcode that turns into MOV_INDIRECT later. We also allow it to load multiple components now. This is useful for say, returning vec4 pushed inputs. And, we allow it in more stages than just the fragment stage. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:43:59 +00:00
Kenneth Graunke	792762617a	brw: Rename read_attribute_payload_intel to load_attribute_payload_intel We're going to change the intrinsic to a load(...) which puts "load" in the name. Also, it's just more consistent with our usual terminology. We also rename the corresponding backend opcode so they remain matched. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:43:58 +00:00
Kenneth Graunke	f1ab64ad74	nir: add new intrinsics to load/store from URB on intel We add several new intrinsics for accessing URB handles: - load_urb_output_handle_intel - load_urb_input_handle_intel - load_urb_input_handle_intel_indexed The latter is used by stages like TCS and GS where each input control point has a unique handle. The index is which ICP to read from. The others are for most stages, where all inputs or outputs are accessed via a single handle. Then we have URB load and store operations, split for Xe2+ (URB via LSC) and earlier (HDC OWord messages): - load_urb_vec4_intel - load_urb_lsc_intel - store_urb_vec4_intel - store_urb_lsc_intel The legacy vec4 variants take a handle and a 128-bit OWord offset as sources. Additionally, stores take a set of channel enables to mask off and avoid writing vec4 components. We don't use the WRITE_MASK const-index as our channel enables are not required to be constant. The Xe2+ LSC variants are simpler. Handles are byte offsets into the URB memory region, and offsets are expressed in bytes. So we simply add them into a single "address" source. We don't support writemasks here, as they aren't really necessary with the better addressability. (Plus, the store_cmask operations work significantly differently than the previous HDC OWord messages). We will lower disjoint writemasks to multiple stores. Based on earlier code by Lionel Landwerlin. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:43:54 +00:00
Eric R. Smith	ab867cc3cd	nir: add intrinsics for pixel local storage The pixel local storage load and store instructions keep track of the format of the pixel local storage variables. This allows drivers to insert the appropriate conversions on load/store. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37110>	2025-11-18 20:25:42 +00:00
Georg Lehmann	3a175b54a4	aco,nir: support subdword v_permlane_b16 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38389>	2025-11-17 23:33:59 +00:00

1 2 3 4 5 ...

667 commits