fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-02-04 10:40:36 +01:00

Author	SHA1	Message	Date
Sviatoslav Peleshko	8d22eb960b	brw/disasm: Fix Gfx11 3src-instructions dst register disassembly The conversion from bit value to register file type is already done by the brw_eu_inst_3src_a1_dst_reg_file in the FFC macro now, so doing it again produced incorrect results. Fixes: `e7179232` ("intel/brw: Move encoding of Gfx11 3-src inside the inst helpers") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13141 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35960>	2025-07-08 19:49:09 +00:00
Daniel Schürmann	2c51a8870d	nir: add nir_vectorize_cb callback parameter to nir_lower_phis_to_scalar() Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Similar to nir_lower_alu_width(), the callback can return the desired number of components for a phi, or 0 for no lowering. The previous behavior of nir_lower_phis_to_scalar() with lower_all=true can be elicited via nir_lower_all_phis_to_scalar() while the previous behavior with lower_all=false now corresponds to nir_lower_phis_to_scalar() with NULL callback. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35783>	2025-07-08 15:33:59 +00:00
Marek Olšák	8def3f865d	agx,freedreno,intel,lima,panfrost,svga,virgl,zink: fix supports_indirect_inputs The GLSL compiler always lowers inputs to temps for VS and GS, so exclude them from driver support because the GLSL compiler will no longer do that unconditionally. Thus, indirect VS and GS inputs are completely untested and broken in a lot of drivers. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35945>	2025-07-08 06:11:42 +00:00
Lionel Landwerlin	67e452669e	anv: do not rely on sampler objects for pipeline compilation Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Descriptor set layout lifetime can be shorter than what the implementation requires. One example is : * create descriptor set layout * create graphics pipeline library * destroy descriptor set layout * link optimize library in a final pipeline The last step might need the descriptor set layout information again. We've so far worked around this by taking a reference on the descriptor set layout in the pipelines. But we forgot that descriptor set layouts have pointers to samplers (for immutable & embedded samplers). We could take a reference to samplers but that sucks for various reasons : - it consumes dynamic state heap space - it could cause issues with capture-replay placement So instead we copy the information from the samplers that might be needed in cases like link optimization. This includes : - ycbcr conversion state (used for NIR lowering) - embedded sampler data (to recreate the sampler) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35955>	2025-07-07 18:53:53 +00:00
Lionel Landwerlin	98bc185376	anv: rework embedded sampler hashing Create a hashing key on all samplers so we can just copy that anywhere we need it. That key already contains the needed parameters for embedded samplers, so the sha1 stuff can go away. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35955>	2025-07-07 18:53:53 +00:00
Sushma Venkatesh Reddy	fa0232d961	intel/executor: Add missing dependency to fix intermittent build failures The executor build was failing randomly due to a missing dependency on `idev_intel_dev`. This patch adds the required dependency to the `meson.build` file to ensure consistent and reliable builds across different configurations. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35928>	2025-07-07 18:35:56 +00:00
Sushma Venkatesh Reddy	29fc96cb80	anv: Add GPU breakpoint before/after specific compute dispatch call Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13089 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35353>	2025-07-07 17:43:41 +00:00
Sushma Venkatesh Reddy	172e475705	intel: Add env variable to add break point on/before compute dispatch Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13089 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35353>	2025-07-07 17:43:40 +00:00
Alyssa Rosenzweig	d31cb824df	treewide: use VARYING_BIT_* Some checks failed macOS-CI / macOS-CI (dri) (push) Has been cancelled Details macOS-CI / macOS-CI (xlib) (push) Has been cancelled Details Via Coccinelle patch generated by the following Python: varys = [ "POS", "COL0", "COL1", "FOGC", "TEX0", "TEX1", "TEX2", "TEX3", "TEX4", "TEX5", "TEX6", "TEX7", "PSIZ", "BFC0", "BFC1", "EDGE", "CLIP_VERTEX", "CLIP_DIST0", "CLIP_DIST1", "CULL_DIST0", "CULL_DIST1", "PRIMITIVE_ID", "PRIMITIVE_COUNT", "LAYER", "VIEWPORT", "FACE", "PRIMITIVE_SHADING_RATE", "PNTC", "TESS_LEVEL_OUTER", "TESS_LEVEL_INNER", "PRIMITIVE_INDICES", "BOUNDING_BOX0", "BOUNDING_BOX1", "VIEWPORT_MASK", "CULL_PRIMITIVE" ] t = """ @@ @@ -(1 << VARYING_SLOT_${V}) +VARYING_BIT_${V} @@ @@ -BITFIELD_BIT(VARYING_SLOT_${V}) +VARYING_BIT_${V} @@ @@ -(1ull << VARYING_SLOT_${V}) +VARYING_BIT_${V} @@ @@ -BITFIELD64_BIT(VARYING_SLOT_${V}) +VARYING_BIT_${V} """ for v in varys: from mako.template import Template print(Template(t).render(V = v)) Closes: #13453 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Marek Olšák <maraeo@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> [panfrost, common] Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> [broadcom] Reviewed-by: Corentin Noël <corentin.noel@collabora.com> [virgl] Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> [zink] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35917>	2025-07-04 19:01:04 +00:00
Mike Blumenkrantz	956d3f1562	mesa/st: handle renderbuffer with null zsbuf this matches cbuf handling Fixes: `2eb45daa9c` ("gallium: de-pointerize pipe_surface") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35941>	2025-07-04 17:36:40 +00:00
Yiwei Zhang	b21e62b71a	anv: avoid leaking private binding for aliased wsi image Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Aliased wsi image has to share the same private binding with the original wsi image for memory consistency. If the private binding exists, it needs to be released before being overridden. Fixes: `d85a9d658f` ("anv/image: Call into WSI to create swapchain images") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35893>	2025-07-03 17:40:31 +00:00
José Roberto de Souza	4830aec8ad	anv: Reduce compiled code for Wa_16018063123 Wa_16018063123 is not a workaround that depends on stepping, so we can use the INTEL_WA_16018063123_GFX_VER macro to reduce code generate for non affected platforms. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35700>	2025-07-03 14:09:13 +00:00
José Roberto de Souza	926e6a94ad	anv: Do not emit batch_emit_fast_color_dummy_blit() for video engine Wa_16018063123 don't apply to video engine also video engine don't support XY_FAST_COLOR_BLT. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Fixes: `ec43c20182` ("anv: implement dummy blit for Wa_16018063123") Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35700>	2025-07-03 14:09:12 +00:00
José Roberto de Souza	4618a99a4c	anv: Flush before invalidate aux map in copy and video engines BSpec: 43904 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `46f5359238` ("anv: Invalidate aux map for copy/video engine") Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35700>	2025-07-03 14:09:12 +00:00
José Roberto de Souza	e68f81eaf6	anv: Read the correct register for aux table invalidation when in GPGPU mode in render engine For 3D or GPGPU modes the same render engine should be used, CCS register should only be used when using compute engine. Fixes: `46f5359238` ("anv: Invalidate aux map for copy/video engine") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35700>	2025-07-03 14:09:12 +00:00
Matt Turner	7da88c76db	intel: Add support for BFloat16 as cooperative matrix accumulator The number of passing tests in ./deqp-vk -n 'cooperative_matrix.khr' on PTL increases from 914 -> 1030. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35320>	2025-07-02 20:06:59 +00:00
Matt Turner	e6242fb958	brw: Handle bfloat16 dest and src0 operands for DPAS Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35320>	2025-07-02 20:06:59 +00:00
Caio Oliveira	c006bee22d	brw: Don't use simd_select for BS shaders Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Since there's only one possible SIMD, don't need to use the helpers to decide which one to compile. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35799>	2025-07-02 19:48:04 +00:00
Caio Oliveira	c733f07378	brw: Use the right width in brw_nir_apply_key for BS shaders Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Fixes: `23c7142cd6` ("anv: disable SIMD16 for RT shaders") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35798>	2025-07-02 15:32:23 +00:00
Lionel Landwerlin	343f3dd3c1	brw: fix non constant BTI accesses with offsets Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `e103afe7be` ("brw: run the nir_opt_offsets pass and set the maximum offset size") Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35822>	2025-07-02 01:04:06 +03:00
Iván Briano	5b58b838fe	anv: move view_usage check to before setting the protected bit on it Otherwise the comparison will always be false for protected content. Also remove extra setting of the protected bit that was happening later. Fixes: `8d9cc6aa23` ("anv: properly flag image/imageviews for ISL protection") Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35870>	2025-07-01 21:40:44 +00:00
Sagar Ghuge	5f31e6b286	anv: Drop unused anv_rt_bvh_build_method enum Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Iván Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35848>	2025-07-01 20:00:35 +00:00
Lionel Landwerlin	89f3ee4cb2	brw: remove debug printf Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Fixes: `fcf4401824` ("brw: handle wa_18019110168 with independent shader compilation") Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35815>	2025-06-29 12:39:03 +03:00
Calder Young	646977348b	anv: Fix typo when checking format's extended usage flag Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Fixes: `f4c1753c1a` ("anv: report color/storage features on YCbCr images with EXTENDED_USAGE") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35703>	2025-06-28 20:39:18 +00:00
Lionel Landwerlin	a742b859bd	anv: add support for handling wa_18019110168 with gfx-libs Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35103>	2025-06-28 05:55:35 +00:00
Lionel Landwerlin	fcf4401824	brw: handle wa_18019110168 with independent shader compilation Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35103>	2025-06-28 05:55:35 +00:00
Lionel Landwerlin	bc8d18aee2	brw: make a helper for vertex attribute offset computation Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35103>	2025-06-28 05:55:34 +00:00
Lionel Landwerlin	8fabcd754f	brw: move primitive_id_index field in fs_msaa Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35103>	2025-06-28 05:55:34 +00:00
Lionel Landwerlin	6336cf0ea2	brw: store the remapping table for wa_18019110168 in constant data That way it can be accessed at runtime. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35103>	2025-06-28 05:55:33 +00:00
Lionel Landwerlin	e1a7eb1718	brw: extract out attribute register remapping Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35103>	2025-06-28 05:55:33 +00:00
Lionel Landwerlin	5cc66e2c8d	anv/brw: move Wa_18019110168 handling to backend We simplify the implementation by assuming the worse case, copying entire per-vertex regions if necessary. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35103>	2025-06-28 05:55:32 +00:00
Lionel Landwerlin	8e7e0ef75a	anv: make Wa_18019110168 deal with dynamic provoking vertex Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35103>	2025-06-28 05:55:32 +00:00
Lionel Landwerlin	f0f4f9c566	brw: fix vertex attribute offset computation The formula uses scalar indices (4bytes), not slots (16bytes). We also incorrectly passed a scalar (vertex case) & slot (mesh case) offset in the push constants. Use slots instead so that the value is smaller and we can pack more stuff into fs_msaa_flags. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `18bbcf9a63` ("intel: introduce new VUE layout for separate compiled shader with mesh") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35103>	2025-06-28 05:55:31 +00:00
Lionel Landwerlin	4b5539a0cb	brw: fix set_range on load_per_primitive_output load intrinsics don't have range Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `18bbcf9a63` ("intel: introduce new VUE layout for separate compiled shader with mesh") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35103>	2025-06-28 05:55:31 +00:00
Matt Turner	6842a8179f	intel: Add support for float16 as cooperative matrix accumulator Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The number of passing tests in ./deqp-vk -n 'cooperative_matrix.khr' increases - on PTL from 787 -> 914 - on RPL from 799 -> 926 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13304 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35616>	2025-06-27 01:26:22 +00:00
Matt Turner	6d786a0e4b	brw: Use convert_cmat_intel intrinsic Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35616>	2025-06-27 01:26:22 +00:00
Matt Turner	41cd196886	brw: Implement convert_cmat_intel intrinsic Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35616>	2025-06-27 01:26:22 +00:00
Matt Turner	1215845b5b	intel: Increase size of cooperative_matrix_configurations[] to 16 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35616>	2025-06-27 01:26:21 +00:00
Marek Olšák	1754507d49	nir: rename nir_lower_io_to_temporaries -> nir_lower_io_vars_to_temporaries Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:54 +00:00
Marek Olšák	1e03827c77	nir: rename nir_lower_io_arrays_to_elements -> nir_lower_io_array_vars_to_elements same for *_no_indirects Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:54 +00:00
Marek Olšák	12df9b3def	nir: rename nir_vectorize_tess_levels -> nir_lower_tess_level_array_vars_to_vec Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:50 +00:00
Marek Olšák	2aa94caf82	nir: rename nir_lower_io_to_vector -> nir_opt_vectorize_io_vars Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:50 +00:00
Marek Olšák	439d805291	nir: rename nir_lower_io_to_scalar_early -> nir_lower_io_vars_to_scalar Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:49 +00:00
Ian Romanick	b83f618fb2	brw: Fully write temporary destinations Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Consider an innocuous instruction like: and(1) v250:UD, g0.3<0,1,0>:UD, 4294967264u NoMask group0 If register allocation decides to spill v250, it will see this instruction and say, "Oh no! The other components of v250 aren't set, so I'd better add a fill before that instruction!" But it gets even worse than that... if register coalesce decided to merge two of these, the live range gets massively extended because the writes don't fully initialize the value. This causes the need to spill these registers in the first place. Changing that instruction to SIMD16 on Xe2 or SIMD8 on other platforms alleviates these issues. shader-db: Lunar Lake total instructions in shared programs: 17118324 -> 17113191 (-0.03%) instructions in affected programs: 93701 -> 88568 (-5.48%) helped: 42 / HURT: 6 total cycles in shared programs: 895422566 -> 895079488 (-0.04%) cycles in affected programs: 30111338 -> 29768260 (-1.14%) helped: 35 / HURT: 40 total spills in shared programs: 3588 -> 3304 (-7.92%) spills in affected programs: 285 -> 1 (-99.65%) helped: 10 / HURT: 0 total fills in shared programs: 2218 -> 1663 (-25.02%) fills in affected programs: 556 -> 1 (-99.82%) helped: 10 / HURT: 0 Meteor Lake, DG2, Tiger Lake, and Ice Lake had similar results. (Meteor Lake shown) total instructions in shared programs: 20059218 -> 20053563 (-0.03%) instructions in affected programs: 96938 -> 91283 (-5.83%) helped: 43 / HURT: 6 total cycles in shared programs: 884174588 -> 883536475 (-0.07%) cycles in affected programs: 22105268 -> 21467155 (-2.89%) helped: 35 / HURT: 27 total spills in shared programs: 5032 -> 4679 (-7.02%) spills in affected programs: 355 -> 2 (-99.44%) helped: 12 / HURT: 0 total fills in shared programs: 4782 -> 4113 (-13.99%) fills in affected programs: 671 -> 2 (-99.70%) helped: 12 / HURT: 0 Skylake total instructions in shared programs: 19097658 -> 19097665 (<.01%) instructions in affected programs: 14202 -> 14209 (0.05%) helped: 0 / HURT: 5 total cycles in shared programs: 862058109 -> 862058267 (<.01%) cycles in affected programs: 3450244 -> 3450402 (<.01%) helped: 7 / HURT: 11 fossil-db: Lunar Lake Totals: Cycle count: 31439652246 -> 31439652272 (+0.00%) Totals from 2 (0.00% of 707091) affected shaders: Cycle count: 2602 -> 2628 (+1.00%) No other Intel platforms had any fossil-db changes. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35721>	2025-06-26 17:59:47 +00:00
Caio Oliveira	30490de24a	intel/executor: allow single line comments in macro lines Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Assembler supports them, so allow them on @-macro lines. For now we don't bother with multiline comments, if becomes a thing we can add them later. Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35699>	2025-06-26 00:58:02 +00:00
Caio Oliveira	d14fa6683b	intel/executor: update SFID names in macros to match recent changes After commit `88309a9818`, SFID names were renamed - "dp data 1" became "hdc1" - "thread_spawner" became "ts/btd" Update macros in executor to use the new SFID names so the generated assembly can be parsed correctly. Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35701>	2025-06-25 17:31:00 -07:00
Iván Briano	d964b8d5fa	anv: don't report custom sample locations for sample count 1 Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details We can't actually enable MSAA for images with sample count 1, and without MSAA active, the sample location machinery does not get used. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35504>	2025-06-24 19:44:34 +00:00
Matt Turner	6a47531440	intel: Generate files with newline at end This generator scripts uses the `write` function that, unlike `print`, doesn't print a trailing newline. So let's add one to the template. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35697>	2025-06-24 14:01:04 +00:00
Dave Airlie	29c599ffea	anv: only expose VK_KHR_cooperative_matrix on devices with hw instructions. Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Currently anv exposes this on lots of devices, with the intent to be better than apps can give, but I think this is wrong for a couple of reasons. Apps want to know if hw exposes the fast path, Vulkan is meant to be explicit, and telling llama.cpp if the fast path exists lets it make smarter decisions. It seems unless someone heavily optimises the slow path, that CPU is usually faster than GPU with llama-bench unless the hw path exists. v2: added INTEL_LOWER_DPAS support Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35564>	2025-06-23 21:06:51 +00:00
Konstantin Seurer	4cbbdc0a50	vulkan: Pass a structure to most BVH build callbacks It is annoying to change all function signatures when a driver needs more information. There are also some callbacks that have a lot of parameters and there have already been bugs related to that. This patch tries to clean the interface by adding a struct that contains all information that might be relevant for the driver and passing that to most callbacks. radv changes are: Reviewed-by: Natalie Vock <natalie.vock@gmx.de> anv changes are: Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> turnip changes are: Reviewed-by: Connor Abbott <cwabbott0@gmail.com> vulkan runtime changes are: Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35385>	2025-06-23 20:43:43 +00:00

1 2 3 4 5 ...

14225 commits