fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-03 01:18:06 +02:00

Author	SHA1	Message	Date
Sushma Venkatesh Reddy	29fc96cb80	anv: Add GPU breakpoint before/after specific compute dispatch call Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13089 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35353>	2025-07-07 17:43:41 +00:00
Sushma Venkatesh Reddy	172e475705	intel: Add env variable to add break point on/before compute dispatch Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13089 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35353>	2025-07-07 17:43:40 +00:00
Alyssa Rosenzweig	d31cb824df	treewide: use VARYING_BIT_* Some checks failed macOS-CI / macOS-CI (dri) (push) Has been cancelled Details macOS-CI / macOS-CI (xlib) (push) Has been cancelled Details Via Coccinelle patch generated by the following Python: varys = [ "POS", "COL0", "COL1", "FOGC", "TEX0", "TEX1", "TEX2", "TEX3", "TEX4", "TEX5", "TEX6", "TEX7", "PSIZ", "BFC0", "BFC1", "EDGE", "CLIP_VERTEX", "CLIP_DIST0", "CLIP_DIST1", "CULL_DIST0", "CULL_DIST1", "PRIMITIVE_ID", "PRIMITIVE_COUNT", "LAYER", "VIEWPORT", "FACE", "PRIMITIVE_SHADING_RATE", "PNTC", "TESS_LEVEL_OUTER", "TESS_LEVEL_INNER", "PRIMITIVE_INDICES", "BOUNDING_BOX0", "BOUNDING_BOX1", "VIEWPORT_MASK", "CULL_PRIMITIVE" ] t = """ @@ @@ -(1 << VARYING_SLOT_${V}) +VARYING_BIT_${V} @@ @@ -BITFIELD_BIT(VARYING_SLOT_${V}) +VARYING_BIT_${V} @@ @@ -(1ull << VARYING_SLOT_${V}) +VARYING_BIT_${V} @@ @@ -BITFIELD64_BIT(VARYING_SLOT_${V}) +VARYING_BIT_${V} """ for v in varys: from mako.template import Template print(Template(t).render(V = v)) Closes: #13453 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Marek Olšák <maraeo@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> [panfrost, common] Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> [broadcom] Reviewed-by: Corentin Noël <corentin.noel@collabora.com> [virgl] Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> [zink] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35917>	2025-07-04 19:01:04 +00:00
Mike Blumenkrantz	956d3f1562	mesa/st: handle renderbuffer with null zsbuf this matches cbuf handling Fixes: `2eb45daa9c` ("gallium: de-pointerize pipe_surface") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35941>	2025-07-04 17:36:40 +00:00
Yiwei Zhang	b21e62b71a	anv: avoid leaking private binding for aliased wsi image Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Aliased wsi image has to share the same private binding with the original wsi image for memory consistency. If the private binding exists, it needs to be released before being overridden. Fixes: `d85a9d658f` ("anv/image: Call into WSI to create swapchain images") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35893>	2025-07-03 17:40:31 +00:00
José Roberto de Souza	4830aec8ad	anv: Reduce compiled code for Wa_16018063123 Wa_16018063123 is not a workaround that depends on stepping, so we can use the INTEL_WA_16018063123_GFX_VER macro to reduce code generate for non affected platforms. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35700>	2025-07-03 14:09:13 +00:00
José Roberto de Souza	926e6a94ad	anv: Do not emit batch_emit_fast_color_dummy_blit() for video engine Wa_16018063123 don't apply to video engine also video engine don't support XY_FAST_COLOR_BLT. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Fixes: `ec43c20182` ("anv: implement dummy blit for Wa_16018063123") Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35700>	2025-07-03 14:09:12 +00:00
José Roberto de Souza	4618a99a4c	anv: Flush before invalidate aux map in copy and video engines BSpec: 43904 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `46f5359238` ("anv: Invalidate aux map for copy/video engine") Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35700>	2025-07-03 14:09:12 +00:00
José Roberto de Souza	e68f81eaf6	anv: Read the correct register for aux table invalidation when in GPGPU mode in render engine For 3D or GPGPU modes the same render engine should be used, CCS register should only be used when using compute engine. Fixes: `46f5359238` ("anv: Invalidate aux map for copy/video engine") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35700>	2025-07-03 14:09:12 +00:00
Matt Turner	7da88c76db	intel: Add support for BFloat16 as cooperative matrix accumulator The number of passing tests in ./deqp-vk -n 'cooperative_matrix.khr' on PTL increases from 914 -> 1030. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35320>	2025-07-02 20:06:59 +00:00
Matt Turner	e6242fb958	brw: Handle bfloat16 dest and src0 operands for DPAS Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35320>	2025-07-02 20:06:59 +00:00
Caio Oliveira	c006bee22d	brw: Don't use simd_select for BS shaders Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Since there's only one possible SIMD, don't need to use the helpers to decide which one to compile. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35799>	2025-07-02 19:48:04 +00:00
Caio Oliveira	c733f07378	brw: Use the right width in brw_nir_apply_key for BS shaders Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Fixes: `23c7142cd6` ("anv: disable SIMD16 for RT shaders") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35798>	2025-07-02 15:32:23 +00:00
Lionel Landwerlin	343f3dd3c1	brw: fix non constant BTI accesses with offsets Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `e103afe7be` ("brw: run the nir_opt_offsets pass and set the maximum offset size") Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35822>	2025-07-02 01:04:06 +03:00
Iván Briano	5b58b838fe	anv: move view_usage check to before setting the protected bit on it Otherwise the comparison will always be false for protected content. Also remove extra setting of the protected bit that was happening later. Fixes: `8d9cc6aa23` ("anv: properly flag image/imageviews for ISL protection") Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35870>	2025-07-01 21:40:44 +00:00
Sagar Ghuge	5f31e6b286	anv: Drop unused anv_rt_bvh_build_method enum Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Iván Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35848>	2025-07-01 20:00:35 +00:00
Lionel Landwerlin	89f3ee4cb2	brw: remove debug printf Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Fixes: `fcf4401824` ("brw: handle wa_18019110168 with independent shader compilation") Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35815>	2025-06-29 12:39:03 +03:00
Calder Young	646977348b	anv: Fix typo when checking format's extended usage flag Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Fixes: `f4c1753c1a` ("anv: report color/storage features on YCbCr images with EXTENDED_USAGE") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35703>	2025-06-28 20:39:18 +00:00
Lionel Landwerlin	a742b859bd	anv: add support for handling wa_18019110168 with gfx-libs Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35103>	2025-06-28 05:55:35 +00:00
Lionel Landwerlin	fcf4401824	brw: handle wa_18019110168 with independent shader compilation Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35103>	2025-06-28 05:55:35 +00:00
Lionel Landwerlin	bc8d18aee2	brw: make a helper for vertex attribute offset computation Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35103>	2025-06-28 05:55:34 +00:00
Lionel Landwerlin	8fabcd754f	brw: move primitive_id_index field in fs_msaa Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35103>	2025-06-28 05:55:34 +00:00
Lionel Landwerlin	6336cf0ea2	brw: store the remapping table for wa_18019110168 in constant data That way it can be accessed at runtime. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35103>	2025-06-28 05:55:33 +00:00
Lionel Landwerlin	e1a7eb1718	brw: extract out attribute register remapping Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35103>	2025-06-28 05:55:33 +00:00
Lionel Landwerlin	5cc66e2c8d	anv/brw: move Wa_18019110168 handling to backend We simplify the implementation by assuming the worse case, copying entire per-vertex regions if necessary. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35103>	2025-06-28 05:55:32 +00:00
Lionel Landwerlin	8e7e0ef75a	anv: make Wa_18019110168 deal with dynamic provoking vertex Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35103>	2025-06-28 05:55:32 +00:00
Lionel Landwerlin	f0f4f9c566	brw: fix vertex attribute offset computation The formula uses scalar indices (4bytes), not slots (16bytes). We also incorrectly passed a scalar (vertex case) & slot (mesh case) offset in the push constants. Use slots instead so that the value is smaller and we can pack more stuff into fs_msaa_flags. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `18bbcf9a63` ("intel: introduce new VUE layout for separate compiled shader with mesh") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35103>	2025-06-28 05:55:31 +00:00
Lionel Landwerlin	4b5539a0cb	brw: fix set_range on load_per_primitive_output load intrinsics don't have range Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `18bbcf9a63` ("intel: introduce new VUE layout for separate compiled shader with mesh") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35103>	2025-06-28 05:55:31 +00:00
Matt Turner	6842a8179f	intel: Add support for float16 as cooperative matrix accumulator Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The number of passing tests in ./deqp-vk -n 'cooperative_matrix.khr' increases - on PTL from 787 -> 914 - on RPL from 799 -> 926 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13304 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35616>	2025-06-27 01:26:22 +00:00
Matt Turner	6d786a0e4b	brw: Use convert_cmat_intel intrinsic Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35616>	2025-06-27 01:26:22 +00:00
Matt Turner	41cd196886	brw: Implement convert_cmat_intel intrinsic Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35616>	2025-06-27 01:26:22 +00:00
Matt Turner	1215845b5b	intel: Increase size of cooperative_matrix_configurations[] to 16 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35616>	2025-06-27 01:26:21 +00:00
Marek Olšák	1754507d49	nir: rename nir_lower_io_to_temporaries -> nir_lower_io_vars_to_temporaries Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:54 +00:00
Marek Olšák	1e03827c77	nir: rename nir_lower_io_arrays_to_elements -> nir_lower_io_array_vars_to_elements same for *_no_indirects Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:54 +00:00
Marek Olšák	12df9b3def	nir: rename nir_vectorize_tess_levels -> nir_lower_tess_level_array_vars_to_vec Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:50 +00:00
Marek Olšák	2aa94caf82	nir: rename nir_lower_io_to_vector -> nir_opt_vectorize_io_vars Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:50 +00:00
Marek Olšák	439d805291	nir: rename nir_lower_io_to_scalar_early -> nir_lower_io_vars_to_scalar Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:49 +00:00
Ian Romanick	b83f618fb2	brw: Fully write temporary destinations Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Consider an innocuous instruction like: and(1) v250:UD, g0.3<0,1,0>:UD, 4294967264u NoMask group0 If register allocation decides to spill v250, it will see this instruction and say, "Oh no! The other components of v250 aren't set, so I'd better add a fill before that instruction!" But it gets even worse than that... if register coalesce decided to merge two of these, the live range gets massively extended because the writes don't fully initialize the value. This causes the need to spill these registers in the first place. Changing that instruction to SIMD16 on Xe2 or SIMD8 on other platforms alleviates these issues. shader-db: Lunar Lake total instructions in shared programs: 17118324 -> 17113191 (-0.03%) instructions in affected programs: 93701 -> 88568 (-5.48%) helped: 42 / HURT: 6 total cycles in shared programs: 895422566 -> 895079488 (-0.04%) cycles in affected programs: 30111338 -> 29768260 (-1.14%) helped: 35 / HURT: 40 total spills in shared programs: 3588 -> 3304 (-7.92%) spills in affected programs: 285 -> 1 (-99.65%) helped: 10 / HURT: 0 total fills in shared programs: 2218 -> 1663 (-25.02%) fills in affected programs: 556 -> 1 (-99.82%) helped: 10 / HURT: 0 Meteor Lake, DG2, Tiger Lake, and Ice Lake had similar results. (Meteor Lake shown) total instructions in shared programs: 20059218 -> 20053563 (-0.03%) instructions in affected programs: 96938 -> 91283 (-5.83%) helped: 43 / HURT: 6 total cycles in shared programs: 884174588 -> 883536475 (-0.07%) cycles in affected programs: 22105268 -> 21467155 (-2.89%) helped: 35 / HURT: 27 total spills in shared programs: 5032 -> 4679 (-7.02%) spills in affected programs: 355 -> 2 (-99.44%) helped: 12 / HURT: 0 total fills in shared programs: 4782 -> 4113 (-13.99%) fills in affected programs: 671 -> 2 (-99.70%) helped: 12 / HURT: 0 Skylake total instructions in shared programs: 19097658 -> 19097665 (<.01%) instructions in affected programs: 14202 -> 14209 (0.05%) helped: 0 / HURT: 5 total cycles in shared programs: 862058109 -> 862058267 (<.01%) cycles in affected programs: 3450244 -> 3450402 (<.01%) helped: 7 / HURT: 11 fossil-db: Lunar Lake Totals: Cycle count: 31439652246 -> 31439652272 (+0.00%) Totals from 2 (0.00% of 707091) affected shaders: Cycle count: 2602 -> 2628 (+1.00%) No other Intel platforms had any fossil-db changes. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35721>	2025-06-26 17:59:47 +00:00
Caio Oliveira	30490de24a	intel/executor: allow single line comments in macro lines Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Assembler supports them, so allow them on @-macro lines. For now we don't bother with multiline comments, if becomes a thing we can add them later. Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35699>	2025-06-26 00:58:02 +00:00
Caio Oliveira	d14fa6683b	intel/executor: update SFID names in macros to match recent changes After commit `88309a9818`, SFID names were renamed - "dp data 1" became "hdc1" - "thread_spawner" became "ts/btd" Update macros in executor to use the new SFID names so the generated assembly can be parsed correctly. Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35701>	2025-06-25 17:31:00 -07:00
Iván Briano	d964b8d5fa	anv: don't report custom sample locations for sample count 1 Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details We can't actually enable MSAA for images with sample count 1, and without MSAA active, the sample location machinery does not get used. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35504>	2025-06-24 19:44:34 +00:00
Matt Turner	6a47531440	intel: Generate files with newline at end This generator scripts uses the `write` function that, unlike `print`, doesn't print a trailing newline. So let's add one to the template. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35697>	2025-06-24 14:01:04 +00:00
Dave Airlie	29c599ffea	anv: only expose VK_KHR_cooperative_matrix on devices with hw instructions. Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Currently anv exposes this on lots of devices, with the intent to be better than apps can give, but I think this is wrong for a couple of reasons. Apps want to know if hw exposes the fast path, Vulkan is meant to be explicit, and telling llama.cpp if the fast path exists lets it make smarter decisions. It seems unless someone heavily optimises the slow path, that CPU is usually faster than GPU with llama-bench unless the hw path exists. v2: added INTEL_LOWER_DPAS support Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35564>	2025-06-23 21:06:51 +00:00
Konstantin Seurer	4cbbdc0a50	vulkan: Pass a structure to most BVH build callbacks It is annoying to change all function signatures when a driver needs more information. There are also some callbacks that have a lot of parameters and there have already been bugs related to that. This patch tries to clean the interface by adding a struct that contains all information that might be relevant for the driver and passing that to most callbacks. radv changes are: Reviewed-by: Natalie Vock <natalie.vock@gmx.de> anv changes are: Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> turnip changes are: Reviewed-by: Connor Abbott <cwabbott0@gmail.com> vulkan runtime changes are: Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35385>	2025-06-23 20:43:43 +00:00
Konstantin Seurer	28713789ad	vulkan: Replace get_*_key with get_build_config It is a cleaner API and gives more control about the build to the driver. Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35385>	2025-06-23 20:43:43 +00:00
José Roberto de Souza	bdd20457ed	anv: Emit STATE_COMPUTE_MODE before COMPUTE_WALKER when new async compute limits are needed Cc: stable Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35563>	2025-06-23 18:57:25 +00:00
José Roberto de Souza	b37747ce68	blorp: Emit STATE_COMPUTE_MODE before COMPUTE_WALKER Cc: stable Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35563>	2025-06-23 18:57:25 +00:00
José Roberto de Souza	59d361043e	intel/common: Use as much as possible spec recommended values for compute engine async thread limits Spec recommended values should give us a good balance between progress in render and compute engines, also with less possibility of values it will reduce the number of times that we need to emit STATE_COMPUTE_MODE reducing the number of stalls in the compute engine. Cc: stable Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35563>	2025-06-23 18:57:25 +00:00
José Roberto de Souza	080b9a165c	intel/common: Add function to compute optimal compute engine async thread limits Spec has several restrictions to the values we program to compute engine async thread limits. Without those we risk hit deadlocks, so here adding a function to return the optimal value based on those restrictions. Cc: stable Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35563>	2025-06-23 18:57:24 +00:00
Eric Engestrom	99e8d804bf	intel/compiler tests: fix variable type for getopt_long() return value Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details `getopt_long()` returns an `int`, not a `char`; putting the value in a `char` before comparing it to `-1` was making the comparison always fail, resulting in the invalid codepath taken that then fails with: option `-' is invalid: ignored cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34756>	2025-06-23 08:26:29 +00:00

1 2 3 4 5 ...

14219 commits