fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-07 11:28:05 +02:00

Author	SHA1	Message	Date
Nick Hamilton	a965c71ec6	pvr: Rename pvr_render_input_attachment The struct will also be used for preserve attachments in the next commit. Backport-to: 26.0 Signed-off-by: Nick Hamilton <nick.hamilton@imgtec.com> Reviewed-by: Frank Binns <frank.binns@imgtec.com> (cherry picked from commit `e18670347a`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:23 +01:00
Jarred Davies	fb1ba13c57	pvr: Fix allocating the required scratch buffer space for tile buffers When calculating the dwords per pixel the output registers should always be taken into account in addition to the number of tile buffers. Fixes incorrect scratch buffer space calculation when both output registers and tile buffers are emitted by a render. Partial fix for: dEQP-VK.renderpass.*.attachment_allocation.input_output.71 Fixes: `3457f8083a` ("pvr: Acquire scratch buffer on framebuffer creation.") Signed-off-by: Nick Hamilton <nick.hamilton@imgtec.com> Reviewed-by: Frank Binns <frank.binns@imgtec.com> (cherry picked from commit `df445dc9b9`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:23 +01:00
Nick Hamilton	9ad2e48819	pvr: Fix incorrect subpass merging optimisation The subpass merging optimisation check for when subpasses are using tile buffers was in the incorrect location. The current check is in a function called from two places but only the first of these should have been doing the optimisation check. This was incorrectly affecting the number of renders that subpass merging could avoid. Partial fix for: dEQP-VK.renderpass.*.attachment_allocation.input_output.71 Fixes: `10b6a0d567` ("pvr: Add support for generating render pass hw setup data.") Signed-off-by: Nick Hamilton <nick.hamilton@imgtec.com> Reviewed-by: Frank Binns <frank.binns@imgtec.com> (cherry picked from commit `0640ac7e3b`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:23 +01:00
Danylo Piliaiev	ac49313d06	ir3: Align TCS per-patch output to 64 bytes to prevent stale reads Empirically, TCS outputs have to be aligned to 64 bytes, otherwise stale data may be read in rare cases. The exact reason is not clear, but tests and proprietary driver behavior strongly point at the need for 64 byte alignment. Fixes tesselation issues in at least "Conan Exiles" but likely in many more cases. CC: mesa-stable Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> (cherry picked from commit `47251b2e2d`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:23 +01:00
Rhys Perry	ba82a16761	aco: resolve hazards before calls Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Backport-to: 26.0 Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> (cherry picked from commit `613b4fe407`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:23 +01:00
Rhys Perry	697fbaddb5	aco: reset all vgpr_used_by_vmem_ in resolve_all_gfx11 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Backport-to: 26.0 Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> (cherry picked from commit `dfda890ae8`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:23 +01:00
Benjamin Otte	d7607b6a4e	lavapipe: Fix features for nonsubsampled ycbcr formats The Vulkan spec says about VkFormatFeatureFlagBits: If a format does not incorporate chroma downsampling (it is not a “422” or “420” format) but the implementation supports sampler Y′CBCR conversion for this format, the implementation must set VK_FORMAT_FEATURE_MIDPOINT_CHROMA_SAMPLES_BIT. Fixes: `af062126ae` Signed-off-by: Benjamin Otte <otte@redhat.com> (cherry picked from commit `0b6dd167ac`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:23 +01:00
Robert Mader	a163dec3ff	lavapipe: enable dmabuf import for planar drm formats Like e.g. NV12. This just requires some minor fixes around offset handling. (cherry picked from commit `0b6340fd94`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:23 +01:00
Mike Blumenkrantz	499a74569f	zink: only do pre-sync transfer barrier after a renderpass this is otherwise pointless and (for swapchain images) broken (because they may never have acquired an image) discovered by @valentine cc: mesa-stable (cherry picked from commit `d47ba92d42`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:23 +01:00
Samuel Pitoiset	545509553a	radv/meta: fix depth/stencil resolves with different regions This is possible since VK_KHR_maintenance10. This fixes new VKCTS coverage in dEQP-VK.pipeline..multisample.m10_resolve.. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `ab6147e8ef`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:23 +01:00
Tapani Pälli	befb9af14b	util: bring back fix to avoid strict aliasing bugs in xxhash This is commit `b9e163fa67` that got lost in xxhash upgrade `070bf8986c`. Fixes graphics artifacts seen in multiple workloads with Intel driver when using clang compiler. Fixes also CTS tests: dEQP-GLES31.functional.geometry_shading.layered.fragment_layer_cubemap dEQP-GLES31.functional.geometry_shading.layered.fragment_layer_3d dEQP-GLES31.functional.geometry_shading.layered.fragment_layer_2d_array dEQP-GLES31.functional.geometry_shading.layered.fragment_layer_2d_multisample_array v2: pass arguments from meson.build instead of hardcoding (Eric Engestrom) Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14684 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14107 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13895 Fixes: `070bf8986c` ("util: Upgrade xxhash.h to v0.8.3") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (cherry picked from commit `d2351b3d04`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:23 +01:00
Faith Ekstrand	a457021d67	panvk: Also load output attachments with LOAD_OP_NONE+STORE_OP_NONE We already had this for LOAD_OP_DONT_CARE but we also need it for LOAD_OP_NONE. Cc: mesa-stable Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Acked-by: Boris Brezillon <boris.brezillon@collabora.com> Acked-by: Eric R. Smith <eric.smith@collabora.com> (cherry picked from commit `44ff0c4707`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:23 +01:00
Faith Ekstrand	262e7feab9	panvk/jm: Refactor BeginRendering() The old code was all out of order and made no sense. There's a reason it made no sense. It was wrong. Cleaning this up fixes a solid 1/3 of the remaining Bifrost CTS fails in CI. Cc: mesa-stable Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Acked-by: Boris Brezillon <boris.brezillon@collabora.com> Acked-by: Eric R. Smith <eric.smith@collabora.com> (cherry picked from commit `962d1f33e1`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:23 +01:00
Faith Ekstrand	e29de2865e	panvk/preload: Stop assuming 32 registers cc: mesa-stable Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Acked-by: Boris Brezillon <boris.brezillon@collabora.com> Acked-by: Eric R. Smith <eric.smith@collabora.com> (cherry picked from commit `3bb7d929f4`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:23 +01:00
Faith Ekstrand	37191db342	panvk: Create both Z/S descriptors, even for separate Z/S The Vulkan spec says that aspects are ignored for Z/S attachments so we shouldn't consider that as a factor when deciding whether or not to create other aspect descriptors. This will be irrelevant in a couple of commits but we need it for the backport anyway. Cc: mesa-stable Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Acked-by: Boris Brezillon <boris.brezillon@collabora.com> Acked-by: Eric R. Smith <eric.smith@collabora.com> (cherry picked from commit `19ad26a8de`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:23 +01:00
Faith Ekstrand	3a92074d8c	nir/gather_info: Add support for panfrost tile load/store intrinsics Fixes: `6fc1030e4f` ("nir: Add some new panfrost fragment shader intrinsics") Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Acked-by: Boris Brezillon <boris.brezillon@collabora.com> Acked-by: Eric R. Smith <eric.smith@collabora.com> (cherry picked from commit `88ad8bc75d`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:23 +01:00
Faith Ekstrand	897f5814ed	pan/clear: Stop packing undefined bits in colors The util code doesn't actually fill things with zeros so the high bits are undefined. If we really want things replicated, we need to mask off just the bits we care about. Cc: mesa-stable Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Acked-by: Boris Brezillon <boris.brezillon@collabora.com> Acked-by: Eric R. Smith <eric.smith@collabora.com> (cherry picked from commit `4d8551552e`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:22 +01:00
Emma Anholt	61f09295f3	ir3/ra: Fix DOUBLE_ONLY limit pressure computation. As the comment says, we want to limit our pressure based on underlying HW reg file size, not max it out to HW reg file size. This caused us to not spill when we should when the HW reg size was bigger than the ISA reg file size, leading to OOB writes in RA when it tried to allocate to the limit pressure we spilled to. Fixes segfaults in llama.cpp's test-backend-ops. Fixes: `e6e34883a9` ("ir3: Add wavesize control") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14846 (cherry picked from commit `0c6da326f8`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:22 +01:00
José Roberto de Souza	b7752ddbc3	intel/perf: Add HSW verx10 to intel_perf_query_result_write_mdapi() HSW is verx10 75 and when we switched from ver to verx10 I forgot to add the case 75. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `a097a3d214` ("intel/perf: Change mdapi switch cases from ver to verx") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14902 Signed-off-by: José Roberto de Souza <jose.souza@intel.com> (cherry picked from commit `48c685ee39`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:22 +01:00
Natalie Vock	71145cb846	radv/nir: Correctly handle workgroup sizes not aligned to 32 Since the stride is always 32 dwords, we need to treat the workgroup size as multiples of that value. Using MAX2() only works for cases where the workgroup size is less than 32, which was hit by some CTS with 1x1 workgroups. Cc: mesa-stable (cherry picked from commit `b08f9f192c`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:22 +01:00
Samuel Pitoiset	54293d4fdd	radv: fix potential corruption after FMASK decompression on GFX6-8 While reworking image resolves completely in RADV, I found a very weird bug where the only fix was to emit caches immediately after decompressing the source resolve image (after FMASK_DECOMPRESS). I have been struggling this for few hours and figured that it was something related to context rolls (ie. as long the context was rolled out, emitting the flushes immediately was required). It turns out this was a known hardware bug on GFX6 that was implemented in PAL. Though PAL only applies on GFX6 but GFX7-8 are also affected based on my testing. Note that RadeonSI flushes CB_META too. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `837078b8d5`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:22 +01:00
Lionel Landwerlin	6f75431e98	anv: disable ccs modifier reporting when ccs modifiers are disabled Reporting the modifiers when we're going to disable it in the back hits various asserts in anv_image.c Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `2418c91537` ("anv/drirc: disable Xe2 CCS drm modifiers for GTK engine") Helps: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14853 Reviewed-by: Tapani Pälli <tapani.palli@intel.com> (cherry picked from commit `4f38b5c888`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:22 +01:00
Lionel Landwerlin	5fa6c15b36	anv: apply the same ccs disabling for Xe3 than Xe2 The new compression scheme introduced in Xe2 also applies to Xe3, so we're liable for the same bugs. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `2418c91537` ("anv/drirc: disable Xe2 CCS drm modifiers for GTK engine") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> (cherry picked from commit `4ac47f8dde`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:22 +01:00
Rhys Perry	849cdbcf72	aco: fix gfx6-8 store_scratch() with function calls Might happen with radv_emulate_rt=true. Fixes the_great_circle/a6079328b8df7712 with polaris10. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Fixes: `e006f68b11` ("aco/isel: Don't add scratch offset as gfx8- soffset if no offsets exist") Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> (cherry picked from commit `75722da909`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:22 +01:00
Ian Romanick	bfeb230f9b	elk/cmod: Don't propagate from CMP to ADD if there is a write between If either source of the CMP is modified before an appropriate ADD is found, the ADD and the CMP will not have the same result. No shader-db changes on any ELK platform. I suspect the problematic cases only occur after scheduling has rearranged instructions. This is likely the reason BRW didn't experience this problem until `09450faf`. Fixes: `020b0055e7` ("i965/fs: Propagate conditional modifiers from compares to adds") Reviewed-by: Matt Turner <mattst88@gmail.com> (cherry picked from commit `da1fd9786b`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:22 +01:00
Ian Romanick	024c5de569	elk/cmod: Don't propagate from CMP to possible Inf + (-Inf) This is a backport of BRW `e26270249b`. shader-db: All Intel platforms had similar results. (Broadwell shown) total instructions in shared programs: 18623918 -> 18624594 (<.01%) instructions in affected programs: 125179 -> 125855 (0.54%) helped: 0 / HURT: 139 total cycles in shared programs: 957073100 -> 957072484 (<.01%) cycles in affected programs: 16534168 -> 16533552 (<.01%) helped: 42 / HURT: 68 Fixes: `020b0055e7` ("i965/fs: Propagate conditional modifiers from compares to adds") Reviewed-by: Matt Turner <mattst88@gmail.com> (cherry picked from commit `bdbfe8de4d`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:22 +01:00
Ian Romanick	d68b3091b2	brw/cmod: Don't propagate from CMP to ADD if there is a write between If either source of the CMP is modified before an appropriate ADD is found, the ADD and the CMP will not have the same result. shader-db: Lunar Lake total instructions in shared programs: 17098815 -> 17098818 (<.01%) instructions in affected programs: 1187 -> 1190 (0.25%) helped: 0 / HURT: 3 total cycles in shared programs: 876858960 -> 876858968 (<.01%) cycles in affected programs: 6878 -> 6886 (0.12%) helped: 0 / HURT: 1 Meteor Lake, DG2, Tiger Lake, Ice Lake, and Skylake had similar results. (Meteor Lake shown) total instructions in shared programs: 20034973 -> 20034984 (<.01%) instructions in affected programs: 4599 -> 4610 (0.24%) helped: 0 / HURT: 11 total cycles in shared programs: 881033088 -> 881033108 (<.01%) cycles in affected programs: 57872 -> 57892 (0.03%) helped: 0 / HURT: 5 fossil-db: All Intel platforms had similar results. (Lunar Lake shown) Totals: Instrs: 918873064 -> 918873269 (+0.00%) CodeSize: 14747338416 -> 14747339360 (+0.00%); split: -0.00%, +0.00% Cycle count: 104141836677 -> 104141840371 (+0.00%); split: -0.00%, +0.00% Totals from 205 (0.01% of 2011421) affected shaders: Instrs: 290415 -> 290620 (+0.07%) CodeSize: 4280704 -> 4281648 (+0.02%); split: -0.01%, +0.03% Cycle count: 18166526 -> 18170220 (+0.02%); split: -0.00%, +0.02% Closes: #14874 Fixes: `020b0055e7` ("i965/fs: Propagate conditional modifiers from compares to adds") Reviewed-by: Matt Turner <mattst88@gmail.com> Tested-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> (cherry picked from commit `d1614cd6db`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:22 +01:00
Frank Binns	e1ae66262f	pvr: Fix alloc callbacks usage when freeing frame buffers When creating frame buffers the alloc callbacks are used in the host allocations, those same alloc callbacks need to be used when freeing those allocations but are missing in some places causing the CTS to report memory leaks in certain test cases. Fixes: `146364ab9f` ("pvr: add support for VK_KHR_dynamic_rendering") fix: dEQP-VK.api.object_management.alloc_callback_fail.framebuffer dEQP-VK.api.object_management.single_alloc_callbacks.framebuffer Signed-off-by: Nick Hamilton <nick.hamilton@imgtec.com> Reviewed-by: Frank Binns <frank.binns@imgtec.com> (cherry picked from commit `05ef9f01a7`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:22 +01:00
Frank Binns	dea37352ba	pvr/ci: move some timing out tests from fails to skips Some of these test cases where already in the skip list. Signed-off-by: Frank Binns <frank.binns@imgtec.com> (cherry picked from commit `74fd985c6c`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:22 +01:00
Yiwei Zhang	22c27bd3ea	venus: sync protocol for strict aliasing compliance See https://gcc.gnu.org/bugzilla/show_bug.cgi?id=124148 for details. Backport log: headers are generated from the protocol used by 26.0 branch with the strict aliasing fix (cherry picked from commit `6411ee0c2d`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:22 +01:00
Aitor Camacho	40cf87c35a	kk: Fix graphics pipeline serialization Bundles all graphics pipeline creation information required by Metal into the vertex shader so we can later rebuild the pipeline. This allows us to correctly create pipelines from caches that were loaded from files. Signed-off-by: Aitor Camacho <aitor@lunarg.com> (cherry picked from commit `cdbf7242f3`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:22 +01:00
Aitor Camacho	358c8f257a	kk: Move gfx pipeline data to the info struct within kk_shader Makes it easier to serialize and add data specific to the gfx pipeline. Signed-off-by: Aitor Camacho <aitor@lunarg.com> (cherry picked from commit `99d8246d1c`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:22 +01:00
Aitor Camacho	6152bf1cfb	kk: Fix compute pipeline cache When deserializing the compute shader from a blob, we need to recreate the pipeline because the blob may have been loaded from file and therefore the reference to the Metal resource will be invalid. Signed-off-by: Aitor Camacho <aitor@lunarg.com> (cherry picked from commit `75f6f46c0f`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Aitor Camacho	024143cca4	kk: Correctly release pipeline handles at shader destroy The condition to release Metal pipelines incorrectly checks which shader stage we are destroying leading to leads when graphics pipelines had to be released. Signed-off-by: Aitor Camacho <aitor@lunarg.com> (cherry picked from commit `622ebba476`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Aitor Camacho	9a63c20469	kk: Fix shader uint32_t value serialization We need to write with blob_write_uint32 if we are using blob_read_uint32 Signed-off-by: Aitor Camacho <aitor@lunarg.com> (cherry picked from commit `15c0dd39fc`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Aitor Camacho	a3f872630b	kk: Fill pipelineUUID Signed-off-by: Aitor Camacho <aitor@lunarg.com> (cherry picked from commit `b350f059f5`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Natalie Vock	6f88b07e5d	radv: Initialize nir_lower_io_to_scalar progress variable The NIR_PASS macro only overwrites this when the pass actually makes progress. If the pass doesn't make progress, the variable stays uninitialized. Clang correctly spots this and warns about it. Cc: mesa-stable (cherry picked from commit `47e4a68a83`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Mike Blumenkrantz	641a3ea0d9	zink: fix broken compiler assert cc: mesa-stable (cherry picked from commit `44f2c40830`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Natalie Vock	c4bb652871	radv/rt: Only use ds_bvh_stack_rtn if the stack base is possible to encode The hardware only provides 13 bits for encoding the stack base (in dwords). That translates to the stack base being required to be below 8192 dwords, or 32kB. It's possible to exceed this - LDS is 64kB after all. Add an explicit check to make sure we don't end up with offsets that overflow the hw's address fields. This fixes Metro Exodus Enhanced Edition, which was using ray queries in a 1024-thread sized workgroup, resulting in exactly 64kB of LDS being required for the stack. This check isn't required for RT pipelines as we always use 32 or 64 wide workgroups with no other LDS used, so it's impossible to reach this stack base limit. Cc: mesa-stable (cherry picked from commit `59a397793e`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Olivia Lee	47caf527e3	hk: fix passthrough GS key invalidation Just seeing that a passthrough GS was already bound is not sufficient to know that it is a matching passthrough GS. If the application binds a new VS that requires a different passthrough GS key than the previous VS, then we need to bind a different passthrough GS. Fixes: `5bc8284816` ("hk: add Vulkan driver for Apple GPUs") Signed-off-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Mary Guillemard <mary@mary.zone> (cherry picked from commit `e10f29399f`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Janne Grunau	3397d3995f	hk: Use aligned vector fill in hk_CmdFillBuffer if possible 30% faster with 16KB buffers, more than twice as fast with 8MB and larger buffers. (cherry picked from commit `651a321ee2`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Janne Grunau	1ce5b5b361	asahi: Implement clear_buffer using libagx_fill* Use either libagx_fill_uint4 or libagx_fill based of size and object alignment for clear_sizes which are a power of two up to 16. Reported fill rate for 256MB buffers on a M1 Ultra (G13D) in gpu-ratemeter is 355 GB/s for 16 byte aligned buffers and 155 GB/s for 4 byte aligned buffers. Signed-off-by: Janne Grunau <janne-fdr@jannau.net> (cherry picked from commit `5c2d62c030`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Janne Grunau	37a269e303	asahi: Use GPU for buffer copies in resource_copy_region() Use a compute shader to copy PIPE_BUFFERs. Based on hk's hk_cmd_copy(). For large copy sizes (>= 128MB) it achieves 3/4 of the available memory bandwidth on a M1 Ultra (G13D). `gpu-ratemeter gl.bufbw` reports ~625 GB/s for 256MB buffer size. Apple specifies the memory bandwidth of the M1 Ultra with 819.2 GB/s. Signed-off-by: Janne Grunau <j@jannau.net> (cherry picked from commit `3f5497ded8`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Pavel Ondračka	0f21dc1bd4	mesa: implement FRAMEBUFFER_RENDERABLE internalformat query Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Erik Faye-Lund <erik-faye-lund@collabora.com> Cc: mesa-stable (cherry picked from commit `2b76f2e4a7`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Jianxun Zhang	372c7545e6	anv: Limit modifier disabling workaround to specific GTK versions The issue caused us to put a switch to disable (Xe2) drm modifers in `2418c91537` is fixed in GTK 4.20.3, so we can enable the modifiers with this and newer GTK releases. GTK https://gitlab.gnome.org/GNOME/gtk/-/merge_requests/9164: b2a42d5a6e Revert "vulkan: Wait for device to be idle before create/recreating swapchain" 270735a151 vulkan: Rework swapchain present implementation The hex values represent the GTK version range: [4.0.0, 4.20.2] for VK_MAKE_VERSION(), refer to: `f493f5c88d` Cc: mesa-stable Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> (cherry picked from commit `df7d333656`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Wei Hao	f60b93b454	radeonsi: fix threaded shader compilation finishing after context is destroyed Reviewed-by: Marek Olšák <marek.olsak@amd.com> (cherry picked from commit `ec6d077351`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Ryan Zhang	96ee7156af	panvk: guard against NULL pointers to avoid crash Vkcts simulate_oom caselist try to alloc fail manual which caused the panvk crash. We should guard driver cannot access null pointor. Fixes: `598a8d9d11` ("panvk: Collect allocated push sets at the command level") Fixed: dEQP-VK.wsi.wayland.swapchain.simulate_oom.* Signed-off-by: Ryan Zhang <ryan.zhang@nxp.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> (cherry picked from commit `418e6c4ed9`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Lars-Ivar Hesselberg Simonsen	11db64a7d3	pan/genxml/v13: Fix HSR Prepass typo Fixes: `ece01443e1` ("pan/genxml: Add v13 definition") Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> (cherry picked from commit `71500a32fa`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Lars-Ivar Hesselberg Simonsen	43b9a2ea5e	panvk: Fix dcd_flags1 dirty bit dcd_flags1 was not counted as dirty in case the color attachment map was updated. This could lead to an outdated value for render_target_mask. Fixes: `a4670a67e0` ("panvk/csf: Set the correct DCD_FLAGS_1.render_rarget_mask") Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> (cherry picked from commit `75242b1862`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Pavel Ondračka	98e2234eb4	r300: align macro-tiled stride-addressed textures in X Odd macro-tile counts in X trigger flaky rendering/readback in parallel stress runs with macro-tiled NPOT textures (for example piglit draw-pixel-with-texture -auto -fbo). When a texture is macro-tiled and uses stride addressing, align the width to two macro tiles. This keeps the stride at an even number of macro tiles in X and avoids the corruption without disabling macrotiling. I was not able to find anything about this in the docs. Cc: mesa-stable (cherry picked from commit `0763fb947a`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00

1 2 3 4 5 ...

217678 commits