fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-07 04:58:05 +02:00

Author	SHA1	Message	Date
Eric Engestrom	3646899ffd	docs: add release notes for 26.0.2	2026-03-12 12:56:33 +01:00
Mike Blumenkrantz	5cf88188bd	egl/device: fix the fix for explicit sw rejection in non-sw EGL_PLATFORM=device Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details "explicit sw" means llvmpipe, which cannot be a real drm device. this requires also returning only a single device so as to avoid leaking non-sw drivers should fix LIBGL_ALWAYS_SOFTWARE=1 eglinfo Fixes: `8a339cdebc` ("egl: fix sw fallback rejection in non-sw EGL_PLATFORM=device") (cherry picked from commit `c9b2986607`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:12 +01:00
Job Noorman	1f89a0fb96	ir3: don't predicate vote_all/vote_any These get lowered to control flow which isn't allowed inside predicated blocks. Signed-off-by: Job Noorman <jnoorman@igalia.com> Fixes: `39088571f0` ("ir3: add support for predication") (cherry picked from commit `5e4a7d01fe`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:12 +01:00
Job Noorman	d78e309e4d	ir3: update context builder after ir3_get_predicate If we are currently inserting instructions after the src of the predicate conversion, uses of the predicate will be inserted before its def (the conversion). Fix this by updating the context builder to point to after the conversion. Signed-off-by: Job Noorman <jnoorman@igalia.com> Fixes: `fda91b49d7` ("ir3: refactor builders to use ir3_builder API") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/15043 (cherry picked from commit `f88e8b778d`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:12 +01:00
Samuel Pitoiset	0e31cb83ce	radv: fix missing L2 cache invalidation with streamout on GFX12 COPY_DATA emitted from the CP isn't coherent with L2, in case the buffer filled size needs to be copied. This fixes rare and random flickering with Mafia 3 Definitive Edition on RDNA4. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14697 Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `d9420eed9e`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:12 +01:00
Sagar Ghuge	ada713b32f	anv: Fix Wa_14021821874, Wa_14018813551, Wa_14026600921 WA states that we need to allocate maximum number of stackIDs per DSS from RT_DISPATCH_GLOBALS to 2048. We can still throttle/control the CFE_STATE::StackID to be in range specified by the field. This does impact performance having CFE_STATE::stackIDs capped to 2K by default. More the outstanding ray queries, larger the working set and have more impact on cache hit rate. This affect performance on Xe2+ onwards: * Boundary Benchmark: 36.2% * Solar Bay extreme: 9.8% * Hitman world of assassination: 3.9% Fixes: `c1a44e8d43` ("anv: force StackIDControl value for Wa_14021821874") Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> (cherry picked from commit `cb423ee636`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:12 +01:00
Tapani Pälli	446fab4a4a	anv: add handling for Wa_14026600921 This is the Xe3 version of the earlier workaround. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (cherry picked from commit `840e6e855b`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:12 +01:00
Tapani Pälli	77add2d8f2	intel/dev: update mesa_defs.json from workaround database Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (cherry picked from commit `c75309c8f1`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:12 +01:00
Faith Ekstrand	7054ea6d45	pan/bi: Be more careful about bit sizes in b2f lowering Fixes: `21bdee7bcc` ("pan/bi: Switch to lower_bool_to_bitsize") Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> (cherry picked from commit `08c437f644`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:12 +01:00
Faith Ekstrand	9c2b19219a	nir/lower_bool_to_bitsize: Make all bN_csel sources match Previously, we assumed that the selector for bcsel could be whatever, regardless of the bit sizes of the data and we'd just fix it in the back-end. This works okay for scalars but falls over the moment we vectorize because all our vector handling assumes bit sizes match. Since matching bit sizes is what the hardware wants anyway, it's better to do the right thing in NIR and hope copy-propagation can fold in conversions if needed. Unfortunately, copy prop isn't that smart yet so this does hurt a bit: Instrs: 1193679 -> 1198086 (+0.37%); split: -0.06%, +0.43% CodeSize: 11915136 -> 11950592 (+0.30%); split: -0.05%, +0.34% Full: 160985 -> 160941 (-0.03%); split: -0.04%, +0.01% Estimated normalized CVT cycles: 4456.938557000181 -> 4480.876069000186 (+0.54%); split: -0.13%, +0.67% Estimated normalized SFU cycles: 6350.9375 -> 6392.21875 (+0.65%) Estimated normalized Load/Store cycles: 205773.0 -> 205795.0 (+0.01%) Maximum number of threads: 12864 -> 12863 (-0.01%) Number of spill instructions: 22487 -> 22489 (+0.01%) Number of fill instructions: 52179 -> 52219 (+0.08%) Hurt shaders: google-meet-clvk/BgBlur google-meet-clvk/Relight parallel-rdp/small_subgroup parallel-rdp/small_uber_subgroup The proper solution here is to teach copy-prop about this stuff so that it can propagate swizzles into ALU ops when they're supported: https://gitlab.freedesktop.org/panfrost/mesa/-/issues/265 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14945 Cc: mesa-stable Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> (cherry picked from commit `3fd471dca5`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:12 +01:00
Faith Ekstrand	740734ac72	etnaviv: Call lower_bool_to_int32 not to_bitsize It calls both for some reason but never handles any other booleans than 32-bit. This was probably a mistake. Fixes: `e63a7882a0` ("etnaviv: call nir_lower_bool_to_bitsize") Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> (cherry picked from commit `6fb3995659`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:12 +01:00
Mary Guillemard	f550eb1903	vulkan: Do not override the shader_flags in case of no task shader This should be doing a or and not an assign. This fixes issues on NVK with mesh stages on DGC. Signed-off-by: Mary Guillemard <mary@mary.zone> Fixes: `9308e8d90d` ("vulkan: Add generic graphics and compute VkPipeline implementations") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (cherry picked from commit `8f2eeee7ba`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:11 +01:00
Antonino Maniscalco	7b2af9e15a	zink: don't care about generated gs output primitive Zink uses the output primitive of the last vertex stage when deciding the raster primitive. When we generate the gs the output primitive depends on the raster primitive. Not only does the generated gs output primitive have no value in chosing the raster primitive, it can also get us stuck with the last raster primitve which is of course incorrect. Ignore it for generated shaders. Cc: mesa-stable (cherry picked from commit `d526bbc29b`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:11 +01:00
Timothy Arceri	b5304ffef7	glx: guard glx_screen frontend_screen member Guards workaround code with the same conditions as glx_screen`s frontend_screen member. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Fixes: `67eeee43e0` ("driconf: add a way to override GLX_CONTEXT_RESET_ISOLATION_BIT_ARB") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/15021 (cherry picked from commit `bd42f62b0f`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:11 +01:00
Iván Briano	836a22d1a2	anv: don't try to fast clear D/S with multiview If multiview is enabled on the render pass, baseLayer and layerCount will be 0 and 1 respectively and throw us off. We can still fast clear if view_mask == 1, but anything else hits the BLORP_BATCH_NO_EMIT_DEPTH_STENCIL restriction. Fixes: `e488773b29` ("anv: Fast clear depth/stencil surface in vkCmdClearAttachments") Signed-off-by: Iván Briano <ivan.briano@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> (cherry picked from commit `5d22f307d5`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:11 +01:00
Ian Romanick	2a2dba1bc7	elk/algebraic: Don't optimize SEL.L.SAT or SEL.G.SAT shader-db: Broadwell total instructions in shared programs: 18607516 -> 18607530 (<.01%) instructions in affected programs: 2095 -> 2109 (0.67%) helped: 0 / HURT: 8 total cycles in shared programs: 955704436 -> 955702925 (<.01%) cycles in affected programs: 34299 -> 32788 (-4.41%) helped: 2 / HURT: 6 All Haswell and older platforms had similar results. (Haswell shown) total instructions in shared programs: 16989200 -> 16989201 (<.01%) instructions in affected programs: 461 -> 462 (0.22%) helped: 0 / HURT: 1 total cycles in shared programs: 946537070 -> 946537035 (<.01%) cycles in affected programs: 16378 -> 16343 (-0.21%) helped: 1 / HURT: 0 Test: piglit!1100 Reported-by: Georg Lehmann Fixes: `ca675b73d3` ("i965/fs: Optimize saturating SEL.L(E) with imm val >= 1.0.") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> (cherry picked from commit `64c60582b5`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:11 +01:00
Ian Romanick	829e5ccc84	brw/algebraic: Don't optimize SEL.L.SAT or SEL.G.SAT This optimization was added in October 2013, and the error was only just now discovered. Removing the SEL.G.SAT optimization affected zero shader-db shaders, and it affected 9 fossil-db shaders for instruction size only. I haven't checked to see if any of the hurt shaders are helped by !39987. shader-db: All Intel platforms had similar results. (Lunar Lake shown) total instructions in shared programs: 17093041 -> 17093055 (<.01%) instructions in affected programs: 2072 -> 2086 (0.68%) helped: 0 / HURT: 8 total cycles in shared programs: 876739578 -> 876739154 (<.01%) cycles in affected programs: 18946 -> 18522 (-2.24%) helped: 2 / HURT: 6 fossil-db: Lunar Lake Totals: Instrs: 906230557 -> 906240487 (+0.00%); split: -0.00%, +0.00% CodeSize: 14498856128 -> 14499003168 (+0.00%); split: -0.00%, +0.00% Send messages: 40667184 -> 40667205 (+0.00%); split: -0.00%, +0.00% Cycle count: 104068494103 -> 104068561943 (+0.00%); split: -0.00%, +0.00% Max live registers: 189570192 -> 189570204 (+0.00%); split: -0.00%, +0.00% Max dispatch width: 48157648 -> 48157552 (-0.00%) Non SSA regs after NIR: 139823587 -> 139823016 (-0.00%); split: -0.00%, +0.00% Totals from 9172 (0.46% of 1985212) affected shaders: Instrs: 10774709 -> 10784639 (+0.09%); split: -0.00%, +0.09% CodeSize: 177868384 -> 178015424 (+0.08%); split: -0.08%, +0.17% Send messages: 311154 -> 311175 (+0.01%); split: -0.00%, +0.01% Cycle count: 232471392 -> 232539232 (+0.03%); split: -0.15%, +0.18% Max live registers: 1243549 -> 1243561 (+0.00%); split: -0.00%, +0.01% Max dispatch width: 196672 -> 196576 (-0.05%) Non SSA regs after NIR: 509663 -> 509092 (-0.11%); split: -0.19%, +0.08% Test: piglit!1100 Reported-by: Georg Lehmann Fixes: `ca675b73d3` ("i965/fs: Optimize saturating SEL.L(E) with imm val >= 1.0.") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> (cherry picked from commit `6c6c6ce054`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:11 +01:00
Eric R. Smith	63a6e0ffc9	pco: fix a typo in the check for optimization looping The count isn't incremented anywhere else. Signed-off-by: Eric R. Smith <eric.smith@collabora.com> Reviewed-by: Simon Perretta <simon.perretta@imgtec.com> Fixes: `f1b24267d2` ("pco: rework nir processing and passes") (cherry picked from commit `8521051cfa`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:11 +01:00
Pavel Ondračka	eea697b179	r300: disable clip-discard watermark for triangles Commit `0d4aa5f55f` introduced the watermark to optimize the guardband state changes and always computed new_distance as MAX2(distance, watermark). That is correct for point/line paths where distance > 0, but it keeps a non-zero discard distance alive when the next draw sets distance = 0 (triangles). This leaks wide point/line clip-discard state into later triangle draws and can clip away large parts of geometry (as observed in Sauerbraten). Only apply the watermark when distance > 0 and reset it to zero otherwise so triangle draws disable clip-discard as intended. Fixes: `0d4aa5f55f` ("r300: pop-free clipping") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14959 (cherry picked from commit `ce33f82f83`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:11 +01:00
Samuel Pitoiset	ecb7bf7b68	radv: fix local invocation index for mesh/task and quad derivatives on GFX12 It must be lowered. This fixes dEQP-VK.spirv_assembly.instruction.compute.compute_shader_derivatives.{mesh,task}.*. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `3c4cb16159`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:11 +01:00
Samuel Pitoiset	f858d2238e	radv: fix a GPU hang with PS epilogs and secondary command buffers If the secondary changes the fragment output state and if the same PS epilog used before ExecuteCommands() is re-bind immediately after that call, the PS epilog state wouldn't be re-emitted. Apply the same change for VS prologs, although the logic is slightly different and the bug shouldn't occur. The whole logic of secondaries should be completely rewritten because it's definitely not robust. This fixes a GPU hang in Where Winds Meet, see https://github.com/doitsujin/dxvk/issues/5436. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `1a00587c44`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:11 +01:00
Yiwei Zhang	2b6e7f0be2	lvp: avoid advertising dmabuf support for kms_swrast Lavapipe relies on true udmabuf support for dmabuf export allocation. This changes aligns the behavior with both llvmpipe_allocate_memory_fd and llvmpipe_import_memory_fd. Fixes: `7d0a631f20` ("llvmpipe: export dmabuf caps for kms_swrast") Reviewed-by: Lucas Fryzek <lfryzek@igalia.com> (cherry picked from commit `5ab8c8a439`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:11 +01:00
Mel Henning	60e29a07c0	driconf: force_vk_vendor on No Man's Sky + NVK Cc: mesa-stable Reviewed-by: Mary Guillemard <mary@mary.zone> (cherry picked from commit `bfde63e4d8`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:11 +01:00
Georg Lehmann	8f6c3dcc90	nir/opt_algebraic: fix frsq clamp pattern This is not NaN correct. And also make the pattern 32bit only because the constant is hard coded FLT_MAX. Fixes: `780b5c1037` ("nir/algebraic: Simplify some Inf and NaN avoidance code") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> (cherry picked from commit `ab773fc5d4`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:11 +01:00
Danylo Piliaiev	4a4a86390b	tu: Don't read .patch_input_gmem of unused attachment There was duplicated code to set unscaled_input_fragcoord and a read from VK_ATTACHMENT_UNUSED attachment, which incorrectly updated builder->unscaled_input_fragcoord. ubsan: tu_pipeline.cc:4734:44: runtime error: load of value 127, which is not a valid value for type 'bool' Seen in: dEQP-VK.renderpasses.renderpass1.custom_resolve.monolithic.stencil_only_s8 Fixes: `97da0a7734` ("tu: Rewrite to use common Vulkan dynamic state") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> (cherry picked from commit `81a76be861`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:11 +01:00
Danylo Piliaiev	ace5f6c88d	tu: Store gmem attachments after custom resolve in dyn RP For dynamic renderpass we created a fake second subpass, which would is used by CmdBeginCustomResolveEXT, however CmdBeginCustomResolveEXT doesn't trigger tile stores, but attachments didn't know they should be stored after fake custom resolve subpass. Fixes: `520e3f3a47` ("tu: Implement VK_EXT_custom_resolve") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> (cherry picked from commit `67c54c4465`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:11 +01:00
Caio Oliveira	8355670805	nir: Fix constant folding for iadd_sat Use INT_MIN instead of INT_MAX for underflow. Fixes: `cc4b50b023` ("nir/opcodes: use u_overflow to fix incorrect checks") Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pelloux@gmail.com> (cherry picked from commit `da57fbfb07`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:11 +01:00
Connor Abbott	725626858d	tu: Fix setting will_be_resolved with MSRTSS We were setting it on the user's attachments, which become resolve/unresolve attachments, but it should be set on the color and depth/stencil attachments. Cc: mesa-stable (cherry picked from commit `d0be4ab2ab`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:10 +01:00
Connor Abbott	9a361c3801	tu: Set polygon mode when blitting Noticed by inspection. Cc: mesa-stable (cherry picked from commit `1d167ffe77`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:10 +01:00
Yiwei Zhang	b88c8f37e4	pan: fix to not clear out of bitset range Fixes: `617f0562bb` ("pan: Use bitset instead of bool array in bi_find_loop_blocks") Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> (cherry picked from commit `ec24d1afb6`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:10 +01:00
Lucas Fryzek	d7ee1e68df	vulkan/wsi: Check that xshm can be attached Cc: mesa-stable Co-authored-by: Carlos Lopez <clopez@igalia.com> (cherry picked from commit `4933e60bc2`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:10 +01:00
Lucas Fryzek	5f4eccf1fb	glx: Check that xshm can be attached Cc: mesa-stable Co-authored-by: Carlos Lopez <clopez@igalia.com> (cherry picked from commit `a67af81944`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:10 +01:00
Lucas Fryzek	2c4c7fbfa9	egl/dri: Check that xshm can be attached Cc: mesa-stable Co-authored-by: Carlos Lopez <clopez@igalia.com> (cherry picked from commit `5f481dd89d`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:10 +01:00
Lucas Fryzek	23b88ba221	x11: Add helper util to check for xshm support Cc: mesa-stable (cherry picked from commit `9e1671dea9`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:10 +01:00
Lucas Fryzek	8d313e5d1c	drisw: Properly mark shmid as -1 when alloc fails Cc: mesa-stable (cherry picked from commit `b93bf19d94`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:10 +01:00
Timothy Arceri	681de5a641	st/glsl_to_nir: update state var locations earlier We need to update the state var locations before the st_serialize_base_nir() calls otherwise _mesa_optimize_state_parameters() can alter params such that variants wont be able to find the correct match when calling _mesa_lookup_state_param_idx(). Prior to `891d46f5` this worked because after failing to match we would end up adding additional params back in that we had just attempted to optimise. Fixes: `a6fcc2835e` (" st/glsl_to_nir: make sure the variant has the correct locations set") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14837 Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> (cherry picked from commit `6c60f423b3`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:10 +01:00
Timothy Arceri	0edb7039cb	mesa/st: use same path for setting state ref locations After the fix in `a6fcc2835e` we can now take the same path whether allow_st_finalize_nir_twice is set or not. Reviewed-by: Marek Olšák <marek.olsak@amd.com> (cherry picked from commit `b59c3ac82a`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:10 +01:00
Caio Oliveira	b2a34da82f	spirv: Fix spec constant to handle Select for non-native floats There was an assumption that if the instruction had non-native float as a source, the first source would have such type. This doesn't hold for Select, and the code failed in two ways - The boolean source of Select was being converted to the non-native float type. - The loop that resolves the bit-size for unsized operands would trip at `assert(i == 0)` because Select has more than one source. Re-organize the code to track the types of the sources independently, and fix both issues above. Fixes: `90e1b12890` ("spirv: Add bfloat16 support to SpecConstantOp") Fixes: `51d3c4c889` ("spirv: support float8 spec constant op") Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> (cherry picked from commit `6affcb43a7`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:10 +01:00
Caio Oliveira	4588b025c8	spirv: Pull constant source fixup to the existing loop Backport-to: 26.0 Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> (cherry picked from commit `b0c3b20bff`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:10 +01:00
Caio Oliveira	0775d0f1b5	spirv: Refactor ALU opcode translation to take bit sizes Only used by Convert operations, so just pass 0 from callers that are not Convert and clarify that in the code. Backport-to: 26.0 Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> (cherry picked from commit `1c3c987d5c`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:10 +01:00
Timothy Arceri	a66a9280fb	glsl: add workaround for MDK2 HD Allows a shader to compile that uses an embedded struct declaration which are not allowed in glsl 1.20+ Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14986 (cherry picked from commit `f109bfc3f1`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:10 +01:00
Rhys Perry	1d66a995ce	nir/range_analysis: set deleted key If (uintptr_t)&deleted_key is small enough, inserting entries into the hash table might not work correctly. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Backport-to: 26.0 Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> (cherry picked from commit `c0079e09ca`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:10 +01:00
Ian Romanick	0d52c7941e	brw: Also check for ADDRESS file in update_for_reads Like accumulators and ARF address registers, the virtual address registers are not tracked in a way the defs analysis can know about. This could actually be fixed, but that is future work. Fixes: `b110b06447` ("brw: introduce a new register type for the address register") Suggested-by: Lionel Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (cherry picked from commit `8624da56ee`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:10 +01:00
Ian Romanick	815691378b	brw: Use brw_reg_is_arf in update_for_reads brw_reg::nr encodes both which ARF it is and which instance of that ARF. In other words, nr for acc0 and acc2 have some bits that say BRW_ARF_ACCUMULATOR and some bits that say 0 vs 2. The previous test would only detect acc0. Fixes: `0d144821f0` ("intel/brw: Add a new def analysis pass") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (cherry picked from commit `366410e913`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:09 +01:00
Ian Romanick	f21bc439a1	brw: Don't mark_invalid in update_for_reads for non-VGRF destination This can occur if NULL or an accumulator is an explicit destination. update_for_reads still needs to process the sources. v2: Pass a brw_reg to ::mark_invalid, and do the VGRF check in that one place. Fixes: `0d144821f0` ("intel/brw: Add a new def analysis pass") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (cherry picked from commit `a548466186`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:09 +01:00
Jose Maria Casanova Crespo	31ea1923de	v3d: reject fast TLB blit when RT formats don't match v3d_tlb_blit_fast includes the blit onto a pending job that writes to the source resource. The TLB data is already unpacked according to the job's RT format, so storing it with a different RT format performs a channel reinterpretation rather than a raw byte copy, corrupting the data. So when copying from RGB10_A2UI to RG16UI with glCopyImageSubData, the copy_image path remaps both formats to R16G16_UNORM for a raw 32-bit copy. The fast TLB blit found the pending clear job (RGB10_A2UI, 4 channels: 10-10-10-2) and stored its TLB data as RG16UI (2 channels: 16-16), writing the unpacked 10-bit R and G channel values into 16-bit fields instead of preserving the raw packed bits. Previous internal_type/bpp check was insufficient: both RGB10_A2UI and RG16UI share internal_type=16UI and the source bpp (64) exceeds the destination bpp (32), but their channel layouts are different. Add a check that the job's source surface RT format matches the blit destination RT format before allowing the fast path. Fixes: `66de8b4b5c` ("v3d: add a faster TLB blit path") Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> (cherry picked from commit `5454221cfb`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:09 +01:00
Marek Olšák	f7d391f851	ac: set the correct number of Z planes for ALLOW_EXPCLEAR This is an old driver bug that could cause Z corruption on gfx8-11.5. v2: handle allow_expclear differently Cc: mesa-stable Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> (v1) Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (v2) (cherry picked from commit `4cfe08e583`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:09 +01:00
Karol Herbst	d29063d4f2	nir: fix nir_round_int_to_float for fp16 fp16 has quite the limited value range and with bigger integers nir_round_int_to_float might return Inf where it shouldn't depending on the rounding mode. Fixes conversions half_rt[npz]_(u)?(int\|long) CL CTS tests. Cc: mesa-stable Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Rob Clark <rob.clark@oss.qualcomm.com> (cherry picked from commit `e1ed7de274`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:09 +01:00
Karol Herbst	3d8ff40d58	nir: fix nir_alu_type_range_contains_type_range for fp16 to int The special value "Inf" doesn't fit into an int and therefore we have to clamp regardless of whether all the other values would fit. And because f2u32 and f2u64 define out-of-range conversions as UB in nir, we need to clamp. This change should have no effect for non saturating conversions. Fixes "conversions long_sat_*half" CL CTS tests Cc: mesa-stable Suggested-by: Rob Clark <rob.clark@oss.qualcomm.com> Reviewed-by: Rob Clark <rob.clark@oss.qualcomm.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> (cherry picked from commit `8e8fb2ebaa`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:09 +01:00
Boris Brezillon	7ee55d3a5f	pan/kmod: Allow mmap() on foreign buffers If the BO comes from a different subsystem (args.extra_flags & DRM_PANTHOR_BO_IS_IMPORTED), we should normally add extra DMA_BUF_IOCTL_SYNC calls around CPU accesses to ensure the CPU mapping consistency, but this is something we never worried about (we've always assumed exporters were exposing uncached mappings with NOP {begin,end}_cpu_access() implementations), and it worked fine until now. The long term plan is to hook up DMA_BUF_IOCTL_SYNC, but this requires more work, and we need a quick fix that can be backported easily, hence this revert+FIXME. Fixes: `b5e47ba894` ("pan/kmod: Add new helpers to sync BO CPU mappings") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14963 Closes: https://gitlab.freedesktop.org/panfrost/mesa/-/issues/282 Closes: https://gitlab.freedesktop.org/wayland/weston/-/issues/1101 Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Acked-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> (cherry picked from commit `30f1d5bab9`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:09 +01:00

... 2 3 4 5 6 ...

217920 commits