fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-07 07:08:04 +02:00

Author	SHA1	Message	Date
Olivia Lee	47caf527e3	hk: fix passthrough GS key invalidation Just seeing that a passthrough GS was already bound is not sufficient to know that it is a matching passthrough GS. If the application binds a new VS that requires a different passthrough GS key than the previous VS, then we need to bind a different passthrough GS. Fixes: `5bc8284816` ("hk: add Vulkan driver for Apple GPUs") Signed-off-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Mary Guillemard <mary@mary.zone> (cherry picked from commit `e10f29399f`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Janne Grunau	3397d3995f	hk: Use aligned vector fill in hk_CmdFillBuffer if possible 30% faster with 16KB buffers, more than twice as fast with 8MB and larger buffers. (cherry picked from commit `651a321ee2`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Janne Grunau	1ce5b5b361	asahi: Implement clear_buffer using libagx_fill* Use either libagx_fill_uint4 or libagx_fill based of size and object alignment for clear_sizes which are a power of two up to 16. Reported fill rate for 256MB buffers on a M1 Ultra (G13D) in gpu-ratemeter is 355 GB/s for 16 byte aligned buffers and 155 GB/s for 4 byte aligned buffers. Signed-off-by: Janne Grunau <janne-fdr@jannau.net> (cherry picked from commit `5c2d62c030`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Janne Grunau	37a269e303	asahi: Use GPU for buffer copies in resource_copy_region() Use a compute shader to copy PIPE_BUFFERs. Based on hk's hk_cmd_copy(). For large copy sizes (>= 128MB) it achieves 3/4 of the available memory bandwidth on a M1 Ultra (G13D). `gpu-ratemeter gl.bufbw` reports ~625 GB/s for 256MB buffer size. Apple specifies the memory bandwidth of the M1 Ultra with 819.2 GB/s. Signed-off-by: Janne Grunau <j@jannau.net> (cherry picked from commit `3f5497ded8`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Pavel Ondračka	0f21dc1bd4	mesa: implement FRAMEBUFFER_RENDERABLE internalformat query Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Erik Faye-Lund <erik-faye-lund@collabora.com> Cc: mesa-stable (cherry picked from commit `2b76f2e4a7`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Jianxun Zhang	372c7545e6	anv: Limit modifier disabling workaround to specific GTK versions The issue caused us to put a switch to disable (Xe2) drm modifers in `2418c91537` is fixed in GTK 4.20.3, so we can enable the modifiers with this and newer GTK releases. GTK https://gitlab.gnome.org/GNOME/gtk/-/merge_requests/9164: b2a42d5a6e Revert "vulkan: Wait for device to be idle before create/recreating swapchain" 270735a151 vulkan: Rework swapchain present implementation The hex values represent the GTK version range: [4.0.0, 4.20.2] for VK_MAKE_VERSION(), refer to: `f493f5c88d` Cc: mesa-stable Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> (cherry picked from commit `df7d333656`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Wei Hao	f60b93b454	radeonsi: fix threaded shader compilation finishing after context is destroyed Reviewed-by: Marek Olšák <marek.olsak@amd.com> (cherry picked from commit `ec6d077351`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Ryan Zhang	96ee7156af	panvk: guard against NULL pointers to avoid crash Vkcts simulate_oom caselist try to alloc fail manual which caused the panvk crash. We should guard driver cannot access null pointor. Fixes: `598a8d9d11` ("panvk: Collect allocated push sets at the command level") Fixed: dEQP-VK.wsi.wayland.swapchain.simulate_oom.* Signed-off-by: Ryan Zhang <ryan.zhang@nxp.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> (cherry picked from commit `418e6c4ed9`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Lars-Ivar Hesselberg Simonsen	11db64a7d3	pan/genxml/v13: Fix HSR Prepass typo Fixes: `ece01443e1` ("pan/genxml: Add v13 definition") Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> (cherry picked from commit `71500a32fa`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Lars-Ivar Hesselberg Simonsen	43b9a2ea5e	panvk: Fix dcd_flags1 dirty bit dcd_flags1 was not counted as dirty in case the color attachment map was updated. This could lead to an outdated value for render_target_mask. Fixes: `a4670a67e0` ("panvk/csf: Set the correct DCD_FLAGS_1.render_rarget_mask") Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> (cherry picked from commit `75242b1862`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Pavel Ondračka	98e2234eb4	r300: align macro-tiled stride-addressed textures in X Odd macro-tile counts in X trigger flaky rendering/readback in parallel stress runs with macro-tiled NPOT textures (for example piglit draw-pixel-with-texture -auto -fbo). When a texture is macro-tiled and uses stride addressing, align the width to two macro tiles. This keeps the stride at an even number of macro tiles in X and avoids the corruption without disabling macrotiling. I was not able to find anything about this in the docs. Cc: mesa-stable (cherry picked from commit `0763fb947a`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Yiwei Zhang	7c0b97be73	venus: workaround a gcc-15 dead store elimination (DSE) bug No issue with clang or gcc-14.x (or earlier versions). The issue only shows up since gcc-15.1. The compiler somehow fails to consider those cs helpers dereferencing the pointer from the pNext chain for reads, and thus has falsely optimized away the pNext store. This change works around this with a no-op memory clobber. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13242 Cc: mesa-stable (cherry picked from commit `b0397b967d`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Timothy Arceri	6fb7a07c79	st/glsl_to_nir: make sure the variant has the correct locations set For drivers that set allow_st_finalize_nir_twice locations are set when the variable is created. But for variants here we update the locations in case parameter opt pass or something else changed the location. Fixes: `891d46f517` ("st/glsl_to_nir: dont add duplicate state tokens") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14837 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> (cherry picked from commit `a6fcc2835e`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Timothy Arceri	d7fa6a4deb	mesa: add _mesa_lookup_state_param_idx() helper This will be used in the following patch. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> (cherry picked from commit `c3aae0714c`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:21 +01:00
Ian Romanick	0710d042db	elk: Call nir_opt_algebraic_late in elk_postprocess_nir Make sure that lowering undone in elk_nir_optimize are reapplied. No shader-db or fossil-db changes on any Intel platform. This is most likely to impact either Gfx8 on ANV or Gfx7.5 on HASVK. I don't fossil-db test either of those platforms. I tried doing a similar thing here as is done in BRW (previous commit), but that caused a couple Haswell shaders to fall off a performance cliff: total spills in shared programs: 8247 -> 8311 (0.78%) spills in affected programs: 6 -> 70 (1066.67%) helped: 0 / HURT: 2 total fills in shared programs: 8558 -> 8910 (4.11%) fills in affected programs: 6 -> 358 (5866.67%) helped: 0 / HURT: 2 Fixes: `442daeb54a` ("nir/opt_algebraic: use fcanonicalize") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> (cherry picked from commit `df704bd38e`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:20 +01:00
Ian Romanick	1f65b768a1	brw: Call nir_opt_algebraic_late later in brw_postprocess_nir_opts Move the call to nir_opt_algebraic_late after the last time brw_nir_optimize might be called. nir_opt_algebraic_distribute_src_mods works together with the late algebraic optimizations, so move it also. shader-db: Lunar Lake total instructions in shared programs: 17081222 -> 17080842 (<.01%) instructions in affected programs: 419931 -> 419551 (-0.09%) helped: 545 / HURT: 826 total cycles in shared programs: 878437752 -> 879236226 (0.09%) cycles in affected programs: 506003142 -> 506801616 (0.16%) helped: 3091 / HURT: 3189 LOST: 18 GAINED: 16 Meteor Lake and DG2 had similar results. (Meteor Lake shown) total instructions in shared programs: 19994270 -> 19993231 (<.01%) instructions in affected programs: 490499 -> 489460 (-0.21%) helped: 660 / HURT: 800 total cycles in shared programs: 882498776 -> 882834186 (0.04%) cycles in affected programs: 477858602 -> 478194012 (0.07%) helped: 3458 / HURT: 3564 total fills in shared programs: 4371 -> 4370 (-0.02%) fills in affected programs: 7 -> 6 (-14.29%) helped: 1 / HURT: 0 LOST: 28 GAINED: 10 Tiger Lake, Ice Lake, and Skylake had similar results. (Tiger Lake shown) total instructions in shared programs: 19943849 -> 19942782 (<.01%) instructions in affected programs: 467384 -> 466317 (-0.23%) helped: 655 / HURT: 796 total cycles in shared programs: 860085674 -> 861410289 (0.15%) cycles in affected programs: 426900998 -> 428225613 (0.31%) helped: 3250 / HURT: 3441 LOST: 19 GAINED: 14 fossil-db: Lunar Lake Totals: Instrs: 926472091 -> 926204838 (-0.03%); split: -0.04%, +0.01% CodeSize: 14845921056 -> 14842776112 (-0.02%); split: -0.10%, +0.08% Send messages: 41459570 -> 41459574 (+0.00%); split: -0.00%, +0.00% Cycle count: 104481085069 -> 104583692712 (+0.10%); split: -0.14%, +0.24% Spill count: 3454651 -> 3457340 (+0.08%); split: -0.15%, +0.23% Fill count: 4958779 -> 4958487 (-0.01%); split: -0.46%, +0.45% Max live registers: 193805970 -> 193839002 (+0.02%); split: -0.00%, +0.02% Max dispatch width: 49114416 -> 49113776 (-0.00%); split: +0.01%, -0.01% Non SSA regs after NIR: 142953905 -> 142800740 (-0.11%); split: -0.12%, +0.01% Totals from 420256 (20.80% of 2020128) affected shaders: Instrs: 448571327 -> 448304074 (-0.06%); split: -0.09%, +0.03% CodeSize: 7312002800 -> 7308857856 (-0.04%); split: -0.21%, +0.17% Send messages: 17716494 -> 17716498 (+0.00%); split: -0.00%, +0.00% Cycle count: 52178854998 -> 52281462641 (+0.20%); split: -0.28%, +0.48% Spill count: 2945654 -> 2948343 (+0.09%); split: -0.17%, +0.26% Fill count: 4404768 -> 4404476 (-0.01%); split: -0.51%, +0.51% Max live registers: 60875448 -> 60908480 (+0.05%); split: -0.01%, +0.06% Max dispatch width: 9455280 -> 9454640 (-0.01%); split: +0.04%, -0.04% Non SSA regs after NIR: 60542740 -> 60389575 (-0.25%); split: -0.28%, +0.02% Meteor Lake and DG2 had similar results. (Meteor Lake shown) Totals: Instrs: 1000081384 -> 999726726 (-0.04%); split: -0.05%, +0.01% CodeSize: 16764458080 -> 16761624256 (-0.02%); split: -0.09%, +0.07% Subgroup size: 27599528 -> 27599544 (+0.00%) Send messages: 45538933 -> 45538951 (+0.00%); split: -0.00%, +0.00% Cycle count: 93303830912 -> 93370118192 (+0.07%); split: -0.19%, +0.26% Spill count: 3739306 -> 3739719 (+0.01%); split: -0.22%, +0.23% Fill count: 5089719 -> 5083626 (-0.12%); split: -0.56%, +0.44% Max live registers: 122041364 -> 122055848 (+0.01%); split: -0.00%, +0.01% Max dispatch width: 38117296 -> 38127200 (+0.03%); split: +0.06%, -0.03% Non SSA regs after NIR: 164296197 -> 164299306 (+0.00%); split: -0.01%, +0.01% Totals from 338754 (14.82% of 2285730) affected shaders: Instrs: 452723479 -> 452368821 (-0.08%); split: -0.10%, +0.03% CodeSize: 7861878032 -> 7859044208 (-0.04%); split: -0.19%, +0.16% Subgroup size: 16 -> 32 (+100.00%) Send messages: 17050010 -> 17050028 (+0.00%); split: -0.00%, +0.00% Cycle count: 52881801997 -> 52948089277 (+0.13%); split: -0.33%, +0.46% Spill count: 3271458 -> 3271871 (+0.01%); split: -0.25%, +0.26% Fill count: 4628422 -> 4622329 (-0.13%); split: -0.61%, +0.48% Max live registers: 30738902 -> 30753386 (+0.05%); split: -0.01%, +0.06% Max dispatch width: 4787264 -> 4797168 (+0.21%); split: +0.47%, -0.26% Non SSA regs after NIR: 61748026 -> 61751135 (+0.01%); split: -0.03%, +0.03% Tiger Lake Totals: Instrs: 1011068379 -> 1010977290 (-0.01%); split: -0.03%, +0.02% CodeSize: 14197751744 -> 14197683040 (-0.00%); split: -0.07%, +0.07% Send messages: 46431228 -> 46431220 (-0.00%); split: -0.00%, +0.00% Cycle count: 85066526419 -> 85085088071 (+0.02%); split: -0.16%, +0.18% Spill count: 3853750 -> 3855185 (+0.04%); split: -0.15%, +0.19% Fill count: 6716746 -> 6719594 (+0.04%); split: -0.25%, +0.29% Max live registers: 122307387 -> 122326083 (+0.02%); split: -0.00%, +0.02% Max dispatch width: 38009632 -> 38003280 (-0.02%); split: +0.03%, -0.05% Non SSA regs after NIR: 158403572 -> 158415390 (+0.01%); split: -0.01%, +0.02% Totals from 277728 (12.17% of 2281577) affected shaders: Instrs: 349206856 -> 349115767 (-0.03%); split: -0.07%, +0.05% CodeSize: 5042621104 -> 5042552400 (-0.00%); split: -0.20%, +0.20% Send messages: 13132243 -> 13132235 (-0.00%); split: -0.00%, +0.00% Cycle count: 36183327716 -> 36201889368 (+0.05%); split: -0.38%, +0.43% Spill count: 2210072 -> 2211507 (+0.06%); split: -0.26%, +0.33% Fill count: 4188439 -> 4191287 (+0.07%); split: -0.39%, +0.46% Max live registers: 24956695 -> 24975391 (+0.07%); split: -0.02%, +0.09% Max dispatch width: 3948832 -> 3942480 (-0.16%); split: +0.32%, -0.48% Non SSA regs after NIR: 45616425 -> 45628243 (+0.03%); split: -0.04%, +0.06% Ice Lake Totals: Instrs: 1009584306 -> 1009411757 (-0.02%); split: -0.02%, +0.01% CodeSize: 12593466880 -> 12592958096 (-0.00%); split: -0.01%, +0.01% Send messages: 47274203 -> 47274171 (-0.00%); split: -0.00%, +0.00% Cycle count: 84920281455 -> 84914027301 (-0.01%); split: -0.05%, +0.04% Spill count: 2988523 -> 2986191 (-0.08%); split: -0.14%, +0.07% Fill count: 5296078 -> 5288737 (-0.14%); split: -0.21%, +0.07% Max live registers: 125429384 -> 125444786 (+0.01%); split: -0.00%, +0.02% Max dispatch width: 41269072 -> 41267312 (-0.00%); split: +0.03%, -0.03% Non SSA regs after NIR: 163223895 -> 163236623 (+0.01%); split: -0.01%, +0.02% Totals from 243818 (10.45% of 2334244) affected shaders: Instrs: 296953759 -> 296781210 (-0.06%); split: -0.08%, +0.02% CodeSize: 3643224480 -> 3642715696 (-0.01%); split: -0.04%, +0.03% Send messages: 11518671 -> 11518639 (-0.00%); split: -0.00%, +0.00% Cycle count: 33065548412 -> 33059294258 (-0.02%); split: -0.13%, +0.11% Spill count: 1346515 -> 1344183 (-0.17%); split: -0.32%, +0.15% Fill count: 2537906 -> 2530565 (-0.29%); split: -0.43%, +0.14% Max live registers: 21476776 -> 21492178 (+0.07%); split: -0.02%, +0.09% Max dispatch width: 3727288 -> 3725528 (-0.05%); split: +0.31%, -0.35% Non SSA regs after NIR: 41050474 -> 41063202 (+0.03%); split: -0.04%, +0.07% Skylake Totals: Instrs: 513573157 -> 513462971 (-0.02%); split: -0.02%, +0.00% CodeSize: 5950280672 -> 5950001392 (-0.00%); split: -0.01%, +0.00% Send messages: 24909757 -> 24909758 (+0.00%); split: -0.00%, +0.00% Cycle count: 57636102242 -> 57634726342 (-0.00%); split: -0.03%, +0.03% Spill count: 627286 -> 627241 (-0.01%); split: -0.01%, +0.00% Fill count: 837888 -> 837804 (-0.01%); split: -0.01%, +0.00% Max live registers: 87272271 -> 87284192 (+0.01%); split: -0.00%, +0.02% Max dispatch width: 32278832 -> 32271800 (-0.02%); split: +0.02%, -0.04% Non SSA regs after NIR: 87387713 -> 87387614 (-0.00%); split: -0.00%, +0.00% Totals from 177432 (10.30% of 1722906) affected shaders: Instrs: 127170648 -> 127060462 (-0.09%); split: -0.10%, +0.01% CodeSize: 1443406368 -> 1443127088 (-0.02%); split: -0.03%, +0.01% Send messages: 5444220 -> 5444221 (+0.00%); split: -0.00%, +0.00% Cycle count: 15423028495 -> 15421652595 (-0.01%); split: -0.10%, +0.10% Spill count: 235844 -> 235799 (-0.02%); split: -0.03%, +0.01% Fill count: 333783 -> 333699 (-0.03%); split: -0.03%, +0.01% Max live registers: 13765573 -> 13777494 (+0.09%); split: -0.01%, +0.10% Max dispatch width: 3086880 -> 3079848 (-0.23%); split: +0.24%, -0.47% Non SSA regs after NIR: 17623772 -> 17623673 (-0.00%); split: -0.00%, +0.00% Fixes: `442daeb54a` ("nir/opt_algebraic: use fcanonicalize") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> (cherry picked from commit `11b96a84b0`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:20 +01:00
Ian Romanick	2874160ce2	brw: Call nir_opt_algebraic_late in brw_nir_create_raygen_trampoline Make sure that lowering undone in brw_nir_optimize are reapplied. No shader-db changes on any Intel platform. Why are there fossil-db changes on platforms that don't support ray tracing? Lunar Lake Totals: Instrs: 926636441 -> 926636313 (-0.00%); split: -0.00%, +0.00% Send messages: 41510729 -> 41510723 (-0.00%); split: -0.00%, +0.00% Cycle count: 104509492613 -> 104509490569 (-0.00%); split: -0.00%, +0.00% Max live registers: 193792922 -> 193792890 (-0.00%); split: -0.00%, +0.00% Non SSA regs after NIR: 150091934 -> 150092170 (+0.00%); split: -0.00%, +0.00% Totals from 10 (0.00% of 2020428) affected shaders: Instrs: 8142 -> 8014 (-1.57%); split: -3.14%, +1.57% Send messages: 192 -> 186 (-3.12%); split: -7.29%, +4.17% Cycle count: 131892 -> 129848 (-1.55%); split: -6.93%, +5.38% Max live registers: 1442 -> 1410 (-2.22%); split: -3.05%, +0.83% Non SSA regs after NIR: 950 -> 1186 (+24.84%); split: -26.95%, +51.79% Meteor Lake Totals: Instrs: 1000805547 -> 1000805543 (-0.00%); split: -0.00%, +0.00% Cycle count: 93131592265 -> 93131619619 (+0.00%); split: -0.00%, +0.00% Max live registers: 122081268 -> 122081244 (-0.00%); split: -0.00%, +0.00% Totals from 16 (0.00% of 2286241) affected shaders: Instrs: 18652 -> 18648 (-0.02%); split: -1.39%, +1.37% Cycle count: 369520 -> 396874 (+7.40%); split: -2.94%, +10.34% Max live registers: 1350 -> 1326 (-1.78%); split: -4.15%, +2.37% DG2 Totals: Instrs: 999834626 -> 999834651 (+0.00%); split: -0.00%, +0.00% Send messages: 45719398 -> 45719403 (+0.00%); split: -0.00%, +0.00% Cycle count: 93118238139 -> 93118269557 (+0.00%); split: -0.00%, +0.00% Max live registers: 122098944 -> 122098936 (-0.00%); split: -0.00%, +0.00% Non SSA regs after NIR: 169413734 -> 169413661 (-0.00%); split: -0.00%, +0.00% Totals from 13 (0.00% of 2286795) affected shaders: Instrs: 18799 -> 18824 (+0.13%); split: -1.04%, +1.18% Send messages: 492 -> 497 (+1.02%); split: -2.44%, +3.46% Cycle count: 352838 -> 384256 (+8.90%); split: -1.08%, +9.98% Max live registers: 1237 -> 1229 (-0.65%); split: -2.91%, +2.26% Non SSA regs after NIR: 2191 -> 2118 (-3.33%); split: -20.86%, +17.53% Tiger Lake Totals: Instrs: 1011816778 -> 1011816714 (-0.00%); split: -0.00%, +0.00% Send messages: 46515289 -> 46515285 (-0.00%); split: -0.00%, +0.00% Cycle count: 85148902406 -> 85148894668 (-0.00%); split: -0.00%, +0.00% Max live registers: 122362180 -> 122362172 (-0.00%); split: -0.00%, +0.00% Max dispatch width: 38036160 -> 38036176 (+0.00%) Non SSA regs after NIR: 160317521 -> 160317649 (+0.00%); split: -0.00%, +0.00% Totals from 6 (0.00% of 2282318) affected shaders: Instrs: 9204 -> 9140 (-0.70%); split: -1.43%, +0.74% Send messages: 258 -> 254 (-1.55%); split: -3.10%, +1.55% Cycle count: 287652 -> 279914 (-2.69%); split: -3.29%, +0.60% Max live registers: 552 -> 544 (-1.45%); split: -2.90%, +1.45% Max dispatch width: 48 -> 64 (+33.33%) Non SSA regs after NIR: 914 -> 1042 (+14.00%); split: -14.00%, +28.01% Ice Lake Totals: Instrs: 1012203285 -> 1012203249 (-0.00%); split: -0.00%, +0.00% Send messages: 47358859 -> 47358858 (-0.00%); split: -0.00%, +0.00% Cycle count: 85112165276 -> 85112171905 (+0.00%); split: -0.00%, +0.00% Max live registers: 125545002 -> 125544992 (-0.00%); split: -0.00%, +0.00% Max dispatch width: 41335696 -> 41335656 (-0.00%) Non SSA regs after NIR: 166448597 -> 166448602 (+0.00%); split: -0.00%, +0.00% Totals from 13 (0.00% of 2335519) affected shaders: Instrs: 16486 -> 16450 (-0.22%); split: -1.67%, +1.46% Send messages: 368 -> 367 (-0.27%); split: -4.89%, +4.62% Cycle count: 347643 -> 354272 (+1.91%); split: -1.34%, +3.25% Max live registers: 1104 -> 1094 (-0.91%); split: -3.80%, +2.90% Max dispatch width: 192 -> 152 (-20.83%) Non SSA regs after NIR: 2100 -> 2105 (+0.24%); split: -21.76%, +22.00% Skylake Totals: Instrs: 504548665 -> 504548057 (-0.00%); split: -0.00%, +0.00% Send messages: 24479148 -> 24479118 (-0.00%); split: -0.00%, +0.00% Cycle count: 57575198140 -> 57575179256 (-0.00%); split: -0.00%, +0.00% Max live registers: 85570671 -> 85570575 (-0.00%); split: -0.00%, +0.00% Non SSA regs after NIR: 85097646 -> 85098486 (+0.00%); split: -0.00%, +0.00% Totals from 22 (0.00% of 1703671) affected shaders: Instrs: 19866 -> 19258 (-3.06%); split: -3.72%, +0.66% Send messages: 464 -> 434 (-6.47%); split: -8.19%, +1.72% Cycle count: 250854 -> 231970 (-7.53%); split: -9.23%, +1.70% Max live registers: 2024 -> 1928 (-4.74%); split: -5.53%, +0.79% Non SSA regs after NIR: 2498 -> 3338 (+33.63%); split: -8.33%, +41.95% Fixes: `442daeb54a` ("nir/opt_algebraic: use fcanonicalize") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> (cherry picked from commit `5af0b8bd09`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:20 +01:00
Konstantin Seurer	3ffe4b257b	vulkan/cmd_queue: Fixup stride for multi draws Copying the draw infos packs them so the stride needs to be set to the struct size. cc: mesa-stable Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> (cherry picked from commit `be5ab80de1`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:20 +01:00
Ian Romanick	45ce75f3bc	nir: Use STACK_ARRAY instead of NIR_VLA The number of fields comes from the shader, so it could be a value large enough that using alloca would be problematic. Fixes: `c11833ab24` ("nir,spirv: Rework function calls") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ryan Neph <ryanneph@google.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (cherry picked from commit `9017d37e84`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:20 +01:00
Ian Romanick	978fd42b4b	spirv: Use STACK_ARRAY instead of NIR_VLA The number of fields comes from the shader, so it could be a value large enough that using alloca would be problematic. Fixes: `2a023f30a6` ("nir/spirv: Add basic support for types") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ryan Neph <ryanneph@google.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (cherry picked from commit `3da828d2dd`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:20 +01:00
Jesse Natalie	5048a2ed1c	meson: Include DirectX-Headers dependency for all VK Windows builds Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14839 Cc: mesa-stable Reviewed-by: Eric Engestrom <eric@igalia.com> (cherry picked from commit `f0066a3150`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:20 +01:00
Alyssa Rosenzweig	806f0a35a4	brw: drop buggy SLM optimization This was incorrect for OpenCL due to the possibility of variable shared memory existing despite shared_size == 0. Fortunately the optimization it was trying to do should be done in NIR via nir_opt_barrier_modes so we can just drop the brw code and move on with our merry lives. Fixes OpenCL tests on Iris: non_uniform_work_group non_uniform_3d_barriers basic async_strided_copy_local_to_global Cc: mesa-stable Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (cherry picked from commit `bd5ebbb2f8`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:20 +01:00
Anna Maniscalco	6278aa107a	freedreno/common: set has_astc_hdr true for a7xx targets Fixes: `dc07473524` ("freedreno/fdl: add astc hdr formats") Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> (cherry picked from commit `e959dd0dd7`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:20 +01:00
Daniel Schürmann	7fda785505	nir/clone: Fix cloning indirect call instructions Fixes: `bb40284f76` ('nir: Add indirect calls') (cherry picked from commit `88b4221519`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:20 +01:00
Samuel Pitoiset	a2ad1789fa	ac,radv,radeonsi: use correct swizzle/pitch for depth-only images with SDMA This fixes new VKCTS coverage dEQP-VK.api.copy_and_blit.core.use_after_copy.*. is_stencil isn't set for RadeonSI because it doesn't do SDMA copies with Z/S. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `1be4ffdff9`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:20 +01:00
Eric Engestrom	88e238de07	.pick_status.json: Mark `7dd7731ac7` as denominated Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:20 +01:00
Aitor Camacho	4229b57783	wsi/metal: Expose additional color spaces if instance extension enabled Caught through VVL test NegativeWsi.SwapchainImageFormatList. The test would try to create a swapchain with a color space from VK_EXT_swapchain_colorspace without enabling the extension. This is because wsi would expose those color spaces even when the extension was not enabled. Fixes: `fd045ac99c` ("wsi/metal: add support for color spaces") Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Signed-off-by: Aitor Camacho <aitor@lunarg.com> (cherry picked from commit `e6f118f12b`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:20 +01:00
Lionel Landwerlin	1994d93542	isl: fix 32bit math with 4GB buffer size Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (cherry picked from commit `d956957153`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:20 +01:00
Lionel Landwerlin	af97f7fe38	anv: add missing constant cache invalidation for descriptor buffers A descriptor buffer promoted to push constants requires a constant cache invalidation if it is modified on the device. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (cherry picked from commit `42b70cf05a`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:20 +01:00
Lionel Landwerlin	12da136c07	anv: fix nested command buffer relocations When executing 3 command buffers : vkCmdExecuteCommands(CB_B, CB_C); vkCmdExecuteCommands(CB_A, CB_B); vkQueueSubmit(CB_A); We're not transfering correctly the relocations of CB_C from CB_B to CB_A. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (cherry picked from commit `e64889635c`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:20 +01:00
Konstantin Seurer	f8ce75c40c	radv: Fix setting the viewport for depth stencil FS resolves Fixes: `704fbbb` ("radv/meta: rework depth/stencil resolves using graphics") Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `f574de2249`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:20 +01:00
Lionel Landwerlin	2abdb028dd	anv: flush render caches on first pipeline select Given a situation like this : - CB_A: begin, renderDepthA, end - CB_B: begin, computeA, barrier (depth), computeB, end The depth cache is not being flushed between renderDepthA & computeB because : - it's not flushed at the end of CB_A (it's not required) - when CB_B starts, we're still on GFX pipeline mode but do not flush render caches because pipeline mode is unknown - when barrier is CB_B is executed, we're already in compute pipeline mode and HW cannot flush depth. The fix is to flush RT/depth cached when switching from unknown pipeline mode any pipeline mode. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `e6dae6ef5f` ("vulkan: Optimize implicit end_subpass barrier") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14816 Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Tested-by: David Gow <david@davidgow.net> (cherry picked from commit `888ac904a3`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:20 +01:00
Juston Li	6dbd6ee94b	anv: set missing protected bit for protected depth/stencil surfaces This bit is set in mocs for other protected attachment types by anv_image_fill_surface_state() however was ommited for depth/stencil attachments here. Without the protected bit set, it causes heavy black artifacting when attaching a protected depth attachment image to a framebuffer. Fixes: `794b0496e9` ("anv: enable protected memory") Signed-off-by: Juston Li <justonli@google.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (cherry picked from commit `f84ed620c2`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:20 +01:00
Matt Turner	2c250e6235	elk/cse: use copies in `operands_match` instead of in-place modification `operands_match` was modifying instruction source operands in-place (through the `elk_fs_reg *src` pointer member) and relying on a save/restore pattern to undo the modifications. Work on local copies instead, which is simpler and avoids mutating shared state in a comparison function. Fixes: `47c4b38540` ("i965/fs: Allow CSE to handle MULs with negated arguments.") (cherry picked from commit `14c65322e8`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:20 +01:00
Matt Turner	03e6f285e5	elk/cse: fix `operands_match` corrupting non-IMM register data The MUL case in `operands_match` was reading and writing the `.f` union member unconditionally, even when the register's `.file != IMM`. In that case `.f` aliases the struct containing `.nr`/`.swizzle`/etc, so the `fabsf()` call could corrupt the `.nr` by clearing bit 31. Guard all `.f` accesses with `.file == IMM` checks. Fixes: `47c4b38540` ("i965/fs: Allow CSE to handle MULs with negated arguments.") (cherry picked from commit `93f39f87c4`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:20 +01:00
Matt Turner	2b221e5a1a	brw/cse: use copies in `operands_match` instead of in-place modification `operands_match` was modifying instruction source operands in-place (through the `brw_reg *src` pointer member) and relying on a save/restore pattern to undo the modifications. Work on local copies instead, which is simpler and avoids mutating shared state in a comparison function. Fixes: `47c4b38540` ("i965/fs: Allow CSE to handle MULs with negated arguments.") (cherry picked from commit `b302faad8b`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:20 +01:00
Matt Turner	14c7d820cd	brw/cse: fix `operands_match` corrupting non-IMM register data The MUL case in `operands_match` was reading and writing the `.f` union member unconditionally, even when the register's `.file != IMM`. In that case `.f` aliases the struct containing `.nr`/`.swizzle`/etc, so the `fabsf()` call could corrupt the `.nr` by clearing bit 31. Guard all `.f` accesses with `.file == IMM` checks. Fixes: `47c4b38540` ("i965/fs: Allow CSE to handle MULs with negated arguments.") (cherry picked from commit `f5e0f63216`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:19 +01:00
Eric Engestrom	547fd52a66	pick-ui: add `Backport-to: *` as a synonym to `Cc: mesa-stable` (cherry picked from commit `b2d99b9378`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:19 +01:00
Eric Engestrom	9a0d13be9a	bin/gen_release_notes: fix support for python 3.14 There is no default even loop anymore, we need to make one if we want one now. Cc: mesa-stable (cherry picked from commit `c7603a11de`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:19 +01:00
Eric Engestrom	e5fb4a0682	.pick_status.json: Update to `03d2cc2b2a` Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:19 +01:00
Eric Engestrom	8794fced82	docs: add sha sum for 26.0.0 Some checks failed macOS-CI / macOS-CI (dri) (push) Has been cancelled Details macOS-CI / macOS-CI (xlib) (push) Has been cancelled Details	2026-02-11 19:19:12 +01:00
Eric Engestrom	c10cba7efa	VERSION: bump for 26.0.0	2026-02-11 19:07:29 +01:00
Eric Engestrom	e0f7bc0024	docs: add release notes for 26.0.0	2026-02-11 19:07:29 +01:00
Georg Lehmann	3062621cf6	aco/opt_postRA: don't optimize across calls Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Could do better by checking which registers are clobbered/preserved, but that's unlikely to be useful anyway. Backport-to: 26.0 Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> (cherry picked from commit `fc7b5d7eed`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39828>	2026-02-11 14:54:48 +00:00
Georg Lehmann	33ca80ea38	aco: handle all SALU that modifies PC in needs_exec_mask Calls use swappc. Backport-to: 26.0 Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> (cherry picked from commit `10b12a6ee2`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39828>	2026-02-11 14:54:48 +00:00
Georg Lehmann	d8acb10c56	aco/lower_branches: consider jump target of conditional branches based on vcc Cc: mesa-stable Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> (cherry picked from commit `421a4dacf0`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39828>	2026-02-11 14:54:48 +00:00
Karol Herbst	acdbdcc53b	vtn: set default fp_math_ctrl values for kernels The kernel capabilty has the `FPFastMathMode` decoration, but not the `FPFastMathDefault` execution mode, so a SPIR-V module not using `SPV_KHR_float_controls2` has no way of setting any defaults. Fixes: `9da2d21804` ("vtn: implement default fp_math_ctrl without using execution mode") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Tested-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> (cherry picked from commit `faf3a93e8f`) [Eric: adjusted commit because of missing `46a617884e`, as suggested by the author at https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39790#note_3325830] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39828>	2026-02-11 14:54:48 +00:00
Dave Airlie	b53dbb573a	gallivm: handle u16 correct on const loads. I somehow screwed this up on my previous attempt at fixing this bug, This should fix the loop limiter bug on big endian properly. Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Cc: mesa-stable Fixes: `e28cfb2bad` ("gallivm: handle u8/u16 const loads properly on big-endian.") (cherry picked from commit `c016346b50`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39828>	2026-02-11 14:54:48 +00:00
Eric R. Smith	160efe917e	mesa: do not unbind general point when different indexed points are deleted When a buffer is deleted, we have to remove it from all binding points. We were re-using the code for BindBufferRange for this; however, this caused the general binding point to be unbound (bound to NULL) unconditionally, even if a different buffer is bound there. Fix this by inlining the various bind calls into the delete buffers code. cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14755 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> (cherry picked from commit `fa418f1e73`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39828>	2026-02-11 14:54:48 +00:00
Samuel Pitoiset	5f6d1e4b44	radv/meta: fix CmdCopyBufferToImage2() on compute queue with compressed HTILE Only for partial copies because image stores don't decompress on writes (ie. HTILE isn't updated by image stores). Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `9f5a20abde`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39828>	2026-02-11 14:54:48 +00:00

1 2 3 4 5 ...

217739 commits