fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-07 04:58:05 +02:00

Author	SHA1	Message	Date
Lars-Ivar Hesselberg Simonsen	9f049032be	pan/genxml: Print shader hex in trace for Valhall Enable verbose disassembly for Valhall in traces, which adds hex values to shader printing. Useful for debugging. For example: Shader 0xffffbe3ec000 (GPU VA ffdd3000) sz 16384 LD_ATTR_IMM.v4.f32.slot0.wait0 @r0:r1:r2:r3, r60^, r61^, index:0x0, table:0x0 FRCP.f32 r3, r3^ FMAX.f32 r3, r3^, u6 vs Shader 0xffffa8bf7000 (GPU VA ffdd3000) sz 16384 7c 7d 00 32 08 80 66 08 LD_ATTR_IMM.v4.f32.slot0.wait0 @r0:r1:r2:r3, r60^, r61^, index:0x0, table:0x0 43 00 00 00 00 c3 9c 00 FRCP.f32 r3, r3^ 43 86 03 00 00 c3 a4 00 FMAX.f32 r3, r3^, u6 Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41062>	2026-04-22 08:31:00 +00:00
Samuel Pitoiset	a73fc90bcd	radv: fix GPU hangs with PS epilogs and secondaries properly The previous fix was incomplete because if the same graphics pipeline and the same PS epilog are rebind after vkCmdExecuteCommands(), the PS epilog state wouldn't be re-emitted, and it will use a wrong VA (in case both fragment shader user SGPRs aren't similar either). Resetting the PS epilog to NULL in the primary should prevent any issues, but this tracking still need to be improved because it caused two issues recently. Fixes: `1a00587c44` ("radv: fix a GPU hang with PS epilogs and secondary command buffers") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/work_items/15176 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41056>	2026-04-22 08:03:35 +00:00
Samuel Pitoiset	9d17a7bdb4	spirv,treewide: rework specialization constant Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details With SPV_KHR_constant_data, it's allowed to specialize array of constants. RustiCL changes are from Karol Herbst <kherbst@redhat.com>. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41046>	2026-04-22 06:57:55 +00:00
squidbus	7d023db5b2	kk: Enable VK_AMD_shader_image_load_store_lod This extension is already supported by the shader compiler, it just needs to be enabled. Reviewed-by: Aitor Camacho <aitor@lunarg.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41082>	2026-04-22 06:42:08 +00:00
Sagar Ghuge	12f81eaa88	anv: Enable dynamic stack ID control on Xe3+ Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This patch enables dynamic stack ID control on Xe3+. Programmed values are the recommended settings from the Bspec. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41066>	2026-04-22 01:48:19 +00:00
Sagar Ghuge	acecc0f1b3	intel/genxml: Update xml for dynamic stack ID control fields Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41066>	2026-04-22 01:48:18 +00:00
Timothy Arceri	5f37490855	glcpp: fix paste within macro function expansion Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Note the tests added in `89cd6df034` were wrong (confirmed in gcc) I've updated them to the expected outcome and enabled the paste test from `475222b022`. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13863 Fixes: `d5cd40343f` ("Expand macro arguments before performing argument substitution.") Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40062>	2026-04-21 23:53:19 +00:00
Timothy Arceri	35eda3f3e2	glcpp: update out of date comment Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40062>	2026-04-21 23:53:19 +00:00
Karol Herbst	4cd64165a3	nak/lower_cf: remove single src phis When lowering cf we go out of SSA which translates phis into reg intrinsics. However when converting them back to SSA, initially single source phis now have an undef source leading to increased register pressure on the NAK side. This also hinders copy propagation as it's not designed to handle sources through phis yet. Totals from 50621 (4.17% of 1212873) affected shaders: CodeSize: 1605273744 -> 1621029728 (+0.98%); split: -0.34%, +1.32% Number of GPRs: 4673586 -> 4067935 (-12.96%); split: -12.97%, +0.01% SLM Size: 263428 -> 258176 (-1.99%) Static cycle count: 2599838439 -> 2586392435 (-0.52%); split: -1.11%, +0.59% Spills to memory: 23512 -> 15527 (-33.96%) Fills from memory: 23512 -> 15527 (-33.96%) Spills to reg: 64590 -> 57328 (-11.24%); split: -13.83%, +2.58% Fills from reg: 55559 -> 44319 (-20.23%); split: -22.66%, +2.42% Max warps/SM: 1189396 -> 1347600 (+13.30%) Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41042>	2026-04-21 23:37:55 +00:00
Sagar Ghuge	620835926d	brw: Pass write back register for ray query messages For DG2 (Bspec 47937) has the same programming note as of Xe2+, "When this bit is set in the header, Trace Ray Message behaves like a Ray Query. This message requires a write-back message indicating RayQuery for all valid Rays (SIMD lanes) have completed." So this patch is just passing a write back destination register when we have ray query message. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41039>	2026-04-21 23:16:09 +00:00
José Roberto de Souza	64bc538f5e	intel/brw: Explicitly upcast UB to UW for SHR with vector immediates HW does not allow instructions with vector immediates to cross a GRF boundary if it has a stride. Under register pressure, the register allocator may place a temporary register across such a boundary. To resolve this, we now explicitly emit a MOV to upcast the UB payload into a UW VGRF. This ensures the SHR instruction operates on a dense, well-aligned region that satisfies hardware alignment constraints. Below is the portion of the shader exhibiting this issue: Native code for unnamed fragment shader GLSL6 (src_hash 0x9c84a007) (sha1 48745e7dae90d08f8a9bbe4dbf837de23440c841f0344e669cb8af9df79bce58) SIMD32 shader: 44 instructions. 0 loops. 354 cycles. 0:0 spills:fills, 2 sends, scheduled with mode latency-sensitive. Promoted 0 constants. GRF registers: 22. Non-SSA regs (after NIR): 11. Compacted 800 to 800 bytes (0%) mov(1) f1<1>UW g0.30<0,1,0>UW { align1 WE_all 1N }; mov(1) f1.1<1>UW g1.30<0,1,0>UW { align1 WE_all 1N I@1 }; mov(32) g2<2>UW g0.20<2,8,0>UW { align1 WE_all }; mov(32) g4<2>UW g0.21<2,8,0>UW { align1 WE_all }; mov(32) g8<2>UW g1.20<2,8,0>UW { align1 WE_all }; mov(32) g10<2>UW g1.21<2,8,0>UW { align1 WE_all }; mov(16) g12<4>UB g0.60<1,8,0>UB { align1 1H }; mov(16) g13<4>UB g1.60<1,8,0>UB { align1 2H }; add(32) g0<1>UW g2<16,8,2>UW 0x01000100V { align1 WE_all I@6 }; add(32) g1<1>UW g4<16,8,2>UW 0x01010000V { align1 WE_all I@6 }; add(32) g2<1>UW g8<16,8,2>UW 0x01000100V { align1 WE_all I@6 }; add(32) g3<1>UW g10<16,8,2>UW 0x01010000V { align1 WE_all I@6 }; shr(16) g4<1>UW g12<32,8,4>UB 0x76543210V { align1 1H I@6 }; mov(16) g14.32<4>UB g13<32,8,4>UB { align1 2H I@6 }; sync nop(1) null<0,1,0>UB { align1 WE_all 1N I@6 }; mov(16) g5<1>UW g0<16,8,2>UW { align1 1H }; sync nop(1) null<0,1,0>UB { align1 WE_all 1N I@6 }; mov(16) g0<1>UW g1<16,8,2>UW { align1 1H }; sync nop(1) null<0,1,0>UB { align1 WE_all 5N I@6 }; mov(16) g5.16<1>UW g2<16,8,2>UW { align1 2H }; sync nop(1) null<0,1,0>UB { align1 WE_all 5N I@6 }; mov(16) g0.16<1>UW g3<16,8,2>UW { align1 2H }; shr(16) g4.16<1>UW g14.32<32,8,4>UB 0x76543210V { align1 2H I@5 }; ERROR: Invalid register region for source 0. See special restrictions section. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40856>	2026-04-21 22:51:45 +00:00
Eric R. Smith	4ae192a3d9	glsl, spirv: Improve accuracy of asin() and acos() Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The polynomial used for asin_expr() was suboptimal (and its source was not documented). A better approximation is found in the _Handbook_of_Mathematical_Functions_ by Abramowitz and Stegun, which is used in Nvidia's Cg toolkit. However, while this approximation gives a good absolute error bound, its relative error exceeds the 4096 ulp allowed by the Vulkan spec. Taking a page from the spirv implementation of asin(), we implement a piecewise approximation where a Taylor series is used for small values of \|x\|. This patch also harmonizes the GLSL and Vulkan implementations by moving the implementation to common code (nir_builder). Running tests on asin() with a grid of 64000 samples between 0.0 and +1.0, the original asin() at 32 bits has: ``` glsl spirv RMSE: 1.756451e-04 1.609091e-04 worst abs error: 3.904104e-04 at 0.937001 3.904104e-04 at 0.937001 worst ulp error: 11800 at 6.2499e-05 3826 at 0.841331 ``` whereas the new implementation has for both: ``` RMSE: 2.528056e-05 worst abs error: 4.962087e-05 at 0.451149 worst ulp error: 2379 at 0.215106 ``` Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Acked-by: Mel Henning <mhenning@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40862>	2026-04-21 21:10:22 +00:00
Jordan Justen	fa784fffd0	brw: Don't set header_size at init since it will be re-set in later code Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Ref: `efcba73b49` ("brw: switch to new sampler payload description scheme") Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41035>	2026-04-21 19:23:41 +00:00
José Roberto de Souza	26525ac7ae	anv: Move code to load color border to memory to a function Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41035>	2026-04-21 19:23:41 +00:00
José Roberto de Souza	83d75a0384	anv: Move init and finish of state pools to its own functions Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41035>	2026-04-21 19:23:41 +00:00
José Roberto de Souza	a4c22baeb4	anv: Move VMA heaps init and finish of vma heaps to anv_va.c Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41035>	2026-04-21 19:23:40 +00:00
José Roberto de Souza	32f3d6486c	anv: Change fill_inline_params() first parameter from struct GENX(COMPUTE_WALKER_BODY) to uint32_t * This will make this function more generic allowing us to use it for COMPUTE_WALKER_2. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41035>	2026-04-21 19:23:40 +00:00
Jesse Natalie	6f8656ec64	microsoft/compiler: Back-propagate interpolator modes from FS Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41067>	2026-04-21 18:31:31 +00:00
Erik Faye-Lund	c4287eaa04	gallium: delete leftovers of post-processing infrastructure Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This was removed, but driconfs and docs were left behind. Reviewed-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41048>	2026-04-21 18:04:11 +00:00
Erik Faye-Lund	8259e06645	haiku: remove unfinished post-processing support This doesnt' work, because pp_init_fbos and pp_run aren't wired up and no filters ever gets enabled. But the post processing infrastructure has been removed, so let's just delete this code. This gives the code a chance of compiling! Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Acked-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41048>	2026-04-21 18:04:11 +00:00
Eric Engestrom	4731fc588e	docs: add stub of vk_struct_type_cast.h for vk_util.h Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41029>	2026-04-21 17:29:04 +00:00
Samuel Pitoiset	ebf2797da2	vulkan,treewide: stop passing vk_device to vk_pipeline_robustness_state_fill() This will be helpful for RADV. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41029>	2026-04-21 17:29:04 +00:00
Samuel Pitoiset	b7a8b09b21	vulkan: pre-compute the default robustness state in the device Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41029>	2026-04-21 17:29:04 +00:00
Samuel Pitoiset	5828ebeb70	vulkan: refactor vk_pipeline_robustness_state_fill() slightly Overwrite the default device state only when requested. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41029>	2026-04-21 17:29:04 +00:00
Lionel Landwerlin	b0c17357db	intel/ci: update expectation for RPL This fails everywhere but CI only run this test on RPL. A CTS fix has been merged in main. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39451>	2026-04-21 16:29:14 +00:00
Lionel Landwerlin	eda83bc2b6	anv: add a pass to realign global loads on DX CBV resources CBV resources are supposed to be 256B aligned (D3D12_CONSTANT_BUFFER_DATA_PLACEMENT_ALIGNMENT). vkd3d-proton will puts CBV addresses in the push constant data and do global loads on them. Unfortunately those loads don't have a 256B alignment value on them. So when looking at what we can promote to HW push buffers, we can't consider them. This change introduces a detection pass for CBV resources (according to vkd3d-proton devs those are 64KiB in size) and realign the loads to be 256B aligned. This is only enabled on DX emulation. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39451>	2026-04-21 16:29:14 +00:00
Lionel Landwerlin	bba428ce3f	anv: promote push constant pointers to push buffers Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39451>	2026-04-21 16:29:14 +00:00
Lionel Landwerlin	0539f26065	brw: track push constants shader stats Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39451>	2026-04-21 16:29:14 +00:00
squidbus	f59734d5d3	kk: Use device limits for buffers and compute shared memory. Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Metal provides these limits as properties of MTLDevice, which can be used instead of hardcoding them. Reviewed-by: Aitor Camacho <aitor@lunarg.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41077>	2026-04-21 14:11:08 +00:00
Valentine Burley	17d03d98c7	ci/zink/intel: Disable flaky TGL canvas_moire-v2 trace Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Valentine Burley <valentine.burley@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41083>	2026-04-21 15:37:19 +02:00
Lionel Landwerlin	b10be13434	ci/zink/intel: disable TGL demo-v2 trace Flaky trace, renders at the wrong resolution (32x32). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41083>	2026-04-21 15:41:28 +03:00
Rhys Perry	bddd8b36a6	aco: use RegisterDemand::operator[] more Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39690>	2026-04-21 11:16:26 +00:00
Rhys Perry	176b075129	aco: prefer spilling smaller temporaries if it finishes spilling fossil-db stats seemed less positive when updating process_block() too. fossil-db (navi31): Totals from 41 (0.05% of 84369) affected shaders: Instrs: 294758 -> 294694 (-0.02%); split: -0.11%, +0.09% CodeSize: 1566136 -> 1564392 (-0.11%); split: -0.21%, +0.10% SpillSGPRs: 2306 -> 2143 (-7.07%); split: -8.37%, +1.30% Latency: 3877251 -> 3868194 (-0.23%); split: -0.29%, +0.05% InvThroughput: 881747 -> 882352 (+0.07%); split: -0.01%, +0.08% SClause: 6498 -> 6494 (-0.06%); split: -0.09%, +0.03% Copies: 33582 -> 33900 (+0.95%); split: -0.23%, +1.18% Branches: 6799 -> 6801 (+0.03%) VALU: 192977 -> 192646 (-0.17%); split: -0.21%, +0.04% SALU: 28082 -> 28395 (+1.11%); split: -0.27%, +1.39% VOPD: 1939 -> 1959 (+1.03%); split: +1.19%, -0.15% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39690>	2026-04-21 11:16:26 +00:00
Rhys Perry	0ffbc30d7f	aco: refactor spiller to use spills_needed variable Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39690>	2026-04-21 11:16:26 +00:00
Samuel Pitoiset	e60b49a3f6	radv/ci: document more HIC regressions on NAVI10 addrlib support for HIC needs more bugfixes and AMD is aware. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40996>	2026-04-21 10:14:43 +00:00
Samuel Pitoiset	87e95c5e50	radv: advertise VK_EXT_host_image_copy by default on GFX10.3+ Latest addrlib supports SIMD (AVX2) and it's definitely fast enough to be used in production now. GFX10 is still not enabled by default due to some regressions from the addrlib bump, also still missing AVX for some formats. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40996>	2026-04-21 10:14:43 +00:00
Samuel Pitoiset	aea04d11b7	amd: allow addrlib to enable SIMD if possible The SIMD variants are way faster, the order of magnitude seems x10. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40996>	2026-04-21 10:14:42 +00:00
Caius-Moldovan-img	daeb52d38d	pco: Replace nir_shader_lower_instructions with nir_shader_*_pass Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Caius Moldovan <caius.moldovan@imgtec.com> Reviewed-by: Simon Perretta <simon.perretta@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40390>	2026-04-21 09:17:28 +00:00
David Rosca	27dbe82800	ac/parse_ib: Fix printing enc recon VAs on VCN5 Fixes: `f8f80c3700` ("ac/parse_ib: Fix VCN address parsing") Reviewed-by: Benjamin Cheng <benjamin.cheng@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41025>	2026-04-21 08:09:09 +00:00
Samuel Pitoiset	1fc8683281	radv: allow depth+stencil formats with host image copy Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Not super useful but it's supported. The NAVI10 crashes are expected and they are due to a bug in addrlib. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41000>	2026-04-21 08:57:31 +02:00
Samuel Pitoiset	4de652c78b	radv: add depth+stencil formats support with host image copy Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41000>	2026-04-21 08:57:31 +02:00
Samuel Pitoiset	fd95195f45	ac/surface: add stencil-only support for host mem->surf copies It's needed to tweak the surface info and to adjust the base pointer. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41000>	2026-04-21 08:57:31 +02:00
Brandon Jones	d1dd65d425	nir/opt_algebraic: fix fabs optimization Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This fixes a regression found in blender's unit testing, which called fabs(-0.0) and invoked an NIR optimization that is was not valid for the parameter -0.0. IEEE 754 requires that abs clear the sign bit for the value -0.0. Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41060>	2026-04-21 04:10:29 +00:00
jinmiliu	e5392e3d5f	mesa/st: Set protected content context flag based on pipe context attributes If the PIPE_CONTEXT_PROTECTED flag is set in the context attributes, propagate this by enabling GL_CONTEXT_FLAG_PROTECTED_CONTENT_BIT_EXT on the corresponding Mesa GL context. Signed-off-by: jinmiliu <jinming.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40998>	2026-04-21 03:14:35 +00:00
Sagar Ghuge	7a627fa8f3	anv: Fix Wa_14021821874, Wa_14018813551, Wa_14026600921 Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details StackSizePerRay is the RTDispatchGlobals::AsyncStackSize and DisableRTGlobalsKnownValues is to interpret how many Max BVH levels we need to use. It's not relevant to Vulkan, since we have just 2 fixed BVH levels. Fixes: `cb423ee6` ("anv: Fix Wa_14021821874, Wa_14018813551, Wa_14026600921") Fixes: `c1a44e8d` ("anv: force StackIDControl value for Wa_14021821874") Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41012>	2026-04-21 01:38:34 +00:00
Alyssa Rosenzweig	6397ddd15d	gallium: Drop post-processing filters Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Marek Olšák <maraeo@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Christian Gmeiner <cgmeiner@igalia.co Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5448>	2026-04-20 22:58:39 +00:00
Alyssa Rosenzweig	168141fbac	gallium: Drop users of post-processing filters Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Marek Olšák <maraeo@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5448>	2026-04-20 22:58:39 +00:00
Alyssa Rosenzweig	fd46a48ccc	jay/ra: only use stride=4 temps Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details SIMD16: Totals from 56 (2.12% of 2647) affected shaders: Instrs: 541831 -> 542004 (+0.03%); split: -0.40%, +0.44% CodeSize: 8597680 -> 8597248 (-0.01%); split: -0.45%, +0.44% SIMD32: Totals: Instrs: 4858179 -> 4734713 (-2.54%); split: -2.78%, +0.24% CodeSize: 78651424 -> 76667440 (-2.52%); split: -2.76%, +0.24% Totals from 1108 (41.86% of 2647) affected shaders: Instrs: 4241312 -> 4117846 (-2.91%); split: -3.18%, +0.27% CodeSize: 68753152 -> 66769168 (-2.89%); split: -3.16%, +0.27% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41064>	2026-04-20 22:32:12 +00:00
Alyssa Rosenzweig	1f62da938b	jay/ra: drop memory copy reordering No shader-db changes, and no longer required for correctness. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41064>	2026-04-20 22:32:12 +00:00
Alyssa Rosenzweig	45845ea7f2	jay/ra: use accumulator for stride=4 swaps SIMD16: Totals: Instrs: 2767930 -> 2767190 (-0.03%) CodeSize: 44327408 -> 44312304 (-0.03%); split: -0.04%, +0.00% Totals from 142 (5.36% of 2647) affected shaders: Instrs: 658928 -> 658188 (-0.11%) CodeSize: 10514512 -> 10499408 (-0.14%); split: -0.16%, +0.01% SIMD32: Totals: Instrs: 4884039 -> 4858179 (-0.53%) CodeSize: 79079008 -> 78651424 (-0.54%); split: -0.54%, +0.00% Totals from 761 (28.75% of 2647) affected shaders: Instrs: 3803274 -> 3777414 (-0.68%) CodeSize: 61707728 -> 61280144 (-0.69%); split: -0.70%, +0.00% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41064>	2026-04-20 22:32:12 +00:00

1 2 3 4 5 ...

221484 commits