fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-05 16:08:04 +02:00

Author	SHA1	Message	Date
Tapani Pälli	e84938a428	iris: make sure to not mix compressed vs non-compressed This commit implements the following requirement: "Keep any UMD-recycling of compression-enabled/disabled memory separate." As additional info there are 2 related wa's for the issue: Wa_14018443005 Wa_18038669374 Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34499> (cherry picked from commit `6d70ec449f`)	2025-04-22 19:40:49 +02:00
Tapani Pälli	940c2cbbb6	iris: force reallocate on eglCreateImage with GFX >= 20 Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34499> (cherry picked from commit `c2a4657862`)	2025-04-22 19:40:48 +02:00
Ian Romanick	8e3cae7c78	elk/algebraic: Don't optimize float SEL.CMOD to MOV Floating point SEL.CMOD may flush denorms to zero. We don't have enough information at this point in compilation to know whether or not it is safe to remove that. Integer SEL or SEL without a conditional modifier is just a fancy MOV. Those are always safe to eliminate. See also `3f782cdd25`. Fixes: `fab92fa1cb` ("i965/fs: Optimize SEL with the same sources into a MOV.") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34192> (cherry picked from commit `e783930b10`)	2025-04-22 19:39:44 +02:00
Ian Romanick	9af068c5e0	elk/algebraic: Clear condition modifier on optimized SEL instruction The condition modifier on SEL means something completely different than it means on MOV. On MOV it means to modify the flags based on the value written to the destination. On SEL it means to compare the sources using that mode and pick the result (i.e., as min() or max()) without modifying the flags. The resulting MOV should not have a condition modifier for the same reason it (already) doesn't have a predicate. This bug was found by inspection, so I added a unit test. Fixes: `fab92fa1cb` ("i965/fs: Optimize SEL with the same sources into a MOV.") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34192> (cherry picked from commit `f4ede9c10a`)	2025-04-22 19:39:22 +02:00
Ian Romanick	ce96dcf1a6	brw/algebraic: Don't optimize float SEL.CMOD to MOV Floating point SEL.CMOD may flush denorms to zero. We don't have enough information at this point in compilation to know whether or not it is safe to remove that. Integer SEL or SEL without a conditional modifier is just a fancy MOV. Those are always safe to eliminate. See also `3f782cdd25`. Fixes: `fab92fa1cb` ("i965/fs: Optimize SEL with the same sources into a MOV.") No shader-db changes on any Intel platform. fossil-db: All Intel platforms had similar results. (Lunar Lake shown) Totals: Instrs: 209903490 -> 209903492 (+0.00%) Cycle count: 30546025224 -> 30546021980 (-0.00%); split: -0.00%, +0.00% Max live registers: 65516231 -> 65516235 (+0.00%) Totals from 2 (0.00% of 706657) affected shaders: Instrs: 3197 -> 3199 (+0.06%) Cycle count: 361650 -> 358406 (-0.90%); split: -10.05%, +9.15% Max live registers: 300 -> 304 (+1.33%) Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34192> (cherry picked from commit `6a19d8915f`)	2025-04-22 19:39:21 +02:00
Ian Romanick	055cbf9836	brw/algebraic: Clear condition modifier on optimized SEL instruction The condition modifier on SEL means something completely different than it means on MOV. On MOV it means to modify the flags based on the value written to the destination. On SEL it means to compare the sources using that mode and pick the result (i.e., as min() or max()) without modifying the flags. The resulting MOV should not have a condition modifier for the same reason it (already) doesn't have a predicate. This bug was found by inspection, so I added a unit test. No shader-db or shader-db changes on any Intel platform. Fixes: `fab92fa1cb` ("i965/fs: Optimize SEL with the same sources into a MOV.") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34192> (cherry picked from commit `07dc1d4043`)	2025-04-22 19:38:28 +02:00
Mel Henning	006af589ee	nvk: Override render enable for blits and resolves Fixes cts tests: dEQP-VK.conditional_rendering.conditional_ignore.blit_image dEQP-VK.conditional_rendering.conditional_ignore.blit_image_inverted dEQP-VK.conditional_rendering.conditional_ignore.resolve_image dEQP-VK.conditional_rendering.conditional_ignore.resolve_image_inverted which were introduced in vk-gl-cts commit 4aa277c300 Fixes: `32f2317223` ("nvk: Use meta for doing blits with the 3D hardware") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34644> (cherry picked from commit `2fc4c98aaf`)	2025-04-22 19:37:33 +02:00
Mel Henning	af61891fed	nvk: SET_STATISTICS_COUNTER at start of meta_begin Ideally, begin/end should be roughly symmetric - the initialization order should be the reverse of the teardown order. Fixes: `6f85e6b06b` ("nvk: Disable statistics around meta ops") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34644> (cherry picked from commit `52085f2a0e`)	2025-04-22 19:37:32 +02:00
Faith Ekstrand	5f36e5961e	nak/sm70: Fix the bit74_75_ar_mod assert It's used for src2, not src0. Fixes: `40422927dc` ("nak: Pass has_mod to all form of src2 requiring it") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33107> (cherry picked from commit `47fc468944`)	2025-04-22 19:37:11 +02:00
Faith Ekstrand	61b44913f5	nak/legalize: Take a RegFile in copy_alu_src_and_lower_fmod Otherwise, we'll screw up uniform GPRs. Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33107> (cherry picked from commit `22a30bfa4f`)	2025-04-22 19:36:01 +02:00
Tomeu Vizoso	70ad887eda	etnaviv: Release screen->dummy_desc_reloc.bo We are currently trying to release twice the same dummy BO, while leaking the other one. Fixes: `bca5ef70a4` ("etnaviv: split dummy RT backing store from reloc") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34627> (cherry picked from commit `63251d43ae`)	2025-04-22 18:47:28 +02:00
Georg Lehmann	e6134c388d	nir/opt_algebraic: disable fsat(a + 1.0) opt if a can be NaN Foz-DB Navi21: Totals from 9 (0.01% of 79789) affected shaders: Instrs: 6782 -> 6796 (+0.21%); split: -0.03%, +0.24% CodeSize: 40020 -> 40108 (+0.22%); split: -0.04%, +0.26% Latency: 23764 -> 23758 (-0.03%) InvThroughput: 6424 -> 6431 (+0.11%); split: -0.08%, +0.19% SClause: 273 -> 275 (+0.73%) Copies: 338 -> 339 (+0.30%) VALU: 5138 -> 5147 (+0.18%); split: -0.06%, +0.23% SALU: 349 -> 350 (+0.29%) SMEM: 498 -> 500 (+0.40%) Fixes: `a4a3487aae` ("nir/opt_algebraic: optimize patterns from Skia") Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34125> (cherry picked from commit `3e26fc4498`)	2025-04-22 18:47:27 +02:00
Yinjie Yao	c72a9e2795	gallium/pipe: Increase hevc max slice to 600 According to the spec, increase max supported slices of hevc to 600. Cc: mesa-stable Signed-off-by: Yinjie Yao <yinjie.yao@amd.com> Reviewed-by: Ruijing Dong <ruijing.dong@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34632> (cherry picked from commit `2b5ca87927`)	2025-04-22 18:47:26 +02:00
Eric Engestrom	cdd4f62e89	aco: help clang 20 do some additions and subtractions clang 20 complains: ../src/amd/compiler/aco_assembler.cpp:837:28: error: writing 1 byte into a region of size 0 [-Werror=stringop-overflow=] 837 \| vaddr[num_vaddr + i] = reg(ctx, instr->operands.back(), 8) + i + 1; \| ~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ../src/amd/compiler/aco_assembler.cpp:832:12: note: at offset 5 into destination object ‘vaddr’ of size 5 832 \| uint8_t vaddr[5] = {0, 0, 0, 0, 0}; \| ^~~~~ ../src/amd/compiler/aco_assembler.cpp:837:28: error: writing 1 byte into a region of size 0 [-Werror=stringop-overflow=] 837 \| vaddr[num_vaddr + i] = reg(ctx, instr->operands.back(), 8) + i + 1; \| ~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ../src/amd/compiler/aco_assembler.cpp:832:12: note: at offset 6 into destination object ‘vaddr’ of size 5 832 \| uint8_t vaddr[5] = {0, 0, 0, 0, 0}; \| ^~~~~ ../src/amd/compiler/aco_assembler.cpp:837:28: error: writing 1 byte into a region of size 0 [-Werror=stringop-overflow=] 837 \| vaddr[num_vaddr + i] = reg(ctx, instr->operands.back(), 8) + i + 1; \| ~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ../src/amd/compiler/aco_assembler.cpp:832:12: note: at offset 7 into destination object ‘vaddr’ of size 5 832 \| uint8_t vaddr[5] = {0, 0, 0, 0, 0}; \| ^~~~~ But `i < MIN2(instr->operands.back().size() - 1, 5 - num_vaddr)` means `i` is at most `5 - num_vaddr - 1`, which means `vaddr[num_vaddr + i]` => `vaddr[num_vaddr + 5 - num_vaddr - 1]` => `vaddr[5 - 1]` => `vaddr[4]` which is within the valid indices. For some reason, using signed `int` instead allows clang to figure this out, so let's do that since we don't need the extra range. While at it, use ARRAY_SIZE(vaddr) instead of hard-coding the same `5` in several places. Backport-to: 25.0 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34625> (cherry picked from commit `2bcb55f3f6`)	2025-04-22 18:47:18 +02:00
Marek Olšák	fec9695e67	radv: fix incorrect patch_outputs_read for TCS with dynamic state Fixes: `8c2f9f0665` - radv: switch to the new TCS LDS/offchip size computation Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34544> (cherry picked from commit `4a51089f30`)	2025-04-22 18:47:17 +02:00
Rhys Perry	65a50ce376	aco: combine VALU lanemask hazard into VALUMaskWriteHazard This is now basically the same as the original VALUMaskWriteHazard, except it now considers both VALU and SALU writes. Now that it's a part of VALUMaskWriteHazard, differences from the original VALU lanemask workaround are: - it includes SALU reads after the write - it includes VALU writes and SALU/VALU reads after the write which are not lanemasks - it combines s_waitcnt_depctr instructions when it's a read after both a SALU write and a VALU write - non-exec VALU SGPR reads reset the SGPRs read by VALU as a lanemask - exec SGPRs are ignored resolve_all_gfx11() is also finished. fossil-db (navi31): Totals from 21538 (27.13% of 79377) affected shaders: Instrs: 27628855 -> 27552972 (-0.27%); split: -0.30%, +0.03% CodeSize: 145968448 -> 145667616 (-0.21%); split: -0.23%, +0.02% Latency: 209537805 -> 209509519 (-0.01%); split: -0.02%, +0.00% InvThroughput: 36304270 -> 36301624 (-0.01%); split: -0.01%, +0.00% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12623 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11480 Backport-to: 25.0 Backport-to: 25.1 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34529> (cherry picked from commit `ce2be5ab8e`)	2025-04-22 18:47:10 +02:00
Rhys Perry	2ff09ffbda	aco/gfx12: don't use second VALU for VOPD's OPX if there is a WaR fossil-db (gfx1201): Totals from 38908 (49.02% of 79377) affected shaders: Instrs: 30268107 -> 30268131 (+0.00%); split: -0.00%, +0.00% CodeSize: 180843648 -> 180843640 (-0.00%); split: -0.00%, +0.00% Latency: 224905962 -> 224906072 (+0.00%); split: -0.00%, +0.00% InvThroughput: 44322988 -> 44323004 (+0.00%) VALU: 15124145 -> 15124167 (+0.00%) VOPD: 4018504 -> 4018482 (-0.00%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Backport-to: 25.0 Backport-to: 25.1 Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34246> (cherry picked from commit `408fa33c09`)	2025-04-22 18:47:09 +02:00
Patrick Lerda	a153a481cc	mesa_interface: fix legacy dri2 compatibility These values are shared with xcb/dri2.h, and can't be changed without breaking the legacy dri2 compatibility. This change reverses partially the update done by `3b603d1646`. For instance this issue is triggered on dri2 i915 with "piglit/bin/glx-copy-sub-buffer -auto" or "piglit/bin/hiz-depth-read-window-stencil0 -auto". Fixes: `3b603d1646` ("mesa_interface: remove unused stuff") Signed-off-by: Patrick Lerda <patrick9876@free.fr> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34561> (cherry picked from commit `60a31156b0`)	2025-04-22 18:47:02 +02:00
Mike Blumenkrantz	8a4f7476d7	zink: verify that surface exists when adding implicit feedback loop this can be null if multiple contexts are in use cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34557> (cherry picked from commit `de6efc01c1`)	2025-04-22 18:47:00 +02:00
Eric Engestrom	45aa964eb8	pick-ui: make `Backport-to: 25.0` backport to 25.0 and more recent release branches It is what developers expect, so make the code match it. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34580> (cherry picked from commit `c37a468a8a`)	2025-04-22 18:46:38 +02:00
Eric Engestrom	35d5005925	.pick_status.json: Update to `5f3a3740dc`	2025-04-22 18:46:36 +02:00
Eric Engestrom	310da5f30b	docs: add sha sum for 25.0.4 Some checks failed macOS-CI / macOS-CI (dri) (push) Has been cancelled Details macOS-CI / macOS-CI (xlib) (push) Has been cancelled Details	2025-04-17 02:22:01 +02:00
Eric Engestrom	d0f8720019	VERSION: bump for 25.0.4	2025-04-17 02:04:03 +02:00
Eric Engestrom	bd6a277901	docs: add release notes for 25.0.4	2025-04-17 02:04:03 +02:00
Pierre-Eric Pelloux-Prayer	4437cdabf0	winsys/amdgpu: disable VM_ALWAYS_VALID The referenced commit has been identified as the root cause of graphic artifacts / hangs on some APUs. For now disable AMDGPU_GEM_CREATE_VM_ALWAYS_VALID on all chips except when user queues are used. See https://gitlab.freedesktop.org/mesa/mesa/-/issues/12809. Fixes: `8c91624614` ("winsys/amdgpu: use VM_ALWAYS_VALID for all VRAM and GTT allocations") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34547> (cherry picked from commit `555821ff93`)	2025-04-17 01:24:17 +02:00
David Rosca	0e9f94576f	radeonsi/vpe: Use float division to get scaling ratio Fixes: `e85a6b6a63` ("radeonsi/vpe: check reduction ratio") Reviewed-by: Peyton Lee <peytolee@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34519> (cherry picked from commit `bd6f9e8aee`)	2025-04-17 01:24:17 +02:00
Marek Olšák	ba2a1ba2e5	ac/surface: select 3D tile mode without overallocating too much for gfx6-8 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12466 Fixes: `c87ce78d` - ac/surface: enable thick tiling for 3D textures for better perf on gfx6-8 Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34432> (cherry picked from commit `78cacfd9ce`)	2025-04-17 01:24:17 +02:00
Marek Olšák	48bfe6dbfd	ac/surface: make gfx12_estimate_size reusable by gfx6 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12466 Fixes: `c87ce78d` - ac/surface: enable thick tiling for 3D textures for better perf on gfx6-8 Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34432> (cherry picked from commit `195e7b4f75`)	2025-04-17 01:24:16 +02:00
Ryan Mckeever	651c53fc1f	pan/format: Update format flags to follow HW spec Fixes: `861e7dca` ("panfrost: Switch formats to table") Signed-off-by: Ryan Mckeever <ryan.mckeever@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33787> (cherry picked from commit `b9a9798c46`)	2025-04-16 15:52:03 +02:00
Eric Engestrom	9cbca28609	.pick_status.json: Update to `555821ff93`	2025-04-16 15:50:33 +02:00
Kenneth Graunke	bb83fd7ac0	brw: Don't assert about MAX_VGRF_SIZE in brw_opt_split_virtual_grfs() Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This allows us to create temporary VGRFs that are larger than MAX_VGRF_SIZE(devinfo), which will be split eventually. They may not be split on the initial pass, because we may need LOAD_PAYLOAD lowering, copy propagation, and so on to occur first. So we allow registers to exceed that size initially. The "Register allocation relies on split_virtual_grfs()" assertion in brw_reg_allocate.cpp still asserts that all VGRFs which reach the register allocator have been properly split. One case where this is useful is for vectorizing convergent block loads. We create temporaries to splat the SIMD1 values out to SIMD(N), which can lead to some very large temporaries. However, copy propagation and so on ultimately eliminate these and they'll get split down to proper sizes or elided entirely in the end. (Note: both this and the prior commits from this merge request are needed to close the linked issue.) Cc: mesa-stable Reviewed-by: Matt Turner <mattst88@gmail.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12324 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34461> (cherry picked from commit `eb1ec9cf8e`)	2025-04-16 15:37:06 +02:00
Kenneth Graunke	7a588a5a8e	brw: Use live->max_vgrf_size in pre-RA scheduling Post-RA scheduling doesn't use liveness analysis, so we continue using MAX_VGRF_SIZE(devinfo). But for pre-RA scheduling, we now use live->max_vgrf_size. This helps get us to a place where we can emit arbitrarily large VGRFs early on in compilation, but which will be split and cleaned up prior to register allocation. It may also allocate smaller arrays in practice since MAX_VGRF_SIZE(devinfo) assumes the worst case scenario for things we actually could need to allocate. Cc: mesa-stable Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34461> (cherry picked from commit `a45583f078`)	2025-04-16 15:37:06 +02:00
Kenneth Graunke	0d1e83ca6a	brw: Use live->max_vgrf_size in register coalescing We already require liveness, so just use the actual maximum size we saw instead of a hardcoded pessimal size. Cc: mesa-stable Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34461> (cherry picked from commit `4b27b5895c`)	2025-04-16 15:37:05 +02:00
Kenneth Graunke	c906f565b6	brw: Track the largest VGRF size in liveness analysis We're already looking at this data to calculate the per-component vars_from_vgrf[] and vgrf_from_vars[] mappings, so just record the largest VGRF size while we're here. This will allow passes to size arrays based on the actual size needed, rather than hardcoding some fixed size. In many cases, MAX_VGRF_SIZE(devinfo) is larger than necessary, because e.g. vec5 sparse sampling results aren't used. Not hardcoding this means we can also temporarily handle very large VGRFs which we know will be split eventually, without having to increase the maximum which is ultimately used for RA classes. Cc: mesa-stable Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34461> (cherry picked from commit `ea468412f6`)	2025-04-16 15:37:05 +02:00
Erik Faye-Lund	6c6c6873c4	panvk: claim official conformance on v10 It's official, PanVK is Vulkan 1.1 conformant on v10. Let's make this clear. Backport-to: 25.0 Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34500> (cherry picked from commit `65b7d2e865`)	2025-04-16 15:37:05 +02:00
Erik Faye-Lund	238399e93a	panvk: set shared_addr_format We need to set this, otherwise we end up failing tests. Fixes: `4e111c259c` ("panvk: Lower shared memory") Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34514> (cherry picked from commit `e77a815299`)	2025-04-16 15:37:05 +02:00
Marek Olšák	1fe9f5d3ac	radeonsi: add ACO-specific main shader parts We can't have merged shaders where the first part is compiled using ACO and the second part is compiled using LLVM. Add ACO-specific main shader parts to fix that. This happens when ACO is enabled for gfx12 streamout where GS can be paired with a previous shader compiled by LLVM. Fixes: `8ba718fb7d` - radeonsi/gfx12: use ACO for streamout because it's faster Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34491> (cherry picked from commit `7f7d6deb18`)	2025-04-16 15:37:05 +02:00
Marek Olšák	15ea052c20	radeonsi: make si_shader_selector::main_shader_part_* an iterable union for the next commit Fixes: `8ba718fb7d` - radeonsi/gfx12: use ACO for streamout because it's faster Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34491> (cherry picked from commit `4865ac57cc`)	2025-04-16 15:37:05 +02:00
Jose Maria Casanova Crespo	9babb23138	v3dv: avoid TFU reading unmapped pages beyond the end of the buffers TFU units is doing a readahead of 64 bytes. This is causing invalid read MMU errors that can be observed at the nightly full Vulkan runs on Broadcom devices. 04:13:59.969: [ 85.623205] v3d 1002000000.v3d: MMU error from client TLB (3) at 0x4869000, pte invalid 04:14:05.408: [ 91.019321] v3d 1002000000.v3d: MMU error from client TLB (3) at 0x5209000, pte invalid 04:14:05.413: [ 91.031662] v3d 1002000000.v3d: MMU error from client TLB (3) at 0x7521000, pte invalid Although the log reports the TLB the real culprit is the TFU. A fix to the kernel was submitted to fix AXI ID on V3D 4.2 and 7.1 So doing an over-allocation of 64-bytes at v3dv_AllocateMemory is the simplest method to make these MMU errors itp disapear. Running ./deqp-vk for an hour, we can see that ~%40 of allocations would need an extra page (4096 bytes) to accomodate this 64 bytes padding. Fixes: `ca330f7f04` ("v3dv: implement VK_EXT_memory_budget") Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34475> (cherry picked from commit `0bcb82048c`)	2025-04-16 15:37:04 +02:00
Mike Blumenkrantz	31e9893f64	zink: stop setting ArrayStride on image arrays this is illegal cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33651> (cherry picked from commit `b4e3535650`)	2025-04-16 15:37:04 +02:00
Mike Blumenkrantz	0f3b6ba7ad	zink: don't set shared block stride without KHR_workgroup_memory_explicit_layout this is illegal cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33651> (cherry picked from commit `1c0de360bc`)	2025-04-16 15:37:04 +02:00
Eric R. Smith	5a685929d3	panfrost: fix transaction elimination crc valid calculation The setting of the clean_pixel_write_enable flag in pan_prepare_rt was not consistent with the crc valid calculations in pan_emit_fbd. This caused the crc_valid flag to not be accurate, causing transaction elimination to fail. Fixes: `eac8f1d460` ("Revert "panfrost: Disable CRC by default"") Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34408> (cherry picked from commit `69a6db4b2b`)	2025-04-16 15:37:04 +02:00
Erik Faye-Lund	27342a5532	nir/lower_tex: use texture_mask instead of shifting on use In commit `292ac71a4a` ("nir/lower_tex: handle deref casts"), we avoided using texture_index when a texture instruction contained a variable deref. There's no good reason why this should be done to some of the lowering, but not all. So let's fix up code-paths that were added after this change to do the same. The first two patches here crossed paths with the commit that introduced texture_mask, so it's not strange that the change was missed. The last one seems to have just copied what was done around it, propagating the issue. Fixes: `880b00dc59` ("nir/lower_tex: Add support for lowering YUYV formats") Fixes: `1358d93650` ("nir/lower_tex: Add support for lowering Y41x formats") Fixes: `65d6f5aed2` ("nir: add options to lower y_vu, yv_yu, yx_xvxu and xy_vxux") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34365> (cherry picked from commit `41b136f674`)	2025-04-16 15:37:04 +02:00
Faith Ekstrand	5d6c82000c	nil: Multiply by array_stride_B instead of adding Fixes: `5577128c83` ("nil: Rewrite the TIC code in Rust") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34495> (cherry picked from commit `fadac25b0c`)	2025-04-16 15:37:04 +02:00
Faith Ekstrand	ea963009f0	nvk/nvkmd: Check the correct flag for the Kepler GART workaround Fixes: `1db57bb414` ("nvk/nvkmd: Rework memory placement flags") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34495> (cherry picked from commit `5c81b3546f`)	2025-04-16 15:37:04 +02:00
Caio Oliveira	aedb7eb700	nir/load_store_vectorize: Skip new bit-sizes that are unaligned with high_offset Otherwise this would require combining two values to produce a single (new bit-size) channel, which vectorize_stores() don't handle. The pass can still keep trying smaller bit-sizes. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12946 Fixes: `ce9205c03b` ("nir: add a load/store vectorization pass") Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34414> (cherry picked from commit `2ed79f80ba`)	2025-04-16 15:37:03 +02:00
David Rosca	8ffedebf1c	radv/video: Fix encode session info for VCN3+ Last dword should be 0. Cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34449> (cherry picked from commit `7249d9548e`)	2025-04-16 15:37:03 +02:00
David Rosca	15b2a440da	radv/video: Fix msg header total size It needs to include also codec msg size. Cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34449> (cherry picked from commit `34031531fc`)	2025-04-16 15:37:03 +02:00
Erik Faye-Lund	b839ea42bf	panfrost: fixup typo in 16x sample-pattern This is an n-queen pattern, where no two values should be on the same row or column. But this and the second to last element has the same y component, and neither has the negative one. Let's fix this up by setting the first value to the negative value. This matches the D3D 16x sample pattern. Fixes: `a61fb62966` ("panfrost: Upload sample positions on device init") Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33925> (cherry picked from commit `b4ebffa1aa`)	2025-04-16 15:37:03 +02:00
Lionel Landwerlin	f018626745	brw: fix Wa_22013689345 emission 2 problems : - not detecting null destination correctly - applied too late using SHADER_OPCODE_MEMORY_FENCE, when lowering already happened Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34319> (cherry picked from commit `06ad9a25e5`)	2025-04-16 15:37:03 +02:00

1 2 3 4 5 ...

201608 commits