fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-24 04:08:10 +02:00

Author	SHA1	Message	Date
Rhys Perry	799685659d	aco/gfx115: consider point sample acceleration Like 15428e0d786939a5c7629a9978947c8a9112ce96 in LLVM. fossil-db (gfx1150): Totals from 909 (1.14% of 79653) affected shaders: Instrs: 5840489 -> 5840705 (+0.00%); split: -0.00%, +0.00% CodeSize: 31133460 -> 31134296 (+0.00%); split: -0.00%, +0.00% Latency: 52982280 -> 53438577 (+0.86%); split: -0.00%, +0.86% InvThroughput: 10841454 -> 10942682 (+0.93%); split: -0.00%, +0.93% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Backport-to: 25.0 Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34935> (cherry picked from commit `171920ceed`)	2025-05-20 20:18:07 +02:00
Samuel Pitoiset	e81572403c	radv: remove the optimization for equal immutable samplers This optimization used to optimize the allocated space for descriptors when immutable samplers are equal. Though, this was basically broken : - descriptor copies were broken for combiner image sampler (or sampler) with equal immutable samplers because 96 bytes were copied instead of 64 bytes (cf. the linked ticket). This could be fixed but it's not worth it. - the value returned by vkGetDescriptorLayoutSupport() was broken, it should have been 96 with no immutable samplers (or when they aren't equal) This optimization was also not applied for descriptor buffers which is the default for vkd3d-proton and Zink. DXVK doesn't use db but it doesn't use immutable samplers, so basically only native vulkan games would be concerned. Note that immutable samplers would still be inlined in shaders if no indirect access which should be 99.9% of the usecase. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11165 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34928> (cherry picked from commit `69ff204422`)	2025-05-20 20:18:06 +02:00
Samuel Pitoiset	1512a1cdd7	radv: fix emitting dynamic viewports/scissors when the count is static In a scenario where the viewports/scissors are a dynamic state but the count is static (ie. updated when a graphics pipeline is bound), the driver wasn't considering that and it was re-emitting the previous number of viewports/scissors. This fixes rendering issue with Blender. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13127 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34921> (cherry picked from commit `9a07ccbc89`)	2025-05-20 20:18:06 +02:00
David Rosca	22826ec621	radv/video: Use ac_uvd_alloc_stream_handle ac_uvd_alloc_stream_handle tries to avoid collisions in the case when PID is not unique (eg. in sandboxes like Flatpak). Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12607 Reviewed-by: Ruijing Dong <ruijing.dong@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34807> (cherry picked from commit `5fee04bcae`)	2025-05-20 20:18:06 +02:00
David Rosca	4cfaede767	ac/uvd: Add ac_uvd_alloc_stream_handle Cc: mesa-stable Reviewed-by: Ruijing Dong <ruijing.dong@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34807> (cherry picked from commit `69455e8208`)	2025-05-20 20:18:06 +02:00
Natalie Vock	42519ff23a	radv,driconf: Add radv_force_64k_sparse_alignment config Needed by DOOM: The Dark Ages. Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34944> (cherry picked from commit `e32a90b57c`)	2025-05-20 20:18:06 +02:00
Samuel Pitoiset	6a1a256578	radv: fix SDMA copies for linear 96-bits formats The hardware requires a power of two bpe. To do that, the driver needs to adjust the pitch/offset/extent based on a texel scale factor which only applies to 96-bits formats. This fixes new VKCTS coverage. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34927> (cherry picked from commit `4b73d7e817`)	2025-05-20 20:18:06 +02:00
Rhys Perry	55241615c6	ac/llvm: correctly set alignment of vector global load/store For coherent/volatile access, this would be too high for vector access. Even when we didn't set the alignment, LLVM seemed to assume too high of an alignment for 8/16-bit vector access. Fixes generated_tests/cl/vload/vload-char-constant.cl Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Michel Dänzer <mdaenzer@redhat.com> Backport-to: 25.0 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34903> (cherry picked from commit `d0a09b6ff7`)	2025-05-20 20:18:05 +02:00
Rhys Perry	98f96feda8	ac/llvm: correctly split vector 8/16-bit stores This assumes that the start of the load is 32-bit aligned. For example, a vec3 16-bit store with align_offset=2 should split off the first component, not the last. This probably also fixed splitting with 8-bit stores. Fixes arb_copy_buffer-overlap Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Michel Dänzer <mdaenzer@redhat.com> Backport-to: 25.0 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34903> (cherry picked from commit `c1ecad2b11`)	2025-05-20 20:18:05 +02:00
Samuel Pitoiset	25c188a743	radv: ignore conditional rendering with vkCmdTraceRays* CmdTraceRays is neither a dispatch or a draw command which means it shouldn't be affected by conditional rendering. Fixes recent VKCTS coverage. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34868> (cherry picked from commit `4b76d04f7f`)	2025-05-20 20:18:05 +02:00
Samuel Pitoiset	b1c7064a68	radv: ignore radv_disable_dcc_stores on GFX12 It's not necessary because DCC is completely transparent to the userspace driver. Also it's causing issues with scanout. This fixes rendering issues with scanout in Indiana Jones. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12924 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34859> (cherry picked from commit `b7d2cdd2b4`)	2025-05-20 20:18:04 +02:00
Konstantin Seurer	8b8ca028a0	radv: Return VK_ERROR_INCOMPATIBLE_DRIVER for unsupported devices VK_ERROR_INITIALIZATION_FAILED will fail physical device enumeration. Returning VK_ERROR_INCOMPATIBLE_DRIVER means that the driver can still be used on supported GPUs when multiple GPUs are installed. cc: mesa-stable Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34783> (cherry picked from commit `84b9c281fe`)	2025-05-07 09:04:49 +02:00
Rhys Perry	77e7fd0dee	aco: swap the correct v_mov_b32 if there are two of them Previously, this function tried to swap the instruction which is not v_mov_b32, so that it doesn't introduce any new OPY-only instructions. If both were v_mov_b32, it swapped Y. Since this makes Y opy-only, this can't be done if X is also opy-only. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Fixes: `408fa33c09` ("aco/gfx12: don't use second VALU for VOPD's OPX if there is a WaR") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13101 Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34841> (cherry picked from commit `9ca71b52aa`)	2025-05-07 09:04:39 +02:00
Samuel Pitoiset	867fb6756b	radv: fix GPU hangs with image copies for ASTC/ETC2 formats on transfer queue Emitting compute dispatches on SDMA just hangs. It might be needed to switch to gang submit for these to work but fixing the GPU hang is more important for now. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34805> (cherry picked from commit `0684dc5fa8`)	2025-05-06 17:24:03 +02:00
Samuel Pitoiset	963b9fc2f3	radv: disable SINGLE clear codes to workaround a hw bug with DCC on GFX11 This fixes a very weird cache-related corruption with DCC on GFX11 due to a hw bug according to PAL. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12932 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34790> (cherry picked from commit `1356d20042`)	2025-05-06 17:24:02 +02:00
Samuel Pitoiset	49d96917d5	radv: do not clear unwritten color attachments with dual-source blending This is incorrect because the color format at slot 0 needs to be replicated to the slot 1. But with dual-source blending the colors written mask is only 0xf and this was clearing the color format at slot 1. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13082 Fixes: `e1483d022b` ("radv: clear unwritten color attachments for monolithic PS earlier") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34773> (cherry picked from commit `55ad0fd35c`)	2025-05-06 17:24:01 +02:00
Paul Gofman	234d66a1e2	radv/amdgpu: Fix hash key in radv_amdgpu_winsys_destroy(). Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34774> (cherry picked from commit `96765935e8`)	2025-05-03 12:48:02 +02:00
Samuel Pitoiset	03dc23baa2	radv: fix re-emitting VRS state when rendering begins This state also depends on whether a VRS attachment is used. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11693 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34735> (cherry picked from commit `1fccc09abe`)	2025-04-30 14:15:54 +02:00
Rhys Perry	87902dca71	aco: fix get_temp_reg_changes with clobbered operands The spiller might have tried to spill a live-through first or second s_fmac_f32 operand, but this wouldn't have reduced the SGPRs if the third operand wasn't killed Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13038 Fixes: `d6cb45dbb0` ("aco/spill: Allow spilling live-through operands") Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34699> (cherry picked from commit `7fe84024cb`)	2025-04-30 14:15:47 +02:00
Rhys Perry	e1f06788f5	aco/gfx11: create waitcnt for workgroup vmem barriers It seems this is necessary on GFX11. Similar to `576a2e798c` Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Backport-to: 25.0 Backport-to: 25.1 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34634> (cherry picked from commit `b03e071583`)	2025-04-27 11:45:27 +02:00
Timur Kristóf	5c9733618d	radv: Clear dirty flag for clip rects state after emitting it. Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Tested-by: Marcus Seyfarth <m.seyfarth@gmail.com> Fixes: `0ba3a8b3cc` Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34686> (cherry picked from commit `3ad385b9cc`)	2025-04-27 11:45:24 +02:00
Timur Kristóf	d18a3d5f09	radv: Clear dirty flag for MSAA state after emitting it. Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Tested-by: Marcus Seyfarth <m.seyfarth@gmail.com> Fixes: `08918f0880` Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13022 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34686> (cherry picked from commit `3a05477ac6`)	2025-04-27 11:45:23 +02:00
Georg Lehmann	3d9ac270e2	aco/insert_exec: reset temporary when recreating wqm mask from exact mask Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The old, now incorrect temporary was still used for invert blocks and loop masks. Foz-DB Navi31: Totals from 379 (0.48% of 79789) affected shaders: Instrs: 399471 -> 399897 (+0.11%); split: -0.00%, +0.11% CodeSize: 2197292 -> 2198908 (+0.07%); split: -0.00%, +0.08% Latency: 2500636 -> 2500895 (+0.01%); split: -0.00%, +0.01% SClause: 7912 -> 7918 (+0.08%); split: -0.04%, +0.11% Copies: 25687 -> 26068 (+1.48%); split: -0.04%, +1.53% PreSGPRs: 15648 -> 15562 (-0.55%) SALU: 35125 -> 35517 (+1.12%) Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12901 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13019 Fixes: `b872ff6ef2` ("aco/insert_exec_mask: if applicable, use s_wqm to restore exec after divergent CF") Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34659> (cherry picked from commit `dd3e1190a2`)	2025-04-23 12:21:56 +02:00
Georg Lehmann	4fb4880183	aco/insert_exec: only restore wqm mask after control flow if necessary The next commit will make this not free, so we should avoid it if possible. Foz-DB Navi31: Totals from 3933 (4.93% of 79789) affected shaders: Instrs: 5726914 -> 5727295 (+0.01%); split: -0.00%, +0.01% CodeSize: 31307100 -> 31308884 (+0.01%); split: -0.00%, +0.01% SpillSGPRs: 1797 -> 1793 (-0.22%); split: -0.33%, +0.11% Latency: 58973929 -> 58974343 (+0.00%); split: -0.00%, +0.00% InvThroughput: 8591893 -> 8591911 (+0.00%); split: -0.00%, +0.00% SClause: 209074 -> 209115 (+0.02%); split: -0.00%, +0.02% Copies: 423965 -> 432420 (+1.99%) Branches: 149976 -> 149979 (+0.00%); split: -0.00%, +0.00% PreSGPRs: 200175 -> 200663 (+0.24%) VALU: 3440165 -> 3440156 (-0.00%); split: -0.00%, +0.00% SALU: 555727 -> 556143 (+0.07%); split: -0.00%, +0.08% Fixes: `b872ff6ef2` ("aco/insert_exec_mask: if applicable, use s_wqm to restore exec after divergent CF") Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34659> (cherry picked from commit `13f6be262a`)	2025-04-23 12:21:56 +02:00
Georg Lehmann	d3285fe971	aco: set opsel_hi to 1 for WMMA This is ignored by the hardware but LLVM requires it to disassemble GFX12 WMMA. Cc: mesa-stable Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34396> (cherry picked from commit `b0c8f31600`)	2025-04-23 12:21:56 +02:00
Marek Olšák	39e4fe7ab4	radv: fix incorrect patch_outputs_read for TCS with dynamic state Fixes: `8c2f9f0665` - radv: switch to the new TCS LDS/offchip size computation Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34544> (cherry picked from commit `4a51089f30`)	2025-04-22 01:25:00 +02:00
Rhys Perry	76db8496a9	aco: combine VALU lanemask hazard into VALUMaskWriteHazard This is now basically the same as the original VALUMaskWriteHazard, except it now considers both VALU and SALU writes. Now that it's a part of VALUMaskWriteHazard, differences from the original VALU lanemask workaround are: - it includes SALU reads after the write - it includes VALU writes and SALU/VALU reads after the write which are not lanemasks - it combines s_waitcnt_depctr instructions when it's a read after both a SALU write and a VALU write - non-exec VALU SGPR reads reset the SGPRs read by VALU as a lanemask - exec SGPRs are ignored resolve_all_gfx11() is also finished. fossil-db (navi31): Totals from 21538 (27.13% of 79377) affected shaders: Instrs: 27628855 -> 27552972 (-0.27%); split: -0.30%, +0.03% CodeSize: 145968448 -> 145667616 (-0.21%); split: -0.23%, +0.02% Latency: 209537805 -> 209509519 (-0.01%); split: -0.02%, +0.00% InvThroughput: 36304270 -> 36301624 (-0.01%); split: -0.01%, +0.00% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12623 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11480 Backport-to: 25.0 Backport-to: 25.1 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34529> (cherry picked from commit `ce2be5ab8e`)	2025-04-22 01:24:39 +02:00
Rhys Perry	dd304bfd80	aco/gfx12: don't use second VALU for VOPD's OPX if there is a WaR fossil-db (gfx1201): Totals from 38908 (49.02% of 79377) affected shaders: Instrs: 30268107 -> 30268131 (+0.00%); split: -0.00%, +0.00% CodeSize: 180843648 -> 180843640 (-0.00%); split: -0.00%, +0.00% Latency: 224905962 -> 224906072 (+0.00%); split: -0.00%, +0.00% InvThroughput: 44322988 -> 44323004 (+0.00%) VALU: 15124145 -> 15124167 (+0.00%) VOPD: 4018504 -> 4018482 (-0.00%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Backport-to: 25.0 Backport-to: 25.1 Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34246> (cherry picked from commit `408fa33c09`)	2025-04-22 01:24:31 +02:00
Samuel Pitoiset	b4940255ed	radv/sdma: add support for compression on GFX12 Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Similar to previous generations that support compression, except that the driver don't need to configure a meta VA because DCC is completely transparent to the userspace. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34517>	2025-04-16 06:57:00 +00:00
Samuel Pitoiset	efa0b16bb2	radv/sdma: add a new flag to know if the surface is compressed On GFX12, DCC is transparent to the driver and there is no meta VA. Adding a new flag to know if the SDMA surface is compressed is needed. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34517>	2025-04-16 06:57:00 +00:00
Samuel Pitoiset	03671ccf9e	radv/sdma: use the correct helper to get the number type field This wasn't technically incorrect because V_028C70_BU_NUM_xxx values are similar to V_028C70_NUMBER_xxx but it's better to use the corect helper. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34517>	2025-04-16 06:57:00 +00:00
Samuel Pitoiset	b44dc98cde	radv/sdma: remove redundant check for compression when getting metadata It's already checked by the caller. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34517>	2025-04-16 06:57:00 +00:00
Samuel Pitoiset	d3d5d2fe86	radv/sdma: use SDMA5_DCC_xxx bitfields It's cleaner. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34517>	2025-04-16 06:57:00 +00:00
Samuel Pitoiset	f44342199a	radv/sdma: simplify configuring the number of uncompressed DCC blocks SDMA doesn't support MSAA, so the value can be V_028C78_MAX_BLOCK_SIZE_256B. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34517>	2025-04-16 06:57:00 +00:00
Samuel Pitoiset	13db408e59	ac/perfcounter: add support for GFX12 Sourced from PAL to add SPM support. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34524>	2025-04-16 06:35:33 +00:00
Samuel Pitoiset	c42d43e8eb	radv: print more error messages during SPM initialization Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34524>	2025-04-16 06:35:33 +00:00
Marek Olšák	78cacfd9ce	ac/surface: select 3D tile mode without overallocating too much for gfx6-8 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12466 Fixes: `c87ce78d` - ac/surface: enable thick tiling for 3D textures for better perf on gfx6-8 Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34432>	2025-04-16 06:08:48 +00:00
Marek Olšák	195e7b4f75	ac/surface: make gfx12_estimate_size reusable by gfx6 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12466 Fixes: `c87ce78d` - ac/surface: enable thick tiling for 3D textures for better perf on gfx6-8 Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34432>	2025-04-16 06:08:48 +00:00
Marek Olšák	2c122d478b	ac/nir: set X=0 for task->mesh shader dispatch when Y or Z is 0 The code set X=0 when Y and Z is 0, not "or". Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34432>	2025-04-16 06:08:48 +00:00
Marek Olšák	963147d7fd	ac/gpu_info: add 256 to payload_entry_size to increase future task shader perf It has no effect because num_entries is 1K, but the table shows a lot of potential. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34432>	2025-04-16 06:08:48 +00:00
Marek Olšák	d7c903f258	ac/gpu_info: add payload_entry_size into ac_task_info to stop causing full RADV recompiles when it's changed. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34432>	2025-04-16 06:08:48 +00:00
Marek Olšák	0dafd04695	ac/gpu_info: remove has_tmz_support function It's not needed since: `8b3056343f` - ac/gpu_info: bump required DRM minor version to 3.42.0 (kernel 5.15+) Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34432>	2025-04-16 06:08:48 +00:00
Marek Olšák	0be5a3559a	ac/gpu_info: increase the attribute ring size for gfx12 Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34432>	2025-04-16 06:08:48 +00:00
Eric Engestrom	54bcfb4c1f	ci/deqp: fix vulkan video build Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34532>	2025-04-15 17:23:05 +00:00
Samuel Pitoiset	e86e0fc525	radv: allocate the SPM BO in GTT for faster readback Reading VRAM from CPU is very slow. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34467>	2025-04-15 06:30:38 +00:00
Samuel Pitoiset	8ea46b14fa	ci: update VKCTS main to 76c1572eaba42d7ddd9bb8eb5788e52dd932068e RADV is the only driver using VKCTS main. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34299>	2025-04-14 08:24:14 +00:00
Samuel Pitoiset	410f7f9f6e	radv: only enable DCC for invisible VRAM on GFX12 DCC should only be allowed on invisible VRAM, otherwise the CPU could read the data and it will read garbage if it's compressed. This also caused GPU hangs after suspend/resume probably because some buffers were compressed when moved back from GTT to VRAM. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12962 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12922 Fixes: `9af11bf306` ("radv: add initial DCC support on GFX12") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34347>	2025-04-14 07:39:33 +00:00
Samuel Pitoiset	75be860eec	radv: use paired context regs when optimal on GFX12 Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details CP is very slow on GFX12 and parsing the packet header is the main bottleneck. Using paired context regs reduce the number of packet headers and it should be more optimal. It doesn't seem worth when only one context reg is emitted (one packet header and same number of DWORDS) or when consecutive context regs are emitted (would increase the number of DWORDS). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34421>	2025-04-14 06:18:13 +00:00
Samuel Pitoiset	f92f50c58a	radv: add macros for paired context registers on GFX12 Imported from RadeonSI. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34421>	2025-04-14 06:18:13 +00:00
Konstantin Seurer	676e26aed5	radv: Fix rayTracingPositionFetch with multiple geometies Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The fix adds more indirections to avoid increasing register pressure by tracking the primitive address. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34460>	2025-04-11 22:26:08 +00:00

1 2 3 4 5 ...

17356 commits