fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-17 18:18:06 +02:00

Author	SHA1	Message	Date
Georg Lehmann	4fa3fb87c7	aco/insert_NOPs: allow WMMA with constant C matrix Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34396>	2025-04-22 16:08:56 +00:00
Georg Lehmann	c3964e87f8	radv: apply fneg/fabs modifiers to wmma Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34396>	2025-04-22 16:08:55 +00:00
Georg Lehmann	6d7e67d986	nir,amd: add neg_lo/hi modifiers to cmat_matmul_amd Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34396>	2025-04-22 16:08:55 +00:00
Georg Lehmann	b0c8f31600	aco: set opsel_hi to 1 for WMMA This is ignored by the hardware but LLVM requires it to disassemble GFX12 WMMA. Cc: mesa-stable Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34396>	2025-04-22 16:08:54 +00:00
Eric Engestrom	2bcb55f3f6	aco: help clang 20 do some additions and subtractions clang 20 complains: ../src/amd/compiler/aco_assembler.cpp:837:28: error: writing 1 byte into a region of size 0 [-Werror=stringop-overflow=] 837 \| vaddr[num_vaddr + i] = reg(ctx, instr->operands.back(), 8) + i + 1; \| ~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ../src/amd/compiler/aco_assembler.cpp:832:12: note: at offset 5 into destination object ‘vaddr’ of size 5 832 \| uint8_t vaddr[5] = {0, 0, 0, 0, 0}; \| ^~~~~ ../src/amd/compiler/aco_assembler.cpp:837:28: error: writing 1 byte into a region of size 0 [-Werror=stringop-overflow=] 837 \| vaddr[num_vaddr + i] = reg(ctx, instr->operands.back(), 8) + i + 1; \| ~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ../src/amd/compiler/aco_assembler.cpp:832:12: note: at offset 6 into destination object ‘vaddr’ of size 5 832 \| uint8_t vaddr[5] = {0, 0, 0, 0, 0}; \| ^~~~~ ../src/amd/compiler/aco_assembler.cpp:837:28: error: writing 1 byte into a region of size 0 [-Werror=stringop-overflow=] 837 \| vaddr[num_vaddr + i] = reg(ctx, instr->operands.back(), 8) + i + 1; \| ~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ../src/amd/compiler/aco_assembler.cpp:832:12: note: at offset 7 into destination object ‘vaddr’ of size 5 832 \| uint8_t vaddr[5] = {0, 0, 0, 0, 0}; \| ^~~~~ But `i < MIN2(instr->operands.back().size() - 1, 5 - num_vaddr)` means `i` is at most `5 - num_vaddr - 1`, which means `vaddr[num_vaddr + i]` => `vaddr[num_vaddr + 5 - num_vaddr - 1]` => `vaddr[5 - 1]` => `vaddr[4]` which is within the valid indices. For some reason, using signed `int` instead allows clang to figure this out, so let's do that since we don't need the extra range. While at it, use ARRAY_SIZE(vaddr) instead of hard-coding the same `5` in several places. Backport-to: 25.0 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34625>	2025-04-21 15:16:02 +00:00
Marek Olšák	4a51089f30	radv: fix incorrect patch_outputs_read for TCS with dynamic state Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Fixes: `8c2f9f0665` - radv: switch to the new TCS LDS/offchip size computation Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34544>	2025-04-19 22:55:00 -04:00
Marek Olšák	2948f7ce96	ac/gpu_info: rename tess ring variables, fold double_offchip_wg Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34544>	2025-04-19 22:55:00 -04:00
Marek Olšák	d2e016c37d	ac/nir: don't store tess levels for TES in TCS if no_varying is set Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34544>	2025-04-19 22:55:00 -04:00
Marek Olšák	be8977811b	ac/nir: remove shader_info parameter from ac_nir_compute_tess_wg_info Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34544>	2025-04-19 22:55:00 -04:00
Marek Olšák	6d9e708642	ac/gpu_info: reduce the tess offchip ring size and compute it proportionately .. to the CU count. We allocated too much. This reduces the tess offchip ring size as follows (examples): - GFX11-12: - Navi31, Navi33, and Navi48 get 75% decrease. - Navi32 gets 68.75% decrease. - Phoenix gets 81.25% decrease. - Phoenix2 gets 93.75% decrease. - GFX10.3: - Navi21 and Navi22 get 37.5% decrease. - Navi23 and Navi24 get 50% decrease. - Rembrandt gets 62.5% decrease. - VanGogh gets 75% decrease. - Raphael gets 93.75% decrease. - GFX8-9: - Vega10 gets 0% decrease. - Vega20 gets 49.6% decrease. - Raven gets 65.3% decrease. - Raven2 gets 93.7% decrease. - Stoney gets 81% decrease. No difference in performance was measured. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34544>	2025-04-19 22:55:00 -04:00
Marek Olšák	9333c0a1ed	ac/gpu_info: compute the tess factor ring size proportionately to the CU count No change in the size on GPUs with 16 CUs per SE such as Navi31 and Navi48. Navi21 and Navi32 get 25% increase. (20 CUs per SE) APUs get a significant decrease. For example: - Phoenix gets 25% decrease - Vangogh gets 50% decrease - Phoenix2 gets 75% decrease - Raphael and Stoney get 87.5% decrease Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34544>	2025-04-19 22:55:00 -04:00
Marek Olšák	5fb2de9454	ac/nir: don't include TCS offchip size in LDS_SIZE This drastically reduces LDS usage for TCS. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34544>	2025-04-19 22:55:00 -04:00
Marek Olšák	b8f2fb81f6	ac/gpu_info: print tessellation ring info Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34544>	2025-04-19 22:55:00 -04:00
Marek Olšák	b8d15fee3d	ac: minor cleanup of ac_compute_num_tess_patches No change in behavior. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34544>	2025-04-19 22:55:00 -04:00
Marek Olšák	a905a17f39	ac: use HS offchip wg size from radeon_info in ac_compute_num_tess_patches Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34544>	2025-04-19 22:55:00 -04:00
Marek Olšák	d82eda72a1	ac/gpu_info: move HS info into radeon_info Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34544>	2025-04-19 22:55:00 -04:00
Marek Olšák	ea294349bd	radv: move the tess factor ring after the tess offchip ring to match radeonsi Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34544>	2025-04-19 22:54:59 -04:00
Marek Olšák	c057d9105f	ac/gpu_info: add total_tess_ring_size Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34544>	2025-04-19 22:54:59 -04:00
Marek Olšák	97119d980c	ac/gpu_info: clean up ac_get_hs_info, use standard terms like workgroup Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34544>	2025-04-19 22:54:59 -04:00
Samuel Pitoiset	792c30dd32	radv/meta: remove redundant parameter to blit_surf_for_image_level_layer() Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34558>	2025-04-18 17:21:24 +02:00
Samuel Pitoiset	a3f2c5f05e	radv/meta: remove unnecessary radv_meta_blit2d_buffer::bs Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34558>	2025-04-18 17:21:24 +02:00
Samuel Pitoiset	78c2feed00	radv/meta: rename more buffer->memory for fill/copy/update operations Recently, I renamed most of the helpers for future work but I forgot few things like meta keys, etc. This is for consistency. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34558>	2025-04-18 17:21:24 +02:00
Samuel Pitoiset	43c8cb1ae2	radv/meta: remove unused functions/prototypes Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34558>	2025-04-18 17:21:24 +02:00
Samuel Pitoiset	78f03dcf70	radv/meta: simplify dealing with image layouts for blits/resolves This doesn't do anything useful. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34558>	2025-04-18 17:21:24 +02:00
Yogesh Mohan Marimuthu	e63b24bee8	ac,radeonsi: clear_state is not supported in user queue Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34370>	2025-04-18 07:45:33 +00:00
Yogesh Mohan Marimuthu	61fd80a42e	ac,winsys/amdgpu: get userq_ip_mask supported from kernel info ioctl Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34370>	2025-04-18 07:45:33 +00:00
Konstantin Seurer	76031ba53d	radv: Optimize the gfx12 encode shader Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34273>	2025-04-17 20:20:40 +00:00
Konstantin Seurer	97f6287827	radv: Use the BVH8 format on GFX12 Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34273>	2025-04-17 20:20:40 +00:00
Konstantin Seurer	95e7343a7d	radv/bvh: Add helpers for encoding The build and update paths can use the same code. Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34273>	2025-04-17 20:20:40 +00:00
Konstantin Seurer	3af19f336c	radv/bvh: Document GFX12 BVH encoding Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34273>	2025-04-17 20:20:40 +00:00
Konstantin Seurer	2942e3affb	radv/rra: Set rra_accel_struct_header::rtip_level Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34273>	2025-04-17 20:20:40 +00:00
Konstantin Seurer	fa99eeb2b4	radv/rra: Move gfx10_3 specific code to a new file gfx12 needs completely different code and having them in different files is cleaner. Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34273>	2025-04-17 20:20:40 +00:00
Konstantin Seurer	9d157173b2	radv: Refactor create_bvh_descriptor Make it a bit more extendable since GFX12 introduced more fields. Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34273>	2025-04-17 20:20:40 +00:00
Konstantin Seurer	978e9b670e	aco,nir: Add support for new GFX12 ray tracing instructions Adds image_bvh_dual_intersect_ray and image_bvh8_intersect_ray which can handle the new BVH format. Both instructions write up to 10 VGPRs so they need to use a vec16 definition in nir. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34273>	2025-04-17 20:20:40 +00:00
Natalie Vock	ee0f784858	aco/ra: Don't consider precolored ops/defs in get_reg_impl Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34273>	2025-04-17 20:20:40 +00:00
Natalie Vock	b9e506afd4	aco: Add support for multiple definitions in emit_mimg Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34273>	2025-04-17 20:20:40 +00:00
Natalie Vock	f309d76aab	aco: Add support for multiple ops fixed to defs Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34273>	2025-04-17 20:20:40 +00:00
Konstantin Seurer	fe739a2da2	ac: Add rt_version rt_version describes which generation of RT capabilities a chip has. This matches what PAL does. Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34273>	2025-04-17 20:20:40 +00:00
Konstantin Seurer	2dee1117b7	vulkan: Add a vk_device parameter to get_encode_key Useful for selecting different encoding options based on hardware generation. Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34273>	2025-04-17 20:20:40 +00:00
Caio Oliveira	fd0a7efb5a	spirv, nir: Delay calculation of shared_size when using explicit layout Move the calculation to nir_lower_vars_to_explicit_types(). This consolidates the check of shader_info::shared_memory_explicit_layout in a single place instead of in all drivers. This is motivated by SPV_KHR_untyped_pointers. Before that extension we had essentially two modes for shared memory variables - No layout decorations in the SPIR-V, and both internal layout and driver location was _given by the driver_. - Explicitly laid out, i.e. they are blocks, and decorated with Aliased. Because they all alias, we could assign them driver location directly to the start of the shared memory. With the untyped pointers extension, there's a third option, to be added by a later commit - Explicitly laid out, i.e. they are blocks, and NOT decorated with Aliased. Driver location is _given by the driver_. Blocks with and without Aliased can be mixed. The driver location of multiple blocks that don't alias depend on alignment that is driver-specific, which we can more easily do from the nir_lower_vars_to_explicit_types() that already has access to a function to obtain such value. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> (hk) Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> (v3dv) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (anv/hasvk) Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> (panvk) Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (radv) Reviewed-by: Rob Clark <robdclark@gmail.com> (tu) Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34139>	2025-04-17 19:13:17 +00:00
Rhys Perry	427479c040	aco: remove va_vdst/vm_vsrc/sa_sdst variables Use the "wait" variable instead. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34529>	2025-04-17 17:28:22 +00:00
Rhys Perry	3d6fa6996c	aco: init vm_vsrc/sa_sdst from depctr_wait fossil-db (navi31): Totals from 5805 (7.31% of 79377) affected shaders: Instrs: 14229621 -> 14207115 (-0.16%); split: -0.16%, +0.00% CodeSize: 75358724 -> 75268624 (-0.12%); split: -0.12%, +0.00% Latency: 133637034 -> 133624262 (-0.01%); split: -0.01%, +0.00% InvThroughput: 22067819 -> 22066213 (-0.01%); split: -0.01%, +0.00% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34529>	2025-04-17 17:28:22 +00:00
Rhys Perry	ce2be5ab8e	aco: combine VALU lanemask hazard into VALUMaskWriteHazard This is now basically the same as the original VALUMaskWriteHazard, except it now considers both VALU and SALU writes. Now that it's a part of VALUMaskWriteHazard, differences from the original VALU lanemask workaround are: - it includes SALU reads after the write - it includes VALU writes and SALU/VALU reads after the write which are not lanemasks - it combines s_waitcnt_depctr instructions when it's a read after both a SALU write and a VALU write - non-exec VALU SGPR reads reset the SGPRs read by VALU as a lanemask - exec SGPRs are ignored resolve_all_gfx11() is also finished. fossil-db (navi31): Totals from 21538 (27.13% of 79377) affected shaders: Instrs: 27628855 -> 27552972 (-0.27%); split: -0.30%, +0.03% CodeSize: 145968448 -> 145667616 (-0.21%); split: -0.23%, +0.02% Latency: 209537805 -> 209509519 (-0.01%); split: -0.02%, +0.00% InvThroughput: 36304270 -> 36301624 (-0.01%); split: -0.01%, +0.00% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12623 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11480 Backport-to: 25.0 Backport-to: 25.1 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34529>	2025-04-17 17:28:22 +00:00
Rhys Perry	4fcf2eb1d7	aco/gfx12: VOPD src0/1 are src bank compatible if they are the same vgpr fossil-db (gfx1201): Totals from 66518 (83.80% of 79377) affected shaders: Instrs: 36939667 -> 36656685 (-0.77%); split: -0.79%, +0.02% CodeSize: 220575208 -> 220201764 (-0.17%); split: -0.21%, +0.04% Latency: 258919732 -> 258137974 (-0.30%); split: -0.35%, +0.05% InvThroughput: 49911351 -> 49643836 (-0.54%); split: -0.55%, +0.02% VClause: 788661 -> 788430 (-0.03%); split: -0.04%, +0.01% SClause: 1176416 -> 1176263 (-0.01%); split: -0.02%, +0.01% VALU: 18014058 -> 17818119 (-1.09%); split: -1.10%, +0.01% VOPD: 4926983 -> 5122922 (+3.98%); split: +4.01%, -0.04% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34246>	2025-04-17 14:00:29 +00:00
Rhys Perry	3446f2059d	aco/gfx12: assume VOPD with two v_mov_b32 are src bank compatible fossil-db (gfx1201): Totals from 10576 (13.32% of 79377) affected shaders: (no stats changed) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34246>	2025-04-17 14:00:29 +00:00
Rhys Perry	1bd5ae7b14	aco: refactor can_use_vopd so that it returns flags Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34246>	2025-04-17 14:00:29 +00:00
Rhys Perry	d4b418bbb9	aco: add are_src_banks_compatible helper for VOPD creation Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34246>	2025-04-17 14:00:29 +00:00
Rhys Perry	4b0da5b51f	aco: rename is_opy_only to can_be_opx Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34246>	2025-04-17 14:00:29 +00:00
Rhys Perry	408fa33c09	aco/gfx12: don't use second VALU for VOPD's OPX if there is a WaR fossil-db (gfx1201): Totals from 38908 (49.02% of 79377) affected shaders: Instrs: 30268107 -> 30268131 (+0.00%); split: -0.00%, +0.00% CodeSize: 180843648 -> 180843640 (-0.00%); split: -0.00%, +0.00% Latency: 224905962 -> 224906072 (+0.00%); split: -0.00%, +0.00% InvThroughput: 44322988 -> 44323004 (+0.00%) VALU: 15124145 -> 15124167 (+0.00%) VOPD: 4018504 -> 4018482 (-0.00%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Backport-to: 25.0 Backport-to: 25.1 Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34246>	2025-04-17 14:00:29 +00:00
Samuel Pitoiset	209a0ede98	radv: add a function to emit meshlet registers on GFX11+ Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34518>	2025-04-17 12:49:47 +00:00

1 2 3 4 5 ...

17394 commits