fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-04 01:08:03 +02:00

Author	SHA1	Message	Date
Samuel Pitoiset	b3c983b8dd	amd,radv,radeonsi: add a new function to update windowed perf counters Some checks failed macOS-CI / macOS-CI (dri) (push) Has been cancelled Details macOS-CI / macOS-CI (xlib) (push) Has been cancelled Details Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39065>	2025-12-24 07:20:01 +00:00
Samuel Pitoiset	47366527ce	radv: fix capturing performance counters with SPM Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14333 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39065>	2025-12-24 07:20:01 +00:00
Samuel Pitoiset	e03461f3bd	radv: change the default value of RADV_TRACE_CACHE_COUNTERS on < GFX10 To not print a warning about missing SPM by default on < GFX10. Also move the function to radv_physical_device.c and make it non-static. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39065>	2025-12-24 07:20:01 +00:00
Timur Kristóf	450a6189de	radv: Initialize transfer queue gang when needed Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Initialize gang CS on unsupported transfer operations. Add a wait when: - SDMA needs to wait for previous transfer operations on ACE - ACE needs to wait for previous transfer operations on SDMA Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39057>	2025-12-23 12:14:59 +00:00
Timur Kristóf	cc5190829f	radv: Declare some gang submit functions in radv private header. They will be called from the transfer copy functions. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39057>	2025-12-23 12:14:59 +00:00
Timur Kristóf	b1938901d0	radv: Use SDMA fence packet when flushing gang semaphores Add back the SDMA fence packet to radv_flush_gang_semaphore. This was regressed by `9666bd1245`. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39057>	2025-12-23 12:14:59 +00:00
Timur Kristóf	d71a05dffa	radv: Implement gang semaphores for transfer queues. We need to use gang semaphores in the following two scenarios: 1. Leader to follower semaphore: Increment the leader to follower semaphore when the leader wants to block the follower: a transfer operation on ACE needs to wait for a previous operation on SDMA. 2. Follower to leader semaphore: Increment the follower to leader semaphore when the follower wants to block the leader: a transfer operation on SDMA needs to wait for a previous operation on ACE. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39057>	2025-12-23 12:14:58 +00:00
Timur Kristóf	4d0975dc83	radv: Update comments for gang semaphores Change the explanation to use "leader" and "follower" terminology. Explain better how it is used with GFX/ACE and SDMA/ACE. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39057>	2025-12-23 12:14:58 +00:00
Timur Kristóf	65bf4e7dcd	radv: Require gang submit and compute for transfer queues RADV's transfer queue implementation will use compute for the transfer operations that aren't supported by the SDMA, so we'll need gang submissions for that. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39057>	2025-12-23 12:14:58 +00:00
Timur Kristóf	f481a5f887	radv: Add function to determine if SDMA supports an image. The following are not supported by SDMA: - Sparse images (aka. PRT) on older GPUs - Multisampled images Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39057>	2025-12-23 12:14:58 +00:00
Timur Kristóf	f55771a17d	radv: Bypass L2 for gang semaphore BO with SDMA/ACE When the "gang leader" is SDMA, we need to ensure that the gang semaphores BO is coherent between SDMA and CP. To achieve this, we need bypass the L2 cache when either SDMA or CP are connected to L2. Suggested-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39057>	2025-12-23 12:14:58 +00:00
Timur Kristóf	fc57fa4589	radv, radeonsi: Don't pass task ring info to mesh/task payload lowering The pass now uses the ring descriptors to figure these out. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39032>	2025-12-22 15:17:59 +00:00
Samuel Pitoiset	044e7f6017	radv/nir: fix front_face opts for points/lines and unknown prim Fixes new VKCTS coverage dEQP-VK.glsl.builtin_var.frontfacing.*. Fixes: `af375c6756` ("radv: Optimize fs builtins using static gfx state") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39041>	2025-12-22 07:59:30 +00:00
Daniel Schürmann	1e8d367537	amd: add and use ac_cu_info::has_vtx_format_alpha_adjust_bug Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38701>	2025-12-22 07:34:48 +00:00
Daniel Schürmann	f7c4aa48a0	ac/gpu_info: add some more flags to ac_cu_info Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38701>	2025-12-22 07:34:46 +00:00
Daniel Schürmann	f791e46c47	aco: add ac_cu_info to aco_compiler_options Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38701>	2025-12-22 07:34:46 +00:00
Daniel Schürmann	553b431aca	ac/gpu_info: move some CU information into separate struct ac_cu_info Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38701>	2025-12-22 07:34:44 +00:00
Samuel Pitoiset	045b778ed6	radv: add the SQTT relocated shaders BO to the cmdbuf list Found this while debugging another thing with amdgpu.debug_mask=0x1 (VM). Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39002>	2025-12-22 07:13:06 +00:00
Benjamin Cheng	fa8b0b6bbb	radv/video: Enable write combine for decode Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: David Rosca <david.rosca@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39025>	2025-12-18 15:25:57 -05:00
Marek Olšák	3c5c96fedb	radv: double pixel throughput in certain cases of PS without interpolated inputs Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This reduces the number of initialized VGPRs by 1 when no barycentric coordinates are used. I have verified with zink that this indeed increases performance for cases where sysvals like frag_coord and front_face are used without interpolated PS inputs. Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38936>	2025-12-18 03:37:58 +00:00
Samuel Pitoiset	f8feed17e1	ac,radv,radeonsi: add tracked register macros to common code Because the tracked registers are really driver dependant, the driver is expected to handle the tracked_registers struct itself. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38740>	2025-12-17 15:09:26 +00:00
Samuel Pitoiset	c580fc667f	ac,radv: add ac_cmdbuf::context_roll and use it Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38740>	2025-12-17 15:09:26 +00:00
Samuel Pitoiset	f3b385859a	ac,radv: add more cmdbuf emit helpers Some can't be shared with RadeonSI because it uses templates in some places. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38740>	2025-12-17 15:09:25 +00:00
Samuel Pitoiset	b444dc145a	radv: remove redundant assertions in radeon_emit_{array}() The common helpers already have assertions. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38740>	2025-12-17 15:09:25 +00:00
Samuel Pitoiset	262fc80e45	ac,radv,radeonsi: add functions to initialize tracked regs Also initialize the new slots for RADV. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38740>	2025-12-17 15:09:25 +00:00
Samuel Pitoiset	44314e1ea6	ac,radv,radeonsi: add ac_tracked_regs Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38740>	2025-12-17 15:09:24 +00:00
Samuel Pitoiset	c97bd17d4d	radv: switch to AC_TRACKED_xxx Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38740>	2025-12-17 15:09:23 +00:00
Samuel Pitoiset	5d76202b6d	radv: create descriptors for color/depth-stencil surfaces earlier For less CPU overhead when rendering begins and also because it's easy to pre-compute those descriptors. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38714>	2025-12-17 11:11:18 +00:00
Samuel Pitoiset	c8729cdd3c	radv/meta: stop passing a stencil attachment for depth decompress It should only be the depth aspect. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38714>	2025-12-17 11:11:18 +00:00
Samuel Pitoiset	43d7d97b13	radv/meta: inject image view usage info This will be used to initialize color/depth-stencil descriptors earlier when the image view is created. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38714>	2025-12-17 11:11:18 +00:00
Samuel Pitoiset	ce69cabb60	radv: constify radv_{cb,ds}_buffer_info parameters Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38714>	2025-12-17 11:11:18 +00:00
Daniel Schürmann	125ac1626d	radv: remove precomputed registers from radv_shader_binary Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details It is enough to compute them after upload. This saves some disk space and eliminates an unlikely bug where the shader cache is shared between two GPUs with the same chip but a different number of enabled CUs. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38970>	2025-12-17 08:16:06 +00:00
Samuel Pitoiset	6193483c4f	radv: rename RADEON_FLAG_VA_UNCACHED to RADEON_FLAG_GL2_BYPASS Easier to understand. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38907>	2025-12-16 07:17:08 +00:00
Samuel Pitoiset	0beb83b0eb	radv: add RADV_DEBUG=vm option Useful for debugging page faults because this adds a gap between every VA allocation. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38907>	2025-12-16 07:17:08 +00:00
Mauro Rossi	e8134e6eaf	radv/rt: Fix gnu-empty-initializer error Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Fixes the following building error happening with clang: FAILED: src/amd/vulkan/libvulkan_radeon.so.p/nir_radv_nir_rt_traversal_shader.c.o ... ../src/amd/vulkan/nir/radv_nir_rt_traversal_shader.c:1159:49: error: use of GNU empty initializer extension [-Werror,-Wgnu-empty-initializer] struct radv_nir_rt_traversal_params params = {}; ^ 1 error generated. Fixes: `f692ac76` ("radv/rt: Use traversal vars for object origin/direction in ahit/isec") Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38954>	2025-12-15 22:27:29 +01:00
Timur Kristóf	0324700c03	radv: Use zero-filled BO for GFX6 and GFX10 null index buffer bug GFX10 hangs when drawing from a 0-sized index buffer. GFX6 has a HW bug when the index buffer address is 0. Looking at VK CTS runs, GFX6 still triggers VM faults despite the current mitigation, and it also tries to access memory when the index buffer is zero sized. So it looks like GFX6 and GFX10 really have the same bug. Let's share the mitigation between the two. Use a zero-filled BO instead of the upload buffer. This fixes VM faults on GFX6, and should speed up GFX10 a bit. Note that the zero-filled BO is also going to be used for other bug mitigations on GFX6-7. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38958>	2025-12-15 21:03:19 +00:00
Benjamin Cheng	59f821218c	radv/video: Move probability table filling to bind Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details We should not manipulate the session buffers at command recording time. It shouldn't cause any problems as these initialized probability tables are not modified by firmware, but moving these to bind time should be safer and also faster if an application frequently RESETs. Reviewed-by: David Rosca <david.rosca@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38926>	2025-12-15 18:49:28 +00:00
Georg Lehmann	ec81337d8d	radv: use nir_opt_uniform_subgroup Foz-DB Navi21: Totals from 665 (0.68% of 97581) affected shaders: MaxWaves: 12856 -> 12822 (-0.26%) Instrs: 2073376 -> 2068645 (-0.23%); split: -0.23%, +0.00% CodeSize: 11116904 -> 11098376 (-0.17%); split: -0.18%, +0.01% VGPRs: 39584 -> 39568 (-0.04%); split: -0.20%, +0.16% SpillSGPRs: 160 -> 155 (-3.12%) SpillVGPRs: 2995 -> 2968 (-0.90%) Latency: 15432093 -> 15503462 (+0.46%); split: -0.13%, +0.59% InvThroughput: 3344411 -> 3351185 (+0.20%); split: -0.08%, +0.28% VClause: 50278 -> 50225 (-0.11%); split: -0.15%, +0.04% SClause: 57537 -> 57505 (-0.06%); split: -0.18%, +0.13% Copies: 189642 -> 188175 (-0.77%); split: -0.86%, +0.08% Branches: 68800 -> 68502 (-0.43%); split: -0.45%, +0.02% PreSGPRs: 37646 -> 37068 (-1.54%) PreVGPRs: 35891 -> 35943 (+0.14%) VALU: 1386943 -> 1385881 (-0.08%); split: -0.09%, +0.01% SALU: 287322 -> 284165 (-1.10%); split: -1.11%, +0.01% VMEM: 90874 -> 90820 (-0.06%) Acked-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38902>	2025-12-15 12:22:32 +00:00
Georg Lehmann	fee87679bf	radv/nir: fix front_face_fsign opt Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details If front facing primitives are culled, there are only back facing fragments left. Fixes: `0fe8250bf4` ("radv: optimize known front_face_fsign too") Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38937>	2025-12-13 10:22:21 +01:00
Georg Lehmann	17e597093d	radv: eliminate unused FS output channels For formats that don't have all color channels, there is no reason to output all of them. Games often write to R only or RGB formats with non trivial remaining channels. Foz-DB Navi21: Totals from 10270 (10.55% of 97347) affected shaders: MaxWaves: 249166 -> 250950 (+0.72%); split: +0.73%, -0.01% Instrs: 8442016 -> 8354715 (-1.03%); split: -1.05%, +0.01% CodeSize: 45939644 -> 45487156 (-0.98%); split: -1.01%, +0.02% VGPRs: 472584 -> 463784 (-1.86%); split: -1.98%, +0.12% SpillSGPRs: 1502 -> 1448 (-3.60%) LDS: 6024192 -> 6011904 (-0.20%) Inputs: 42463 -> 41773 (-1.62%) Outputs: 24601 -> 23955 (-2.63%) Latency: 78011745 -> 77653907 (-0.46%); split: -0.56%, +0.10% InvThroughput: 19767826 -> 19274046 (-2.50%); split: -2.53%, +0.03% VClause: 177891 -> 176681 (-0.68%); split: -0.80%, +0.12% SClause: 236784 -> 235324 (-0.62%); split: -0.72%, +0.10% Copies: 621048 -> 616096 (-0.80%); split: -1.03%, +0.23% Branches: 202608 -> 201811 (-0.39%); split: -0.44%, +0.05% PreSGPRs: 441032 -> 437698 (-0.76%); split: -0.77%, +0.01% PreVGPRs: 378067 -> 369564 (-2.25%); split: -2.26%, +0.01% VALU: 5906415 -> 5833179 (-1.24%); split: -1.25%, +0.01% SALU: 973428 -> 968088 (-0.55%); split: -0.61%, +0.06% VMEM: 298277 -> 296504 (-0.59%); split: -0.61%, +0.01% SMEM: 402244 -> 399612 (-0.65%); split: -0.71%, +0.06% Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38853>	2025-12-12 17:00:51 +00:00
Georg Lehmann	5d2f3065fd	radv: gather color0_written with scalar io correctly Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38853>	2025-12-12 17:00:51 +00:00
Georg Lehmann	18013e3281	radv: consider dual src blend for when epilog needs alpha Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38853>	2025-12-12 17:00:51 +00:00
Georg Lehmann	a1fbf91ff2	radv/nir: fix radv_nir_remap_color_attachment progress And switch to SPDX header. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38853>	2025-12-12 17:00:51 +00:00
Georg Lehmann	0fe8250bf4	radv: optimize known front_face_fsign too Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Foz-DB Navi21: Totals from 1941 (1.99% of 97581) affected shaders: MaxWaves: 44196 -> 44612 (+0.94%); split: +0.97%, -0.03% Instrs: 1553182 -> 1548823 (-0.28%); split: -0.36%, +0.08% CodeSize: 8261308 -> 8261496 (+0.00%); split: -0.17%, +0.18% VGPRs: 98488 -> 97968 (-0.53%); split: -0.56%, +0.03% SpillSGPRs: 1288 -> 1347 (+4.58%) Latency: 19136399 -> 19094748 (-0.22%); split: -0.38%, +0.16% InvThroughput: 5424693 -> 5409469 (-0.28%); split: -0.32%, +0.04% VClause: 29941 -> 29943 (+0.01%); split: -0.26%, +0.27% SClause: 39922 -> 39972 (+0.13%); split: -1.02%, +1.14% Copies: 109736 -> 109684 (-0.05%); split: -1.45%, +1.40% Branches: 24523 -> 24499 (-0.10%); split: -0.12%, +0.02% PreSGPRs: 99206 -> 99191 (-0.02%); split: -0.02%, +0.00% PreVGPRs: 79019 -> 78240 (-0.99%); split: -1.00%, +0.02% VALU: 1145088 -> 1140731 (-0.38%); split: -0.44%, +0.06% SALU: 164035 -> 164077 (+0.03%); split: -0.48%, +0.51% SMEM: 80668 -> 80658 (-0.01%) We used to call this pass before front_face_fsign is created but that has changed. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38906>	2025-12-12 08:24:38 +00:00
Marek Olšák	9bd2c6dcb2	ac/nir: allow smaller workgroups for GS Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details It's not good for performance, but it's possible to use for debugging. Running single-wave GS workgroups could work around any LDS race conditions. Setting the workgroup size to 64 reliably works around GLCTS primitive_counterline failures, indicating streamout data corruption with multi-wave GS workgroups. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38328>	2025-12-12 04:27:32 +00:00
Christian Gmeiner	b393518bdf	treewide: Use wsi_common_is_swapchain_image() helper Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Replace the duplicated swapchain image detection pattern across all Vulkan drivers with the new wsi_common_is_swapchain_image() helper. Since the swapchain handle can be extracted from VkImageCreateInfo's pNext chain inside wsi_common_create_swapchain_image(), remove the now-redundant VkSwapchainKHR parameter from that function. This removes the #ifdef guards for Android/WSI platforms from each driver, as the helper now handles this uniformly. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38541>	2025-12-11 20:20:39 +00:00
Konstantin Seurer	85e8f815e0	radv/nir: Use fmt_idx correctly Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38462>	2025-12-11 16:26:01 +00:00
Konstantin Seurer	c14eb415a2	radv/bvh: Avoid a slow case when compressing triangles Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38462>	2025-12-11 16:26:01 +00:00
Konstantin Seurer	2749b5b713	radv/bvh: Fix calculating the vertex payload/prefix sizes This calculation needs to happen in the same loop as the geometry/triangle id calculations in case the selected invocation is before all invocations that were already selected. Totals from 1269 (15.10% of 8406) affected BVHs: compacted_size: 137581888 -> 137606464 (+0.02%); split: -0.08%, +0.10% sah: 6496048424 -> 6496048450 (+0.00%); split: -0.00%, +0.00% primitive_node_count: 604384 -> 604656 (+0.05%); split: -0.14%, +0.19% Fixes: `c18a7d0` ("radv: Emit compressed primitive nodes on GFX12") Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38462>	2025-12-11 16:26:00 +00:00
Konstantin Seurer	3a3810647e	radv/bvh: Assert that indices_midpoint is valid Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38462>	2025-12-11 16:26:00 +00:00

1 2 3 4 5 ...

11251 commits