fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-24 06:18:10 +02:00

Author	SHA1	Message	Date
Samuel Pitoiset	76dcac9d47	radv: advertise VK_KHR_cooperative_matrix on GFX12 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33378>	2025-02-07 12:06:10 +00:00
Samuel Pitoiset	b05a112d92	radv/nir: add cooperative matrix lowering for GFX12 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33378>	2025-02-07 12:06:10 +00:00
Samuel Pitoiset	ad611adeb7	radv/nir: add a struct for parameters to cooperative matrix lowering Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33378>	2025-02-07 12:06:10 +00:00
Samuel Pitoiset	dbb7e3cf88	radv: do not keep track of the streamout binding buffer More like BDA style. For future work. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33404>	2025-02-07 10:53:37 +01:00
Samuel Pitoiset	03cacc1406	radv: rework passing draw info via radv_draw_info More like BDA style. For future work. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33404>	2025-02-07 10:53:37 +01:00
Samuel Pitoiset	6f34be88d9	radv: rework passing dispatch info via radv_dispatch_info More like BDA style. For future work. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33404>	2025-02-07 09:30:22 +01:00
Samuel Pitoiset	b5740d5819	radv: use radv_indirect_dispatch() more Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33404>	2025-02-07 09:30:22 +01:00
Samuel Pitoiset	ef7e28e7a8	radv: remove redundant drawCount == 0 for indirect mesh/task draws This is already handled in radv_before_taskmesh_draw(). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33404>	2025-02-07 09:30:22 +01:00
Samuel Pitoiset	8625decbcc	radv: fix fetching draw vertex data from counter buffers with transform feedback counterOffset was just ignored and nobody noticed (missing VKCTS coverage). VGT_STRMOUT_DRAW_OPAQUE_OFFSET will do the computation in hw for us. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33407>	2025-02-07 07:59:39 +00:00
Hans-Kristian Arntzen	1fcb494054	radv: Repurpose radv_legacy_sparse_binding drirc Rename the drirc and call it radv_disable_dedicated_sparse_queue instead, since normal queues support sparse now anyway. Keep the workaround for existing known games, since they might not expect a separate SPARSE queue to pop up. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33166>	2025-02-06 14:07:20 +00:00
Hans-Kristian Arntzen	f58630f07c	radv: Always allow sparse on normal GFX/COMPUTE/DMA queues. Forcing a dedicated sparse queue is problematic in real-world scenarios. In the current implicit sync world for sparse updates, we can rely on submission order. For use cases where an application can take advantage of the separate sparse queue to do "async" updates, the existing implementation works well, but problems arise when trying to implement D3D-style submission ordering. E.g., when a game does sparse on a graphics or compute queue, we need to guarantee that previous submissions, sparse update and future submissions are properly ordered. The Vulkan way of implementing this is to: - Signal graphics queue to timeline N (i.e. last submission made) - Wait on timeline N on the sparse queue - Do sparse updates - Signal timeline N + 1 on sparse queue - Wait for timeline N + 1 on graphics queue (can be deferred until next graphics submit) This causes an unavoidable bubble in GPU execution, since the existing sparse queue ends up doing: - Wait pending signal. The implication here is that all previous GPU work must have been submitted. - Do VM operations on CPU timeline - Wait for semaphores to signal (this is required for signal ordering) - ... GPU is meanwhile stalling in a bubble due to GPU -> CPU -> GPU roundtrip. - Signal semaphore on CPU (unblocks GPU work) Letting the GPU go idle here is not great, and we can be screwed over by bad thread scheduling. Another knock-on effect is that the graphics queue is now forced into using a thread for submissions. This is because when the graphics queue wants to wait for timeline N + 1, the sparse queue may not have signalled the timeline yet on CPU, so effectively, we have created a wait-before-signal situation internally in RADV. Throwing another thread under the bus is not great either. Just letting the queue in question support sparse binding solves all these issues and I don't see a path forward where the D3D use case can be solved in a separate queue world. It is also friendlier to the ecosystem at large. RADV is the only driver I know of that insists on separate sparse queues and multiple games assume that graphics queue can support sparse. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33166>	2025-02-06 14:07:20 +00:00
Samuel Pitoiset	9b827556f5	radv: fix adding the BO to cmdbuf list when starting conditional rendering Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33403>	2025-02-06 07:13:29 +00:00
Mike Blumenkrantz	30b616244c	radv: print stringname for VkExternalMemoryHandleTypeFlagBits error Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33323>	2025-02-06 01:48:25 +00:00
Mike Blumenkrantz	20013a1774	radv: stop blocking non-2D import/export ops these work fine Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33323>	2025-02-06 01:48:25 +00:00
Mike Blumenkrantz	ca8a740e3b	radv: fix error reporting for VkExternalMemoryTypeFlagBitsKHR wrong type name is confusing cc: mesa-stable Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33323>	2025-02-06 01:48:25 +00:00
Samuel Pitoiset	4fc856af98	radv: fix caching on-demand meta shaders This switches to disk_cache instead of our own mechanism which only stored meta shaders when the logical was destroyed. Meta shaders are still stored separately from the application shaders because they are common to all applications on a given GPU/Mesa version. The default cache is 32MiB which should be large enough. This fixes massive stuttering in FF7 Rebirth but all apps are technically affected. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33370>	2025-02-05 16:30:27 +00:00
Georg Lehmann	ff225dee67	radv: inline radv_nir_lower_poly_line_smooth Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33340>	2025-02-05 11:23:35 +00:00
Georg Lehmann	b588b56078	radv: remove radv_should_lower_poly_line_smooth I think this was broken as there might be a store_output with less than 4 components to a location that shouldn't be smoothed anyway (i.e. not the first one). nir_lower_poly_line_smooth now handles the case where the first location doesn't have 4 components. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33340>	2025-02-05 11:23:35 +00:00
Samuel Pitoiset	f095aaf819	radv/meta: stop using string keys also for DGC and query objects Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33379>	2025-02-05 08:25:00 +00:00
Dave Airlie	44b88c1034	radv/video: add h264 b frame encoding support. This is supported on VCN 3 and newer. Acked-by: David Rosca <nowrep@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31104>	2025-02-05 04:11:38 +00:00
Dave Airlie	717c85d08a	radv/video: calculate colloc buffer size for h264 B frames. This adds the overheads for the colloc buffer needed when B frames are enabled. Acked-by: David Rosca <nowrep@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31104>	2025-02-05 04:11:38 +00:00
Dave Airlie	19b27c77bd	radv/video: move encoder to using a buffer instead of an image For the encoder DPB just allocate a buffer of storage, this should align memory usage more with what radeonsi does. Acked-by: David Rosca <nowrep@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31104>	2025-02-05 04:11:38 +00:00
Samuel Pitoiset	5b856a741d	radv: advertise computeDerivativeGroupQuads on GFX12 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33303>	2025-02-04 08:11:16 +00:00
Samuel Pitoiset	bd8575ebd3	radv: implement derivative group quads on GFX12 It's natively supported by the hw. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33303>	2025-02-04 08:11:16 +00:00
Samuel Pitoiset	5fb23f29fe	radv/nir: update radv_nir_opt_tid for derivative group quads Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33303>	2025-02-04 08:11:16 +00:00
Samuel Pitoiset	7d3062470f	radv/meta: add missing pipeline lookups Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33360>	2025-02-04 07:52:01 +00:00
Samuel Pitoiset	9993f3dd6a	ac,radv,radeonsi: add new GFX12_DCC_WRITE_COMPRESS_DISABLE tiling flag Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33301>	2025-02-03 21:12:07 +00:00
Konstantin Seurer	3ab55b3c51	radv/meta: Stop using strings for meta keys Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32881>	2025-02-03 16:03:49 +01:00
Marek Olšák	82047fa82f	amd: drop support for LLVM 15, 16, 17 Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33211>	2025-02-01 04:22:30 +00:00
Konstantin Seurer	60a20bcf3d	nir: Stop using instructions for debug info Annotating ssa defs without affecting compilation is impossible with debug info instructions since referencing a nir_def from the debug info instr will add uses. The old approach also stops worrking if passes reorder instructions. This patch proposes a solution which should not regress performance just like the old approach. The difference is that this one allocates a bit more space for debug info instead of adding a new instruction for it. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33141>	2025-01-30 20:14:01 +00:00
Timur Kristóf	a9e9ec30a5	radv, radeonsi: Disable early prim export on GFX11+. We suspect that it has no perf benefits on GFX11+. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33218>	2025-01-30 15:26:46 +00:00
Timur Kristóf	f7305f776e	ac/nir/ngg: Pass radeon_info to mesh shader lowering. Same idea as the VS/TES and GS lowering: Make shader compilation decisions based on the features of the current GPU instead of ad-hoc deciding according to GFX level. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33218>	2025-01-30 15:26:46 +00:00
Timur Kristóf	b8204c8df9	ac/nir/ngg: Remove gfx_level and family from NGG lowering options. They can be read from radeon_info. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33218>	2025-01-30 15:26:46 +00:00
Timur Kristóf	e1be943f10	ac/nir/ngg: Add and use a has_ngg_passthru_no_msg field to ac_gpu_info. Instead of using the chip family field. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33218>	2025-01-30 15:26:45 +00:00
Timur Kristóf	a40000b85b	ac/nir/ngg: Add and use a has_ngg_fully_culled_bug field to ac_gpu_info. Better than applying the workaround ad-hoc based on GFX level. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33218>	2025-01-30 15:26:45 +00:00
Timur Kristóf	cad0d26dbf	ac/nir/ngg: Add and use a has_attr_ring field to ac_gpu_info. While theoretically all GFX11+ GPUs have an attribute ring, it is nicer to have this property instead of deciding ad-hoc based on the GFX level. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33218>	2025-01-30 15:26:45 +00:00
Timur Kristóf	b163ce51b1	ac/nir/ngg: Add and use a has_attr_ring_wait_bug field to ac_gpu_info. And apply the attribute ring wait workaround based on the new field. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33218>	2025-01-30 15:26:45 +00:00
Timur Kristóf	e76361d626	ac/nir/ngg: Add radeon_info to NGG lowering options. The intention is to have all the HW features affecting shader compilation in one place, instead of ad-hoc decisions in the code based on the GFX level and chip class. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33218>	2025-01-30 15:26:45 +00:00
Timur Kristóf	e9069eec8a	aco: Move NGG pos export scheduling determination to drivers. And don't schedule them on GFX11+ at all. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33218>	2025-01-30 15:26:45 +00:00
Samuel Pitoiset	d6f9c19755	radv/amdgpu: add support for AMDGPU_GEM_CREATE_GFX12_DCC This flags will be used to set PTE.DCC to VRAM allocations (ie. compression). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33284>	2025-01-30 08:18:22 +00:00
Timur Kristóf	8b263555ee	radv: Lower array derefs of vectors outside of shader linking. This fixes depth-only rendering with mesh shaders, as well as array derefs in unlinked shaders in general. Lowering array derefs of vectors is necessary for correctness. Without this, nir_lower_io will incorrectly add the array index to the IO intrinsic base instead of to the component offset. This was previously only done during shader linking, which leaves some problems with unlinked shaders and depth-only rendering. Whether these calls can be safely removed from shader linking will be investigated in a future commit. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12516 Cc: mesa-stable Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33264>	2025-01-29 20:05:25 +00:00
Samuel Pitoiset	4425d8556f	radv: use stage instead of entrypoint to determine valid gfx stages Otherwise if the function name is stripped during NIR serialization, importing libraries would break because entrypoint is NULL. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33273>	2025-01-29 14:37:41 +00:00
Marek Olšák	8e8eda4089	radeonsi: fix PS prolog not counting used fragcoord VGPRs correctly Using the used component count is not enough. We need to consider the component mask because any component can be disabled. This might fix tests. This removes the component counting from ac_get_fs_input_vgpr_cnt and determines the component mask where it's needed. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32910>	2025-01-29 07:19:40 +00:00
Samuel Pitoiset	18c7eafcdc	radv: fix programming mip level for TILED_SUB_WINDOWS on GFX12 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33252>	2025-01-28 20:32:11 +00:00
Samuel Pitoiset	7c949f1760	radv: fix programming pitches for LINEAR_SUB_WINDOW on GFX12 GFX12 supports up to 64k images. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33252>	2025-01-28 20:32:11 +00:00
Eric Engestrom	dd2629b8b8	radv,lvp: fix url to VkAabbPositionsKHR docs The current URL redirect to a page that does not contain any information about this struct, so let's fix that. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33159>	2025-01-28 14:28:59 +00:00
Samuel Pitoiset	50a0d1fd65	radv: disable VK_KHR_cooperative_matrix on GFX12 I have it mostly but it won't be ready in time for 25.0 and the changes are probably too large for a backport. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33257>	2025-01-28 13:27:55 +00:00
Samuel Pitoiset	9d528b9966	radv: disable video support on GFX12 VCN 5.0+ isn't yet implemented. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33253>	2025-01-28 12:09:43 +00:00
Samuel Pitoiset	c172f6ef01	radv: fix disabling logic op for srgb/float formats when blending is enabled The Vulkan spec says: "If logicOpEnable is VK_TRUE, then a logical operation selected by logicOp is applied between each color attachment and the fragment’s corresponding output value, and blending of all attachments is treated as if it were disabled. Any attachments using color formats for which logical operations are not supported simply pass through the color values unmodified." When logic op and blending are both enabled, logic op takes precedence and values should be passed through unmodified. Also RB+ shouldn't have any effects when blending is disabled. Fixes new VKCTS coverage dEQP-VK.pipeline..logic_op_na_formats.. Fixes: `03b037a0e3` ("radv: disable logic op for float/srgb formats") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33235>	2025-01-28 08:19:15 +00:00
Samuel Pitoiset	ce3a137892	radv: fix the number of drm modifier planes for DCC on GFX12 It's always 1 plane because DCC isn't allocated from the userspace driver. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33213>	2025-01-27 08:44:48 +00:00

1 2 3 4 5 ...

9791 commits