fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-19 22:18:18 +02:00

Author	SHA1	Message	Date
Daniel Schürmann	13ad3db43f	aco/lower_branches: implement try_remove_simple_block() in lower_branches() This is mostly the same as in jump_threading, but can handle multiple predecessors. Totals from 3523 (4.44% of 79395) affected shaders: (Navi31) Instrs: 10244892 -> 10244753 (-0.00%); split: -0.00%, +0.00% CodeSize: 54171500 -> 54168540 (-0.01%); split: -0.01%, +0.00% Latency: 75070425 -> 75059570 (-0.01%); split: -0.02%, +0.00% InvThroughput: 11606911 -> 11605767 (-0.01%); split: -0.01%, +0.00% Branches: 331778 -> 331675 (-0.03%); split: -0.05%, +0.02% Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32477>	2025-01-23 00:11:06 +00:00
Daniel Schürmann	2b5a893e29	aco/lower_branches: do eliminate_useless_exec_writes_in_block() during branch lowering. Totals from 728 (0.92% of 79395) affected shaders: (Navi31) Instrs: 452926 -> 452161 (-0.17%) CodeSize: 2255536 -> 2252504 (-0.13%) Latency: 1683404 -> 1683470 (+0.00%); split: -0.01%, +0.01% InvThroughput: 210887 -> 210888 (+0.00%); split: -0.00%, +0.00% SALU: 77865 -> 77106 (-0.97%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32477>	2025-01-23 00:11:06 +00:00
Daniel Schürmann	eecdb45d61	aco: consider s_cbranch_exec* instructions in needs_exec_mask() Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32477>	2025-01-23 00:11:06 +00:00
Daniel Schürmann	de1e38e214	aco/assembler: Find loop exits using the successor's loop nest depth Previously, we just used the next block after a loop that has a back-edge. This assumes that loop-exit blocks can only be removed when falling through to the next block, when in fact it can also be a jump to somewhere else, in future even to some block before the actual loop. 12 (0.02% of 79395) affected shaders. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32477>	2025-01-23 00:11:06 +00:00
Daniel Schürmann	29c63de062	aco/jump_threading: don't remove loop preheaders They might be needed as convergence point in order to insert code (e.g. for loop alignment, wait states, etc.). Totals from 1 (0.00% of 79395) affected shaders: CodeSize: 12672 -> 12716 (+0.35%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32477>	2025-01-23 00:11:06 +00:00
Samuel Pitoiset	b4085df31c	radv: re-emit streamout state for GFX12 when the user SGPR changes This is more for consistency than a real fix. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33164>	2025-01-22 22:54:23 +00:00
Martin Roukala (né Peres)	6a4c99adf1	radeonsi/ci: update the vangogh expectations Signed-off-by: Martin Roukala (né Peres) <martin.roukala@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33128>	2025-01-22 16:10:55 +00:00
Pierre-Eric Pelloux-Prayer	6cee989915	amd: add ac_drm_device_get_cookie This returns the underlying device pointer but as an opaque uintptr_t. This will be required because libdrm_amdgpu will return the same device when called multiple times from the same process. radeonsi relies on the pointer value to identify if the device are the same and adjust the synchronisation logic based on that. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33081>	2025-01-22 14:55:56 +00:00
Pierre-Eric Pelloux-Prayer	910c18df6c	dri: use _checked variants of xcb requests Requests with no reply will report errors by default to the event loop, which then usually cause the not very useful log like this to be printed: X Error of failed request: BadAlloc (insufficient resources for operation) Major opcode of failed request: 149 () Minor opcode of failed request: 2 Serial number of failed request: 33 Current serial number in output stream: 34 This commit introduce some helpers to handle the xcb errors in Mesa, and be able to report errors properly. For instance the same error will now log: MESA: error: dri3_alloc_render_buffer:1634 xcb_dri3_pixmap_from_buffer[s] failed MESA: error: X error: 11 It's not fixing the underlying issue, but at least now tests like "glx-visuals-stencil -pixmap" and "glx-visuals-depth pixmap" fail properly. cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33036>	2025-01-21 14:33:13 +00:00
Pierre-Eric Pelloux-Prayer	b307951648	glx: fix glx-create-context-invalid-es-version * GLES3.x is only valid for x <= 2 * The expected error is GLXBadProfileARB, not BadValue cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33036>	2025-01-21 14:33:13 +00:00
Marek Olšák	de790c3c5f	Revert "ac/llvm: enable wqm for ac_build_quad_swizzle from ac_build_fs_interp_mov" This reverts commit `9d4d9e6150`. It breaks on Navi31: * KHR-GL46.shaders.uniform_block.instance_array_basic_type.shared.bvec3,Fail * KHR-GL46.shaders.uniform_block.instance_array_basic_type.std140.bvec3,Fail * KHR-GL46.shaders.uniform_block.random.all_per_block_buffers.13,Fail * KHR-GL46.shaders.uniform_block.random.all_per_block_buffers.3,Fail * KHR-GL46.shaders.uniform_block.single_basic_array.shared.bvec3,Fail * KHR-GL46.shaders.uniform_block.single_basic_array.std140.bvec3,Fail * KHR-GLES3.shaders.uniform_block.instance_array_basic_type.shared.bvec3,Fail * KHR-GLES3.shaders.uniform_block.instance_array_basic_type.std140.bvec3,Fail * KHR-GLES3.shaders.uniform_block.random.all_per_block_buffers.13,Fail * KHR-GLES3.shaders.uniform_block.random.all_per_block_buffers.3,Fail * KHR-GLES3.shaders.uniform_block.single_basic_array.shared.bvec3,Fail * KHR-GLES3.shaders.uniform_block.single_basic_array.std140.bvec3,Fail * dEQP-GLES3.functional.ubo.instance_array_basic_type.shared.bvec3_both,Fail * dEQP-GLES3.functional.ubo.instance_array_basic_type.std140.bvec3_both,Fail * dEQP-GLES3.functional.ubo.random.vector_types.24,Fail * dEQP-GLES3.functional.ubo.single_basic_array.shared.bvec3_both,Fail * dEQP-GLES3.functional.ubo.single_basic_array.std140.bvec3_both,Fail Fixes: `9d4d9e6150` Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33118>	2025-01-21 11:58:37 +00:00
Samuel Pitoiset	f4cd2d1c3f	radv: use global atomics for generated/written primitives query on GFX12 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33041>	2025-01-21 08:42:32 +00:00
Samuel Pitoiset	0901f8fc25	radv: emit the shader buffer query VA on GFX12 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33041>	2025-01-21 08:42:32 +00:00
Samuel Pitoiset	2f86338ba3	radv: allocate memory for the shader query buffer on GFX12 The allocation is done on-demand. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33041>	2025-01-21 08:42:32 +00:00
Samuel Pitoiset	15a69991fe	radv: lower emulated queries with global atomics on GFX12 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33041>	2025-01-21 08:42:32 +00:00
Samuel Pitoiset	a2069b1b26	radv: declare a new user SGPR for emulating queries on GFX12 GDS is gone on GFX12 and generated/written primitives queries need to be emulated using global atomics. This new user SGPR will be used to pass the 32-bit VA of the shader query buffer. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33041>	2025-01-21 08:42:32 +00:00
Samuel Pitoiset	b942e285c3	radv: fix transform feedback on GFX12 The original implementation based on RadeonSI was broken for pause/resume and for indirect draws with a counter buffer basically. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33017>	2025-01-21 08:05:20 +00:00
Samuel Pitoiset	1f253700bc	radv: do not overallocate the number of exports for streamout on GFX12 This shouldn't be needed because GE_GS_OREDERD_ID is always reset to 0 when streamout is started. Thus it's technically impossible that the ordered ID is more than 12-bit. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33017>	2025-01-21 08:05:19 +00:00
Samuel Pitoiset	d4ff011b12	radv: advertise VK_KHR_maintenance8 There is nothing to do for VK_PIPELINE_CACHE_CREATE_INTERNALLY_SYNCHRONIZED_MERGE_BIT_KHR because the vulkan/runtime code already locks the dstCache unconditionally. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33091>	2025-01-21 07:28:14 +00:00
Samuel Pitoiset	40131ddadc	radv: adjust the source aspect for color to depth/stencil image copies The opposite is already supported. Note that only one aspect (depth or stencil) is supported when it's a copy<->depth/stencil copy, and multiplanar images aren't supported. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33091>	2025-01-21 07:28:14 +00:00
Samuel Pitoiset	3be1e9ee4d	radv: add support for VkMemoryBarrierAccessFlags3KHR There is no flags yet. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33091>	2025-01-21 07:28:14 +00:00
Samuel Pitoiset	efab1885b7	ac/sqtt: update programming SQTT on GFX12 This is pure guess but I think GFX12 now uses 48-bits VAs for configuring the SQTT buffer. This isn't yet enough to generate a capture because it's missing some info I don't know, but it's a start. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33068>	2025-01-20 23:50:10 +00:00
Samuel Pitoiset	05bfa317a0	radv: remove duplicate definition of SQTT_BUFFER_ALIGN_SHIFT It's already defined in ac_sqtt.h. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33068>	2025-01-20 23:50:10 +00:00
Samuel Pitoiset	de9d8a23d2	radv: add a helper to report if cooperative matrix is enabled To avoid duplicating checks. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33120>	2025-01-20 13:58:13 +00:00
Ivan Avdeev	14e3231b56	radv: add a flag to indicate ray tracing support Determine whether the device has hardware raytracing support early, and then use this result where needed, instead of checking for `gfx_level` every time. This is a prerequisite for CYAN_SKILLFISH chip enablement. This chip is still GFX10, not GFX10_3, but has hardware support for accelerated `image_bvh{,64}_intersect_ray` instructions. Just checking for `gfx_level` is insufficient for it. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33109>	2025-01-20 08:27:11 +00:00
Samuel Pitoiset	0a4584a684	radv: bump maxViewportDimensions to 32K on GFX12 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33064>	2025-01-17 21:10:23 +00:00
Samuel Pitoiset	2ba91d1deb	radv: promote VK_EXT_depth_clamp_zero_one to KHR Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33090>	2025-01-17 19:02:02 +00:00
Samuel Pitoiset	c84b1dda0b	ac/nir: fix skipping streamout when no buffers are bound on GFX12 RadeonSI compiles shader variants with streamout disabled but RADV doesn't do that. The alternative solution is to set the streamout buffer size to 0 to indicate that streamout isn't bound. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33058>	2025-01-17 11:24:55 +00:00
Samuel Pitoiset	ede0d534ef	radv: add GFX12 support to the null winsys Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33063>	2025-01-17 10:51:49 +00:00
Pierre-Eric Pelloux-Prayer	9d4d9e6150	ac/llvm: enable wqm for ac_build_quad_swizzle from ac_build_fs_interp_mov Without this, WQM is only used for the lds_param_load like this: s_wqm_b64 exec, exec lds_param_load v5, attr0.x wait_vdst:15 s_mov_b64 exec, s[0:1] v_mov_b32_dpp v5, v5 quad_perm:[0,0,0,0] row_mask:0xf bank_mask:0xf With this change we get: s_wqm_b64 exec, exec lds_param_load v5, attr0.x wait_vdst:15 s_mov_b64 exec, s[0:1] ... s_wqm_b64 exec, exec v_mov_b32_dpp v5, v5 quad_perm:[0,0,0,0] row_mask:0xf bank_mask:0xf s_mov_b64 exec, s[0:1] This fixes KHR-GL46.shaders.uniform_block.random.nested_structs_instance_arrays.0 and other similar tests with LLVM. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32959>	2025-01-17 09:55:45 +00:00
Pierre-Eric Pelloux-Prayer	182d662ccf	ac/llvm: add wqm param to ac_build_quad_swizzle And to ac_build_dpp because it's used from quad_swizzle. No functional changes but will be used in the next commit. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32959>	2025-01-17 09:55:45 +00:00
Georg Lehmann	71cb394b02	aco: implement some more std::vector functions for small_vec Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33043>	2025-01-17 09:25:48 +00:00
Georg Lehmann	31de188bc2	aco: support less trivial component types in small_vec Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33043>	2025-01-17 09:25:48 +00:00
Georg Lehmann	15cba08db0	aco: guard small_vector move/copy operator against self assignment Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33043>	2025-01-17 09:25:48 +00:00
David Rosca	ccb450b91c	radeonsi/uvd: Set decode target swizzle mode on GFX9 Acked-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32829>	2025-01-17 08:53:05 +00:00
Samuel Pitoiset	4526f2692e	radv: mark AMD CDNA as unsupported No access to the hw and likely broken. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33031>	2025-01-17 01:30:16 +00:00
Samuel Pitoiset	c942d957b0	radv: fail to initialize when the AMD GPU generation is unsupported Better to be conservative than allowing something that isn't supposed to be working. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33031>	2025-01-17 01:30:16 +00:00
Sonny Jiang	5b2de9e593	radeonsi/vcn: Add vcn_5_0_1 support Add support for AMD vcn_5_0_1 Signed-off-by: Sonny Jiang <sonny.jiang@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33004>	2025-01-16 23:41:28 +00:00
Vignesh Raman	9e7ca3b86a	ci: update expectation files Update expectation files for the test runs with kernel 6.13-rc4. Signed-off-by: Vignesh Raman <vignesh.raman@collabora.com> Reviewed-by: David Heidelberg <None> Reviewed-by: Sergi Blanch Torné <None> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32788>	2025-01-16 22:57:52 +00:00
Marek Olšák	ff6e3e9f76	nir: add next_stage param to nir_slot_is_varying & nir_remove_sysval_output The result of nir_slot_is_varying depends on what the next shader stage is, and nir_remove_sysval_output uses it. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32855>	2025-01-16 16:28:15 +00:00
Pierre-Eric Pelloux-Prayer	1278d5286c	radeonsi, radv, virtio: use AMDGPU_GEM_CREATE_VIRTIO_SHARED Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Dmitry Osipenko <dmitry.osipenko@collabora.com> Reviewed-by: Dmitry Osipenko <dmitry.osipenko@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21658>	2025-01-16 12:24:33 +00:00
Pierre-Eric Pelloux-Prayer	6c0e4a0ece	ac/virtio: add virtio-only AMDGPU_GEM_CREATE flag On the host side virglrenderer creates dmabuf on demand when: * cpu mapping is requested * setting up scan out * sharing buffers between guest processes On-demand dmabuf creation only works if the ctx that created the BO still exists and knows about this BO. This assumption works ok for the first 2 cases, but can break with the last one (and it does cause issues on Android). eg: * process A allocates BO and exports it as a guest dmabuf * process A closes its handle to the BO (-> detach_resource) * process B imports the guest dmabuf -> this triggers the attach_resource function in virglrenderer. If the given resource isn't a VIRGL_RESOURCE_FD_DMABUF it'll try to get one... But for this to work, process A needs to be used -> this fails because this resource was detached from it. The reason we create dmabuf on demand is to avoid hitting the number of open file descriptor limit. So to cover the 3rd case, we'll use the VIRTGPU_BLOB_FLAG_USE_SHAREABLE flag, but try to limit to as few possible buffers as possible. Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Dmitry Osipenko <dmitry.osipenko@collabora.com> Reviewed-by: Dmitry Osipenko <dmitry.osipenko@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21658>	2025-01-16 12:24:33 +00:00
Pierre-Eric Pelloux-Prayer	6b6340fea1	radv/virtio: disable syncobj timeline support The virtio-gpu guest kmd reports timelines as supported, so querying DRM_CAP_SYNCOBJ_TIMELINE as vk_drm_syncobj_get_type() does will return true. The native context code on the other hand doesn't support timelines, and support is disabled in the "ac/virtio: disable timeline_syncobj support" commit. Fix the inconsistency by manually disabling timeline support when info.has_timeline_syncobj is false. Tested-by: Dmitry Osipenko <dmitry.osipenko@collabora.com> Reviewed-by: Dmitry Osipenko <dmitry.osipenko@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21658>	2025-01-16 12:24:33 +00:00
Pierre-Eric Pelloux-Prayer	9de728c4d0	radv: enable virtio native context support No big code changes are needed to support virtio. The main caveat with radv is about buffer allocations. Allocating a cpu visible buffer requires the host process (eg qemu) to create a dmabuf, mmap it, then map the host CPU address into the guest application CPU address space. The first issue is about the number of dmabuf created because we might hit the number of open file limit. The host process limit can be raised but we would hit the second issue - at least on qemu: there's a limit on how many sections can be mapped and ultimately we hit this assert: assert(map->sections_nb < TARGET_PAGE_SIZE); (the third issue is a performance one: these operations have a cost, and this increases some Vulkan app loading times) radeonsi is not really affected because it's using pb_slab to suballocate small buffers from larger ones. radv on the other hand doesn't, so if an app decides to allocate lots of cpu visible small buffers, we're likely to fail. Earlier versions of the amdgpu nctx code had a suballoctor, but it was removed to simplified the code. It could be restored later; or radv could be modified to use a suballocator (like anv does AFAICT). Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Dmitry Osipenko <dmitry.osipenko@collabora.com> Reviewed-by: Dmitry Osipenko <dmitry.osipenko@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21658>	2025-01-16 12:24:32 +00:00
Pierre-Eric Pelloux-Prayer	2269ea7e2f	ac/virtio: disable timeline syncobj support Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21658>	2025-01-16 12:24:32 +00:00
Pierre-Eric Pelloux-Prayer	dc83195175	ac/virtio: disable userptr and local buffers They're not supported yet so let's not pretend they are. In particular use_local_buffers can cause VM_ALWAYS_VALID to be used, which then prevents the creation of a dmabuf. Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Dmitry Osipenko <dmitry.osipenko@collabora.com> Reviewed-by: Dmitry Osipenko <dmitry.osipenko@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21658>	2025-01-16 12:24:32 +00:00
Pierre-Eric Pelloux-Prayer	22263616ed	amd: amdgpu-virtio implementation Native context support is implemented by diverting the libdrm_amdgpu functions into new functions that use virtio-gpu. VA allocations are done directly in the guest, using newly exposed libdrm_amdgpu helpers (retrieved through dlopen/dlsym). Guest <-> Host roundtrips can be expensive so we try to avoid them as much as possible. When possible we also don't wait for the host reply in case where it's not needed to get correct result. Implicit sync works because virtio-gpu commands are submitted in order to the host (there a single queue per device, shared by all the guest processes). virtio-gpu also only supports one context per file description (but multiple file descriptions per process) while amdgpu only allows one fd per process, but multiple contexts per fd. This causes synchronization problems, because virtio-gpu drops all sync primitive if they belong to the same fd/context/ring: ie the amdgpu_ctx can't be expressed in virtio-gpu terms. For now the solution is to only allocate a single amdgpu_ctx per application. Contrary to radeonsi/radv, amdgpu_virtio can use libdrm_amdgpu directly: the ones that don't rely on ioctl() are safe to use here. Tested-by: Dmitry Osipenko <dmitry.osipenko@collabora.com> Reviewed-by: Dmitry Osipenko <dmitry.osipenko@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21658>	2025-01-16 12:24:32 +00:00
Pierre-Eric Pelloux-Prayer	a565f2994f	amd: move all uses of libdrm_amdgpu to ac_linux_drm This is required to implement virtio native-context. In a virtualized environment, most of the functions provided by libdrm_amdgpu will be implemented using virtio. This allows to implement efficient virtualization, by forwarding the kernel API to the host, instead of the GL/VK calls. Similarly, the raw 'fd' or 'gem_handle' arguments are replaced by opaque types. This allows to encapsulate all the needed state in the handle, and use unmodified API between baremetal and virtualized contexts. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21658>	2025-01-16 12:24:32 +00:00
Samuel Pitoiset	874d34cf1b	radv: fix emitting SPI_SHADER_GS_OUT_CONFIG_PS with NULL FS on GFX12 This register wasn't emitted at all if the fragment shader was NULL and this was causing random GPU hangs. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33030>	2025-01-16 10:33:46 +00:00
Samuel Pitoiset	079f55d405	radv: advertise VK_MESA_image_alignment_control on GFX12 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33033>	2025-01-16 08:49:38 +00:00

1 2 3 4 5 ...

16704 commits