fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-19 11:28:15 +02:00

Author	SHA1	Message	Date
Paulo Zanoni	a8f7d26c2b	anv: change the vm_bind-related kmd_backend vfuncs to return VkResult All these vfuncs funnel down to either stubs or the xe_vm_bind_op() function. By returning int we're shifting VkResult generation to the callers, which are simply not doing the correct job. If they get VkResult they can simply throw the errors up the stack without having to erroneously try to figure out what really happened. Today the callers are returning either VK_ERROR_UNKNOWN or VK_ERROR_OUT_OF_DEVICE_MEMORY, but after the patch we're returning either VK_ERROR_OUT_OF_HOST_MEMORY or VK_ERROR_DEVICE_LOST. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27926>	2024-03-08 23:14:09 +00:00
Paulo Zanoni	8051919b3c	anv/sparse: leave the semaphore waits and signals to the vm_bind ioctl We can now finally leave the semaphore waits and signals to the vm_bind ioctl, making vm_bind operations truly asynchronous. This was previously done for TR-TT in `18bd00c024` ("anv/trtt: don't wait/signal syncobjs using the CPU anymore"). Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27926>	2024-03-08 23:14:09 +00:00
Paulo Zanoni	aa07d8a04c	anv/sparse: don't issue a single bind operation per vm_bind ioctl The xe.ko driver finally fixed bug 746, which means we can finally pass multiple bind operations in a single ioctl. There's a dEQP test that issues 960 bind operations in a single call, so our gains here have potential, although most real-world apps are not even remotely close to this. Link: https://gitlab.freedesktop.org/drm/xe/kernel/-/issues/746 Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27926>	2024-03-08 23:14:09 +00:00
Paulo Zanoni	2526308dcd	anv/sparse: allow binding operations to match the resource size The resource size doesn't need to match the binding granularity. For example, if the user wants to create a 32kb buffer, Anv will require its memory to have 64kb, but the buffer size will still be the original 32kb. And the spec says: VUID-VkSparseMemoryBind-size-01100: "size must be less than or equal to the size of the resource minus resourceOffset" VUID-VkSparseMemoryBind-size-01102: "size must be less than or equal to the size of memory minus memoryOffset" So when binding such buffer, size should actually be the lesser of the two values: 32kb, and we have to accept that. Since our binding granularity is 64kb, we're safe to simply extend the requested size to match our binding granularity, since we already require the memory to be appropriately sized. None of this is exercised by dEQP. This was caught by piglit/arb_sparse_buffer-basic using Zink. Testcase: piglit/arb_sparse_buffer-basic Issue: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10220 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26410>	2024-02-21 22:58:42 +00:00
Paulo Zanoni	a501a840a3	anv/sparse: add an extra step before anv_sparse_bind_resource_memory() I need to add some sparse-related checks that require having the anv_buffer and anv_image, and putting them directly inside anv_queue_submit_sparse_bind_locked() doesn't feel like the right thing to do. Here we change the interface so now we have anv_sparse_bind_buffer() and anv_sparse_bind_image_opaque() as the main interface into anv_sparse.c, so they both can call the lower level anv_sparse_bind_resource_memory() function. In the next patch we'll be adding changing the code of the functions we just created, justifying their addition. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26410>	2024-02-21 22:58:42 +00:00
Rohan Garg	c69650a95e	isl,blorp,anv: introduce ISL_TILING_64_XE2 for Xe2+ platforms Xe2+ changed the msaa mapping for 2D/3D Tile64 surfaces, introduce a Xe2+ specific enum to handle this change. Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27113>	2024-01-24 17:01:48 +01:00
Lionel Landwerlin	e1b9a6e4f3	anv: initial RMV support Launch with : $ MESA_VK_TRACE=rmv MESA_VK_TRACE_TRIGGER=/tmp/trig ./my_app In another terminal, trigger a capture : $ touch /tmp/trig The application with create a snapshot and print out : RMV capture saved to '/tmp/my_app_2024.01.19_10.56.33.rmv' Then just open it with RMV : ./RadeonMemoryVisualizer /tmp/my_app_2024.01.19_10.56.33.rmv Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26843>	2024-01-23 17:24:19 +00:00
Paulo Zanoni	bf0f261c1e	anv/sparse: document USAGE_2D_3D_COMPATIBLE as non-standard too The standard block shapes (and by extension, the tiling formats they require) are simply incompatible with getting a 2D view of a 3D image. I couldn't find in the Vulkan spec anything related to what are the expectations when trying to use both at the same time. So here we "document" that this case is known non-standard. Please notice that since we report residencyStandard3DBlockShape as true we were actually supposed to support this case, but I can't see how this would be possible, so set is_known_nonstandard_format to true so we can avoid the assert() that comes right after. Fixes the following when using Zink: KHR-GL46.sparse_texture_tests.SparseTextureAllocation Also "moves forward" the following test on Zink, so it now hits a different assertion: KHR-GL46.sparse_texture_tests.SparseTextureCommitment Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26454>	2023-12-06 00:29:58 +00:00
Paulo Zanoni	181aa83027	anv/tr-tt: assert the bind size is a multiple of the granularity If the size here is not a multiple of the granularity (64kb) then we'll miss our "pages" estimation by 1. We could fix this with DIV_ROUND_UP() or by simply putting a "+1" there, but the upper layers should now be preventing this case so let's just put the assertion here. Previously it was possible to hit this case with Zink by running under certain conditions piglit/arb_sparse_buffer-basic. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26454>	2023-12-06 00:29:58 +00:00
Paulo Zanoni	c87f7c13fa	anv/sparse: reject binds that are not a multiple of the granularity From the spec: "Resources can be bound at some defined (sparse block) granularity." "The sparse block size in bytes for sparse buffers and fully-resident images is reported as VkMemoryRequirements::alignment. alignment represents both the memory alignment requirement and the binding granularity (in bytes) for sparse resources." Not only the upper layer (the Spec) doesn't allow this, the lower layers (both the vm_bind ioctl and TR-TT) also work on a granularity. Just check for this case and return an error. Before this check, what would happen was: - for the vm_bind backend, the vm_bind ioctl would fail - for the TR-TT backend, we'd understimate l1_binds_capacity and fail an assertion, or we'd just silently bind 64kb instead of the original size Currently, some Zink tests such as piglit/arb_sparse_buffer-basic can trigger this behavior, but we're working to fix Zink for this case (and that commit may be merged before this one). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26454>	2023-12-06 00:29:58 +00:00
Paulo Zanoni	563678f310	anv/sparse: don't support YCBCR 2x1 compressed formats Regarding supporting these formats, the spec says: "A sparse image created using VK_IMAGE_CREATE_SPARSE_RESIDENCY_BIT supports all non-compressed color formats with power-of-two element size that non-sparse usage supports. Additional formats may also be supported and can be queried via vkGetPhysicalDeviceSparseImageFormatProperties. VK_IMAGE_TILING_LINEAR tiling is not supported." Regarding the formats themselves, the spec says: "VK_FORMAT_B8G8R8G8_422_UNORM specifies a four-component, 32-bit format containing a pair of G components, an R component, and a B component, collectively encoding a 2×1 rectangle of unsigned normalized RGB texel data. One G value is present at each i coordinate, with the B and R values shared across both G values and thus recorded at half the horizontal resolution of the image. This format has an 8-bit B component in byte 0, an 8-bit G component for the even i coordinate in byte 1, an 8-bit R component in byte 2, and an 8-bit G component for the odd i coordinate in byte 3. This format only supports images with a width that is a multiple of two. For the purposes of the constraints on copy extents, this format is treated as a compressed format with a 2×1 compressed texel block." Since these formats are to be considered compressed 2x1 blocks and we don't necessarily have to support non-compressed formats that non-sparse support, we can claim them as not supported with sparse. In addition to all of that, if you look at isl_gfx125_filter_tiling() you'll see that we don't even support Tile64 for these formats, so sparse residency (i.e., non-opaque image binds) doesn't really make sense for them yet. The Vulkan spec defines 4 other YCBCR "2x1 compressed" formats like the ones we have in this commit, but we don't support them even without sparse, so there's no reason to check them here. A recent change in VK-GL-CTS made tests that use these formats go from unsupported to failures: 7ecc7716a983 ("Do not use and check for STORAGE image support, when it is not used in the test") This commit "fixes" the following VK-GL-CTS failures (by making them return NotSupported): dEQP-VK.sparse_resources.image_block_shapes.2d.b8g8r8g8_422_unorm.samples_1 dEQP-VK.sparse_resources.image_block_shapes.2d.g8b8g8r8_422_unorm.samples_1 dEQP-VK.sparse_resources.image_block_shapes.2d_array.b8g8r8g8_422_unorm.samples_1 dEQP-VK.sparse_resources.image_block_shapes.2d_array.g8b8g8r8_422_unorm.samples_1 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:29 +00:00
Paulo Zanoni	fda5163f34	anv/trtt: properly handle the lifetime of TR-TT batch BOs We need to wait for the batches to complete before we return the BOs to the pool. We were previously doing this completely synchronously, which made the code unnecessarily wait. Now we have a timeline syncobj that signals completion of the previous BOs, so sometimes we check where we are in the timeline and then return the BOs that we know are unused. This, in addition to the previous patch that made us wait for the other syncobjs through the execbuf ioctl instead of through the CPU, makes TR-TT batches at least an order of magnitude faster. Still, I don't think we'll notice any changes in games's FPS as they don't bind sparse resources that often. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:29 +00:00
Paulo Zanoni	18bd00c024	anv/trtt: don't wait/signal syncobjs using the CPU anymore Pass them as part of the TR-TT batch. This is what a lot of the previous commits were building up to. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:28 +00:00
Paulo Zanoni	040063c156	anv/sparse: move waiting/signaling syncobjs to the backends Move waiting/signaling to the backends so we can fix each backend separately. As I write this patch the vm_bind backend is back to using synchronous vm_binds so we can't pass syncobjs to the synchronous vm_bind ioctl anymore. We'll need more discussions and possibly some rework before we go back to asynchronous vm_binds. This commit should allow us to fix the TR-TT backend in the next commit and leave vm_bind for later. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:28 +00:00
Paulo Zanoni	f6d28bec6d	anv/sparse: add 'queue' to anv_sparse_submission If we're going to move syncobj waiting/signaling down to the backend we're going to need a queue to signal as lost in case those operations fail. In some places of the stack we don't have a queue available, such as when we're creating or destroying resources. For those, for vm_bind cases we don't use the queue for anything so passing it as NULL is fine. For TR-TT we are already using device->trtt.queue. For TR-TT specifically this also means we're going to start using the actual queues from the call stack instead of trtt->queue, but that shouldn't make any difference since we only ever have one queue. Still, this is more technigally correct. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:28 +00:00
Paulo Zanoni	576275907a	anv/sparse: pass anv_sparse_submission to the backend functions Our ultimate goal is to have the backend functions deal with the wait and signal syncobjs instead of waiting for them on the CPU inside anv_queue_submit_sparse_bind_locked(). For that, we'll need waits and signals parameters to be passed all the way to the backend functions that actually make the submission, and this is what this patch does, through struct anv_sparse_submission. This patch just deals with passing the parameters to the functions, nothing is using the new variables yet. There should be no functional changes here. The goal here is to make code review easier. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:28 +00:00
Paulo Zanoni	6c7753ee0b	anv/sparse: join all submissions into a single anv_sparse_bind() call Currently, a single vkQueueBindSparse() call may lead to multiple bind calls in the backend (either a vm_bind ioctl or a command submission that updates the TR-TT page tables). These operations can be quite slow so it's better for us if we try to emit as few of them as possible. On top of that, this gives our "just extend the last operation's size if possible" code a little more chance to act and save us real time. Our ultimate goal here is to also pass submit->waits and submit->signals to the backend so we can avoid doing CPU waits, so having a single call to the backend helps simplify things a little too, and we just created the structure to carry these extra pointers forward. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:28 +00:00
Paulo Zanoni	11e9a700f6	anv/sparse: drop anv_sparse_binding_data from dump_anv_vm_bind() Having it helped us printing the resource offset, which made debugging some situations easier. The problem is that we want to rework the code a little bit and we won't have a 'sparse' struct anymore to pass around. Since it's all debug code drop it for now so it doesn't get in the way of the rework. If we need it later we can find a way to add it back, or we find another way to print the value. Drive-by drop the DEBUG_SPARSE check that's already in the caller. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:28 +00:00
Paulo Zanoni	04bfe828db	anv/sparse: allow sparse resouces to use TR-TT as its backend TR-TT is a hardware feature supported by both i915.ko and xe.ko, which means we can now finally have Sparse Resources on i915.ko and we also have 2 options for xe.ko (and whatever is the best should be the default). In this patch we use batch commands to write the page tables and forever keep them in device memory. We maintain a mirror of both the L3 and and L2 tables because that helps us never having to read the tables that are in device memory. We still have some things to improve, but with this commit, workloads that didn't work at all due to the lack of sparse resources should at least run. This is still all disabled by default in i915.ko, you can turn it on by exporting ANV_SPARSE=1 before launching the applications. For xe.ko, switch the default with ANV_SPARSE_USE_TRTT=1. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:28 +00:00
Paulo Zanoni	0a120edfb8	anv/sparse: extract anv_sparse_bind() This function will be able to transparently handle sparse binding regardless of the backend: vm_bind ioctls or TR-TT. For now we only support the vm_bind ioctls, but soon we'll have anv_sparse_bind_trtt() as an option. It is important to notice that even backends that support the vm_bind ioctl may choose to do Sparse binding via TR-TT, that's why we're adding the indirection at this specific point. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26036>	2023-11-04 02:06:53 +00:00
Paulo Zanoni	1af1426542	anv/sparse: also print bind->address at dump_anv_vm_bind This helped tracking down xe.ko bug #746. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26036>	2023-11-04 02:06:52 +00:00
Paulo Zanoni	b94d7dbe66	anv/sparse: join multiple NULL binds when possible When it's a NULL bind we always set the bo_offset (aka memory offset) to zero, so we have to avoid the "bind.offset == prev.offset + size" check. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26036>	2023-11-04 02:06:52 +00:00
Paulo Zanoni	2fc0bbe814	anv/sparse: join multiple bind operations when possible If the next bind is just an extension of the previous one, join both in the same bind operation. Due to how mip levels are laid in memory, this can only happen for mip level 0. As of today xe.ko doesn't try to join contiguous operations for us. Due to how rebinds happen each additional rebind operation may end up resulting in many extra things done, so these simple checks end up saving us a lot of cycles the Kernel would otherwise waste. This will be true even after we issue all binds in a single ioctl. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26036>	2023-11-04 02:06:52 +00:00
Chia-I Wu	b30d5a4794	anv: advertise emulated formats Advertise required features for emulated formats. v2: no sparse residency support Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25467>	2023-10-14 02:36:40 +00:00
Paulo Zanoni	7e2d8cced3	anv/sparse: add INTEL_DEBUG=sparse This pollutes stderr a lot, but I've used it countless times while developing this code. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23045>	2023-09-28 06:16:40 +00:00
Paulo Zanoni	2bdd01187d	anv/sparse: get ready to issue a single vm_bind ioctl per non-opaque bind Game testing shows it's common for this operation to result in multiple bind regions, so try to use a single ioctl when we can. Actual testing reveals 136 shader-related tests fail when we actually do this, so for now keep doing a single bind per ioctl while leaving a very easy way to the desired behavior when we figure this out. It should also be possible to go even higher-level and do this at the anv_queue_submit_sparse_bind_locked() layer, but that should happen in future commits. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23045>	2023-09-28 06:16:40 +00:00
Paulo Zanoni	6368c1445f	anv/sparse: add the initial code for Sparse Resources This giant patch implements a huge chunk of the Vulkan Sparse Resources API. I previously had this as a nice series of many smaller patches that evolved as the xe.ko added more features, but once I was asked to squash some of the major reworks I realized I wouldn't be able easily rewrite history, so I just squased basically the whole series into a giant patch. I may end up splitting this again later if I find a way to properly do it. If we want to support the DX12 API through vkd3d we need to support part of the the Sparse Resources API. If we don't, a bunch of Steam games won't work. For now we only support the xe.ko backend, but the vast majority of the code is KMD-independent and so an i915.ko implementation would use most of what's here, just extending the part that binds and unbinds memory. v2+: There's no way to sanely track the version history of this patch in this commit message. Please refer to Gitlab. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23045>	2023-09-28 06:16:40 +00:00

27 commits