fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-24 08:28:16 +02:00

Author	SHA1	Message	Date
Iago Toral Quiroga	c881e718d4	v3dv: expose separateDepthStencilLayouts This is actually required by Vulkan 1.2 and to expose the extension, so let's conform to this requirement, we don't really care since image layouts are not relevant to our current implementation. Fixes: `1442d77bc5` ('v3dv: trivially implement VK_KHR_separate_depth_stencil_layouts') Fixes: dEQP-VK.info.device_mandatory_features Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16398>	2022-05-09 13:43:35 +02:00
Iago Toral Quiroga	1442d77bc5	v3dv: trivially implement VK_KHR_separate_depth_stencil_layouts We don't currently benefit from seeing barriers and layout transitions that affect just the depth or stencil aspects, so we don't expose this feature. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16344>	2022-05-05 10:41:57 +00:00
Iago Toral Quiroga	32223ac67f	v3dv: implement VK_EXT_separate_stencil_usage We don't care about the depth/stencil usage other than to determine if an image format is supported for that usage. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16344>	2022-05-05 10:41:57 +00:00
Iago Toral Quiroga	44791ba825	v3dv: check input attachment usage as sampled usage Since we implement input attachments as textures we should check support for input attachment usage the same way we check support for sampled images. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16344>	2022-05-05 10:41:57 +00:00
Iago Toral Quiroga	184a48197f	v3dv: use wait stage for wait semaphores to decide about binning syncs If the wait stage for a semaphore doesn't involve geometry stages we don't need a binning sync. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16322>	2022-05-05 09:06:50 +00:00
Iago Toral Quiroga	a981ac0539	v3dv: skip binning sync if binning shaders don't access external resources Until now we have been enabling binning sync if we found a barrier involving geometry stages (a bcl barrier), however, if the actual binning shaders involved with the job don't access any external buffers or images there is no reason to sync at the binning stage. In this patch we don't immediately consume the bcl barrier flag from the command buffer state when we create a new job. Instead, we check this state when we are about to emit a draw call by checking if the shaders involved with binning may access external resources, such as vertex buffers, UBOs, or textures. If none of the draw calls in the job use binning shaders that access external resources then we never enable binning sync for the job. It is possible that a binning shader uses resources that are not synchronized through a barrier though, so we keep track of the access masks used with barriers for both buffers and images separately to better identify if the binning shader is affected by the barrier. If a serialized job never consumes the bcl barrier flag because none of its draw calls ever required a bcl sync, then the flag will be cleared when the job is finished. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16322>	2022-05-05 09:06:50 +00:00
Iago Toral Quiroga	46dd903cc6	v3dv: flag BCL barriers for all-graphics and all-commands stages Since these also include geometry stages in the pipeline. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16322>	2022-05-05 09:06:50 +00:00
Iago Toral Quiroga	1571954be9	v3dv: drop default sampler states if not used If a shader doesn't use any samplers (including default sampler states), make sure we drop them so other parts of the driver can recognize that the program doesn't actually use any samplers at all. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16322>	2022-05-05 09:06:50 +00:00
Alejandro Piñeiro	1ed2b5e253	v3dv/pipeline: include pipeline layout on the pipeline sha1 Fixes failures on tests like this when the on-disk-cache is enabled: dEQP-VK.binding_model.descriptor_copy.compute.uniform_buffer_0 We only found them when running full CTS runs. What happens is that we got a hit from the on-disk shader cache, for several tests using the same shaders. But some tests seems to be using a uniform buffer, and others a inline buffer. Right now inline buffers leads to some changes on the final nir shader, and generated assembly, compared with uniform buffers. So we got a wrong shader. Fortunately we only got an assert instead of weird behaviour. With this commit we include the pipeline layout on the pipeline sha1, so those two cases would get different sha1. FWIW, this is what other drivers are already doing. Surprisingly that didn't cause a problem before. Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16313>	2022-05-04 09:21:20 +00:00
Alejandro Piñeiro	502fae57be	v3dv/pipeline_cache: add on disk cache hit stats Useful when debugging/testing on disk cache. Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16313>	2022-05-04 09:21:20 +00:00
Alejandro Piñeiro	f57a01c5f9	v3dv/pipeline_cache: adds check to skip searching for a entry If we are calling pipeline_cache_upload_shared_data with from_disk_cache, that means that we had used the disk-cache to found that entry. And that should only happens if we didn't find the entry on the cache. So on that case we can skip to search for it. Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16313>	2022-05-04 09:21:20 +00:00
Alejandro Piñeiro	080e14ff61	v3dv/pipeline: fix small comment typo Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16313>	2022-05-04 09:21:20 +00:00
Iago Toral Quiroga	c9f68361a8	v3dv: ignore barriers for image layout transitions from undefined layouts Layout transitions are not relevant to us, we only care about barriers that involve a sync point between read/write actions on the image across GPU jobs. Image transitions from undefined layout can only happen before the image is ever used by the GPU, which means they are never relevant to our implementation. This improves performance in vkQuake. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16235>	2022-05-02 08:04:13 +02:00
Iago Toral Quiroga	a44d498287	v3dv: document Vulkan requirements for signaling operations Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16235>	2022-05-02 08:04:13 +02:00
Iago Toral Quiroga	0273f661e6	v3dv: don't emit final noop job if we don't have anything to signal Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16235>	2022-05-02 08:04:13 +02:00
Iago Toral Quiroga	de0d4d671a	v3dv: drop obsolete comment Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16235>	2022-05-02 08:04:13 +02:00
Iago Toral Quiroga	c367bd0ee3	v3dv: ignore no-op barriers Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16235>	2022-05-02 08:04:13 +02:00
Juan A. Suarez Romero	031bcf686b	v3d/simulator: add support for AMD cards Dumb buffers do not work with AMD gpus. So use AMD ioctl to create proper buffers. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16187>	2022-04-27 14:42:23 +00:00
Juan A. Suarez Romero	9c3144d2ef	v3dv: store device_id on device init Instead of calling later an ioctl to get the device id, let's store it while initializing the physical device. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16187>	2022-04-27 14:42:23 +00:00
Juan A. Suarez Romero	697e98c66e	v3dv: duplicate key on hashtable insert The key is created on stack, so as soon as the function returns this key is lost, so the inserted key in the hashtable is invalid. Rather, insert a duplicated version on heap. This fixes a stack-buffer-overflow when running some Vulkan CTS tests. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16083>	2022-04-22 09:18:15 +00:00
Jason Ekstrand	1b8a43a0ba	util: Remove util_cpu_detect util_cpu_detect is an anti-pattern: it relies on callers high up in the call chain initializing a local implementation detail. As a real example, I added: ...a Mali compiler unit test ...that called bi_imm_f16() to construct an FP16 immediate ...that calls _mesa_float_to_half internally ...that calls util_get_cpu_caps internally, but only on x86_64! ...that relies on util_cpu_detect having been called before. As a consequence, this unit test: ...crashes on x86_64 with USE_X86_64_ASM set ...passes on every other architecture ...works on my local arm64 workstation and on my test board ...failed CI which runs on x86_64 ...needed to have a random util_cpu_detect() call sprinkled in. This is a bad design decision. It pollutes the tree with magic, it causes mysterious CI failures especially for non-x86_64 developers, and it is not justified by a micro-optimization. Instead, let's call util_cpu_detect directly from util_get_cpu_caps, avoiding the footgun where it fails to be called. This cleans up Mesa's design, simplifies the tree, and avoids a class of a (possibly platform-specific) failures. To mitigate the added overhead, wrap it all in a (fast) atomic load check and declare the whole thing as ATTRIBUTE_CONST so the compiler will CSE calls to util_cpu_detect. Co-authored-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Marek Olšák <maraeo@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15580>	2022-04-20 18:44:35 +00:00
Jason Ekstrand	93fbaae7d5	v3dv: Add emulated timeline semaphore support This is trivial thanks to the emulated timelines provided in common code. "Real" timeline semaphores which can be shared across processes will require kernel support. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15704>	2022-04-13 17:22:14 +00:00
Jason Ekstrand	1cc917bc68	v3dv: Use the core version property helpers vulkaninfo is the same before and after. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15704>	2022-04-13 17:22:14 +00:00
Jason Ekstrand	1973b2da9d	v3dv: Use the core version feature helpers vulkaninfo is the same before and after. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15704>	2022-04-13 17:22:14 +00:00
Jason Ekstrand	316728a55b	v3dv: Switch to the common submit framework Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15704>	2022-04-13 17:22:14 +00:00
Jason Ekstrand	321f0b85f2	v3dv: Always wait on last_job_syncs if job->serialize Even if we're the first job on some queue, there may be no wait semaphores but we still need to ensure things happen in-order. (See the "Implicit Synchronization Guarantees" section of the Vulkan spec.) The client can submit back-to-back command buffers with no semaphores between them and it needs to adt the same as if there were a semaphore. If job->serialize is set because of a barrier or something, we still need to synchronize across HW queues by waiting on last_job_syncs. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15704>	2022-04-13 17:22:14 +00:00
Jason Ekstrand	00b84fae2d	v3dv: Add a condition variable for queries In order to properly wait for a query to be complete, we need to first wait for the end query job to flush through on the queue. Since query end is always handled on the CPU, we can do this with a condition variable. The 2s timeout is taken from ANV. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15704>	2022-04-13 17:22:14 +00:00
Jason Ekstrand	e5a0e2122f	v3dv: Use util/os_time helpers Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15704>	2022-04-13 17:22:14 +00:00
Jason Ekstrand	8bd7bd9577	v3dv: Switch to the common device lost tracking Vulkan requires that, once the device has been lost, you keep returning VK_ERROR_DEVICE_LOST. We've got tracking for this in common code; it just needs to be wired up. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15704>	2022-04-13 17:22:14 +00:00
Jason Ekstrand	32527f3ccc	v3dv: Destroy the device mutex on the teardown path Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15704>	2022-04-13 17:22:14 +00:00
Jason Ekstrand	30191fd9df	v3dv: Don't use pthread functions on c11 mutexes This only works because c11/threads.h is typedeffing the c11 stuff to ptrheads. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15704>	2022-04-13 17:22:14 +00:00
Jason Ekstrand	25441b5e5c	v3dv: Put indirect compute CSD jobs in the job list Instead of having the CPU job execute the CSD job, put both jobs on the list with the CPU job first which modifies the GPU job which gets kicked off next. This gives the queue code more visibility into what types of jobs are actually in the list. In particular, if an indirect compute job is the last job in a batch buffer, it currently appears as if the batch ends with CPU work which isn't true because it kicks off GPU work. In that case, the last job on the list is now a GPU job, which better matches reality. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15704>	2022-04-13 17:22:14 +00:00
Jason Ekstrand	0208bb2d58	v3dv: Stop directly setting vk_device::alloc vk_device_init() will do this. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15704>	2022-04-13 17:22:14 +00:00
Iago Toral Quiroga	40f0c616e8	v3dv: fix bogus VkDrmFormatModifierProperties2EXT usage The array is allocated for VkDrmFormatModifierPropertiesEXT, so writring entried with type VkDrmFormatModifierProperties2EXT is bogus. It seems this was a mistake added with a change intended to get rid of VK_OUTARRAY_MAKE, that changed the type of the write by mistake. Fixes: `56a2ccf058` ('v3dv: Stop using VK_OUTARRAY_MAKE()') Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15819>	2022-04-08 14:43:25 +00:00
Jason Ekstrand	292ceb297c	v3dv: Enable VK_EXT_debug_utils It's implemented in common code as long as you use vk_command_buffer. Acked-by: Emma Anholt <emma@anholt.net> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15560>	2022-04-06 01:18:23 +00:00
Iago Toral Quiroga	827ef5fba9	v3dv: fix limits for inline uniform blocks We don't support 'Update After Bind', however, the limits for this model also include the ones without it. See the with or without remark in the spec below: "maxPerStageDescriptorUpdateAfterBindInlineUniformBlocks is similar to maxPerStageDescriptorInlineUniformBlocks but counts descriptor bindings from descriptor sets created with or without the VK_DESCRIPTOR_SET_LAYOUT_CREATE_UPDATE_AFTER_BIND_POOL_BIT bit set." Fixes: dEQP-VK.api.info.vulkan1p2_limits_validation.ext_inline_uniform_block Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15732>	2022-04-04 09:28:55 +00:00
Jason Ekstrand	688d478045	v3dv/queue: Rework multisync_free Thix fixes two bugs. First, we stop leaking in/out fences with multisync. Because the in_syncs and out_syncs parameters to set_multisync were arrays and not pointers to arrays, the caller's in_syncs and out_syncs pointers never got set and remained NULL so multisync_free() always sees to NULL pointers and does nothing, leaking both arrays. Not sure how this isn't showing up in the dEQP leak check tests. Second, the struct drm_v3d_multi_sync was in the scope of the then clause of the `if (device->pdevice->caps.multisync)` so it goes out of scope before the ioctl. This is, effectively, a use-after-free and, depending on stack allocation details, may result in the multisync extension struct getting stompped before the ioctl. Fixes: `ff8586c345` ("v3dv: enable multiple semaphores on cl submission") Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15512>	2022-03-29 14:38:41 +00:00
Iago Toral Quiroga	7f6ecb8667	v3dv: add reference counting for descriptor set layouts The spec states that descriptor set layouts can be destroyed almost at any time: "VkDescriptorSetLayout objects may be accessed by commands that operate on descriptor sets allocated using that layout, and those descriptor sets must not be updated with vkUpdateDescriptorSets after the descriptor set layout has been destroyed. Otherwise, descriptor set layouts can be destroyed any time they are not in use by an API command." Based on a similar fix for RADV. Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5893 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15634>	2022-03-29 11:28:39 +00:00
Iago Toral Quiroga	ca861bd6f4	v3dv: drop unnecessary memset We are already zeroing when we allocate the descriptor set layout memory with vk_object_zalloc. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15634>	2022-03-29 11:28:39 +00:00
Iago Toral Quiroga	591eed30b2	v3dv: fix sampler array addressing in v3dv_descriptor_set_layout Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15634>	2022-03-29 11:28:39 +00:00
Iago Toral Quiroga	ea3223e7a4	v3dv: implement VK_EXT_inline_uniform_block Inline uniform blocks store their contents in pool memory rather than a separate buffer, and are intended to provide a way in which some platforms may provide more efficient access to the uniform data, similar to push constants but with more flexible size constraints. We implement these in a similar way as push constants: for constant access we copy the data in the uniform stream (using the new QUNIFORM_UNIFORM_UBO_*) enums to identify the inline buffer from which we need to copy and for indirect access we fallback to regular UBO access. Because at NIR level there is no distinction between inline and regular UBOs and the compiler isn't aware of Vulkan descriptor sets, we use the UBO index on UBO load intrinsics to identify inline UBOs, just like we do for push constants. Particularly, we reserve indices 1..MAX_INLINE_UNIFORM_BUFFERS for this, however, unlike push constants, inline buffers are accessed through descriptor sets, and therefore we need to make sure they are located in the first slots of the UBO descriptor map. This means we store them in the first MAX_INLINE_UNIFORM_BUFFERS slots of the map, with regular UBOs always coming after these slots. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15575>	2022-03-28 10:44:13 +00:00
Boris Brezillon	56a2ccf058	v3dv: Stop using VK_OUTARRAY_MAKE() We're trying to replace VK_OUTARRAY_MAKE() by VK_OUTARRAY_MAKE_TYPED() so people don't get tempted to use it and make things incompatible with MSVC (which doesn't support typeof()). Suggested-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15522>	2022-03-25 11:00:02 +00:00
Jason Ekstrand	19f56e3fc4	v3dv: Drop GetPhysicalDeviceQueueFamilyProperties Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15459>	2022-03-18 11:19:14 -05:00
Iago Toral Quiroga	4f284254e4	v3dv: support importing external semaphores This was waiting for multisync support in our kernel interface so we can wait on the actual imported payload of a semaphore rather than the last job we submitted. Reviewed-by: Melissa Wen <mwen@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15342>	2022-03-18 13:17:58 +00:00
Iago Toral Quiroga	fa1b10f36d	v3dv: lock around noop job submits Any thread we create may end up creating/submitting at least a noop job, which is a shared object. Before multisync, this was an issue only for the creation of the job itself, but with multisync we can also modify parameters of the noop job every time it is used (for signaling and serialization configuration). This change adds a noop mutex that all threads (main, wait and master) take before submitting a noop job to ensure concurrent access is not an issue. Fixes flakyness observed with multisync with the following test: dEQP-VK.api.command_buffers.secondary_execute_twice Reviewed-by: Melissa Wen <mwen@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15342>	2022-03-18 13:17:58 +00:00
Iago Toral Quiroga	daa865fb2c	v3dv: fix semaphore wait from CPU job If a CPU job comes first in a command buffer with a semaphore wait operation we need to wait on the CPU for the semaphore to be signaled before we process the job. We have been doing this with a WaitForIdle operation, but that only works if the semaphore has been submitted for signaling from the same instance of the driver. If we have an imported payload from another instance in our semaphore however, waitForIdle may return too early since the submission to signal the semaphore may have been submitted by a different instance of the driver as well, and our wait for idle checks only know about this instance submissions. To fix this, we always submit a noop job from our instance that waits on the semaphores on the GPU and follow up with WaitForIdle to wait for that to complete. Fixes test failures and/or assert crashes in: dEQP-VK.synchronization.cross_instance.* (when enabling support for semaphore imports) Reviewed-by: Melissa Wen <mwen@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15342>	2022-03-18 13:17:58 +00:00
Iago Toral Quiroga	3b8ab8a9ce	v3dv: don't signal semaphores/fences from a wait thread When we have a wait thread we can't ensure that the last job in the last command buffer will be the one to signal semaphores because in this case there is no gurantee that jobs from command buffers in the batch will be submitted to the GPU in order, as those put in a wait thread will be submitted later when the event wait operation is completed. Instead, we need to wait for all outstanding wait threads to complete and only then we should signal any semaphores or fences. This also fixes a bug where the wait for events was the last job in the command buffer. In this case, once the event wait is completed we have no additional jobs to submit and thus would never try to signal semaphores or fences. Reviewed-by: Melissa Wen <mwen@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15342>	2022-03-18 13:17:58 +00:00
Iago Toral Quiroga	03840bfcd1	v3dv: fix temporary imports of semaphores and fences with multisync This is preparatory work to expose support for importing semaphores, which was waiting on kernel multisync support. When we implemented user-space multisync support we didn't handle temporary fence/semaphore payload imports at all, so we fix that here. Also, we add a has_temp boolean flag to identify the case where we have a temporary payload in a fence/sempahore instead of just checking if temp_sync is not 0. This is necessary to support semaphore imports (for which we are not exposing support yet) because these need to drop the temporary payload when they are used as wait semaphores in a submit, but we can't destroy the underlying temp_sync at that point because it needs to survive at least until the submit is finished, so instead we use a flag to tell if we have an active temporary payload or not, and we simply destroy any temp_sync on a semaphore destroy or any new import on the same semaphore. We only strictly need this flag for semaphores because fences drop the temporary payload when they are reset, which happens in the CPU and can only be done if the GPU is not using the fence, but we add the same flag for the fence for consistency. Reviewed-by: Melissa Wen <mwen@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15342>	2022-03-18 13:17:58 +00:00
Iago Toral Quiroga	5a11a2fb6c	v3dv: don't expose image load/store features for linear images Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15342>	2022-03-18 13:17:58 +00:00
Iago Toral Quiroga	0590ce1362	v3dv: return early on image to buffer blit copies if image is linear This path uses a shader blit to implement the copy which is only supported for tiled images (except 1D). While blit_shader() already checks for this, this path does a lot of heavy lifting to prepare for the blit_shader call so we rather avoid that if possible when we know blit_shader won't be able to implement the blit. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15342>	2022-03-18 13:17:58 +00:00

... 4 5 6 7 8 ...

1264 commits