fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-04-05 12:30:40 +02:00

Author	SHA1	Message	Date
Alejandro Piñeiro	e14f5252fa	v3dv/cmd_buffer: always bind pipeline static state Even if the pipeline is the same. The followin sequence, used on dEQP-VK.dynamic_state..double_static_bind tests, is valid: 1. Bind pipeline with some static state. 2. Set state command for that static state (to a bad value). 3. Bind the same pipeline again. 4. Draw. So on 3 we need to ensure to load again the pipeline static state. Fixes: dEQP-VK.dynamic_state..double_static_bind Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28897>	2024-04-26 09:52:09 +00:00
Samuel Pitoiset	e4f945cd4a	vulkan: pass cmdbuf level to vk_command_buffer_ops::create() RADV needs to know the command buffer level in the create() helper. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28861>	2024-04-23 06:33:31 +00:00
Iago Toral Quiroga	bdf2a470d3	v3dv: fix job suspend with command buffer simultaneous use flag With the simultaneous use flag we can reuse the same command buffer multiple times. That means, for example, that we can have an instance of a job running in the GPU while we are submitting another one for execution to a queue. This scenario is problematic with dynamic rendering and job suspension because suspended jobs need to be patched with the resume address at queue submit time, and thus, if we have another instance of the same job currently executing in the GPU we could stomp its resume address, which could be different. To fix this, at queue submission time, when we detect a suspending job in a command buffer with the simultaneous use flag, we clone the job and create its own copy of the BCL so we can patch the resume address into it safely without conflicting with any other instance of the job that may be running. We need to flag these clones as having their own BCL since we would have to free it when the job is destroyed, unlike other clones that don't own any resources of their own. Also, because this job is created at queue submit time, it won't be in the execution list of the command buffer, so it won't be automatically destroyed with it, so we need to add it to the command buffer as a private object. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28521>	2024-04-03 13:35:54 +02:00
Iago Toral Quiroga	ff8d72ba22	v3dv: store the offset of the BRANCH instruction in a CL This will be useful to know which is the actual executable size of a BO in a CL that branches into a another BO. We will need this soon to implement deep clones of the BCL for suspending jobs with the command buffer simultaneous use flag. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28521>	2024-04-03 12:57:56 +02:00
Iago Toral Quiroga	c874caf33d	v3dv: fix job pointers from cloned CLs We had these pointing to the original job instead of pointing to the cloned job. This can be confusing, particularly, if we then emit commands that include references to new BOs into the cloned jobs, since we would then try to insert these BOs in the original jobs instead of the clones, which was the situation we had when we implemented resume address patching with dynamic rendering. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28521>	2024-04-03 12:57:56 +02:00
Iago Toral Quiroga	e6efee3b40	v3dv: add a v3dv_job_clone helper This will clone the job but it won't automatically put it in the job list of a command buffer. This will come in handy to handle the required job cloning for suspending jobs with the command buffer reuse flag in a follow-up patch. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28521>	2024-04-03 12:57:56 +02:00
Iago Toral Quiroga	16c96b0e93	v3dv: drop single sync kernel interface Since we are now requiring a multisync kernel interface there is no reason to continue supporting the legacy interface. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28541>	2024-04-03 10:34:17 +00:00
Iago Toral Quiroga	25e45b85c2	v3dv: require multisync kernel Multisync has been available in kernel releases for a long time now and Raspberry Pi OS kernels have been supporting it for a while too. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28541>	2024-04-03 10:34:17 +00:00
Eric Engestrom	ff37f68740	meson: add VK_DRIVER_FILES to devenv, alongside the old VK_ICD_FILENAMES Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28516>	2024-04-02 18:08:52 +00:00
Iago Toral Quiroga	7992d44b24	v3dv: fix image creation when exceeding maxResourceSize Fixes crashes in tests like dEQP-VK.pipeline.monolithic.render_to_image.core.2d_array.huge.width_height_layers.r8g8b8a8_unorm with CTS main. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28364>	2024-03-26 07:23:56 +00:00
Yonggang Luo	1ac1c0843f	treewide: Replace usage of macro DEBUG with MESA_DEBUG when possible This is achieved by the following steps: #ifndef DEBUG => #if !MESA_DEBUG defined(DEBUG) => MESA_DEBUG #ifdef DEBUG => #if MESA_DEBUG This is done by replace in vscode excludes docs,.rs,addrlib,src/imgui,.sh,src/intel/vulkan/grl/gpu These are safe because those files should keep DEBUG macro is already excluded; and not directly replace DEBUG, as we have some symbols around it. Use debug or NDEBUG instead of DEBUG in comments when proper This for reduce the usage of DEBUG, so it's easier migrating to MESA_DEBUG These are found when migrating DEBUG to MESA_DEBUG, these are all comment update, so it's safe Replace comment /* DEBUG / and / !DEBUG / with proper / MESA_DEBUG / or / !MESA_DEBUG */ manually DEBUG \|\| !NDEBUG -> MESA_DEBUG \|\| !NDEBUG !DEBUG && NDEBUG -> !(MESA_DEBUG \|\| !NDEBUG) Replace the DEBUG present in comment with proper new MESA_DEBUG manually Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: David Heidelberg <david.heidelberg@collabora.com> Reviewed-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28092>	2024-03-22 18:22:34 +00:00
Joshua Ashton	fc263e0308	v3dv: Enable EXT_swapchain_colorspace No-op. Signed-off-by: Joshua Ashton <joshua@froggi.es> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Acked-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28275>	2024-03-20 18:24:26 +00:00
Joshua Ashton	f977e4d4f5	v3dv: Enable EXT_swapchain_maintenance1 This was missing, this is implemented in common code. Signed-off-by: Joshua Ashton <joshua@froggi.es> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Acked-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28275>	2024-03-20 18:24:25 +00:00
Iago Toral Quiroga	92172760e2	v3dv: enable VK_KHR_dynamic_rendering Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27978>	2024-03-19 12:06:21 +00:00
Iago Toral Quiroga	7a2b17235d	v3dv: also emit subpass clears with secondary command buffers With dynamic rendering secondary command buffers can start subpasses so we need this. Outside dynamic rendering secondary command buffers won't be calling here since they are restricted to record commands within a subpass. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27978>	2024-03-19 12:06:21 +00:00
Iago Toral Quiroga	e9b44a3bb5	v3dv: handle render pass continue flag with dynamic passes If a secondary command buffer recording a dynamic pass has the VK_COMMAND_BUFFER_USAGE_RENDER_PASS_CONTINUE_BIT flag then the rendering information for it should come from a VkCommandBufferInheritanceRenderingInfo struct in the pNext chain instead of the usual render pass information in the VkCommandBufferInheritanceInfo struct. We take the information from the new struct and build a render pass description from it assuming a setup without a framebuffer (which is optional for regular render passes too). Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27978>	2024-03-19 12:06:21 +00:00
Iago Toral Quiroga	f4ec92084e	v3dv: fix resume address patching for secondary command buffers Because we are cloning these into primaries but the cloning is superficial the command lists in them still point to the original jobs and therefore paching new addresses would make the packing code add the BO of the resume address to the original job. This has two problems: 1. This is probably not what we want since the patching should only be affecting the clone. 2. The bo_count of the clone job will not be updated accordingly and we end up with a mismatch that will blow up when we submit. The solution used here is a big hack, but works for now: we just specify the address by its full offset rather than a relative offset from a BO. We already have to add all the BOS in the resume job manually which will include this the BO for the branch address too, so this is fine. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27978>	2024-03-19 12:06:21 +00:00
Iago Toral Quiroga	0bb04c019e	v3dv: rename SECONDARY job type to INCOMPLETE This was used only in secondary CL command buffers so it made sense but with dynamic rendering we are going to also have regular CLs also in secondaries (since secondaries can now record full dynamic rendering passes), so renaming this to INCOMPLETE makes more sense, since this is really what they refer to: parts of CLs that are intended to be merged into other primaries through branching. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27978>	2024-03-19 12:06:21 +00:00
Iago Toral Quiroga	2478939b69	v3dv: implement dynamic rendering resume/suspend Dynamic rendering allows the client to suspend recording of a render pass and have it continued in a different command buffer. When a suspended command buffer is submitted to a queue, the resuming command buffer must be te next one in submission order. This means we need to be able to "merge" or "stitch" together these command buffers at submit time. To accomplish this, when we suspend a command buffer we emit a BRANCH instruction to finish it. Then at submit time, when we know the resuming job, we patch the BRANCH address with the address of the resuming binning list (bcl). This is very similar to how we execute secondary command buffers inside a render pass. Also, only the last resuming job should flush the binning lists in the bcl since we won't have processed the full binning command list until we have execute the last linked job in the resume list. Since all jobs and command buffers in the suspend/resume chain must be part of the same dynamic render pass, we only need to produce and emit the render command list (rcl) once. Since the way we implement stitching is that we branch from the suspending job into the resuming one, the first job suspending will link into all the resuming jobs necessary to complete the chain, therefore, after the stitching is complete, we only want to submit the first job in the suspend/resume chain, and thus, we only produce and emit the rcl for this one job. Notice as well that suspending only affects the last job recording a dynamic rendering pass (the one that needs the branch so we can resume execution with another job in another command buffer). Resuming affects all jobs in the dynamic render pass, since we won't produce RCLs for them (as only the originating job on the suspend/resume chain will emit the RCL). Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27978>	2024-03-19 12:06:21 +00:00
Iago Toral Quiroga	c15e0aac17	v3dv: implement vkCmdBeginRendering and vkCmdEndRendering With this we are able to run basic dynamic render passes, however, we are still missing a few things like support for secondary render passes, suspend/resume, etc that will be adding in follow-up patches. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27978>	2024-03-19 12:06:21 +00:00
Iago Toral Quiroga	78015a9da3	v3dv: don't assume that pipelines have a render pass This builds up on the previous patch and rewrites all the pipeline code that fetched information from the pipeline's render pass (which will be NULL for dynamic rendering) to instead fetch it through the new rendering_info field, which will be valid for both regular and dynamic render passes. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27978>	2024-03-19 12:06:21 +00:00
Iago Toral Quiroga	e22d843fa4	v3dv: add a vk_render_pass_state to pipelines With dynamic rendering the API formally eliminates render passes, so the pipeline create info can now have a NULL render pass, in which case rendering info must be provided via pNext struct VkPipelineRenderingCreateInfo, or if this is missing too then defaults to no multiview and no attachments. Since we don't want to have separate paths all over the place whenever we need to access render pass / rendering info for the pipeline, we will always produce a valid vk_render_pass_state struct with the relevant information even when we have a render pass, so we can rely on that always being available. A follow-up patch will rewrite all the places where we assumed the existence of a render pass in the pipeline to instead fetch the info it needs from this new field instead. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27978>	2024-03-19 12:06:20 +00:00
Iago Toral Quiroga	10df187540	v3dv: add a helper to setup a framebuffer for dynamic rendering Since the plan is to leverage our render pass infrastructure, we also need to setup a framebuffer from the rendering info provided with dynamic rendering. We allocate the framebuffer lazily, only once, if a dynamic render pass is used. To do this, we make it so it can hold the maximum number of attachments possible with our hardware. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27978>	2024-03-19 12:06:20 +00:00
Iago Toral Quiroga	6684aa09ff	v3dv: add helper to build a render pass for dynamic rendering The idea is to build a regular render pass from the rendering info provided with dynamic rendering. We will use this when recording dynamic render passes to leverage our existing implementation for render passes with dynamic rendering. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27978>	2024-03-19 12:06:20 +00:00
Iago Toral Quiroga	72c3769437	v3dv: add helper to check if we need to use a draw for a depth/stencil clear We will need this when setting up dynamic render passes too. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27978>	2024-03-19 12:06:20 +00:00
Iago Toral Quiroga	f1e6e58aef	v3dv: add a helper to constrain clip window to render area We will need to do the same when setting up dynamic render passes. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27978>	2024-03-19 12:06:20 +00:00
Iago Toral Quiroga	f285f69677	v3dv: refactor checking and adding pending jobs Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27978>	2024-03-19 12:06:20 +00:00
Iago Toral Quiroga	e1b52e3052	v3dv: fix copying v3dv_end_query_info into primaries from secondaries We had missed copying the count field. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27978>	2024-03-19 12:06:20 +00:00
Iago Toral Quiroga	93f9f2bcbb	v3dv: always set view index before drawing It is allowed for a shader to enable the multiview extension even if the draw call in which it is used doesn't use multidraw. This allows the shader to still use gl_ViewIndex, which will always be 0 in that scenario. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27978>	2024-03-19 12:06:20 +00:00
Iago Toral Quiroga	aeee18be1b	v3dv: fix subpass clear with draw call for multi-layered framebuffers Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27978>	2024-03-19 12:06:20 +00:00
Juan A. Suarez Romero	4f6f2cea6a	v3dv: enable smooth line rendering This is based on a lowering that we are already using in the OpenGL driver. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28171>	2024-03-18 12:09:12 +00:00
Juan A. Suarez Romero	f5d4242928	v3dv: assume that rasterization state can be NULL So far to check if rasterization discard is enabled or not we assumed that rasterization state struct was never NULL. However, as this will change with VK_EXT_extended_dynamic_state3, it can be a good idea just to assume it can be NULL, so adding the check too. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28171>	2024-03-18 12:09:11 +00:00
Yonggang Luo	680e707534	treewide: Replace the invalid usage #if DEBUG with #ifdef DEBUG This is done by find&replace and exclude the following folders in vscode docs,.rs,addrlib,src/imgui,.sh,src/intel/vulkan/grl/gpu This is a prepare step for re-working https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21946 These issues are found when to try switch DEBUG to MESA_DEBUG=0\|1 in MR https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28092 Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28083>	2024-03-15 16:08:18 +00:00
Juan A. Suarez Romero	d38ff02c03	v3dv: mark some promoted extensions as supported There are few EXT_ extensions that were promoted to KHR_, but we didn't enabled them as supported. This makes some CTS tests to be run as unsupported when they should be supported instead. For example, we were passing 16/108 line rasterization tests instead of 40/108 because we did not enabled KHR_line rasterization. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28090>	2024-03-11 12:17:43 +00:00
Juan A. Suarez Romero	08af5f2703	v3dv: disable Early Z for multisampled 16-bit depth buffers Besides disabling early-z when a frame is an odd width or height, we need to disable it if the buffer is 16-bit and multisampled. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28009>	2024-03-11 10:23:48 +00:00
Juan A. Suarez Romero	33e77c9041	v3d,v3d: use new simulator The new simulator provides a new API, so we need to adapt the code. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28009>	2024-03-11 10:23:48 +00:00
Yiwei Zhang	c9d3cc2615	vulkan: refactor the runtime header gen order dependency Summary: - ensure headers used outside runtime are included in dependency source - drop redundant idep_vulkan_common_entrypoints_h - drop redundant icd side tricks for the order of header gen Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28066>	2024-03-08 21:42:07 +00:00
Yiwei Zhang	90824e07a2	vulkan: properly ensure wsi_entrypoints header gen order Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28066>	2024-03-08 21:42:07 +00:00
Yonggang Luo	db103c56ab	treewide: Remove vulkan/runtime vulkan/util prefix in include path This is for unify the include style of shared vulkan headers Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27526>	2024-03-05 19:05:00 +00:00
Iago Toral Quiroga	1880e7cfed	v3d,v3dv: fix BO allocation for shared vars We need to allocate "shared size" bytes for each workgroup but we were incorrectly multiplying by the number of workgroups in each supergroup instead, which would typically cause us to allocate less memory than actually required. The reason this issue was not visible until now is that the kernel driver is using a large page alignment on all BO allocations and this causes us to "waste" a lot of memory after each allocation. Incidentally, this wasted memory ensured that out of bounds accesses would not cause issues since they would typically land in unused memory regions in between aligned allocations, however, experimenting with reduced memory aligments raised the issue, which manifested with the UE4 Shooter demo as a GPU hang caused by corrupted state from out of bounds memory writes to CS shared memory. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27675>	2024-02-21 06:17:55 +00:00
Eric Engestrom	11cf6965ea	v3dv: enable VK_EXT_headless_surface on all platforms except Windows Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27448>	2024-02-06 20:32:38 +00:00
Yiwei Zhang	f06d7f6942	v3dv: refactor to use DETECT_OS_ANDROID instead of ANDROID Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27374>	2024-02-01 19:29:49 +00:00
Alejandro Piñeiro	16f6f50ce4	v3dv: expose VK_EXT_depth_clip_enable We already had the logic implemented, but it was never really tested (there was a comment about that) So the advantage of this is that we now test that code (in fact, there were a small typo on that code). There aren't too much CTS tests for this feature, but we gets tests like this working: dEQP-VK.clipping.clip_volume.depth_clip.* Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10527 Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27386>	2024-02-01 11:33:38 +00:00
Iago Toral Quiroga	6c570f7a98	v3dv: allow subgroup operations in fragment shaders Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27211>	2024-01-31 10:06:06 +00:00
Iago Toral Quiroga	31e8740808	v3dv: expose more subgroup features on V3D 7.x The hardware included additional instructions to support more subgroup features, so let's put them to use. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27211>	2024-01-31 10:06:06 +00:00
Daniel Schürmann	26c8f13ff5	vulkan: enable VK_KHR_shader_expect_assume This implementation ignores the hints. Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27265>	2024-01-30 19:09:42 +00:00
Alejandro Piñeiro	0a3784ae33	v3dv/bo: use mtx_lock/unlock on cache_init too To handle coverity warning: 4. thread2_modifies_field: Thread2 sets cache_size to a new value. Note that this write can be reordered at runtime to occur before instructions that do not access this field within this locked region. After Thread2 leaves the critical section, control is switched back to Thread1. CID 1559509 (#1 of 1): Check of thread-shared field evades lock acquisition (LOCK_EVASION)6. thread1_overwrites_value_in_field: Thread1 sets cache_size to a new value. Now the two threads have an inconsistent view of cache_size and updates to fields correlated with cache_size may be lost. 521 cache->cache_size += bo->size; Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26951>	2024-01-22 16:47:08 +01:00
Karol Herbst	f2b7c4ce29	nir: rework and fix rotate lowering No driver supports urol/uror on all bit sizes. Intel gen11+ only for 16 and 32 bit, Nvidia GV100+ only for 32 bit. Etnaviv can support it on 8, 16 and 32 bit. Also turn the `lower` into a `has` option as only two drivers actually support `uror` and `urol` at this momemt. Fixes crashes with CL integer_rotate on iris and nouveau since we emit urol for `rotate`. v2: always lower 64 bit Fixes: `fe0965afa6` ("spirv: Don't use libclc for rotate") Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by (Intel and nir): Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: David Heidelberg <david.heidelberg@collabora.com> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27090>	2024-01-22 10:27:44 +00:00
Iago Toral Quiroga	f37bb34d86	v3dv: expose VK_EXT_subgroup_size_control This is trivial for us since we don't support variable subgroup sizes. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26968>	2024-01-11 07:21:36 +00:00
Iago Toral Quiroga	5c42d6c62f	v3dv: implement VK_EXT_shader_demote_to_helper_invocation Demoting means that we don't execute any writes to memory but otherwise the invocation continues to execute. Particularly, subgroup operations and derivatives must work. Our implementation of discard does exactly this by using setmsf to prevent writes for the affected invocations, the only difference for us is that with discard/terminate we want to be more careful with emitting quad loads for tmu operations, since the invocations are not supposed to be running any more and load offsets may not be valid, but with demote the invocations are not terminated and thus we should emit memory reads for them to ensure quad operations and derivatives from invocations that have not been demoted still work. Since we use the sample mask to implement demotes we can't tell whether a particular helper invocation was originally such (gl_HelperInvocation in GLSL) or was later demoted (OpIsHelperInvocationEXT added with SPV_EXT_demote_to_helper_invocation), so we use nir_lower_is_helper_invocation to take care of this. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26949>	2024-01-09 13:22:37 +00:00

1 2 3 4 5 ...

1458 commits