fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-25 21:18:26 +02:00

Author	SHA1	Message	Date
Dave Airlie	4bcb48b831	radv: add initial copy descriptor support. (v2) It appears the latest dota2 vulkan uses this, and we get a hang in VR mode without it. v2: remove finishme I left in after finishing. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Andres Rodriguez <andresx7@gmail.com> Cc: "17.2 17.3" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-11-06 19:12:39 +00:00
Dave Airlie	60a9705e00	radv: move descriptor sets out of cmd_state. Instead of storing all the pointers and zeroing them all out, just store a valid bitmask in the state. This also moves the CmdBindPipeline path down the cpu usage path for the multithreading demo as it no longer has to traverse MAX_SETS to find the active descriptor sets. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-11-06 01:11:03 +00:00
Dave Airlie	3a0d098252	radv: add helper for setting a descriptor. This is just a simple refactor. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-11-06 01:11:00 +00:00
Dave Airlie	b48063a2f2	radv: move vertex binding out of cmd state. This isn't required to be cleared, since buffers are only linked by vertex elements, so if elements are clear then no buffers should be referenced. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-11-06 01:10:56 +00:00
Dave Airlie	7365626d78	radv: reorder cmd_state to remove a hole. This just removes a hole in the cmd_state and packs some bools together. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-11-06 01:10:53 +00:00
Dave Airlie	f0ae06a13c	radv: free attachments on end command buffer. If we allocate attachments in the begin command buffer due to the render pass continue bit, we were leaking them. Since renderpasses inside a cmd buffer malloc/free these properly, and set to NULL, we just need to call free at end. Fixes a memory leak with multithreading demo. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Cc: "17.2 17.3" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-11-06 01:03:47 +00:00
Bas Nieuwenhuizen	608af05ffb	radv: Optimize calling radv_save_descriptors. uint32_t data[MAX_SETS * 2] = {}; was getting executed before the exit and took significant amounts of time. By having the check outside the function, we skip the execution of the clear. Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-11-04 20:18:17 +01:00
Bas Nieuwenhuizen	cecbcf4b2d	radv: Use an array to store descriptor sets. The vram_list linked list resulted in lots of pointer chasing. Replacing this with an array instead improves descriptor set allocation CPU usage by 3x at least (when also considering the free), because it had to iterate through 300-400 sets on average. Not a huge improvement as the pre-improvement CPU usage was only about 2.3% in the busiest thread. Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-11-04 20:18:17 +01:00
Samuel Pitoiset	bad31f6a65	radv: use the optimal packets order for dispatch calls This should reduce the time where compute units are idle, mainly for meta operations because they use a bunch of compute shaders. This seems to have a really minor positive effect for Talos, at least. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-11-02 23:03:59 +01:00
Bas Nieuwenhuizen	806721429a	radv: Don't expose heaps with 0 memory. It confuses CTS. This pregenerates the heap info into the physical device, so we can use it for translating contiguous indices into our "standard" ones. This also makes the WSI a bit smarter in case the first preferred heap does not exist. Reviewed-by: Dave Airlie <airlied@redhat.com> CC: <mesa-stable@lists.freedesktop.org>	2017-11-02 20:28:19 +01:00
Samuel Pitoiset	c39f39106d	radv: make radv_bind_descriptor_set() static Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-11-02 09:36:14 +01:00
Dave Airlie	799ef80059	radv: make sure we set buffers as shareable properly. This should make sure we don't treat exports buffers as local bos. Fixes: `a639d40f13` (radv: add support for local bos. (v3)) Tested-by: Andres Rodriguez <andresx7@gmail.com> Reviewed-by: Andres Rodriguez <andresx7@gmail.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-11-02 01:01:29 +00:00
Samuel Pitoiset	5010436e09	radv: bail out when binding the same vertex buffers Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-31 10:16:38 +01:00
Samuel Pitoiset	11fdc2cd34	radv: bail out when binding the same index buffer DOW3 appears to hit this path. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-31 10:16:35 +01:00
Timothy Arceri	e92405c55a	radv: use correct alloc function when loading from disk Fixes regression in: dEQP-VK.api.object_management.alloc_callback_fail.graphics_pipeline Fixes: `1e84e53712` "radv: add cache items to in memory cache when reading from disk" Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-31 14:51:55 +11:00
Alex Smith	134a40d2a6	radv: Fix -Wformat-security issue Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=103513 Fixes: `de88979413` ("radv: Implement VK_AMD_shader_info") Signed-off-by: Alex Smith <asmith@feralinteractive.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2017-10-30 10:58:56 +01:00
Timothy Arceri	1e84e53712	radv: add cache items to in memory cache when reading from disk Otherwise we will leak them, load duplicates from disk rather than memory and never write items loaded from disk to the apps pipeline cache. Fixes: `fd24be134f` 'radv: make use of on-disk cache' Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-30 17:49:54 +11:00
Alex Smith	de88979413	radv: Implement VK_AMD_shader_info This allows an app to query shader statistics and get a disassembly of a shader. RenderDoc git has support for it, so this allows you to view shader disassembly from a capture. When this extension is enabled on a device (or when tracing), we now disable pipeline caching, since we don't get the shader debug info when we retrieve cached shaders. v2: Improvements to resource usage reporting v3: Disassembly string must be null terminated (string_buffer's length does not include the terminator) v4: Fixed LDS reporting. (Bas) Signed-off-by: Alex Smith <asmith@feralinteractive.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-29 00:28:45 +02:00
Samuel Pitoiset	a41e2e9cf5	radv: allow to use a compute shader for resetting the query pool Serious Sam Fusion 2017 uses a huge number of occlusion queries, and the allocated query pool buffer is greater than 4096 bytes. This slightly improves performance (tested in Ultra) from 117.2 FPS to 119.7 FPS (~+2%) on my RX480. This also improves Talos, from 69 FPS to 72/73 FPS (~+5%). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-27 13:47:06 +02:00
Samuel Pitoiset	0d61109bb7	radv: make radv_fill_buffer() return the needed flush bits Only needed when the CS path is used. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-27 13:47:03 +02:00
Dave Airlie	a639d40f13	radv: add support for local bos. (v3) This uses the new kernel interfaces for reduced cs overhead, We only set the local flag for memory allocations that don't have a dedicated allocation and ones that aren't imports. v2: add to all the internal buffer creation paths. v3: missed some command submission paths, handle 0/empty bo lists. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-10-26 23:59:28 +01:00
Samuel Pitoiset	06a12f250f	radv: only copy the dynamic states that changed When binding a new pipeline, we applied all dynamic states without checking if they really need to be re-emitted. This doesn't seem to be useful for the meta operations because only the viewports/scissors are updated. This should reduce the number of commands added to the IB when a new graphics pipeline is bound. Also, rename radv_dynamic_state_copy() to radv_bind_dynamic_state() and set the dirty flags directly there. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-26 09:37:05 +02:00
Samuel Pitoiset	b1e31c1911	radv: store the dynamic state mask into radv_dynamic_state Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-26 09:37:03 +02:00
Samuel Pitoiset	672cf692fb	radv: only emit the depth bounds test values when set dynamically The depth bounds test values are either set at pipeline creation or dynamically using vkCmdSetDepthBounds(). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-26 09:37:00 +02:00
Bas Nieuwenhuizen	61a9ef4ab1	radv: Compute ac keys from pipeline key. The beginning of the end for the shader keys. Not entirely sure what I'm going to replace them with for the compiler though, so this is the first step. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-10-26 00:28:40 +02:00
Bas Nieuwenhuizen	49d035122e	radv: Add single pipeline cache key. To decouple the key used for info gathering and the cache from whatever we pass to the compiler. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-10-26 00:28:40 +02:00
Bas Nieuwenhuizen	de38491a57	radv: Don't compute as_ls/as_es before hashing. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-10-26 00:28:40 +02:00
Samuel Pitoiset	9711979df0	radv: print NIR before LLVM IR and disassembly It's still printed after linking, but it makes more sense to have SPIRV->NIR->LLVM IR->ASM. Fixes: `f0a2bbd1a4` (radv: move nir print after linking is done) Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-25 11:46:53 +02:00
Bas Nieuwenhuizen	5bfbab2fdc	radv: Fix truncation issue hexifying the cache uuid for the disk cache. Going from binary to hex has a 2x blowup. Fixes: `1421625292` 'radv: create on-disk shader cache' Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-25 09:50:05 +02:00
Timothy Arceri	767ca5bdf1	radv: enable lower to scalar nir pass This will allow dead components of varyings to be removed. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-25 17:02:40 +11:00
Dave Airlie	d8cefaa197	radv: use device name in cache creation like radeonsi. Not sure how useful this is, but it makes it more consistent. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Cc: "17.3" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-10-25 02:26:01 +01:00
Dave Airlie	3cd3035ace	radv: use a define for the transition point between cp and compute shader For certain buffer meta ops we can use the CP or a compute shader, we should use a define to rather than hardcoding 4096, allows for easier testing and more consistency. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-10-25 10:01:13 +10:00
Dave Airlie	a5499b639c	radv: only emit dfsm packets if dfsm is allowed. radeonsi only emits these when dfsm is enabled, so for now just hinge them on a flag we never set. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-10-24 23:00:57 +01:00
Timothy Arceri	f0a2bbd1a4	radv: move nir print after linking is done We now have linking optimisations so we want to delay dumping the nir until after these are complete. Fixes: `06f05040eb` (radv: Link shaders) Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-24 10:41:38 +11:00
Timothy Arceri	013313cf89	radv: clone meta shaders before linking The IR is reused in different pipeline combinations so we need to clone it to avoid link time optimistaions messing up the original copy. Fixes: `06f05040eb` (radv: Link shaders) Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-24 09:27:40 +11:00
Alex Smith	fee9d05e21	radv: Update code pointer correctly if a variant is already created This was the actual cause of GPU hangs fixed by `0fdd531457` ("radv: Fix pipeline cache locking issues"), since multiple threads would end up trying to create the variants for a single entry. Now that we're locking around the whole of this function, this isn't really necessary (we either create all or none of the variants), but fix this anyway in case things change later. Signed-off-by: Alex Smith <asmith@feralinteractive.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> CC: 17.3 <mesa-stable@lists.freedesktop.org>	2017-10-23 22:36:54 +02:00
Juan A. Suarez Romero	2665d012a8	radv: automake: include radv_extensions.py in the tarball Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-10-23 12:37:01 +02:00
Bas Nieuwenhuizen	c07d719e8b	radv: Disallow indirect outputs for GS on GFX9 as well. Since it also uses the output vector before writing to memory. Fixes: `e38685cc62` 'Revert "radv: disable support for VEGA for now."' Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-10-23 00:27:44 +02:00
Dave Airlie	da9c3cd3ee	radv/ac/nir: only emit tess factors to storage if tes reads them Otherwise we just need to write them to the tf ring. this seems to improve the tessellation demo on Bonarie ~2190->~2230 fps Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-10-23 07:10:29 +10:00
Bas Nieuwenhuizen	6ce550453f	radv: Don't use vgpr indexing for outputs on GFX9. Due to LLVM bugs. Fixes a bunch of dEQP-VK.glsl.indexing.* tests. Fixes: `e38685cc62` 'Revert "radv: disable support for VEGA for now."' Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-22 02:36:37 +02:00
Bas Nieuwenhuizen	67648c0faa	radv: Don't compile shaders when they are cached already. When the gs_copy_shader is NULL (due to an incomplete cache), but the main shaders are found, we still do the nir, but we shouldn't compile the shaders again. For merged shaders we should also account for the missing shaders. Fixes: `ce03c119ce` 'radv: Add code to compile merged shaders.' Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-21 22:29:34 +02:00
Bas Nieuwenhuizen	3bf954b28e	radv: Don't check for max GL GS invocations. We specify 127 instead of 32 as the limit in vulkan. Fixes: `6bc42855f9` 'radv: enable GS on GFX9' Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-10-21 22:29:09 +02:00
Bas Nieuwenhuizen	050f7e2df2	radv: Don't explicitly reference vertex shader for draw_id. With merged shaders the vertex shader may not exist. This got in because the offending patch was written before merged shaders were upstream, but committed after. Fixes: `75dfab24a2` 'radv: refactor indirect draws with radv_draw_info' Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2017-10-21 20:00:22 +02:00
Bas Nieuwenhuizen	20fb15bfe4	radv: Don't reset cmd_buffer->state.dirty. Otherwise for non-indexed draws we set and immediately unset RADV_CMD_DIRTY_INDEX_BUFFER. As all the set functions should clear their own bit, this is unnecessary. Fixes: `341529dbee` 'radv: use optimal packet order for draws' Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2017-10-21 20:00:16 +02:00
Bas Nieuwenhuizen	fb55477990	radv: Correctly detect changed shaders for vertex descriptors. As they were emitted after the new pipeline, the changed pipeline detection was not working anymore. Fixes: `341529dbee` 'radv: use optimal packet order for draws' Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2017-10-21 19:59:44 +02:00
Alex Smith	0fdd531457	radv: Fix pipeline cache locking issues Need to lock around the whole process of retrieving cached shaders, and around GetPipelineCacheData. This fixes GPU hangs observed when creating multiple pipelines in parallel, which appeared to be due to invalid shader code being pulled from the cache. Signed-off-by: Alex Smith <asmith@feralinteractive.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-21 03:52:43 +02:00
Andres Rodriguez	a2c6fbb3ee	radv: disable implicit sync for radv allocated bos v3 Implicit sync kicks in when a buffer is used by two different amdgpu contexts simultaneously. Jobs that use explicit synchronization mechanisms end up needlessly waiting to be scheduled for long periods of time in order to achieve serialized execution. This patch disables implicit synchronization for all radv allocations except for wsi bos. The only systems that require implicit synchronization are DRI2/3 and PRIME. v2: mark wsi bos as RADV_MEM_IMPLICIT_SYNC v3: Add drm version check (Bas) Signed-off-by: Andres Rodriguez <andresx7@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-21 01:15:54 +02:00
Andres Rodriguez	eff2bdbd82	radv: factor out radv_alloc_memory This allows us to pass extra parameters to the memory allocation operation that are not defined in the vulkan spec. This is useful for internal usage. Signed-off-by: Andres Rodriguez <andresx7@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-21 01:15:49 +02:00
Andres Rodriguez	92724338ba	radv: Expose VK_EXT_global_priority Expose the extension string as supported Signed-off-by: Andres Rodriguez <andresx7@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-21 01:01:44 +02:00
Andres Rodriguez	9f7edf4d1f	radv: don't skip PS/VS partial flush This patch helps lower high priority compute latency. Found by bisecting a perf regression on computeparticles with high priority compute queues enabled. Reverting this micro-optimization doesn't seem to have any negative effect on performance on Dota2 or ssao. Signed-off-by: Andres Rodriguez <andresx7@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-10-21 01:01:44 +02:00

... 169 170 171 172 173 ...

9469 commits