fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-17 11:48:05 +02:00

Author	SHA1	Message	Date
Paulo Zanoni	f9477770d8	anv: use vk_realloc for the anv_execbuf arrays Three reasons for that: 0. The operation we're doing here is actually a reallocation. 1. The newer code is, IMHO, easier to read. 2. Realloc has this property where sometimes, when possible, it will expand your array without moving it somewhere else, so it doesn't need to copy the memory contents, returning the original pointer back to you. I did some analysis and while that case is not common, it does happen sometimes in real world applications (I could see it happening in Shootergame and Aztec Ruins, but not Dota 2), so we're able to save a few CPU cycles. v2: Rebase. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20703>	2023-01-19 02:21:09 +00:00
Paulo Zanoni	6d4fc0e5bf	anv: rename anv_execbuf->array_length to bo_array_length Because this is counting the array length of the things related to the BOs, just like syncobj_array_length is counting the array length of the things related to syncobjs. v2: Rebase. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20703>	2023-01-19 02:21:09 +00:00
Paulo Zanoni	e642cafdae	anv: run buf_finish() if add_bo() fails during execute_simple_batch() This is the only code path where we don't run anv_execbuf_finish() in case anv_execbuf_add_bo() fails. While there is not a bug in the current tree, I recently made an (uncommitted) modification that started leaking memory and made me realize the lack of cleanup here. If we had anv_execbuf_finish() being called upon error like we're going to have after this patch my modification wouldn't have caused the memory leak. I think it's much safer and future-proof if we're able to operate under the assumption that whatever is allocated and set to anv_execbuf will be dealt with upon failure of anything else related to it, so functions that fail should only be required to free pointers not yet assigned to anv_execbuf. The dEQP-VK 'alloc_callback_fail' tests should exercise this code path. The one I was specifically using here is: dEQP-VK.api.object_management.alloc_callback_fail.device_group v2: Rebase. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20703>	2023-01-19 02:21:09 +00:00
Paulo Zanoni	3d37950fd9	anv: check the return value of anv_execbuf_add_bo_bitset() Because anv_execbuf_add_bo_bitset() calls anv_execbuf_add_bo(), which can fail if its memory allocations fail. I have seen dEQP tests exercising memory allocation failures during anv_execbuf_add_bo(), but I don't think the path coming from add_bo_biset() was specifically exercised. Anyway, add the error check just in case. v2: Rebase. Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20703>	2023-01-19 02:21:09 +00:00
Paulo Zanoni	ad6a036a68	anv: don't leave undefined values in exec->syncobj_values In anv_execbuf_add_syncobj(), we try to not create or use exec->syncobj_values if we don't need to. But when we figure we're going to need it (i.e., when timeline_value is not zero), then we create exec->syncobj_values with vk_zalloc, which means every previous value is set to zero, as it should be. This is all correct. The problem starts when we add a 16th element. In this case we double exec->syncobj_array_length and realloc the buffer by using vk_alloc and copying the old array to the new one. After that, we write the timeline_value to the array only if it's not zero, and that's the problem: since we just used vkalloc and memcpy, we don't have any guarantees that the new array will be zero after the 16th element, and if timeline_value is zero we write nothing to that position. Once we start using exec->syncobj_values we have to commit to using it, so the "if (timeline_value)" check near the end of the function has to be changed to "if (exec->syncobj_values)", so we actually set elements after the 16th to zero when they need to be zero. Another approach to fix this would be to memset the new elements once we double syncobj_array_length. In practice, I couldn't find any application or deqp test that used more than 3 elements in exec->syncobj_array_length, and we need more than 16 elements in order to be able to reproduce the bug, so I'm not aware of any real-world bug that goes away with this patch. This issue was found while reading code. If we craft a little Vulkan program that submits a ton of timeline and binary semaphores on vkQueueSubmit, then waits for them, we get the following error without this patch: MESA: error: ../../src/intel/vulkan/anv_batch_chain.c:1910: execbuf2 failed: Invalid argument (VK_ERROR_DEVICE_LOST) v2: Rebase. Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20703>	2023-01-19 02:21:09 +00:00
José Roberto de Souza	e879b28994	anv: Move anv_device_check_status() code to i915/anv_device.c Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Rohan Garg <rohan.garg@intel.com> Acked-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20428>	2023-01-17 17:10:18 +00:00
José Roberto de Souza	94af444490	anv: Split i915 code from anv_batch_chain.c There is no change in behavior here. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Rohan Garg <rohan.garg@intel.com> Acked-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20428>	2023-01-17 17:10:18 +00:00
José Roberto de Souza	94ca73b356	anv: Export anv_exec_batch_debug() and chain_command_buffers() This functions will be used by i915 and Xe KMD. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Rohan Garg <rohan.garg@intel.com> Acked-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20428>	2023-01-17 17:10:18 +00:00
José Roberto de Souza	80c89c4606	anv: Start to move i915 specific code from anv_device to i915/anv_device More code re-organization to separate i915_drm.h specific code from the rest. No behavior changes here. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Rohan Garg <rohan.garg@intel.com> Acked-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20428>	2023-01-17 17:10:18 +00:00
Illia Polishchuk	8491b1fd5e	ANV: Add extra memory types for ANV driver instead of a single one Some game engines can't handle single type well And Intel on Windows uses 3 types so it's better to add extra one here Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7360 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Illia Polishchuk <illia.a.polishchuk@globallogic.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20693>	2023-01-17 07:41:52 +00:00
Jason Ekstrand	b39958a3a1	anv,nir: Move the ANV YCbCr lowering pass to common code Nir changes: Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Anv changes: Acked-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19950>	2023-01-16 14:10:21 +00:00
Jason Ekstrand	2ac771973d	anv: Use the YCbCr format info from common code We still maintain our own table of formats but all of the conversion and sampling info we pull from common code. Acked-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19950>	2023-01-16 14:10:21 +00:00
Jason Ekstrand	30a91d333d	anv: Use the common vk_ycbcr_conversion object Acked-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19950>	2023-01-16 14:10:21 +00:00
Jason Ekstrand	18feb32df0	anv/android: Use VkFormat for externalFormat Using a pointer to an internal data structure works but it's a bit sketchy. Since every anv_format maps to a VkFormat, we may as well just use the VkFormat. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19950>	2023-01-16 14:10:21 +00:00
Jason Ekstrand	9fc046a87d	anv: Refactor Android externalFormat handling in CreateYcbcrConversion Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19950>	2023-01-16 14:10:21 +00:00
Lionel Landwerlin	28b15fa9e7	anv: add support for command buffer tagging in traces Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16655>	2023-01-13 01:22:15 +00:00
Lionel Landwerlin	9a16effeac	anv: record secondaries' traces into primaries Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16655>	2023-01-13 01:22:15 +00:00
Emma Anholt	f67a0a7745	anv: Add a tracepoint for the fallback implicit sync wait path. If you're here, you'd really like to know. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20658>	2023-01-12 20:21:03 +00:00
Emma Anholt	1aa163ebb5	anv: Print the BO sizes in KB instead of hex bytes. We already show the address range, which is most of why I'd think you'd be looking at hex values. I find a more human-readable number nice for debugging, instead of counting zeroes to decide if it's 1.5MB or 96kb. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20540>	2023-01-11 00:35:34 +00:00
Emma Anholt	38e29fe712	anv: Fix the size/aperture space debug printouts to consider _ccs_size. It's added in at anv_bo_vma_alloc_or_close(), so count it here too. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20540>	2023-01-11 00:35:34 +00:00
Emma Anholt	e937c4b716	anv: Add an aperture space summary to INTEL_DEBUG=submit. Same as on iris, this is nice for tracking at a high level how much memory is being used. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20540>	2023-01-11 00:35:34 +00:00
Lionel Landwerlin	2d627f28c8	anv: use the null surface with unused push descriptor binding table entries Some binding table entries have been identify as unused in the shaders by the push constant analysis pass. We can just put the null entry in there. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `b49b18f0b7` ("anv: reduce BT emissions & surface state writes with push descriptors") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20555>	2023-01-09 23:00:24 +00:00
Lionel Landwerlin	bbfca4eb92	anv: return properly typed value for no ubo promoted Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `ff91c5ca42` ("anv: add analysis for push descriptor uses and store it in shader cache") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20555>	2023-01-09 23:00:24 +00:00
Lionel Landwerlin	e2b0086b78	anv: check that push range actually match binding considered We can't just check the load_ubo range is contained in the push entry, we also need to check that the push entry set/binding matches the load_ubo set/binding. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `ff91c5ca42` ("anv: add analysis for push descriptor uses and store it in shader cache") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20555>	2023-01-09 23:00:24 +00:00
Lionel Landwerlin	48bb3df951	anv: don't nullify entries We'll use those to fill the push constant addresses, so we can't have them turned to null. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `ff91c5ca42` ("anv: add analysis for push descriptor uses and store it in shader cache") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20555>	2023-01-09 23:00:24 +00:00
José Roberto de Souza	1067ec90a5	anv: Update PIPELINE_CONTROL flush when switching pipeline mode in TGL+ This 2 PIPELINE_CONTROL flushes are not necessary for TGL and newer and also it have different requirements of flush, so here doing this two changes at the same time. As no ANV_PIPE_INVALIDATE_BITS is set as parameter of anv_add_pending_pipe_bits(), genX(cmd_buffer_apply_pipe_flushes)(cmd_buffer) will only emit one PIPELINE_CONTROL. BSpec: 44505 Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20501>	2023-01-09 14:40:26 +00:00
Rohan Garg	85650297d2	anv,hasvk: move the null check into the function call and drop null check copies Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20104>	2023-01-06 17:22:16 +00:00
Rohan Garg	0ae23b81a4	anv: Drop useless FIXME Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20104>	2023-01-06 17:22:16 +00:00
Rohan Garg	00ffe8227f	anv,hasvk: drop unused function align_i32 is not used anywhere Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20104>	2023-01-06 17:22:16 +00:00
Rohan Garg	05dca17b57	anv,hasvk: migrate to ROUND_DOWN_TO from util Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20104>	2023-01-06 17:22:16 +00:00
Rohan Garg	818eed3d2f	anv,hasvk: migrate to u_minify from util Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20104>	2023-01-06 17:22:16 +00:00
Rohan Garg	9257b08f49	anv: migrate anv_minify to use u_minify Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20104>	2023-01-06 17:22:16 +00:00
Rohan Garg	4504188508	anv,hasvk: migrate to align64 from util Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20104>	2023-01-06 17:22:16 +00:00
Rohan Garg	a06f751ec8	anv,hasvk: migrate align32 to the right functions from util Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20104>	2023-01-06 17:22:16 +00:00
Rohan Garg	1e9fb7c696	anv,hasvk: Use the inbuilt macro from src/util for clamping int64_t Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20104>	2023-01-06 17:22:15 +00:00
Rohan Garg	0030d6d224	anv: constify variables and use early returns Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20104>	2023-01-06 17:22:15 +00:00
Felix DeGrood	7f6beb8537	anv: Emit CS stall on INTEL_MEASURE timestamp For INTEL_MEASURE, ensure all prior instructions completed before timestamp taken. Continue to support no CS flush case for Perfetto. CS stall was dropped from pipecontrol when adding u_trace support. Fixes: `cc5843a573` ("anv: implement u_trace support") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20502>	2023-01-04 22:43:36 +00:00
Tapani Pälli	97f2b60833	anv: implement Wa_14015814527 for task shaders After using task shader, we need to emit a zero URB state and a nullprim (empty pipe control) before rendering with primitives. After this, a normal URB state needs to be returned, this will happen when pipeline batch is emitted during pipeline switch. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20334>	2023-01-03 12:44:08 +00:00
Sviatoslav Peleshko	c2acd9f76a	anv: Add layer with work-around for Doom 64 texture corruption Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7817 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19502>	2023-01-02 15:05:06 +00:00
José Roberto de Souza	c6d1f76da2	anv: Add and use emit_pipeline_select() To avoid the replication of code to properly emit PIPELINE_SELECT. init_compute_queue_state() had a different emit of PIPELINE_SELECT but as there is no compute engine in GFX VER 11 we are safe with the differences. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20444>	2022-12-29 08:34:15 -08:00
Tapani Pälli	b9aa66d5d0	anv: disable preemption for 3DPRIMITIVE during streamout This is required by Wa_16013994831. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20438>	2022-12-27 15:53:42 +00:00
Lionel Landwerlin	afdbed9e9c	anv: fix potential integer overflow The loop going from 0 to max_draw_count multiplies the value which could potentially overflow. Fixes Coverity CID 1517852 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `3596a8ea7a` ("anv: factor out some indirect draw count entry points") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20436>	2022-12-27 14:21:44 +00:00
Lionel Landwerlin	c950fe97a0	anv: implement generated (indexed) indirect draws Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15642>	2022-12-23 22:52:50 +00:00
Lionel Landwerlin	3596a8ea7a	anv: factor out some indirect draw count entry points Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15642>	2022-12-23 22:52:50 +00:00
Lionel Landwerlin	61b730f1f4	anv: decouple util function from anv_cmd_buffer The issue we're addressing here is that we have 2 batches and the both grow at different rate. We want to keep doubling the main batch size as the application writes more and more commands to limit the number of GEM BOs. But we don't want to have the generation batch size to be linked to the main batch. v2: remove gfx7 code Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15642>	2022-12-23 22:52:50 +00:00
José Roberto de Souza	3e28c5b9f9	anv: Pass anv_bo as parameter to anv_gem_mmap() anv_bo has information that will be needed by a future patch in anv_gem_mmap(), so here already preparing for that. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20423>	2022-12-23 18:22:29 +00:00
Lionel Landwerlin	739a08ad23	anv: handle null push descriptors in deferred optimization Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `b49b18f0` ("anv: reduce BT emissions & surface state writes with push descriptors") Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20410>	2022-12-22 14:07:21 +00:00
Rohan Garg	ad9c0e8cd9	anv: Ensure we clear ANV_PIPE_PSS_STALL_SYNC_BIT on flush Add the PSS stall bit to ANV_PIPE_STALL_BITS so that it get's cleared on flush. Fixes: `f3c62973` ("anv,iris: PSS Stall Sync around color fast clears") Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20317>	2022-12-20 10:44:54 +00:00
Chad Versace	a5f9e59ce3	anv: Use vma_heap for descriptor pool host allocation Pre-patch, anv_descriptor_pool used a free list for host allocations that never merged adjacent free blocks. If the pool only allocated fixed-sized blocks, then this would not be a problem. But the pool allocations are variable-sized, and this caused over half of the pool's memory to be consumed by unusable free blocks in some workloads, causing unnecessary memory footprint. Replacing the free list with util_vma_heap, which does merge adjacent free blocks, fixes the memory explosion in the target workload. Disdavantges of util_vma_heap compared to the free list: - The heap calls malloc() when a new hole is created. - The heap calls free() when a hole disappears or is merged with an adjacent hole. - The Vulkan spec expects descriptor set creation/destruction to be thread-local lockless in the common case. For workloads that create/destroy with high frequency, malloc/free may cause overhead. Profiling is needed. Tested with a ChromeOS internal TensorFlow benchmark, provided by package 'tensorflow', running with its OpenCL backend on clvk. cmdline: benchmark_model --graph=mn2.tflite --use_gpu=true --min_secs=60 gpu: adl memory footprint from start of benchmark: before: init=132.691MB max=227.684MB after: init=134.988MB max=134.988MB Reported-by: Romaric Jodin <rjodin@google.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20289>	2022-12-16 07:18:38 +00:00
Iván Briano	766508f56a	Revert "anv: Refactor anv_pipeline to use the anv_pipeline_type" This reverts commit `b1126abb38`. This breaks all hell at least on DG2, as there are several cases left where current_pipeline gets checked against GPGPU to decide what to do, and the value doesn't match that of ANV_HW_PIPELINE_STATE_COMPUTE. On top of that, it also misses checking for ANV_HW_PIPELINE_STATE_RAYTRACING. Then there's the fact that in some cases, current_pipeline will be UINT32_MAX, because it's the original undefined state and also used after executing a secondary command buffer because we are not tracking on which pipeline did the secondary left us. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7910 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20349>	2022-12-16 06:39:32 +00:00

1 2 3 4 5 ...

4247 commits