fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-18 20:18:06 +02:00

Author	SHA1	Message	Date
Caio Oliveira	2811cb2923	intel: Add statistic for Non SSA registers after NIR to BRW This is going to be useful while we convert the NIR to BRW to produce SSA definitions. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30496>	2024-10-11 06:40:29 +00:00
Tapani Pälli	e4fcbe8d6f	anv: set StackIDControlOverride_RTGlobals for 2 workarounds GFX_VER block matches both workarounds and while these workarounds are almost about the same cause, other one applies only for LNL and other one for BMG, need to check for both. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31571>	2024-10-10 10:20:56 +00:00
Tapani Pälli	78b614b333	anv: add depth, DC and L3 fabric flush for aux map invalidation These should be included according to table in Bspec 43904. Patch removes PIPE_CONTROL_STATE_CACHE_INVALIDATE based on HSDES. Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29764>	2024-10-08 08:45:40 +00:00
Tapani Pälli	e3814dee1a	anv: add plumbing/support for L3 fabric flush Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29764>	2024-10-08 08:45:40 +00:00
Mike Blumenkrantz	5ba00df1f9	anv: add VK_FORMAT_G10X6_B10X6R10X6_2PLANE_420_UNORM_3PACK16 to modifier exceptions this is implemented Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31516>	2024-10-04 13:22:08 +00:00
Nanley Chery	26692deefc	anv: Delete stale comment for BLORP clear color addr It looks like this comment attempted to describe all the reasons we need to pass the clear color address to BLORP. This comment actually isn't exhaustive and some bits are out of date (e.g., BLORP no longer updates the clear color address for us). Let's just delete it. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31136>	2024-10-03 19:41:31 +00:00
Nanley Chery	10bcfb63d5	anv: Prevent clear color modifier corruption with views If a dmabuf is shared with a clear color, the raw clear color channels generally won't be interpreted correctly during format reinterpretation. So, prevent Vulkan apps from trying to use such dmabufs as mutable format render targets. Also, prevent such apps from using such dmabufs as blorp_copy() destinations if doing so would require format reinterpretation. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31136>	2024-10-03 19:41:31 +00:00
Nanley Chery	6721064939	anv: Use image formats when copying to/from buffers blorp_copy() will sometimes use a complex shader if the source and destination surface formats differ. For example, it will do this when both formats support CCS_E, but have differing numbers of bits-per-channel. To reduce the chance of using this complex shader during transfers between images and buffers, ensure the same format is used. We can't completely prevent the complex shader because a copy may happen between surface formats that have a different number of bits-per-pixel. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31136>	2024-10-03 19:41:31 +00:00
Tapani Pälli	ac00d97e31	anv: use mi_builder in CmdBeginTransformFeedbackEXT Patch converts MI_LOAD_REGISTER_MEM, MI_LOAD_REGISTER_IMM to use mi_builder in CmdBeginTransformFeedbackEXT. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31502>	2024-10-03 16:20:40 +00:00
Lionel Landwerlin	1f2ad64b63	anv: optimize WA 16011107343/22018402687 No need to emit the instruction twice. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Backport-to: 24.2 Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31481>	2024-10-02 17:27:55 +00:00
Lionel Landwerlin	4cdb5de163	anv: consolidate pre/post draw workaround in helpers This avoids sprinkling those all over the code base. Debug breakpoints are put in there too. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Backport-to: 24.2 Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31481>	2024-10-02 17:27:55 +00:00
Lionel Landwerlin	18e2c25dad	anv: limit 22018402687 to impacted platforms ARL is impacted, but LNL is not. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Backport-to: 24.2 Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31481>	2024-10-02 17:27:55 +00:00
Lionel Landwerlin	17c3bd358e	anv: limit render target cache flushing due to color output remapping Fixes a performance regression of 1%/2% introduced in `badb3f6301` ("anv: Only flush render target cache when detecting RT changes") Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31380>	2024-10-01 15:52:39 +00:00
Hyunjun Ko	f76781feb8	anv: enable KHR_video_maintenance1 Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31436>	2024-10-01 10:45:14 +09:00
Hyunjun Ko	ac2fd8ae66	anv: support VK_IMAGE_CREATE_VIDEO_PROFILE_INDEPENDENT_BIT_KHR Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31436>	2024-10-01 10:45:14 +09:00
Hyunjun Ko	0981d20850	anv: support for inline query for vulkan video v1. Removed the unnecessary query begin code. (lionel.g.landwerlin@intel.com) Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31436>	2024-10-01 10:45:14 +09:00
Hyunjun Ko	1b06d4a8ea	anv: consider VK_VIDEO_CODEC_OPERATION_ENCODE_H264_BIT_KHR when allocating mv storgae. Fixes: `3ec8f7f99` ("anv/video: initial support for h264 encoding") Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31436>	2024-10-01 10:45:14 +09:00
Hyunjun Ko	8a3f852119	anv/video: support VK_VIDEO_ENCODE_RATE_CONTROL_MODE_DISABLED_BIT_KHR. Which means to support CQP mode. Fixes: `3ec8f7f99` ("anv/video: initial support for h264 encoding") Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31436>	2024-10-01 10:45:14 +09:00
Jules Blok	4994c5a243	anv: Add support for VK_EXT_depth_clamp_control Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31411>	2024-09-30 22:18:27 +00:00
Paulo Zanoni	bd33917509	anv: remove another copy of the texture cache pipe_control workaround The workaround is already implemented by batch_emit_pipe_control_write(), we don't need to do it here as well. This was spotted by Lionel Landwerlin. The credits go to him, I just wrote the patch. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31412>	2024-09-30 21:44:12 +00:00
Paulo Zanoni	fd4a44430c	anv: remove duplicate pipe_control workaround Commit `a603cc0633` ("anv: move some pc was to batch_emit_pipe_control_write") moved some WAs from emit_apply_pipe_flushes() to batch_emit_pipe_control_write(), but it turns out one of them was already there since `cf7e1f3817` ("anv, iris: add missing CS_STALL bit for GPGPU texture invalidation"). Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31412>	2024-09-30 21:44:12 +00:00
Tapani Pälli	c1a44e8d43	anv: force StackIDControl value for Wa_14021821874 This is also encouraged by another wa, Wa_14018813551. Both workarounds state that StackIDControlOverride_RTGlobals should always be set to 0 (i.e. 2k). Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30937>	2024-09-30 07:33:37 +03:00
Iván Briano	a4cbc903a8	anv: allocate sparse descriptor buffers from the correct heap When allocating a buffer normally, this flag gets to the allocator from the memory requirements, but when sparse bindings are created we were checking for them but never setting them. Fixes sparse descriptor buffers on Xe2. Makes the failure on TRTT more obvious. Fixes: `c6a91f1695` ("anv: add new heap/pool for descriptor buffers") Fixes: `692e1ab2c1` ("anv: get rid of the second dynamic state heap") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31372>	2024-09-27 04:49:22 +00:00
Paulo Zanoni	fe59044f47	anv/trtt: mark vk_sync_get_value()'s value as defined for Valgrind Valgrind doesn't seem to know that drmSyncobjQuery() writes to the variable that we pass as 'last_value'. This gets rid of: ==6275== Conditional jump or move depends on uninitialised value(s) ==6275== at 0x5308370: anv_sparse_trtt_garbage_collect_batches (anv_sparse.c:540) ==6275== by 0x53091E2: anv_sparse_bind_trtt (anv_sparse.c:825) ==6275== by 0x5309771: anv_sparse_bind (anv_sparse.c:953) ==6275== by 0x5309A3B: anv_free_sparse_bindings (anv_sparse.c:1041) ==6275== by 0x529FF21: anv_DestroyBuffer (anv_buffer.c:248) ==6275== by 0x932ADBD: ??? (in /usr/lib/x86_64-linux-gnu/libVkLayer_khronos_validation.so) ==6275== by 0x127AA2: MyVkBuffer::~MyVkBuffer() (sparse.cpp:364) ==6275== by 0x12B2D4: MyApp::test1_trivial_sparse() (sparse.cpp:1421) ==6275== by 0x13E01A: MyApp::run_test(int) (sparse.cpp:6594) ==6275== by 0x13E3B0: main (sparse.cpp:6656) ==6275== Uninitialised value was created by a stack allocation ==6275== at 0x53082D3: anv_sparse_trtt_garbage_collect_batches (anv_sparse.c:525) An alternative to these Valgrind macros would simply have been to zero-intialize last_value. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31332>	2024-09-27 04:10:12 +00:00
Paulo Zanoni	ab91106d4f	anv: fix compute engines when using ANV_QUEUE_OVERRIDE I just noticed that my custom sparse program was not working correctly when I used ANV_QUEUE_OVERRIDE (instead of enabling the compute queue by default or using INTEL_ENGINE_CLASS_COMPUTE, which was removed by commit `600d88ab3c` ("intel: Remove INTEL_ENGINE_CLASS_COMPUTE and INTEL_ENGINE_CLASS_COPY parameters"). It turns out we were not setting the same engine class type when using ANV_QUEUE_OVERRIDE vs the other cases. Move the code around so the behavior can stay the same. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31332>	2024-09-27 04:10:12 +00:00
Matt Turner	75f02ed4b5	anv: Set shader_spilling_rate=15 by default This avoids massively long shader compile times when there is lots of spilling, at a minor cost of a few more spills/fills. Choose 15 as it is already the default used by the Cyberpunk 2077 driconf workaround. Surprisingly the number of additional spills/fills are miniscule in fossil-db: Instructions in all programs: 152680595 -> 152681525 (+0.0%) SENDs in all programs: 7672789 -> 7672789 (+0.0%) Loops in all programs: 48469 -> 48469 (+0.0%) Cycles in all programs: 11981743456 -> 11984228708 (+0.0%) Spills in all programs: 42989 -> 42779 (-0.5%) Fills in all programs: 76380 -> 76776 (+0.5%) partly because of the chaotic unpredictability that the choice of registe to spill has on a shader. For example, this patch massively helps some shaders in terms of spills/fills: Spills helped fossils/fossil-db/steam-native/red_dead_redemption2.vk-g6.foz/4101ff9c9b83bf22/SIMD8 fragment: 3208 -> 2894 (-9.8%) Fills helped fossils/fossil-db/steam-native/red_dead_redemption2.vk-g6.foz/4101ff9c9b83bf22/SIMD8 fragment: 7258 -> 6795 (-6.4%) Spills helped fossils/q2rtx/q2rtx-rt-pipeline.976f4ab1c0fee975.1.foz/c496e8a549f6b4bf/compute: 109 -> 92 (-15.6%) Related: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31133 Related: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9241 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11709 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11844 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31269>	2024-09-27 03:43:52 +00:00
Sagar Ghuge	f39cd30f4f	anv: Track all the descriptor sets During compute state save/restore, let's track all the descriptor sets. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30798>	2024-09-26 06:56:21 +00:00
Dylan Baker	f8273555d3	anv: enable VK_EXT_ycbcr_2plane_444_formats Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31352>	2024-09-25 22:10:14 +00:00
Lionel Landwerlin	0b5408f9fc	anv: expose VK_EXT_pipeline_protected_access Intel's protection mechanism is descriptor based. There is nothing going on in the shaders. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31339>	2024-09-25 16:45:49 +00:00
Lionel Landwerlin	d2f7b6d5a7	anv: implement VK_KHR_dynamic_rendering_local_read Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27270>	2024-09-25 12:51:07 +00:00
Lionel Landwerlin	15987f49bb	anv: avoid setting up a null RT unless needed Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27270>	2024-09-25 12:51:07 +00:00
Nanley Chery	730e83b525	anv: Require compression for fast-clears on gfx20+ In commit `44351d67f8`, I needed to change some variables in a check for compression in anv_can_fast_clear_color_view(). Instead of doing that, I dropped the check altogether because I thought the call to anv_layout_to_fast_clear_type() which followed right afterwards would return ANV_FAST_CLEAR_NONE if the aux usage was ISL_AUX_USAGE_NONE. That turned out not to be the case, due to special-casing of Xe2+. For now, make Xe2+ more like other platforms when it comes to enabling fast-clears. If there comes a reason to actually fast-clear with ISL_AUX_USAGE_NONE, we can revisit this. Fixes: `44351d67f8` ("anv: Change params of anv_can_fast_clear_color_view") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11920 Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31297>	2024-09-24 13:56:02 +00:00
Mike Blumenkrantz	04709e4f7d	anv: fix video profile lists these didn't include dmabuf layout or mutable formats despite both being supported Reviewed-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31317>	2024-09-24 11:38:48 +00:00
Lionel Landwerlin	f81dc17e7d	anv: add missing pipeline instance multiplier Fix zink/anv tests : dEQP-GLES3.functional.fbo.multiview.samples_* Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11911 Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31341>	2024-09-24 10:36:17 +00:00
Iván Briano	2e1c278e3d	anv: skip rt pipeline compile if we found all shaders When no pipeline cache is provided by the application and we rely on the internal one, cache hits are not counted as such. This was causing us to return COMPILE_REQUIRED on some cases where all shaders had been found in the cache, as well as some unnecessary extra processing in the case that we did have to compile the pipeline. Fixes: `1dacea10f3` ("anv: implement caching for ray tracing pipelines") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31298>	2024-09-23 19:57:53 +00:00
Iván Briano	1a45c8827b	anv: free shaders on rt pipeline compile error We have not yet added the shaders to the pipeline->shaders array at this point. If we couldn't compile (or were asked not to) the pipeline, we were leaking references to any shaders found in the cache. This would manifest as an assert on device destruction: vk_pipeline_cache_destroy: Assertion `cache->object_cache->entries == 0' failed. Fixes: `58c9f817cb` ("anv: fix pipeline executable properties with graphics libraries") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31298>	2024-09-23 19:57:53 +00:00
Lionel Landwerlin	badb3f6301	anv: Only flush render target cache when detecting RT changes We setup an empty render target when there are no color attachments, which effectively makes it a different surface state. In most cases the compiler will insert a null-rt bit in the extended descriptor which means the RT isn't even accessed. But in some cases like alpha-to-coverage output + depth/stencil write, we will access the render target because using the null-rt will prevent alpha-to-coverage from happening. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `2bd304bc8f` ("anv: Skip the RT flush when doing depth-only rendering.") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31196>	2024-09-23 15:56:02 +00:00
Lionel Landwerlin	fb3ae17d96	anv: fix missing tracking for alpha-to-coverage runtime changes Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `9926aedc96` ("anv: enable EDS3 AlphaToCoverageEnable & RasterizationSamples") Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31196>	2024-09-23 15:56:01 +00:00
Nanley Chery	b3882c4488	intel: Avoid no-op calls to anv_image_clear_color Whenever we execute a fast-clear due to LOAD_OP_CLEAR, we decrease the number of layers to clear by one. We then enter the slow clear function and possibly exit without clearing if the layer count is zero. Unfortunately, we've already compiled the shader for slow clears by the time we exit. Skip the slow clear function if there are no layers to clear. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31167>	2024-09-20 16:34:37 +00:00
Nanley Chery	1c7fe9ad1b	anv: Support fast clears in anv_CmdClearColorImage At least two game traces make use of this path: TWWH3 and Factorio. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31167>	2024-09-20 16:34:37 +00:00
Nanley Chery	46d58583ff	anv: Move exec_ccs_op and exec_mcs_op higher up The next patch will use them in anv_CmdClearColorImage(). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31167>	2024-09-20 16:34:37 +00:00
Nanley Chery	03286117ef	anv: Move and rename anv_can_fast_clear_color_view It's no longer specific to image views. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31167>	2024-09-20 16:34:36 +00:00
Nanley Chery	44351d67f8	anv: Change params of anv_can_fast_clear_color_view Expand the scope to more than just image views. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31167>	2024-09-20 16:34:36 +00:00
José Roberto de Souza	7c01cbda6f	anv: Optimize vkQueueWaitIdle() on Xe KMD vk_common_QueueWaitIdle() creates a syncobj, does a submit with no batch buffers what translates to execute trivial_batch_bo and then waits for syncobj to be signaled when trivial_batch_bo finishes. On Xe KMD on other hand we can avoid the trivial_batch_bo submission and instead use the special DRM_IOCTL_XE_EXEC with num_batch_buffer == 0 to get a syncobj to be signaled when the last exec finish execution. This should free a bit GPU to execute more important workloads. This will also optimize vkDeviceWaitIdle() that calls QueueWaitIdle(). It have to fallback to vk_common_QueueWaitIdle() when queue is in VK_QUEUE_SUBMIT_MODE_THREADED mode because vkQueueWaitIdle() could return but there still stuff in VK/CPU submission queue. Also it could cause use after free when resources attached to submission are freed before it is processed, example: vkCreateFence() or vkCreateSemaphore() vkQueueSubmit() // with Fence or Semaphore created above vkQueueWaitIdle() // with the race it returns vkDestroyFence() or vkDestroySemaphore() // vk_queue_submit_thread_func() start to process submission above... Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30958>	2024-09-19 23:12:45 +00:00
José Roberto de Souza	2f7c9f906d	intel: Split anv_xe_wait_exec_queue_idle() and move part of it to common/ Split anv_xe_wait_exec_queue_idle() into 2 functions, the first function creates the syncobj and prepares it to be signaled when the last workload in queue is completed. And the second one that calls the first function, then waits for the syncobj to be signaled and destroy the syncobj. The main reason for that is that the first function can be reused in Iris and a future patch will add another user, so lets share it. No changes in behavior are expected here. Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30958>	2024-09-19 23:12:44 +00:00
José Roberto de Souza	89c6fa1883	anv: Fix condition to clear query pool with blorp The comment above says it all, only when queue is not protected that it is possible to clear query pool with blorp but it was checking the opposite. Fixes: `d5b0526507` ("anv: propagate protected information for blorp operations") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31239>	2024-09-19 17:54:24 +00:00
José Roberto de Souza	0ced5663e2	anv: Improve readbility of khr_perf_query_availability_offset() and khr_perf_query_data_offset() No changes in behavior expected here. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31239>	2024-09-19 17:54:24 +00:00
José Roberto de Souza	3d09ffde46	anv/query: Fix batch end value This were not causing any issues but better set end to the correct value. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31239>	2024-09-19 17:54:24 +00:00
José Roberto de Souza	ac95745dc4	anv: Add documentation to some fields in anv_query_pool Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31239>	2024-09-19 17:54:24 +00:00
José Roberto de Souza	dec5a624e9	anv: Check if vkCreateQueryPool() is being created in a supported queue Turns out not even VK CTS was calling vkEnumeratePhysicalDeviceQueueFamilyPerformanceQueryCountersKHR() to check if queue supports query. So here adding a explicity check in our implementation of vkCreateQueryPool(). https://github.com/KhronosGroup/VK-GL-CTS/pull/482 Cc: 24.2 <mesa-stable> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30652>	2024-09-18 15:29:16 +00:00

1 2 3 4 5 ...

5889 commits