fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-20 09:08:07 +02:00

Author	SHA1	Message	Date
Nanley Chery	569f80f2df	anv: Reduce accesses of isl_mod_info->aux_usage This field will be replaced in an upcoming patch. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24120>	2023-07-20 20:53:26 +00:00
Nanley Chery	f2dab434d8	anv: Handle explicit surface layout of DG2_RC_CCS We're going to enable the DG2 modifier. Account for the reduced plane count that exists with it. Also add an assert to make it clearer that the aux in use is CCS. Otherwise, it may not be obvious because of the generic compression names being used here. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24120>	2023-07-20 20:53:26 +00:00
Nanley Chery	47565d31e1	intel: Add and use isl_drm_modifier_get_plane_count We're going to enable the DG2_RC_CCS modifier in anv. Add and use this function to prepare for the new plane count that comes with that modifier. iris is left alone for now because it supports more modifiers than isl_drm_modifier_get_score is aware of. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24120>	2023-07-20 20:53:26 +00:00
Nanley Chery	e50af52e3d	anv: Don't support ASTC images with modifiers Before this change, anv_get_image_format_features2 reported support for ASTC formats with any modifier (even those not supported by anv). But, we didn't intend to support that compressed image format with modifiers. With this change, the format feature function reports no support for modifiers on ASTC-formatted images. This prevents the next patch from causing assertion failures due to unsupported modifiers. Fixes: `355f318843` ("anv: Allow transfer-only linear ASTC images") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24120>	2023-07-20 20:53:26 +00:00
Rohan Garg	ba071ee81c	anv: use the correct GFX_VERx10 macro for WA Fixes: `60b0d2c2cb` ("add required invalidate/flush for Wa_14014427904") Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23937>	2023-07-20 20:25:12 +00:00
Rohan Garg	097f3b4a98	anv: use the WA infrastructure where possible when generating state Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23937>	2023-07-20 20:25:12 +00:00
Felix DeGrood	d04be9770b	intel/compiler: use shader source hash in shader dump code Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23942>	2023-07-20 09:08:08 +00:00
Felix DeGrood	6ac8a9a030	intel: use shader source hash in INTEL_MEASURE Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23942>	2023-07-20 09:08:08 +00:00
Felix DeGrood	124973c635	anv: Add Source hash field to VkPipelineExecutableStatisticKHR Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23942>	2023-07-20 09:08:08 +00:00
Felix DeGrood	b145d05381	anv: save a shader source uint32_t hash in gfx/compute pipelines Save lowest dword of shader source sha1 in pipeline object for use later as hash for uniquely identifying shader in debug outputs. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23942>	2023-07-20 09:08:08 +00:00
Lionel Landwerlin	3384f029be	intel/compiler: rework input parameters Use a struct for various common parameters rather than per stage structure or arguments to stage specific entrypoints. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23942>	2023-07-20 09:08:08 +00:00
José Roberto de Souza	6f88e3befb	anv: Add support for userptr in Xe KMD Xe KMD only requires userptr to be bound to VM, so here reusing workaround_bo->gem_handle id to all userptr bos in Xe version of gem_create_userptr(). The Xe version of gem_close() will make sure that workaround_bo->gem_handle is not closed when userptr bos are closed. With the same gem_handle for all userptr bos, it was also necessary skip the anv_device_lookup_bo() and manually allocate memory to store anv_bo in host heap memory, what lead to some small changes in anv_device_release_bo() as well. The remaining changes are the support to VM bind userptr bos and the gem_vm_bind() call in anv_device_import_bo_from_host_ptr(). Fixes: dEQP-VK.memory.external_memory_host* Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23787>	2023-07-19 22:20:24 +00:00
José Roberto de Souza	5c729cb1b8	anv: Replace handle by anv_bo in the gem_close() struct anv_bo will be needed in the next patch to properly handle closure of userptr bos. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23787>	2023-07-19 22:20:24 +00:00
José Roberto de Souza	7e7ab39424	anv: Add gem_create_userptr() to KMD backend Xe support of userptr will be implemented in the next patch, this is just moving the i915 and stub functions to KMD backend. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23787>	2023-07-19 22:20:24 +00:00
José Roberto de Souza	2a6fc690c1	anv: Use workaround framework to Wa_14016118574 Wa_14016118574 is not the lineage number for this workaround so it was updated to Wa_22014412737. Wa_22014412737 is not applicable for MTL B0 steppings and newer so using the workaround framework eliminates this pipe_control instruction for not affected revisions. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24221>	2023-07-19 14:43:44 +00:00
Iván Briano	4ad19c8310	anv: implement Wa_14019750404 Cc: 23.2 <mesa-stable> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8931 Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24150>	2023-07-18 19:14:27 +00:00
Iván Briano	7b0ded0b23	anv: ensure mesh is disabled on context init It turns out the hardware doesn't save the whole state on a context switch, as the kernel expects when it creates the golden context. For some HW units, only the state that was explicitly programmed will be part of it, so we need to make sure mesh shading is disabled on context creation, or we risk being context switched with an application that uses mesh, and when ours gets to run again, the mesh state won't be reset, and submitting a legacy 3D pipeline while the HW thinks mesh is enabled causes us to hang. Cc: 23.2 <mesa-stable> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24150>	2023-07-18 19:14:27 +00:00
Iván Briano	75990e5564	anv: ensure CFE_STATE is emitted for ray tracing pipelines Fixes sporadic failures in dEQP-VK.robustness.robustness2.*.rgen Fixes: `ecb709c853` ("anv: only emit CFE_STATE when scratch space increases") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9382 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24206>	2023-07-17 22:19:12 -07:00
Marcin Ślusarz	87dd96bbbe	anv: drop support for VK_NV_mesh_shader Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24071>	2023-07-14 08:27:14 +00:00
Lionel Landwerlin	67a8b70c57	anv: hide exec_flags selection inside the i915 backend Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24073>	2023-07-13 17:12:26 +00:00
Jordan Justen	492b07625d	anv,iris,hasvk: Use ISL_SURF_USAGE_STREAM_OUT_BIT for setting stream-out MOCS Cc: 23.2 <mesa-stable> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23823>	2023-07-12 23:47:25 -07:00
Marcin Ślusarz	a762fa27db	anv: limit stack usage for anv_surface_state Each one is 136 bytes. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24109>	2023-07-12 12:00:10 +00:00
Marcin Ślusarz	deaf4f2d57	anv: pass anv_surface_state using a pointer It's 136 bytes, so passing it by stack is wasteful. CID: 1531860 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24109>	2023-07-12 12:00:09 +00:00
Marcin Ślusarz	fb070b1dfd	anv: fix how NULL buffer_view is handled in anv_descriptor_set_write_buffer_view CID: 1531855 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24109>	2023-07-12 12:00:09 +00:00
Hyunjun Ko	0c778ec3c8	anv: Adds a workaround for HEVC decoding on some old platforms. HEVC support on Gfx9 is only available on VCS0. So limit the number of video queues to the first VCS engine instance. We should be able to query HEVC support from the kernel using the engine query uAPI, but this appears to be broken : https://gitlab.freedesktop.org/drm/intel/-/issues/8832 When this bug is fixed we should be able to check HEVC support to determine the correct number of queues. Closes: mesa/mesa#9172, mesa/mesa#9314 Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24065>	2023-07-12 15:34:28 +09:00
Lionel Landwerlin	a85b84ba1e	anv: fix utrace signaling with Xe utrace submits can either have a batch or not. When there is a batch, the utrace vk_sync is signaled by the utrace batch (because utrace does a timestamp buffer copy using its own batch). When there is no batch, the utrace vk_sync should be signaled by the application batch (no timestamp copy required, utrace can read the timestamps when the application batch has completed). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `fdea48df5e` ("anv: Implement Xe version of anv_queue_exec_locked() and queue_exec_trace()") Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24085>	2023-07-11 16:27:06 +00:00
Yonggang Luo	48a25ef700	treewide: Remove all usage of nir_builder_init with nir_builder_create and nir_builder_at Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24038>	2023-07-10 19:20:17 +00:00
Sagar Ghuge	66a6f48747	anv: Drop depth cache flush requirement after depth clear/resolve From Bspec 46959, a programming note applicable to Gfx12+: "Since HZ_OP has to be sent twice (first time set the clear/resolve state and 2nd time to clear the state), and HW internally flushes the depth cache on HZ_OP, there is no need to explicitly send a Depth Cache flush after Clear or Resolve." Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24027>	2023-07-10 18:03:39 +00:00
Jordan Justen	c328638b3b	anv: Use correct CCS0 aux-map register offset in pipe flush According to Bspec, COMPCS0_CCS_AUX_INV register offset is 042C8h and COMPCS0_AUX_TABLE_BASE_ADDR is defined to 042C0h. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23958>	2023-07-07 18:05:47 +00:00
Jordan Justen	1fb9460913	anv: Program compute aux-map base address during queue init Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23958>	2023-07-07 18:05:47 +00:00
Yonggang Luo	7471bc2574	intel/vulkan: Convert to use nir_foreach_function_impl when possible Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24040>	2023-07-07 14:02:40 +00:00
Hyunjun Ko	d0e6809ee5	anv/video: fix to support HEVC 10bit on some of 9th gens. From Broxton and Kabylake, it started supporting HEVC 10-bit decoding. Fixes: `649e12c897` ("anv_video: reject decoding of unsupported profiles and formats") Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23985>	2023-07-05 00:20:18 +00:00
José Roberto de Souza	59aa49494c	anv: Drop unnecessary intel_canonical_address() calls around bo->offset bo->offset is set as canonical address no need to do it over again. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23977>	2023-07-04 15:24:04 +00:00
José Roberto de Souza	27e20c8726	anv: Drop unnecessary intel_canonical_address() call around anv_address_physical() anv_address_physical() already returns a canonical address. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23977>	2023-07-04 15:24:04 +00:00
José Roberto de Souza	2fa4fe2c85	anv: Fix some mismatches of canonical and regular addresses around anv_bo_vma_alloc_or_close() anv_vma_alloc() returns a canonical address, but explicit_address is a regular address. This mismatch can potentially cause issues. So here making bo->offset as always canonical address by converting it in the explicit case and fixing the only caller that was caling anv_bo_vma_alloc_or_close() with a canonical address. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23977>	2023-07-04 15:24:04 +00:00
Marcin Ślusarz	1ac1d5d62e	anv,intel/compiler: enable shortcut in wg id to wg idx lowering on >= gfx12.5 This speeds up vk_meshlet_cadscene in "VK mesh ext" renderer by 1.4% Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22334>	2023-07-04 09:15:08 +00:00
Lynne	649e12c897	anv_video: reject decoding of unsupported profiles and formats Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23954>	2023-07-03 23:48:48 +00:00
José Roberto de Souza	c142736f52	anv: Fix compute maximum number of threads value There is no mention in spec about subtract one of the number of threads, also Iris and blorp code don't subtract. Alchemist PRMs: Volume 2a: Command Reference: Instructions: CFE_STATE: Maximum Number of Threads: Normally set to the maximum number of threads: (# EUs) * (# threads/EU) Cc: mesa-stable Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23973>	2023-07-03 22:53:49 +00:00
Konstantin Seurer	05269047d3	intel: Use nir_builder_at Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23883>	2023-07-03 15:21:38 +00:00
Rohan Garg	feea00a6c4	anv: retry batchbuffer submission with i915 Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23950>	2023-06-30 19:51:33 +00:00
Iván Briano	bafbfc57ea	anv: flush data cache before emitting availability Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23814>	2023-06-29 22:11:35 +00:00
Rohan Garg	4f3890dd87	anv: move WA 1607854226 to use the WA infrastructure Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23929>	2023-06-29 16:22:59 +00:00
Lionel Landwerlin	2e8c0a33e7	anv: implement storage image depth query using descriptor buffer read The HW not returning the depth value we would like for VK_EXT_sliced_view_of_3d, we can pull that value by reading the RENDER_SURFACE_STATE struct directly. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23868>	2023-06-29 10:32:20 +00:00
Lionel Landwerlin	a1fda29bd1	anv: look into batch bo reloc list looking for BOs to decode On DG2 I ran into a case where the surface state was not being decoded with INTEL_DEBUG=bat. This is because the surface states are not part of a state pool there anymore. Instead BO are allocate manually and placed in vma heap. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `96c33fb027` ("anv: enable direct descriptors on platforms with extended bindless offset") Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23891>	2023-06-29 09:24:07 +00:00
Erik Faye-Lund	6520b3e726	anv: use imm-helpers Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23855>	2023-06-29 07:08:19 +00:00
Jordan Justen	463bf13411	anv: Use set PAT extension on BO creation for MTL Reworks: * Drop local pat_index var (suggested by José) Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22878>	2023-06-27 22:06:19 +00:00
Francisco Jerez	fce905f613	anv: Swap ordering of memory types on non-LLC platforms to work around application bugs. The Vulkan specification indicates that if memory types have properties which are a strict subset of another type's, then they should appear before that memory type. Otherwise the specification does not require a specific ordering of memory types. But, it appears that Aztec Ruins and the Vulkan CTS make an assumption that the first host-accessible memory type is host-coherent and select it when they expect data written by the CPU to become visible without calling vkFlushMappedMemoryRanges(), even though flushing is required by the spec, which leads to misrendering and hangs on MTL platforms. We found that other drivers also put a host-coherent, but not cached memory type as the first host-accessible memory type, so let's do the same in order to match the expectations of such broken applications. Host-coherent uncached memory types are currently implemented with a WC CPU map on non-LLC platforms, so there shouldn't be a huge performance penalty from this: If an application intends to do heavy R/W CPU access on a memory range it's expected to loop over the available memory types and select one marked as host-cached -- If an application fails to do that and simply selects the first available type it seems more robust to stay on the safe side and give them a host-coherent type rather than a cached one. Rework: * Jordan: Add initial explanation to body of commmit message. * Curro: Add additional comments to commit message. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22878>	2023-06-27 22:06:19 +00:00
Jordan Justen	a831ee51ae	anv: Flush untyped dataport cache DC flush is requested on compute Although the following is based on this observations for OpenGL, we probably need this for Vulkan as well. KHR-GL46.texture_buffer.texture_buffer_operations_ssbo_writes writes to an SSBO in a compute program, then issues a memory-barrier, which causes us to add a DC-flush. Then a second compute program samples from the SSBO written by the first compute program. Although we expected the DC-flush to make the writes available to the second compute program, on MTL this wasn't the case. Adding the "Untyped Data-Port Cache Flush" fixes this. The PRM indicates that compute programs must set "Untyped Data-Port Cache Flush" to flush some LSC writes when flushing HDC. Although we are setting DC-flush, and not HDC-flush, it does appear that the following reference might also apply to DC-flush. In the Intel(R) Arc(tm) A-Series Graphics and Intel Data Center GPU Flex Series Open-Source Programmer's Reference Manual, Vol 2a: Command Reference: Instructions, PIPE_CONTROL, HDC Pipeline Flush (DWord 0, Bit 9), there is a programming note: > When the "Pipeline Select" mode is set to "GPGPU", the LSC Untyped > L1 cache flush is controlled by "Untyped Data-Port Cache Flush" bit > in the PIPE_CONTROL command. Ref: `a8108f1d44` ("anv: Add missing untyped data port flush on PIPELINE_SELECT") Ref: `bd8e8d204d` ("iris: Add missing untyped data port flush on PIPELINE_SELECT") Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23176>	2023-06-27 20:56:28 +00:00
Jordan Justen	215c6c6ce4	anv: Flush untyped dataport cache when HDC flush is requested on compute In the Intel(R) Arc(tm) A-Series Graphics and Intel Data Center GPU Flex Series Open-Source Programmer's Reference Manual, Vol 2a: Command Reference: Instructions, PIPE_CONTROL, HDC Pipeline Flush (DWord 0, Bit 9), there is a programming note: > When the "Pipeline Select" mode is set to "GPGPU", the LSC Untyped > L1 cache flush is controlled by "Untyped Data-Port Cache Flush" bit > in the PIPE_CONTROL command. Ref: `a8108f1d44` ("anv: Add missing untyped data port flush on PIPELINE_SELECT") Ref: `bd8e8d204d` ("iris: Add missing untyped data port flush on PIPELINE_SELECT") Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23176>	2023-06-27 20:56:28 +00:00
Jordan Justen	c5ca2bed51	anv: Clear untyped dataport cache flush bit if not in GPGPU mode This should be equivalent, but refactoring the code will allow the next two patches to use an else block for this check. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23176>	2023-06-27 20:56:28 +00:00

1 2 3 4 5 ...

4739 commits