fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-20 06:58:16 +02:00

Author	SHA1	Message	Date
José Roberto de Souza	2a6fc690c1	anv: Use workaround framework to Wa_14016118574 Wa_14016118574 is not the lineage number for this workaround so it was updated to Wa_22014412737. Wa_22014412737 is not applicable for MTL B0 steppings and newer so using the workaround framework eliminates this pipe_control instruction for not affected revisions. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24221>	2023-07-19 14:43:44 +00:00
Iván Briano	4ad19c8310	anv: implement Wa_14019750404 Cc: 23.2 <mesa-stable> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8931 Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24150>	2023-07-18 19:14:27 +00:00
Iván Briano	7b0ded0b23	anv: ensure mesh is disabled on context init It turns out the hardware doesn't save the whole state on a context switch, as the kernel expects when it creates the golden context. For some HW units, only the state that was explicitly programmed will be part of it, so we need to make sure mesh shading is disabled on context creation, or we risk being context switched with an application that uses mesh, and when ours gets to run again, the mesh state won't be reset, and submitting a legacy 3D pipeline while the HW thinks mesh is enabled causes us to hang. Cc: 23.2 <mesa-stable> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24150>	2023-07-18 19:14:27 +00:00
Iván Briano	75990e5564	anv: ensure CFE_STATE is emitted for ray tracing pipelines Fixes sporadic failures in dEQP-VK.robustness.robustness2.*.rgen Fixes: `ecb709c853` ("anv: only emit CFE_STATE when scratch space increases") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9382 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24206>	2023-07-17 22:19:12 -07:00
Marcin Ślusarz	87dd96bbbe	anv: drop support for VK_NV_mesh_shader Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24071>	2023-07-14 08:27:14 +00:00
Lionel Landwerlin	67a8b70c57	anv: hide exec_flags selection inside the i915 backend Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24073>	2023-07-13 17:12:26 +00:00
Jordan Justen	492b07625d	anv,iris,hasvk: Use ISL_SURF_USAGE_STREAM_OUT_BIT for setting stream-out MOCS Cc: 23.2 <mesa-stable> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23823>	2023-07-12 23:47:25 -07:00
Marcin Ślusarz	a762fa27db	anv: limit stack usage for anv_surface_state Each one is 136 bytes. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24109>	2023-07-12 12:00:10 +00:00
Marcin Ślusarz	deaf4f2d57	anv: pass anv_surface_state using a pointer It's 136 bytes, so passing it by stack is wasteful. CID: 1531860 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24109>	2023-07-12 12:00:09 +00:00
Marcin Ślusarz	fb070b1dfd	anv: fix how NULL buffer_view is handled in anv_descriptor_set_write_buffer_view CID: 1531855 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24109>	2023-07-12 12:00:09 +00:00
Hyunjun Ko	0c778ec3c8	anv: Adds a workaround for HEVC decoding on some old platforms. HEVC support on Gfx9 is only available on VCS0. So limit the number of video queues to the first VCS engine instance. We should be able to query HEVC support from the kernel using the engine query uAPI, but this appears to be broken : https://gitlab.freedesktop.org/drm/intel/-/issues/8832 When this bug is fixed we should be able to check HEVC support to determine the correct number of queues. Closes: mesa/mesa#9172, mesa/mesa#9314 Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24065>	2023-07-12 15:34:28 +09:00
Lionel Landwerlin	a85b84ba1e	anv: fix utrace signaling with Xe utrace submits can either have a batch or not. When there is a batch, the utrace vk_sync is signaled by the utrace batch (because utrace does a timestamp buffer copy using its own batch). When there is no batch, the utrace vk_sync should be signaled by the application batch (no timestamp copy required, utrace can read the timestamps when the application batch has completed). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `fdea48df5e` ("anv: Implement Xe version of anv_queue_exec_locked() and queue_exec_trace()") Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24085>	2023-07-11 16:27:06 +00:00
Yonggang Luo	48a25ef700	treewide: Remove all usage of nir_builder_init with nir_builder_create and nir_builder_at Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24038>	2023-07-10 19:20:17 +00:00
Sagar Ghuge	66a6f48747	anv: Drop depth cache flush requirement after depth clear/resolve From Bspec 46959, a programming note applicable to Gfx12+: "Since HZ_OP has to be sent twice (first time set the clear/resolve state and 2nd time to clear the state), and HW internally flushes the depth cache on HZ_OP, there is no need to explicitly send a Depth Cache flush after Clear or Resolve." Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24027>	2023-07-10 18:03:39 +00:00
Jordan Justen	c328638b3b	anv: Use correct CCS0 aux-map register offset in pipe flush According to Bspec, COMPCS0_CCS_AUX_INV register offset is 042C8h and COMPCS0_AUX_TABLE_BASE_ADDR is defined to 042C0h. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23958>	2023-07-07 18:05:47 +00:00
Jordan Justen	1fb9460913	anv: Program compute aux-map base address during queue init Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23958>	2023-07-07 18:05:47 +00:00
Yonggang Luo	7471bc2574	intel/vulkan: Convert to use nir_foreach_function_impl when possible Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24040>	2023-07-07 14:02:40 +00:00
Hyunjun Ko	d0e6809ee5	anv/video: fix to support HEVC 10bit on some of 9th gens. From Broxton and Kabylake, it started supporting HEVC 10-bit decoding. Fixes: `649e12c897` ("anv_video: reject decoding of unsupported profiles and formats") Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23985>	2023-07-05 00:20:18 +00:00
José Roberto de Souza	59aa49494c	anv: Drop unnecessary intel_canonical_address() calls around bo->offset bo->offset is set as canonical address no need to do it over again. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23977>	2023-07-04 15:24:04 +00:00
José Roberto de Souza	27e20c8726	anv: Drop unnecessary intel_canonical_address() call around anv_address_physical() anv_address_physical() already returns a canonical address. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23977>	2023-07-04 15:24:04 +00:00
José Roberto de Souza	2fa4fe2c85	anv: Fix some mismatches of canonical and regular addresses around anv_bo_vma_alloc_or_close() anv_vma_alloc() returns a canonical address, but explicit_address is a regular address. This mismatch can potentially cause issues. So here making bo->offset as always canonical address by converting it in the explicit case and fixing the only caller that was caling anv_bo_vma_alloc_or_close() with a canonical address. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23977>	2023-07-04 15:24:04 +00:00
Marcin Ślusarz	1ac1d5d62e	anv,intel/compiler: enable shortcut in wg id to wg idx lowering on >= gfx12.5 This speeds up vk_meshlet_cadscene in "VK mesh ext" renderer by 1.4% Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22334>	2023-07-04 09:15:08 +00:00
Lynne	649e12c897	anv_video: reject decoding of unsupported profiles and formats Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23954>	2023-07-03 23:48:48 +00:00
José Roberto de Souza	c142736f52	anv: Fix compute maximum number of threads value There is no mention in spec about subtract one of the number of threads, also Iris and blorp code don't subtract. Alchemist PRMs: Volume 2a: Command Reference: Instructions: CFE_STATE: Maximum Number of Threads: Normally set to the maximum number of threads: (# EUs) * (# threads/EU) Cc: mesa-stable Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23973>	2023-07-03 22:53:49 +00:00
Konstantin Seurer	05269047d3	intel: Use nir_builder_at Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23883>	2023-07-03 15:21:38 +00:00
Rohan Garg	feea00a6c4	anv: retry batchbuffer submission with i915 Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23950>	2023-06-30 19:51:33 +00:00
Iván Briano	bafbfc57ea	anv: flush data cache before emitting availability Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23814>	2023-06-29 22:11:35 +00:00
Rohan Garg	4f3890dd87	anv: move WA 1607854226 to use the WA infrastructure Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23929>	2023-06-29 16:22:59 +00:00
Lionel Landwerlin	2e8c0a33e7	anv: implement storage image depth query using descriptor buffer read The HW not returning the depth value we would like for VK_EXT_sliced_view_of_3d, we can pull that value by reading the RENDER_SURFACE_STATE struct directly. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23868>	2023-06-29 10:32:20 +00:00
Lionel Landwerlin	a1fda29bd1	anv: look into batch bo reloc list looking for BOs to decode On DG2 I ran into a case where the surface state was not being decoded with INTEL_DEBUG=bat. This is because the surface states are not part of a state pool there anymore. Instead BO are allocate manually and placed in vma heap. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `96c33fb027` ("anv: enable direct descriptors on platforms with extended bindless offset") Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23891>	2023-06-29 09:24:07 +00:00
Erik Faye-Lund	6520b3e726	anv: use imm-helpers Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23855>	2023-06-29 07:08:19 +00:00
Jordan Justen	463bf13411	anv: Use set PAT extension on BO creation for MTL Reworks: * Drop local pat_index var (suggested by José) Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22878>	2023-06-27 22:06:19 +00:00
Francisco Jerez	fce905f613	anv: Swap ordering of memory types on non-LLC platforms to work around application bugs. The Vulkan specification indicates that if memory types have properties which are a strict subset of another type's, then they should appear before that memory type. Otherwise the specification does not require a specific ordering of memory types. But, it appears that Aztec Ruins and the Vulkan CTS make an assumption that the first host-accessible memory type is host-coherent and select it when they expect data written by the CPU to become visible without calling vkFlushMappedMemoryRanges(), even though flushing is required by the spec, which leads to misrendering and hangs on MTL platforms. We found that other drivers also put a host-coherent, but not cached memory type as the first host-accessible memory type, so let's do the same in order to match the expectations of such broken applications. Host-coherent uncached memory types are currently implemented with a WC CPU map on non-LLC platforms, so there shouldn't be a huge performance penalty from this: If an application intends to do heavy R/W CPU access on a memory range it's expected to loop over the available memory types and select one marked as host-cached -- If an application fails to do that and simply selects the first available type it seems more robust to stay on the safe side and give them a host-coherent type rather than a cached one. Rework: * Jordan: Add initial explanation to body of commmit message. * Curro: Add additional comments to commit message. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22878>	2023-06-27 22:06:19 +00:00
Jordan Justen	a831ee51ae	anv: Flush untyped dataport cache DC flush is requested on compute Although the following is based on this observations for OpenGL, we probably need this for Vulkan as well. KHR-GL46.texture_buffer.texture_buffer_operations_ssbo_writes writes to an SSBO in a compute program, then issues a memory-barrier, which causes us to add a DC-flush. Then a second compute program samples from the SSBO written by the first compute program. Although we expected the DC-flush to make the writes available to the second compute program, on MTL this wasn't the case. Adding the "Untyped Data-Port Cache Flush" fixes this. The PRM indicates that compute programs must set "Untyped Data-Port Cache Flush" to flush some LSC writes when flushing HDC. Although we are setting DC-flush, and not HDC-flush, it does appear that the following reference might also apply to DC-flush. In the Intel(R) Arc(tm) A-Series Graphics and Intel Data Center GPU Flex Series Open-Source Programmer's Reference Manual, Vol 2a: Command Reference: Instructions, PIPE_CONTROL, HDC Pipeline Flush (DWord 0, Bit 9), there is a programming note: > When the "Pipeline Select" mode is set to "GPGPU", the LSC Untyped > L1 cache flush is controlled by "Untyped Data-Port Cache Flush" bit > in the PIPE_CONTROL command. Ref: `a8108f1d44` ("anv: Add missing untyped data port flush on PIPELINE_SELECT") Ref: `bd8e8d204d` ("iris: Add missing untyped data port flush on PIPELINE_SELECT") Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23176>	2023-06-27 20:56:28 +00:00
Jordan Justen	215c6c6ce4	anv: Flush untyped dataport cache when HDC flush is requested on compute In the Intel(R) Arc(tm) A-Series Graphics and Intel Data Center GPU Flex Series Open-Source Programmer's Reference Manual, Vol 2a: Command Reference: Instructions, PIPE_CONTROL, HDC Pipeline Flush (DWord 0, Bit 9), there is a programming note: > When the "Pipeline Select" mode is set to "GPGPU", the LSC Untyped > L1 cache flush is controlled by "Untyped Data-Port Cache Flush" bit > in the PIPE_CONTROL command. Ref: `a8108f1d44` ("anv: Add missing untyped data port flush on PIPELINE_SELECT") Ref: `bd8e8d204d` ("iris: Add missing untyped data port flush on PIPELINE_SELECT") Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23176>	2023-06-27 20:56:28 +00:00
Jordan Justen	c5ca2bed51	anv: Clear untyped dataport cache flush bit if not in GPGPU mode This should be equivalent, but refactoring the code will allow the next two patches to use an else block for this check. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23176>	2023-06-27 20:56:28 +00:00
Alyssa Rosenzweig	173b9ee69a	treewide: Use nir_builder_create more perl -p0e 's/nir_builder_init\(&([^,]*), /\1 = nir_builder_create(/g' -i $(git grep -l nir_builder_init) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23860>	2023-06-27 18:13:02 +00:00
Alyssa Rosenzweig	815efcdf7e	nir: Use nir_builder_create perl -p0e 's/nir_builder ([^;]);\snir_builder_init\(&\1, /nir_builder \1 = nir_builder_create(/g' -i $(git grep -l nir_builder_init) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23860>	2023-06-27 18:13:02 +00:00
Konstantin Seurer	8f3db26d14	intel: Use nir_ instead of nir_build_ helpers Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23858>	2023-06-27 17:37:54 +00:00
Sagar Ghuge	957d7644aa	intel/ds: Track CCS cache flush bit Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23786>	2023-06-26 16:08:20 -07:00
Sagar Ghuge	e88eac5b6c	anv: implement recommended flush/wait of AUX-TT invalidation on compute This patch implements the recommended flush/wait of AUX-TT invalidation for compute/render command streamer. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23786>	2023-06-26 15:57:39 -07:00
Sagar Ghuge	012ff791fb	anv: Fix AUX-TT invalidation In order to make sure RCS engine is idle, we need to add DC flush + CS stall + Render target Cache flush + Depth Cache on Gfx 12 and additional CCS cache flush on Gfx12.5. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23786>	2023-06-26 15:57:39 -07:00
Sagar Ghuge	0b42a6c3b5	anv: Add CCS cache flush bits to anv_pipe_bits This will help us to flush the entries out of the CCS cache. v2: - Move enum value close to HW bits section (Lionel) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23786>	2023-06-26 15:57:39 -07:00
Hyunjun Ko	9f4299d6b2	anv: fix to set predicted weight tables correctly. Fixes: `8d519eb5f` ("anv: add initial video decode support for h265") Closes: mesa/mesa#9214 Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23790>	2023-06-26 15:08:05 +09:00
Matt Turner	561cce32f1	anv: Only expose video decode bits with KHR_video_decode_queue This fixes dEQP-VK.api.info.format_properties.g8_b8r8_2plane_420_unorm in combination with the CTS fix from https://gerrit.khronos.org/c/vk-gl-cts/+/12191 Fixes: `9361481780` ("anv: add video format features for the one supported video output format") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8263 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23776>	2023-06-24 02:54:37 +00:00
Matt Turner	727335045d	anv: Pipe anv_physical_device to anv_get_image_format_features2 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23776>	2023-06-24 02:54:37 +00:00
Kenneth Graunke	1b3669a1ed	intel: Initialize FF_MODE2 on all Gfx12 platforms On Alchemist, the FF_MODE2 documentation says that we must set the FF_MODE2 timer values for GS and HS to 224. The hardware performance tuning guide also recommends setting the TDS timer to 4. On Tigerlake, i915 applies workarounds to set the GS timer to 224 (failing to do so can cause HS/DS unit hangs), and the TDS timer to 4 (for performance). It doesn't currently apply a HS timer there, and I'm not sure if it's strictly necessary, but given that Alchemist needed it, and the other two settings matched, let's assume that it ought to match as well. Unfortunately, there has been a bug in the i915 workarounds infrastructure for non-masked context registers where writing one field of the register zeroes out all the others. So, I believe the Tigerlake TDS timer value of 4 isn't being applied correctly there, though the register is also not readable on that platform which makes it hard to verify. So, this may also speed up tessellation. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9233 Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23839>	2023-06-24 01:20:36 +00:00
Francisco Jerez	427fee3507	intel/gfx12.5: Enable L3 partial write merging for compressible surfaces among other cases. This enables L3 partial write merging for a number of cases that seem to be getting accidentally disabled by the kernel, which was causing a serious performance bottleneck on DG2 and MTL platforms. The "Compressible Partial Write Merge Enable", "Coherent Partial Write Merge Enable" and "Cross-Tile Partial Write Merge Enable" bits in L3SQCREG5 were expected to be enabled by default (and confusingly, they even read off as enabled if you ran 'intel_reg read 0xb158' on an idle system), but they are getting clobbered during 3D context initialization by an i915 workaround. Enabling L3 partial write merging of compressible surfaces in particular seems to increase rendering fillrate by over 3x in some cases (e.g. the "VulkanFillRate/FillRateGPU/resolution:1[0-3]/format:*/blend:0" fillrate-bound microbenchmarks). Significant improvements can also be reproduced in most real-world workloads we've tested so far, e.g. Counter Strike GO improves by ~11%, Shadow Of the Tomb Raider improves by ~5.5%, and AztecRuins-VK improves by ~6.5% on DG2-512 -- Thanks a lot to Caleb Callaway for these figures. No regressions have been observed so far. Even though this patch might strike as surprisingly simple for such a large payoff, it's the result of Felix DeGrood and I trying to root-cause the rendering performance gap of DG2 on Linux vs Windows on and off during the last year, and some of the OA statistics captured by Felix early this month were greatly helpful for me to connect the last few dots, so Felix deserves a big chunk of the credit for this work. Cc: mesa-stable Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23783>	2023-06-23 21:24:27 +00:00
Caio Oliveira	dc93f205c1	meson: Explicitly add "check : false" to a couple instances of run_command In both cases there's code right after the execution to check the result and give a proper message. This gets rid of meson warning ``` WARNING: You should add the boolean check kwarg to the run_command call. It currently defaults to false, but it will default to true in future releases of meson. See also: https://github.com/mesonbuild/meson/issues/9300 ``` Reviewed-by: Eric Engestrom <eric@igalia.com> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23821>	2023-06-23 18:57:31 +00:00
Lionel Landwerlin	a13ac83f1b	anv: fix utrace batch allocation The introduction of a workaround adding lots of MI_NOOPs broke our computation. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `b9aa66d5d0` ("anv: disable preemption for 3DPRIMITIVE during streamout") Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23792>	2023-06-23 11:26:27 +00:00

1 2 3 4 5 ...

4725 commits