fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-19 04:58:08 +02:00

Author	SHA1	Message	Date
Sviatoslav Peleshko	090dbbc995	anv: Add full subgroups workaround for the shaders that use shared memory This workaround is similar to anv_assume_full_subgroups, but it applies to the shaders that use shared memory. If they rely on the implicit synchronization, and we choose a smaller group size than the (broken) shader expects, it will produce incorrect results. Cc: mesa-stable Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23408> (cherry picked from commit `369aec5704`)	2025-03-15 09:49:03 +01:00
Hyunjun Ko	0ea91330c3	anv: Do not support the tiling of DRM modifier if DECODE_DST Fixes: `04709e4f` ("anv: fix video profile lists"); Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33784> (cherry picked from commit `f7ff9b240d`)	2025-03-03 17:25:22 +01:00
Kevin Chuang	f912436dc9	anv/bvh: Fix copy shader handling sparse buffer Fixes: `692b5fa9f2` ("anv: Add shader to copy acceleration structures") This commit fixes the future test "sparse_binding_structures" for "header_bottom_address" for ray tracing pipeline. Even on 48-bit ray tracing (Xe1/2), the software-defined part instance_leaf_part1.bvh_ptr has to be in canonical form for copy.comp to deference a bvh, which means we have to preserve the upper 16bits. This is especially relevant in cases where the acceleration structure buffer is located high, such as sparse buffer. Signed-off-by: Kevin Chuang <kaiwenjon23@gmail.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33745> (cherry picked from commit `87ff7b061f`)	2025-03-03 17:25:16 +01:00
Kevin Chuang	614dd4999c	anv/bvh: Fix encoder handling sparse buffer Fixes: `2fe57947e3` ("anv: Implement encode shader to fit in ANV BVH") This commit resolves the failures in the future tests "sparse_binding_structures" for rayquery. Sparse buffers' heaps are located high, and since it's in canonical form, the higher 16bits are all set to 1. However, the existing encoder did not expect any non-zero values at the higher 16bits. As a result, the instance flags got corrupted, causing most triangle tests to fail. Thanks for Paulo providing insights about sparse buffer properties. Co-developed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Signed-off-by: Kevin Chuang <kaiwenjon23@gmail.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33745> (cherry picked from commit `b9a980ea73`)	2025-03-03 17:25:14 +01:00
Lionel Landwerlin	3630721dc8	anv: fix missing 3DSTATE_PS:Kernel0MaximumPolysperThread programming Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `815d2e3e8b` ("anv: move 3DSTATE_PS to partial packing") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33712> (cherry picked from commit `91f36ba5b6`)	2025-02-28 22:17:35 +01:00
Tapani Pälli	3194cae6d0	anv: apply cache flushes on pipeline select with gfx20 This fixes rendering artifacts seen with Hogwarts Legacy and Black Myth Wukong. Assumption is that we can get rid of these flushes once RESOURCE_BARRIER work lands but until then we need them. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12540 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12489 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33397> (cherry picked from commit `765f3b78d5`)	2025-02-18 22:46:07 +01:00
Tapani Pälli	961a3fc760	anv: tighten condition for changing barrier layouts Assertion (or attempting the layout change) is causing crash when launching Steel Rats. Tighten the condition for change so that it should affect only when runtime has made changes. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12602 Fixes: `eed788213b` ("anv: ensure consistent layout transitions in render passes") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33523> (cherry picked from commit `d8381415a6`)	2025-02-18 22:46:01 +01:00
Lionel Landwerlin	e2232c0be4	anv: ensure Wa_16012775297 interacts correctly with Wa_18020335297 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `dddd765553` ("anv: implement VF_STATISTICS emit for Wa_16012775297") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32418> (cherry picked from commit `6b99bf76ca`)	2025-02-15 00:02:54 +01:00
Lionel Landwerlin	399de9dd00	anv: disable VF statistics for memcpy Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32418> (cherry picked from commit `462d8e3fab`)	2025-02-15 00:02:53 +01:00
Ernst Persson	26ad2f9149	intel/vulkan: Add bvh build dependency Fixes: `41baeb3810` ("anv: Implement acceleration structure API") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12558 Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33333> (cherry picked from commit `c64871accc`)	2025-02-04 20:47:26 +01:00
Hyunjun Ko	cd4ffc319f	anv: Fix to set CDEF flter flag correctly for AV1 decoding and relevant tiny clean-up. Fixes: `8432b8b282` ("anv: add initial support for AV1 decoding") Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33316> (cherry picked from commit `52d9edbf05`)	2025-02-04 20:47:26 +01:00
Francisco Jerez	d455d5d86c	anv/xe3+: Enable VRT. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32664>	2025-01-29 23:39:32 +00:00
Francisco Jerez	dd1712515b	anv/xe3+: Set RegistersPerThread for bindless shader dispatch. v2: Use MOV and wrap in conditional during BTD spawn header setup (Lionel). Remove references to SIMD8 (Tapani). v3: Update brw_bsr() to specify number of registers per thread, don't initialize Registers Per Thread on BTD spawn header (Lionel). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32664>	2025-01-29 23:39:32 +00:00
Francisco Jerez	b25d0f899b	anv/xe3+: Set RegistersPerThread during shader state setup based on prog_data. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32664>	2025-01-29 23:39:32 +00:00
Francisco Jerez	a736757275	anv/gfx12.5: Request subgroup size 8 for RT trampoline shader. The 16-wide variant of the trampoline shader doesn't appear to work and would be inadvertently enabled by this series on Gfx12.5. Set the required subgroup size to avoid changing current behavior. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32664>	2025-01-29 23:39:32 +00:00
Lionel Landwerlin	ff9cf7a222	anv: reduce alignment for small heaps Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33240>	2025-01-29 17:33:13 +00:00
Lionel Landwerlin	4434b0799b	anv: dirty pipeline & push constants after internal CS shaders Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `7ca5c84804` ("anv: add support for simple internal compute shaders") Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33280>	2025-01-29 15:25:43 +00:00
Lionel Landwerlin	524dab2b10	anv: expose A4B4G4R4_UNORM_PACK16 support with CBCWF is disabled Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12511 Reviewed-by: Lucas Fryzek <lfryzek@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33194>	2025-01-29 13:57:26 +00:00
Lionel Landwerlin	7fab8675a6	anv: add a drirc to disable border colors without format Disable it by default on Android. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Lucas Fryzek <lfryzek@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33194>	2025-01-29 13:57:26 +00:00
Lionel Landwerlin	c2c3f19e88	anv: pass physical device to format helpers So that we can have special behavior based on drirc configuration. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Lucas Fryzek <lfryzek@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33194>	2025-01-29 13:57:26 +00:00
Lionel Landwerlin	eb0c2d8f33	anv: use flags for format capabilities Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Lucas Fryzek <lfryzek@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33194>	2025-01-29 13:57:26 +00:00
Iván Briano	c3dea47be8	anv: disable logic op for float/srgb formats Fixes new tests: dEQP-VK.pipeline..logic_op_na_formats. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33250>	2025-01-29 08:02:21 +00:00
Alyssa Rosenzweig	164a161279	meson: project-wide fs = import('fs') reduces a bit of boilerplate. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33242>	2025-01-28 23:01:32 +00:00
Eric Engestrom	fa67ab5525	anv,gfxstream,panvk,zink: update urls to vulkan docs This is simply following the redirects the same way the browser does. The new pages were manually verified to still contain the corresponding information. For URLs where this was not the case, see the next commits. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33159>	2025-01-28 14:28:58 +00:00
Lionel Landwerlin	6768eb31e5	intel: rework CL pre-compile Stolen from asahi_clc :) We drop the nasty LLVM17+ workaround code (Thanks Alyssa!) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Tested-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Dylan Baker <None> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33014>	2025-01-25 03:28:07 +00:00
Lionel Landwerlin	db11165c07	intel/cl: switch to SPIRV as shader storage Effectively making intel-clc not needed. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Tested-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Dylan Baker <None> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33014>	2025-01-25 03:28:07 +00:00
Lionel Landwerlin	7ddb49653d	anv/brw: rework primitive count writing Instead the complicated logic we currently have, do this : We start with this shader : int main() { ... if (...) { SetMeshOutputsEXT(0, 0); return; } else { SetMeshOutputsEXT(...); } ... } We turn it into this : int main() { uint __temp_prim_count = 0; ... if (...) { __temp_prim_count = 0; return; } else { __temp_prim_count = ...; } ... if (is_first_group_lane()) { SetMeshOutputsEXT(..., __temp_prim_count); } } This works because the SPIRV spec says this : "The arguments are taken from the first invocation in each workgroup. Any invocation must execute this instruction no more than once and under uniform control flow." Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12388 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33038>	2025-01-24 10:19:28 +00:00
Lionel Landwerlin	4cc847cfd4	anv/Wa_18019110168: copy the primitive count writes That way we don't have to lower the set_vertex_and_primitive_count intrinsic before applying this WA. Cc stable for the next patches that are fixing something. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33038>	2025-01-24 10:19:28 +00:00
Rhys Perry	0eb5f66660	nir/validate: validate ssa dominance by default This no longer modifies dominance metadata, so enable it by default. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32005>	2025-01-23 23:35:44 +00:00
José Roberto de Souza	e9f4458c37	anv: Allow WSI blit_src Image to be kept compressed when transitioning to VK_IMAGE_LAYOUT_PRESENT_SRC_KHR When WSI is working in prime/dma-buf mode, it has one additional VkBuffer or VkImage where the main VkImage is copied to without any compression or tiling different from linear The batch buffer to do this copy is created in wsi_finish_create_blit_context(). It performs a barrier transitioning the VkImage to VK_IMAGE_LAYOUT_TRANSFER_SRC_OPTIMAL, performs the copy, and then transitions it back to VK_IMAGE_LAYOUT_PRESENT_SRC_KHR. However, in this prime/dma-buf mode, no display modifiers are involved, which causes compression to be disabled when switching to VK_IMAGE_LAYOUT_PRESENT_SRC_KHR. This change adds an exception to allow the Vkimage to remain compressed because we can handle the compressed-to-uncompressed copy. Doing so fixes an issue that was reported with BMG + integrated GPU and should also improve performance by keeping the VkImage compressed. Cc: mesa-stable Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12354 Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33044>	2025-01-23 18:27:31 +00:00
José Roberto de Souza	5a37467cfd	anv: Return scanout PAT entry for scanout and external buffers in discrete GPUs Without this scanout and external buffers will be allocated as WB what will fail allocation if DRM_XE_GEM_CREATE_FLAG_SCANOUT is set or it will use WC but it will not be the special PAT entry for scanout. Cc: mesa-stable Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33044>	2025-01-23 18:27:31 +00:00
Lionel Landwerlin	9ea04a1a53	anv: don't look at pipelines to figure out CPS values Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33170>	2025-01-23 17:13:54 +00:00
Tapani Pälli	e85646eace	anv: set dependency between SF_CLIP and CC_PTR states Fixes flickering seen in Cyberpunk 2077, Supraland and some other game workloads. cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12494 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12504 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12453 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33163>	2025-01-23 16:26:24 +00:00
Connor Abbott	987e499253	anv: Delete acceleration structure stubs These are now provided by common code. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33153>	2025-01-23 05:16:58 +00:00
Lionel Landwerlin	f96e95fcc9	anv: remove print lowering This is handled by the back compiler. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33067>	2025-01-17 18:09:46 +00:00
Lionel Landwerlin	e1074f5bd4	anv: update debug printf example code Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33067>	2025-01-17 18:09:45 +00:00
Lionel Landwerlin	58a3ef4160	anv: handle printf buffer size relocations Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33067>	2025-01-17 18:09:45 +00:00
Alyssa Rosenzweig	e1368f0a30	nir,util: move printf serializing into util there's nothing NIR specific here and these routines will be useful otherwise. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33067>	2025-01-17 18:09:45 +00:00
Alyssa Rosenzweig	e7a1d704d0	intel: set max_buffer_size to nir_lower_printf instead of relying on an implicit value which doesn't make much sense. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33067>	2025-01-17 18:09:45 +00:00
Nanley Chery	15e23f3781	anv: Limit slow clear heuristic to ACM and prior It hasn't been tuned for Xe2. Fixes: `052d7e1a9c` ("anv: Slow clear if fast-clear cost is not mitigated") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33035>	2025-01-15 15:43:19 +00:00
Nanley Chery	caf007ff27	anv: Drop can_fast_clear_with_non_zero_color() This got dropped during a rebase. Fixes: `35f02d8f36` ("anv: Inline can_fast_clear_with_non_zero_color") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33035>	2025-01-15 15:43:19 +00:00
Matthew Brost	2a053b2e60	anv/xe: Bind queue per anv_queue The Xe uAPI is designed to use bind queues such that binds without input dependencies (sync objects) do not block on binds with input dependencies. For example: - Bind A (sparse) is submitted with a list of input dependencies. - Bind B (immediate) is subsequently submitted without a list of input dependencies. If Bind A and Bind B share a single bind queue, Bind B will not be scheduled until Bind A completes. Using individual bind queues decouples Bind A and Bind B, allowing Bind B to make immediate progress. This change creates a separate bind queue for each ANV queue, enabling support for sparse bindings that may have input dependencies. v2: - Bail on bind queue creation failure (Linoel) - Only create bind queue if VK_QUEUE_SPARSE_BINDING_BIT is set (Jose) v3: - Add comment around submit->queue usage (Jose) Signed-off-by: Matthew Brost <matthew.brost@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32873>	2025-01-14 14:39:53 +00:00
Nanley Chery	cd8e120b97	anv: Allow more single subresource fast-clears with FCV Format re-interpretation is no longer a problem with texture views. The clear color address now points to a clear color that is in the expected format. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31374>	2025-01-14 03:43:55 +00:00
Nanley Chery	35f02d8f36	anv: Inline can_fast_clear_with_non_zero_color Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31374>	2025-01-14 03:43:55 +00:00
Nanley Chery	5549cb921d	Revert "anv: turn off non zero fast clears for CCS_E" This reverts commit `25a232238f`. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11110 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11325 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31374>	2025-01-14 03:43:55 +00:00
Nanley Chery	3e62401df3	anv: Drop bpc check for non-zero fast clears Use the matching clear color address for an image view format to support any clear color. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31374>	2025-01-14 03:43:55 +00:00
Nanley Chery	83cd73385a	anv: Use L3 Fabric flush in fast-clear post-amble on TGL Replace the Tile Cache flush with an L3 Fabric flush. According to HSD 1604687438, this should be faster. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31600>	2025-01-14 03:14:00 +00:00
Nanley Chery	cec086a074	anv: Reduce fast-clear post-amble synchronization On gfx12+, the pre-amble and post-amble flushes contain the stalls necessary to ensure the prior operation is complete. Remove the extra uses of ANV_PIPE_END_OF_PIPE_SYNC_BIT in post-amble flushes. Also do this for the pre-amble flushes, but this doesn't have any impact. The flush application function will implicitly add the bit. For A750, this improves the TWWH3 trace in the performance CI by 0.52% (n=2). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31600>	2025-01-14 03:14:00 +00:00
Nanley Chery	052d7e1a9c	anv: Slow clear if fast-clear cost is not mitigated Fast-clears require expensive flushes beforehand and afterwards. The cost of flushes are decreased in a series of back-to-back fast-clears as no extra fast-clear flushes are required in between them. If the ratio of a command buffer's recorded back-to-back fast clears over independent fast-clears falls below 1/2, prevent that command buffer from recording any further fast-clears. Averaging two runs of our Factorio trace on an A750 shows a +14.37% improvement in FPS. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32984>	2025-01-13 20:42:31 +00:00
Hyunjun Ko	638fc5e472	anv: change bool to VkResult Fixes: `41caf3665c` ("anv/image: allocate some memory for mv storage after video images.") Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32775>	2025-01-10 21:45:04 +00:00

1 2 3 4 5 ...

6107 commits