fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-20 02:38:07 +02:00

Author	SHA1	Message	Date
Lionel Landwerlin	eef54f3175	intel/decoder: add options to decode surfaces/samplers Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24632>	2023-08-12 13:49:32 +00:00
Lionel Landwerlin	cf5ee0a0f7	anv: emit 3DSTATE_GS only once per pipeline Following `71ebd9b9d7`, 3DSTATE_GS can be emitted as part of the pipeline batch and as a dynamic state. Just do the latter. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `71ebd9b9d7` ("anv,hasvk: respect provoking vertex setting on geometry shaders") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24632>	2023-08-12 13:49:31 +00:00
Tapani Pälli	92941ee84b	anv: implement required PSS sync for Wa_18019816803 According to WA description, we need to track DS write state and emit a PSS_STALL_SYNC whenever that state changes. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18411>	2023-08-11 07:15:48 +00:00
Konstantin Seurer	eaee792ea5	vulkan: Add a generated vk_properties struct Generates a physical device properties table to avoid dealing with pNext chains in the driver. Based on vk_features. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24575>	2023-08-11 02:53:47 +00:00
Lionel Landwerlin	c87d5c67d9	anv: implement VK_EXT_pipeline_robustness v2: - Use vk_pipeline_robustness_state Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17545>	2023-08-09 09:03:45 +03:00
Lionel Landwerlin	9934613c74	anv/hasvk: track robustness per pipeline stage And split them into UBO and SSBO v2 (Lionel): - Get rid of robustness fields in anv_shader_bin v3 (Lionel): - Do not pass unused parameters around Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17545>	2023-08-09 09:00:12 +03:00
Lionel Landwerlin	059e82a469	anv: remove descriptor array bounds checking We cannot find anything in the Vulkan spec requiring this. D3D12 [1] says it's undefined as long as it doesn't crash the OS : "Out of bounds indexing of any descriptor table from the shader results in a largely undefined memory access, including the possibility of reading arbitrary in-process memory as if it is a hardware state descriptor and living with the consequence of what the hardware does with that. This could produce a device reset, but will not crash Windows." [1] : https://learn.microsoft.com/en-us/windows/win32/direct3d12/advanced-use-of-descriptor-tables#out-of-bounds-indexing Found 2 titles affected by this change Some pretty good results on Cyberpunk 2077 : Totals from 10285 (100.00% of 10285) affected shaders: Instrs: 7638709 -> 7517360 (-1.59%); split: -1.64%, +0.05% Cycles: 148047414 -> 148470916 (+0.29%); split: -0.83%, +1.12% Subgroup size: 112544 -> 112576 (+0.03%); split: +0.04%, -0.01% Spill count: 98 -> 90 (-8.16%) Fill count: 90 -> 82 (-8.89%) Max live registers: 495274 -> 479502 (-3.18%); split: -3.21%, +0.03% Max dispatch width: 87824 -> 91168 (+3.81%); split: +4.10%, -0.29% Gaining 297 shaders in SIMD16/32, loosing 16 SIMD32 shaders Some not so good results on Strange Brigade : Totals from 4027 (100.00% of 4027) affected shaders: Instrs: 2080355 -> 2013880 (-3.20%); split: -3.20%, +0.01% Cycles: 25405149 -> 25170579 (-0.92%); split: -1.37%, +0.45% Max live registers: 167303 -> 168958 (+0.99%) Max dispatch width: 33264 -> 32496 (-2.31%) Loosing 96 SIMD16 shaders. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17545>	2023-08-09 09:00:12 +03:00
Yonggang Luo	d130c96bda	util/treewide: Use alignas(x) instead __attribute__((aligned(x))) Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24571>	2023-08-09 05:15:09 +00:00
Sagar Ghuge	f575d4bc6f	blorp: Implement blorp hooks to emit breakpoint Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24308>	2023-08-08 17:36:19 +00:00
Sagar Ghuge	49eabb9ea6	anv: Add GPU breakpoint before/after specific draw call This change allow us to insert the MI_SEMAPHORE_WAIT before/after specific draw call. With GTX tool, we can always update the memory address to unblock spinning wait. v2: - Make sure draw_call_count is thread-safe (Lionel) - Add static inline helper (Lionel) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24308>	2023-08-08 17:36:19 +00:00
Benjamin Cheng	4755276baf	anv/video: copy from correct H264 scaling lists Vulkan defines the scaling lists according to the H264 ITU spec, which only defines ScalingList8x8[0] and ScalingList8x8[1] for non-444 formats. Reviewed-by: Lynne <dev@lynne.ee> Reviewed-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24474>	2023-08-08 03:21:39 +00:00
José Roberto de Souza	d686cadfbf	intel: Sync xe_drm.h and rename engine to exec_queue Sync with commit f16c04291100 ("drm/xe: Rename engine to exec_queue"). With that Iris and ANV had some major renames that were done manually as there is to many "engine" in non-related code. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24476>	2023-08-07 21:34:14 +00:00
Lionel Landwerlin	f5074adeb5	anv: enable INTEL_DEBUG=nofc Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24510>	2023-08-07 12:02:57 +03:00
Alyssa Rosenzweig	ab0d878932	treewide: Remove more is_ssa asserts Stuff Coccinelle missed. sed -i -e '/assert(.\.is_ssa)/d' $(git grep -l is_ssa) sed -i -e '/ASSERT.\.is_ssa)/d' $(git grep -l is_ssa) + a manual fixup to restore the assert for parallel copy lowering. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24432>	2023-08-03 22:40:28 +00:00
Alyssa Rosenzweig	5fead24365	treewide: Drop is_ssa asserts We only see SSA now. Via Coccinelle patch: @@ expression x; @@ -assert(x.is_ssa); Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24432>	2023-08-03 22:40:28 +00:00
Yonggang Luo	86bcc90c0e	intel/compiler,intel/blorp,intel/vulkan: decouple vulkan driver and compiler from gallium Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24438>	2023-08-03 22:00:15 +00:00
Felix DeGrood	6e7718dcea	anv: debug messaging for sparse texture usage Enable sparse debug messages with INTEL_DEBUG=sparse Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24046>	2023-08-03 19:57:19 +00:00
Felix DeGrood	7db2003209	anv: add fake sparse support Some DX12 games require sparse resource support but don't actually use sparse resources. Add a way to make these games work while we still don't have sparse resources fully working on every KMD backend. When fake_sparse=true, anv advertises sparse resource support despite lacking full support. Based-on-patch-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24046>	2023-08-03 19:57:18 +00:00
Alyssa Rosenzweig	51db19f7a2	nir: Rename scoped_barrier -> barrier sed + ninja clang-format + fix up spacing for common code. If you are unhappy that I did not manually change the whitespace of your driver, you need to enable clang-format for it so the formatting would happen automatically. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24428>	2023-08-01 23:18:29 +00:00
José Roberto de Souza	f49989148a	anv: Return earlier in anv_reloc_list functions Xe KMD don't need relocs, so calling a nop function and avoiding the CPU cycles and memory waste with reloc. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24411>	2023-08-01 22:30:04 +00:00
José Roberto de Souza	d9d284d050	anv: Remove VkAllocationCallbacks parameter from reloc functions Mismatch allocator could cause bad things, so better set the allocator on anv_reloc_list_init() and use it in every reloc function. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24411>	2023-08-01 22:30:04 +00:00
José Roberto de Souza	0584bb450e	anv: Nuke unused READ_ONCE() from anv_batch_chain.c Only genX_cmd_buffer.c makes use of READ_ONCE() but that file also defines it so it can be removed from anv_batch_chain.c. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24411>	2023-08-01 22:30:04 +00:00
José Roberto de Souza	a7ab31b96a	anv: Set MI_MATH MOCS field MOCS = 0 is a invalid MOCS index, so it is necessary get a valid value and set to MI_MATH instructions. So here the mocs index is set with mi_builder_set_mocs(), it can be always set but it is required when mi_build will emit MI_MATH instructions. The mocs index will only be stored and used in gfx12.5+ platforms so no changes were are required in crocus or hasvk. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22508>	2023-08-01 19:49:45 +00:00
Faith Ekstrand	ae105ad5cd	anv: Use the common versions of vkBegin/EndQuery() Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24409>	2023-08-01 19:17:05 +00:00
Faith Ekstrand	e4485bc062	anv: Use vk_query_pool Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24409>	2023-08-01 19:17:05 +00:00
Faith Ekstrand	1d6d775ffe	anv: Use vk_buffer_view Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24409>	2023-08-01 19:17:05 +00:00
Faith Ekstrand	92f996d0fa	anv: Use vk_sampler Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24409>	2023-08-01 19:17:05 +00:00
Rohan Garg	7f6e6eb8ec	anv: partially revert `2e8b1f6d` set_image_compressed_bit checks for the image aux usage whereas cmd_buffer_mark_image_written checks for the subresource's aux usage. Signed-off-by: Rohan Garg <rohan.garg@intel.com> Fixes: `2e8b1f6d` ('anv: drop duplicate checks when setting the compressed bit') Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24363>	2023-07-31 15:06:39 +00:00
Lionel Landwerlin	c1c0311d42	anv: enable EDS3 ConservativeRasterizationMode Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24395>	2023-07-31 12:30:37 +00:00
Lionel Landwerlin	a0179c32b6	anv: fix 3DSTATE_RASTER::APIMode field setting The APIMode field is set in the dynamic part in gfx8_cmd_buffer.c Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `55951ac28e` ("anv: fix emitting dynamic primitive topology") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24395>	2023-07-31 12:30:37 +00:00
José Roberto de Souza	2b7599dc49	intel: Rename intel_gem_add_ext() to intel_i915_gem_add_ext() gem_add_ext() is i915 specific so adding it to the name. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23905>	2023-07-28 15:36:52 +00:00
José Roberto de Souza	4198a301b3	intel: Move i915_drm.h specific code from common/intel_gem.h to common/i915/intel_gem.h This allow us to remove one more i915_drm.h include from code shared by both backends. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23905>	2023-07-28 15:36:52 +00:00
Illia Polishchuk	56e0aff530	anv, drirc: Add workaround to speed up Cyberpunk 2077 reg allocation Calling the ra_allocate function after each register spill can take several minutes. This option speeds up shader compilation by spilling more registers after the ra_allocate failure.Required for Cyberpunk 2077, which uses a watchdog thread to terminate the process in case the render thread hasn't responded within 2 minutes. Execution time of my Cyberpunk2077 shader compilation test: https://gitlab.freedesktop.org/illia.a.polishchuk/cyberpunk-vulkan-compute-hang-test-anv Before the patch: real 1m28,738s user 1m28,329s sys 0m0,400s After the patch real 0m33,245s user 32m,835s sys 0m0,404s I think it's acceptable patch because Cyberpunk benchmarks has the same FPS with and without patch. (I started it without patch with a patched binary with disabled watchdog thread) Signed-off-by: Illia Polishchuk <illia.a.polishchuk@globallogic.com> Requires: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24228 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9241 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24299>	2023-07-28 14:51:42 +00:00
Lionel Landwerlin	87149cc545	blorp: update and move fast clear PIPE_CONTROLs to drivers Before this patch, when updating the indirect clear color, BLORP only invalidated the texture cache on gfx11. The hardware docs state that the texture cache invalidation is also needed on gfx12 however. Add this invalidation for gfx12 and move the fast-clear related cache invalidations to the drivers for clarity and performance. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5850 Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23588>	2023-07-28 00:07:15 +00:00
Iván Briano	71ebd9b9d7	anv,hasvk: respect provoking vertex setting on geometry shaders We need to set the right value on ReorderMode based on the provoking vertex mode, or the order in which the vertices for tristrip[_adj] are delivered to the geometry shader doesn't match what Vulkan expects. Fixes dEQP-VK.transform_feedback.primitives_generated_query.concurrent.triangle_strip_with_adjacency Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23243>	2023-07-27 18:52:49 +00:00
Lionel Landwerlin	365b14489d	anv: wire image sparse loads Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23882>	2023-07-27 02:03:02 +03:00
Lionel Landwerlin	50c29e1ffa	anv: simplify buffer address+size loads from descriptor buffer Only found a couple titles that have been helped by this : PERCENTAGE DELTAS Shaders Instrs Cycles cyberpunk_2077 10388 -0.00% -0.00% ----------------------------------------------- All affected 1 -2.24% -0.39% ----------------------------------------------- Total 10388 -0.00% -0.00% PERCENTAGE DELTAS Shaders Instrs Cycles red_dead_redemption2 5949 -0.10% -0.00% -------------------------------------------------- All affected 111 -0.74% -0.14% -------------------------------------------------- Total 5949 -0.10% -0.00% Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23318>	2023-07-26 09:41:23 +00:00
José Roberto de Souza	f59d272e93	anv: Request Xe KMD to place BOs to CPU visible VRAM when required This is required to support discrete GPUs placed in systems with large PCI bar or resizeble PCI bar not available or disabled. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23781>	2023-07-25 19:33:16 +00:00
Illia Polishchuk	c2724b4d37	s/Intel: fix/anv: fix: potentially overflowing expression in genX CID 1528164 (#1 of 1): Unintentional integer overflow (OVERFLOW_BEFORE_WIDEN) overflow_before_widen: Potentially overflowing expression pool->n_passes * pool->khr_perf_preamble_stride with type unsigned int (32 bits, unsigned) is evaluated using 32-bit arithmetic, and then used in a context that expects an expression of type uint64_t (64 bits, unsigned). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Illia Polishchuk <illia.a.polishchuk@globallogic.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20893>	2023-07-25 08:55:56 +00:00
Faith Ekstrand	079e8a9674	anv,hasvk,iris: sampler_prog_key::swizzles is only used on crocus The field is no longer consumed by brw_complie_* and is instead handled directly by the crocus driver. Therefore, it's safe to leave it zero and not even bother setting it. This removes our reliance on the SWIZZLE_* macros in prog_instructions.h. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24288>	2023-07-24 15:40:40 +00:00
Marcin Ślusarz	c1685f08dd	intel/compiler,anv: put some vertex and primitive data in headers Both per-primitive and per-vertex space is allocated in MUE in 8 dword chunks and those 8-dword chunks (granularity of 3DSTATE_SBE_MESH.Per[Primitive\|Vertex]URBEntryOutputReadLength) are passed to fragment shaders as inputs (either non-interpolated for per-primitive and flat vertex attributes or interpolated for non-flat vertex attributes). Some attributes have a special meaning and must be placed in separate 8/16-dword slot called Primitive Header or Vertex Header. Primitive Header contains 4 such attributes (Cull Primitive, ViewportIndex, RTAIndex, CPS), leaving 4 dwords (the rest of 8-dword slot) potentially unused. Vertex Header is similar - it starts with 3 unused dwords, 1 dword for Point Size (but if we declare that shader doesn't produce Point Size then we can reuse it), followed by 4 dwords for Position and optionally 8 dwords for clip distances. This means we have an interesting optimization problem - we can put some user attributes into holes in Primitive and Vertex Headers, which may lead to smaller MUE size and potentially more mesh threads running in parallel, but we have to be careful to use those holes only when we need it, otherwise we could force HW to pass too much data to fragment shader. Example 1: Let's assume that Primitive Header is enabled and user defined 12 dwords of per-primitive attributes. Without packing we would consume 8 + ALIGN(12, 8) = 24 dwords of MUE space and pass ALIGN(12, 8) = 16 dwords to fragment shader. With packing, we'll consume 4 + 4 + ALIGN(12 - 4, 8) = 16 dwords of MUE space and pass ALIGN(4, 8) + ALIGN(12 - 4, 8) = 16 dwords to fragment shader. 16/16 is better than 24/16, so packing makes sense. Example 2: Now let's assume that Primitive Header is enabled and user defined 16 dwords of per-primitive attributes. Without packing we would consume 8 + ALIGN(16, 8) = 24 dwords of MUE space and pass ALIGN(16, 16) = 16 dwords to fragment shader. With packing, we'll consume 4 + 4 + ALIGN(16 - 4, 8) = 24 dwords of MUE space and pass ALIGN(4, 8) + ALIGN(16 - 4, 8) = 24 dwords to fragment shader. 24/24 is worse than 24/16, so packing doesn't make sense. This change doesn't affect vk_meshlet_cadscene in default configuration, but it speeds it up by up to 25% with "-extraattributes N", where N is some small value divisible by 2 (by default N == 1) and we are bound by URB size. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20407>	2023-07-24 07:55:29 +00:00
Rohan Garg	01965a2fe9	anv: drop CFE state validation checks anv no longer needs to track if the CFE state is valid since we ensure that the state is valid at pipeline creation time. Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23934>	2023-07-21 10:46:08 +00:00
Rohan Garg	e7e7042093	anv,iris: program the maximum number of threads on compute queue init Fixes: `90a39cac87` ("intel/blorp: Emit compute program based on BLORP_BATCH_USE_COMPUTE") Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23934>	2023-07-21 10:46:08 +00:00
Marcin Ślusarz	06046a02f8	anv: merge cases leading to the same code Added in: `688968e888` ("anv: add support for direct descriptor in allocation/writes") Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24260>	2023-07-21 07:22:22 +00:00
Marcin Ślusarz	0eb2679cdb	anv: drop unused function Added in: `02cecffe2b` ("anv: add a pass to partially lower resource_intel") Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24260>	2023-07-21 07:22:22 +00:00
Hyunjun Ko	e3ecba3266	anv: use ycbcr_info for P010 format Since !24096 landed, we can just use ycbcr_info to get information of an image of the P010 format. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24265>	2023-07-21 06:15:30 +00:00
Nanley Chery	d9bdffa708	intel: Describe modifier compression with booleans Replace the aux_usage field with two booleans: one for render compression and one for media compression. This more accurately describes how CCS_E is used on gfx12. On those platforms, the FCV feature may be enabled or disabled, but ISL's modifier table has been using the FCV aux-usage for every gfx12 render compression modifier. Instead, set the newly-added render compression boolean to true. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24120>	2023-07-20 20:53:27 +00:00
Nanley Chery	569f80f2df	anv: Reduce accesses of isl_mod_info->aux_usage This field will be replaced in an upcoming patch. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24120>	2023-07-20 20:53:26 +00:00
Nanley Chery	f2dab434d8	anv: Handle explicit surface layout of DG2_RC_CCS We're going to enable the DG2 modifier. Account for the reduced plane count that exists with it. Also add an assert to make it clearer that the aux in use is CCS. Otherwise, it may not be obvious because of the generic compression names being used here. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24120>	2023-07-20 20:53:26 +00:00
Nanley Chery	47565d31e1	intel: Add and use isl_drm_modifier_get_plane_count We're going to enable the DG2_RC_CCS modifier in anv. Add and use this function to prepare for the new plane count that comes with that modifier. iris is left alone for now because it supports more modifiers than isl_drm_modifier_get_score is aware of. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24120>	2023-07-20 20:53:26 +00:00

1 2 3 4 5 ...

4786 commits