fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-18 22:28:06 +02:00

Author	SHA1	Message	Date
Guilherme Gallo	d801c1101d	ci/anv: Update xfiles Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31414>	2024-09-27 16:38:27 +00:00
Guilherme Gallo	a748d38ec9	ci/anv: Introduce missing farm var for ADL jobs Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31414>	2024-09-27 16:38:27 +00:00
Guilherme Gallo	a06102ca6c	ci/intel: Rebalance jobs via parallel Take advantage of 3 spare JSL in Collabora lab to load the balance of those jobs: job name avg duation (min) Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> --- --- anv-jsl 15 anv-jsl-angle 20 iris-jsl-deqp 18 Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31414>	2024-09-27 16:38:26 +00:00
Sviatoslav Peleshko	57344052b6	intel/brw: Don't apply discard_if condition opt if it can change results We can't just always negate the alu instruction's cmod, because negating it can produce different results when the argument is NaN float. We can still do that if the condition is == or !=. Fixes: `0ba9497e` ("intel/fs: Improve discard_if code generation") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11800 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31042>	2024-09-27 11:52:27 +00:00
Caio Oliveira	93c3780bc1	intel/brw: Skip per-primitive inputs when computing flat input mask The per-primitive have their own separate section in the FS thread payload, and are not considered when setting the mask in 3STATE_SBE's ConstantInterpolationEnable. This is also consistent with what is done for brw_interp_reg(). Fixes - dEQP-VK.mesh_shader.ext.misc.clip_geom_provoking_last - dEQP-VK.mesh_shader.ext.misc.clip_geom_and_task_shader_provoking_last Backport-to: 24.2 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11844 Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31417>	2024-09-27 08:15:18 +00:00
Iván Briano	a4cbc903a8	anv: allocate sparse descriptor buffers from the correct heap When allocating a buffer normally, this flag gets to the allocator from the memory requirements, but when sparse bindings are created we were checking for them but never setting them. Fixes sparse descriptor buffers on Xe2. Makes the failure on TRTT more obvious. Fixes: `c6a91f1695` ("anv: add new heap/pool for descriptor buffers") Fixes: `692e1ab2c1` ("anv: get rid of the second dynamic state heap") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31372>	2024-09-27 04:49:22 +00:00
Paulo Zanoni	fe59044f47	anv/trtt: mark vk_sync_get_value()'s value as defined for Valgrind Valgrind doesn't seem to know that drmSyncobjQuery() writes to the variable that we pass as 'last_value'. This gets rid of: ==6275== Conditional jump or move depends on uninitialised value(s) ==6275== at 0x5308370: anv_sparse_trtt_garbage_collect_batches (anv_sparse.c:540) ==6275== by 0x53091E2: anv_sparse_bind_trtt (anv_sparse.c:825) ==6275== by 0x5309771: anv_sparse_bind (anv_sparse.c:953) ==6275== by 0x5309A3B: anv_free_sparse_bindings (anv_sparse.c:1041) ==6275== by 0x529FF21: anv_DestroyBuffer (anv_buffer.c:248) ==6275== by 0x932ADBD: ??? (in /usr/lib/x86_64-linux-gnu/libVkLayer_khronos_validation.so) ==6275== by 0x127AA2: MyVkBuffer::~MyVkBuffer() (sparse.cpp:364) ==6275== by 0x12B2D4: MyApp::test1_trivial_sparse() (sparse.cpp:1421) ==6275== by 0x13E01A: MyApp::run_test(int) (sparse.cpp:6594) ==6275== by 0x13E3B0: main (sparse.cpp:6656) ==6275== Uninitialised value was created by a stack allocation ==6275== at 0x53082D3: anv_sparse_trtt_garbage_collect_batches (anv_sparse.c:525) An alternative to these Valgrind macros would simply have been to zero-intialize last_value. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31332>	2024-09-27 04:10:12 +00:00
Paulo Zanoni	ab91106d4f	anv: fix compute engines when using ANV_QUEUE_OVERRIDE I just noticed that my custom sparse program was not working correctly when I used ANV_QUEUE_OVERRIDE (instead of enabling the compute queue by default or using INTEL_ENGINE_CLASS_COMPUTE, which was removed by commit `600d88ab3c` ("intel: Remove INTEL_ENGINE_CLASS_COMPUTE and INTEL_ENGINE_CLASS_COPY parameters"). It turns out we were not setting the same engine class type when using ANV_QUEUE_OVERRIDE vs the other cases. Move the code around so the behavior can stay the same. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31332>	2024-09-27 04:10:12 +00:00
Matt Turner	75f02ed4b5	anv: Set shader_spilling_rate=15 by default This avoids massively long shader compile times when there is lots of spilling, at a minor cost of a few more spills/fills. Choose 15 as it is already the default used by the Cyberpunk 2077 driconf workaround. Surprisingly the number of additional spills/fills are miniscule in fossil-db: Instructions in all programs: 152680595 -> 152681525 (+0.0%) SENDs in all programs: 7672789 -> 7672789 (+0.0%) Loops in all programs: 48469 -> 48469 (+0.0%) Cycles in all programs: 11981743456 -> 11984228708 (+0.0%) Spills in all programs: 42989 -> 42779 (-0.5%) Fills in all programs: 76380 -> 76776 (+0.5%) partly because of the chaotic unpredictability that the choice of registe to spill has on a shader. For example, this patch massively helps some shaders in terms of spills/fills: Spills helped fossils/fossil-db/steam-native/red_dead_redemption2.vk-g6.foz/4101ff9c9b83bf22/SIMD8 fragment: 3208 -> 2894 (-9.8%) Fills helped fossils/fossil-db/steam-native/red_dead_redemption2.vk-g6.foz/4101ff9c9b83bf22/SIMD8 fragment: 7258 -> 6795 (-6.4%) Spills helped fossils/q2rtx/q2rtx-rt-pipeline.976f4ab1c0fee975.1.foz/c496e8a549f6b4bf/compute: 109 -> 92 (-15.6%) Related: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31133 Related: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9241 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11709 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11844 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31269>	2024-09-27 03:43:52 +00:00
Caio Oliveira	4e559077e4	intel/executor: Dump both pre-processed source and assembly Having the actual generated assembly is helpful when trying to figure out if the code emission and disassembly are implemented correctly. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31305>	2024-09-27 02:46:28 +00:00
Caio Oliveira	2455e2765a	intel/brw: Add DUMP flag to brw_assemble Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31305>	2024-09-27 02:46:28 +00:00
Lionel Landwerlin	50cc738a6d	blorp: convert fast clear color for unsupported formats This tests is asserting on LNL like : dEQP-VK.pipeline.monolithic.sampler.border_swizzle.r8_srgb.gbar.custom.gather_1.no_swizzle_hint dEQP-VK.api.image_clearing.core.clear_color_image.2d.optimal.single_layer.e5b9g9r9_ufloat_pack32 Because blorp tries, for example, to setup a render target with L8_UNORM_SRGB (which is mapped to the R8_UNORM_SRGB of Vulkan) but is not supported for rendering. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `1c7fe9ad1b` ("anv: Support fast clears in anv_CmdClearColorImage") Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31357>	2024-09-27 00:37:25 +00:00
Caio Oliveira	28ef0de250	intel/brw: Add SWSB MATH pipe to assembler Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31336>	2024-09-26 20:40:28 +00:00
Sagar Ghuge	f39cd30f4f	anv: Track all the descriptor sets During compute state save/restore, let's track all the descriptor sets. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30798>	2024-09-26 06:56:21 +00:00
Dylan Baker	f8273555d3	anv: enable VK_EXT_ycbcr_2plane_444_formats Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31352>	2024-09-25 22:10:14 +00:00
Caio Oliveira	d12950539c	intel/brw: Consider pipe when comparing SWSB in tests When tests were added, there was a single pipe (float), so there wasn't a pipe to compare in `operator==`. Add it there now and adjust expectations accordingly. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31335>	2024-09-25 19:32:31 +00:00
Lionel Landwerlin	0b5408f9fc	anv: expose VK_EXT_pipeline_protected_access Intel's protection mechanism is descriptor based. There is nothing going on in the shaders. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31339>	2024-09-25 16:45:49 +00:00
Lionel Landwerlin	d2f7b6d5a7	anv: implement VK_KHR_dynamic_rendering_local_read Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27270>	2024-09-25 12:51:07 +00:00
Lionel Landwerlin	15987f49bb	anv: avoid setting up a null RT unless needed Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27270>	2024-09-25 12:51:07 +00:00
Lionel Landwerlin	6f5d032c6f	intel/decoder: decode the 8 BLEND_STATEs Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27270>	2024-09-25 12:51:07 +00:00
Lionel Landwerlin	d164fe839c	intel/decoder: split state tracking handlers from printing ones Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27270>	2024-09-25 12:51:07 +00:00
Lionel Landwerlin	b39980c616	intel/decoder: add filter feature Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27270>	2024-09-25 12:51:07 +00:00
Lionel Landwerlin	7bd4b537fe	intel/decoder: constify functions not modifying instructions/fields Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27270>	2024-09-25 12:51:07 +00:00
Lionel Landwerlin	2193d87277	brw: remove EOT handling from sampler messages Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Suggested-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31307>	2024-09-25 10:22:40 +00:00
Lionel Landwerlin	2ed4af057a	brw: fix mask componentation for 16-bit sampler returns We can't use register counts since 16-bit sampler loads in SIMD8 will only write back half a GRF. Signed-off-by: Lionel Landwerlin <llandwerlin@gmail.com> Fixes: `0116430d39` ("intel/brw: Handle 16-bit sampler return payloads") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31307>	2024-09-25 10:22:40 +00:00
Lionel Landwerlin	eeb5f6e8c8	brw: make sampler message emission more generic We can generalize the simd8-16bits case by just rounding to a physical register. We also take the opportunity to limit the register allocation to a single physical GRF for the residency data. Signed-off-by: Lionel Landwerlin <llandwerlin@gmail.com> Fixes: `0116430d39` ("intel/brw: Handle 16-bit sampler return payloads") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31307>	2024-09-25 10:22:40 +00:00
Sagar Ghuge	7e48cbb029	intel: uncached L1 to fix memory barrier issue in RT shader In the RT shader, if there's a executeCallableEXT() in between, even though the called shader does nothing, the instructions before and after the executeCallableEXT() is not properly synced. Patch fixes: - dEQP-VK.ray_tracing_pipeline.memguarantee.inside.rgen - dEQP-VK.ray_tracing_pipeline.memguarantee.inside.chit - dEQP-VK.ray_tracing_pipeline.memguarantee.inside.miss - dEQP-VK.ray_tracing_pipeline.memguarantee.inside.call Thank to Kevin for finding out there is a load/store issue. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31201>	2024-09-24 14:33:11 +00:00
Nanley Chery	730e83b525	anv: Require compression for fast-clears on gfx20+ In commit `44351d67f8`, I needed to change some variables in a check for compression in anv_can_fast_clear_color_view(). Instead of doing that, I dropped the check altogether because I thought the call to anv_layout_to_fast_clear_type() which followed right afterwards would return ANV_FAST_CLEAR_NONE if the aux usage was ISL_AUX_USAGE_NONE. That turned out not to be the case, due to special-casing of Xe2+. For now, make Xe2+ more like other platforms when it comes to enabling fast-clears. If there comes a reason to actually fast-clear with ISL_AUX_USAGE_NONE, we can revisit this. Fixes: `44351d67f8` ("anv: Change params of anv_can_fast_clear_color_view") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11920 Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31297>	2024-09-24 13:56:02 +00:00
Mike Blumenkrantz	04709e4f7d	anv: fix video profile lists these didn't include dmabuf layout or mutable formats despite both being supported Reviewed-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31317>	2024-09-24 11:38:48 +00:00
Lionel Landwerlin	f81dc17e7d	anv: add missing pipeline instance multiplier Fix zink/anv tests : dEQP-GLES3.functional.fbo.multiview.samples_* Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11911 Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31341>	2024-09-24 10:36:17 +00:00
Rohan Garg	56adf42110	intel/brw: lower math op regions for Xe2+ This helps fix: - dEQP-VK.spirv_assembly.instruction.graphics.float16.arithmetic_3.tan_frag - dEQP-VK.spirv_assembly.instruction.graphics.float16.arithmetic_2.tan_frag Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31218>	2024-09-24 09:58:28 +00:00
Caio Oliveira	e1b74407bb	intel/brw: Only validate GRF boundary crossing restriction for GRFs Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31294>	2024-09-24 03:39:05 +00:00
Kenneth Graunke	878ae9708a	intel/brw: Don't include sync.nop in INTEL_DEBUG instruction counts In an earlier commit, I made us stop counting sync.nops in the shader statistics we use for shader-db (brw_debug_log_message) and fossil-db (stats->instructions = ...). However, I missed adjusting the printout for INTEL_DEBUG. Fixes: `1497f4e0c2` ("intel/fs: Don't include sync.nop in instruction count statistics") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31311>	2024-09-24 03:12:32 +00:00
Iván Briano	2e1c278e3d	anv: skip rt pipeline compile if we found all shaders When no pipeline cache is provided by the application and we rely on the internal one, cache hits are not counted as such. This was causing us to return COMPILE_REQUIRED on some cases where all shaders had been found in the cache, as well as some unnecessary extra processing in the case that we did have to compile the pipeline. Fixes: `1dacea10f3` ("anv: implement caching for ray tracing pipelines") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31298>	2024-09-23 19:57:53 +00:00
Iván Briano	1a45c8827b	anv: free shaders on rt pipeline compile error We have not yet added the shaders to the pipeline->shaders array at this point. If we couldn't compile (or were asked not to) the pipeline, we were leaking references to any shaders found in the cache. This would manifest as an assert on device destruction: vk_pipeline_cache_destroy: Assertion `cache->object_cache->entries == 0' failed. Fixes: `58c9f817cb` ("anv: fix pipeline executable properties with graphics libraries") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31298>	2024-09-23 19:57:53 +00:00
Lionel Landwerlin	35ea8b6cd2	brw: disable null_rt only if color output does not affect other outputs We found out that some HW changes on Xe2 make the HW avoid reading the blend state if we're using the null_rt bit in the extended descriptor. Since the alpha_to_coverage bit resides in the blend state, that state is ignored and writes are going through to the depth/stencil buffers. Disable null_rt in the color outputs if the color outputs can affect other outputs (through alpha_to_coverage & omask). Fixes tests in this pattern on Xe2 : dEQP-VK.pipeline..multisample.alpha_to_coverage_no_color_attachment. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Backport-to: 24.2 Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31196>	2024-09-23 15:56:02 +00:00
Lionel Landwerlin	b45ce7d43e	brw: move null_rt control up a layer We'll want to tune this setting based on other parameters. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Backport-to: 24.2 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31196>	2024-09-23 15:56:02 +00:00
Lionel Landwerlin	9b42215e0d	iris: ensure null render target for specific cases Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31196>	2024-09-23 15:56:02 +00:00
Lionel Landwerlin	badb3f6301	anv: Only flush render target cache when detecting RT changes We setup an empty render target when there are no color attachments, which effectively makes it a different surface state. In most cases the compiler will insert a null-rt bit in the extended descriptor which means the RT isn't even accessed. But in some cases like alpha-to-coverage output + depth/stencil write, we will access the render target because using the null-rt will prevent alpha-to-coverage from happening. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `2bd304bc8f` ("anv: Skip the RT flush when doing depth-only rendering.") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31196>	2024-09-23 15:56:02 +00:00
Lionel Landwerlin	fb3ae17d96	anv: fix missing tracking for alpha-to-coverage runtime changes Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `9926aedc96` ("anv: enable EDS3 AlphaToCoverageEnable & RasterizationSamples") Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31196>	2024-09-23 15:56:01 +00:00
Nanley Chery	b3882c4488	intel: Avoid no-op calls to anv_image_clear_color Whenever we execute a fast-clear due to LOAD_OP_CLEAR, we decrease the number of layers to clear by one. We then enter the slow clear function and possibly exit without clearing if the layer count is zero. Unfortunately, we've already compiled the shader for slow clears by the time we exit. Skip the slow clear function if there are no layers to clear. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31167>	2024-09-20 16:34:37 +00:00
Nanley Chery	1c7fe9ad1b	anv: Support fast clears in anv_CmdClearColorImage At least two game traces make use of this path: TWWH3 and Factorio. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31167>	2024-09-20 16:34:37 +00:00
Nanley Chery	46d58583ff	anv: Move exec_ccs_op and exec_mcs_op higher up The next patch will use them in anv_CmdClearColorImage(). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31167>	2024-09-20 16:34:37 +00:00
Nanley Chery	03286117ef	anv: Move and rename anv_can_fast_clear_color_view It's no longer specific to image views. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31167>	2024-09-20 16:34:36 +00:00
Nanley Chery	44351d67f8	anv: Change params of anv_can_fast_clear_color_view Expand the scope to more than just image views. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31167>	2024-09-20 16:34:36 +00:00
José Roberto de Souza	7c01cbda6f	anv: Optimize vkQueueWaitIdle() on Xe KMD vk_common_QueueWaitIdle() creates a syncobj, does a submit with no batch buffers what translates to execute trivial_batch_bo and then waits for syncobj to be signaled when trivial_batch_bo finishes. On Xe KMD on other hand we can avoid the trivial_batch_bo submission and instead use the special DRM_IOCTL_XE_EXEC with num_batch_buffer == 0 to get a syncobj to be signaled when the last exec finish execution. This should free a bit GPU to execute more important workloads. This will also optimize vkDeviceWaitIdle() that calls QueueWaitIdle(). It have to fallback to vk_common_QueueWaitIdle() when queue is in VK_QUEUE_SUBMIT_MODE_THREADED mode because vkQueueWaitIdle() could return but there still stuff in VK/CPU submission queue. Also it could cause use after free when resources attached to submission are freed before it is processed, example: vkCreateFence() or vkCreateSemaphore() vkQueueSubmit() // with Fence or Semaphore created above vkQueueWaitIdle() // with the race it returns vkDestroyFence() or vkDestroySemaphore() // vk_queue_submit_thread_func() start to process submission above... Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30958>	2024-09-19 23:12:45 +00:00
José Roberto de Souza	2f7c9f906d	intel: Split anv_xe_wait_exec_queue_idle() and move part of it to common/ Split anv_xe_wait_exec_queue_idle() into 2 functions, the first function creates the syncobj and prepares it to be signaled when the last workload in queue is completed. And the second one that calls the first function, then waits for the syncobj to be signaled and destroy the syncobj. The main reason for that is that the first function can be reused in Iris and a future patch will add another user, so lets share it. No changes in behavior are expected here. Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30958>	2024-09-19 23:12:44 +00:00
Tapani Pälli	b01d76027d	blorp: assert that color depth is not 96 for Wa_16021021469 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31263>	2024-09-19 22:44:49 +00:00
Nanley Chery	290f3a9367	intel/isl: Disable 3D Ys/Yf miptails for CCS We currently disable CCS if a 3D Ys/Yf surface uses miptails. However, ISL generally configures surfaces to be compatible with compression. For consistency, disable miptails on 3D Ys/Yf surfaces in order to allow compression. If drivers prefer to have a more compact layout, they can pass the ISL_SURF_USAGE_DISABLE_AUX_BIT flag at surface creation time. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30081>	2024-09-19 20:39:59 +00:00
Nanley Chery	19ed0e1685	intel/isl: Reduce miptail slot usage to allow CCS We currently disable CCS if a surface uses more than 11 slots in a miptail. However, ISL generally configures surfaces to be compatible with compression. For consistency, reduce the number of slots used in miptails in order to allow compression. If drivers prefer to have a more compact layout, they can pass the ISL_SURF_USAGE_DISABLE_AUX_BIT flag at surface creation time. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30081>	2024-09-19 20:39:59 +00:00

1 2 3 4 5 ...

12759 commits