fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-21 06:48:09 +02:00

Author	SHA1	Message	Date
Tapani Pälli	7d4c23991a	intel/blorp: remove unused blorp batch flag Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28623>	2024-04-10 05:38:24 +00:00
Lionel Landwerlin	85dd83aa46	anv: only check patch_control_points changes in runtime flush Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28396>	2024-04-09 11:32:48 +00:00
Paulo Zanoni	cf7e1f3817	anv, iris: add missing CS_STALL bit for GPGPU texture invalidation The BSpec page "Flush Types" (46213) says the following about the Tex Invalidate bit: "Requires stall bit ([20] of DW) set for all GPGPU Workloads." For newer platforms, this is documented in the description of the texture invalidation bit in the PIPE_CONTROL page (56551): "CS Stall bit in PIPE_CONTROL command must be always set for GPGPU workloads when Texture Cache Invalidation Enable bit is set" Iris had it only for GFX_VER 9 and 11, while Anv had it missing for everything. Please notice that this patch includes a revert of `397e728ef4`. Fixes: `397e728ef4` ("iris: Drop GPGPU Tex Invalidate restriction for TGL+") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28608>	2024-04-08 22:57:22 +00:00
Lionel Landwerlin	2dd321963f	isl: set NullPageCoherencyEnable for depth/stencil sparse surfaces Not setting this bits, it seems we get incorrect depth values (i.e not zero) for null depth/stencil tiles. Fixes vkd3d-proton's test_sparse_depth_stencil_rendering CTS doesn´t seem to exercise any depth/stencil format. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28611>	2024-04-08 09:03:41 +00:00
Lionel Landwerlin	c3d30d9e65	anv: mark descriptors & pipeline dirty after blorp compute All of those are used by blorp, we need to reemit it when doing the next compute dispatch. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `37fca614b8` ("anv/blorp: Split blorp_exec into a render and compute") Fixes: `6823ffe70e` ("anv: try to keep the pipeline in GPGPU mode when buffer transfer ops") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10972 Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28617>	2024-04-08 06:55:54 +00:00
Hyunjun Ko	2bd3674679	anv/video: Fix to set correct offset and size for parsing h265 slice header. Fixes: `8d519eb5` ("anv: add initial video decode support for h265") Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28591>	2024-04-08 04:12:07 +00:00
Lionel Landwerlin	fe36cf6cad	anv: add missing data flush out of L3 for transform feedback writes Fixes zink's piglit.spec.arb_shader_image_load_store.host-mem-barrier on TGL Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28492>	2024-04-06 07:33:29 +00:00
Lionel Landwerlin	6a7e576017	intel/fs: fixup instruction scheduling last grf write tracking When I bumped the max size of VGRFs, I should have bumped the values in the scheduler too. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `d33aff783d` ("intel/fs: add support for sparse accesses") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28188>	2024-04-05 19:46:40 +00:00
Lionel Landwerlin	d59612f5e5	intel/fs: printout a couple of more late compile steps Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28188>	2024-04-05 19:46:40 +00:00
José Roberto de Souza	77c004f7ca	anv: Create protected engine context when i915 supports vm control When has_vm_control is supported it takes a different code path and creates one context per engine and in this code path we were not setting the protected context flag. The lack of this is not causing any test to fail in our CI but it is better do what we are supposed to do. Fixes: `fd40134487` ("anv: allow protected GEM context creation") Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28299>	2024-04-05 15:00:24 +00:00
Lionel Landwerlin	ea84b36592	anv: fix incorrect blorp dynamic state heap usage Found with valgrind : ==253563== Invalid free() / delete / delete[] / realloc() ==253563== at 0x6EEBB88: anv_state_pool_free (anv_allocator.c:962) ==253563== by 0x6EFB563: anv_device_finish_blorp (anv_blorp.c:143) ==253563== by 0x6F204F8: anv_DestroyDevice (anv_device.c:4063) ==253563== by 0x6DE1CD7: loader_layer_destroy_device (loader.c:4387) ==253563== by 0x6DF1D5E: vkDestroyDevice (trampoline.c:1025) ==253563== by 0x407C54: vk::refdetails::Deleter<vk::VkDevice_s>::operator()(vk::VkDevice_s) const (vkRef.hpp:131) ==253563== by 0x42C016: vk::refdetails::RefBase<vk::VkDevice_s>::reset() (vkRef.hpp:303) ==253563== by 0x40B385: vk::refdetails::RefBase<vk::VkDevice_s>::~RefBase() (vkRef.hpp:296) ==253563== by 0x40A95D: vk::refdetails::Unique<vk::VkDevice_s>::~Unique() (vkRef.hpp:376) ==253563== by 0x402501: vkt::DefaultDevice::~DefaultDevice() (vktTestCase.cpp:658) ==253563== by 0x444807: de::DefaultDeleter<vkt::DefaultDevice>::operator()(vkt::DefaultDevice) const (deDefs.hpp:112) ==253563== by 0x43922D: de::details::UniqueBase<vkt::DefaultDevice, de::DefaultDeleter<vkt::DefaultDevice> >::reset() (deUniquePtr.hpp:90) ==253563== Address 0xd3df000 is 0 bytes inside a block of size 272 client-defined ==253563== at 0x6EEBA0B: anv_state_pool_alloc (anv_allocator.c:940) ==253563== by 0x6EFA610: anv_state_pool_emit_data (anv_private.h:852) ==253563== by 0x6EFB206: upload_dynamic_state (anv_blorp.c:106) ==253563== by 0x6FC8C31: blorp_init_dynamic_states (blorp_genX_exec_brw.h:2211) ==253563== by 0x6FCD02D: gfx9_blorp_init_dynamic_states (genX_blorp_exec.c:507) ==253563== by 0x6EFB47E: anv_device_init_blorp (anv_blorp.c:129) ==253563== by 0x6F1FBBA: anv_CreateDevice (anv_device.c:3908) ==253563== by 0x840B669: vk_tramp_CreateDevice (vk_dispatch_trampolines.c:78) ==253563== by 0x6DE663A: terminator_CreateDevice (loader.c:5836) ==253563== by 0x6DE3E39: loader_create_device_chain (loader.c:4940) ==253563== by 0x6DE1AC5: loader_layer_create_device (loader.c:4320) ==253563== by 0x6DF1CBF: vkCreateDevice (trampoline.c:1005) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `fe1baa6481` ("anv: reduce blorp dynamic state emissions") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28578>	2024-04-05 09:50:41 +00:00
Lionel Landwerlin	9b0f028c7e	anv: update protection fault property Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `794b0496e9` ("anv: enable protected memory") Reviewed-by: Rohan Garg <rohan.garg@intel.com> Tested-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26540>	2024-04-05 09:07:21 +03:00
Lionel Landwerlin	d2e490dc4d	anv: disable generated draws in protected command buffers Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `794b0496e9` ("anv: enable protected memory") Reviewed-by: Rohan Garg <rohan.garg@intel.com> Tested-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26540>	2024-04-05 09:07:21 +03:00
Lionel Landwerlin	034a1cdb58	anv: disable protected content around surface state copies Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `794b0496e9` ("anv: enable protected memory") Reviewed-by: Rohan Garg <rohan.garg@intel.com> Tested-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26540>	2024-04-05 09:07:21 +03:00
Lionel Landwerlin	27a3771227	anv: pull surface state copies for secondary in one loop It'll be easier for the next commit. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Tested-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26540>	2024-04-05 09:07:21 +03:00
Lionel Landwerlin	07bf480856	anv: fix protected memory allocations Using the wrong flag field... Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `5f2c77a10a` ("anv: handle protected memory allocation") Reviewed-by: Rohan Garg <rohan.garg@intel.com> Tested-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26540>	2024-04-05 09:07:21 +03:00
Ian Romanick	0e817ba548	intel/brw/xe2+: Implement Wa 22016140776 HF sources to math instructions cannot be scalar. This is very similar to an old Gfx6 restriction on POW, so let's fix it in a similar way. As an extra bit of saftey, lower any occurances that might slip through in brw_fs_lower_regioning. The primary change is to prevent copy propagation from violating the restriction. With that change, nothing should be able to generate these invalid source strides. The modification to fs_visitor::validate should detect potential problems sooner rather than later. Previous attempts to implement this Wa when emitting the math instruction (in brw_eu_emit.c gfx6_math) didn't work for several reasons. The lowering happens after the SWSB pass, so the scoreboarding was incorrect (thanks to Curro for finding that). In addition, the lowering happens after register allocation, so it's impossible to allocate a non-scalar register to expand the scalar value. Fixes 113 tests in the dEQP-VK.spirv_assembly.* group on LNL. v2: Add changes to brw_fs_lower_regioning. Suggested by Curro. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28480>	2024-04-04 21:04:09 -07:00
Jordan Justen	50c7d25a9e	intel/dev/mesa_defs.json: Add LNL WA entries Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Acked-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28480>	2024-04-04 21:03:51 -07:00
Ian Romanick	0b67d3d909	intel/elk: Delete stray nir_opt_dce No shader-db changes on any Intel platform. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28136>	2024-04-04 23:42:28 +00:00
Ian Romanick	24cdbbdaa2	intel/brw: Delete stray nir_opt_dce No shader-db or fossil-db changes on any Intel platform. Fixes: `f76f4be301` ("intel/compiler: move gen5 final pass to actually be final pass") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28136>	2024-04-04 23:42:27 +00:00
Ian Romanick	44fb57b827	intel/elk: Don't call nir_opt_remove_phis before nir_convert_from_ssa shader-db: All platforms had similar results. (Ivy Bridge shown) total instructions in shared programs: 15831424 -> 15831637 (<.01%) instructions in affected programs: 38880 -> 39093 (0.55%) helped: 0 / HURT: 179 total cycles in shared programs: 432140353 -> 432170199 (<.01%) cycles in affected programs: 11798080 -> 11827926 (0.25%) helped: 77 / HURT: 123 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28136>	2024-04-04 23:42:27 +00:00
Ian Romanick	6377e8fd29	intel/brw: Don't call nir_opt_remove_phis before nir_convert_from_ssa Per discussion in #10727, removing phis breaks LCSSA form which in turn invalidates divergence analysis. shader-db: All Skylake and newer platforms had similar results. (Ice Lake shown) total instructions in shared programs: 20299612 -> 20299695 (<.01%) instructions in affected programs: 20829 -> 20912 (0.40%) helped: 6 / HURT: 13 total cycles in shared programs: 842149085 -> 842148399 (<.01%) cycles in affected programs: 15146222 -> 15145536 (<.01%) helped: 40 / HURT: 45 fossil-db: All Intel platforms had similar results. (Ice Lake shown) Totals: Instrs: 165505077 -> 165505603 (+0.00%); split: -0.00%, +0.00% Cycles: 15144183575 -> 15144235695 (+0.00%); split: -0.00%, +0.00% Spill count: 45213 -> 45220 (+0.02%) Fill count: 74166 -> 74184 (+0.02%) Totals from 94 (0.01% of 656116) affected shaders: Instrs: 263079 -> 263605 (+0.20%); split: -0.00%, +0.20% Cycles: 28411487 -> 28463607 (+0.18%); split: -0.18%, +0.37% Spill count: 3474 -> 3481 (+0.20%) Fill count: 6713 -> 6731 (+0.27%) Fixes: `6dbb5f1e07` ("intel/fs: rerun divergence analysis prior to convert_from_ssa") Closes: #10727 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28136>	2024-04-04 23:42:27 +00:00
Ian Romanick	87101e7d83	intel/compiler: Ensure load_barycentric_at_sample and load_interpolated_input remain together This previously worked by luck because we were incorrectly calling nir_opt_remove_phis before calling nir_convert_from_ssa. See also #10727. No shader-db or fossil-db changes on any Intel platform. v2: Handle the load_interpolated_input and load_barycentric_at_sample as separate passes. Based on discussion with Ken starting at https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28136#note_2330424. Fixes: `74a40cc4b6` ("intel/fs: move lower of non-uniform at_sample barycentric to NIR") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28136>	2024-04-04 23:42:27 +00:00
Nanley Chery	c6686fda28	intel/isl: Use Tile64 to align images for CCS WA See HSD 22015614752. We have issues when multiple engines access the same CCS cacheline in parallel. This can happen in a Vulkan application that uses different queues to operate on different subresources. To resolve this, this patch prefers Tile64 when an image has multiple subresources and disallows CCS if such an image lacks that tiling. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8614 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28284>	2024-04-04 15:17:50 +00:00
Nanley Chery	b092124186	intel/isl: Enable a 64KB alignment WA for flat-CCS WA 22015614752 applies to gfx125 platforms, but the alignment requirement was only enabled for the subset that has an aux-map. Adjust the condition to apply it where appropriate. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28284>	2024-04-04 15:17:50 +00:00
Nanley Chery	d7bfa8051e	intel/isl: Remove a CCS_D check from gfx12+ code This aux usage isn't used on gfx12+. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28284>	2024-04-04 15:17:50 +00:00
Nanley Chery	8845f1e439	intel/isl: Remove inconsistency when encoding Tile64 We guard surface state encoding of tilings by macros when the encoded value is not present on certain platforms. For gfx20 however, we added these macros even when the existing ones for gfx125 were sufficient. Remove the extra macros. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28284>	2024-04-04 15:17:50 +00:00
Nanley Chery	81d8c071ac	intel/isl: Remove inconsistency when choosing Tile64 We don't check the gfx version when choosing the tiling except when choosing Tile64. Drop the version check for consistency and to remove doubts about the order of operations occuring as expected within the CHOOSE macro. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28284>	2024-04-04 15:17:50 +00:00
Rohan Garg	57209a0c7a	isl: allow CCS on single sampled TILE64 surfaces Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23030>	2024-04-04 02:17:34 +00:00
Rohan Garg	afb63443a0	intel/blorp: add fast clear rectangle dimensions for single sampled TILE64 CCS surfaces Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23030>	2024-04-04 02:17:34 +00:00
José Roberto de Souza	a47a65c1c2	intel/genxml/xe2: Update definition of INTERFACE_DESCRIPTOR_DATA This maches specification and better matches the gfx 125 definition. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28505>	2024-04-03 20:21:04 +00:00
José Roberto de Souza	0f29b780e1	intel/genxml/gfx125: Fix definition of INTERFACE_DESCRIPTOR_DATA::Thread group dispatch size It was using the wrong platform definition that only had 1 bit, filtering by DG2/ACM it shows the correct definition. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28505>	2024-04-03 20:21:04 +00:00
José Roberto de Souza	c00c685f84	intel/genxml: Add more instdone registers Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28505>	2024-04-03 20:21:04 +00:00
José Roberto de Souza	2f3dc31876	anv: Set STATE_COMPUTE_MODE mask bit when zeroing compute mode Justing setting all zeroes to STATE_COMPUTE_MODE will do nothing, the mask of each register must be set for it to change. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28505>	2024-04-03 20:21:04 +00:00
Yonggang Luo	3114917986	util: Turn futex_wake parameter to int32_t for consistence across platforms Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28473>	2024-04-03 00:55:24 +00:00
Eric Engestrom	ff37f68740	meson: add VK_DRIVER_FILES to devenv, alongside the old VK_ICD_FILENAMES Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28516>	2024-04-02 18:08:52 +00:00
Eric Engestrom	96e8648b32	docs: replace references to the deprecated VK_INSTANCE_LAYERS with the new VK_LOADER_LAYERS_ENABLE Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28516>	2024-04-02 18:08:52 +00:00
Tapani Pälli	a87d888546	anv: disable fcv optimization on >= gfx125 Earlier strategy was to enable always on DG2 but there has been bunch of issues that indicate this feature is not working correctly. Disable until we figure out issues with it. Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28184>	2024-04-02 09:28:18 +00:00
Sergi Blanch Torne	35a9e8577c	ci: Nightly run expectations update Reviewer the results from the last nightly run completed using ci-collate tool (gl.fd.o/gfx-ci/ci-collate) with the 'patch' feature and a bit of human intervention, these are the changes in the expectations. Signed-off-by: Sergi Blanch Torne <sergi.blanch.torne@collabora.com> Reviewed-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28350>	2024-04-02 07:52:42 +00:00
Kenneth Graunke	9e0d0190ea	intel/brw: Drop align16 support in brw_broadcast() align16 support is only used on Gen9 for 3-source instructions, quad swizzling, and dPdy calculations. We don't need it for broadcast. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28458>	2024-04-02 00:00:59 +00:00
Kenneth Graunke	a520c976a5	intel/brw: Drop dead CHV checks. This compiler no longer supports Cherryview. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28458>	2024-04-02 00:00:59 +00:00
Kenneth Graunke	e3d12cf72f	intel/brw: Don't mention gfx7 limitations in shuffle comments We don't support gfx7 here anymore, so we needn't consider it. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28458>	2024-04-02 00:00:59 +00:00
Kenneth Graunke	1d9e2b761a	intel/brw: Update comments for indirect MOV splitting brw_broadcast and generate_mov_indirect both had similar comments, both with typos ("insead"). One still referred to IVB bugs, while the other dropped that during the compiler split. The one that dropped the comment mentioned "both of these" issues, while citing only one issue; there was in fact a third issue (no-Q/UQ) that wasn't mentioned in either comment. One also had some bad grammar in the comments. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28458>	2024-04-02 00:00:59 +00:00
Kenneth Graunke	7a24f29fbb	intel/brw: Fix lower_regioning for BROADCAST, MOV_INDIRECT on Q types For BROADCAST and MOV_INDIRECT, required_exec_type was returning brw_int_type(type_sz(t), false), which is an unsigned type. However, get_exec_type(inst) returns the original type for either Q or UQ. This meant that has_invalid_exec_type would detect a mismatch and trigger lowering. That lowering would insert new 64-bit MOVs, which would need to be lowered on platforms which don't support Q/UQ. Except, we already ran that lowering pass earlier. So, the unlowered Q/UQ MOVs would reach the software scoreboarding pass, and trigger failures in the inferred_exec_pipe() function, as no pipe is available to handle 64-bit integer operations. It turns out that we don't need the region lowering pass to do anything for these opcodes. The generator code for both BROADCAST and MOV_INDIRECT already handle decomposing Q/UQ operations into 32-bit MOVs when they're not supported. And, it also implicitly converts to integer types, even for floating point sources. The inferred_exec_pipe function already special cases them to note that they'll always be handled on the integer pipe, so that matches. Just drop the region lowering code for these opcodes. Cc: mesa-stable Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28458>	2024-04-02 00:00:59 +00:00
Kenneth Graunke	a90edad9f7	intel/brw: Fix generate_mov_indirect to check has_64bit_int not float We are overriding the type to Q/UQ, so we need to split to two MOVs if 64-bit integer math is not supported. For reference, Meteorlake does support 64-bit floats but would still not work correctly here. See also brw_broadcast(), which does similar indirects but correctly checks has_64bit_int instead of has_64bit_float. Cc: mesa-stable Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28458>	2024-04-02 00:00:59 +00:00
Paulo Zanoni	817f74748f	anv/xe: don't overwrite the result from vk_sync_wait() The vk_sync_wait() function is already capable of returning some nice VkResult errors, don't lose information by replacing everything with vk_queue_set_lost. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28455>	2024-04-01 23:36:12 +00:00
Paulo Zanoni	38af7254e2	anv/xe: don't leak xe_syncs during trtt submission ==134077== 96 bytes in 1 blocks are definitely lost in loss record 1 of 3 ==134077== at 0x4840808: malloc (in /usr/libexec/valgrind/vgpreload_memcheck-amd64-linux.so) ==134077== by 0x6D6F690: vk_default_alloc (vk_alloc.c:26) ==134077== by 0x52EEEBE: vk_alloc (vk_alloc.h:48) ==134077== by 0x52EEEEE: vk_zalloc (vk_alloc.h:56) ==134077== by 0x52EF47E: xe_exec_process_syncs (anv_batch_chain.c:132) ==134077== by 0x52EF8F6: xe_execute_trtt_batch (anv_batch_chain.c:215) ==134077== by 0x5301670: anv_queue_submit_trtt_batch (anv_batch_chain.c:1697) ==134077== by 0x603D135: gfx125_write_trtt_entries (genX_cmd_buffer.c:6091) ==134077== by 0x5370B44: anv_sparse_bind_trtt (anv_sparse.c:595) ==134077== by 0x5370CFC: anv_sparse_bind (anv_sparse.c:629) ==134077== by 0x5370E6E: anv_init_sparse_bindings (anv_sparse.c:670) ==134077== by 0x5328037: anv_CreateBuffer (anv_device.c:5071) Note to backporters: this is only for when xe.ko is being used and ANV_SPARSE_USE_TRTT=1 is exported. This is not the regular code path. Fixes: `18bd00c024` ("anv/trtt: don't wait/signal syncobjs using the CPU anymore") Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28455>	2024-04-01 23:36:12 +00:00
Eric Engestrom	51c589234d	isl: fix inline c identifier reference -> inline code Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28499>	2024-04-01 21:18:37 +00:00
Rohan Garg	3d68dd78d0	intel/eu/validate: Allow SIMD16 for mixed mode float operations on xe2+ Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28484>	2024-04-01 00:00:03 +00:00
Rohan Garg	a368d234c8	intel/brw: Lower DWORD scattered read writes to lsc Rework: * Francisco Jerez: Rebase on `07b9bfacc7` ("intel/compiler: Move logical-send lowering to a separate file") * Jordan: Move SHADER_OPCODE_DWORD_SCATTERED__LOGICAL from previous patch, as it seems to make more sense here. Jordan: Change `devinfo->has_lsc` ?: to if/else as suggested by idr Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28484>	2024-04-01 00:00:03 +00:00

1 2 3 4 5 ...

11735 commits