fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-18 22:28:06 +02:00

Author	SHA1	Message	Date
Sagar Ghuge	470bb614e0	blorp: Use the correct miptail start LOD for surfaces Use the correct miptail start LOD for the surfaces involved in the XY_BLOCK_COPY_BLT/XY_FAST_COLOR_BLT instructions. Thanks to Lionel for pointing out the issue. Fixes: `46f45d62d1` ("intel/isl: Start using miptails") Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25688>	2023-10-13 21:58:59 +00:00
Lionel Landwerlin	3f973a4f45	Revert "intel/fs: limit register flag interaction of FIND_LIVE_CHANNEL" This reverts commit `c9739e8912`. We don't have a full understanding of what is going on but reverting definitely fixes a hang. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `c9739e8912` ("intel/fs: limit register flag interaction of FIND_LIVE_CHANNEL") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9868 Tested-By: Valentin Geyer <trayshar@t-online.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25563>	2023-10-13 08:37:28 +03:00
José Roberto de Souza	07eede0970	intel: Prepare implementation of Wa_18019816803 and Wa_16013994831 for future platforms Those workarounds are temporary for newer platforms so we can't use INTEL_NEEDS_WA_, luckly those already had runtime checks. INTEL_NEEDS_WA_ was only used because it was accessing instructions or fields of the instructions that only exists in gfx12 or gfx125. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25685>	2023-10-13 03:54:50 +00:00
Jordan Justen	ee482ad660	anv/batch: Assert that extend_cb is non-NULL if the batch is out of space Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25672>	2023-10-13 01:46:58 +00:00
Jordan Justen	ef8dcb0aa4	anv/batch: Check if batch already has an error in anv_queue_submit_simple_batch() Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25672>	2023-10-13 01:46:58 +00:00
Lionel Landwerlin	fe05e6610b	anv: fixup spirv cap for ImageReadWithoutFormat on Gfx12.5 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `2e2491b76c` ("anv: enable shaderStorageImageReadWithoutFormat on Gfx12.5+") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25573>	2023-10-12 18:57:55 +00:00
Sagar Ghuge	bed0542b2f	anv: Enable transfer queue only on ACM+ platforms On older platforms, we have the blitter engine, but it lacks compression handling and other features we need, unfortunately, so enable the transfer queue only on ACM+ platforms. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25667>	2023-10-12 17:24:06 +00:00
Lionel Landwerlin	ebb68d506d	anv: simplify push descriptors There can only be one push descriptor amongst all descriptor sets. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25598>	2023-10-11 20:45:03 +00:00
Rohan Garg	26c2c96d62	anv: enable FCV for Gen12.5 Now that we have proper handling of FCV_CCS_E everywhere, we can turn this on for Gen12.5. This helps fix a performance regression where enabling fast clears to non-zero values with CCS_E caused additional partial resolves, regressing performance on certain games. Performance is helped on the following games: - F1'22: +45% - RDR2: +6% Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25589>	2023-10-11 12:18:15 +00:00
Rohan Garg	8688a3b8f7	anv: ensure that FCV_CCS_E fast clears are properly tracked Surfaces with FCV_CCS_E aux usage should be marked as fast cleared when being rendered to, to ensure proper fast clear state tracking. We also need to ensure that we're not trying to partially resolve surfaces with level > 0 and layer > 0 since we don't track fast clear states for those. Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25589>	2023-10-11 12:18:15 +00:00
Rohan Garg	300c98dbb2	intel/genxml: fix 3DSTATE_3D_MODE length to align with BSpec Closes: #8632 Fixes: `569afd37f1` ('intel/genxml: Copy gen12.xml to gen125.xml') Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25589>	2023-10-11 12:18:15 +00:00
Lionel Landwerlin	29352b304b	anv: add support for VK_EXT_nested_command_buffer Our implementation of secondary command buffers already jumps into them and edits the end of the secondary command buffer to jump back into the primary. That implementation can work just the same with any levels of secondary. The only possible issue would happen with a secondary calling itself, but that's not possible. We also cannot support simultaneous execution with self-modifying command buffers. That's actually not a problem at the moment because we don't have multiple queues of the same family but we choose to reflect that in the feature bits. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25600>	2023-10-11 11:32:47 +00:00
Lionel Landwerlin	8a12286214	anv: rename primary in container in ExecuteCommands() Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25600>	2023-10-11 11:32:47 +00:00
David Heidelberg	5ab60581da	ci/traces: keep images for every job except the performance testing Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8354 Acked-by: Emma Anholt <emma@anholt.net> Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25606>	2023-10-10 12:46:51 +00:00
Alyssa Rosenzweig	c39896b17b	nir: Use getters for nir_src::parent_* First, we need to give the parent_instr field a unique name to be able to replace with a helper. We have parent_instr fields for both nir_src and nir_def, so let's rename nir_src::parent_instr in preparation for rework. This was done with a combination of sed and manual fix-ups. Then we use semantic patches plus manual fixups: @@ expression s; @@ -s->renamed_parent_instr +nir_src_parent_instr(s) @@ expression s; @@ -s.renamed_parent_instr +nir_src_parent_instr(&s) @@ expression s; @@ -s->parent_if +nir_src_parent_if(s) @@ expression s; @@ -s.renamed_parent_if +nir_src_parent_if(&s) @@ expression s; @@ -s->is_if +nir_src_is_if(s) @@ expression s; @@ -s.is_if +nir_src_is_if(&s) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24671>	2023-10-10 04:58:05 -04:00
Iván Briano	54498937c5	intel/compiler: round f2f16 correctly for RTNE case v2: bcsel -> b2i32 (Ian) Fixes upcoming Vulkan CTS tests: dEQP-VK.spirv_assembly.instruction.compute.float_controls.fp16.input_args.rounding_rte_conv_from_fp64_up dEQP-VK.spirv_assembly.instruction.compute.float_controls.fp16.input_args.rounding_rte_conv_from_fp64_up_nostorage dEQP-VK.spirv_assembly.instruction.graphics.float_controls.fp16.input_args.rounding_rte_conv_from_fp64_up_vert dEQP-VK.spirv_assembly.instruction.graphics.float_controls.fp16.input_args.rounding_rte_conv_from_fp64_up_nostorage_vert dEQP-VK.spirv_assembly.instruction.graphics.float_controls.fp16.input_args.rounding_rte_conv_from_fp64_up_frag dEQP-VK.spirv_assembly.instruction.graphics.float_controls.fp16.input_args.rounding_rte_conv_from_fp64_up_nostorage_frag Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25281>	2023-10-09 23:37:52 +00:00
Iván Briano	919f5468eb	vulkan/runtime: add internal parameter to vk_spirv_to_nir If used to compile internal shaders, it will lack the flag while running through all the optimization passes it does. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25281>	2023-10-09 23:37:51 +00:00
Lionel Landwerlin	c8556a8f2e	anv: flag 3DSTATE_RASTER as dirty after simple shader primitive Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `50f6903bd9` ("anv: add new low level emission & dirty state tracking") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9899 Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25618>	2023-10-09 16:09:48 +00:00
Lionel Landwerlin	d924b568ef	anv: fix a couple of missing input for 3DSTATE_RASTER programming Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `50f6903bd9` ("anv: add new low level emission & dirty state tracking") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25618>	2023-10-09 16:09:48 +00:00
Lionel Landwerlin	0985548204	anv: add missing workaround handling in simple shader It's not going to make any real difference because of the type of primitive used, but it feels safer to have this everywhere after a 3DPRIMITIVE. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25618>	2023-10-09 16:09:48 +00:00
Lionel Landwerlin	6bfa8850ab	anv: implement INTEL_DEBUG=reemit Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25618>	2023-10-09 16:09:48 +00:00
Vinson Lee	16fa4621b8	anv: Fix transfer type assert Fix defect reported by Coverity Scan. Constant expression result (CONSTANT_EXPRESSION_RESULT) always_true_or: The or condition type != ANV_TIMESTAMP_CAPTURE_AT_CS_STALL \|\| type != ANV_TIMESTAMP_REWRITE_COMPUTE_WALKER will always be true because type cannot be equal to two different values at the same time, so it must be not equal to at least one of them. Fixes: `5112b42146` ("anv: Handle end of pipe with MI_FLUSH_DW on transfer queue") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25605>	2023-10-08 22:39:41 -07:00
Lionel Landwerlin	d091609d81	anv: fix index buffer size programming This is a merge issue due to 2 MRs touching the same code. Fixes a few maintence5 tests like : dEQP-VK.robustness.bind_index_buffer2.offset_0.draw_indexed.oo_size Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `50f6903bd9` ("anv: add new low level emission & dirty state tracking") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25599>	2023-10-07 17:30:17 +00:00
Ian Romanick	bac10ef4aa	intel/fs: Add DP4A to get_lowered_simd_width While working on cooperative matrix support, I noticed some invalid DP4A instructions being generated. dp4a(32) g33<1>UD g21<8,8,1>UD g1.0<0,1,0>UD g9<1,1,1>UD This violates the constraint that the destination or a source can only access two consecutive GRFs. I'm a little surprised that validation didn't catch this. Perhaps because it's a 3 source instruction? Either way, it seems like a bigger project to fix that. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Fixes: `0f809dbf40` ("intel/compiler: Basic support for DP4A instruction") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25554>	2023-10-07 02:27:53 +00:00
Eric Engestrom	82e342888f	ci: skip dEQP-VK.api.driver_properties.conformance_version for everyone This test checks the driver's reported conformance version against the version of the CTS we're running. This check fails every few months and everyone has to go and bump the number in every driver. Running this check only makes sense while preparing a conformance submission, so skip it in the regular CI. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25519>	2023-10-06 17:37:20 +00:00
Lionel Landwerlin	596b438936	intel/ds: track acceleration RT commands Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25570>	2023-10-06 11:10:12 +00:00
Lionel Landwerlin	3e8d2617e1	anv: use buffer pools for BVH build buffers Private memory for BVH builds doesn't need to be mapped on the host, it's purely for use by the GPU. So it can be put into a different buffer pool that can put into VRAM only buffers. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25570>	2023-10-06 11:10:12 +00:00
Lionel Landwerlin	bab344645f	anv: move bo_pool allocation flags to init caller Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25570>	2023-10-06 11:10:12 +00:00
Lionel Landwerlin	787c29f2fc	anv: reduce working temporary memory for BVH builds Part of the memory allocated (private) is a temporary working buffer for the GRL kernels. Once the build operation is done, the buffer becomes unused. Rather than allocate a new buffer each time, reuse the current last allocated one if its size fits the next build operation. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25570>	2023-10-06 11:10:11 +00:00
Caio Oliveira	81bc09bf97	intel/fs: Tweak default case of fs_inst::size_read() In the default case, there's a special case with a few conditions. Prefer the cheapest conditions first, so we can take advantage of short-circuiting. Effect is a small but still significant reduce in shader compilation times, as can be seen by: - Fossil replay for Rise of the Tomb Raider ``` Difference at 95.0% confidence -0.433333 +/- 0.028609 -1.42556% +/- 0.0941163% (Student's t, pooled s = 0.0337886) ``` - Fossil replay for Batman Arkham City ``` Difference at 95.0% confidence -8.84 +/- 0.146083 -1.65932% +/- 0.0274207% (Student's t, pooled s = 0.125423) ``` Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25549>	2023-10-06 09:16:56 +00:00
Emma Anholt	4b9c3c76d0	ci/hasvk: Add a bunch of new CTS border color fails. pretty sure this is from new coverage since the CTS uprev.. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25574>	2023-10-06 04:06:29 +00:00
Sviatoslav Peleshko	8361cd4c4c	intel/eu/validate: Validate "packed word exception" stricter Fixes: `75b7f5a2` ("i965: Validate "Region Alignment Rules"") Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25378>	2023-10-05 01:41:42 +00:00
Sviatoslav Peleshko	8f23b45252	intel/fs: Fix "packed word exception" condition for register regioning Fixes: `a6bf5f88` ("i965/fs: Enforce common regioning restrictions by SIMD splitting.") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9432 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25378>	2023-10-05 01:41:42 +00:00
Emma Anholt	77b240a251	ci/anv: Drop the 16bit.scalar.13 skip. It's now at 1.5 sec on my ADL and CFL systems. Closes: #4641 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25509>	2023-10-03 19:38:39 +00:00
Emma Anholt	c742cba113	ci/anv: Drop incorrect xfail addition for TGL This xfails file is for deqp-vk. The xfail that was added was for angle-on-anv, which has its own expectations files and has this test recently listed as a flake already. Fixes: `a217c5c58c` ("ci: update to vulkan-cts-1.3.6.3") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25509>	2023-10-03 19:38:39 +00:00
Kenneth Graunke	17b8b2cffd	anv: Add support for a transfer queue on Alchemist Alchemist has an improved blitter that's sufficiently powerful to implement a transfer queue. Tigerlake's blitter lacks compression handling and other features we need, unfortunately. Rework (Sagar): - Check blitter command buffer in EndCommandBuffer v2: (Lionel) - Look at image, buffer and memory barriers as well - Flush cache if there is queue ownership transfer Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18325>	2023-10-03 18:02:52 +00:00
Sagar Ghuge	5112b42146	anv: Handle end of pipe with MI_FLUSH_DW on transfer queue Blitter command streamer supports MI_FLUSH_DW command so make sure we don't end up emitting pipe control with CS stall and also handle the end of pipe timestamp with MI_FLUSH_DW command. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18325>	2023-10-03 18:02:52 +00:00
Sagar Ghuge	9d7166dfc0	isl: Use 16-bit instead of 8-bits for surface format info fields Comparing uint8_t max value 255 with devinfo->verx10 will work fine for now but for future platforms, comparison will fail. To avoid this let's switch the field data type from 8-bits to 16-bits. v1: (Jordan) - Use 16 bits instead of 32 and add assertion. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25478>	2023-10-02 17:24:33 +00:00
Tapani Pälli	1c4d57568a	intel/genxml: remove HDC from gen11.xml, it is not available Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25399>	2023-10-02 12:05:54 +00:00
Tapani Pälli	99d3d76646	anv: HDC flush is available only for GFX_VER 12+ Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25399>	2023-10-02 12:05:53 +00:00
Tapani Pälli	524e8865ce	iris/anv: move Wa_14018912822 as a drirc workaround This should be toggled on only for applications that hit the issue. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9886 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25424>	2023-10-02 08:26:14 +00:00
Lionel Landwerlin	6ea2ea0bb0	anv: fix internal compute copy shader build Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9907 Fixes: `2cc5b3b1e0` ("anv: add a memcpy compute internal kernel") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25480>	2023-10-02 07:39:01 +00:00
Guilherme Gallo	6de10c3585	ci/anv: Catch some flakes Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25469>	2023-09-29 17:27:49 -03:00
Lionel Landwerlin	a25f96c00c	intel/fs: switch from SIMD 1 to 8 instructions surface/sampler rematerialization SIMD1 instructions are problematic because they are considered partial writes. This increases the liveness of the destination register written by those instructions. To workaround this we use UNDEF instructions to bound the liveness of the register. But this causing other issues like in this case : undef(1) vgrf2 mov(1) vgrf2, u4.0 add(1) vgrf3, vgrf2.0, 64UD In this case the copy propagation pass in unable to see that vgrf2 in the add() instruction can be replaced with the uniform u4.0. To fix this problem, we switch NoMask SIMD8 instructions that cover the entire register. We can drop the UNDEF instructions and now copy propagation can do its job. Good results on 2 apps : Cyberpunk 2077 : Totals from 7258 (68.80% of 10549) affected shaders: Instrs: 6332210 -> 6073833 (-4.08%); split: -4.11%, +0.03% Cycles: 130667501 -> 127351268 (-2.54%); split: -3.12%, +0.58% Subgroup size: 90320 -> 90400 (+0.09%) Spill count: 90 -> 68 (-24.44%) Fill count: 82 -> 64 (-21.95%) Scratch Memory Size: 8192 -> 6144 (-25.00%) Max live registers: 385464 -> 375152 (-2.68%) Max dispatch width: 64336 -> 64424 (+0.14%); split: +0.96%, -0.82% Gaining 60 SIMD16/SIMD32 shaders, loosing 33 Strange Brigade : Totals from 2137 (53.12% of 4023) affected shaders: Instrs: 1544031 -> 1457544 (-5.60%); split: -5.60%, +0.00% Cycles: 22292564 -> 21868978 (-1.90%); split: -2.43%, +0.53% Subgroup size: 25328 -> 25344 (+0.06%) Max live registers: 113716 -> 111214 (-2.20%) Max dispatch width: 17232 -> 18608 (+7.99%); split: +8.36%, -0.37% Gaining 138 SIMD16/SIMD32 shaders, loosing 4 On app slightly negatively affected : Dota2 : Totals from 232 (14.73% of 1575) affected shaders: Instrs: 30029 -> 28194 (-6.11%) Cycles: 385155 -> 371422 (-3.57%); split: -3.59%, +0.02% Max live registers: 6792 -> 6780 (-0.18%) Max dispatch width: 2256 -> 2160 (-4.26%) Loosing 6 SIMD32 shaders Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24554>	2023-09-29 10:46:47 +00:00
Lionel Landwerlin	d28f42f85d	intel/fs: handle add3 in surface/sampler rematerialization Some recent NIR changes started generated those instructions. We need to handle them to be able to rematerialize. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24554>	2023-09-29 10:46:47 +00:00
Lionel Landwerlin	05fd418e8b	intel/fs: handle ishl in surface/sampler rematerialization Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24554>	2023-09-29 10:46:47 +00:00
Sagar Ghuge	3d993e63bb	anv: Enable barrier handling on video engines v1: (Lionel) - Don't check for the layout transition Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9776 Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25131>	2023-09-28 18:22:37 +00:00
Caio Oliveira	2d0f4f2c17	compiler/types: Add support for Cooperative Matrix types Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23825>	2023-09-28 07:35:02 +00:00
Paulo Zanoni	b75da97a1d	anv: enable sparse resources by default This of course only applies to xe.ko. There is no reason to keep it disabled by default. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23045>	2023-09-28 06:16:40 +00:00
Paulo Zanoni	7e2d8cced3	anv/sparse: add INTEL_DEBUG=sparse This pollutes stderr a lot, but I've used it countless times while developing this code. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23045>	2023-09-28 06:16:40 +00:00

1 2 3 4 5 ...

10396 commits