fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-18 13:48:06 +02:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	c39896b17b	nir: Use getters for nir_src::parent_* First, we need to give the parent_instr field a unique name to be able to replace with a helper. We have parent_instr fields for both nir_src and nir_def, so let's rename nir_src::parent_instr in preparation for rework. This was done with a combination of sed and manual fix-ups. Then we use semantic patches plus manual fixups: @@ expression s; @@ -s->renamed_parent_instr +nir_src_parent_instr(s) @@ expression s; @@ -s.renamed_parent_instr +nir_src_parent_instr(&s) @@ expression s; @@ -s->parent_if +nir_src_parent_if(s) @@ expression s; @@ -s.renamed_parent_if +nir_src_parent_if(&s) @@ expression s; @@ -s->is_if +nir_src_is_if(s) @@ expression s; @@ -s.is_if +nir_src_is_if(&s) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24671>	2023-10-10 04:58:05 -04:00
Iván Briano	54498937c5	intel/compiler: round f2f16 correctly for RTNE case v2: bcsel -> b2i32 (Ian) Fixes upcoming Vulkan CTS tests: dEQP-VK.spirv_assembly.instruction.compute.float_controls.fp16.input_args.rounding_rte_conv_from_fp64_up dEQP-VK.spirv_assembly.instruction.compute.float_controls.fp16.input_args.rounding_rte_conv_from_fp64_up_nostorage dEQP-VK.spirv_assembly.instruction.graphics.float_controls.fp16.input_args.rounding_rte_conv_from_fp64_up_vert dEQP-VK.spirv_assembly.instruction.graphics.float_controls.fp16.input_args.rounding_rte_conv_from_fp64_up_nostorage_vert dEQP-VK.spirv_assembly.instruction.graphics.float_controls.fp16.input_args.rounding_rte_conv_from_fp64_up_frag dEQP-VK.spirv_assembly.instruction.graphics.float_controls.fp16.input_args.rounding_rte_conv_from_fp64_up_nostorage_frag Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25281>	2023-10-09 23:37:52 +00:00
Iván Briano	919f5468eb	vulkan/runtime: add internal parameter to vk_spirv_to_nir If used to compile internal shaders, it will lack the flag while running through all the optimization passes it does. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25281>	2023-10-09 23:37:51 +00:00
Lionel Landwerlin	c8556a8f2e	anv: flag 3DSTATE_RASTER as dirty after simple shader primitive Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `50f6903bd9` ("anv: add new low level emission & dirty state tracking") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9899 Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25618>	2023-10-09 16:09:48 +00:00
Lionel Landwerlin	d924b568ef	anv: fix a couple of missing input for 3DSTATE_RASTER programming Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `50f6903bd9` ("anv: add new low level emission & dirty state tracking") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25618>	2023-10-09 16:09:48 +00:00
Lionel Landwerlin	0985548204	anv: add missing workaround handling in simple shader It's not going to make any real difference because of the type of primitive used, but it feels safer to have this everywhere after a 3DPRIMITIVE. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25618>	2023-10-09 16:09:48 +00:00
Lionel Landwerlin	6bfa8850ab	anv: implement INTEL_DEBUG=reemit Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25618>	2023-10-09 16:09:48 +00:00
Vinson Lee	16fa4621b8	anv: Fix transfer type assert Fix defect reported by Coverity Scan. Constant expression result (CONSTANT_EXPRESSION_RESULT) always_true_or: The or condition type != ANV_TIMESTAMP_CAPTURE_AT_CS_STALL \|\| type != ANV_TIMESTAMP_REWRITE_COMPUTE_WALKER will always be true because type cannot be equal to two different values at the same time, so it must be not equal to at least one of them. Fixes: `5112b42146` ("anv: Handle end of pipe with MI_FLUSH_DW on transfer queue") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25605>	2023-10-08 22:39:41 -07:00
Lionel Landwerlin	d091609d81	anv: fix index buffer size programming This is a merge issue due to 2 MRs touching the same code. Fixes a few maintence5 tests like : dEQP-VK.robustness.bind_index_buffer2.offset_0.draw_indexed.oo_size Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `50f6903bd9` ("anv: add new low level emission & dirty state tracking") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25599>	2023-10-07 17:30:17 +00:00
Ian Romanick	bac10ef4aa	intel/fs: Add DP4A to get_lowered_simd_width While working on cooperative matrix support, I noticed some invalid DP4A instructions being generated. dp4a(32) g33<1>UD g21<8,8,1>UD g1.0<0,1,0>UD g9<1,1,1>UD This violates the constraint that the destination or a source can only access two consecutive GRFs. I'm a little surprised that validation didn't catch this. Perhaps because it's a 3 source instruction? Either way, it seems like a bigger project to fix that. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Fixes: `0f809dbf40` ("intel/compiler: Basic support for DP4A instruction") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25554>	2023-10-07 02:27:53 +00:00
Eric Engestrom	82e342888f	ci: skip dEQP-VK.api.driver_properties.conformance_version for everyone This test checks the driver's reported conformance version against the version of the CTS we're running. This check fails every few months and everyone has to go and bump the number in every driver. Running this check only makes sense while preparing a conformance submission, so skip it in the regular CI. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25519>	2023-10-06 17:37:20 +00:00
Lionel Landwerlin	596b438936	intel/ds: track acceleration RT commands Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25570>	2023-10-06 11:10:12 +00:00
Lionel Landwerlin	3e8d2617e1	anv: use buffer pools for BVH build buffers Private memory for BVH builds doesn't need to be mapped on the host, it's purely for use by the GPU. So it can be put into a different buffer pool that can put into VRAM only buffers. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25570>	2023-10-06 11:10:12 +00:00
Lionel Landwerlin	bab344645f	anv: move bo_pool allocation flags to init caller Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25570>	2023-10-06 11:10:12 +00:00
Lionel Landwerlin	787c29f2fc	anv: reduce working temporary memory for BVH builds Part of the memory allocated (private) is a temporary working buffer for the GRL kernels. Once the build operation is done, the buffer becomes unused. Rather than allocate a new buffer each time, reuse the current last allocated one if its size fits the next build operation. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25570>	2023-10-06 11:10:11 +00:00
Caio Oliveira	81bc09bf97	intel/fs: Tweak default case of fs_inst::size_read() In the default case, there's a special case with a few conditions. Prefer the cheapest conditions first, so we can take advantage of short-circuiting. Effect is a small but still significant reduce in shader compilation times, as can be seen by: - Fossil replay for Rise of the Tomb Raider ``` Difference at 95.0% confidence -0.433333 +/- 0.028609 -1.42556% +/- 0.0941163% (Student's t, pooled s = 0.0337886) ``` - Fossil replay for Batman Arkham City ``` Difference at 95.0% confidence -8.84 +/- 0.146083 -1.65932% +/- 0.0274207% (Student's t, pooled s = 0.125423) ``` Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25549>	2023-10-06 09:16:56 +00:00
Emma Anholt	4b9c3c76d0	ci/hasvk: Add a bunch of new CTS border color fails. pretty sure this is from new coverage since the CTS uprev.. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25574>	2023-10-06 04:06:29 +00:00
Sviatoslav Peleshko	8361cd4c4c	intel/eu/validate: Validate "packed word exception" stricter Fixes: `75b7f5a2` ("i965: Validate "Region Alignment Rules"") Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25378>	2023-10-05 01:41:42 +00:00
Sviatoslav Peleshko	8f23b45252	intel/fs: Fix "packed word exception" condition for register regioning Fixes: `a6bf5f88` ("i965/fs: Enforce common regioning restrictions by SIMD splitting.") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9432 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25378>	2023-10-05 01:41:42 +00:00
Emma Anholt	77b240a251	ci/anv: Drop the 16bit.scalar.13 skip. It's now at 1.5 sec on my ADL and CFL systems. Closes: #4641 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25509>	2023-10-03 19:38:39 +00:00
Emma Anholt	c742cba113	ci/anv: Drop incorrect xfail addition for TGL This xfails file is for deqp-vk. The xfail that was added was for angle-on-anv, which has its own expectations files and has this test recently listed as a flake already. Fixes: `a217c5c58c` ("ci: update to vulkan-cts-1.3.6.3") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25509>	2023-10-03 19:38:39 +00:00
Kenneth Graunke	17b8b2cffd	anv: Add support for a transfer queue on Alchemist Alchemist has an improved blitter that's sufficiently powerful to implement a transfer queue. Tigerlake's blitter lacks compression handling and other features we need, unfortunately. Rework (Sagar): - Check blitter command buffer in EndCommandBuffer v2: (Lionel) - Look at image, buffer and memory barriers as well - Flush cache if there is queue ownership transfer Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18325>	2023-10-03 18:02:52 +00:00
Sagar Ghuge	5112b42146	anv: Handle end of pipe with MI_FLUSH_DW on transfer queue Blitter command streamer supports MI_FLUSH_DW command so make sure we don't end up emitting pipe control with CS stall and also handle the end of pipe timestamp with MI_FLUSH_DW command. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18325>	2023-10-03 18:02:52 +00:00
Sagar Ghuge	9d7166dfc0	isl: Use 16-bit instead of 8-bits for surface format info fields Comparing uint8_t max value 255 with devinfo->verx10 will work fine for now but for future platforms, comparison will fail. To avoid this let's switch the field data type from 8-bits to 16-bits. v1: (Jordan) - Use 16 bits instead of 32 and add assertion. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25478>	2023-10-02 17:24:33 +00:00
Tapani Pälli	1c4d57568a	intel/genxml: remove HDC from gen11.xml, it is not available Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25399>	2023-10-02 12:05:54 +00:00
Tapani Pälli	99d3d76646	anv: HDC flush is available only for GFX_VER 12+ Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25399>	2023-10-02 12:05:53 +00:00
Tapani Pälli	524e8865ce	iris/anv: move Wa_14018912822 as a drirc workaround This should be toggled on only for applications that hit the issue. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9886 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25424>	2023-10-02 08:26:14 +00:00
Lionel Landwerlin	6ea2ea0bb0	anv: fix internal compute copy shader build Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9907 Fixes: `2cc5b3b1e0` ("anv: add a memcpy compute internal kernel") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25480>	2023-10-02 07:39:01 +00:00
Guilherme Gallo	6de10c3585	ci/anv: Catch some flakes Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25469>	2023-09-29 17:27:49 -03:00
Lionel Landwerlin	a25f96c00c	intel/fs: switch from SIMD 1 to 8 instructions surface/sampler rematerialization SIMD1 instructions are problematic because they are considered partial writes. This increases the liveness of the destination register written by those instructions. To workaround this we use UNDEF instructions to bound the liveness of the register. But this causing other issues like in this case : undef(1) vgrf2 mov(1) vgrf2, u4.0 add(1) vgrf3, vgrf2.0, 64UD In this case the copy propagation pass in unable to see that vgrf2 in the add() instruction can be replaced with the uniform u4.0. To fix this problem, we switch NoMask SIMD8 instructions that cover the entire register. We can drop the UNDEF instructions and now copy propagation can do its job. Good results on 2 apps : Cyberpunk 2077 : Totals from 7258 (68.80% of 10549) affected shaders: Instrs: 6332210 -> 6073833 (-4.08%); split: -4.11%, +0.03% Cycles: 130667501 -> 127351268 (-2.54%); split: -3.12%, +0.58% Subgroup size: 90320 -> 90400 (+0.09%) Spill count: 90 -> 68 (-24.44%) Fill count: 82 -> 64 (-21.95%) Scratch Memory Size: 8192 -> 6144 (-25.00%) Max live registers: 385464 -> 375152 (-2.68%) Max dispatch width: 64336 -> 64424 (+0.14%); split: +0.96%, -0.82% Gaining 60 SIMD16/SIMD32 shaders, loosing 33 Strange Brigade : Totals from 2137 (53.12% of 4023) affected shaders: Instrs: 1544031 -> 1457544 (-5.60%); split: -5.60%, +0.00% Cycles: 22292564 -> 21868978 (-1.90%); split: -2.43%, +0.53% Subgroup size: 25328 -> 25344 (+0.06%) Max live registers: 113716 -> 111214 (-2.20%) Max dispatch width: 17232 -> 18608 (+7.99%); split: +8.36%, -0.37% Gaining 138 SIMD16/SIMD32 shaders, loosing 4 On app slightly negatively affected : Dota2 : Totals from 232 (14.73% of 1575) affected shaders: Instrs: 30029 -> 28194 (-6.11%) Cycles: 385155 -> 371422 (-3.57%); split: -3.59%, +0.02% Max live registers: 6792 -> 6780 (-0.18%) Max dispatch width: 2256 -> 2160 (-4.26%) Loosing 6 SIMD32 shaders Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24554>	2023-09-29 10:46:47 +00:00
Lionel Landwerlin	d28f42f85d	intel/fs: handle add3 in surface/sampler rematerialization Some recent NIR changes started generated those instructions. We need to handle them to be able to rematerialize. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24554>	2023-09-29 10:46:47 +00:00
Lionel Landwerlin	05fd418e8b	intel/fs: handle ishl in surface/sampler rematerialization Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24554>	2023-09-29 10:46:47 +00:00
Sagar Ghuge	3d993e63bb	anv: Enable barrier handling on video engines v1: (Lionel) - Don't check for the layout transition Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9776 Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25131>	2023-09-28 18:22:37 +00:00
Caio Oliveira	2d0f4f2c17	compiler/types: Add support for Cooperative Matrix types Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23825>	2023-09-28 07:35:02 +00:00
Paulo Zanoni	b75da97a1d	anv: enable sparse resources by default This of course only applies to xe.ko. There is no reason to keep it disabled by default. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23045>	2023-09-28 06:16:40 +00:00
Paulo Zanoni	7e2d8cced3	anv/sparse: add INTEL_DEBUG=sparse This pollutes stderr a lot, but I've used it countless times while developing this code. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23045>	2023-09-28 06:16:40 +00:00
Paulo Zanoni	2bdd01187d	anv/sparse: get ready to issue a single vm_bind ioctl per non-opaque bind Game testing shows it's common for this operation to result in multiple bind regions, so try to use a single ioctl when we can. Actual testing reveals 136 shader-related tests fail when we actually do this, so for now keep doing a single bind per ioctl while leaving a very easy way to the desired behavior when we figure this out. It should also be possible to go even higher-level and do this at the anv_queue_submit_sparse_bind_locked() layer, but that should happen in future commits. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23045>	2023-09-28 06:16:40 +00:00
Paulo Zanoni	6368c1445f	anv/sparse: add the initial code for Sparse Resources This giant patch implements a huge chunk of the Vulkan Sparse Resources API. I previously had this as a nice series of many smaller patches that evolved as the xe.ko added more features, but once I was asked to squash some of the major reworks I realized I wouldn't be able easily rewrite history, so I just squased basically the whole series into a giant patch. I may end up splitting this again later if I find a way to properly do it. If we want to support the DX12 API through vkd3d we need to support part of the the Sparse Resources API. If we don't, a bunch of Steam games won't work. For now we only support the xe.ko backend, but the vast majority of the code is KMD-independent and so an i915.ko implementation would use most of what's here, just extending the part that binds and unbinds memory. v2+: There's no way to sanely track the version history of this patch in this commit message. Please refer to Gitlab. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23045>	2023-09-28 06:16:40 +00:00
Paulo Zanoni	e4598f0eea	intel/isl: simplify the check for maximum surface size The only thing that changes between these 3 checks is the size. This entire patch was suggested by Kenneth Graunke, I just converted his gitlab comment to a git commit. Credits-to: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23045>	2023-09-28 06:16:40 +00:00
Paulo Zanoni	0de5d142e8	intel/isl: add ISL_SURF_USAGE_SPARSE_BIT Vulkan Sparse resources have their own set of rules, so here we try to make ISL aware of them through ISL_SURF_USAGE_SPARSE_BIT. The big deal here is when some image ends up not using Tile64 nor TileYs. Previously Ys was not supported on TGL at all, and Tile64 did not have support for 3D. Now we still have some formats that end up not being used with either Tile64 and Ys, but need to support Sparse on them (e.g., YUV on Tile64). In the future we may have new tiling formats or hardware restrictions that would force this case to happen again. So here we do some adjustments so we can make sparse work with other tiling formats, although with limited functionality (e.g., those formats may be restricted to opaque binds, and certainly don't support the standard block shapes). v2: before we had Ys support, we had defined TGL's block size as 4k. v3: move the size_B chunk to before nte notify_failure() checks (Ken). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23045>	2023-09-28 06:16:40 +00:00
Marcin Ślusarz	ea92bd8d44	intel/compiler: mask GS URB handles at thread payload construction Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25195>	2023-09-27 23:57:25 +00:00
Marcin Ślusarz	815eee10e0	intel/compiler/mesh: implement IO for xe2 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25195>	2023-09-27 23:57:25 +00:00
Marcin Ślusarz	ee4214de6e	intel/compiler/mesh: fix position of output URB handle for xe2 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25195>	2023-09-27 23:57:25 +00:00
Francisco Jerez	7f3dc4505d	intel/fs: Delete manual 'inst->mlen' calculations from all uses of logical URB reads. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25195>	2023-09-27 23:57:25 +00:00
Francisco Jerez	53d1d793cb	intel/fs: Delete manual 'inst->mlen' calculations from all uses of logical URB writes. Rework: * Marcin: update emit_urb_indirect_vec4_write Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25195>	2023-09-27 23:57:25 +00:00
Francisco Jerez	34a2c9ce35	intel/fs: Specify number of data components of logical URB writes via control immediate. This is what most logical SEND messages do when they take a variable number of components. 'inst->mlen' is expected to be zero for logical SEND opcodes, which are expected to behave like plain arithmetic operations, so certain automated transformations (like SIMD lowering) can manipulate them without opcode-specific special-casing. Guessing the number of components from 'inst->mlen' has other disadvantages, because it requires duplicating the logic that infers the message payload size in every use of the instruction -- Instead we can just do the computation once during logical send lowering. In addition on LNL platform this causes the 'inst->mlen' field of URB writes to have units inconsistent with every other SEND instruction, which is likely to lead to confusion and bugs down the road. Rework: * Marcin: update emit_urb_indirect_vec4_write Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25195>	2023-09-27 23:57:25 +00:00
Francisco Jerez	74c9973c0b	intel/fs/xe2+: Fix URB writes with 0 data components. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25195>	2023-09-27 23:57:25 +00:00
Caio Oliveira	c89597085a	intel/compiler/xe2: Update TCS ICP handle code to support SIMD16 Rework: * Use ffs(grf_size_bytes) (s-b Ken) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25195>	2023-09-27 23:57:25 +00:00
Caio Oliveira	f0fcb778b4	intel/compiler/xe2: Fix URB writes in TCS Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25195>	2023-09-27 23:57:25 +00:00
Caio Oliveira	0c03018abf	intel/compiler/xe2: URB fence uses LSC now Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25195>	2023-09-27 23:57:25 +00:00

1 2 3 4 5 ...

10382 commits