fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-29 01:08:12 +02:00

Author	SHA1	Message	Date
Qiang Yu	3151f5ec47	nir: add filter parameter to nir_lower_array_deref_of_vec To be used by latter commits to limit the lowering to specific variables. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29799>	2024-07-03 02:06:56 +00:00
Jianxun Zhang	870be63c7e	anv: Disable tracking of clear color on color attachment Xe2+ platforms don't need it because of its new fast-clear and compression design. Fixes: Vulkan CTS dEQP-VK.pipeline.pipeline_library.multisample. sample_locations_ext.draw.depth.samples_4. separate_subpass_clear_attachments src/intel/vulkan/anv_private.h:5439: anv_image_get_fast_clear_type_addr: Assertion `device->info->ver < 20' failed. Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29966>	2024-07-03 00:55:13 +00:00
Jianxun Zhang	bd05ef9d91	anv: Support arbitrary fast-clear value on all layouts (xe2) Xe2+ platforms don't use fast-type buffer for its new design. We don't have to track different fast-clear types, so we just return the highest level of support. Fixes: Vulkan CTS dEQP-VK.api.copy_and_blit.core.resolve_image.whole_array_image _one_region.8_bit_not_all_remaining_layers src/intel/vulkan/anv_private.h:5439: anv_image_get_fast_clear_type_addr: Assertion `device->info->ver < 20' failed. Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29966>	2024-07-03 00:55:13 +00:00
Jianxun Zhang	4034539c00	anv: Fix Vulkan CTS failure related to MCS (xe2) Fixes: Vulkan CTS dEQP-VK.pipeline.monolithic.multisample.sampled_image.79x31_1.r32_uint.samples_2 src/intel/vulkan/anv_private.h:5439: anv_image_get_fast_clear_type_addr: Assertion `device->info->ver < 20' failed. deqp-vk: ../src/intel/vulkan/genX_cmd_buffer.c:1263: transition_color_buffer: Assertion `must_init_fast_clear_state' failed. Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29966>	2024-07-03 00:55:13 +00:00
Jianxun Zhang	beb0ea2469	anv: Disable tracking fast clear and aux state (xe2) Xe2+ doesn't use aux tracking buffers, and we should not have access to the fast-clear type and compression state. Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29966>	2024-07-03 00:55:13 +00:00
Mauro Rossi	87ad3ca0ac	intel/common: fix building error in intel_common.c Fixes the following building error: ../out_src/src/intel/common/intel_common.c:29:4: error: implicit declaration of function 'free' is invalid in C99 [-Werror,-Wimplicit-function-declarat ion] free(engine_info); ^ 1 error generated. Fixes: `5b8b4f78` ("intel/dev: Add engine_class_supported_count to intel_device_info") Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29975>	2024-07-02 23:35:26 +00:00
Iván Briano	9a68be59ca	anv: enable VK_KHR_maintenance7 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29968>	2024-07-02 20:24:43 +00:00
Jianxun Zhang	597c6cdf20	isl: Add some formats not covered in CMF table (xe2) The CMF values of these formats are not explicitly defined in the spec. Refer to the added comment for more details. Fixed Piglit tests: [ISL_FORMAT_L8A8_UNORM_SRGB] getteximage-formats -auto -fbo [ISL_FORMAT_L8_UNORM_SRGB] teximage-colors GL_SLUMINANCE8 -auto -fbo [ISL_FORMAT_R9G9B9E5_SHAREDEXP] fbo-generatemipmap-3d RGB9_E5 -auto -fbo src/intel/isl/isl_genX_helpers.h:322: isl_get_render_compression_format: Assertion `!"" "Unsupported render compression format!"' failed. Also bump up Bspec revision in comments. Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28620>	2024-07-02 19:03:19 +00:00
Jianxun Zhang	77c83069ad	intel/dev: Select a compressed PAT entry (xe2) Fix glxgears (LNL) glxgears: xe/iris_kmd_backend.c:81: xe_gem_create: Assertion `!"" "missing"' failed. Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28620>	2024-07-02 19:03:19 +00:00
Jianxun Zhang	c9ee484f21	blorp: Ensure MSAA fast clear in correct modes (xe2) Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28620>	2024-07-02 19:03:19 +00:00
Renato Pereyra	de0d237ab0	intel/perf: Move sysmacros.h include from header to implementation sysmacros.h defines macros `minor()` and `major()`. These macros conflict with a definition of `minor()` in the Perfetto SDK header. Move the sysmacros.h include to intel_perf.c because the Perfetto header is only included at the same time as intel_perf.h not *.c (in intel_driver_ds.cc). Unbeknown to anyone, the definition of `minor()` in the Perfetto header is being replaced with the macro. See the MR attachment for an example. Signed-off-by: Renato Pereyra <renatopereyra@chromium.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29974>	2024-07-01 22:02:49 +00:00
Paulo Zanoni	4aa3b2d3ad	anv: LNL+ doesn't need the special flush for sparse Newer hardware is smart enough to know that if something writes to a NULL tile and immediately reads back the value (from the cache), the value should read back as zero, not whatever was written to the cache but not the memory. Due to that, we don't need to flush the tile cache, which is quite expensive. Link: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11029 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29953>	2024-07-01 21:28:26 +00:00
Sagar Ghuge	99ce8b5a07	intel/compiler: Add indirect mov lowering pass Indirect addressing(vx1 and vxh) not supported with UB/B datatype for src0, so we need to change the data type for both dest and src0. This fixes following tests cases on Xe2+ - dEQP-VK.spirv_assembly.instruction.compute.8bit_storage.push_constant_8_to_16* - dEQP-VK.spirv_assembly.instruction.compute.8bit_storage.push_constant_8_to_32* Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29316>	2024-07-01 19:06:31 +00:00
Kenneth Graunke	1e69ec3b8d	intel/brw: Add a lower_csel pass and allow building it for all types We can do CSEL on F, HF, W, and D on Gfx11+. Gfx9 can only do F. We can lower unsupported types to CMP+CSEL, allowing us to use CSEL in the IR and not worry about the limitations. Rework: (Sagar) - Update validation pass for CSEL Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29316>	2024-07-01 19:06:31 +00:00
Dylan Baker	dc604f340a	anv/grl: add some validation that we're not going to overflow Coverity has spotted a place where we could in theory overflow. In reality it wont happen as the potential overflow is a bitfield with a maximum of two values. Add an `assume()` statement to help out the compiler and document our assumption. fixes: `dc1aedef2b` Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29825>	2024-07-01 18:11:38 +00:00
Lionel Landwerlin	884397b587	anv: workaround flaky xfb query results on Gfx11 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29836>	2024-07-01 09:04:12 +00:00
Lionel Landwerlin	b8f8926026	anv: emit the right shader instruction for protected mode Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29778>	2024-07-01 06:48:06 +00:00
Lionel Landwerlin	57e74d7b56	anv: allocate compute scratch using the right scratch pool Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29778>	2024-07-01 06:48:06 +00:00
Lionel Landwerlin	3ccf80f9b1	anv: prepare 2 variants of all shader instructions One variant uses a protected scratch surface the other not. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29778>	2024-07-01 06:48:06 +00:00
Lionel Landwerlin	08a4e0a2e3	anv: add a protected scratch pool Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29778>	2024-07-01 06:48:06 +00:00
David Heidelberg	68215332a8	build: pass licensing information in SPDX form Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Dylan Baker <dylan.c.baker@intel.com> Acked-by: Eric Engestrom <eric@igalia.com> Acked-by: Daniel Stone <daniels@collabora.com> Signed-off-by: David Heidelberg <david@ixit.cz> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29972>	2024-06-29 12:42:49 -07:00
José Roberto de Souza	3b6e2475e4	intel/perf: Enable perf on Xe KMD Support was added in the previous patches, so this check can now be removed. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29312>	2024-06-29 01:17:37 +00:00
José Roberto de Souza	936e87a7f9	anv: Implement Xe KMD query pools Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29312>	2024-06-29 01:17:37 +00:00
José Roberto de Souza	3c1b545057	intel/perf: Implement Xe KMD perf stream read Xe KMD perf stream reads just returns the samples, there is no header. For error checking there is other uAPI that is not handled here yet. So to mantain compatibility here reading the perf stream, adding a header then copying the sample. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29312>	2024-06-29 01:17:37 +00:00
José Roberto de Souza	da63c54db5	intel/perf: Remove i915_drm.h includes from common code Only place that still has i915_drm.h includes in common code is intel_perf_query.c. This are the last i915_drm.h includes in headers in common code \o/. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29312>	2024-06-29 01:17:37 +00:00
José Roberto de Souza	c2fd848002	intel/perf: Refactor and add Xe KMD support to change stream metrics id Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29312>	2024-06-29 01:17:37 +00:00
José Roberto de Souza	b22899b494	intel/perf: Refactor and add Xe KMD support to enable and disable perf stream Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29312>	2024-06-29 01:17:37 +00:00
José Roberto de Souza	981090f173	intel/perf: Add Xe KMD perf stream open function Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29312>	2024-06-29 01:17:37 +00:00
José Roberto de Souza	6258c84375	intel/perf: Refactor and add Xe KMD support to add and remove configs Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29312>	2024-06-29 01:17:37 +00:00
José Roberto de Souza	0e68d7a735	intel/perf: Replace i915_perf_version and i915_query_supported by a feature bitmask Replacing the i915_perf_version that is i915 specific by a feature mask makes easier to support Xe KMD. Also this allow us to group a bool and a int into a single enum(int). No changes in behavior is expected here. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29312>	2024-06-29 01:17:37 +00:00
José Roberto de Souza	a56b085661	intel/perf: Add function to check if OA/perf is supported by Xe KMD This is a uAPI added after initial Xe KMD upstreaming so not supported by every version, also by default it requires high privilege permissions so it check if current applications has it. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29312>	2024-06-29 01:17:37 +00:00
José Roberto de Souza	f0c62b6438	intel/perf: Implement function that returns OA format for Xe KMD Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29312>	2024-06-29 01:17:37 +00:00
Sushma Venkatesh Reddy	d52dd5a9e9	anv/drirc: add option to provide low latency hint GuC offers a mechanism for KMD/UMD to provide workload hints and one of that strategy is low latency hint. We can utilize this hint when the workload is more latency sensitive like compute usecases. Signed-off-by: Sushma Venkatesh Reddy <sushma.venkatesh.reddy@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28282>	2024-06-28 21:45:59 +00:00
Caio Oliveira	6dc7f65a39	anv: Use brw_nir_lower_cs_intrinsics for lowering Mesh/Task LocalID Stop using the option in the generic pass nir_lower_compute_system_values and use the same code as brw uses for compute instead. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29828>	2024-06-28 16:30:38 +00:00
Caio Oliveira	d89bfb1ff7	intel/brw: Reorganize lowering of LocalID/Index to handle Mesh/Task Reorganize the code to make clearer all the lowering cases: (a) Single invocation workgroup. Index and IDs are all zero. (b) Local ID provided by hardware. (c) Local Index provided by the hardware. Depending on the case this might not be the final local index, e.g. heuristics for tile. (d) Neither provided by the hardware. Case (c) is new and supported by Mesh/Task shaders. At the moment the nir_lower_compute_system_values handle lowering of LocalID for Task/Mesh, but a later patch will flip that on ANV. This will make the Task/Mesh use the same lowering as Compute shaders. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29828>	2024-06-28 16:30:38 +00:00
Sagar Ghuge	edcad250ed	intel/compiler: Don't use half float param for sample_b Looks like some of the tests uses the bias which does not fit into half float parameter, so it's better to use float param for sample_b. If we have cube arrays, we anyway combine BIAS and array index properly so we don't have to worry about the first parameter. This fixes: GTF-GL46.gtf21.GL3Tests.texture_lod_bias.texture_lod_bias_clamp_m_g_M Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29533>	2024-06-28 03:33:18 +00:00
Dylan Baker	35298e84f1	intel/compiler: move predicated_break out of backend loop This has no impact on the generated shaders, but does have a small (positive) impact on the amount of time spent in shader compilation. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29126>	2024-06-27 15:20:19 -07:00
Jordan Justen	7b3149c99b	intel/brw: Retype some regs to BRW_TYPE_UD for Xe2 indirect accesses Following https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28957, some Xe2 code paths started triggering asserts. In the cases fixed by this patch, it was because of the assert added to brw_type_larger_of() in `cf8ed9925f` ("intel/brw: Make a helper for finding the largest of two types"), and then brw_type_larger_of() is used in `674e89953f`. (For example, the assert was triggering when the SHL types differed between D and UD.) Fixes: `674e89953f` ("intel/brw: Use new builder helpers that allocate a VGRF destination") Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29925>	2024-06-27 21:51:07 +00:00
Paulo Zanoni	746f41e705	anv: properly store the engine_class_supported_count values Function anv_physical_device_try_create() creates the devinfo variable and then at some point it copies its contents to device->info: device->info = devinfo; Much much later we're calling: intel_common_update_device_info(fd, &devinfo); ... which is updating devinfo but not device->info. As a consequence, we're only creating one queue, as engine_class_supported_count[klass] is zero for everybody. Fixes: `5b8b4f7878` ("intel/dev: Add engine_class_supported_count to intel_device_info") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29927>	2024-06-27 20:19:39 +00:00
Lionel Landwerlin	cff6df7e11	anv: limit vertex fetch invalidation on indirect read Only used on Gfx9 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29810>	2024-06-27 19:01:50 +00:00
Ian Romanick	531461d576	intel/brw: Test corner case CSE of ADD3 instructions When the destination of both instructions is NULL and the conditional modifier matches, operands_match (by way of instructions_match) will only test the first two operands. This can result in bad CSE happening. This is a very, very narrow edge case. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29848>	2024-06-27 18:34:53 +00:00
Kenneth Graunke	7adccbd48d	intel/brw: Support CSE of ADD3 This one is a bit more complex in that we need to handle 3-source commutative opcodes. But it's also quite useful: fossil-db results on Alchemist (A770): Instrs: 151659750 -> 150164959 (-0.99%); split: -0.99%, +0.01% Cycles: 12822686329 -> 12574996669 (-1.93%); split: -2.05%, +0.12% Subgroup size: 7589608 -> 7589592 (-0.00%) Send messages: 7375047 -> 7375053 (+0.00%); split: -0.00%, +0.00% Loop count: 46313 -> 46315 (+0.00%); split: -0.01%, +0.01% Spill count: 110184 -> 54670 (-50.38%); split: -50.79%, +0.41% Fill count: 213724 -> 104802 (-50.96%); split: -51.43%, +0.47% Scratch Memory Size: 9406464 -> 3375104 (-64.12%); split: -64.35%, +0.23% Our older Shadow of the Tomb Raider fossil is particularly helped with over a 90% reduction in scratch access (spills, fills, and scratch size). However, benchmarking in the actual game shows no change in performance. We're thinking the game's shaders have been updated since our capture. Ian noted that there was a bug here where we'd accidentally CSE two ADD3 instructions with null destinations and different src[2] that couldn't be dead code eliminated due to conditional mods. However, this is only a bug in the new cse_defs pass so we don't need to nominate this for stable branches. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29848>	2024-06-27 18:34:53 +00:00
Tapani Pälli	a603cc0633	anv: move some pc was to batch_emit_pipe_control_write These were only applied in emit_apply_pipe_flushes but in theory could be required for some other individually shot pipe controls. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29897>	2024-06-27 09:02:03 +00:00
Georg Lehmann	3bfba9c565	iris/ci: update trace checksums There is a small difference, but it looks like a minor precision change to me. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29467>	2024-06-27 08:12:30 +00:00
Francisco Jerez	01118a3fbb	anv/xe2+: Align push constant ranges to GRF boundaries. This fixes corruption of push constants on Xe2 due to a mismatch in the uniform layout implemented by the compiler and assumed by the driver. To fix it we need to align the push constant ranges computed by the Vulkan driver to a multiple of the GRF size of the platform. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29926>	2024-06-27 07:39:17 +00:00
Francisco Jerez	039f4fe25e	intel/dev: Add GRF size information to the intel_device_info struct. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29926>	2024-06-27 07:39:17 +00:00
Francisco Jerez	79fa3eba11	intel/fs/xe2+: Add ALU-based implementation of barycentric interpolation at a per-channel sample. This implements a replacement for the previous implementation of nir_intrinsic_load_barycentric_at_sample that relied on the Pixel Interpolator shared function, since it's going to be removed from the hardware from Xe2 onwards. This implementation simply looks up the X/Y offsets of each sample index on the table provided in the PS thread payload by using indirect addressing, then does the actual interpolation by recursing into emit_pixel_interpolater_alu_at_offset() introduced in the previous commit. Note that even though this is only immediately useful on Xe2+ platforms there's no reason why it shouldn't work on earlier platforms, as long as we have the sample X/Y offsets available in the thread payload. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29847>	2024-06-27 00:18:00 +00:00
Francisco Jerez	95eec5a0dd	intel/fs/xe2+: Add ALU-based implementation of barycentric interpolation at a per-channel offset. This implements a replacement for the previous implementation of nir_intrinsic_load_barycentric_at_offset that relied on the Pixel Interpolator shared function, since it's going to be removed from the hardware from Xe2 onwards. That's okay since we can get all the primitive setup information needed for interpolation at an arbitrary coordinate: We use the X/Y offset relative to the "X/Y Start" coordinates from the thread payload order to evaluate the plane equations also provided in the thread payload for each barycentric coordinate of each polygon. The evaluation of the barycentric plane equations (and the RHW plane equation for perspective-correct interpolation) uses the accumulator and MAD/MAC for ALU efficiency, but that means we need to manually split instructions to fit the width of the accumulator. The division and scaling for perspective-correct interpolation is also now done in the shader if necessary. Note that even though this is only immediately useful on Xe2+, the thread payload numbers are filled out for older platforms, and the EU restrictions of previous Xe platforms are taken into account, mostly for the purposes of testing and performance evaluation. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29847>	2024-06-27 00:18:00 +00:00
Francisco Jerez	e8007c9325	intel/fs/xe2+: Don't lower barycentric load offsets to fixed-point format on Xe2+. Floating-point offsets work fine in combination with the floating-point arithmetic we're about to lower these intrinsics into, and they require less instructions than converting to fixed-point and then back. No reason to take the precision/range hit nor the extra instructions. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29847>	2024-06-27 00:18:00 +00:00
Francisco Jerez	04b5b8b9ec	anv/gfx11+: Request PS payload fields for ALU-based interpolation via 3DSTATE_PS_EXTRA. Plumb the prog_data bits recently introduced for ALU-based interpolation down to 3DSTATE_PS_EXTRA emission in the Vulkan driver. Even though this is only going to be used on Xe2+ for now there seems to be no reason not to plumb the bits on all platforms back to gfx11, since the 3DSTATE_PS_EXTRA enables already existed on ICL. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29847>	2024-06-27 00:18:00 +00:00

... 57 58 59 60 61 ...

15202 commits