fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-04-25 20:00:37 +02:00

Author	SHA1	Message	Date
Rhys Perry	a53d3ff0b3	nir/tests: add nir_opt_dead_cf_test.jump_before_constant_if Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24235>	2023-07-24 14:06:16 +01:00
Rhys Perry	21f0aca948	nir/opt_dead_cf: remove nodes after a jump earlier In the case of: halt // succs: b9 if %618 { block b3:// preds: break // succs: b6 } else { block b4: // preds: , succs: b5 } block b5: // preds: b4 32 %556 = iadd %617, %2 (0x1) opt_constant_if() doesn't work because stitch_blocks() can't join blocks if the before ends in a jump and the after isn't empty. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24235>	2023-07-24 14:06:16 +01:00
Konstantin Seurer	1c8577b493	nir/tests: Use a single binary Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24249>	2023-07-24 11:44:46 +00:00
Konstantin Seurer	6eb0a3a5b7	nir/tests: Refactor boilerplate into a common header Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24249>	2023-07-24 11:44:46 +00:00
Danylo Piliaiev	eeb1fd90fc	tu,freedreno: Forbid blit event for R8G8_SRGB due to gpu faults Same cause as for other R8G8 formats - msaa resolve via blit event causes gpu fault. Fixes: dEQP-VK.api.image_clearing..clear_color_attachment..r8g8_srgb_* Fixes: `029919f3c8` ("tu: allow using resolve engine for SRGB MSAA resolves") Cc: mesa-stable Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24277>	2023-07-24 10:13:49 +00:00
Charles Giessen	f3d948eb6c	panvk: Use 1.0 in ICD Manifest json PanVK downgraded from supporting Vulkan 1.1 to 1.0, but did not change their ICD Manifest api_version to reflect that. This cause the Vulkan-Loader to interpret the ICD as a 1.1 driver erroneously. Originally discussed in this issue https://github.com/KhronosGroup/Vulkan-Loader/issues/1242 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24289>	2023-07-24 08:24:13 +00:00
Marcin Ślusarz	48885c7fe3	intel/compiler: load debug mesh compaction options once Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20407>	2023-07-24 07:55:29 +00:00
Marcin Ślusarz	c1685f08dd	intel/compiler,anv: put some vertex and primitive data in headers Both per-primitive and per-vertex space is allocated in MUE in 8 dword chunks and those 8-dword chunks (granularity of 3DSTATE_SBE_MESH.Per[Primitive\|Vertex]URBEntryOutputReadLength) are passed to fragment shaders as inputs (either non-interpolated for per-primitive and flat vertex attributes or interpolated for non-flat vertex attributes). Some attributes have a special meaning and must be placed in separate 8/16-dword slot called Primitive Header or Vertex Header. Primitive Header contains 4 such attributes (Cull Primitive, ViewportIndex, RTAIndex, CPS), leaving 4 dwords (the rest of 8-dword slot) potentially unused. Vertex Header is similar - it starts with 3 unused dwords, 1 dword for Point Size (but if we declare that shader doesn't produce Point Size then we can reuse it), followed by 4 dwords for Position and optionally 8 dwords for clip distances. This means we have an interesting optimization problem - we can put some user attributes into holes in Primitive and Vertex Headers, which may lead to smaller MUE size and potentially more mesh threads running in parallel, but we have to be careful to use those holes only when we need it, otherwise we could force HW to pass too much data to fragment shader. Example 1: Let's assume that Primitive Header is enabled and user defined 12 dwords of per-primitive attributes. Without packing we would consume 8 + ALIGN(12, 8) = 24 dwords of MUE space and pass ALIGN(12, 8) = 16 dwords to fragment shader. With packing, we'll consume 4 + 4 + ALIGN(12 - 4, 8) = 16 dwords of MUE space and pass ALIGN(4, 8) + ALIGN(12 - 4, 8) = 16 dwords to fragment shader. 16/16 is better than 24/16, so packing makes sense. Example 2: Now let's assume that Primitive Header is enabled and user defined 16 dwords of per-primitive attributes. Without packing we would consume 8 + ALIGN(16, 8) = 24 dwords of MUE space and pass ALIGN(16, 16) = 16 dwords to fragment shader. With packing, we'll consume 4 + 4 + ALIGN(16 - 4, 8) = 24 dwords of MUE space and pass ALIGN(4, 8) + ALIGN(16 - 4, 8) = 24 dwords to fragment shader. 24/24 is worse than 24/16, so packing doesn't make sense. This change doesn't affect vk_meshlet_cadscene in default configuration, but it speeds it up by up to 25% with "-extraattributes N", where N is some small value divisible by 2 (by default N == 1) and we are bound by URB size. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20407>	2023-07-24 07:55:29 +00:00
Marcin Ślusarz	a252123363	intel/compiler/mesh: compactify MUE layout Instead of using 4 dwords for each output slot, use only the amount of memory actually needed by each variable. There are some complications from this "obvious" idea: - flat and non-flat variables can't be merged into the same vec4 slot, because flat inputs mask has vec4 stride - multi-slot variables can have different layout: float[N] requires N 1-dword slots, but i64vec3 requires 1 fully occupied 4-dword slot followed by 2-dword slot - some output variables occur both in single-channel/component split and combined variants - crossing vec4 boundary requires generating more writes, so avoiding them if possible is beneficial This patch fixes some issues with arrays in per-vertex and per-primitive data (func.mesh.ext.outputs.*.indirect_array.q0 in crucible) and by reduction in single MUE size it allows spawning more threads at the same time. Note: this patch doesn't improve vk_meshlet_cadscene performance because default layout is already optimal enough. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20407>	2023-07-24 07:55:29 +00:00
Samuel Pitoiset	fb765a65c8	radv: add radv_compile_cs() to compile a compute shader This doesn't rely on the pipeline. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24280>	2023-07-24 07:04:44 +00:00
Samuel Pitoiset	8ccabbfc50	radv: stop using an array of binaries when compiling a compute shader Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24280>	2023-07-24 07:04:44 +00:00
Zhang Ning	06db9bd3f6	Revert "intel/ci: disable iris-jsl-deqp because it always fails for an AMD MR" This reverts commit `da4b5b4a47`. Signed-off-by: Zhang Ning <zhangn1985@outlook.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23815>	2023-07-24 03:02:14 +00:00
Timothy Arceri	2cf8c8cba4	nir/opt_copy_prop_vars: drop reuse of dynamic arrays After the previous commit there are so few to reuse that this is no longer worth doing and actually causes compilation to slow down. The Blender shader compile time in issue #9326 improves as folows: 21.11 seconds -> 9.90 seconds The CTS test dEQP-GLES31.functional.ubo.random.all_per_block_buffers.20 improves as follows: 0.92 seconds -> 0.68 seconds Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9326 Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24227>	2023-07-24 02:29:54 +00:00
Timothy Arceri	d56e739417	nir/opt_copy_prop_vars: skip cloning of copies arrays until needed Most of the variables in the hash table will never actually be looked up for any given block so cloning every possible value just creates a bunch of unrequired memcpy calls. Here we change the code to only clone the copies array once it is actually looked up for the first time. The Blender shader compile time in issue #9326 improves as folows: 151.09 seconds -> 21.11 seconds The CTS test dEQP-GLES31.functional.ubo.random.all_per_block_buffers.20 improves as follows: 1.67 seconds -> 0.92 seconds Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24227>	2023-07-24 02:29:54 +00:00
Timothy Arceri	869b5a562e	nir/opt_copy_prop_vars: remove var hash entry on kill alias If kill alias results in the hash table entry holding an empty copies array then remove the hash entry and return the dynamic array to the unused pool. This helps avoid hash table size getting out of control in very large shaders. 151.09 seconds -> 118.60 seconds Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24227>	2023-07-24 02:29:54 +00:00
Timothy Arceri	9b4c7cc611	nir/opt_copy_prop_vars: speedup cloning of copy tables Here we change things to simply clone the entire hash table. This is much faster than trying to rebuild it and is needed to avoid slow compilation of very large shaders. The Blender shader compile time in issue #9326 improves as folows: 251.29 seconds -> 151.09 seconds The CTS test dEQP-GLES31.functional.ubo.random.all_per_block_buffers.20 improves as follows: 2.38 seconds -> 1.67 seconds Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24227>	2023-07-24 02:29:54 +00:00
Timothy Arceri	e9804bdc4c	nir/opt_copy_prop_vars: don't clone copies if branch empty There is no point doing an expensive clone of the copies if the if-branch is empty. Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24227>	2023-07-24 02:29:54 +00:00
Qiang Yu	527cc3ad29	radeonsi: enable aco compile for mono merged ES/GS Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24204>	2023-07-24 01:49:21 +00:00
Qiang Yu	b313d950e2	radeonsi: enable aco compile for mono merged LS/HS Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24204>	2023-07-24 01:49:21 +00:00
Qiang Yu	1b53708a62	radeonsi: calculate lds size for merged shaders Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24204>	2023-07-24 01:49:21 +00:00
Qiang Yu	339ea9e344	radeonsi: aco compile support merged mono shader Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24204>	2023-07-24 01:49:21 +00:00
Qiang Yu	21ae5909a4	radeonsi: refine si_llvm_es_build_end 1. merge si_set_es_return_value_for_gs into si_llvm_es_build_end 2. stop return value when mono mode in which case GS use ES input as input instead of ES output Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24204>	2023-07-24 01:49:21 +00:00
Qiang Yu	401a40a5f4	radeonsi: refine si_llvm_ls_build_end 1. merge si_set_ls_return_value_for_tcs into si_llvm_ls_build_end because they do the same job to return value 2. stop return value when mono mode with different thread count, in which case TCS use LS input as its input instead of LS output 3. use si_insert_input_ret_float Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24204>	2023-07-24 01:49:21 +00:00
Qiang Yu	07fcb4aa19	radeonsi: remove param type check in wrapper function Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24204>	2023-07-24 01:49:21 +00:00
Qiang Yu	7ebf667360	radeonsi: move vertex shader vb desc input sgpr args to last ACO use same args for merged shader stages, but vb desc input sgpr args is not present when second stage of merged shader. In order to share same shaders args, move it to last so other args have same index. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24204>	2023-07-24 01:49:21 +00:00
Qiang Yu	a4b4f9a62a	radeonsi: simplify si_build_wrapper_function We only need it to merge LS/HS or ES/GS now, prolog and epilog have been lowered in nir already. So we just need to handle two parts and they are sure to be first and second stage of a merged shader. This also remove the needs SGPRs must be before VGPRs, which is required by following commits to move some SGPRs after VGPRs. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24204>	2023-07-24 01:49:21 +00:00
Qiang Yu	d9f7902afb	radeonsi: init aco shader info for merged LS/HS Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24204>	2023-07-24 01:49:21 +00:00
Qiang Yu	7daa0857c0	radeonsi: extract si_get_prev_stage_nir_shader to be shared with aco Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24204>	2023-07-24 01:49:21 +00:00
Qiang Yu	ec17cc345f	radeonsi: aco does not pass LS outputs to HS by arg aco has global input/output variables for this. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24204>	2023-07-24 01:49:21 +00:00
Qiang Yu	599b50b448	aco,radv: replace tess_input_vertices shader info param Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24204>	2023-07-24 01:49:20 +00:00
David Heidelberg	826c570ab3	ci/freedreno: cover all texture gather flakes Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24300>	2023-07-24 01:37:01 +02:00
Konstantin Seurer	01266f8119	llvmpipe: Fix compiling with LP_USE_TEXTURE_CACHE Fixes: `36eb75d` ("llvmpipe: move to common sampler/image binding code") Closes: #9359 Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24243>	2023-07-23 19:11:40 +00:00
Bas Nieuwenhuizen	c2e3986326	nir: Fix 16-component nir_replicate. Fixes: `f534c2c539` ("nir/builder: Add nir_replicate helper") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24286>	2023-07-22 22:11:15 +00:00
Bas Nieuwenhuizen	e536d31a46	aco: Fix some constant patterns in 16-bit vec4 construction with s_pack. Fixes: `04e3d7ad93` ("aco: improve nir_op_vec with constant operands") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24286>	2023-07-22 22:11:15 +00:00
Bas Nieuwenhuizen	2fcf7c7014	aco: fix nir_op_vec8/16 with 16-bit elements. Fixes: `5718347c2b` ("aco: implement vec2/3/4 with subdword operands") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24286>	2023-07-22 22:11:15 +00:00
Alyssa Rosenzweig	e890bb0e75	asahi: Don't depend on glibc to decode fopencookie is a glibc feature, so we can't use it on macOS (and probably other libc's?). It's only used for the hypervisor interface, though, so we can just make the hypervisor piece glibc-only while otherwise fixing the wrap.dylib build. Fixes: `ee83453f69` ("asahi: Add a shared library interface for decode") Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24293>	2023-07-22 12:42:58 -04:00
Eric Engestrom	f997d32f9f	asahi: drop unused include paths Signed-off-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24287>	2023-07-22 10:10:03 +00:00
Christian Gmeiner	2572a96162	ci/etnaviv: update ci expectations Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24291>	2023-07-22 04:16:32 +00:00
Chia-I Wu	5cca1124d1	amd/ci: update radv-stoney-aco-fails.txt for depth/stencil resolve image_2d_16_64_6 ones have been fixed by the previous commit. The others are outdated. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23959>	2023-07-22 02:32:31 +00:00
Chia-I Wu	e7c4ebc0cd	radv: disable tc-compat htile for layered images on gfx8 sliceInterleaved may be true for layered images on gfx8. Such a htile cannot be cleared with radv_clear_htile. Fixes 24 failures in dEQP-VK.renderpass2.depth_stencil_resolve.image_2d_16_64_6.* on GFX8. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23959>	2023-07-22 02:32:31 +00:00
Thomas H.P. Andersen	d84d5ff0ce	tgsi: drop two unused functions Removes: * tgsi_util_get_src_from_ind * tgsi_full_src_register_from_dst The last usage of these got removed in !24175 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24283>	2023-07-22 02:04:57 +00:00
Yiwei Zhang	2ed4f04869	venus: use in_render_pass to skip present_src counting It's an early return also benefiting dynamic rendering. We then no longer need to track the legacy pass from inheritance info. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24103>	2023-07-22 01:49:43 +00:00
Yiwei Zhang	e47da97be6	venus: refactor more cmd states into cmd builder This change: - adds helpers for cmd begin/end rendering - simplifies cmd reset - updates ordering to align with cmd builder Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24103>	2023-07-22 01:49:43 +00:00
Yiwei Zhang	10c791619c	venus: avoid redundant tracking of render pass Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24103>	2023-07-22 01:49:43 +00:00
Yiwei Zhang	540242f9ff	venus: add helpers to track subpass view mask Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24103>	2023-07-22 01:49:43 +00:00
Yiwei Zhang	311a0eeb21	venus: cleanup vn_cmd_begin_render_pass usage For secondary command buffers, vn_cmd_begin_render_pass was only used to track inherited render pass previously. So we clean it up. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24103>	2023-07-22 01:49:43 +00:00
Yiwei Zhang	81b69f8e8b	venus: use tracked queue_family_index from the cmd pool Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24103>	2023-07-22 01:49:43 +00:00
Yiwei Zhang	72728f83ed	venus: remove redundant fb tracking from cmd builder Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24103>	2023-07-22 01:49:43 +00:00
Yiwei Zhang	f0b5a6335d	venus: move transient storage from cmd to pool The storage is for command scope usage, so it fits better for the pool. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24103>	2023-07-22 01:49:43 +00:00
Yiwei Zhang	566df7821b	venus: log and doc the broken query feedback in suspended render pass Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24103>	2023-07-22 01:49:43 +00:00

1 2 3 4 5 ...

174663 commits