fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-19 13:38:19 +02:00

Author	SHA1	Message	Date
Ian Romanick	a2292f53b5	nir: Optimize uniform vote_all and vote_any No shader-db changes on any Intel platform. fossil-db: All Ice Lake and newer platforms had similar results. (Ice Lake) Totals: Instrs: 165513303 -> 165511820 (-0.00%) Cycles: 15125314947 -> 15125211500 (-0.00%); split: -0.00%, +0.00% Totals from 82 (0.01% of 656120) affected shaders: Instrs: 544627 -> 543144 (-0.27%) Cycles: 22616493 -> 22513046 (-0.46%); split: -0.46%, +0.00% No fossil-db changes on Gfx9. Suggested-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 09:44:32 -08:00
Ian Romanick	535caaf3e0	nir: Optimize uniform iadd, fadd, and ixor reduction operations This adds optimizations for iadd, fadd, and ixor with reduce, inclusive scan, and exclusive scan. NOTE: The fadd and ixor optimizations had no shader-db or fossil-db changes on any Intel platform. NOTE 2: This change "fixes" arb_compute_variable_group_size-local-size and base-local-size.shader_test on DG2 and MTL. This is just changing the code path taken to not use whatever path was not working properly before. This is a subset of the things optimized by ACO. See also https://gitlab.freedesktop.org/mesa/mesa/-/issues/3731#note_682802. The min, max, iand, and ior exclusive_scan optimizations are not implemented. Broadwell on shader-db is not happy. I have not investigated. v2: Silence some warnings about discarding const. v3: Rename mbcnt to count_active_invocations. Add a big comment explaining the differences between the two paths. Suggested by Rhys. shader-db: All Gfx9 and newer platforms had similar results. (Ice Lake shown) total instructions in shared programs: 20300384 -> 20299545 (<.01%) instructions in affected programs: 19167 -> 18328 (-4.38%) helped: 35 / HURT: 0 total cycles in shared programs: 842809750 -> 842766381 (<.01%) cycles in affected programs: 2160249 -> 2116880 (-2.01%) helped: 33 / HURT: 2 total spills in shared programs: 4632 -> 4626 (-0.13%) spills in affected programs: 206 -> 200 (-2.91%) helped: 3 / HURT: 0 total fills in shared programs: 5594 -> 5581 (-0.23%) fills in affected programs: 664 -> 651 (-1.96%) helped: 3 / HURT: 1 fossil-db results: All Intel platforms had similar results. (Ice Lake shown) Totals: Instrs: 165551893 -> 165513303 (-0.02%) Cycles: 15132539132 -> 15125314947 (-0.05%); split: -0.05%, +0.00% Spill count: 45258 -> 45204 (-0.12%) Fill count: 74286 -> 74157 (-0.17%) Scratch Memory Size: 2467840 -> 2451456 (-0.66%) Totals from 712 (0.11% of 656120) affected shaders: Instrs: 598931 -> 560341 (-6.44%) Cycles: 184650167 -> 177425982 (-3.91%); split: -3.95%, +0.04% Spill count: 983 -> 929 (-5.49%) Fill count: 2274 -> 2145 (-5.67%) Scratch Memory Size: 52224 -> 35840 (-31.37%) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 09:44:11 -08:00
Ian Romanick	f10d1ef372	nir: Initial framework for optimizing uniform subgroup operations The first commit just optimizes operation where the result of the subgroup operation is the same as each of the individual channel results. This is a subset of the things optimized by ACO. See also https://gitlab.freedesktop.org/mesa/mesa/-/issues/3731#note_682802. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 08:38:31 -08:00
Ian Romanick	75de4458a1	nir: Mark nir_intrinsic_load_global_block_intel as divergent This is divergent because it specifically loads sequential values into successive SIMD lanes. No shader-db or fossil-db changes on any Intel platform. Fixes: `9f44a26462` ("nir/divergence: handle load_global_block_intel") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 08:36:42 -08:00
Ian Romanick	5da5106727	nir: Add documentation for subgroup_.._mask v2: Fix reference to GL_ARB_shader_ballot. Noticed by Lionel. Suggested-by: Lionel Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 08:36:09 -08:00
Sagar Ghuge	30ead72e80	nir: Allow nir_texop_tg4 in implicit derivative This allow us to invoke the quad helper. v2: (Georg) - Add check for is_gather_implicit_lod Fixes: `48158636bf` ("nir: add is_gather_implicit_lod") Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27447>	2024-02-27 00:22:46 +00:00
Alyssa Rosenzweig	6825902bb6	treewide: use ralloc_memdup @@ expression memctx, dst, src, size; @@ -dst = ralloc_size(memctx, size); -memcpy(dst, src, size); +dst = ralloc_memdup(memctx, src, size); Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27762>	2024-02-26 15:37:58 +00:00
Timur Kristóf	cc1501628f	nir: Clean up divergence analysis for TES patch input loads. Just make the code a little bit easier to follow. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27680>	2024-02-26 14:53:23 +00:00
Timur Kristóf	870a2e4197	nir: Cleanup divergence analysis for mesh shaders. 1. Mesh shaders don't have inputs (only task payload), so remove them from handling load_input. 2. Clarify in comments that loading any mesh shader output is an NV_mesh_shader only feature. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27680>	2024-02-26 14:53:23 +00:00
Timur Kristóf	9553d67373	nir: Fix divergence analysis of load_patch_vertices_in. load_patch_vertices_in can only occur in tessellation shaders, and contains the number of vertices in an input patch. * TCS: patch_vertices_in is equal to the input patch size * TES: patch_vertices_in is equal to the TCS output patch size The patch sizes may be set by a pipeline or dynamic states, however in both cases it is definitely uniform within a subgroup. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27680>	2024-02-26 14:53:23 +00:00
Timur Kristóf	537c0029dd	nir: Fix divergence of reductions. By accident, the function would return without setting the divergence information. Cc: mesa-stable Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27680>	2024-02-26 14:53:23 +00:00
Christian Gmeiner	028080c716	isaspec: encode.py: Include util/log.h Generated encode functions are making use of mesa_loge(..). Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27714>	2024-02-23 20:29:57 +00:00
Timothy Arceri	0f0fa64eed	glsl: move some lowering to the compiler Rather than doing this lowering potentially multiple times when a shader is relinked we can instead do it once in the compiler. This change also gets us closer to converting to NIR at compile time. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27690>	2024-02-22 05:26:16 +00:00
Timothy Arceri	82d617e8b1	glsl: fix potential crash in expression flattening The base_ir variable used by this pass is set via visit_list_elements() however this pass was skipping visit_list_elements() for the initial list of instructions i.e. it was skipping it for globals so if we ended up trying to flatten an expression on a global we would segfault. To quote the code comment on the base_ir variable: "This is implemented by visit_list_elements -- if the visitor is not called by it, nothing good will happen" Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27743>	2024-02-22 04:44:44 +00:00
Karol Herbst	815a6647eb	meson: do not pull in clc for clover Fixes: `01d0d94319` ("meson: Simplify clc expression") Signed-off-by: Karol Herbst <kherbst@redhat.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27663>	2024-02-21 20:53:36 +00:00
Karol Herbst	6474f8c2ce	clc: include opencl-c.h for extensions needing it This also allows tools build on clc to drop their workaround to include it themselves. Rusticl might need it once it supports extensions which need this file pulled in. Later if the need to include it changes based on llvm version, we can easily handle this in clc. The main reason to include it only conditionally is the massively reduction in compilation time. It also removes the mental burden from users of clc to deal with any of this themselves. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10633 Fixes: `37a1346347` ("meson: remove opencl-external-clang-headers option and rely on shared-llvm") Signed-off-by: Karol Herbst <kherbst@redhat.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27663>	2024-02-21 20:53:36 +00:00
Samuel Pitoiset	78ea304a06	spirv: only consider IO variables when adjusting patch locations for TES With TES, the primitive ID is an input variable but it's considered a sysval by SPIRV->NIR. Though, its value is greater than VARYING_SLOT_VAR0 which means its location was adjusted by mistake. This fixes compiling a tessellation evaluation shader in debug build with Enshrouded. Fixes: `dfbc03fa88` ("spirv: Fix locations for per-patch varyings") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27413>	2024-02-21 10:36:07 +00:00
Timothy Arceri	74534397ac	glsl: split var copies before lowering named interfaces Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10593 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27669>	2024-02-20 23:29:17 +00:00
Timothy Arceri	4c11119825	glsl: support array wildcards in lower named interface blocks Will be required with the change in the following patch. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27669>	2024-02-20 23:29:17 +00:00
Timothy Arceri	ec240e2cd8	nir: allow gather info to handle nir_deref_type_array_wildcard Needed for some changes to the glsl nir linker in the following patches. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27669>	2024-02-20 23:29:17 +00:00
Bas Nieuwenhuizen	c7b2ac3377	radv: Remove ray_launch_size_addr_amd system value. Not used anymore, so clean it up. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27664>	2024-02-17 11:08:16 +00:00
Caio Oliveira	a88084f8be	intel/compiler: Rename brw_image_param to isl_image_param And move them to ISL. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27475>	2024-02-14 22:31:23 -08:00
Timothy Arceri	219be55807	glsl: add missing error check for half float varying We should never get here currently as the parser should not even process float16_t without half float enabled. However it seems like a good idea to add this for completeness. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27585>	2024-02-14 23:50:21 +00:00
Alyssa Rosenzweig	cb0b027c59	asahi: make clip_halfz dynamic we could move this to the linker but meh, this is good enough for now Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27616>	2024-02-14 21:02:32 +00:00
Alyssa Rosenzweig	6673924b7e	asahi: make gs topology dynamic even with shobjs, we know the class of topology statically, so we just need to select between the (up to) 3 compatible topologies, and luckily there are common subexpressions we can factor out when calculating all 3 at once. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27616>	2024-02-14 21:02:32 +00:00
Alyssa Rosenzweig	17896f1699	nir: rm load_vert_id_in_prim_agx now unused since we separate vs/gs Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27616>	2024-02-14 21:02:31 +00:00
Alyssa Rosenzweig	c6c8262ce1	asahi: implement pipeline stats as a checkbox real impl is blocked on uapi to plumb thru hw perf counters. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27616>	2024-02-14 21:02:30 +00:00
Asahi Lina	b89da92a5e	agx: compiler: Add fence_helper_exit_agx barrier This is used by the helper program on exit. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27616>	2024-02-14 21:02:29 +00:00
Asahi Lina	b07dbf7b0f	nir: Add AGX-specific helper opcodes These opcodes are used by the helper program to fetch the current operation info and core ID. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27616>	2024-02-14 21:02:29 +00:00
Alyssa Rosenzweig	311070f7af	nir: add active_subgroup_invocation_agx sysval Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27616>	2024-02-14 21:02:29 +00:00
Alyssa Rosenzweig	5dc0f5ccba	asahi: implement VBO robustness GL semantics. GLES (weaker) and VK (stronger) semantics are left as a todo, with explanations given. Enabled always to deal with null VBOs, this should be optimized once we have soft fault. This necessitates a rework of VBO keys, but hopefully for the best. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27616>	2024-02-14 21:02:29 +00:00
Alyssa Rosenzweig	9753cd44f7	asahi: Implement skeleton for tessellation This implements a rough skeleton of what's needed for tessellation. It contains the relevant lowerings to merge the VS and TCS, running them as a compute kernel, and to lower the TES to a new VS (possibly merged in with a subsequent GS). This is sufficient for both standalone tessellation and tess + geom/xfb together. It does not yet contain a GPU accellerated tessellator, simply falling back to the CPU for that for now. Nevertheless the data structures are engineered with that end goal in mind, in particular to be able to tessellate all patches in parallel without needing any prefix sums etc (using simple watermark allocation for the heap). Work on fleshing out the skeleton continues in parallel. For now, this does pass the tests and lets the harder stuff get regression tested more easily. And merging early will ease rebase. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27616>	2024-02-14 21:02:28 +00:00
Alyssa Rosenzweig	2d37d1b704	asahi: lower poly stipple Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27616>	2024-02-14 21:02:28 +00:00
Alyssa Rosenzweig	db144685a9	compiler: add a vs.tes_agx bit So we can distinguish lowered tess eval shaders masquerading as hardware vertex shaders from actual software vertex shaders, for determining what stage to pull descriptors. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27616>	2024-02-14 21:02:28 +00:00
Mike Blumenkrantz	9e2c7314f2	nir/lower_io: fix handling for compact arrays with indirect derefs this logic relies on constant indexing for compact arrays, but this is frequently not the case for compact array builtins (e.g., gl_TessLevelOuter). the usual strategy of lowering to temps isn't viable in TCS, which means io lowering has to be able to handle indirect access to these builtins without crashing cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27534>	2024-02-13 16:13:13 +00:00
Karol Herbst	727cddd338	nir/lower_cl_images: record image_buffers and msaa_images Cc: mesa-stable Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27385>	2024-02-13 10:12:13 +00:00
Connor Abbott	6a744ddebc	ir3: Initial support for pushing globals with ldg.k Add a separate pass which uses the analyze_ubo_ranges machinery to construct ranges of readonly globals accessed in the shader and push them to constants in the preamble, using ldg.k if possible. This is enough to handle inline uniforms in turnip but also provides a base for OpenCL, although the pass would need further work for that. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26934>	2024-02-12 22:05:13 +00:00
Connor Abbott	45c71803f9	tu: Add more info to ldg inline uniform path This will let us push the ldg into the preamble. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26934>	2024-02-12 22:05:13 +00:00
Sagar Ghuge	c984d6e2fc	nir: Drop intel specific lowering code In previous patches, we have moved the Intel specific lowering code in brw_nir_lower_texture file. We can go ahead and drop the Intel specific texture source too. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27458>	2024-02-12 21:25:48 +00:00
Timothy Arceri	6fbf336788	compiler/types: Add a contains_32bit helper Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18540>	2024-02-12 13:23:14 +00:00
Timothy Arceri	5f1f6d7496	glsl: add half float AMD_shader_trinary_minmax functions Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18540>	2024-02-12 13:23:14 +00:00
Timothy Arceri	d619c16c3f	glsl: add half float derivative functions Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18540>	2024-02-12 13:23:14 +00:00
Timothy Arceri	14de2eff89	glsl: add half float interpolation functions Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18540>	2024-02-12 13:23:14 +00:00
Timothy Arceri	9dc5eec02c	glsl: allow half float varyings Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18540>	2024-02-12 13:23:14 +00:00
Timothy Arceri	3dc67c2c7e	glsl: add half float vector relational functions Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18540>	2024-02-12 13:23:14 +00:00
Timothy Arceri	e7f1be1ceb	glsl: add half float matrix functions Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18540>	2024-02-12 13:23:14 +00:00
Timothy Arceri	99a80ac930	glsl: add half float geometric functions Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18540>	2024-02-12 13:23:14 +00:00
Timothy Arceri	6a170051a9	glsl: add support for half float packing functions Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18540>	2024-02-12 13:23:14 +00:00
Timothy Arceri	c386d56915	glsl: add half float support for common functions Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18540>	2024-02-12 13:23:14 +00:00
Timothy Arceri	eea1c1fa7b	glsl: add f2f16() helper to ir_builder Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18540>	2024-02-12 13:23:14 +00:00

1 2 3 4 5 ...

9075 commits