fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-18 13:48:06 +02:00

Author	SHA1	Message	Date
Daniel Schürmann	9808ef0349	nir/opt_loop: move loop control-flow optimizations into separate pass This new pass aims to simplify loop control-flow by reducing the number of break and continue statements. It also supersedes nir_opt_trivial_continues(). For this purpose, it implements 3 optimizations: - opt_loop_terminator(), as previously - opt_loop_merge_break_continue(), similar to opt_merge_breaks() incl. continues - opt_loop_last_block(), a generalization of opt_if_loop_last_continue() Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24940>	2024-01-03 20:48:04 +00:00
Christian Gmeiner	0158075b22	nir/opt_peephole_select: handle speculative ubo loads Some platforms may be able to speculate ubo loads safely. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8299>	2024-01-03 20:02:25 +00:00
Karol Herbst	3ee6339206	clc: remove code supporting pre llvm-10 we require llvm-10+ already anyway, see meson.build:1726 Signed-off-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26871>	2024-01-03 18:30:32 +00:00
Yonggang Luo	18abdb8596	compiler/glsl: Move glsl specific _mesa_glsl_initialize_types out and glsl_symbol_table of glsl_types.h To make sure C-ABI compat, struct _mesa_glsl_parse_state; struct gl_shader_program; struct gl_builtin_uniform_desc; are wrapped with extern "C" And getting _mesa_glsl_initialize_variables c-compat for consistence Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26804>	2024-01-03 06:38:19 +00:00
Caio Oliveira	0b5abf2512	spirv: Use value_id_bound to set initial memory allocated Don't rely on the current default (which is 2048 bytes) buffer size for blocks -- which ends up being too small for most shaders. Since we already rely on value_id_bound to allocate an array of vtn_value, use that to estimate a better value. In addition to space for the array, we approximate the extra size of extra data structures with the size of vtn_ssa_value, and skip it to the next size (double it) to cover the CFG related allocations. This results in only single system allocation necessary to back the temporary data for the majority of the shaders. Parsing code was slightly reordered so we can validate and read the value_id_bound before the temporary allocator is created. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25279>	2024-01-02 16:07:06 +00:00
Caio Oliveira	d5b4b7356e	spirv: Use linear_alloc for parsing-only data All the vtn_* structures and arrays are used only during the lifetime of spirv_to_nir(); we don't need to free them individually nor steal them out; and some of them are smaller than the 5-pointer header required for ralloc allocations. These properties make them a good candidate for using an arena-style allocation. Change the code to create a linear_parent and use that for all the vtn_* allocation. Note that NIR data structures still go through ralloc, since we steal them (through the nir_shader) at the end, i.e. they outlive the parsing. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25279>	2024-01-02 16:07:06 +00:00
Konstantin Seurer	b88ac6b381	nir: Optimize fpow with small constant exponents They would be turned into exp(log(a)*b) instead, which is slow. Totals from 2146 (2.52% of 85071) affected shaders: MaxWaves: 35769 -> 35779 (+0.03%); split: +0.03%, -0.01% Instrs: 6476835 -> 6465494 (-0.18%); split: -0.18%, +0.00% CodeSize: 35382288 -> 35347092 (-0.10%); split: -0.10%, +0.00% SpillSGPRs: 1055 -> 1017 (-3.60%) Latency: 75211743 -> 75063623 (-0.20%); split: -0.20%, +0.00% InvThroughput: 17525115 -> 17501745 (-0.13%); split: -0.14%, +0.00% VClause: 200089 -> 200077 (-0.01%); split: -0.01%, +0.01% SClause: 293566 -> 293480 (-0.03%); split: -0.03%, +0.00% Copies: 649631 -> 640516 (-1.40%); split: -1.44%, +0.03% Branches: 268441 -> 268325 (-0.04%) PreSGPRs: 146868 -> 146045 (-0.56%) PreVGPRs: 134125 -> 134128 (+0.00%); split: -0.00%, +0.01% Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26727>	2024-01-02 11:16:14 +01:00
Rhys Perry	10e0518a85	nir/loop_analyze: remove invariance analysis compute_invariance_information() wasn't doing anything. The only variables not skipped in the list are phis (which are never considered invariant) and ALU instructions which use the phi as one of it's sources. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23726>	2024-01-01 14:15:39 +00:00
Yonggang Luo	0210b554d6	treewide: Replace the include of nir_types.h with glsl_types.h Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26753>	2023-12-30 15:08:11 +00:00
Ian Romanick	6b14da33ad	intel/fs: nir: Add nir_intrinsic_dpas_intel v2: Fix parameter order in nir_intrinsic_dpas_intel to DPAS conversion. v3: Fix float16 destination DPAS on DG2. v4: Use nir_component_mask(...) instead of 0xffff. Suggested by Caio. v5: Rebase on !26323. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25994>	2023-12-29 20:28:43 -08:00
Caio Oliveira	6fccacda1e	compiler/types: Use a typedef for glsl_type Most of the code now will see `const glsl_type ` instead of `const struct glsl_type `. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26708>	2023-12-22 07:53:25 -08:00
Caio Oliveira	550fdc2026	compiler/types: Remove glsl_type C++ helpers All code now use the C functions. Remove glsl_type_impl.h that contained the inline C++ wrappers around those. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26707>	2023-12-22 06:51:01 -08:00
Caio Oliveira	d06f0305f6	glsl: Use glsl_type C helpers Acked-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26707>	2023-12-22 06:51:01 -08:00
Caio Oliveira	db5f73dc9f	compiler/types: Add a few more glsl_type C helpers These will be used once the C++ ones are removed. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26707>	2023-12-22 06:44:23 -08:00
Caio Oliveira	582c20c431	nir: Use glsl_type C helpers Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26707>	2023-12-22 06:44:23 -08:00
Karol Herbst	f8afd41667	clc: add workaround for clang always defining __IMAGE_SUPPORT_ and __opencl_c_int64 Signed-off-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26764>	2023-12-20 11:31:30 +00:00
Bas Nieuwenhuizen	da6a5e1f63	nir: Add pass for clearing memory at the end of a shader. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26679>	2023-12-20 09:15:45 +00:00
Bas Nieuwenhuizen	bc99b73d70	nir: Add nir_static_workgroup_size helper. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26679>	2023-12-20 09:15:45 +00:00
Faith Ekstrand	3e042173e4	nir/lower_doubles: Add lowering for fmin/fmax/fsat Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26587>	2023-12-20 02:40:25 +00:00
Timothy Arceri	52dbf44d2e	glsl: add support for inout params to glsl_to_nir() Supporting these means we don't have to depend on calling the GLSL IR optimisation loop for shaders that contain these parameter types. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26755>	2023-12-20 01:47:27 +00:00
Timothy Arceri	3d3ba9f428	glsl: move glsl ir lowering out of glsl_to_nir() The main motivation for doing this is that some tests and even the st tracker linking code dump out the GLSL IR for debugging before glsl_to_nir() is called expecting it to already be in its final form. Moving these to the linker makes those assumptions true. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26755>	2023-12-20 01:47:27 +00:00
Timothy Arceri	bb1873faad	glsl: add additional lower mediump test There were tests for inputs and inout, but no test for out which turned out to not be behaving correctly. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26755>	2023-12-20 01:47:27 +00:00
Timothy Arceri	d42f9d94af	glsl: copy precision val of function output params We need to copy the precision to our temp values when converting to nir or this information will be lost. This change fixes the new test introduced in the following patch. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26755>	2023-12-20 01:47:27 +00:00
Timothy Arceri	37e83a93d7	glsl: remove some unused linker code These were missed when removing code in `72ad0db505`. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26747>	2023-12-19 23:45:30 +00:00
Timothy Arceri	4584acca6b	glsl: tidy up validation loop in linker There is no need to have a separate loop to determine the first stage in the shader program. Previously there were other users of this but since this is the last remain user this patch changes the code to simply detect the first stage directly. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26747>	2023-12-19 23:45:30 +00:00
Sviatoslav Peleshko	a6459e0f7b	nir/loop_analyze: Don't test non-positive iterations count Testing negative iterations count makes no sense, and can cause issues when the unsigned type is used. Testing 0 iterations is already covered with will_break_on_first_iteration, so it can be skipped too. Fixes: `6772a17a` ("nir: Add a loop analysis pass") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9913 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26173>	2023-12-19 12:53:52 +00:00
Job Noorman	6cad2fc230	nir: add helper to create cursor after all @decl_regs @decl_reg intrinsics must be in the first block so it's convenient to be able to create an insertion point after all @decl_regs when the first block needs to be split. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26737>	2023-12-18 14:52:02 +00:00
Christian Gmeiner	a8a33ac5ae	isaspec: Add bool_inv type to print inverted bools Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20144>	2023-12-16 14:34:18 +00:00
Job Noorman	6e7a61df4c	nir: add _safe variants of nir_foreach_reg_load/store Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26175>	2023-12-15 17:19:28 +00:00
Faith Ekstrand	1cf1b9d741	nir: Scalarize bounds checked loads and stores Fixes: `39da1deb49` ("nir/lower_io: Add a bounds-checked 64-bit global address format") Reviewed-by: M Henning <drawoc@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26526>	2023-12-15 03:53:54 +00:00
Caio Oliveira	81e3b28f78	compiler: Remove C++ static member pointers to builtin types When we moved the bulk of glsl_type to C, these globals were kept to avoid changes to compiler/glsl code in the MR. Now that landed, change the code to use the actual bultins directly. Acked-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26658>	2023-12-15 03:09:19 +00:00
Caio Oliveira	90e364edb0	compiler/types: Add a few more helpers to get builtin types Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26658>	2023-12-15 03:09:19 +00:00
Caio Oliveira	f17e23e116	compiler/glsl: Reduce scope of is_anonymous This a GLSL parser specific detail, so move it there. Also add a comment pointing to where #anon prefix is used. Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26656>	2023-12-13 15:44:40 +00:00
Friedrich Vock	f1817ab7e0	radv,vtn,driconf: Add and use radv_rt_ssbo_non_uniform workaround for Crysis 2/3 Remastered Crysis 2 and 3 Remastered's RT shaders non-uniformly index into SSBO descriptor arrays without specifying the NonUniformEXT qualifier on the relevant access chains/load ops. This leads to artifacts around objects. To add insult to injury, the game fails to provide a meaningful applicationName/engineName in the Vulkan part of the DX11-Vulkan interop solution used for RT. Both of these fields are set to "nvpro-sample" (perhaps the code has been copied from NVIDIA's sample applications). Therefore, fall back to executable name matching. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9883 Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26391>	2023-12-12 21:16:39 +00:00
Karol Herbst	8c73b1eb90	nir/algebraic: add support for custom arguments Those are passed as an optional argument and are declared as a list of (type, name) tuples. At the moment this can only be used for conditions. Signed-off-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26214>	2023-12-12 18:48:11 +00:00
Karol Herbst	c674db05e8	clc: use addMacroDef/Undef instead of -D/-U flags It always felt weird having the extension management in two different places. Later once we require LLVM-14 we might even be able to clean it up a little more. Signed-off-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26641>	2023-12-12 14:24:48 +00:00
Lionel Landwerlin	f53748c481	nir: fixup nir_printf intrinsic description Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26505>	2023-12-12 11:11:10 +00:00
Lionel Landwerlin	dc3e69af1a	nir/serialize: untangle printf serialization from a particular stage This allows any stage to carry printf instructions. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26505>	2023-12-12 11:11:10 +00:00
Lionel Landwerlin	4e4a3820ab	nir/divergence: handle printf intrinsic Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26505>	2023-12-12 11:11:10 +00:00
Lionel Landwerlin	f7ae92b868	nir: include printfs from linked shaders Once lowered low enough, it's not always possible to tell what strings are used. So include them all when linking another shader. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26505>	2023-12-12 11:11:10 +00:00
Lionel Landwerlin	81b3dea993	nir/clone: fix missing printf_info clone Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26505>	2023-12-12 11:11:10 +00:00
Lionel Landwerlin	603f039708	nir: make printf_info (de)serializer available Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26505>	2023-12-12 11:11:10 +00:00
Timothy Arceri	5147e9a26e	glsl: combine shader stage loops in linker The gs validation that was run between these loops can be run after merging them without any issue. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26628>	2023-12-12 02:28:33 +00:00
Timothy Arceri	fe44414662	glsl/st: move remaining glsl ir lowering to linker This is a tidy up but also allows us to drop an additional validate_ir_tree() call. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26628>	2023-12-12 02:28:33 +00:00
Karol Herbst	7e78802028	clc: add support for cl_khr_subgroup_shuffle and shuffle_relative Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26504>	2023-12-11 23:08:51 +00:00
Eric Engestrom	c51e40dd8b	spirv: add missing build dependency Fixes: `59a72570b6` ("compiler: Move spirv into a module of its own") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10277 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26624>	2023-12-11 21:47:37 +00:00
Ian Romanick	7fce0a5598	nir: Handle divergence for decl_reg Once decl_reg is handled, src[0].ssa->divergent will be properly set, so load_reg and load_reg_indirect do not need special treatment. shader-db can run to completion on HSW, IVB, and SNB now. No other testing was done. v2: Refactor nir_intrinsic_load_reg and nir_intrinsic_load_reg_indirect handling. Suggested by Daniel Schürmann. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Fixes: `4fd257d20f` ("nir: Properly handle divergence for load_reg") Fixes: `6dbb5f1e07` ("intel/fs: rerun divergence analysis prior to convert_from_ssa") Closes: #10233 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26436>	2023-12-11 17:10:51 +00:00
Jesse Natalie	37c0e8beda	compiler/clc: Don't fail to parse SPIR-V if there's no kernels It's valid to have library SPIR-V being parsed that has no entrypoints. We still want to get spec constant info for them. Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26582>	2023-12-11 16:28:28 +00:00
Faith Ekstrand	aac1e3f595	nir: Add a new has_fmulz_no_denorms flag Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26569>	2023-12-11 15:29:17 +00:00
Alyssa Rosenzweig	c43c90a5fa	asahi: rewrite pointsize handling In the wise words of Mike Blumenkrantz, "I hate gl_PointSize and so can you". The mesa/st lowering won't mesh well with vertex shader epilogues, and it falls over in various circumstances. I am too tired to go against the grain, so let's just pretend to be a normal gallium driver and trust in the rasterizer CSO, lowering point size internally. This properly handles transform feedback without any hacks, both GL and GLES behaviours, etc. Fixes: KHR-GL31.transform_feedback.capture_vertex_separate_test gl-2.0-large-point-fs Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26614>	2023-12-09 12:08:39 -04:00

1 2 3 4 5 ...

8923 commits