fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-21 07:10:09 +01:00

Author	SHA1	Message	Date
Ian Romanick	535caaf3e0	nir: Optimize uniform iadd, fadd, and ixor reduction operations This adds optimizations for iadd, fadd, and ixor with reduce, inclusive scan, and exclusive scan. NOTE: The fadd and ixor optimizations had no shader-db or fossil-db changes on any Intel platform. NOTE 2: This change "fixes" arb_compute_variable_group_size-local-size and base-local-size.shader_test on DG2 and MTL. This is just changing the code path taken to not use whatever path was not working properly before. This is a subset of the things optimized by ACO. See also https://gitlab.freedesktop.org/mesa/mesa/-/issues/3731#note_682802. The min, max, iand, and ior exclusive_scan optimizations are not implemented. Broadwell on shader-db is not happy. I have not investigated. v2: Silence some warnings about discarding const. v3: Rename mbcnt to count_active_invocations. Add a big comment explaining the differences between the two paths. Suggested by Rhys. shader-db: All Gfx9 and newer platforms had similar results. (Ice Lake shown) total instructions in shared programs: 20300384 -> 20299545 (<.01%) instructions in affected programs: 19167 -> 18328 (-4.38%) helped: 35 / HURT: 0 total cycles in shared programs: 842809750 -> 842766381 (<.01%) cycles in affected programs: 2160249 -> 2116880 (-2.01%) helped: 33 / HURT: 2 total spills in shared programs: 4632 -> 4626 (-0.13%) spills in affected programs: 206 -> 200 (-2.91%) helped: 3 / HURT: 0 total fills in shared programs: 5594 -> 5581 (-0.23%) fills in affected programs: 664 -> 651 (-1.96%) helped: 3 / HURT: 1 fossil-db results: All Intel platforms had similar results. (Ice Lake shown) Totals: Instrs: 165551893 -> 165513303 (-0.02%) Cycles: 15132539132 -> 15125314947 (-0.05%); split: -0.05%, +0.00% Spill count: 45258 -> 45204 (-0.12%) Fill count: 74286 -> 74157 (-0.17%) Scratch Memory Size: 2467840 -> 2451456 (-0.66%) Totals from 712 (0.11% of 656120) affected shaders: Instrs: 598931 -> 560341 (-6.44%) Cycles: 184650167 -> 177425982 (-3.91%); split: -3.95%, +0.04% Spill count: 983 -> 929 (-5.49%) Fill count: 2274 -> 2145 (-5.67%) Scratch Memory Size: 52224 -> 35840 (-31.37%) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 09:44:11 -08:00
Ian Romanick	f10d1ef372	nir: Initial framework for optimizing uniform subgroup operations The first commit just optimizes operation where the result of the subgroup operation is the same as each of the individual channel results. This is a subset of the things optimized by ACO. See also https://gitlab.freedesktop.org/mesa/mesa/-/issues/3731#note_682802. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 08:38:31 -08:00
Sagar Ghuge	c984d6e2fc	nir: Drop intel specific lowering code In previous patches, we have moved the Intel specific lowering code in brw_nir_lower_texture file. We can go ahead and drop the Intel specific texture source too. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27458>	2024-02-12 21:25:48 +00:00
Marek Olšák	e98bbcad17	nir: add vertex divergence into nir_divergence_analysis This is a prerequisite for the new nir_opt_varyings pass. It reuses the same divergent field in nir_def and nir_loop. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26918>	2024-02-02 16:45:52 -05:00
Marek Olšák	5ffa4d879c	nir: add a lower_mediump_io callback into options This will be called by the GLSL linker before nir_opt_varyings. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26918>	2024-02-02 16:45:51 -05:00
Marek Olšák	ecf0fe09f0	nir: replace lower_io_variables with a GLSL NIR flag This stops using it in nir_lower_io_passes because all callers call it only when it's true. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26918>	2024-02-02 16:45:49 -05:00
Marek Olšák	c4acab77a8	nir: remove and replace underused option pack_varying_options This will also be used by nir_opt_varyings. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26918>	2024-02-02 16:45:47 -05:00
Ian Romanick	c8ba2bc2f0	nir: Pack texture LOD and array index to a single 32-bit value v2: Fix clamped_ai calculation in nir_lower_tex.c. Add nir_tex_src_combined_lod_and_array_index_intel to print_tex_instr. Suggested by Sagar. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27305>	2024-02-02 02:39:10 +00:00
Friedrich Vock	6c845ed548	nir: Make is_trivial_deref_cast public Cc: mesa-stable Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27197>	2024-01-24 12:39:48 +00:00
Karol Herbst	f2b7c4ce29	nir: rework and fix rotate lowering No driver supports urol/uror on all bit sizes. Intel gen11+ only for 16 and 32 bit, Nvidia GV100+ only for 32 bit. Etnaviv can support it on 8, 16 and 32 bit. Also turn the `lower` into a `has` option as only two drivers actually support `uror` and `urol` at this momemt. Fixes crashes with CL integer_rotate on iris and nouveau since we emit urol for `rotate`. v2: always lower 64 bit Fixes: `fe0965afa6` ("spirv: Don't use libclc for rotate") Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by (Intel and nir): Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: David Heidelberg <david.heidelberg@collabora.com> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27090>	2024-01-22 10:27:44 +00:00
Georg Lehmann	d641750573	nir: add lowering for boolean shuffle Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27116>	2024-01-19 20:13:34 +00:00
Karol Herbst	36012af17f	nir/printf: remove treat_doubles_as_floats It is broken and clang uses fp32 for float constants if the fp64 extension isn't enabled anyway. SPIRVs can't use fp64 constants with printf unless they enable the Float64 cap, which also requires cl_khr_fp64 to be supported. So just remove it and rely on clang handling -cl-single-precision-constant correctly, which at the moment doesn't seem to be the case, but we can think about that once we plan to support cl_khr_fp64. Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26541>	2024-01-18 13:16:13 +01:00
Alyssa Rosenzweig	fcae4b469f	nir/lower_io_arrays_to_elements: return prog Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26976>	2024-01-12 01:13:02 +00:00
Alyssa Rosenzweig	70fd20d2bc	nir/lower_passthrough_edgeflags: return progress Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26976>	2024-01-12 01:13:02 +00:00
Alyssa Rosenzweig	460d2ca4f3	nir/lower_point_size_mov: return prog Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26976>	2024-01-12 01:13:02 +00:00
Alyssa Rosenzweig	8b7d765e59	nir/lower_alpha_test: rewrite with intrinsics_pass returns progress now Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26976>	2024-01-12 01:13:02 +00:00
Alyssa Rosenzweig	086cbe5da2	nir/lower_bitmap: return prog Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26976>	2024-01-12 01:13:02 +00:00
Alyssa Rosenzweig	4833e42721	nir: return prog from drawpixels Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26976>	2024-01-12 01:13:02 +00:00
Alyssa Rosenzweig	6fa32b5b83	nir/lower_clip_cull_distance_arrays: return prog Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26976>	2024-01-12 01:13:02 +00:00
Alyssa Rosenzweig	a36812d9b4	nir/lower_io_to_temporaries: return prog Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26976>	2024-01-12 01:13:02 +00:00
Rhys Perry	0477421f7d	nir: add msad_4x8 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26907>	2024-01-05 18:55:22 +00:00
Daniel Schürmann	bdbf873b0f	nir: remove redundant passes from nir_opt_if() These are now covered by nir_opt_loop(): - opt_if_loop_last_continue() - opt_merge_breaks() - opt_if_loop_terminator() Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24940>	2024-01-03 20:48:05 +00:00
Daniel Schürmann	5b1b5cd794	nir: remove nir_opt_trivial_continues() This pass is superseded by nir_opt_loop() Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24940>	2024-01-03 20:48:04 +00:00
Daniel Schürmann	9808ef0349	nir/opt_loop: move loop control-flow optimizations into separate pass This new pass aims to simplify loop control-flow by reducing the number of break and continue statements. It also supersedes nir_opt_trivial_continues(). For this purpose, it implements 3 optimizations: - opt_loop_terminator(), as previously - opt_loop_merge_break_continue(), similar to opt_merge_breaks() incl. continues - opt_loop_last_block(), a generalization of opt_if_loop_last_continue() Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24940>	2024-01-03 20:48:04 +00:00
Yonggang Luo	0210b554d6	treewide: Replace the include of nir_types.h with glsl_types.h Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26753>	2023-12-30 15:08:11 +00:00
Bas Nieuwenhuizen	da6a5e1f63	nir: Add pass for clearing memory at the end of a shader. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26679>	2023-12-20 09:15:45 +00:00
Bas Nieuwenhuizen	bc99b73d70	nir: Add nir_static_workgroup_size helper. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26679>	2023-12-20 09:15:45 +00:00
Faith Ekstrand	3e042173e4	nir/lower_doubles: Add lowering for fmin/fmax/fsat Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26587>	2023-12-20 02:40:25 +00:00
Job Noorman	6cad2fc230	nir: add helper to create cursor after all @decl_regs @decl_reg intrinsics must be in the first block so it's convenient to be able to create an insertion point after all @decl_regs when the first block needs to be split. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26737>	2023-12-18 14:52:02 +00:00
Job Noorman	6e7a61df4c	nir: add _safe variants of nir_foreach_reg_load/store Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26175>	2023-12-15 17:19:28 +00:00
Faith Ekstrand	aac1e3f595	nir: Add a new has_fmulz_no_denorms flag Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26569>	2023-12-11 15:29:17 +00:00
Karol Herbst	6979a1aa07	nir/opt_preamble: make load_workgroup_size handling optional not all drivers support it being in the preamble, e.g. asahi. Signed-off-by: Karol Herbst <git@karolherbst.de> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26614>	2023-12-09 10:56:37 -04:00
Timothy Arceri	c468aa6059	nir: add nir_fixup_deref_types() This will be used to fix up types after arrays have been resized. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26534>	2023-12-07 03:05:09 +00:00
Faith Ekstrand	09fc5e1c4d	nir: Split has_[su]dot_4x8 bits into regular and _sat versions Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26533>	2023-12-06 23:15:33 +00:00
Faith Ekstrand	629af540ca	spirv: Plumb variable alignments through to NIR Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26522>	2023-12-06 18:37:36 +00:00
Alyssa Rosenzweig	e14633fa7d	nir/lower_tex: Add 1D lowering From amd/common. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26377>	2023-11-28 20:32:03 +00:00
Marek Olšák	27a9ddad28	nir: return progress from nir_remove_sysval_output Changing IO semantics doesn't affect the SSA structure. Same as nir_remove_varying. Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26274>	2023-11-24 15:37:24 +00:00
Marek Olšák	32ee6376ad	nir: add lowering from FS LAYER input to LAYER_ID sysval Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26274>	2023-11-24 15:37:24 +00:00
Timothy Arceri	110887de2b	glsl/nir: implement a nir based lower distance pass This is based off the original GLSL IR pass but it is much much simpler as it doesn't need to do all of the hackery required in GLSL IR to achieve the lowering. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25860>	2023-11-21 00:45:21 +00:00
Qiang Yu	7e4aac46ad	nir: add force_f2f16_rtz option to lower f2f16 to f2f16_rtz Used by OpenGL driver like radeonsi which has undefined rounding mode. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25990>	2023-11-20 02:20:17 +00:00
Jesse Natalie	df4fc66649	nir: Add a flag to opt_if to prevent fighting with splitting 64bit phis Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26210>	2023-11-17 19:19:11 +00:00
Alyssa Rosenzweig	6d73f62e42	nir: Simplify nir_alu_instr_channel_used definition Deduplicates the "get # of channels" logic which was the same between the helpers. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26084>	2023-11-17 09:36:08 -04:00
Connor Abbott	1dab2c5bd2	nir/subgroups: Add option to lower Boolean subgroup reductions This will be useful for AMD, and probably Intel as well. Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/218>	2023-11-17 09:45:40 +00:00
Rhys Perry	288e9db053	nir/lower_fp16_casts: add option to split fp64 casts Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25566>	2023-11-16 11:02:30 +00:00
Karol Herbst	41f814df6f	nir: allow vec derefs on system values There is no real reason to prevent this as far as I know. And some of the SPIR-V generated by DPCPP is running into this. Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25701>	2023-11-15 08:34:57 +00:00
Alyssa Rosenzweig	cc3f20ca6c	nir: Also gather decomposed primitive count Simple extension. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Antonino Maniscalco <antonino.maniscalco@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26056>	2023-11-07 00:05:54 +00:00
Alyssa Rosenzweig	b65636ca40	nir/lower_gs_intrinsics: Count decomposed primitives too We need both: decomposed primitives for transform feedback and regular primitives for the sizing the index buffer. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Antonino Maniscalco <antonino.maniscalco@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26056>	2023-11-07 00:05:54 +00:00
Alyssa Rosenzweig	0a35aa3a2b	nir/lower_gs_intrinsics: Append EndPrimitive This is simpler for generic GS lowering. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Antonino Maniscalco <antonino.maniscalco@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26056>	2023-11-07 00:05:54 +00:00
Faith Ekstrand	1793adbd3a	nir/validate: Allow array derefs on vectors on function/shader_temp This is required by OpenCL. Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22580>	2023-11-02 20:28:46 +00:00
Alyssa Rosenzweig	23bea25207	nir: Add nir_remove_non_exported For libraries. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25498>	2023-11-02 11:37:46 +00:00

1 2 3 4 5 ...

1394 commits