fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-19 00:38:06 +02:00

Author	SHA1	Message	Date
Qiang Yu	33b4b923ee	nir: add nir_intrinsic_load_lshs_vertex_stride_amd For loading LS-HS vertex stride by shader argument in radeonsi. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16418>	2022-06-07 01:40:14 +00:00
Timothy Arceri	4237932685	glsl: tidy up link_varyings_and_uniforms() All uniform linking is now done via nir based linker not via this code so we drop that from its name. We also drop a bunch of unused parameters. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16880>	2022-06-07 01:11:19 +00:00
Timothy Arceri	f00be793e4	glsl: drop extra optimise swizzles call As per the comment this was meant to tidy things up after varying linking but varying linking has been moved into a nir based linker so this extra call is no longer needed. This optimisation pass is still called in the regular glsl ir optimisation loop. No shader-db change on Iris (BDW). Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16880>	2022-06-07 01:11:19 +00:00
Qiang Yu	19f3737262	mesa: pass select result buffer offset as attribute/varying Will be used by geometry shader to store hit result. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15765>	2022-06-06 18:23:49 +00:00
Qiang Yu	ff8ae4e589	nir/builder: add load/store array variable helper functions Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15765>	2022-06-06 18:23:49 +00:00
Mike Blumenkrantz	06859ba69c	mesa: handle atomic counter lowering for drivers with big ssbo offset aligns according to the spec, atomic counters can be bound at any offset divisible by 4, which means that any driver that uses the ssbo lowering pass and doesn't have a min offset align of 4 is potentially broken to handle this, use a statevar to inject the misaligned remainder of the offset into the shader as a uniform. for well-aligned counter binds, the uniform offset will be 0 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16749>	2022-06-05 23:16:36 +00:00
Vinson Lee	3e679219a1	clc: Fix build with llvm-15. opencl_c_h is defined only for llvm < 15. Fixes: `bcc2df4890` ("clc: speed up compilation by not relying on opencl-c.h") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16808>	2022-06-04 22:27:55 -07:00
Timothy Arceri	5aec67a1e1	glsl: remove the now unused GLSL IR loop unrolling code This code was slow, buggy and hard to understand. All drivers have now switched to using the NIR unrolling code \o/ Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16366>	2022-06-04 16:11:49 +00:00
Alyssa Rosenzweig	dc2d8a643f	nir: Export nir_io_add_intrinsic_xfb_info This is useful for drivers which wish to consume XFB information. These hopefully-uncontroversial hunks are extracted from the much more controversial "st,nir,radeons: Move nir_lower_io_passes to si_nir_lower_io" by Jason. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15720>	2022-06-04 14:35:56 +00:00
Alyssa Rosenzweig	5c79d649af	nir: Add transform feedback system values These will be used to facilitate transform feedback lowering for Panfrost, although other backends could use the sysvals in the future. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15720>	2022-06-04 14:35:56 +00:00
Timothy Arceri	87aaa0f915	glsl: remove now unused lower_const_arrays_to_uniforms() We now use a NIR version instead. Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16770>	2022-06-04 03:13:36 +00:00
Timothy Arceri	c573260c9b	glsl: switch to NIR based implementation of lower_const_arrays_to_uniforms() Shader-db results iris (BDW): total instructions in shared programs: 17523543 -> 17513909 (-0.05%) instructions in affected programs: 218091 -> 208457 (-4.42%) helped: 69 HURT: 327 helped stats (abs) min: 2 max: 2919 x̄: 160.84 x̃: 12 helped stats (rel) min: 0.21% max: 96.88% x̄: 14.87% x̃: 6.40% HURT stats (abs) min: 1 max: 47 x̄: 4.48 x̃: 1 HURT stats (rel) min: 0.10% max: 22.02% x̄: 3.33% x̃: 0.18% 95% mean confidence interval for instructions value: -45.02 -3.63 95% mean confidence interval for instructions %-change: -1.16% 1.47% Inconclusive result (%-change mean confidence interval includes 0). total loops in shared programs: 4875 -> 4868 (-0.14%) loops in affected programs: 7 -> 0 helped: 7 HURT: 0 helped stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 helped stats (rel) min: 100.00% max: 100.00% x̄: 100.00% x̃: 100.00% 95% mean confidence interval for loops value: -1.00 -1.00 95% mean confidence interval for loops %-change: -100.00% -100.00% Loops are helped. total cycles in shared programs: 858032406 -> 857984712 (<.01%) cycles in affected programs: 22940290 -> 22892596 (-0.21%) helped: 155 HURT: 312 helped stats (abs) min: 1 max: 49696 x̄: 1697.70 x̃: 62 helped stats (rel) min: <.01% max: 70.84% x̄: 5.60% x̃: 0.82% HURT stats (abs) min: 1 max: 19640 x̄: 690.54 x̃: 100 HURT stats (rel) min: <.01% max: 217.23% x̄: 33.57% x̃: 0.92% 95% mean confidence interval for cycles value: -436.09 231.84 95% mean confidence interval for cycles %-change: 15.39% 25.75% Inconclusive result (value mean confidence interval includes 0). total spills in shared programs: 16289 -> 15205 (-6.65%) spills in affected programs: 2753 -> 1669 (-39.38%) helped: 9 HURT: 1 total fills in shared programs: 20347 -> 20324 (-0.11%) fills in affected programs: 1642 -> 1619 (-1.40%) helped: 9 HURT: 1 total sends in shared programs: 972151 -> 971960 (-0.02%) sends in affected programs: 1910 -> 1719 (-10.00%) helped: 25 HURT: 20 helped stats (abs) min: 1 max: 50 x̄: 9.00 x̃: 2 helped stats (rel) min: 0.87% max: 53.76% x̄: 13.89% x̃: 6.25% HURT stats (abs) min: 1 max: 8 x̄: 1.70 x̃: 1 HURT stats (rel) min: 8.33% max: 200.00% x̄: 52.36% x̃: 33.33% 95% mean confidence interval for sends value: -8.19 -0.29 95% mean confidence interval for sends %-change: -1.07% 32.18% Inconclusive result (%-change mean confidence interval includes 0). LOST: 3 GAINED: 27 Note a small number of tests fail on lima and r300 after this patch. However since we are doing the correct thing here and they only fail due to a slight increase in instruction count pushing them over their instruction count limit, we are defering that issue to a different bug report for further discussion. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6540 Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16770>	2022-06-04 03:13:36 +00:00
Timothy Arceri	1805ee8d7b	glsl: move gl_nir_link_opts() call out of the st code Calling this directly in the linker code allows us to place it between the varying linker and uniform linker calls which allows for better optimisation/removal of uniforms. Also in a later patch it allows us to insert a new nir based lower_const_arrays_to_uniforms() call after the gl_nir_link_opts() call. This is important because it allows the linking opts to move constant arrays to later stages if possible before lower_const_arrays_to_uniforms() turns them into uniforms. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6541 Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16770>	2022-06-04 03:13:36 +00:00
Timothy Arceri	a14e2733ce	glsl: move common link time optimisation calls to linker code In the following patch we will move the users of this function to this file too and make it static again. Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16770>	2022-06-04 03:13:36 +00:00
Timothy Arceri	64dbc3f03a	glsl/nir: allow the nir linker to remove dead uniforms we created Some backends lower constant arrays to uniforms in GLSL IR. These create so called hidden uniforms. Since we know these are added per stage it is safe to remove them if we detect they are dead. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16770>	2022-06-04 03:13:36 +00:00
Timothy Arceri	4488b577a1	glsl/nir: skip adding hidden uniforms to the remap tables The remap tables are used with the GL API so there is no need to add hidden uniforms to them. Also when we switch to lowering some constant arrays to uniforms in NIR in a following patch there will no longer be enough room in the tables as we assign their size in the GLSL IR linker not the NIR linker currently. Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16770>	2022-06-04 03:13:36 +00:00
Timothy Arceri	44d6068c5b	nir: add nir based version of the lower_const_arrays_to_uniforms pass Doing this in NIR should give better results, but also allows us to stop calling more GLSL IR optimisations passes. v2: Skip 8bit and 16bit type that would require further processing I believe this is an existing bug in the GLSL IR pass also. v3: rebuild constant initialisers as we want to call this pass after nir has already lowered them and performed optimisations. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> (v1) Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16770>	2022-06-04 03:13:36 +00:00
Daniel Schürmann	b56fcefa0f	nir/opt_vectorize: refactor src rewriting to avoid unnecessary mov instructions Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15647>	2022-06-03 08:53:18 +00:00
Danylo Piliaiev	eb5f4c2f6b	spirv: Workaround for RelaxedPrecision on OpLogical* in 3DMark Per spec RelaxedPrecision cannot be applied to bool types, however 3DMark Wild Life does it: OpDecorate %171 RelaxedPrecision ... %171 = OpLogicalAnd %bool %169 %170 Fixes crash in 3DMark Wild Life on Android. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16746>	2022-06-03 07:50:53 +00:00
Jason Ekstrand	d8df87056c	nir: xfb_buffer_info::stride is in bytes For the NIR XFB gathering as well as all the Vulkan drivers, buffer strides in nir_xfb_info are in bytes. When Marek started using nir_xfb_info for GLSL on radeonsi, he copied directly from the GLSL struct which has strides in dwords. This inconsistency didn't show up until I went through and started us using the NIR passes for GL drivers directly without going through the GLSL structs. We could change the nir_xfb_buffer_info field to be in dwords to be consistent with shader_info but that would mean changing all the Vulkan drivers but, for now, it's easier to always use bytes in nir_xfb_info. Fixes: `2a22885a45` ("st,nir: Use nir_shader::xfb_info in nir_lower_io_passes") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16819>	2022-06-02 14:06:31 +00:00
Jason Ekstrand	7c876a6b2f	nir/glsl: Use rzalloc for nir_xfb_info A lot of the fields get fully overwritten but outputs/buffers_written are both bitfields that we set one bit at a time. Fixes: `7c5dc0b11a` ("glsl/nir: Populate nir_shader::xfb_info after linking varyings") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16819>	2022-06-02 14:06:31 +00:00
Erik Faye-Lund	18246ed06a	include: drop c99_math.h Since we now depend on C11, we know that we have support for the C99 math functionality. So let's drop the c99_math.h compatibility wrapper, and just include <math.h> directly. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16812>	2022-06-02 13:09:16 +00:00
Emma Anholt	6e087f96c9	nir_lower_mediump: Drop assertion about not containing movs. A 1D texture operation may need to do a mov to turn a reference to a channel of an SSA value into a scalar value to be passed as the texture coordinate (since texture srcs can't do swizzles). Seen in amnesia-the-dark-descent/low/46.shader_test() for example, where a 1D texture is used to remap each of r,g,b from a previous texture result. Besides, the nir_op_is_vec() case will (perhaps surprisingly) look through a mov, anyway. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16616>	2022-06-01 22:19:44 +00:00
Georg Lehmann	bfc25d6ec9	nir: Add optional lowering for mul_32x16. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13895>	2022-06-01 17:09:25 +00:00
Daniel Schürmann	be01e8711b	nir: introduce new nir_alu_alu_width() with nir_vectorize_cb callback This function allows to only scalarize instructions down to a desired vectorization width. nir_lower_alu_to_scalar() was changed to use the new function with a width of 1. Swizzles outside vectorization width are considered and reduce the target width. This prevents ending up with code like vec2 16 ssa_2 = iadd ssa_0.xz, ssa_1.xz which requires to emit shuffle code in backends and usually is not beneficial. Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13080>	2022-06-01 11:41:44 +00:00
Daniel Schürmann	bd151a256e	nir/opt_vectorize: add callback for max vectorization width The callback allows to request different vectorization factors per instruction depending on e.g. bitsize or opcode. This patch also removes using the vectorize_vec2_16bit option from nir_opt_vectorize(). Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13080>	2022-06-01 11:41:44 +00:00
Emma Anholt	7472bb4bad	glsl,nir: Move i/umulExtended lowering to NIR. NIR already has the necessary lowering, and the GLSL lowering violates GLSL IR validation rules. Once quadop lowering was turned off, the IR validation at the end of the compile path on DEBUG builds caught the problem. In order to move the lowering to NIR, though, we need to make sure that drivers supporting these functions actually have the lowering flag set. xfails added for t860, where apparently this tickles a variety of existing 64-bit bugs in the backend. Fixes: #6461 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Mykhailo Skorokhodov <mykhailo.skorokhodov@globallogic.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16437>	2022-06-01 10:56:35 +00:00
Lionel Landwerlin	5078b4fff1	nir/divergence: handle load_ray_num_dss_rt_stacks_intel Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16797>	2022-06-01 04:58:50 +00:00
Lionel Landwerlin	d3c1b0ac28	nir/divergence: handle load_scratch_base_ptr v2: divergent (Jason) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16797>	2022-06-01 04:58:50 +00:00
Jason Ekstrand	2a22885a45	st,nir: Use nir_shader::xfb_info in nir_lower_io_passes Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16750>	2022-05-31 23:09:30 +00:00
Jason Ekstrand	16b0719441	glsl/nir: Stash the xfb_info in the nir_shader when linking XFB This pass is used for shaders coming in from SPIR-V. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16750>	2022-05-31 23:09:30 +00:00
Jason Ekstrand	36d8a2f1d7	glsl/nir: Stop leaking varyings_info Fixes: `34b3b92bbe` ("nir/xfb: move varyings info out of nir_xfb_info") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16750>	2022-05-31 23:09:30 +00:00
Jason Ekstrand	7c5dc0b11a	glsl/nir: Populate nir_shader::xfb_info after linking varyings Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16750>	2022-05-31 23:09:30 +00:00
Jason Ekstrand	64cc35d2ac	nir: Drop nir_shader_get_xfb_info Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16750>	2022-05-31 23:09:30 +00:00
Jason Ekstrand	23b55dcff4	nir: Add a nir_xfb_info to nir_shader We want to be able to carry this along with the shader instead of always having to re-generate it from scratch. A new nir_gather_xfb_info() helper is also added which, instead of returning it, adds it to the shader. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16750>	2022-05-31 23:09:30 +00:00
Jason Ekstrand	3e04432b3a	nir: Rename nir_gather_xfb_info to nir_shader_get_xfb_info Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16750>	2022-05-31 23:09:30 +00:00
Jesse Natalie	f812cc0fe6	nir: Consider PNTC to be a varying Fixes: `3528dcdf` ("nir: add nir_io_semantics::no_varying, no_sysval_output, and helpers") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6091 Reviewed-by: Marek Olák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16761>	2022-05-31 20:51:22 +00:00
Jesse Natalie	f61788d7d3	nir_lower_task_shader: Fix return from lower_task_intrin (bool, not void*) Fixes: `8aff8d3d` ("nir: Add common task shader lowering to make the backend's job easier.") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16756>	2022-05-31 18:32:59 +00:00
Jason Ekstrand	eb0d571ce4	nir: Add a correctness note for nir_lower_phis_to_regs_block Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16111>	2022-05-31 14:12:21 +00:00
Jason Ekstrand	4a4d6cdc80	nir: Handle register sources in lower_phis_to_regs_block During certain control-flow manipulation passes, we go out-of-SSA temporarily in certain areas of the code to make control-flow manipulation easier. This can result in registers being in phi sources temporarily. If two sub-passes run before we get a chance to do clean-up, we can end up doing some out-of-SSA and then a bit more out-of-SSA and trigger this case. It's easy enough to handle. Fixes: `a620f66872` ("nir: Add a couple quick-and-dirty out-of-SSA helpers") Fixes: `79a987ad2a` ("nir/opt_if: also merge break statements with ones after the branch") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6370 Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16111>	2022-05-31 14:12:21 +00:00
Karol Herbst	9ff04985b9	nir/gce: pin call instructions Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16202>	2022-05-31 12:36:48 +00:00
Karol Herbst	ad34d81c48	nir/gather_info: allow to run it before inlining Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16202>	2022-05-31 12:36:48 +00:00
Timothy Arceri	00313effdb	nir/gcm: fix pushing instructions into if blocks The previous logic would just set the block to the instructions original location if we couldn't evict it from a loop. For now we only push const loads to a later block inside ifs but we can add more heuristics later. This change helps a hand full of shaders but also stops a CTS regression caused by excess spilling after a series I'm working on to disable more of the GLSL IR optimisation passes. Shader-db results iris (BDW): total instructions in shared programs: 17529759 -> 17529749 (<.01%) instructions in affected programs: 15929 -> 15919 (-0.06%) helped: 5 HURT: 2 helped stats (abs) min: 1 max: 5 x̄: 2.40 x̃: 2 helped stats (rel) min: 0.06% max: 0.15% x̄: 0.11% x̃: 0.12% HURT stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 HURT stats (rel) min: 0.06% max: 0.06% x̄: 0.06% x̃: 0.06% 95% mean confidence interval for instructions value: -3.34 0.49 95% mean confidence interval for instructions %-change: -0.14% 0.02% Inconclusive result (value mean confidence interval includes 0). total cycles in shared programs: 861109994 -> 861099681 (<.01%) cycles in affected programs: 7027698 -> 7017385 (-0.15%) helped: 95 HURT: 72 helped stats (abs) min: 1 max: 7995 x̄: 138.54 x̃: 9 helped stats (rel) min: <.01% max: 15.96% x̄: 0.54% x̃: 0.11% HURT stats (abs) min: 1 max: 474 x̄: 39.56 x̃: 12 HURT stats (rel) min: <.01% max: 1.17% x̄: 0.20% x̃: 0.11% 95% mean confidence interval for cycles value: -159.05 35.54 95% mean confidence interval for cycles %-change: -0.45% 0.01% Inconclusive result (value mean confidence interval includes 0). total spills in shared programs: 17606 -> 17605 (<.01%) spills in affected programs: 323 -> 322 (-0.31%) helped: 1 HURT: 0 total fills in shared programs: 22599 -> 22598 (<.01%) fills in affected programs: 1348 -> 1347 (-0.07%) helped: 1 HURT: 0 Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14940>	2022-05-31 01:03:43 +00:00
Mike Blumenkrantz	3394e81eb1	vtn: assert that composite members have the same bit size as the result Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16667>	2022-05-27 14:06:32 +00:00
Mike Blumenkrantz	54e1072ff6	vtn: assert that vector shuffle indices are in-bounds Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16667>	2022-05-27 14:06:32 +00:00
Timur Kristóf	112a856813	nir: Keep track of cross-invocation mesh shader output access. On some implementations eg. AMD RDNA2 the driver can generate a more optimal code path knowing whether outputs are indexed using the local invocation index or not. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16736>	2022-05-27 11:22:07 +00:00
Timur Kristóf	8aff8d3dd4	nir: Add common task shader lowering to make the backend's job easier. 1. Lowers NV_mesh_shader TASK_COUNT output to launch_mesh_workgroups. 2. Removes all code after launch_mesh_workgroups, enforcing the fact that it's a terminating instruction. 3. Ensures that task shaders always have at least one launch_mesh_workgroups instruction, so the backend doesn't need to implement a special case when the shader doesn't have it. 4. Optionally, implements task_payload using shared memory when task_payload atomics are used. This is useful when the backend is otherwise not capable of handling the same atomic features as it can for shared memory. If this is used, the backend only has to implement the basic load/store operations for task_payload. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16720>	2022-05-27 07:52:03 +00:00
Timur Kristóf	9eaf918ed2	nir: Add new launch_mesh_workgroups intrinsic. The new intrinsic launches mesh shader workgroups from a task shader, with explicit task_payload. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16720>	2022-05-27 07:52:03 +00:00
Marcin Ślusarz	b95d9bca1d	nir: add load_task_payload intrinsic to nir_divergence_analysis It's divergent depending on sources. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16668>	2022-05-24 17:53:29 +00:00
Marcin Ślusarz	95dbdbf063	nir: add load_mesh_inline_data_intel intrinsic to nir_divergence_analysis It's not divergent. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16668>	2022-05-24 17:53:29 +00:00

1 2 3 4 5 ...

7067 commits