fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-18 05:08:06 +02:00

Author	SHA1	Message	Date
Timothy Arceri	44d6068c5b	nir: add nir based version of the lower_const_arrays_to_uniforms pass Doing this in NIR should give better results, but also allows us to stop calling more GLSL IR optimisations passes. v2: Skip 8bit and 16bit type that would require further processing I believe this is an existing bug in the GLSL IR pass also. v3: rebuild constant initialisers as we want to call this pass after nir has already lowered them and performed optimisations. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> (v1) Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16770>	2022-06-04 03:13:36 +00:00
Daniel Schürmann	b56fcefa0f	nir/opt_vectorize: refactor src rewriting to avoid unnecessary mov instructions Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15647>	2022-06-03 08:53:18 +00:00
Jason Ekstrand	d8df87056c	nir: xfb_buffer_info::stride is in bytes For the NIR XFB gathering as well as all the Vulkan drivers, buffer strides in nir_xfb_info are in bytes. When Marek started using nir_xfb_info for GLSL on radeonsi, he copied directly from the GLSL struct which has strides in dwords. This inconsistency didn't show up until I went through and started us using the NIR passes for GL drivers directly without going through the GLSL structs. We could change the nir_xfb_buffer_info field to be in dwords to be consistent with shader_info but that would mean changing all the Vulkan drivers but, for now, it's easier to always use bytes in nir_xfb_info. Fixes: `2a22885a45` ("st,nir: Use nir_shader::xfb_info in nir_lower_io_passes") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16819>	2022-06-02 14:06:31 +00:00
Erik Faye-Lund	18246ed06a	include: drop c99_math.h Since we now depend on C11, we know that we have support for the C99 math functionality. So let's drop the c99_math.h compatibility wrapper, and just include <math.h> directly. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16812>	2022-06-02 13:09:16 +00:00
Emma Anholt	6e087f96c9	nir_lower_mediump: Drop assertion about not containing movs. A 1D texture operation may need to do a mov to turn a reference to a channel of an SSA value into a scalar value to be passed as the texture coordinate (since texture srcs can't do swizzles). Seen in amnesia-the-dark-descent/low/46.shader_test() for example, where a 1D texture is used to remap each of r,g,b from a previous texture result. Besides, the nir_op_is_vec() case will (perhaps surprisingly) look through a mov, anyway. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16616>	2022-06-01 22:19:44 +00:00
Georg Lehmann	bfc25d6ec9	nir: Add optional lowering for mul_32x16. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13895>	2022-06-01 17:09:25 +00:00
Daniel Schürmann	be01e8711b	nir: introduce new nir_alu_alu_width() with nir_vectorize_cb callback This function allows to only scalarize instructions down to a desired vectorization width. nir_lower_alu_to_scalar() was changed to use the new function with a width of 1. Swizzles outside vectorization width are considered and reduce the target width. This prevents ending up with code like vec2 16 ssa_2 = iadd ssa_0.xz, ssa_1.xz which requires to emit shuffle code in backends and usually is not beneficial. Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13080>	2022-06-01 11:41:44 +00:00
Daniel Schürmann	bd151a256e	nir/opt_vectorize: add callback for max vectorization width The callback allows to request different vectorization factors per instruction depending on e.g. bitsize or opcode. This patch also removes using the vectorize_vec2_16bit option from nir_opt_vectorize(). Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13080>	2022-06-01 11:41:44 +00:00
Lionel Landwerlin	5078b4fff1	nir/divergence: handle load_ray_num_dss_rt_stacks_intel Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16797>	2022-06-01 04:58:50 +00:00
Lionel Landwerlin	d3c1b0ac28	nir/divergence: handle load_scratch_base_ptr v2: divergent (Jason) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16797>	2022-06-01 04:58:50 +00:00
Jason Ekstrand	2a22885a45	st,nir: Use nir_shader::xfb_info in nir_lower_io_passes Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16750>	2022-05-31 23:09:30 +00:00
Jason Ekstrand	16b0719441	glsl/nir: Stash the xfb_info in the nir_shader when linking XFB This pass is used for shaders coming in from SPIR-V. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16750>	2022-05-31 23:09:30 +00:00
Jason Ekstrand	64cc35d2ac	nir: Drop nir_shader_get_xfb_info Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16750>	2022-05-31 23:09:30 +00:00
Jason Ekstrand	23b55dcff4	nir: Add a nir_xfb_info to nir_shader We want to be able to carry this along with the shader instead of always having to re-generate it from scratch. A new nir_gather_xfb_info() helper is also added which, instead of returning it, adds it to the shader. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16750>	2022-05-31 23:09:30 +00:00
Jason Ekstrand	3e04432b3a	nir: Rename nir_gather_xfb_info to nir_shader_get_xfb_info Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16750>	2022-05-31 23:09:30 +00:00
Jesse Natalie	f812cc0fe6	nir: Consider PNTC to be a varying Fixes: `3528dcdf` ("nir: add nir_io_semantics::no_varying, no_sysval_output, and helpers") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6091 Reviewed-by: Marek Olák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16761>	2022-05-31 20:51:22 +00:00
Jesse Natalie	f61788d7d3	nir_lower_task_shader: Fix return from lower_task_intrin (bool, not void*) Fixes: `8aff8d3d` ("nir: Add common task shader lowering to make the backend's job easier.") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16756>	2022-05-31 18:32:59 +00:00
Jason Ekstrand	eb0d571ce4	nir: Add a correctness note for nir_lower_phis_to_regs_block Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16111>	2022-05-31 14:12:21 +00:00
Jason Ekstrand	4a4d6cdc80	nir: Handle register sources in lower_phis_to_regs_block During certain control-flow manipulation passes, we go out-of-SSA temporarily in certain areas of the code to make control-flow manipulation easier. This can result in registers being in phi sources temporarily. If two sub-passes run before we get a chance to do clean-up, we can end up doing some out-of-SSA and then a bit more out-of-SSA and trigger this case. It's easy enough to handle. Fixes: `a620f66872` ("nir: Add a couple quick-and-dirty out-of-SSA helpers") Fixes: `79a987ad2a` ("nir/opt_if: also merge break statements with ones after the branch") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6370 Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16111>	2022-05-31 14:12:21 +00:00
Karol Herbst	9ff04985b9	nir/gce: pin call instructions Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16202>	2022-05-31 12:36:48 +00:00
Karol Herbst	ad34d81c48	nir/gather_info: allow to run it before inlining Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16202>	2022-05-31 12:36:48 +00:00
Timothy Arceri	00313effdb	nir/gcm: fix pushing instructions into if blocks The previous logic would just set the block to the instructions original location if we couldn't evict it from a loop. For now we only push const loads to a later block inside ifs but we can add more heuristics later. This change helps a hand full of shaders but also stops a CTS regression caused by excess spilling after a series I'm working on to disable more of the GLSL IR optimisation passes. Shader-db results iris (BDW): total instructions in shared programs: 17529759 -> 17529749 (<.01%) instructions in affected programs: 15929 -> 15919 (-0.06%) helped: 5 HURT: 2 helped stats (abs) min: 1 max: 5 x̄: 2.40 x̃: 2 helped stats (rel) min: 0.06% max: 0.15% x̄: 0.11% x̃: 0.12% HURT stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 HURT stats (rel) min: 0.06% max: 0.06% x̄: 0.06% x̃: 0.06% 95% mean confidence interval for instructions value: -3.34 0.49 95% mean confidence interval for instructions %-change: -0.14% 0.02% Inconclusive result (value mean confidence interval includes 0). total cycles in shared programs: 861109994 -> 861099681 (<.01%) cycles in affected programs: 7027698 -> 7017385 (-0.15%) helped: 95 HURT: 72 helped stats (abs) min: 1 max: 7995 x̄: 138.54 x̃: 9 helped stats (rel) min: <.01% max: 15.96% x̄: 0.54% x̃: 0.11% HURT stats (abs) min: 1 max: 474 x̄: 39.56 x̃: 12 HURT stats (rel) min: <.01% max: 1.17% x̄: 0.20% x̃: 0.11% 95% mean confidence interval for cycles value: -159.05 35.54 95% mean confidence interval for cycles %-change: -0.45% 0.01% Inconclusive result (value mean confidence interval includes 0). total spills in shared programs: 17606 -> 17605 (<.01%) spills in affected programs: 323 -> 322 (-0.31%) helped: 1 HURT: 0 total fills in shared programs: 22599 -> 22598 (<.01%) fills in affected programs: 1348 -> 1347 (-0.07%) helped: 1 HURT: 0 Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14940>	2022-05-31 01:03:43 +00:00
Timur Kristóf	112a856813	nir: Keep track of cross-invocation mesh shader output access. On some implementations eg. AMD RDNA2 the driver can generate a more optimal code path knowing whether outputs are indexed using the local invocation index or not. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16736>	2022-05-27 11:22:07 +00:00
Timur Kristóf	8aff8d3dd4	nir: Add common task shader lowering to make the backend's job easier. 1. Lowers NV_mesh_shader TASK_COUNT output to launch_mesh_workgroups. 2. Removes all code after launch_mesh_workgroups, enforcing the fact that it's a terminating instruction. 3. Ensures that task shaders always have at least one launch_mesh_workgroups instruction, so the backend doesn't need to implement a special case when the shader doesn't have it. 4. Optionally, implements task_payload using shared memory when task_payload atomics are used. This is useful when the backend is otherwise not capable of handling the same atomic features as it can for shared memory. If this is used, the backend only has to implement the basic load/store operations for task_payload. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16720>	2022-05-27 07:52:03 +00:00
Timur Kristóf	9eaf918ed2	nir: Add new launch_mesh_workgroups intrinsic. The new intrinsic launches mesh shader workgroups from a task shader, with explicit task_payload. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16720>	2022-05-27 07:52:03 +00:00
Marcin Ślusarz	b95d9bca1d	nir: add load_task_payload intrinsic to nir_divergence_analysis It's divergent depending on sources. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16668>	2022-05-24 17:53:29 +00:00
Marcin Ślusarz	95dbdbf063	nir: add load_mesh_inline_data_intel intrinsic to nir_divergence_analysis It's not divergent. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16668>	2022-05-24 17:53:29 +00:00
Timur Kristóf	47da245ff2	nir: Add explicit task payload atomic intrinsics. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16693>	2022-05-24 17:21:22 +00:00
Icecream95	9f9ed959bd	nir: Add store_combined_output_pan BASE back It's meaningful for this intrinsic and so does not add noise to the lowering pass. (Although dual-source writes must be to RT 0, depth and stencil writes, which store_combined_output_pan is also used for, can still be done with MRT enabled.) Fixes: `5c168f09eb` ("nir: Eliminate store_combined_output_pan BASE") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16685>	2022-05-24 16:13:33 +00:00
Jason Ekstrand	836ff4b586	nir/algebraic: Add two more pack/unpack rules Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16591>	2022-05-23 14:10:54 +00:00
Rhys Perry	6087f1951e	nir: call nir_metadata_preserve in nir_lower_memory_model Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12448>	2022-05-19 13:37:20 +00:00
Rhys Perry	3eed871f41	nir: call nir_metadata_preserve in nir_vectorize_tess_levels This is necessary to use this pass with the NIR_PASS() macro. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12448>	2022-05-19 13:37:20 +00:00
Rhys Perry	f10d4bf963	nir: call nir_metadata_preserve in nir_io_add_const_offset_to_base This is necessary to use this pass with the NIR_PASS() macro. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12448>	2022-05-19 13:37:20 +00:00
Rhys Perry	0d9ead8ca2	nir: print file when validation fails This should make it clear whether a validation failure happens in RADV or zink. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12448>	2022-05-19 13:37:20 +00:00
Rhys Perry	836470d433	nir: allow NIR_PASS(_, ) If a user wants to skip printing the shader if no changes were made without declaring a dummy variable for the progress. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12448>	2022-05-19 13:37:20 +00:00
Timothy Arceri	c4cec84231	nir/i915g/r300/nv30: skip marking varyings as flat in some drivers Some older drivers don't support GLSL versions with the concept of flat varyings and also don't support integers. Here we add a new setting to make sure we don't use the optimisation that sets varyings to flat. This setting helps us avoid marking varyings as flat and therefore potentially having them changed to ints via varying packing. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6500 Fixes: `7647023f3b` ("glsl: enable the use of the nir based varying linker") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16573>	2022-05-19 01:05:32 +00:00
Timothy Arceri	ff8ddcb23e	nir: add support for forced sampler indirect loop unrolling Some drivers don't support these indirects and therefore require loop unrolling if a shader uses a loop induction variable to access a sampler array. Here we add a new nir shader compiler option that drivers can set, this will be the equivalent of the EmitNoIndirectSampler setting used in the GLSL IR unrolling pass. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16543>	2022-05-17 02:12:21 +00:00
Gert Wollny	3749a6ecd2	nir: honor lower_double options for ffloor and ffract v2: Don't lower ffloor@64 to ffract@64 when both ops are to be lowered. Settle on ffloor in opt_algebraic because in can be lowered to other ops in lower_double_ops. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>(v1) Jason Ekstrand <jason.ekstrand@collabora.com> (v1) Reviewed-by: Emma Anholt <emma@anholt.net> (v1) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16431>	2022-05-16 15:03:05 +00:00
Timothy Arceri	c1fbd0b8ab	nir: skip lowering io to scalar for must_be_shader_input These varyings cannot be packed by the GLSL linkers packing pass so we need to skip this lowering until later when we can properly handle them. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15731>	2022-05-16 03:33:18 +00:00
Timothy Arceri	99ab530617	nir: abort io info gathering if location is not set or is a temp value Unlike spirv glsl varyings might not have explicit locations set. nir_shader_gather_info() was once only called at the end of linking but these days it even gets called in NIR optimisation loops via nir_opt_phi_precision. In the following patches we implement a NIR version of the GLSL varying linker which means we will have varyings with no location set when nir_shader_gather_info() gets called the first few times, and temp values set only for the purpose of removing unmatched varyings between shaders for some calls after that. Here rather than asserting we simply abort the io info gathering, when we hit these values. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15731>	2022-05-16 03:33:18 +00:00
Timothy Arceri	cba2fd51a2	nir: add variable data fields required for NIR glsl varying linking These will be used in the following patches that add a NIR based varying linker. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15731>	2022-05-16 03:33:18 +00:00
Jason Ekstrand	98cc4c3a20	nir: Use nir_shader_instructions_pass in nir_lower_input_attachments This simplifies things a bit and also fixes metadata handling. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16482>	2022-05-13 22:51:38 +00:00
Jason Ekstrand	a170448a18	nir: Put the builder first in lower_input_attachments helpers This is more idiomatic. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16482>	2022-05-13 22:51:38 +00:00
Jason Ekstrand	5410f4ee89	mesa/st: Use lower_indirect_var_derefs in st_nir_lower_builtin Instead of having a special NIR helper for GL stuff, we can now use the more generic helper and do so directly. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16482>	2022-05-13 22:51:38 +00:00
Jason Ekstrand	e16197c46e	nir: Add a var set version of lower_indirect_derefs This version takes a set of variables and totally lowers indirects on any variable in the set. We also rewrite the builtin_uniform version to use the new helper internally. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16482>	2022-05-13 22:51:38 +00:00
Jason Ekstrand	c23b20d43a	nir: Preserve metadata if remove_dead_derefs makes no progress Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16482>	2022-05-13 22:51:38 +00:00
Georg Lehmann	bc5c68fc08	nir/opt_algebraic: Optimize Doom Eternal's word extract by LSB. Foz-db GFX10_3: Totals from 419 (0.31% of 134913) affected shaders: CodeSize: 4126032 -> 4121756 (-0.10%) Instrs: 783608 -> 782541 (-0.14%) Latency: 7889664 -> 7888521 (-0.01%); split: -0.02%, +0.00% InvThroughput: 1315690 -> 1314863 (-0.06%); split: -0.06%, +0.00% VClause: 11826 -> 11830 (+0.03%) SClause: 27736 -> 27734 (-0.01%) Copies: 50493 -> 50428 (-0.13%); split: -0.13%, +0.01% PreSGPRs: 23264 -> 23265 (+0.00%) Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16436>	2022-05-12 17:10:41 +00:00
Konstantin Seurer	938c9d9615	nir: Add a ray launch size addr intrinsic Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15712>	2022-05-12 15:04:31 +00:00
Timothy Arceri	0f98ed4afe	nir: remove unreachable loop terminators Remove the conditional break statements associated with all terminators that are associated with a fixed iteration count, except for the one associated with the limiting terminator. This logic matches similiar functionality that exists in the old GLSL IR unrolling code. This change helps a piglit test pass on the r300 driver once we switch off the old GLSL IR unrolling code. Shader-db results IRIS (BDW): total instructions in shared programs: 17538619 -> 17538595 (<.01%) instructions in affected programs: 216 -> 192 (-11.11%) helped: 3 HURT: 0 helped stats (abs) min: 7 max: 10 x̄: 8.00 x̃: 7 helped stats (rel) min: 10.00% max: 12.07% x̄: 11.38% x̃: 12.07% total cycles in shared programs: 858674910 -> 858672810 (<.01%) cycles in affected programs: 79540 -> 77440 (-2.64%) helped: 3 HURT: 0 helped stats (abs) min: 620 max: 800 x̄: 700.00 x̃: 680 helped stats (rel) min: 2.45% max: 2.83% x̄: 2.63% x̃: 2.62% Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16399>	2022-05-12 02:06:31 +00:00
Timothy Arceri	4c3d138e5d	nir: always set the exact_trip_count_unknown loop terminator property Previously we only cared if this was set for the limiting terminator. However in the following patch we will make use of this information on other terminators to decide if we can eliminate them. Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16399>	2022-05-12 02:06:31 +00:00

1 2 3 4 5 ...

3721 commits