fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-17 13:58:05 +02:00

Author	SHA1	Message	Date
Daniel Schürmann	2a92452a0e	nir/opt_shrink_vectors: Remove shrinking of store intrinsics data source This is done via nir_opt_shrink_stores. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14480>	2022-02-11 11:50:47 +01:00
Daniel Schürmann	2dba7e6056	nir: split nir_opt_shrink_stores from nir_opt_shrink_vectors This patch moves the shrinking of store sources into a separate pass. The reasoning behind this is that this pass usually only needs to be called once while nir_shrink_vectors might better be called several times. This allows to move the pass(es) out of the optimization loops. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14480>	2022-02-11 11:34:01 +01:00
Ian Romanick	1cb3d1a6ae	nir: Produce correct results for atan with NaN Properly handling NaN adversely affects several hundred shaders in shader-db (lots of Skia and a few others from various synthetic benchmarks) and fossil-db (mostly Talos and some Doom 2016). Only apply the NaN handling work-around when the shader demands it. v2: Add comment explaining the 1.0*y_over_x. Suggested by Caio. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Fixes: `2098ae16c8` ("nir/builder: Move nir_atan and nir_atan2 from SPIR-V translator") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13999>	2022-02-10 18:15:39 +00:00
Ian Romanick	7d0d9b9fbc	nir: Properly handle various exceptional values in frexp frexp_sig of ±0, ±Inf, or NaN should just return the input unmodified. frexp_exp of ±Inf or NaN is undefined, and frexp_exp of ±0 should return the input unmodified. This seems to already work. No shader-db or fossil-db changes on any Intel platform. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Fixes: `23d30f4099` ("spirv,nir: lower frexp_exp/frexp_sig inside a new NIR pass") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13999>	2022-02-10 18:15:39 +00:00
Ian Romanick	38800b385c	nir: All set-on-comparison opcodes can take all float types Extend `4195a9450b` so that the next poor fool doesn't come along and say, "sge does the right thing for 16-bit sources, but slt gives a NIR validation failure. What the deuce?" NOTE: This commit is necessary to prevent regressions in GLSLstd450Step tests of 16-bit sources at "spriv: Produce correct result for GLSLstd450Step with NaN". Fixes: `4195a9450b` ("nir: sge operation is defined for floating-point types") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13999>	2022-02-10 18:15:39 +00:00
Ian Romanick	97ce3a56bd	nir/search: Constify instr parameter to nir_search_expression::cond Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13999>	2022-02-10 18:15:39 +00:00
Ian Romanick	4dd4135551	nir: Constify def parameter to nir_ssa_def_bits_used Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13999>	2022-02-10 18:15:39 +00:00
Otavio Pontes	510d248299	nir: Use proper macro to set bits of variable correctly When slots is 64 only the first bit was being set, instead of setting all 64 bits of the variable, so for that case the function get_variable_io_mask() always returned 0. This behaviour caused variables that are being used both on producer and consumer to be considered unused and thus being removed on nir_remove_unused_io_vars(). Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14955>	2022-02-10 17:19:54 +00:00
Georg Lehmann	c2168f845e	nir/lower_mediump: Treat u2u16 like i2i16. There is a comment in nir_fold_16bit_sampler_conversions saying that these are the same, but the code only checks for i2i16. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14893>	2022-02-10 16:13:54 +00:00
Emma Anholt	20469009c7	nir: Delete the per-instr SSA liveness impl. It was introduced for nir-to-tgsi, and I found that it was the wrong approach. There's a reason nobody else does RA this way. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14404>	2022-02-10 00:36:57 +00:00
Emma Anholt	f4ee7146f9	nir: Split the flag for lowering of fabs and fneg to source modifiers. i915 and r300 have fneg source modifier but not fabs, and doing it in NIR can save us some backend pain. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14938>	2022-02-08 18:50:01 -08:00
Lionel Landwerlin	c78be5da30	intel/fs: lower ray query intrinsics v2: Add helper for acceleration->root_node computation (Caio) v3: Update comment on "done" bit (Caio) Remove progress bool value for impl function (Caio) Don't use nir_shader_instructions_pass to search the shader (Caio) v4: Rename variable for if/else block (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	e06f9d49bc	nir/lower_shader_calls: consider relocated constants as rematerializable After all they're constants. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	6d9ae6ec1e	intel: add a new intrinsic to get the shader stage from bindless shaders We'll use this to apply ray tracing operations in our trivial return shader based on the stage we're in. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	b8f087b0e6	nir/builder: add nir_ior_imm() helper Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	bb40e999d1	intel/nir: use a single intel intrinsic to deal with ray traversal In the future we'll want to reuse this intrinsic to deal with ray queries. Ray queries will use a different global pointer and programmatically change the control/level arguments of the trace send instruction. v2: Comment on barrier after sync trace instruction (Caio) Generalize lsc helper (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	4deb8e86df	nir: change intel dss_id intrinsic to topology_id This will allow to reuse the same intrinsic for various topology based ID. v2: fix intrinsic comment (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:24 +00:00
Alyssa Rosenzweig	aaf00a1b4d	nir,zink: Make lower_discard_if a common pass This pass (lowering discard_if to control flow and unconditional discard) was originally written for Zink, but is useful for hardware that lacks conditional discard instructions like AGX. In theory AGX could implement a conditional discard with CSEL, but the vendor compiler uses a lowering like this one. Since I like not writing code, I'd like to use the pass that's already in tree. v2: Don't preserve dominance (Jason). Assert we don't see demotes or terminates (Jason). Add Mike's ack. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14217>	2022-02-04 19:09:23 +00:00
Lionel Landwerlin	e227bb9fd5	nir/builder: add ishl_imm helper v2: add (y >= x->bit_size) condition (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13739>	2022-02-02 17:09:46 +00:00
Connor Abbott	503a5bae59	nir: Add support for lowering shuffle to a waterfall loop Qualcomm doesn't natively support shuffle, but it does natively support relative shuffles where the delta is a constant. Therefore we'll expose emulated support for both. Add support for this emulation of subgroupShuffle() to NIR. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14412>	2022-02-01 16:27:45 +00:00
Connor Abbott	913bec10c4	nir/lower_subgroups: Rename lower_shuffle to lower_relative_shuffle This option only applies to relative shuffles (up/down/xor), and in a moment we're going to add an option to lower normal shuffles, so rename it. While we're here, rename lower_shuffle() to lower_to_shuffle() for similar reasons. Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14412>	2022-02-01 16:27:45 +00:00
Marcin Ślusarz	24fef8f33d	intel/compiler: Use Task/Mesh InlineData for the first few push constants Replace load_mesh_global_arg_addr_intel with a more general intrinsic load_mesh_inline_data_intel, since inline data now hold both a pointer descriptor information and the first few push constants. Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14788>	2022-01-29 06:32:19 +00:00
Christian Gmeiner	e5f9cdac1f	nir/nir_lower_tex_shadow: support tex_instr without deref src Use texture_index if there is no deref src. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14308>	2022-01-28 18:40:53 +00:00
Christian Gmeiner	e67bca3fe7	nir: make lower_sample_tex_compare a common pass This pass was originally written for d3d12, but is useful for hardware that lacks sample compare support like some etnaviv GPU models. Also rename the lowering pass and some surrounding code to nir_lower_tex_shadow as suggested by Emma. I'd like to use the pass that's already in tree. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14308>	2022-01-28 18:40:53 +00:00
Emma Anholt	61400f8a2d	nir/lower_locals_to_regs: Do an ad-hoc copy propagate on our generated MOV. I noticed the inefficiency in NIR-to-TGSI output while trying to debug a failure handling some arrays in r600. While this makes reading CTS shaders easier, the effect in the real world is pretty limited. From softpipe shader-db: total instructions in shared programs: 2929840 -> 2929836 (<.01%) instructions in affected programs: 118 -> 114 (-3.39%) Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14321>	2022-01-25 06:01:13 +00:00
Bas Nieuwenhuizen	d1530a3f3b	Revert "nir/algebraic: distribute fmul(fadd(a, b), c) when b and c are constants" This reverts commit `a1af902531`. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5423 Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14532>	2022-01-21 16:58:11 +00:00
Rhys Perry	af51efe195	nir/builder: assume scalar alignment if not provided Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14455>	2022-01-21 13:45:33 +00:00
Rhys Perry	e9e1a44872	nir/builder: set write mask if not provided Zero isn't really a valid write mask. If it's provided, use a full write mask. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14455>	2022-01-21 13:45:33 +00:00
Rhys Perry	495debebad	nir/algebraic: optimize expressions using fmulz/ffmaz Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13436>	2022-01-20 22:54:42 +00:00
Rhys Perry	14b8227083	nir: add some missing nir_alu_type_get_base_type Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13436>	2022-01-20 22:54:42 +00:00
Rhys Perry	f2fbba7920	nir/algebraic: optimize open-coded fmulz/ffmaz This pattern will be found in future versions of D3D9 DXVK. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13436>	2022-01-20 22:54:42 +00:00
Rhys Perry	312a284980	nir/algebraic: add ignore_exact() wrapper Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13436>	2022-01-20 22:54:42 +00:00
Rhys Perry	7f05ea3793	nir: add nir_op_fmulz and nir_op_ffmaz Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13436>	2022-01-20 22:54:42 +00:00
Emma Anholt	cac6f633b2	nir/opt_offsets: Use nir_ssa_scalar to chase offset additions. For nir_to_tgsi, I want to be able to fold into the base from a vector load_const, which the ad-hoc scalar chasing couldn't handle. r300: total instructions in shared programs: 1278731 -> 1256502 (-1.74%) instructions in affected programs: 457909 -> 435680 (-4.85%) total flowcontrol in shared programs: 8316 -> 8313 (-0.04%) flowcontrol in affected programs: 5 -> 2 (-60.00%) total temps in shared programs: 213687 -> 213774 (0.04%) temps in affected programs: 13140 -> 13227 (0.66%) total consts in shared programs: 952850 -> 949929 (-0.31%) consts in affected programs: 386352 -> 383431 (-0.76%) Fixes: #5781 Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14309>	2022-01-19 22:28:34 +00:00
Emma Anholt	645ca56425	nir/opt_offsets: Also apply the max offset to top-level constant folding. nir_to_tgsi wants this for disabling folding into shared var accesses at all. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14309>	2022-01-19 22:28:34 +00:00
Emma Anholt	ec4b9909f0	nir/opt_offsets: Disable unsigned wrap checks on non-native-integers HW. Since we don't have 32-bit ints, these checks for 32-bit unsigned wrapping don't help and just reduce optimization opportunities (particularly for DX9 addressing math). Doesn't affect any current consumers. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14309>	2022-01-19 22:28:34 +00:00
Emma Anholt	700d2fbd0a	nir: Add a .base field to nir_load_ubo_vec4. This lets nir-to-tgsi fold the constant offset of addressing calculations into the CONST[] reference, which is important for D3D9-era compatibility: HW of that age has limited uniform space, and if we do the addressing math as math in the shader for dynamic indexing, the nir_load_consts end up taking up uniforms we don't have available. r300: total instructions in shared programs: 1279699 -> 1279167 (-0.04%) instructions in affected programs: 134796 -> 134264 (-0.39%) total instructions in shared programs: 1279699 -> 1279167 (-0.04%) instructions in affected programs: 134796 -> 134264 (-0.39%) total temps in shared programs: 213912 -> 213736 (-0.08%) temps in affected programs: 2166 -> 1990 (-8.13%) total consts in shared programs: 953237 -> 952973 (-0.03%) consts in affected programs: 45980 -> 45716 (-0.57%) Acked-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14309>	2022-01-19 22:28:34 +00:00
Dave Airlie	ccbf700d6c	nir: remove gl.h include from nir headers. This saves a lot of pointless gl.h includes across the board, it moves the one place that needs GLenum into a separate file only used in those passes that require it. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14605>	2022-01-19 21:54:58 +00:00
Dave Airlie	1352e0ba0c	mesa/*: add a shader primitive type to get away from GL types. This creates an internal shader_prim enum, I've fixed up most users to use it instead of GL types. don't store the enum in shader_info as it changes size, and confuses other things. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14605>	2022-01-19 21:54:58 +00:00
Connor Abbott	9c9e8c3349	nir: Reorder ffma and fsub combining It's relatively common to do something like "a * b - c", which on most GPUs can be implemented in a single instruction. Before opt_algebraic_late this will be something like "fadd(fmul(a, b), fneg(c))", and we want to turn it info "ffma(a, b, fneg(c))". But because the fsub pattern was first we instead turned it into "fsub(fmul(a, b), c)". Fix this by reordering them. Selected shader-db results on freedreno: total instructions in shared programs: 1561330 -> 1551619 (-0.62%) instructions in affected programs: 780272 -> 770561 (-1.24%) helped: 1941 HURT: 491 helped stats (abs) min: 1 max: 147 x̄: 7.98 x̃: 4 helped stats (rel) min: 0.07% max: 30.77% x̄: 4.36% x̃: 3.17% HURT stats (abs) min: 1 max: 307 x̄: 11.76 x̃: 5 HURT stats (rel) min: 0.09% max: 18.71% x̄: 2.26% x̃: 1.38% 95% mean confidence interval for instructions value: -4.57 -3.41 95% mean confidence interval for instructions %-change: -3.21% -2.84% Instructions are helped. total nops in shared programs: 358926 -> 356263 (-0.74%) nops in affected programs: 167116 -> 164453 (-1.59%) helped: 1395 HURT: 859 helped stats (abs) min: 1 max: 108 x̄: 6.80 x̃: 3 helped stats (rel) min: 0.17% max: 100.00% x̄: 19.15% x̃: 10.57% HURT stats (abs) min: 1 max: 307 x̄: 7.95 x̃: 3 HURT stats (rel) min: 0.00% max: 381.82% x̄: 20.04% x̃: 10.00% 95% mean confidence interval for nops value: -1.77 -0.59 95% mean confidence interval for nops %-change: -5.55% -2.87% Nops are helped. total non-nops in shared programs: 1202404 -> 1195356 (-0.59%) non-nops in affected programs: 496682 -> 489634 (-1.42%) helped: 1951 HURT: 265 helped stats (abs) min: 1 max: 39 x̄: 4.02 x̃: 3 helped stats (rel) min: 0.07% max: 15.38% x̄: 2.97% x̃: 1.96% HURT stats (abs) min: 1 max: 22 x̄: 2.97 x̃: 2 HURT stats (rel) min: 0.05% max: 10.00% x̄: 1.14% x̃: 0.75% 95% mean confidence interval for non-nops value: -3.38 -2.99 95% mean confidence interval for non-nops %-change: -2.60% -2.36% Non-nops are helped. total systall in shared programs: 288317 -> 292975 (1.62%) systall in affected programs: 87876 -> 92534 (5.30%) helped: 388 HURT: 431 helped stats (abs) min: 1 max: 214 x̄: 14.39 x̃: 8 helped stats (rel) min: 0.25% max: 100.00% x̄: 22.12% x̃: 11.96% HURT stats (abs) min: 1 max: 232 x̄: 23.77 x̃: 7 HURT stats (rel) min: 0.00% max: 1300.00% x̄: 51.71% x̃: 17.30% 95% mean confidence interval for systall value: 3.07 8.30 95% mean confidence interval for systall %-change: 9.49% 23.97% Systall are HURT. (The systall hurt is probably just due to having having fewer instructions to hide latency with.) Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14554>	2022-01-18 17:44:50 +00:00
Qiang Yu	2cee73f0f7	nir: fix nir_tex_instr hash not count is_sparse field This fixes nir_opt_cse miss replace a non-sparse tex instruction with a sparse tex instruction and fail the nir_validate_shader(). Fixes: `3a7972f72a` ("nir,spirv: add sparse texture fetches") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14362>	2022-01-18 16:10:35 +08:00
Rhys Perry	d95a0b52e4	nir/unsigned_upper_bound: don't follow 64-bit f2u32() Fixes Doom Eternal crash. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Fixes: `72ac3f6026` ("nir: add nir_unsigned_upper_bound and nir_addition_might_overflow") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14555>	2022-01-17 10:59:21 +00:00
Emma Anholt	f6ffefba3e	nir: Apply nir_opt_offsets to nir_intrinsic_load_uniform as well. Doing this for ir3 required adding a struct for limits of how much base to fold in (which NTT wants as well for its case of shared vars), otherwise the later work to lower to the 1<<9 word limit would emit more instructions. The shader-db results are that sometimes the reduction in NIR instruction count results in the fewer sampler prefetches due to the shader being estimated to be shorter (dota2, nexuiz): total instructions in shared programs: 8996651 -> 8996776 (<.01%) total cat5 in shared programs: 86561 -> 86577 (0.02%) Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14023>	2022-01-16 19:11:29 +00:00
Emma Anholt	b024102d7c	freedreno/ir3: Use nir_opt_offset for removing constant adds for shared vars. Saves some work in carchase and manhattan31: instructions in affected programs: 2842 -> 2818 (-0.84%) nops in affected programs: 1131 -> 1105 (-2.30%) non-nops in affected programs: 1236 -> 1238 (0.16%) mov in affected programs: 57 -> 61 (7.02%) dwords in affected programs: 2144 -> 2150 (0.28%) cat0 in affected programs: 1195 -> 1169 (-2.18%) cat1 in affected programs: 151 -> 155 (2.65%) cat2 in affected programs: 142 -> 140 (-1.41%) sstall in affected programs: 190 -> 178 (-6.32%) (ss) in affected programs: 63 -> 63 (0.00%) systall in affected programs: 532 -> 511 (-3.95%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14023>	2022-01-16 19:11:29 +00:00
Daniel Schürmann	79a987ad2a	nir/opt_if: also merge break statements with ones after the branch This optimizations turns loop { ... if (cond1) { if (cond2) { do_work_1(); break; } else { do_work_2(); } do_work_3(); break; } else { ... } } into: loop { ... if (cond1) { if (cond2) { do_work_1(); } else { do_work_2(); do_work_3(); } break; } else { ... } } As this optimizations moves code into the NIF statement, it re-iterates on the branch legs in case of success. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7587>	2022-01-13 02:30:32 +00:00
Daniel Schürmann	dad609d152	nir/opt_if: merge two break statements from both branch legs This optimization turns loop { ... if (cond) { do_work_1(); break; } else { do_work_2(); break; } } into: loop { ... if (cond) { do_work_1(); } else { do_work_2(); } break; } Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7587>	2022-01-13 02:30:32 +00:00
Daniel Schürmann	8a78706643	nir: refactor nir_opt_move This patch is a rewrite of nir_opt_move. Differently from the previous version, each instruction is checked if it can be moved downwards and then inserted before the first user of the definition. The advantage is that less insert operations are performed, the original order is kept if two movable instructions have the same first user, and instructions without user in the same block are moved towards the end. v2: Only return true if an instruction really changed the position. Don't care for discards, this will be handled by another MR. v3: fix self-referring phis and update according to nir_can_move_instr(). v4: use nir_can_move_instr() and nir_instr_ssa_def() v5: deduplicate some code Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3657>	2022-01-12 13:41:54 +00:00
Marcin Ślusarz	f286ecf906	nir: handle per-view clip/cull distances Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14263>	2022-01-11 22:45:23 +00:00
Marcin Ślusarz	0d6f83cbf1	nir: remove invalid assert affecting per-view variables per-view variables can have arbitrary (but > 0) number of array levels Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14263>	2022-01-11 22:45:23 +00:00
Marcin Ślusarz	4fed440724	nir: add load_mesh_view_count and load_mesh_view_indices intrinsics Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14263>	2022-01-11 22:45:23 +00:00

... 23 24 25 26 27 ...

4732 commits