fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-16 18:28:05 +02:00

Author	SHA1	Message	Date
Christian Gmeiner	c0fe111d64	nir: use intrinsic builders Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8295>	2021-01-06 14:34:41 +00:00
Mike Blumenkrantz	b5fb66a5ed	nir: preserve explicit_binding in lower_atomics_to_ssbo it's important to be able to tell whether this is explicitly set by the user Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7489>	2021-01-06 12:56:09 +00:00
Jesse Natalie	4d83306a9a	nir: Update saturated float->int/uint conversion algorithm The mantissa for a float doesn't contain enough data to accurately represent the min/max values for some destination types. Instead of clamping before converting, clamp after converting when coming from floats. This improves conformance of CL conversions, specifically for float -> long/ulong with int64 emulation enabled. Refactors the limit determination from the clamp, so we can determine limits for the dest type (int/uint) in both the source (float) and dest type. The limit as a float is used for comparison, while the limit as a dest type is used for bcsel. Important note is that the comparison is inverted to fge instead of flt, so the bcsel chooses the direct int/uint over the converted float in the case where the comparison comes up equal, but the conversion can't produce the exact min/max value. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8256>	2021-01-05 19:46:25 +00:00
Alexander von Gluck IV	c7486c996e	glsl/builtin_functions: Rename int64 function to int64_avail * int64 is a core type on Haiku (and potentially other platforms) * rename to int64_avail matching other similar calls Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2021-01-04 21:18:55 -06:00
Ian Romanick	539c25c2da	nir/algebraic: Move the flrp -> bcsel rule earlier If multiple rules could match, the rule that appears first in the file is used. Only Tiger Lake and Ice Lake are affected. Other platforms either have a LRP instruction or can't run any shaders from shader-db that would benefit. v2: Fix issues created when this commit was rebased on top of `3c8934a644` ("nir/algebraic: add flrp patterns for 16 and 64 bits"). Noticed by Caio. Tiger Lake and Ice Lake had similar results. total instructions in shared programs: 20908672 -> 20908661 (<.01%) instructions in affected programs: 419 -> 408 (-2.63%) helped: 5 HURT: 0 helped stats (abs) min: 1 max: 3 x̄: 2.20 x̃: 3 helped stats (rel) min: 1.85% max: 3.19% x̄: 2.49% x̃: 2.65% 95% mean confidence interval for instructions value: -3.56 -0.84 95% mean confidence interval for instructions %-change: -3.24% -1.73% Instructions are helped. total cycles in shared programs: 473513940 -> 473513793 (<.01%) cycles in affected programs: 7176 -> 7029 (-2.05%) helped: 12 HURT: 0 helped stats (abs) min: 5 max: 22 x̄: 12.25 x̃: 12 helped stats (rel) min: 0.84% max: 3.24% x̄: 2.09% x̃: 1.80% 95% mean confidence interval for cycles value: -15.43 -9.07 95% mean confidence interval for cycles %-change: -2.57% -1.61% Cycles are helped. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6358>	2021-01-05 02:07:09 +00:00
Ian Romanick	ec16f935fe	nir/algebraic: Mark comparisons generated from lowered fsign precise This prevents other transformations from converting them to 'a != 0'. For example, both of these transformations can do this: (('~flt', 0.0, ('fabs', a)), ('fne', a, 0.0)), (('~flt', ('fneg', ('fabs', a)), 0.0), ('fne', a, 0.0)), Both fsign(fabs(NaN)) and fsign(fneg(fabs(NaN))) should produce zero, but, since 'NaN != 0.0' is true, cascading these transformations could cause them to generate 1.0 or -1.0 respecively. No shader-db or fossil-db changes on any Intel platform. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6358>	2021-01-05 02:07:09 +00:00
Ian Romanick	9771af5dde	nir/algebraic: Fix broken NaN and -0.0 behavior No shader-db or fossil-db changes on any Intel platform. v2: Add a coding line to fix SCons build problems caused by the ± character. Fixes: `25bfba3335` ("nir/algebraic: Recognize open-coded copysign(1.0, a)") Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6358>	2021-01-05 02:07:09 +00:00
Ian Romanick	010e663cc3	spir-v: Mark floating point comparisons exact OpenGL GLSL, OpenGL ARB assembly shaders, and DX9 are pretty loose about the behavior in the presence of NaNs. Many GPUs that implement these specifications do not even have a representation of NaN. However, OpenCL and Vulkan SPIR-V are not so lax. Both actually have some required behavior in the presence of NaN, and, of the two, OpenCL is the most strict. For years we have implemented SPIR-V by using the same comparison opcodes as we use for OpenGL GLSL and OpenGL assembly shaders. This has repeatedly caused problems where an optimization that is valid in the NaN-relaxed world is not valid in Vulkan or OpenCL. To fix this, set the "exact" flag on comparisons instructions generated from SPIR-V. This will block optimizations that may have different NaN behavior. v2: Set the exact flag in the nir_builder, not in the vtn_builder. v3: Add an assertion in vtn_handle_constant that the exact flag wasn't set (because it's ignored). Rebase on `80163bbec3` ("nir/vtn: Support OpOrdered and OpUnordered opcodes"). Mark the NIR generated for those opcodes as exact as well. v4: s/unused_exact/exact/ in a couple places, and assert that exact has the expected value (true in one place, false in the other). Suggested by Caio. Closes: #3345 Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Tested-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Fixes: `8513b12590` ("nir/opt_if: split ALU from Phi more aggressively") This commit doesn't really fix anything in `8513b12590`. However, without `8513b12590`, a regression is triggered in RADV on No Man's Sky. I want to ensure that this change is only applied on top of `8513b12590`, and Fixes: seems the safest way to do that. No shader-db changes on any Intel platform. This only affects SPIR-V, and we have no OpenGL SPIR-V shaders in shader-db. 124 shaders in Shadow of the Tomb Raider (Steam "native") were hurt by 1 spill and 1 fill each. All Intel platforms had similar results. (Tiger Lake shown) Instructions in all programs: 155668276 -> 155685764 (+0.0%) SENDs in all programs: 6474570 -> 6474570 (+0.0%) Loops in all programs: 35271 -> 35271 (+0.0%) Cycles in all programs: 3198055373 -> 3198628031 (+0.0%) Spills in all programs: 231522 -> 231646 (+0.1%) Fills in all programs: 347571 -> 347695 (+0.0%) Vega Totals: SGPRs: 20955712 -> 20956756 (+0.00%); split: -0.02%, +0.03% VGPRs: 13476920 -> 13473132 (-0.03%); split: -0.07%, +0.04% CodeSize: 613371940 -> 613339348 (-0.01%); split: -0.06%, +0.05% MaxWaves: 3111886 -> 3112481 (+0.02%); split: +0.02%, -0.00% Instrs: 120723785 -> 120746991 (+0.02%); split: -0.04%, +0.06% Cycles: 626658992 -> 626862708 (+0.03%); split: -0.05%, +0.08% VMEM: 216330854 -> 216343196 (+0.01%); split: +0.04%, -0.04% SMEM: 32079391 -> 32081972 (+0.01%); split: +0.05%, -0.04% VClause: 2688784 -> 2688789 (+0.00%); split: -0.03%, +0.03% SClause: 6554669 -> `6556251` (+0.02%); split: -0.01%, +0.03% Copies: 5356667 -> 5353283 (-0.06%); split: -0.36%, +0.29% Branches: 954466 -> 954716 (+0.03%); split: -0.01%, +0.04% PreSGPRs: 9078300 -> 9081626 (+0.04%); split: -0.01%, +0.05% PreVGPRs: 10972090 -> 10966576 (-0.05%); split: -0.06%, +0.01% Totals from 48239 (12.08% of 399432) affected shaders: SGPRs: 2713984 -> 2715028 (+0.04%); split: -0.16%, +0.19% VGPRs: 1997804 -> 1994016 (-0.19%); split: -0.46%, +0.27% CodeSize: 172094092 -> 172061500 (-0.02%); split: -0.21%, +0.19% MaxWaves: 337327 -> 337922 (+0.18%); split: +0.20%, -0.02% Instrs: 33053657 -> 33076863 (+0.07%); split: -0.15%, +0.22% Cycles: 254961228 -> 255164944 (+0.08%); split: -0.12%, +0.20% VMEM: 15165226 -> 15177568 (+0.08%); split: +0.59%, -0.51% SMEM: 3304938 -> 3307519 (+0.08%); split: +0.49%, -0.41% VClause: 766225 -> 766230 (+0.00%); split: -0.12%, +0.12% SClause: 1332645 -> 1334227 (+0.12%); split: -0.04%, +0.16% Copies: 2040651 -> 2037267 (-0.17%); split: -0.94%, +0.77% Branches: 743668 -> 743918 (+0.03%); split: -0.01%, +0.05% PreSGPRs: 1697667 -> 1700993 (+0.20%); split: -0.07%, +0.27% PreVGPRs: 1718424 -> 1712910 (-0.32%); split: -0.39%, +0.07% Polaris Totals: SGPRs: 21349172 -> 21354376 (+0.02%); split: -0.02%, +0.04% VGPRs: 13690680 -> 13686920 (-0.03%); split: -0.07%, +0.04% CodeSize: 613745824 -> 613704988 (-0.01%); split: -0.06%, +0.05% MaxWaves: 2775012 -> 2775189 (+0.01%); split: +0.01%, -0.00% Instrs: 120735079 -> 120756209 (+0.02%); split: -0.04%, +0.06% Cycles: 627906100 -> 628076156 (+0.03%); split: -0.05%, +0.08% VMEM: 216623065 -> 216641838 (+0.01%); split: +0.04%, -0.04% SMEM: 32295618 -> 32299338 (+0.01%); split: +0.05%, -0.04% VClause: 2711025 -> 2711141 (+0.00%); split: -0.03%, +0.04% SClause: 6545185 -> 6546769 (+0.02%); split: -0.01%, +0.03% Copies: 5387723 -> 5383249 (-0.08%); split: -0.37%, +0.29% Branches: 953775 -> 953954 (+0.02%); split: -0.01%, +0.03% PreSGPRs: 9148814 -> 9153211 (+0.05%); split: -0.01%, +0.06% PreVGPRs: 11029429 -> 11023915 (-0.05%); split: -0.06%, +0.01% Totals from 48239 (12.00% of 402052) affected shaders: SGPRs: 2682056 -> 2687260 (+0.19%); split: -0.16%, +0.35% VGPRs: 1994436 -> 1990676 (-0.19%); split: -0.46%, +0.27% CodeSize: 170857060 -> 170816224 (-0.02%); split: -0.21%, +0.19% MaxWaves: 295429 -> 295606 (+0.06%); split: +0.07%, -0.01% Instrs: 32808802 -> 32829932 (+0.06%); split: -0.16%, +0.22% Cycles: 254633252 -> 254803308 (+0.07%); split: -0.13%, +0.20% VMEM: 14897934 -> 14916707 (+0.13%); split: +0.65%, -0.52% SMEM: 3289726 -> 3293446 (+0.11%); split: +0.53%, -0.42% VClause: 775318 -> 775434 (+0.01%); split: -0.11%, +0.13% SClause: 1304867 -> 1306451 (+0.12%); split: -0.04%, +0.16% Copies: 2026334 -> 2021860 (-0.22%); split: -0.99%, +0.77% Branches: 742554 -> 742733 (+0.02%); split: -0.02%, +0.04% PreSGPRs: 1690887 -> 1695284 (+0.26%); split: -0.07%, +0.33% PreVGPRs: 1717709 -> 1712195 (-0.32%); split: -0.40%, +0.07% Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6358>	2021-01-05 02:07:09 +00:00
Ian Romanick	55621c6d1c	nir/algebraic: Add some compare-with-zero optimizations that are exact This prevents some fossil-db regressions in "spir-v: Mark floating point comparisons exact". v2: Note that the patterns and replacements produce the same value when isnan(b). Suggested by Caio. v3: Use C99 isfinite() instead of (obsolete) BSD finite(). Fixes various Windows builds. No fossil-db changes on any Inetl platform, Vega, or Polaris10. All Intel platforms had similar results. (Tiger Lake shown) total instructions in shared programs: 20908670 -> 20908672 (<.01%) instructions in affected programs: 69 -> 71 (2.90%) helped: 0 HURT: 1 total cycles in shared programs: 473515288 -> 473513940 (<.01%) cycles in affected programs: 4942 -> 3594 (-27.28%) helped: 2 HURT: 0 Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6358>	2021-01-05 02:07:09 +00:00
Ian Romanick	9167324a86	nir/algebraic: Mark some logic-joined comparison reductions as exact This also prevents some fossil-db regressions in "spir-v: Mark floating point comparisons exact". v2: Mark the fmin / fmax in the replacement exact to prevent other optimizations from ruining the NaN-clensing property of the fmin / fmax. Suggested by Rhys. Don't assume that constants are not NaN because some components of a vector might be NaN while others are numbers. Noticed by Rhys. This causes ~8 more shaders in Age of Wonders III (dxvk) to regress on cycles (not instructions) by less than 1% when "spir-v: Mark floating point comparisons exact" is applied. This difference is too small to care. All Intel platforms had similar results. (Tiger Lake shown) total instructions in shared programs: 20908668 -> 20908670 (<.01%) instructions in affected programs: 9196 -> 9198 (0.02%) helped: 10 HURT: 5 helped stats (abs) min: 1 max: 2 x̄: 1.40 x̃: 1 helped stats (rel) min: 0.02% max: 5.41% x̄: 2.20% x̃: 2.16% HURT stats (abs) min: 2 max: 6 x̄: 3.20 x̃: 3 HURT stats (rel) min: 2.44% max: 16.67% x̄: 9.39% x̃: 12.50% 95% mean confidence interval for instructions value: -1.22 1.49 95% mean confidence interval for instructions %-change: -2.08% 5.41% Inconclusive result (value mean confidence interval includes 0). total cycles in shared programs: 473515330 -> 473515288 (<.01%) cycles in affected programs: 67146 -> 67104 (-0.06%) helped: 10 HURT: 7 helped stats (abs) min: 1 max: 36 x̄: 15.90 x̃: 17 helped stats (rel) min: 0.01% max: 1.29% x̄: 0.66% x̃: 0.89% HURT stats (abs) min: 1 max: 48 x̄: 16.71 x̃: 4 HURT stats (rel) min: 0.08% max: 1.94% x̄: 0.87% x̃: 0.19% 95% mean confidence interval for cycles value: -13.88 8.94 95% mean confidence interval for cycles %-change: -0.56% 0.49% Inconclusive result (value mean confidence interval includes 0). Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6358>	2021-01-05 02:07:09 +00:00
Ian Romanick	71961c73a9	nir: Correctly constant fold fsign(NaN) and fsign(-0) GLSL and SPIR-V GLSL.std.450 don't have any requirements for fsign(NaN), and both only require that FSign(-0.0) == 0.0. OpenCL, on the other hand, requires sign(-0.0) be exactly -0.0. It also requires that sign(NaN) be exactly 0.0. In practice, this change is difficult to test. Our GLSL frontend already constant folds sign(NaN) to 0.0 before even getting to NIR. As far as I can tell, glslang does the same. I don't have a good way to run an OpenCL SPIR-V test. Maybe SPIR-V GLSL.std.450 assembly? No shader-db or fossil-db changes on any Intel platform. Acked-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6358>	2021-01-05 02:07:09 +00:00
Ian Romanick	fe3c518277	nir/algebraic: Don't add reordered version of patterns for commutative instructions The reordered are automatically considered by nir_algebraic rules for commutative instructions. No shader-db or fossil-db changes on any Intel platform. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6358>	2021-01-05 02:07:09 +00:00
Ian Romanick	314a40c902	Revert "nir: Replace an odd comparison involving fmin of -b2f" I originally noticed that `3b30814791` ("nir/algebraic: Optimize 1-bit Booleans") caused this pattern no longer be matched by incorrectly replacing b@32 with b@1. Making that correct had no effect on shader-db. When this pattern originally was added, it only affected 4 shaders, so it's not worth the effort to debug further. This reverts commit `f50400cc80`. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6358>	2021-01-05 02:07:09 +00:00
Ian Romanick	aec0547838	nir/algebraic: Make some notes about comparison rearrangements versus infinity The original comment was a little terse and a little incorrect. The rearrangements are fine w.r.t. NaN. However, they produce incorrect results if one operand is +Inf and the other is -Inf. A later commit, "nir/algebraic: Add some compare-with-zero optimizations that are exact", will add some more patterns here. It may be reasonable to squash this commit (forward) into that commit. v2: Fix some incorrect comparisons operators in the comment (<= vs >=). Add commentary that subtraction works like addition w.r.t. NaN. Both noticed / suggested by Caio. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6358>	2021-01-05 02:07:09 +00:00
Ian Romanick	363efc2823	nir: Make some notes about fsign versus NaN This commit only documents the current behavior, even if that behavior is not the behavior preferred by the relevant specs. In SPIR-V, there are two flavors of the sign instruction, and each lives in an extended instruction set. The GLSL.std.450 FSign instruction is defined as: Result is 1.0 if x > 0, 0.0 if x = 0, or -1.0 if x < 0. This also matches the GLSL 4.60 definition. However, the OpenCL.ExtendedInstructionSet.100 sign instruction is defined as: Returns 1.0 if x > 0, -0.0 if x = -0.0, +0.0 if x = +0.0, or -1.0 if x < 0. Returns 0.0 if x is a NaN. There are two differences. Each treats -0.0 differently, and each also treats NaN differently. Specifically, GLSL.std.450 FSign does not define any specific behavior for NaN. There has been some discussion in Khronos about the NaN behavior of GLSL.std.450 FSign. As part of that discussion, I did some research into how we treat NaN for nir_op_fsign, and this commit just captures some of those notes. v2: Document the expected behavior of nir_op_fsign more thoroughly. Suggested by Rhys. Note that the current implementation of constant folding does not produce the expected result for NaN. Suggested by Caio. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> [v1] Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6358>	2021-01-05 02:07:09 +00:00
Danylo Piliaiev	81132983cd	nir: fix missing nir_lower_pntc_ytransform.c in the makefile Fixes: `33fd9e5d` "nir: account for point-coord origin when lowering it" Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8308>	2021-01-04 15:37:20 +00:00
Danylo Piliaiev	33fd9e5d8a	nir: account for point-coord origin when lowering it The resulting point-coord origin not only depends on whether the draw buffer is flipped but also on GL_POINT_SPRITE_COORD_ORIGIN state. Which makes its transform differ from a transform of wpos. On freedreno fixes: gl-3.2-pointsprite-origin gl-3.2-pointsprite-origin -fbo Fixes: `d934d320` "nir: Add flipping of gl_PointCoord.y in nir_lower_wpos_ytransform." Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8200>	2021-01-04 13:41:33 +00:00
Icecream95	1840404783	nir: Handle load_kernel_input in nir_get_io_offset_src Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8264>	2021-01-01 02:58:49 +00:00
Daniel Schürmann	a3785e3481	nir/opt_vectorize: hash whether a swizzle accesses elements beyond the maximum vectorization factor Swizzles that access components outside of the maximum vector size cannot be vectorized with each other. This patch creates different hash bins for this case. For example accesses to .x and .y are considered different variables compared to accesses to .z and .w for 16-bit vec2. This prevents the vectorization of things like vec2 16 ssa_3 = iadd ssa_1.xz, ssa_2.xz Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6666>	2020-12-31 16:44:58 +00:00
Daniel Schürmann	46e7428031	nir/opt_vectorize: rehash users of vectorized instructions This ensures that chains of ALU instructions are vectorized in a single iteration. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6666>	2020-12-31 16:44:58 +00:00
Daniel Schürmann	8eaf9c61d1	nir/opt_vectorize: don't hash filtered instructions This patch also changes nir_opt_vectorize_cb to use only one instruction as parameter. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6666>	2020-12-31 16:44:58 +00:00
Daniel Schürmann	23b2885514	nir/opt_vectorize: don't hash instructions which are already vectorized This guarantees that the hashset contains exactly the instructions which can be vectorized. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6666>	2020-12-31 16:44:58 +00:00
Daniel Schürmann	ad37e4df73	nir/opt_vectorize: use a single instruction per hash entry instead of a vector This drastically simplifies vectorization but may potentially lead to slightly worse vectorizations. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6666>	2020-12-31 16:44:58 +00:00
Samuel Pitoiset	0b503d8de9	nir: fix determining if an addition might overflow for phi sources nir_addition_might_overflow() expects the parent instruction to be an alu instr but it might be a phi instr. Fix it by assuming that the addition might overflow. This fixes compiler crashes with Horizon Zero Dawn. No fossils-db changes. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8268>	2020-12-31 16:17:08 +00:00
Timothy Arceri	6c8cc9be12	glsl: default to compat shaders in compat profile If the shader does not specify "core" or "compatibility" in shaders above 1.40 we were defaulting these shaders to core shaders when in a compat profile. Instead default to compat shaders. This brings us inline with the behaviour of the binary drivers and fixes a crash on start-up for the game Foundation. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3594 Fixes: `c7e3d31b0b` ("glsl: fix compat shaders in GLSL 1.40") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6993>	2020-12-30 11:47:49 +11:00
Danylo Piliaiev	9fe5859d5d	nir/lower_fragcolor: handle dual source blending With dual source blending only 1 draw buffer could be used. For zink fixes tests: arb_blend_func_extended-fbo-extended-blend-pattern_gles2 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8247>	2020-12-29 19:19:16 +00:00
Jesse Natalie	3f77901342	nir: Add an algebraic optimization for float->double->float As part of this series, it removes the need for float->double conversion, just to be able to print a single float. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8254>	2020-12-29 09:07:24 +10:00
Jesse Natalie	24669a672f	nir: Add a printf lowering pass (v5) This pass creates a SSBO var for the printf buffer. It does an atomic increment at the beginning of the buffer to determine where to write, then dumps the args after that. v2: [airlied] Enhanced to use an index into a set of format info that is passed back to the caller. The format info contains the number of args, argument sizes and the format string. v3: move format string lowering to vtn v4: Jason reworked it. v5: assume buffer has initial offset prebaked in and work from there. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8254>	2020-12-29 09:07:24 +10:00
Jesse Natalie	9524e9dbd0	nir/vtn: Implement printf opcode in terms of intrinsic (v9) [airlied: rebase fixup types] v2: add support for storing strings in a sideband storage, just store the index in print buffer. v3: move the format strings into the nir shader as well v4: simplify the write constant string + explicit sizes move printf cap definition. v5: just parse the format string to find string specifiers using util code. add vtn_fail_if if we can't get the correct type. v6: use ralloc + avoid instr handler for srcs > 5 v7: use a packed struct 4 bytes align all of it v8: simplify constant copy v9: rework to use a single string and common string extract code, (Jason) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8254>	2020-12-29 09:07:23 +10:00
Jesse Natalie	2a3fe68c33	nir: Add intrinsic and string ptrs This just adds the basic nir support for printf, intrinsic, and support for storing the printf info. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8254>	2020-12-29 09:07:23 +10:00
Vinson Lee	671e1571d9	nir/tests: Initialize nir_serialize_test member dup. Fix defect reported by Coverity Scan. Uninitialized pointer field (UNINIT_CTOR) uninit_member: Non-static class member dup is not initialized in this constructor nor in any functions that it calls. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8168>	2020-12-24 21:15:38 +00:00
Caio Marcelo de Oliveira Filho	f41ae4d592	spirv2nir: Add --opengl (-g) argument for OpenGL SPIR-V Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8156>	2020-12-22 09:34:00 -08:00
cheyang	ffe87c2689	glsl: remove unused state variable Because compile_shader function state variable not determine whether the compilation is successful. Signed-off-by: cheyang <cheyang@bytedance.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8178>	2020-12-21 14:49:53 +00:00
Caio Marcelo de Oliveira Filho	c4c9c780b1	spirv: Remove more dead variables SPIR-V modules can have multiple shaders (including of the same stage), but the global variables are all declared for the whole module. This can result in variables with same Binding but incompatible types, so those need to be removed before we use. Previously, a similar issue but with a narrower scope was fixed by `6775665e5e` ("spirv: Eliminate dead input/output variables after translation."). This patch depends on the previous patch that prevents variables used only in pointer initializers to be considered dead. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3686 Fixes: `3a266a18` ("nir/spirv: Add support for declaring variables") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8133>	2020-12-18 17:41:02 +00:00
Caio Marcelo de Oliveira Filho	acce4ce04e	nir: Consider pointer initializers in nir_remove_dead_variables Between the creation of a shader (from GLSL or SPIRV frontends) and nir_lower_variable_initializers is called, variables may refer to other variables for initialization. Those referred variables need to be kept alive, so consider that in the pass. Fixes: `7acc81056f` ("compiler/nir: Add support for variable initialization from a pointer") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8133>	2020-12-18 17:41:02 +00:00
Caio Marcelo de Oliveira Filho	577bfba959	nir: Remove unused parameter in remove_dead_var_writes And destroy the live set earlier in nir_remove_dead_variables. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8133>	2020-12-18 17:41:02 +00:00
Caio Marcelo de Oliveira Filho	617d8d3788	nir: Fix outdated name in comment Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8133>	2020-12-18 17:41:02 +00:00
Caio Marcelo de Oliveira Filho	f65750d221	spirv: Implement OpArrayLength for OpenGL Uses same NIR intrinsic as glsl_to_nir. Make it an option so it is easy later to move Vulkan drivers incrementally to use it. Fixes piglit test spec/arb_gl_spirv/execution/ssbo/unsized-array-length. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3691 Fixes: `15e43907` ("iris: Enable ARB_gl_spirv and ARB_spirv_extensions") Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8136>	2020-12-18 17:13:46 +00:00
Marek Olšák	884e84035e	mesa: remove unused LastUniformIndex This is now dead code. Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8046>	2020-12-18 04:53:44 +00:00
Daniel Schürmann	8513b12590	nir/opt_if: split ALU from Phi more aggressively If the only user is a trivial bcsel which in a second step can be turned into a phi, this conversion is also worth it even if the previous result is not undefined or constant. Allows for some more loop unrolling or saves a few instructions. Totals from 62 (0.04% of 139391) affected shaders (NAVI10): SGPRs: 4976 -> 4992 (+0.32%) VGPRs: 4408 -> 4472 (+1.45%); split: -0.45%, +1.91% CodeSize: 453632 -> 464000 (+2.29%); split: -0.32%, +2.60% MaxWaves: 527 -> 511 (-3.04%); split: +0.38%, -3.42% Instrs: 84940 -> 86681 (+2.05%); split: -0.36%, +2.41% Cycles: 11946844 -> 11783708 (-1.37%); split: -1.40%, +0.04% VMEM: 9403 -> 10357 (+10.15%); split: +11.59%, -1.45% SMEM: 3003 -> 3025 (+0.73%); split: +1.07%, -0.33% VClause: 1756 -> 1997 (+13.72%); split: -0.11%, +13.84% SClause: 2914 -> 2915 (+0.03%); split: -0.10%, +0.14% Copies: 6426 -> 6768 (+5.32%); split: -4.14%, +9.46% Branches: 2105 -> 2102 (-0.14%); split: -1.66%, +1.52% PreSGPRs: 2921 -> 2909 (-0.41%); split: -0.55%, +0.14% PreVGPRs: 4151 -> 4179 (+0.67%); split: -0.24%, +0.92% cc: mesa-stable Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8123>	2020-12-18 01:29:10 +00:00
Mike Blumenkrantz	4d47b22bf0	glsl/float64: make this compatible with glsl 330 just a minor tweak Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7483>	2020-12-17 15:47:30 +00:00
Vinson Lee	193888fd75	glsl: Initialize glsl_type member name. Fix defect reported by Coverity Scan. Uninitialized pointer field (UNINIT_CTOR) uninit_member: Non-static class member name is not initialized in this constructor nor in any functions that it calls. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7766>	2020-12-16 19:31:25 -08:00
Vinson Lee	836b9e1d88	glsl: Fix typos in comments. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7767>	2020-12-16 02:19:56 +00:00
Eric Anholt	0d23f4e56f	glsl/standalone: Fix memory leaks Needed for meson test with asan enabled. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7936>	2020-12-15 19:39:29 +00:00
Eric Anholt	39dbf47bfe	glsl/uniform_initializer_tests: Fix memory leak Needed for meson test with asan enabled. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7936>	2020-12-15 19:39:29 +00:00
Eric Anholt	49c22c2155	glsl/general_ir_test: Fix leaks. Needed for meson test with asan enabled. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7936>	2020-12-15 19:39:29 +00:00
Eric Anholt	1d026e2275	disk_cache: Fix memory leaks in the unit test. Needed for meson test with asan enabled. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7936>	2020-12-15 19:39:29 +00:00
Eric Anholt	d3d28f6c2d	nir: Redefine start/end_ip of blocks to fix NIR-to-TGSI liveness bugs. With the block's end_ip accidentally being the ip of the next instruction, contrary to the comment, you would end up doing end-of-block freeing early and have the value missing when it came time to emit the next instruction. Just expand the ips to have separate ones for start and end of block -- while it means that nir_instr->index is no longer incremented by 1 per instruction, it makes sense for use in liveness because a backend is likely to need to do other things at block boundaries (like emit the if statement's code), and having an ip to identify that stuff is useful. Fixes: `a206b58157` ("nir: Add a block start/end ip to live instr index metadata.") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7658>	2020-12-11 23:02:12 +00:00
Christian Gmeiner	533f6debb1	nir: change return type to void init_liveness_block(..) only gets called without using the always true return value. Switch from bool to void. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8051>	2020-12-11 16:45:57 +00:00
Adam Jackson	c0e7a2cd02	mesa: Remove silly "dummy_false" extension support This removes the memory of some old OES extensions from the table, which seems to be harmless. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8037>	2020-12-11 16:10:48 +00:00

1 2 3 4 5 ...

5815 commits