fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-19 13:38:19 +02:00

Author	SHA1	Message	Date
Timothy Arceri	6eec8fcbfa	glsl/nir: free GLSL IR right after we convert to NIR Gives us memory back faster which is useful for pathalogical CTS tests. The GLSL IR was previously used after converting to NIR for things like building the GL resource list but we have had a NIR version for this for some time and I don't believe there are any other use cases left for keeping the old IR hanging around this long. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15127>	2022-02-24 01:10:49 +00:00
Erik Faye-Lund	2cd0779f83	nir/spirv: guard macros in case of redefinition On some systems, these macros are already defined, and being defined slightly differently causes them to emit redefinition-warnings. Let's wrap them in ifdefs to avoid the warnings. Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15084>	2022-02-21 19:47:17 +00:00
Rhys Perry	a9ac270c5f	nir/validate: don't add instrs not present in shader to shader_gc_list This makes the set smaller and GC list validation faster. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13547>	2022-02-21 11:57:22 +00:00
Rhys Perry	925c5f817d	nir/validate: don't validate the GC list by default This seems really slow. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13547>	2022-02-21 11:57:18 +00:00
Connor Abbott	7e8d885919	spirv: Rewrite determinant calculation The old calculation for mat3 was clever, but it turns out that a straightforward application of subdeterminants similar to how mat4 is handled is more efficient: on a scalar architecture with some sort of combined multiply+add instruction with a negate modifier (both fairly common), the new determinant is 9 instructions vs. 15 for the old one, and without the multiply-add it's 14 instructions vs. 18 for the old one. When used as a routine for inverse() the savings are compounded, because we now use the same method as used to compute the adjucate matrix and so CSE can combine most of the calculations with the adjucate matrix ones. Once mat3 and mat4 use the same method for computing determinants, we can combine them into a single recursive function. I also pulled up the mat_subdet() function because it was doing basically what we need, so it's now shared between determinant and inverse. This shrinks the implementation significantly, as can be seen from the diffstat. The real reason I want to change this, though, is that it fixes dEQP-VK.glsl.builtin.precision_fp16_storage16b.inverse.compute.mat3 with turnip. Qualcomm uses round-to-zero for 16-bit frcp, which combined with some inaccuracy in the old method of calculating the determinant led us to fail. Qualcomm's driver uses something like the new method to calculate the determinant in the inverse. We could argue that Mesa's method should be allowed, because round-to-zero for floating-point division is within spec and there are no precision guarantees given for determinant() or inverse(). However we might as well use the more efficient method. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14652>	2022-02-19 02:03:25 +00:00
Connor Abbott	6761550357	nir/serialize: Don't access blob->data directly It won't work if the blob is fixed-size and we overrun the size, which will be the case with the Vulkan pipeline cache. This gets a bit tricky for the repeated-header optimization, because we can't read the header from the blob. Instead we have to store the header itself. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15028>	2022-02-19 01:25:46 +00:00
Alyssa Rosenzweig	2d6233d04f	nir: Check all sizes in nir_alu_instr_is_comparison nir_alu_instr_is_comparison needs to consider all comparison opcodes regardless of size. Otherwise, they will be missed by nir_opt_move/sink. Without this change, lowering booleans to integers regresses register pressure (and spills/fills) significantly in certain shaders on Panfrost, like android/com.miHoYo.GenshinImpact/1420.shader_test. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15073>	2022-02-18 19:22:01 +00:00
Jose Maria Casanova Crespo	90f966e05f	v3dv/v3d: Fix copyright holder to Raspberry Pi Ltd Acked-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15057>	2022-02-18 11:50:07 +01:00
Alyssa Rosenzweig	7ec1d96e5e	nir: Set internal=true in nir_builder_init_simple_shader Matches the expected use by callers. We do need to fix up a few callers which use this call for external shaders. v2: Fix up a radv call site (Rhys). Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> [v1] Acked-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14936>	2022-02-17 23:30:46 +00:00
Ian Romanick	a01b262990	nir: Add missing dependency on nir_opcodes.py Commit `38800b38` changed nir_opcodes.py, but that doesn't seem to have triggered nir_opt_algebraic.py. The change in `75ef5991` depends on opt_algebraic lowering 16-bit versions of slt, but if opt_algebraic is not rebuilt, this may not happen. This resulted in some people seeing assertion failures in, for example, dEQP-VK.spirv_assembly.instruction.compute.float16.arithmetic_3.step, due to the backend seeing nir_op_slt that it didn't know how to handle. v2: Add nir_opcodes.py to nir_algebraic_py so that all the per-driver algebraic passes pick up the dependency too. Rename it to nir_algebraic_depends. Suggested by Emma. Closes: #6047 Fixes: `d1992255bb` ("meson: Add build Intel "anv" vulkan driver") Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15050>	2022-02-17 22:57:33 +00:00
Lionel Landwerlin	768930a73a	nir: fix lower_memcpy memcpy is divided into chunks that are vec4 sized max. The problem here happens with a structure of 24 bytes : struct { float3 a; float3 b; } If you memcpy that struct, the lowering will emit 2 load/store, one of sized 8, next one sized 16. But both end up located at offset 0, so we effectively drop 2 floats. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `a3177cca99` ("nir: Add a lowering pass to lower memcpy") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15049>	2022-02-17 15:12:45 +00:00
Emma Anholt	3f4bfecee6	nir: Add some notes about const/uniform array access rules in GL. I was doing some RE on freedreno and we had some questions about when the hardware might need non-uniform or non-constant array access for various descriptor types, so let's leave some notes for the next person. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13621>	2022-02-16 20:06:21 +00:00
Samuel Pitoiset	74b932f8d3	nir: add nir_intrinsic_load_vrs_rates_amd This intrinsic specific to RADV will be used to load VRS rates from an user SGPR when RADV_FORCE_VRS is enabled by the application. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14713>	2022-02-16 08:11:12 +01:00
Timur Kristóf	e6cfd1ed64	spirv: Create PRIMITIVE_INDICES for NV_mesh_shader on-demand. The shader can have SpvOpWritePackedPrimitiveIndices4x8NV while the output variable may not exist. This seems to be a defect in the NV_mesh_shader SPIR-V spec, let's work around it by creating the variable on-demand. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15005>	2022-02-14 11:13:45 +01:00
Timur Kristóf	0445802ab2	compiler: Extract num_mesh_vertices_per_primitive function. Prevent code duplication. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15005>	2022-02-14 11:13:42 +01:00
Ilia Mirkin	b91b036322	isaspec: add gen-based leaf bitset separation This is necessary for some ops which have slightly different encoding on a4xx/a5xx, but are otherwise identical. This helps keeping the compiler from having to worry about these details and creating separate ops. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14789>	2022-02-12 13:46:07 -05:00
Ilia Mirkin	40468430a4	isaspec: fix gen_max to be 2^32-1 The minus sign has higher preference than shift: >>> 1 << 32 - 1 2147483648 >>> (1 << 32) - 1 4294967295 Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14789>	2022-02-12 13:45:57 -05:00
Ian Romanick	59889eb3ae	Re-indentation after the previous commit Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:34 +00:00
Ian Romanick	912299cb39	glsl: Eliminate ir_assignment::condition Reformatting is left for the next commit. v2: Remove assignments from the contructors. :face_palm: Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	fb630cd783	glsl: Make ir_assignment::condition private And add get_condition(). This proof that nothing remains that could possibly set ::condition to anything other than NULL. v2: Fix bad rebase. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	5208c116f2	glsl: Don't visit rvalues in the condition of an assignment At this point, this should always be NULL. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	1c22f06970	glsl: Don't lower vector indexing in the condition of an assignment At this point, this should always be NULL. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	2652b9a83d	glsl: Don't split structures in the condition of an assignment At this point, this should always be NULL. v2: Fix bad rebase. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	d07551ad4e	glsl: Don't split arrays in the condition of an assignment At this point, this should always be NULL. v2: Fix bad rebase. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	41f6b42b08	glsl: Don't tree graft in the condition of an assignment At this point, this should always be NULL. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	97ffca80a8	glsl: Don't dead-built-in varying eliminate in the condition of an assignment At this point, this should always be NULL. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	231459ad26	glsl: Remove unused condition parameter from ir_assignment constructor Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	ec5c9649ba	glsl: Don't constant-fold the condition of an assignment At this point, this should always be NULL. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	697a460e49	glsl: Don't clone assignment conditions At this point, this should always be NULL. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	d27e3c6adc	glsl: Eliminate unused conditional assignment constructor Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	50a0d0d875	glsl: Remove the ability to read text IR with conditional assignments Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	710adf2e60	glsl: Add ir_assignment constructor that takes just a write mask The other constructor that takes a write mask and a condition will be removed shortly. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	afee5dc63f	glsl: Lower if to conditional select instead of conditional assignment Platforms that don't have flow control also don't have anything that could be written that has a side effect. It should be safe to implement these condition writes as foo = csel(condition, bar, foo); This should eliminate the last thing in the GLSL compiler that can create new conditions on assignments. Everything else that can store something in ir_assignment::condition derives it from a pre-existing condition. v2: Fix bad rebase. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	c3626e1580	glsl/ir_builder: Eliminate unused conditional assignment builders Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	46a9df6aac	glsl: Don't try to emit the "linear sequence" in lower_variable_index_to_cond_assign When there are four or fewer elements left in the array partition, the strategy changes from a binary search of nested flow control to sequence of conditional assignments like (assign, dest, src[constant_i+0], index == constant_i+0) (assign, dest, src[constant_i+1], index == constant_i+1) (assign, dest, src[constant_i+2], index == constant_i+2) (assign, dest, src[constant_i+3], index == constant_i+3) or (assign, dest[constant_i+0], src, index == constant_i+0) (assign, dest[constant_i+1], src, index == constant_i+1) (assign, dest[constant_i+2], src, index == constant_i+2) (assign, dest[constant_i+3], src, index == constant_i+3) Realistically, the first case should use ir_triop_csel instead. The second case will either get turned back into flow control like if (index == constant_i+0) (assign, dest[constant_i+0], src) if (index == constant_i+1) (assign, dest[constant_i+1], src) if (index == constant_i+2) (assign, dest[constant_i+2], src) if (index == constant_i+3) (assign, dest[constant_i+3], src) or a sequence of conditional selects like (assign, dest[constant_i+0], (csel, index == constant_i+0, src, dest[constant_i+0])) (assign, dest[constant_i+1], (csel, index == constant_i+1, src, dest[constant_i+1])) (assign, dest[constant_i+2], (csel, index == constant_i+2, src, dest[constant_i+2])) (assign, dest[constant_i+3], (csel, index == constant_i+3, src, dest[constant_i+3])) The former case should continue to use the binary search. The later case could be generated from the binary search by other lowering passes. At the end of the day, conditional assignments don't really help anything here, so stop using them. Radeon R430 total instructions in shared programs: 2398683 -> 2398419 (-0.01%) instructions in affected programs: 5143 -> 4879 (-5.13%) helped: 9 HURT: 8 total vinst in shared programs: 616292 -> 616010 (-0.05%) vinst in affected programs: 4467 -> 4185 (-6.31%) helped: 9 HURT: 8 total sinst in shared programs: 315417 -> 315667 (0.08%) sinst in affected programs: 2568 -> 2818 (9.74%) helped: 2 HURT: 15 total flowcontrol in shared programs: 1049 -> 1048 (-0.10%) flowcontrol in affected programs: 7 -> 6 (-14.29%) helped: 1 HURT: 0 total presub in shared programs: 47027 -> 47027 (0.00%) presub in affected programs: 127 -> 127 (0.00%) helped: 1 HURT: 1 total omod in shared programs: 3618 -> 3615 (-0.08%) omod in affected programs: 8 -> 5 (-37.50%) helped: 3 HURT: 0 total temps in shared programs: 450757 -> 451312 (0.12%) temps in affected programs: 837 -> 1392 (66.31%) helped: 8 HURT: 6 total consts in shared programs: 1031928 -> 1031920 (<.01%) consts in affected programs: 1211 -> 1203 (-0.66%) helped: 6 HURT: 7 The shaders that were hurt for temps... are all lies. None of those shaders should have compiled as all 6 had more than 32 temps to begin with. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	05703a49f9	glsl: Use csel in do_vec_index_to_cond_assign This matches what NIR does. See nir_vector_extract. This improves code generation for several reasons. First, it only requires 3 comparisons instead of 4 (vec3(i > 0, i > 1, i > 2) vs vec4(i == 0, i == 1, i == 2, i == 3)). Secondly, it shortens the liverange of some values, possibly quite dramatically. Consider a loop in the old version (after lowering if-statements to selects): loop { ... x = csel(i == 0, a[0], x); x = csel(i == 1, a[1], x); x = csel(i == 2, a[2], x); x = csel(i == 3, a[3], x); ... } x is live for the whole loop across iterations. In the new version, x is only live while its value is needed: loop { ... t0 = csel(i > 0 , a[1], a[0]); t1 = csel(i > 2 , a[3], a[2]); x = csel(i > 1, t1, t0); ... } Outside a loop, this also means more values of the array may have their liveness reduced sooner (by consuming two values at once). All Intel platforms had similar results. (Tigerlake shown) total instructions in shared programs: 21171336 -> 21163615 (-0.04%) instructions in affected programs: 89680 -> 81959 (-8.61%) helped: 40 HURT: 4 helped stats (abs) min: 1 max: 450 x̄: 193.68 x̃: 196 helped stats (rel) min: 0.41% max: 13.32% x̄: 6.01% x̃: 6.22% HURT stats (abs) min: 1 max: 12 x̄: 6.50 x̃: 6 HURT stats (rel) min: 0.50% max: 0.66% x̄: 0.58% x̃: 0.58% 95% mean confidence interval for instructions value: -229.68 -121.28 95% mean confidence interval for instructions %-change: -6.93% -3.89% Instructions are helped. total cycles in shared programs: 832879641 -> 829513122 (-0.40%) cycles in affected programs: 44738430 -> 41371911 (-7.52%) helped: 35 HURT: 2 helped stats (abs) min: 2 max: 189948 x̄: 96186.49 x̃: 116154 helped stats (rel) min: 0.37% max: 11.08% x̄: 5.88% x̃: 6.47% HURT stats (abs) min: 4 max: 4 x̄: 4.00 x̃: 4 HURT stats (rel) min: 0.69% max: 0.69% x̄: 0.69% x̃: 0.69% 95% mean confidence interval for cycles value: -112881.94 -69092.06 95% mean confidence interval for cycles %-change: -6.77% -4.27% Cycles are helped. total spills in shared programs: 8061 -> 7338 (-8.97%) spills in affected programs: 873 -> 150 (-82.82%) helped: 24 HURT: 0 total fills in shared programs: 7501 -> 6388 (-14.84%) fills in affected programs: 1389 -> 276 (-80.13%) helped: 24 HURT: 0 Radeon R430 total instructions in shared programs: 2449852 -> 2449136 (-0.03%) instructions in affected programs: 6285 -> 5569 (-11.39%) helped: 64 HURT: 0 helped stats (abs) min: 4 max: 12 x̄: 11.19 x̃: 12 helped stats (rel) min: 8.16% max: 21.62% x̄: 12.09% x̃: 10.91% total consts in shared programs: 1032517 -> 1032482 (<.01%) consts in affected programs: 966 -> 931 (-3.62%) helped: 35 HURT: 0 helped stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 helped stats (rel) min: 2.94% max: 10.00% x̄: 4.26% x̃: 3.57% Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	ed66d7c385	glsl/lower_vector_derefs: Don't emit conditional assignments Use if-statements instead. Any hardware that supports this sort of tessellation has flow control, so it will probably emit the conditional assignment using an if-statement anyway. This is definitely what st_glsl_to_nir does. v2: Fix copy-and-paste bug in the ir_type_swizzle handling. This bug caused segfaults in tests/spec/arb_tessellation_shader/execution/variable-indexing/tcs-patch-vec4-swiz-index-wr.shader_test. Reviewed-by: Matt Turner <mattst88@gmail.com> [v1] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Daniel Schürmann	2a92452a0e	nir/opt_shrink_vectors: Remove shrinking of store intrinsics data source This is done via nir_opt_shrink_stores. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14480>	2022-02-11 11:50:47 +01:00
Daniel Schürmann	2dba7e6056	nir: split nir_opt_shrink_stores from nir_opt_shrink_vectors This patch moves the shrinking of store sources into a separate pass. The reasoning behind this is that this pass usually only needs to be called once while nir_shrink_vectors might better be called several times. This allows to move the pass(es) out of the optimization loops. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14480>	2022-02-11 11:34:01 +01:00
Ian Romanick	1cb3d1a6ae	nir: Produce correct results for atan with NaN Properly handling NaN adversely affects several hundred shaders in shader-db (lots of Skia and a few others from various synthetic benchmarks) and fossil-db (mostly Talos and some Doom 2016). Only apply the NaN handling work-around when the shader demands it. v2: Add comment explaining the 1.0*y_over_x. Suggested by Caio. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Fixes: `2098ae16c8` ("nir/builder: Move nir_atan and nir_atan2 from SPIR-V translator") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13999>	2022-02-10 18:15:39 +00:00
Ian Romanick	7d0d9b9fbc	nir: Properly handle various exceptional values in frexp frexp_sig of ±0, ±Inf, or NaN should just return the input unmodified. frexp_exp of ±Inf or NaN is undefined, and frexp_exp of ±0 should return the input unmodified. This seems to already work. No shader-db or fossil-db changes on any Intel platform. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Fixes: `23d30f4099` ("spirv,nir: lower frexp_exp/frexp_sig inside a new NIR pass") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13999>	2022-02-10 18:15:39 +00:00
Ian Romanick	93ed87af28	spirv: Produce correct result for GLSLstd450Tanh with NaN No shader-db or fossil-db changes on any Intel platform. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Fixes: `9f9432d56c` ("Revert "spirv: Use a simpler and more correct implementaiton of tanh()"") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13999>	2022-02-10 18:15:39 +00:00
Ian Romanick	e442b9d792	spirv: Produce correct result for GLSLstd450Modf with Inf GLSLstd450ModfStruct too. No shader-db or fossil-db changes on any Intel platform. v2: Fix handling 16-bit (and presumably 64-bit) values. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Fixes: `f92a35d831` ("vtn: Fix Modf.") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13999>	2022-02-10 18:15:39 +00:00
Ian Romanick	75ef5991f5	spriv: Produce correct result for GLSLstd450Step with NaN NOTE: This commit needs "nir: All set-on-comparison opcodes can take all float types" or regressions will occur in other Vulkan SPIR-V tests. No shader-db changes on any Intel platform. NOTE: This commit depends on "nir: All set-on-comparison opcodes can take all float types". v2: Fix handling 16-bit (and presumably 64-bit) values. About 280 shaders in Talos are hurt by a few instructions, and a couple shaders in Doom 2016 are hurt by a few instructions. Tiger Lake Instructions in all programs: 159893290 -> 159895026 (+0.0%) SENDs in all programs: 6936431 -> 6936431 (+0.0%) Loops in all programs: 38385 -> 38385 (+0.0%) Cycles in all programs: 7019260087 -> 7019254134 (-0.0%) Spills in all programs: 101389 -> 101389 (+0.0%) Fills in all programs: 131532 -> 131532 (+0.0%) Ice Lake Instructions in all programs: 143624235 -> 143625691 (+0.0%) SENDs in all programs: 6980289 -> 6980289 (+0.0%) Loops in all programs: 38383 -> 38383 (+0.0%) Cycles in all programs: 8440083238 -> 8440090702 (+0.0%) Spills in all programs: 102246 -> 102246 (+0.0%) Fills in all programs: 131908 -> 131908 (+0.0%) Skylake Instructions in all programs: 134185495 -> 134186618 (+0.0%) SENDs in all programs: 6938790 -> 6938790 (+0.0%) Loops in all programs: 38356 -> 38356 (+0.0%) Cycles in all programs: 8222366923 -> 8222365826 (-0.0%) Spills in all programs: 98821 -> 98821 (+0.0%) Fills in all programs: 125218 -> 125218 (+0.0%) Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Fixes: `1feeee9cf4` ("nir/spirv: Add initial support for GLSL 4.50 builtins") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13999>	2022-02-10 18:15:39 +00:00
Ian Romanick	38800b385c	nir: All set-on-comparison opcodes can take all float types Extend `4195a9450b` so that the next poor fool doesn't come along and say, "sge does the right thing for 16-bit sources, but slt gives a NIR validation failure. What the deuce?" NOTE: This commit is necessary to prevent regressions in GLSLstd450Step tests of 16-bit sources at "spriv: Produce correct result for GLSLstd450Step with NaN". Fixes: `4195a9450b` ("nir: sge operation is defined for floating-point types") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13999>	2022-02-10 18:15:39 +00:00
Ian Romanick	97ce3a56bd	nir/search: Constify instr parameter to nir_search_expression::cond Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13999>	2022-02-10 18:15:39 +00:00
Ian Romanick	4dd4135551	nir: Constify def parameter to nir_ssa_def_bits_used Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13999>	2022-02-10 18:15:39 +00:00
Otavio Pontes	510d248299	nir: Use proper macro to set bits of variable correctly When slots is 64 only the first bit was being set, instead of setting all 64 bits of the variable, so for that case the function get_variable_io_mask() always returned 0. This behaviour caused variables that are being used both on producer and consumer to be considered unused and thus being removed on nir_remove_unused_io_vars(). Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14955>	2022-02-10 17:19:54 +00:00
Georg Lehmann	c2168f845e	nir/lower_mediump: Treat u2u16 like i2i16. There is a comment in nir_fold_16bit_sampler_conversions saying that these are the same, but the code only checks for i2i16. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14893>	2022-02-10 16:13:54 +00:00
Emma Anholt	20469009c7	nir: Delete the per-instr SSA liveness impl. It was introduced for nir-to-tgsi, and I found that it was the wrong approach. There's a reason nobody else does RA this way. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14404>	2022-02-10 00:36:57 +00:00

1 2 3 4 5 ...

6795 commits