fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-17 09:38:05 +02:00

Author	SHA1	Message	Date
Rhys Perry	b729cd58d7	nir/algebraic: eliminate exact a*0.0 if float execution mode allow it fossil-db (GFX10): Totals from 611 (0.44% of 139391) affected shaders: SGPRs: 40528 -> 40288 (-0.59%) VGPRs: 16136 -> 16152 (+0.10%); split: -0.15%, +0.25% CodeSize: 970192 -> 951036 (-1.97%) MaxWaves: 10561 -> 10557 (-0.04%); split: +0.08%, -0.11% Instrs: 174874 -> 172879 (-1.14%); split: -1.18%, +0.04% fossil-db (GFX10.3): Totals from 611 (0.44% of 139391) affected shaders: SGPRs: 40680 -> 40488 (-0.47%) VGPRs: 18368 -> 18276 (-0.50%); split: -0.57%, +0.07% CodeSize: 1050712 -> 1033624 (-1.63%); split: -1.64%, +0.02% MaxWaves: 8658 -> 8674 (+0.18%) Instrs: 205364 -> 201220 (-2.02%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5523>	2021-01-26 11:36:13 +00:00
Rhys Perry	614ab26afd	nir/algebraic: optimize out exact a+0.0 if it's used only as a float fossil-db (GFX10): Totals from 133 (0.10% of 139391) affected shaders: SGPRs: 7864 -> 7856 (-0.10%); split: -0.20%, +0.10% VGPRs: 4884 -> 4836 (-0.98%) CodeSize: 288932 -> 287084 (-0.64%) MaxWaves: 1973 -> 1979 (+0.30%) Instrs: 53899 -> 53550 (-0.65%) fossil-db (GFX10.3): Totals from 133 (0.10% of 139391) affected shaders: SGPRs: 7832 -> 7835 (+0.04%); split: -0.06%, +0.10% VGPRs: 5144 -> 5088 (-1.09%) CodeSize: 318912 -> 316696 (-0.69%) MaxWaves: 1735 -> 1746 (+0.63%) Instrs: 65367 -> 64853 (-0.79%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5523>	2021-01-26 11:36:13 +00:00
Rhys Perry	2849f0b5aa	nir/algebraic: optimize out exact a*1.0 if it's used only as a float fossil-db (GFX10): Totals from 10180 (7.30% of 139391) affected shaders: SGPRs: 549392 -> 549448 (+0.01%); split: -0.00%, +0.01% VGPRs: 243228 -> 243008 (-0.09%); split: -0.11%, +0.02% CodeSize: 12939080 -> 12603996 (-2.59%); split: -2.59%, +0.00% MaxWaves: 186948 -> 186976 (+0.01%) Instrs: 2497266 -> 2414648 (-3.31%) fossil-db (GFX10.3): Totals from 10180 (7.30% of 139391) affected shaders: SGPRs: 549672 -> 549280 (-0.07%); split: -0.23%, +0.16% VGPRs: 289296 -> 283672 (-1.94%); split: -2.83%, +0.88% CodeSize: 13920180 -> 13255560 (-4.77%); split: -4.77%, +0.00% MaxWaves: 151789 -> 153165 (+0.91%) Instrs: 2756978 -> 2671517 (-3.10%); split: -3.10%, +0.00% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5523>	2021-01-26 11:36:13 +00:00
Caio Marcelo de Oliveira Filho	cb7352ae95	nir: Add a data pointer to the callback in nir_remove_dead_variables Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8706>	2021-01-26 05:58:34 +00:00
Rhys Perry	f8072c133d	nir/opt_uniform_atomics: fix elect detection fossil-db (GFX10.3): Totals from 30 (0.02% of 139391) affected shaders: SGPRs: 1736 -> 1712 (-1.38%) CodeSize: 262116 -> 254728 (-2.82%) Instrs: 50341 -> 48857 (-2.95%) Cycles: 486384 -> 477556 (-1.82%) VMEM: 4821 -> 4589 (-4.81%) Copies: 5013 -> 4890 (-2.45%) Branches: 2108 -> 1983 (-5.93%) PreSGPRs: 1444 -> 1418 (-1.80%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8654>	2021-01-25 21:04:52 +00:00
Rhys Perry	eb70c52abe	nir/opt_uniform_atomics: recognize more complicated invocation comparisons For example, gl_LocalInvocationID.x + gl_LocalInvocationID.y * 8. fossil-db (GFX10.3): Totals from 8 (0.01% of 139391) affected shaders: CodeSize: 15224 -> 14800 (-2.79%) Instrs: 2880 -> 2798 (-2.85%) Cycles: 44556 -> 44204 (-0.79%) VMEM: 407 -> 473 (+16.22%); split: +17.69%, -1.47% Copies: 491 -> 483 (-1.63%) Branches: 200 -> 192 (-4.00%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8654>	2021-01-25 21:04:52 +00:00
Erik Faye-Lund	bc0222d471	compiler/nir: add texcoord replace lowering pass This lowering pass allows us to replace point-sprites to gl_PointCoord, which better match what modern hardware does. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6473>	2021-01-25 17:32:33 +00:00
Connor Abbott	6ca1ab3bb4	nir/lower_tex: Assume that nir_tex_instr::dest_type is sized This reverts the code back to the form it was before, but with an explicitly sized float32 instead of float, now that all producers are switched over. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7989>	2021-01-25 11:21:59 +01:00
Connor Abbott	708c47e663	nir: Validate nir_tex_instr::dest_type bitsize In theory, we could also verify this against the sampler type for sampler derefs, but there are a number of complications there: - SPIR-V 1.4 lets you override the signedness of integer samplers per-instruction. So the base type may not match. - mediump/RelaxedPrecision samplers may get lowered to f16 in the instruction or may not. So the bitsize may not match. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7989>	2021-01-25 11:21:53 +01:00
Connor Abbott	b2da598ff9	nir: Use sized types for nir_tex_instr::dest_type Revieweeviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7989>	2021-01-25 11:21:48 +01:00
Connor Abbott	3d803893da	nir/lower_bool: Rewrite dest_type for boolean destinations This happens with nir_texop_samples_identical, and we need to keep things consistent and (soon) keep the validator happy when expanding booleans once we switch that to having a dest_type of bool1. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7989>	2021-01-25 11:21:42 +01:00
Connor Abbott	acd6616eab	nir/lower_tex: Handle sized tex destination types Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7989>	2021-01-25 11:21:26 +01:00
Jason Ekstrand	178820212b	nir/lower_int64: Lower 64-bit vote_ieq Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7329>	2021-01-22 18:38:37 +00:00
Jason Ekstrand	731adf1e17	nir/lower_int64: Add lowering for 64-bit iadd shuffle/reduce Lowering iadd is a bit trickier because we have to deal with potential overflow but it's still not bad to do in NIR. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7329>	2021-01-22 18:38:37 +00:00
Jason Ekstrand	bf7a114246	nir/lower_int64: Add lowering for some 64-bit subgroup ops These are all pretty trivial because we can just split the op into one subgroup op per half of the value. There's some question as to whether these belong in lower_int64 or lower_subgroups but, on Intel, they key decider of whether or not we need the lowering is based on whether or not we have hardware int64 support. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7329>	2021-01-22 18:38:37 +00:00
Jason Ekstrand	da331f814f	nir/lower_int64: Fix lowering of f2[ui]64 for 16-bit float Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7329>	2021-01-22 18:38:37 +00:00
Jason Ekstrand	70b4524de5	nir/lower_int64: Add a level of wrapper functions We're about to start lowering a few intrinsics so we need support more than just ALU. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7329>	2021-01-22 18:38:37 +00:00
Rhys Perry	a6d92eaf4f	nir/sink,nir/move: sink/move reorderable load_ssbo Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6490>	2021-01-21 18:07:03 +00:00
Rhys Perry	e200ce0996	nir/lower_io: fix array_length lowering if buffer is smaller than offset Matches SPIR-V -> NIR implementation of OpArrayLength. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8163>	2021-01-21 11:53:12 +00:00
Jesse Natalie	13b21156e4	nir: Work around MSVC x86 internal compiler error Fixes: `1fd8b466` ("nir,spirv: add sparse image loads") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4108 Acked-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8581>	2021-01-20 20:42:48 +00:00
Mike Blumenkrantz	652e51e1f3	nir/lower_uniforms_to_ubo: set explicit_binding on uniform_0 this variable is always bound to buffer index 0, so the binding info here is actually useful Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7935>	2021-01-14 17:29:09 +00:00
Mike Blumenkrantz	491e7decad	util/set: add the found param to search_or_add this brings parity with the internal api Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8450>	2021-01-14 13:51:35 +00:00
Rhys Perry	dfe429eb41	nir/loop_unroll: unroll more aggressively if it can improve load scheduling Significantly improves performance of a Control compute shader. Also seems to increase FPS at the very start of the game by ~5% (RX 580, 1080p, medium settings, no MSAA). fossil-db (Sienna): Totals from 81 (0.06% of 139391) affected shaders: SGPRs: 3848 -> 4362 (+13.36%); split: -0.99%, +14.35% VGPRs: 4132 -> 4648 (+12.49%) CodeSize: 275532 -> 659188 (+139.24%) MaxWaves: 986 -> 906 (-8.11%) Instrs: 54422 -> 126865 (+133.11%) Cycles: 1057240 -> 750464 (-29.02%); split: -42.61%, +13.60% VMEM: 26507 -> 61829 (+133.26%); split: +135.56%, -2.30% SMEM: 4748 -> 5895 (+24.16%); split: +31.47%, -7.31% VClause: 1933 -> 6802 (+251.89%); split: -0.72%, +252.61% SClause: 1179 -> 1810 (+53.52%); split: -3.14%, +56.66% Branches: 1174 -> 1157 (-1.45%); split: -23.94%, +22.49% PreVGPRs: 3219 -> 3387 (+5.22%); split: -0.96%, +6.18% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6538>	2021-01-13 18:54:18 +00:00
Daniel Schürmann	08fbd5d454	nir/divergence_analysis: mark load_push_constant as uniform Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8439>	2021-01-12 14:46:13 +00:00
Daniel Schürmann	bd8e84eb8d	nir: replace .lower_sub with .has_fsub and .has_isub This allows a more fine-grained control about whether a backend supports one of these instructions. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6597>	2021-01-11 19:13:51 +00:00
Daniel Schürmann	b3ce55b445	nir,vc4: Lower fneg to fmul(x, -1.0) This patch also replaces lower_negate with lower_ineg / lower_fneg. The fneg semantics have been clarified as of Version 1.5, Revision 1 of the SPIR-V specification, which means that the previous lowering to fsub is not a viable solution anymore, and is replaced with lowering to fmul(x, -1.0). Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6597>	2021-01-11 19:13:51 +00:00
Erico Nunes	faaba0d6af	nir/lower_vec_to_movs: don't vectorize unsupports ops If the instruction being coalesced would be vectorized but the target doesn't support vectorizing that op, skip coalescing. Reuse the callbacks from alu_to_scalar to describe which ops should not be vectorized. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Eric Anholt <eric@anholt.net> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6506>	2021-01-11 13:13:30 +00:00
Rhys Perry	b634d7f3e2	nir/opt_vectorize: fix srcs_equal() with two different non-const To match hash_alu_src(), this should return false if both are different non-const ssa defs. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8391>	2021-01-09 11:14:05 +00:00
Rhys Perry	bdf316ae7b	nir/opt_vectorize: fix typo in instr_can_rewrite() Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8391>	2021-01-09 11:14:05 +00:00
Eric Anholt	670944ba04	nir/lower_locals_to_regs: Use the imul_imm helper instead of forcing it. Cleaned up a bit of addressing math in the shader I just had to debug. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8373>	2021-01-08 21:04:31 +00:00
Rhys Perry	f5adf27fb9	nir,radv: add and use nir_vectorize_tess_levels() fossil-db (Sienna): Totals from 1342 (0.97% of 138791) affected shaders: CodeSize: 3287996 -> 3269572 (-0.56%); split: -0.56%, +0.00% Instrs: 629896 -> 628191 (-0.27%); split: -0.31%, +0.04% Cycles: 2619244 -> 2612424 (-0.26%); split: -0.30%, +0.04% VMEM: 388807 -> 389273 (+0.12%); split: +0.14%, -0.02% SMEM: 90655 -> 90700 (+0.05%); split: +0.06%, -0.01% VClause: 21831 -> 21812 (-0.09%) PreVGPRs: 44155 -> 44058 (-0.22%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4202>	2021-01-07 16:34:53 +00:00
Rhys Perry	f199b7188b	nir/load_store_vectorize: add data as callback args Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4202>	2021-01-07 16:34:53 +00:00
Rhys Perry	00c8bec47b	nir: add nir_load_store_vectorize_options Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4202>	2021-01-07 16:34:53 +00:00
Rhys Perry	f4eb833a12	nir/load_store_vectorize: don't ignore subgroup memory barriers Not sure why I thought this was correct, but we should consider them for optimization purposes. Fixes: `ce9205c03b` ('nir: add a load/store vectorization pass') Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4202>	2021-01-07 16:34:53 +00:00
Rhys Perry	c73c246e05	nir: gather whether a compute shader uses non-quad subgroup intrinsics Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7918>	2021-01-07 15:01:02 +00:00
Rhys Perry	7d1d4acbd5	nir/lower_tex: fix lower_tg4_offsets with sparse fetches Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7774>	2021-01-06 20:36:38 +00:00
Rhys Perry	2d2decc905	nir: add sparse_residency_code_and Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7774>	2021-01-06 20:36:38 +00:00
Rhys Perry	4cbdf9ec4d	nir,spirv: implement SpvOpImageSparseTexelsResident Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7774>	2021-01-06 20:36:38 +00:00
Rhys Perry	1fd8b46667	nir,spirv: add sparse image loads Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7774>	2021-01-06 20:36:38 +00:00
Rhys Perry	3a7972f72a	nir,spirv: add sparse texture fetches Like SPIR-V and GL_ARB_sparse_texture2, these return a residency code. It is placed in the destination after the rest of the result. If it's zero, then the texel is resident. Otherwise, it's not resident. Besides the larger destination and the residency code, sparse fetches work the same as normal fetches. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7774>	2021-01-06 20:36:38 +00:00
Rhys Perry	95819663b7	nir: allow 5 component vectors These will be useful for sparse texture instructions and image load intrinsics. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7774>	2021-01-06 20:36:38 +00:00
Rhys Perry	ba4a73a502	nir/tests: fix callback for load/store vectorizer tests Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7774>	2021-01-06 20:36:38 +00:00
Daniel Schürmann	22b89d9a52	nir/opt_vectorize: fix call to filter function Due to the typo, it could happen that instructions got further vectorized than intended. Fixes: `8eaf9c61d1` ('nir/opt_vectorize: don't hash filtered instructions') Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8352>	2021-01-06 19:03:07 +00:00
Christian Gmeiner	c0fe111d64	nir: use intrinsic builders Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8295>	2021-01-06 14:34:41 +00:00
Mike Blumenkrantz	b5fb66a5ed	nir: preserve explicit_binding in lower_atomics_to_ssbo it's important to be able to tell whether this is explicitly set by the user Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7489>	2021-01-06 12:56:09 +00:00
Jesse Natalie	4d83306a9a	nir: Update saturated float->int/uint conversion algorithm The mantissa for a float doesn't contain enough data to accurately represent the min/max values for some destination types. Instead of clamping before converting, clamp after converting when coming from floats. This improves conformance of CL conversions, specifically for float -> long/ulong with int64 emulation enabled. Refactors the limit determination from the clamp, so we can determine limits for the dest type (int/uint) in both the source (float) and dest type. The limit as a float is used for comparison, while the limit as a dest type is used for bcsel. Important note is that the comparison is inverted to fge instead of flt, so the bcsel chooses the direct int/uint over the converted float in the case where the comparison comes up equal, but the conversion can't produce the exact min/max value. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8256>	2021-01-05 19:46:25 +00:00
Ian Romanick	539c25c2da	nir/algebraic: Move the flrp -> bcsel rule earlier If multiple rules could match, the rule that appears first in the file is used. Only Tiger Lake and Ice Lake are affected. Other platforms either have a LRP instruction or can't run any shaders from shader-db that would benefit. v2: Fix issues created when this commit was rebased on top of `3c8934a644` ("nir/algebraic: add flrp patterns for 16 and 64 bits"). Noticed by Caio. Tiger Lake and Ice Lake had similar results. total instructions in shared programs: 20908672 -> 20908661 (<.01%) instructions in affected programs: 419 -> 408 (-2.63%) helped: 5 HURT: 0 helped stats (abs) min: 1 max: 3 x̄: 2.20 x̃: 3 helped stats (rel) min: 1.85% max: 3.19% x̄: 2.49% x̃: 2.65% 95% mean confidence interval for instructions value: -3.56 -0.84 95% mean confidence interval for instructions %-change: -3.24% -1.73% Instructions are helped. total cycles in shared programs: 473513940 -> 473513793 (<.01%) cycles in affected programs: 7176 -> 7029 (-2.05%) helped: 12 HURT: 0 helped stats (abs) min: 5 max: 22 x̄: 12.25 x̃: 12 helped stats (rel) min: 0.84% max: 3.24% x̄: 2.09% x̃: 1.80% 95% mean confidence interval for cycles value: -15.43 -9.07 95% mean confidence interval for cycles %-change: -2.57% -1.61% Cycles are helped. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6358>	2021-01-05 02:07:09 +00:00
Ian Romanick	ec16f935fe	nir/algebraic: Mark comparisons generated from lowered fsign precise This prevents other transformations from converting them to 'a != 0'. For example, both of these transformations can do this: (('~flt', 0.0, ('fabs', a)), ('fne', a, 0.0)), (('~flt', ('fneg', ('fabs', a)), 0.0), ('fne', a, 0.0)), Both fsign(fabs(NaN)) and fsign(fneg(fabs(NaN))) should produce zero, but, since 'NaN != 0.0' is true, cascading these transformations could cause them to generate 1.0 or -1.0 respecively. No shader-db or fossil-db changes on any Intel platform. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6358>	2021-01-05 02:07:09 +00:00
Ian Romanick	9771af5dde	nir/algebraic: Fix broken NaN and -0.0 behavior No shader-db or fossil-db changes on any Intel platform. v2: Add a coding line to fix SCons build problems caused by the ± character. Fixes: `25bfba3335` ("nir/algebraic: Recognize open-coded copysign(1.0, a)") Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6358>	2021-01-05 02:07:09 +00:00
Ian Romanick	55621c6d1c	nir/algebraic: Add some compare-with-zero optimizations that are exact This prevents some fossil-db regressions in "spir-v: Mark floating point comparisons exact". v2: Note that the patterns and replacements produce the same value when isnan(b). Suggested by Caio. v3: Use C99 isfinite() instead of (obsolete) BSD finite(). Fixes various Windows builds. No fossil-db changes on any Inetl platform, Vega, or Polaris10. All Intel platforms had similar results. (Tiger Lake shown) total instructions in shared programs: 20908670 -> 20908672 (<.01%) instructions in affected programs: 69 -> 71 (2.90%) helped: 0 HURT: 1 total cycles in shared programs: 473515288 -> 473513940 (<.01%) cycles in affected programs: 4942 -> 3594 (-27.28%) helped: 2 HURT: 0 Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6358>	2021-01-05 02:07:09 +00:00

1 2 3 4 5 ...

2936 commits