fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-22 20:00:10 +01:00

Author	SHA1	Message	Date
Eric Anholt	1e12dafcac	glsl: Optimize triop_csel with all-true or all-false. Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-02-07 12:46:48 -08:00
Eric Anholt	de796b0ef0	glsl: Optimize various cases of fma (aka MAD). Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-02-07 12:46:48 -08:00
Eric Anholt	44577c4857	glsl: Optimize lrp(x, x, coefficient) --> x. total instructions in shared programs: 1627754 -> 1624534 (-0.20%) instructions in affected programs: 45748 -> 42528 (-7.04%) GAINED: 3 LOST: 0 (serious sam, humus domino demo) Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-02-07 12:46:48 -08:00
Eric Anholt	d72956790f	glsl: Optimize pow(x, 1) -> x. total instructions in shared programs: 1627826 -> 1627754 (-0.00%) instructions in affected programs: 6640 -> 6568 (-1.08%) GAINED: 0 LOST: 0 (HoN and savage2) Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-02-07 12:46:48 -08:00
Eric Anholt	6d7c123d6c	glsl: Optimize log(exp(x)) and exp(log(x)) into x. Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-02-07 12:46:47 -08:00
Eric Anholt	2c2aa35336	glsl: Optimize ~~x into x. v2: Fix pasteo of an extra abs being inserted (caught by many). Rewrite to drop the silly switch statement. Reviewed-by: Matt Turner <mattst88@gmail.com> (v1)	2014-02-07 12:46:47 -08:00
Jordan Justen	8d37e9915a	glsl: Optimize open-coded lrp into lrp. total instructions in shared programs: 1498191 -> 1487051 (-0.74%) instructions in affected programs: 669388 -> 658248 (-1.66%) GAINED: 1 LOST: 0 Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>	2014-01-21 14:20:44 -08:00
Kenneth Graunke	847bc36a38	glsl: Optimize pow(2, x) --> exp2(x). On Haswell, POW takes 24 cycles, while EXP2 only takes 14. Plus, using POW requires putting 2.0 in a register, while EXP2 doesn't. I believe that EXP2 will be faster than POW on basically all GPUs, so it makes sense to optimize it. Looking at the savage2 subset of shader-db: total instructions in shared programs: 113225 -> 113179 (-0.04%) instructions in affected programs: 2139 -> 2093 (-2.15%) instances of 'math pow': 795 -> 749 (-6.14%) instances of 'math exp': 389 -> 435 (11.8%) Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-01-07 12:54:57 -08:00
Kenneth Graunke	d6c1d66d3a	glsl: Optimize pow(1.0, X) --> 1.0. Surprisingly, this helps one vertex shader in 3DMMES. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-01-07 12:54:57 -08:00
Eric Anholt	aa6d7bc6d6	glsl: Apply the transformation "1/rsq(x) == sqrt(x)" in opt_algebraic. The comment was stale, because the lowering in question wasn't happening in lower_instructions.cpp. Presumably if the lowering ever moves there, we can plumb the lowering mask through to opt_algebraic. total instructions in shared programs: 1618696 -> 1616810 (-0.12%) instructions in affected programs: 243018 -> 241132 (-0.78%) GAINED: 0 LOST: 0 Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2013-11-15 11:33:07 -08:00
Eric Anholt	477f8cd08b	glsl: Apply the transformation "(a ^^ a) -> false" in opt_algebraic. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2013-11-15 11:33:07 -08:00
Eric Anholt	58a98d32e4	glsl: Apply the transformation "(a && a) -> a" in opt_algebraic. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2013-11-15 11:33:07 -08:00
Eric Anholt	ee27048262	glsl: Apply the transformation "(a \|\| a) -> a" in opt_algebraic. total instructions in shared programs: 1732385 -> 1732373 (-0.00%) instructions in affected programs: 416 -> 404 (-2.88%) GAINED: 0 LOST: 0 (That's 4 already-short fragment shaders in dota2) Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2013-11-15 11:33:07 -08:00
Eric Anholt	08bf52712e	glsl: Drop no-op shifts involving 0. I noticed this in a shader in Unigine Heaven that was spilling. While it doesn't really reduce register pressure, it shaves a few instructions anyway (7955 -> 7882). v2: Fix turning "0 >> x" into "x" instead of "0" (caught by Erik Faye-Lund). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2013-10-28 14:07:31 -07:00
Eric Anholt	3a0fdf2ab6	glsl: Use ir_builder more in opt_algebraic. While ir_builder is slightly less efficient, we're only increasing the work when there's actual optimization being done, and it's way more readable code. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2013-10-28 14:07:31 -07:00
Eric Anholt	27bcb5063f	glsl: Move common code out of opt_algebraic's handle_expression(). Matt and I had each screwed up these common required patterns recently, in ways that wouldn't have been noticed for a long time if not for code review. Just enforce it in the caller so that we don't rely on code review catching these bugs. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2013-10-28 14:07:31 -07:00
Matt Turner	64c081e8b7	glsl: Optimize (not A) and (not B) into not (A or B). No shader-db changes, but seems like a good idea. Reviewed-by: Eric Anholt <eric@anholt.net>	2013-10-25 10:35:18 -07:00
Matt Turner	65a600f58a	glsl: Optimize (not A) or (not B) into not (A and B). A few Serious Sam 3 shaders affected: instructions in affected programs: 4384 -> 4344 (-0.91%) Reviewed-by: Eric Anholt <eric@anholt.net>	2013-10-25 10:35:13 -07:00
Matt Turner	f1e605f1ad	glsl: Optimize -(-expr) into expr. Reviewed-by: Paul Berry <stereotype441@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2013-10-21 22:53:36 -07:00
Matt Turner	963df4d37d	glsl: Optimize abs(-expr) and abs(abs(expr)) into abs(expr). Reviewed-by: Paul Berry <stereotype441@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2013-10-21 22:53:36 -07:00
Matt Turner	5b3aec412e	glsl: Use saved values instead of recomputing them. Reviewed-by: Paul Berry <stereotype441@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2013-10-21 22:53:36 -07:00
Matt Turner	a360ca7476	glsl: Optimize mul(a, -1) into neg(a). Two extra instructions in some heroesofnewerth shaders, but a win for everything else. total instructions in shared programs: 1531352 -> 1530815 (-0.04%) instructions in affected programs: 121898 -> 121361 (-0.44%) Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-10-16 20:49:43 -07:00
Matt Turner	499d8c6545	glsl: Add support for new bit built-ins in ARB_gpu_shader5. v2: Move use of ir_binop_bfm and ir_triop_bfi to a later patch. Reviewed-by: Chris Forbes <chrisf@ijw.co.nz>	2013-05-06 10:17:13 -07:00
Matt Turner	af2c64063e	glsl: Optimize ir_triop_lrp(x, y, a) with a = 0.0f or 1.0f Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2013-02-28 13:18:59 -08:00
Kenneth Graunke	93066ce129	glsl: Convert mix() to use a new ir_triop_lrp opcode. Many GPUs have an instruction to do linear interpolation which is more efficient than simply performing the algebra necessary (two multiplies, an add, and a subtract). Pattern matching or peepholing this is more desirable, but can be tricky. By using an opcode, we can at least make shaders which use the mix() built-in get the more efficient behavior. Currently, all consumers lower ir_triop_lrp. Subsequent patches will actually generate different code. v2 [mattst88]: - Add LRP_TO_ARITH flag to ir_to_mesa.cpp. Will be removed in a subsequent patch and ir_triop_lrp translated directly. v3 [mattst88]: - Move changes from the next patch to opt_algebraic.cpp to accept 3-src operations. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2013-02-28 13:18:59 -08:00
Matt Turner	ae419a0159	glsl: Transform dot product by a basis vector into a swizzle Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-12 18:51:25 -04:00
Matt Turner	d7bef19c7f	glsl: Check for zero vectors in ir_binop_dot Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-12 18:51:25 -04:00
Eric Anholt	337d9c955b	glsl: Put a bunch of optimization visitors under anonymous namespaces. Because these classes are used entirely from their own source files and not from separate DSOs, the linker gets to produce massively less code. This cuts about 13k of text in the libdricore case. In the non-libdricore case, the additional linkage information allows the compiler to inline some code, so libglsl.a size actually increases by about 300 bytes. For a dricore build, improves shader_runner runtime on glsl-fs-copy-propagation-texcoords-1 by 0.21% +/- 0.03% (n=353574, outliers removed). No statistically significant difference with n=322 on glslparsertest on a yofrankie shader intended to test compiler performance. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2012-06-11 09:28:00 -07:00
Kenneth Graunke	d3073f58c1	Convert everything from the talloc API to the ralloc API.	2011-01-31 10:17:09 -08:00
Aras Pranckevicius	4ce084c707	glsl: fix matrix type check in ir_algebraic Fixes glsl-mat-mul-1.	2010-11-30 13:32:00 -08:00
Ian Romanick	11d6f1c698	glsl: Add ir_quadop_vector expression The vector operator collects 2, 3, or 4 scalar components into a vector. Doing this has several advantages. First, it will make ud-chain tracking for components of vectors much easier. Second, a later optimization pass could collect scalars into vectors to allow generation of SWZ instructions (or similar as operands to other instructions on R200 and i915). It also enables an easy way to generate IR for SWZ instructions in the ARB_vertex_program assembler.	2010-11-19 15:00:26 -08:00
Ian Romanick	fc92e87b97	glsl: Eliminate assumptions about size of ir_expression::operands This may grow in the near future.	2010-11-19 15:00:25 -08:00
Chad Versace	df883eb157	glsl: Fix Doxygen tag \file in recently renamed files	2010-11-17 12:07:24 -08:00
Ian Romanick	38e55153af	glsl: Refactor is_vec_{zero,one} to be methods of ir_constant These predicates will be used in other places soon.	2010-11-16 12:11:02 -08:00
Kenneth Graunke	32aaf89823	glsl: Rename various ir_* files to lower_* and opt_*. This helps distinguish between lowering passes, optimization passes, and other compiler code.	2010-11-15 16:34:20 -08:00

35 commits