fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-22 17:50:12 +01:00

Author	SHA1	Message	Date
Daniel Schürmann	a3785e3481	nir/opt_vectorize: hash whether a swizzle accesses elements beyond the maximum vectorization factor Swizzles that access components outside of the maximum vector size cannot be vectorized with each other. This patch creates different hash bins for this case. For example accesses to .x and .y are considered different variables compared to accesses to .z and .w for 16-bit vec2. This prevents the vectorization of things like vec2 16 ssa_3 = iadd ssa_1.xz, ssa_2.xz Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6666>	2020-12-31 16:44:58 +00:00
Daniel Schürmann	46e7428031	nir/opt_vectorize: rehash users of vectorized instructions This ensures that chains of ALU instructions are vectorized in a single iteration. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6666>	2020-12-31 16:44:58 +00:00
Daniel Schürmann	8eaf9c61d1	nir/opt_vectorize: don't hash filtered instructions This patch also changes nir_opt_vectorize_cb to use only one instruction as parameter. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6666>	2020-12-31 16:44:58 +00:00
Daniel Schürmann	23b2885514	nir/opt_vectorize: don't hash instructions which are already vectorized This guarantees that the hashset contains exactly the instructions which can be vectorized. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6666>	2020-12-31 16:44:58 +00:00
Daniel Schürmann	ad37e4df73	nir/opt_vectorize: use a single instruction per hash entry instead of a vector This drastically simplifies vectorization but may potentially lead to slightly worse vectorizations. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6666>	2020-12-31 16:44:58 +00:00
Marek Olšák	656d8edd9e	nir/opt_vectorize: don't lose exact and no_*_wrap flags This fixes a bunch of dEQP GLES tests. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6694>	2020-09-11 17:41:14 -04:00
Jason Ekstrand	d86e38af2c	nir: More NIR_MAX_VEC_COMPONENTS fixes A couple of these probably aren't strictly necessary but they won't hurt. The one that's particularly tricky is a fixed-length array in nir_search.h. However, to avoid blowing up the binary size of nir_opt_algebraic by about 2x, we just assert that only small ops are used. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6655>	2020-09-09 20:19:42 +00:00
Eric Anholt	f25e169897	nir/opt_vectorize: Add a callback for filtering of vectorizing. For NIR-to-TGSI, we don't want to revectorize 64-bit ops that we split to scalar beyond vec2 width. We even have some ops that we would rather retain as scalar due to TGSI opcodes being scalar, or having more unusual requirements. This could be used to do the vectorize_vec2_16bit filtering, but that shader compiler option is also used in algebraic so leave it in place for now. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6567>	2020-09-02 09:59:17 -07:00
Marek Olšák	116e006693	nir: add options::vectorize_vec2_16bit to limit vectorization to vec2 16 for hardware that is scalar but can do 2 16-bit operations on low and high 16 bits of registers at once. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5002>	2020-06-02 20:01:18 +00:00
Dmitriy Nester	0e9af02323	nir: replace fnv1a hash function with xxhash xxhash is faster than fnv1a in almost all circumstances, so we're switching to it globally. Signed-off-by: Dmytro Nester <dmytro.nester@globallogic.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4020>	2020-05-25 19:41:09 +00:00
Timothy Arceri	7f106a2b5d	util: rename list_empty() to list_is_empty() This makes it clear that it's a boolean test and not an action (eg. "empty the list"). Reviewed-by: Eric Engestrom <eric@engestrom.ch>	2019-10-28 11:24:38 +00:00
Eric Engestrom	bbeb507543	compiler/nir: add an ASSERTED Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-07-31 09:41:05 +01:00
Jonathan Marek	0b5a483baa	nir: opt_vectorize: combine different constant sources We can vectorize instructions with different constant sources by creating a new load_const and using that. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-26 14:56:28 -04:00
Connor Abbott	47e7c6961a	nir: add a vectorization pass This effectively does the opposite of nir_lower_alus_to_scalar, trying to combine per-component ALU operations with the same sources but different swizzles into one larger ALU operation. It uses a similar model as CSE, where we do a depth-first approach and keep around a hash set of instructions to be combined, but there are a few major differences: 1. For now, we only support entirely per-component ALU operations. 2. Since it's not always guaranteed that we'll be able to combine equivalent instructions, we keep a stack of equivalent instructions around, trying to combine new instructions with instructions on the stack. The pass isn't comprehensive by far; it can't handle operations where some of the sources are per-component and others aren't, and it can't handle phi nodes. But it should handle the more common cases, and it should be reasonably efficient. [Alyssa: Rebase on latest master, updating with respect to typeless moves] Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net>	2019-06-18 06:43:30 -07:00

14 commits