fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-21 00:18:09 +02:00

Author	SHA1	Message	Date
Connor Abbott	d7971b41ce	nir/cf: reimplement nir_cf_node_remove() using the new API This gives us some testing of it. Also, the old nir_cf_node_remove() wasn't handling phi nodes correctly and was calling cleanup_cf_node() too late. Signed-off-by: Connor Abbott <connor.w.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-08-24 13:31:42 -07:00
Connor Abbott	fc7f2d2364	nir/cf: add new control modification API's These will help us do a number of things, including: - Early return elimination. - Dead control flow elimination. - Various optimizations, such as replacing: if (foo) { ... } if (!foo) { ... } with: if (foo) { ... } else { ... } Signed-off-by: Connor Abbott <connor.w.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-08-24 13:31:42 -07:00
Connor Abbott	476eb5e4a1	nir/cf: use a cursor for inserting control flow Signed-off-by: Connor Abbott <connor.w.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-08-24 13:31:42 -07:00
Connor Abbott	d356f84d4c	nir/cf: add split_block_cursor() This is a helper that will be shared between the new control flow insertion and modification code. Signed-off-by: Connor Abbott <connor.w.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-08-24 13:31:42 -07:00
Connor Abbott	58a360c6b8	nir/cf: add split_block_before_instr() Signed-off-by: Connor Abbott <connor.w.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-08-24 13:31:42 -07:00
Connor Abbott	6e47a34b29	nir/cf: add a cursor structure For now, it allows us to refactor the control flow insertion API's so that there's a single entrypoint (with some wrappers). More importantly, it will allow us to reduce the combinatorial explosion in the extract function. There, we need to specify two points to extract, which may be at the beginning of a block, the end of a block, or in the middle of a block. And then there are various wrappers based off of that (before a control flow node, before a control flow list, etc.). Rather than having 9 different functions, we can have one function and push the actual logic of determining which variant to use down to the split function, which will be shared with nir_cf_node_insert(). In the future, we may want to make the instruction insertion API's as well as the builder use this, but that's a future cleanup. Signed-off-by: Connor Abbott <connor.w.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-08-24 13:31:42 -07:00
Connor Abbott	6f5c81f86f	nir/cf: fix link_blocks() when there are no successors When we insert a single basic block A into another basic block B, we will split B into C and D, insert A in the middle, and then splice together C, A, and D. When we splice together C and A, we need to move the successors of A into C -- except A has no successors, since it hasn't been inserted yet. So in move_successors(), we need to handle the case where the block whose successors are to be moved doesn't have any successors. Fixing link_blocks() here prevents a segfault and makes it work correctly. Signed-off-by: Connor Abbott <connor.w.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-08-24 13:31:42 -07:00
Connor Abbott	6d028749ac	nir/cf: clean up jumps when cleaning up CF nodes We may delete a control flow node which contains structured jumps to other parts of the program. We need to remove the jump as a predecessor, as well as remove any phi node sources which reference it. Right now, the same problem exists for blocks that don't end in a jump instruction, but with the new API it shouldn't be an issue, since blocks that don't end in a jump must either point to another block in the same extracted CF list or not point to anything at all. Signed-off-by: Connor Abbott <connor.w.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-08-24 13:31:42 -07:00
Connor Abbott	211c79515d	nir/cf: remove uses of SSA definitions that are being deleted Unlike calling nir_instr_remove(), calling nir_cf_node_remove() (and later in the series, the nir_cf_list_delete()) implies that you're removing instructions that may still have uses, except those instructions are never executed so any uses will be undefined. When cleaning up a CF node for deletion, we must clean up any uses of the deleted instructions by making them point to undef instructions instead. Signed-off-by: Connor Abbott <connor.w.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-08-24 13:31:42 -07:00
Connor Abbott	633cbbc068	nir/cf: handle jumps better in stitch_blocks() In particular, handle the case where the earlier block ends in a jump and the later block is empty. In that case, we want to preserve the jump and remove any traces of the later block. Before, we would only hit this case when removing a control flow node after a jump, which wasn't a common occurance, but we'll need it to handle inserting a control flow list which ends in a jump, which should be more common/useful. Signed-off-by: Connor Abbott <connor.w.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-08-24 13:31:42 -07:00
Connor Abbott	940873bf22	nir/cf: handle jumps in split_block_end() Before, we would only split a block with a jump at the end if we were inserting something after a block with a jump, which never happened in practice. But now, we want to use this to extract control flow lists which may end in a jump, in which case we really need to do the correct patching up. As a side effect, when removing jumps we now correctly insert undef phi sources in some corner cases, which can't hurt. Signed-off-by: Connor Abbott <connor.w.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-08-24 13:31:42 -07:00
Connor Abbott	f596e4021c	nir/cf: add block_ends_in_jump() Signed-off-by: Connor Abbott <connor.w.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-08-24 13:31:42 -07:00
Connor Abbott	788d45cb47	nir/cf: handle phi nodes better in split_block_beginning() Signed-off-by: Connor Abbott <connor.w.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-08-24 13:31:42 -07:00
Connor Abbott	747ddc3cdd	nir/cf: split up and improve nir_handle_remove_jumps() Before, the process of removing a jump and wiring up the remaining block correctly was atomic, but with the new control flow modification it's split into two parts: first, we extract the jump, which creates a new block with re-wired successors as well as a free-floating jump, and then we delete the control flow containing the jump, which removes the entry in the predecessors and any phi node sources. Split up nir_handle_remove_jumps() to accomodate this, and add the missing support for removing phi node sources. Signed-off-by: Connor Abbott <connor.w.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-08-24 13:31:42 -07:00
Connor Abbott	13482111d0	nir/cf: add remove_phi_src() helper Signed-off-by: Connor Abbott <connor.w.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-08-24 13:31:41 -07:00
Connor Abbott	f41e108d8b	nir: add nir_foreach_phi_src_safe() Signed-off-by: Connor Abbott <connor.w.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-08-24 13:31:41 -07:00
Connor Abbott	762ae436ea	nir/cf: add insert_phi_undef() helper Signed-off-by: Connor Abbott <connor.w.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-08-24 13:31:41 -07:00
Connor Abbott	b49371b8ed	nir: move control flow modification to its own file We want to start reworking and expanding this code, but it'll be a lot easier to do once we disentangle it from the rest of the stuff in nir.c. Unfortunately, there are a few unavoidable dependencies in nir.c on methods we'd rather not expose publicly, since if not used in very specific situations they can cause Bad Things (tm) to happen. Namely, we need to do some magical control flow munging when adding/removing jumps. In the future, we may disallow adding/removing jumps in nir_instr_insert_*() and nir_instr_remove(), and use separate functions that are part of the control flow modification code, but for now we expose them and put them in a separate, private header. Signed-off-by: Connor Abbott <connor.w.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-08-24 13:31:41 -07:00
Connor Abbott	1c53f89696	nir: make cleanup_cf_node() not use remove_defs_uses() cleanup_cf_node() is part of the control flow modification code, which we're going to split into its own file, but remove_defs_uses() is an internal function used by nir_instr_remove(). Break the dependency by making cleanup_cf_node() use nir_instr_remove() instead, which simply calls remove_defs_uses() and then removes the instruction from the list. nir_instr_remove() does do extra things for jumps, though, so we avoid calling it on jumps which matches the previous behavior (this will be fixed later in the series). Signed-off-by: Connor Abbott <connor.w.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-08-24 13:31:41 -07:00
Connor Abbott	9d5944053c	nir: inline block_add_pred() a few places It was being used to initialize function impls and loops, even though it's really a control flow modification helper. It's pretty trivial, so just inline it to avoid the dependency. Signed-off-by: Connor Abbott <connor.w.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-08-24 13:31:41 -07:00
Connor Abbott	c7df141c71	nir/validate: check successors/predecessors more carefully We should be checking almost everything now. Signed-off-by: Connor Abbott <connor.w.abbott@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-08-24 13:31:41 -07:00
Kenneth Graunke	8e0d4ef341	nir: Delete the nir_function_impl::start_block field. It's simply the first nir_cf_node in the nir_function_impl::body list, which is easy enough to access - we don't to store a pointer to it explicitly. Removing it means we don't need to maintain the pointer when, say, splitting the start block when modifying control flow. Thanks to Connor Abbott for suggesting this. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-08-24 13:31:41 -07:00
Martin Peres	80b1707e26	nir: convert the glsl intrinsic image_size to nir_intrinsic_image_size v2, review from Francisco Jerez: - make the destination variable as large as what the nir instrinsic defines (4) instead of the size of the return variable of glsl. This is still safe for the already existing code because all the intrinsics affected returned the same amount of components as expected by glsl IR. In the case of image_size, it is not possible to do so because the returned number of component depends on the image type and this case is not well handled by nir. v3: - Style fix Signed-off-by: Martin Peres <martin.peres@linux.intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2015-08-20 14:07:46 +03:00
Kenneth Graunke	ab83be590d	nir: Use nir_builder in nir_lower_io's get_io_offset(). Much more readable. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-08-19 19:29:39 -07:00
Kenneth Graunke	ed2afec3fc	nir: Pull nir_lower_io's load_op selection into a helper function. Makes the function a bit smaller. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-08-19 19:29:22 -07:00
Thomas Helland	49d0a36bd6	nir: Simplify feq(fneg(a), a)) -> feq(a, 0.0) The positive and negative value of a float can only be equal to each other if it is -0.0f and 0.0f. This is safe for Nan and Inf, as -Nan != Nan, and -Inf != Inf This gives no changes in my shader-db Signed-off-by: Thomas Helland <thomashelland90@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2015-08-18 11:34:44 -07:00
Thomas Helland	a39167d594	nir: Simplify fne(fneg(a), a) -> fne(a, 0.0) -NaN != NaN, and -Inf != Inf, so this should be safe. Found while working on my VRP pass. Shader-db results on my IVB: total instructions in shared programs: 1698267 -> 1698067 (-0.01%) instructions in affected programs: 15785 -> 15585 (-1.27%) helped: 36 HURT: 0 GAINED: 0 LOST: 0 Some shaders was found to have the following pattern in NIR: vec1 ssa_26 = fneg ssa_21 vec1 ssa_27 = fne ssa_21, ssa_26 Make that: vec1 ssa_27 = fne ssa_21, 0.0f This is found in Dota2 and Brutal Legend. One shader is cut by 8%, from 323 -> 296 instructons in SIMD8 Signed-off-by: Thomas Helland <thomashelland90@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2015-08-18 11:34:44 -07:00
Kenneth Graunke	afccbd7256	nir: Add a glsl_uint_type() wrapper. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Timothy Arceri <t_arceri@yahoo.com.au>	2015-08-16 21:44:19 -07:00
Eric Anholt	a6e75e3cd7	nir: Add support for CSE on textures. NIR instruction count results on i965: total instructions in shared programs: 1261954 -> 1261937 (-0.00%) instructions in affected programs: 455 -> 438 (-3.74%) One in yofrankie, two in tropics. Apparently i965 had also optimized all of these out anyway. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-08-14 11:39:18 -07:00
Eric Anholt	fb2425a641	nir: Zero out texture instructions when creating them. There are so many flags in textures, that the CSE pass would have a hard time referencing the correct set when figuring out if two texture ops are the same. By zeroing, we can avoid that fragility. Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-08-14 11:39:18 -07:00
Eric Anholt	d50c182671	nir: Don't try to scalarize unpack ops. Avoids regressions in vc4 when trying to do our blending in NIR. v2: Add the other unpack ops I meant to when writing the original commit message. Reviewed-by: Matt Turner <mattst88@gmail.com>	2015-08-14 11:39:18 -07:00
Eric Anholt	9e6dc5b64d	nir: Add a nir_opt_undef() to handle csels with undef. We may find a cause to do more undef optimization in the future, but for now this fixes up things after if flattening. vc4 was handling this internally most of the time, but a GLB2.7 shader that did a conditional discard and assign gl_FragColor in the else was still emitting some extra code. total instructions in shared programs: 100809 -> 100795 (-0.01%) instructions in affected programs: 37 -> 23 (-37.84%) v2: Use nir_instr_rewrite_src() to update def/use on src[0] (by Thomas Helland). v3: Make sure to flag metadata dirties, and copy the swizzle and abs/neg over to src[0], too (by anholt). Reviewed-by: Thomas Helland <thomashelland90@gmail.com> (v2) Tested-by: Thomas Helland <thomashelland90@gmail.com> (v2)	2015-08-14 11:39:18 -07:00
Timothy Arceri	2c61d583f8	nir: add missing type to type_size_vec4() Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2015-08-05 21:16:45 +10:00
Eric Anholt	6c28ee2041	nir: Add a nir_lower_load_const_to_scalar() pass. This is useful to increase the CSE opportunities for a scalar backend. It avoids regressions when dropping vc4's custom CSE implementation. v2: Cleanups by Matt (decl in the for loop, and unreachable()). Reviewed-by: Matt Turner <mattst88@gmail.com>	2015-08-04 20:03:10 -07:00
Eric Anholt	a70f63ab20	nir: Add algebraic opt for no-op iand. I lazily generated some of these in VC4 NIR lowering. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2015-08-04 17:19:25 -07:00
Eric Anholt	eae9c3286e	Revert "nir: Use a single bit for the dual-source blend index" This reverts commit `ab5b7a0fe6`. We use more than one bit of value in tgsi_to_nir.	2015-08-04 17:19:01 -07:00
Samuel Iglesias Gonsalvez	418c004f80	nir: Fix output swizzle in get_mul_for_src Avoid copying an overwritten swizzle, use the original values. Example: Former swizzle[] = xyzw src->swizzle[] = zyxx The expected output swizzle = zyxx but if we reuse swizzle in the loop, then output swizzle would be zyzz. Signed-off-by: Samuel Iglesias Gonsalvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-08-03 09:40:50 -07:00
Iago Toral Quiroga	01f6235020	nir/nir_lower_io: Add vec4 support The current implementation operates in scalar mode only, so add a vec4 mode where types are padded to vec4 sizes. This will be useful in the i965 driver for its vec4 nir backend (and possbly other drivers that have vec4-based shaders). Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-08-03 09:40:47 -07:00
Timothy Arceri	ab5b7a0fe6	nir: Use a single bit for the dual-source blend index The only values allowed are 0 and 1, and the value is checked before assigning. This is a copy of `8eeca7a56c` that seems to have been made to the glsl ir type after it was copied for use in nir but before nir landed. Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2015-08-03 21:36:50 +10:00
Matt Turner	4251ccb47b	nir: Avoid double promotion. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2015-07-29 09:34:51 -07:00
Matt Turner	5c7fd67045	glsl: Remove MSVC implementations of copysign and isnormal. Non-Gallium parts of Mesa require MSVC 2013 which provides these.	2015-07-29 09:34:51 -07:00
Dave Airlie	80511d176a	i965: add support for ARB_shader_subroutine This just adds some missing pieces to nir/i965, it is lightly tested on my Haswell. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2015-07-24 10:25:08 +10:00
Dave Airlie	57f24299b7	glsl/types: add new subroutine type (v3.2) This type will be used to store the name of subroutine types as in subroutine void myfunc(void); will store myfunc into a subroutine type. This is required to the parser can identify a subroutine type in a uniform decleration as a valid type, and also for looking up the type later. Also add contains_subroutine method. v2: handle subroutine to int comparisons, needed for lowering pass. v3: do subroutine to int with it's own IR operation to avoid hacking on asserts (Kayden) v3.1: fix warnings in this patch, fix nir, fix tgsi v3.2: fixup tests Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz> Signed-off-by: Dave Airlie <airlied@redhat.com> tests: fix warnings	2015-07-23 17:25:25 +10:00
Connor Abbott	eaf799ddff	nir: add nir_foreach_instr_safe_reverse() Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com> Signed-off-by: Connor Abbott <connor.w.abbott@intel.com>	2015-07-17 09:49:53 -07:00
Connor Abbott	8eea091747	nir: add nir_instr_is_first() and nir_instr_is_last() helpers Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com> Signed-off-by: Connor Abbott <connor.w.abbott@intel.com>	2015-07-17 09:47:22 -07:00
Iago Toral Quiroga	6b09598d63	nir: add nir_var_shader_storage Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2015-07-14 07:04:03 +02:00
Kenneth Graunke	efb36271a9	nir: Fix comment above nir_convert_from_ssa() prototype. Connor renamed the parameter, inverting the sense. Update the comment accordingly. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-07-08 11:28:08 -07:00
Rob Clark	959b47262b	nir/lower_phis_to_scalar: undef is trivially scalarizable Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-07-03 08:56:09 -04:00
Jason Ekstrand	89bd5ee64c	nir: Don't allow copying SSA destinations Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-07-02 15:42:33 -07:00
Connor Abbott	aa7d4cecec	nir: remove parent_instr from nir_register It's no longer used. Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-06-30 11:18:27 -07:00

1 2 3 4 5 ...

420 commits