fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-17 00:58:13 +02:00

Author	SHA1	Message	Date
Jason Ekstrand	b84f74f9b7	nir/lower_io: Support generic pointer access If the pointer is generic and we haven't yet figured out what kind of pointer it is yet, we emit an if-ladder based on a mode check. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	a451f037ff	nir/lower_io: Add support for lowering deref_mode_is The guts are still missing so it will blow up if it sees any deref_mode_is intrinsic that it can't constant-fold from the mode. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	57943112d9	nir/lower_io: Add support for 32/64bit_global for shared Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	c50332fbc2	nir/lower_io: Add a mode parameter to addr_format_is_* Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	7007d06898	nir/lower_io: Add a mode parameter to build_addr_iadd Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	ff838abc46	nir/opt_deref: Add an optimization for deref_mode_is If opt_restrict_deref_modes makes progress, we may be able to figure out the mode well enough to turn a deref_mode_is intrinsic into a constant. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	df51518dc5	nir/opt_deref: Add a deref mode specialization optimization Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	a8e53a772f	spirv: Add generic pointer support Most of this is fairly straightforward; we just set all the modes on any derefs which are generic. The one tricky bit is OpGenericCastToPtrExplicit. Instead of adding NIR intrinsics to do the cast, we add NIR intrinsics to do a storage class check and then bcsel based on that. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	d6415b5d2b	nir: Add support for generic pointers The way they're handled is that deref->modes is treated as a bitfield of possible modes. Variables are required to have a specific mode and derefs with deref_type_var are as well. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	9d377c01d0	nir: Make nir_deref_instr::mode a bitfield We rename it to "modes" to make it clear that it may contain more than one mode and adjust all the uses of nir_deref_instr::modes to attempt to handle multiple modes. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	7d5f3b5c0e	nir/split_*_vars: Prepare for generic pointers All three passes check the variables for complex uses and don't split them if they have any complex uses. Most of these checks are just early returns to avoid chasing the deref to the variable and a hash table lookup if we can quickly determine it has the wrong mode. In a couple of cases, we need to re-arrange or add other checks to ensure that it's safe for generic pointers. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	d50a4dbc13	nir/find_array_copies: Prepare for generic pointers Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	ced9b6f0d8	nir: Use nir_deref_mode_may_be in deref optimizations All the checks being replaced are fore potential aliasing so we want to flush stores whenever the mode might be something that aliases. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	379d535480	nir/vec3_to_vec4: Use nir_deref_must_be We use the same nir_deref_mode_is_in_set helper that we use in nir_lower_vars_to_explicit_types for the same reason. If there are any generic pointers in play, we have to lower all generic pointer modes at the same time or else we risk types getting out-of-sync. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	8a2cda1d53	nir/vars_to_ssa: Use nir_deref_must_be We can only lower a deref to SSA in this pass if it's guaranteed to be nir_var_function_temp. We already flag any variables with complex uses (i.e. casts) as not being lowerable and refuse to lower any derefs to them so we don't have to worry about false negatives. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	0f94ff8a6a	nir: Only force loop unrolling if we know it's a in/out/temp If we don't know the actual mode then we can't get to the variable so it's going to be a scratch or other indirect load anyway and we aren't saving ourselves anything by unrolling the loop. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	fff78fc1c5	nir/phis_to_scalar,gcm: Use nir_deref_mode_may_be In both cases, we're trying to determine if a load is scalarizable. We don't want to scalarize if it's a function_temp or shader_temp because it might turn into something we can't scalarize. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	9f3e3dfd2f	nir/lower_io: Use nir_deref_mode_* helpers For non-explicit nir_lower_io, we use nir_deref_mode_is because there's no way it works for generic pointers. For nir_lower_vars_to_explicit_types, and nir_lower_explicit_io, we use nir_deref_mode_is_in_set to ensure we never get type confusion. For generic pointers, this means that they must be called with the full set of generic pointer modes. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	9ae87a6c31	nir/lower_array_deref_of_vec: Use nir_deref_mode_must_be Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	3cc58e6470	nir: Add and use some deref mode helpers NIR derefs currently have exactly one variable mode. This is about to change so we can handle OpenCL generic pointers. In order to transition safely, we need to audit every deref->mode check. This commit adds a set of helpers that provide more nuanced mode checks and converts most of NIR to use them. For simple cases, we add nir_deref_mode_is and nir_deref_mode_is_one_of helpers. These can be used in passes which don't have to bother with generic pointers and just want to know what mode a thing is. If the pass ever encounters generic pointers in a way that this check would be unsafe, it will assert-fail to alert developers that they need to think harder about things and fix the pass. For more complex passes which require a more nuanced understanding of modes, we add nir_deref_mode_may_be and nir_deref_mode_must_be helpers which accurately describe the compiler's best knowledge about the given deref. Unfortunately, we may not be able to exactly identify the mode in a generic pointers scenario so we have to be very careful when we use these. Conversion of these passes is left to later commits. For the case of mass lowering of a particular mode (nir_lower_explicit_io is one good example), we add nir_deref_mode_is_in_set. This is also pretty assert-happy like nir_deref_mode_is but is for a set containment comparison on deref modes where you expect the deref to either be all-in or all-out. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	74886cabaa	nir/opt_find_array_copies: Allow copies from mem_constant Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	89abfbeb7a	nir: Disallow writes to system values and mem_constant Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	bb5d5029b7	nir: Use var->data.mode instead of deref->mode in a few cases We already have the variable so we know the mode exactly. Just use that instead of the deref mode. If these paths ever have to handle variable pointers (not likely since they're OpenGL-specific), we can fix them to handle crazy deref modes then. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	5664713d7b	nir: Handle incomplete derefs in split_struct_vars In split_var_list_structs where we initalize the splitting, we already use get_complex_used_vars to avoid splitting any variables that have a complex use. However, we weren't actually handling the complex uses properly in the case where we can't actually find the variable. Fixes: `f1cb3348f1` "nir/split_vars: Properly bail in the presence of ..." Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	6b72004f12	nir/phis_to_scalar: Use a deny-list for load_deref modes I can't think of any reason why shared and output aren't in this list. The real thing we're trying to do is avoid premature scalarization because of a shader or function temporary variable because we might lower it to something we don't want scalarized later. Also fix the version we copy+pasted into GCM. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Jason Ekstrand	3f0a29fffb	nir/builder: Add a nir_ieq_imm helper This shows up surprisingly often. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6332>	2020-11-03 22:18:28 +00:00
Rhys Perry	89c4bba8bc	nir/algebraic: better propagate constants up fadd chains Make the optimization create more mad-friendly code if the order of the fadd's operands is unlucky. fossil-db (Navi): Totals from 9259 (8.07% of 114665) affected shaders: SGPRs: 615991 -> 616191 (+0.03%); split: -0.05%, +0.08% VGPRs: 442184 -> 443568 (+0.31%); split: -0.10%, +0.41% CodeSize: 32674876 -> 32625572 (-0.15%); split: -0.17%, +0.02% MaxWaves: 108560 -> 108152 (-0.38%); split: +0.07%, -0.44% Instrs: 6126473 -> 6120463 (-0.10%); split: -0.13%, +0.03% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5631>	2020-11-03 14:56:00 +00:00
Rhys Perry	24a18b1a4b	nir: scalarize fdot in reverse This will create code that is easier to combine into MADs/FMA when the last component is 1.0. nir_opt_algebraic_late has an optimization to do something similar but it only works for inexact code, if the multiplication-by-1 optimization is done before it and if the backend enables fuse_ffma. fossil-db (Navi): Totals from 85583 (74.64% of 114665) affected shaders: SGPRs: 4556060 -> 4558596 (+0.06%); split: -0.07%, +0.12% VGPRs: 3315060 -> 3312984 (-0.06%); split: -0.23%, +0.17% SpillSGPRs: 13552 -> 13553 (+0.01%) CodeSize: 184962756 -> 184431388 (-0.29%); split: -0.32%, +0.03% MaxWaves: 1208693 -> 1209361 (+0.06%); split: +0.17%, -0.11% Instrs: 35678819 -> 35361617 (-0.89%); split: -0.91%, +0.02% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5631>	2020-11-03 14:56:00 +00:00
Jason Ekstrand	78a420ce46	nir/validate: Explain why we don't use nir_foreach_block Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7409>	2020-11-02 23:21:13 +00:00
Yevhenii Kolesnikov	ea81889ea4	nir/large_constants: only search for constant duplicates Fixes: `b6d4753568` ("nir/large_constants: De-duplicate constants") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3706 Signed-off-by: Yevhenii Kolesnikov <yevhenii.kolesnikov@globallogic.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7350>	2020-11-02 17:30:31 +00:00
James Park	ce5e2e2131	nir: Stabilize compact_components sort Incorporate location_frac into qsort comparison. qsort is not required to be stable, and MSVC implementation is not. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7399>	2020-11-02 17:15:15 +00:00
Danylo Piliaiev	8077f3f4c4	nir/lower_returns: Append missing phis' sources after "break" insertion After we lowered `return` into `break` - the control flow is changed and the block with this change has a new successor, which means that in this new successor phis should have additional source. Since the instructions that use phis in the successor are predicated - it's ok for a new phi source to be undef. If `return` is lowered in a nested loop, `break` is inserted in the outer loops, so all new blocks with break require the same changes to phis described above. Examples of NIR before lowering: block block_0: loop { block block_1: if ssa_2 { block block_2: return // succs: block_6 } else { block block_2: break; // succs: block_5 } block block_4: } block block_5: // preds: block_3 vec1 32 ssa_4 = phi block_3: ssa_1 // succs: block_6 block block_6: Here converting return to break should add block_2 to the phis of block_5. block block_0: loop { block block_1: loop { block block_2: if ssa_2 { block block_3: return // succs: block_8 } else { block block_4: break; // succs: block_6 } block block_5: } block block_6: break; // succs: block_7 } block block_7: // preds: block_6 vec1 32 ssa_4 = phi block_6: ssa_1 // succs: block_8 block block_8: Here converting return to break will insert conditional break in the outer loop, changing block_6 predcessors. Cc: <mesa-stable@lists.freedesktop.org> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3322 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3498 Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6186>	2020-11-02 14:12:21 +00:00
Daniel Schürmann	bd0468ed33	nir: add options to lower nir_op_pack_[64/32]_* via nir_lower_alu_to_scalar() Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6527>	2020-10-28 10:14:26 +00:00
Jason Ekstrand	3d9ffdcc72	nir/lower_memcpy: Don't mask the store For constant-size memcpys, we can do as much as a vec4 at a time. We were accidentally masking the store to only the .x component. Fixes: `a3177cca99` "nir: Add a lowering pass to lower memcpy" Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7305>	2020-10-26 14:47:19 +00:00
Connor Abbott	4ca38a1995	nir/lower_clip_cull: Store array size for FS inputs I think the rationale for not setting the size for inputs is that when passed between geometry stages the clip and cull distances are supposed to be treated like any other varying. However, this isn't 100% the case for the FS, since when it's read by the FS it's also used by the fixed-function stage. In freedreno we setup varying locations when compiling the FS, and then tack on VS-only outputs like gl_Position at the end. Furthermore there's code to compact input locations based on what's actually read. But this compaction can't happen for clip and cull distances, because then we won't have space for components that are only read by the clipper. So, we need to know the original number of components for both arrays. Modify this pass so that we don't have to go digging around for it ourselves. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6959>	2020-10-23 11:09:18 +00:00
Andrii Simiklit	d972a6ac4c	nir: get rid of OOB dereferences in nir_lower_io_arrays_to_elements This patch fixes mesa compiler crash in i965 on shaders like the following one: ``` in VS_OUTPUT { mat4 data; } vs_output; out vec4 fs_output; vec4 convert(in float val) { return vec4(val); } void main() { fs_output = vec4(0.0); for (int a = -1; a < 5; a++) { for (int b = -1; b < 5; b++) { fs_output += convert(vs_output.data[b][a]); } } } ``` Section 5.11 (Out-of-Bounds Accesses) of the GLSL 4.60 spec says: In the subsections described above for array, vector, matrix and structure accesses, any out-of-bounds access produced undefined behavior.... Out-of-bounds reads return undefined values, which include values from other variables of the active program or zero. Out-of-bounds writes may be discarded or overwrite other variables of the active program. GL_KHR_robustness and GL_ARB_robustness encourage us to return zero for reads. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Signed-off-by: Andrii Simiklit <andrii.simiklit@globallogic.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6560>	2020-10-23 09:51:38 +00:00
Ian Romanick	67956689bb	nir: Rename replicated-result dot-product instructions All these instructions replicate the result of a N-component dot-product to a vec4. Naming them fdot_replicatedN gives the impression that are some sort of abstract dot-product that replicates the result to a vecN. They also deviate from fdph_replicated... which nobody would reasonably consider naming fdot_replicatedh. Naming these opcodes fdotN_replicated more closely matches what they are, and it matches the pattern of fdph_replicated. I believe that the only reason these opcodes were named this way was because it simplified the implementation of the binop_reduce function in nir_opcodes.py. I made some fairly simple changes to that function, and I think the end result is ok. The bulk of the changes come from the sed rename: sed --in-place -e 's/fdot_replicated$[234]$/fdot\1_replicated/g' \ $(grep -r 'fdot_replicated[234]' src/) v2: Use a named parameter to binop_reduce instead of using isinstance(name, str). Suggested by Jason. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5725>	2020-10-22 18:00:19 +00:00
Gert Wollny	b739bb7168	compile/nir: Correct printing dest_type Fixes: `0aa08ae2f6` nir: Split NIR_INTRINSIC_TYPE into separate src/dest indices Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7261>	2020-10-22 11:39:34 +00:00
Rhys Perry	4735c8a522	nir/loop_analyze: adjust force unrolling to only include interesting modes Instead of force-unrolling any loop which reads an entire array, only do it for arrays which might be faster to access with constant indices. Significantly improves compile-time for these CTS tests, which could previously timeout: dEQP-VK.spirv_assembly.instruction.graphics.16bit_storage.struct_mixed_types.uniform_buffer_block_geom dEQP-VK.spirv_assembly.instruction.graphics.16bit_storage.struct_mixed_types.uniform_geom dEQP-VK.spirv_assembly.instruction.graphics.8bit_storage.struct_mixed_types.storage_buffer_geom dEQP-VK.spirv_assembly.instruction.graphics.spirv_ids_abuse.lots_ids_geom fossil-db (Navi): Totals from 19 (0.01% of 137413) affected shaders: SGPRs: 1728 -> 1688 (-2.31%) VGPRs: 1176 -> 1168 (-0.68%) CodeSize: 198496 -> 136580 (-31.19%) MaxWaves: 154 -> 156 (+1.30%) Instrs: 38889 -> 26029 (-33.07%) Cycles: 446108 -> 1059924 (+137.59%); split: -0.91%, +138.51% VMEM: 3245 -> 2926 (-9.83%) SMEM: 850 -> 828 (-2.59%); split: +4.71%, -7.29% VClause: 549 -> 533 (-2.91%) SClause: 1810 -> 1522 (-15.91%) Copies: 2209 -> 1705 (-22.82%); split: -22.95%, +0.14% Branches: 854 -> 603 (-29.39%); split: -29.86%, +0.47% PreSGPRs: 1512 -> 1506 (-0.40%); split: -0.53%, +0.13% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7161>	2020-10-22 12:07:45 +01:00
Caio Marcelo de Oliveira Filho	8cf0024432	nir: Use a switch in nir_lower_explicit_io_instr Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7255>	2020-10-21 12:00:09 -07:00
Erik Faye-Lund	33ccf0e9bc	nir: drop unused alpha_ref_float Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7251>	2020-10-21 16:33:43 +00:00
Erik Faye-Lund	42ee423e3a	nir: drop support for using load_alpha_ref_float Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7251>	2020-10-21 16:33:43 +00:00
Marek Olšák	233520035a	nir: consider load_color intrinsics as both inputs and sysval in gathering src/mesa expects this somewhere. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6950>	2020-10-21 16:10:08 +00:00
Eric Anholt	fdbc45d1d4	nir: Only validate in passes that might have changed things. If a pass returning boolean progress reports no change, we shouldn't need to re-validate. If a pass breaks the NIR but also fails to report progress correctly, it would be up to the next pass to catch that. This should hopefully help with test timeouts on KHR-GL33.texture_swizzle.functional since switching softpipe to nir-to-tgsi and enabling NIR validation in CI (27s to 20s on my system). Suggested-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Rob Clark <robdclark@chromium.org> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7239>	2020-10-21 05:00:17 +00:00
Jason Ekstrand	ef68f740a6	nir/lower_io: Assert non-zero power-of-two alignments The way the ALIGN_POT macro works, an alignment of 0 may cause ALIGN_POT(x, 0) to return 0 for any x. Throw in an assert to guard against this case. Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7069>	2020-10-20 23:46:42 +02:00
Eric Anholt	d867e7c974	nir: Add an option to not lower source mods for f64/u64/i64. TGSI can't handle them, but we want to use this pass for nir-to-tgsi. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3395>	2020-10-20 08:54:06 -07:00
Eric Anholt	c730feacc0	nir: Add a call to get a struct describing SSA liveness per instruction. nir-to-tgsi will use this to release release temporaries for SSA storage back to ureg's linear register allocation once they're dead. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3395>	2020-10-20 08:54:06 -07:00
Eric Anholt	a206b58157	nir: Add a block start/end ip to live instr index metadata. I wanted it for the per-instruction live intervals metadata, and it's not much to store in general. Make the ip explicitly 32-bit, on suggestion by jekstrand. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3395>	2020-10-20 08:54:06 -07:00
Eric Anholt	2f5d18403a	nir: Replace nir_ssa_def->live_index with nir_instr->index. live_index had two things going on: 0 meant the instr was an undef and always dead, and otherwise ssa defs had increasing numbers by instruction order. We already have a field in the instruction for storing instruction order, and ssa defs don't need that number to be contiguous (if you want a compact per-ssa-def number, use ssa->index after reindexing). We don't use ssa->index for this, because reindexing those would change nir_print, and that would be rude to people trying to track what's happening in optimization passes. This openend up a hole in nir_ssa_def, so we move nir_ssa_def->index toward the end to shrink the struct from 64 bytes to 56. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3395>	2020-10-20 08:54:01 -07:00
Eric Anholt	b6cb184e86	nir: Introduce nir_metadata_instr_index for nir_index_instr() being current. This will be useful to remove the live_index field from nir_ssa_def. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3395>	2020-10-20 08:53:36 -07:00

1 2 3 4 5 ...

2720 commits