fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-16 07:38:14 +02:00

Author	SHA1	Message	Date
Erik Faye-Lund	b9c61379ab	microsoft/compiler: translate nir to dxil Here's the code to emit DXIL code from NIR. It's big and bulky as-is, and it needs to be split up a bit. This is the combination of a lot of commits from our development branch, containing code by several authors. Co-authored-by: Bill Kristiansen <billkris@microsoft.com> Co-authored-by: Boris Brezillon <boris.brezillon@collabora.com> Co-authored-by: Daniel Stone <daniels@collabora.com> Co-authored-by: Gert Wollny <gert.wollny@collabora.com> Co-authored-by: Jesse Natalie <jenatali@microsoft.com> Co-authored-by: Louis-Francis Ratté-Boulianne <lfrb@collabora.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7477>	2020-11-10 15:37:07 +00:00
Erik Faye-Lund	10dfd3cfb4	compiler: add SYSTEM_BIT_FRONT_FACE Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7477>	2020-11-10 15:37:07 +00:00
Gert Wollny	449c4baf50	nir/print: print GS extra info Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7477>	2020-11-10 15:37:07 +00:00
Samuel Pitoiset	1c5271346a	nir/algebraic: optimize bitfield_select(a, b, 0) to iand(a, b) (src0 & src1) \| (~src0 & src2) to (src0 & src1). fossils-db (Polaris10): Totals from 873 (0.63% of 138014) affected shaders: SGPRs: 33781 -> 33733 (-0.14%) VGPRs: 37704 -> 37520 (-0.49%); split: -0.51%, +0.02% CodeSize: 3861460 -> 3853424 (-0.21%); split: -0.21%, +0.00% MaxWaves: 5306 -> 5305 (-0.02%) Instrs: 743798 -> 743486 (-0.04%); split: -0.04%, +0.00% Cycles: 10962244 -> 10960936 (-0.01%); split: -0.01%, +0.00% VMEM: 128309 -> 128350 (+0.03%); split: +0.33%, -0.30% SMEM: 44797 -> 44113 (-1.53%); split: +0.02%, -1.54% Copies: 71875 -> 71674 (-0.28%); split: -0.31%, +0.03% PreSGPRs: 23484 -> 23479 (-0.02%) PreVGPRs: 34582 -> 34529 (-0.15%) Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7479>	2020-11-09 19:51:27 +00:00
Jason Ekstrand	f95665cfeb	nir/lower_bit_size: Add support for lowering subgroup ops Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7482>	2020-11-09 18:58:51 +00:00
Jason Ekstrand	2c4b47184d	nir/lower_bit_size: Pass a nir_instr to the callback This way we can start supporting more than just ALU ops. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7482>	2020-11-09 18:58:51 +00:00
Jason Ekstrand	15c6e05a72	nir/lower_bit_size: Don't cast comparison results Some ALU ops (comparisons being the primary example) have a fixed bit-size destination and, in that case, we don't want to insert a conversion on the destination. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7482>	2020-11-09 18:58:51 +00:00
Jason Ekstrand	2bbe01b186	spirv: Add support for SPV_EXT_shader_image_atomic_int64 Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7509>	2020-11-09 17:17:40 +00:00
Jason Ekstrand	5a3e22018d	nir: Allow 64-bit image atomics Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7509>	2020-11-09 17:17:39 +00:00
Jason Ekstrand	79f477c3c6	compiler/types: Add 64-bit image types Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7509>	2020-11-09 17:17:39 +00:00
Jason Ekstrand	b725fbd191	nir: Validate image atomic formats GLSL requires that image atomics have formats and there are rules about things matching properly. We should enforce those in NIR unless we have reason to do otherwise. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7509>	2020-11-09 17:17:39 +00:00
Jason Ekstrand	72f1c9aef5	nir: Print formats on image intrinsics as text Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7509>	2020-11-09 17:17:39 +00:00
Jason Ekstrand	d22fafa20b	spirv: Update headers and metadata from latest Khronos commit This corresponds to 5ab5c96198f30804a6a29961b8905f292a8ae600 ("Reserve additional loop control bit for Intel extension (NoFusionINTEL) (#175)") in https://github.com/KhronosGroup/SPIRV-Headers. Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7509>	2020-11-09 17:17:39 +00:00
Erik Faye-Lund	92374aebe2	spirv: correct sematic-typo Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7504>	2020-11-09 16:31:02 +00:00
Rhys Perry	a0b42da0a2	spirv: fix GLSLstd450Modf/GLSLstd450Frexp when the destination is vector We can't write to an individual component in a function_temp vector, so we have to use vtn_variable_store() which does a load+insert+store. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3484 Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6231>	2020-11-06 17:21:16 +00:00
Daniel Schürmann	f0a88dbefa	nir/lcssa: consider loops with no back-edge invariant Polaris: Totals from 6233 (4.52% of 138014) affected shaders: SpillSGPRs: 47860 -> 48976 (+2.33%) CodeSize: 69764704 -> 69120700 (-0.92%); split: -0.97%, +0.04% Instrs: 13801184 -> 13594107 (-1.50%) Cycles: 1628800928 -> 1516137888 (-6.92%) VMEM: 910459 -> 910208 (-0.03%); split: +0.00%, -0.03% SMEM: 436625 -> 435194 (-0.33%); split: +0.06%, -0.38% SClause: 534750 -> 534620 (-0.02%); split: -0.03%, +0.00% Copies: 1587121 -> 1542867 (-2.79%); split: -2.81%, +0.03% Branches: 545016 -> 509354 (-6.54%) PreSGPRs: 618545 -> 619354 (+0.13%); split: -0.09%, +0.22% Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5924>	2020-11-06 15:56:18 +00:00
Samuel Pitoiset	77d6fda0f5	nir/algebraic: distribute imul(iadd(a, b), c) when b and c are constants This distributes imul(iadd(a, b), c) to iadd(imul(a, c), b * c) when both b and c are constants. This might allow some compiler backends to create more MADs. For ACO, this allows to combine more DS additions. fossilds-db (Vega10): Totals from 673 (0.49% of 136546) affected shaders: VGPRs: 44548 -> 44516 (-0.07%); split: -0.11%, +0.04% CodeSize: 8301552 -> 8286220 (-0.18%); split: -0.19%, +0.01% MaxWaves: 2731 -> 2735 (+0.15%); split: +0.26%, -0.11% Instrs: 1642684 -> 1638725 (-0.24%); split: -0.24%, +0.00% Cycles: 20846156 -> 20793444 (-0.25%); split: -0.25%, +0.00% VMEM: 108870 -> 108106 (-0.70%); split: +0.03%, -0.73% SMEM: 35718 -> 35674 (-0.12%); split: +0.22%, -0.34% VClause: 20603 -> 20622 (+0.09%); split: -0.01%, +0.10% SClause: 48527 -> 48539 (+0.02%) Copies: 156735 -> 156742 (+0.00%); split: -0.05%, +0.05% PreSGPRs: 43169 -> 43166 (-0.01%); split: -0.02%, +0.02% PreVGPRs: 41369 -> 41330 (-0.09%) shader-db results on Intel: Ice Lake total instructions in shared programs: 20027588 -> 20027446 (<.01%) instructions in affected programs: 71766 -> 71624 (-0.20%) helped: 70 HURT: 0 helped stats (abs) min: 1 max: 7 x̄: 2.03 x̃: 1 helped stats (rel) min: 0.10% max: 2.50% x̄: 0.29% x̃: 0.15% 95% mean confidence interval for instructions value: -2.42 -1.64 95% mean confidence interval for instructions %-change: -0.38% -0.20% Instructions are helped. total cycles in shared programs: 977525222 -> 977494323 (<.01%) cycles in affected programs: 8884593 -> 8853694 (-0.35%) helped: 56 HURT: 16 helped stats (abs) min: 2 max: 7852 x̄: 681.29 x̃: 400 helped stats (rel) min: <.01% max: 19.84% x̄: 2.79% x̃: 0.41% HURT stats (abs) min: 2 max: 1212 x̄: 453.31 x̃: 120 HURT stats (rel) min: 0.05% max: 1.09% x̄: 0.32% x̃: 0.11% 95% mean confidence interval for cycles value: -802.75 -55.56 95% mean confidence interval for cycles %-change: -3.19% -1.01% Cycles are helped. total sends in shared programs: 1032273 -> 1032272 (<.01%) sends in affected programs: 41 -> 40 (-2.44%) helped: 1 HURT: 0 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7445>	2020-11-06 07:49:02 +00:00
Dave Airlie	9790fdf2ce	vtn/opencl: add ctz support ctz is a CL2.0 opcode but 3.0 requires it as well so just add support for it. Tested against CTS integer_ops integer_ctz test. (long line broken up) Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7468>	2020-11-06 17:03:05 +10:00
Jason Ekstrand	03683b9b2e	nir: Handle ray-tracing intrinsics and storage classes in copy-prop etc. We need to consider shader calls as potential writes to their payloads. For other ray-tracing intrinsics, we may not have a shader payload pointer and have to treat them more like a barrier. We also need to ensure that global and SSBO reads/writes aren't propagated across shader call intrinsics. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6479>	2020-11-05 23:36:46 +00:00
Jason Ekstrand	5a28893279	spirv,nir: Add ray-tracing intrinsics Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6479>	2020-11-05 23:36:46 +00:00
Jason Ekstrand	21b1b91549	nir,spirv: Add support for the ShaderCallKHR scope It's currently entirely trivial. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6479>	2020-11-05 23:36:46 +00:00
Jason Ekstrand	6b8fd65e84	spirv: Implement the new ray-tracing storage classes The SPV_KHR_ray_tracing extension adds 6 new storage classes which is a bit on the ridiculous side. In order to avoid adding that many variable modes to NIR, we make a few simplifying assumptions: 1. CallableData and RayPayload data actually lives on the stack somewhere, presumably in the caller's stack. We assume that these are no different from global variables and use nir_var_shader_temp for them. We still need a separate storage class for the incoming variants but only so we can figure out which one the incoming one is and lower it to something useful. 2. There's no difference between incoming CallableData and RayPaolad data. We can use a single storage class for both. 3. ShaderRecordBuffer data is just a global memory access. This lets us avoid NIR variables entirely and just fetch the pointer via the shader_record_ptr system value and it's accessed using a 64-bit global memory pointer. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6479>	2020-11-05 23:36:46 +00:00
Jason Ekstrand	84a8ca1db8	nir: Add new variable modes for ray-tracing If we were desperate to reduce bits, we could probably also use shader_in/out for hit attributes as they really are an output from intersection shaders and read-only in any-hit and closest-hit shaders. However, other passes such as nir_gether_info like to assume that anything with nir_var_shader_in/out is indexed using vec4 locations for interface matching. It's easier to just add a new variable mode. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6479>	2020-11-05 23:36:46 +00:00
Jason Ekstrand	aa4ea9c7ea	nir: Add intrinsics for object to/from world RT sysvals These are a bit more tricky than most because they're matrix system values. We make the intentional choice here to not bother with allowing indirect addressing of columns for these. Since they're system values, they may be magically constructed somehow or come from weird hardware so it's easier on back-ends to just handle any indirects with bcsel. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6479>	2020-11-05 23:36:46 +00:00
Jason Ekstrand	07635a3284	nir/builder: Add a select_from_ssa_def_array helper This is an operation we have to do already for nir_vector_extract and I'm about to do something very similar for matrix columns. Having a more generic helper is useful. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6479>	2020-11-05 23:36:46 +00:00
Jason Ekstrand	46cd91bb45	spirv,nir: Add support for ray-tracing built-ins Missing in this commit are NIR intrinsics for the ObjectToWorld and WorldToObject built-ins. Those are matrices and so they take a bit more work and justify a separate commit. For now, we add the enums and leave the SYSTEM_VALUE <-> nir_intrinsic conversion commented out. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6479>	2020-11-05 23:36:46 +00:00
Jason Ekstrand	ed907e5d84	spirv: Add support for OpTypeAccelerationStructureKHR For now, we assume its a 64-bit global pointer. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6479>	2020-11-05 23:36:45 +00:00
Jason Ekstrand	2df055ab47	spirv: Pass the deref type to storage_class_to_mode for non-forward pointers Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6479>	2020-11-05 23:36:45 +00:00
Jason Ekstrand	2bef02696d	spirv: Add a guard for OpTypeForwardPointer storage classes Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6479>	2020-11-05 23:36:45 +00:00
Jason Ekstrand	aabe37b969	spirv: Remove a redundant vtn_fail_if We already fail in these same cases in vk_desc_type_for_mode. These additional assertions are just extra code to update. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6479>	2020-11-05 23:36:45 +00:00
Caio Marcelo de Oliveira Filho	3bcebe1b27	spirv: Add Ray Tracing execution models Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6479>	2020-11-05 23:36:45 +00:00
Jason Ekstrand	d8dbdf20de	spirv: Add basic plumbing for ray-tracing capabilities Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6479>	2020-11-05 23:36:45 +00:00
Caio Marcelo de Oliveira Filho	f65182931d	compiler: Add new Vulkan shader stages This particular ordering makes them conveniently match VkShaderStageFlagBits, which is a property we already take advantage of in the previous shader stages. Abbreviations are based on the ones used in glslangValidator. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6479>	2020-11-05 23:36:45 +00:00
Mike Blumenkrantz	0b0f152c54	nir/clip_disable: handle 2x vec4 case some drivers may have pre-lowered gl_ClipDistance to 2x vec4 to match hw usage, so for those cases we'll be getting deref_var here and then components will be stored to the deref at some point fixes mesa/mesa#3480 Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6563>	2020-11-05 21:32:27 +00:00
Mike Blumenkrantz	5e43ba39e1	nir/clip_disable: try for better no-op we can just check the bits using clip_distance_array_size here to simplify everything and more easily determine if we need to be running this pass Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6563>	2020-11-05 21:32:27 +00:00
Mike Blumenkrantz	1d23a88c6e	nir/clip_disable: write 0s instead of undefs for disabled clip planes this should yield more reliable and ideally even correct results Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6563>	2020-11-05 21:32:27 +00:00
Jason Ekstrand	61d2badbf4	nir/deref: Fix a typo Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3754 Fixes: `df51518dc5` "nir/opt_deref: Add a deref mode specialization..." Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7459>	2020-11-05 16:31:25 +00:00
Caio Marcelo de Oliveira Filho	eb03f29655	spirv: Implement SpvCapabilitySubgroupBufferBlockIOINTEL Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7448>	2020-11-04 20:24:48 +00:00
Caio Marcelo de Oliveira Filho	dd39e311b3	nir: Add nir_intrinsic_{load,store}_deref_block_intel Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7448>	2020-11-04 20:24:48 +00:00
Caio Marcelo de Oliveira Filho	b86ce274f9	spirv: Implement SpvCapabilitySubgroupShuffleINTEL from SPV_INTEL_subgroups Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7448>	2020-11-04 20:24:48 +00:00
Alyssa Rosenzweig	a05921b9f2	nir: Add SRC_TYPE to store_combined_output_pan Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7446>	2020-11-04 11:21:08 -05:00
Rhys Perry	475077c790	nir/lower_bit_size: optimize upcast of b2i8/b2i16 This also seems to be done by nir_opt_algebraic, but RADV will be moving nir_lower_bit_size() to after that (so it doesn't create unsupported 8/16-bit instructions) and it doesn't seem worth creating a new pass just for this simple optimization. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4791>	2020-11-04 11:50:37 +00:00
Rhys Perry	4e5c85526b	nir: add shader_info::bit_sizes_used Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4791>	2020-11-04 11:50:37 +00:00
Jason Ekstrand	58e7088628	nir/find_array_copies: Don't assume all children exist Fixes: `9f3c595dfc` "nir/find_array_copies: Handle cast derefs" Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7437>	2020-11-04 05:57:07 +00:00
Jason Ekstrand	4ff4d4e569	nir/opt_intrinsic: Optimize bcsel(b, shuffle(x, i), shuffle(x, j)) The shuffles provided by the SPV_INTEL_subgroups extension generate bcsel(b, shuffle(x, i), shuffle(y, j)) In the case where x and y are the same, we can turn this into a shuffle with the bcsel on the index which lets us drop a whole shuffle. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7366>	2020-11-03 16:51:26 -06:00
Jason Ekstrand	2f5b56ae23	nir/opt_intrinsics: Refactor a bit Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7366>	2020-11-03 16:51:26 -06:00
Jason Ekstrand	3b281861c1	nir/constant_folding: Fold subgroup shuffle intrinsics Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7366>	2020-11-03 16:51:26 -06:00
Jason Ekstrand	e59d6350d1	nir: Move constant folding of vote to opt_constant_folding Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7366>	2020-11-03 16:51:26 -06:00
Jason Ekstrand	9492ab2864	nir/constant_folding: Use the standard variable naming convention Typically, if we have one alu instruction, we call it "alu" and if we have one intrinsic we call it "intrin". Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7366>	2020-11-03 16:51:26 -06:00
Jason Ekstrand	9d2ccbfc15	nir/constant_folding: Use a switch in try_fold_intrinsic Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7366>	2020-11-03 16:51:26 -06:00

1 2 3 4 5 ...

5629 commits