fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-20 22:08:10 +02:00

Author	SHA1	Message	Date
Rhys Perry	89b759c4f9	nir/opt_load_store_vectorize: loop internally To vectorize to vec8/16 or vec4 (without vec3), we can't incrementally add components to a load/store. This patch loops vectorization so that two new vec2/4/8 operations can be combined into a larger operation. fossil-db (GFX10.3): Totals from 22 (0.02% of 139391) affected shaders: SpillVGPRs: 1749 -> 1771 (+1.26%) CodeSize: 901212 -> 892532 (-0.96%); split: -1.19%, +0.22% Scratch: 178176 -> 184320 (+3.45%) Instrs: 159358 -> 158027 (-0.84%); split: -0.99%, +0.16% Cycles: 37046772 -> 36738544 (-0.83%); split: -1.00%, +0.17% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10384>	2021-04-21 20:26:58 +00:00
Rhys Perry	447820d003	nir/opt_load_store_vectorize: ignore load_vulkan_descriptor These mess with alignment calculation. No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10384>	2021-04-21 20:26:58 +00:00
Rhys Perry	6ca11b4a66	nir/opt_load_store_vectorize: improve handling of swizzles Previously (for simplicity), it could have skipped vectorization if swizzles were involved. fossil-db (GFX10.3): Totals from 498 (0.36% of 139391) affected shaders: SGPRs: 25328 -> 26608 (+5.05%); split: -1.36%, +6.41% VGPRs: 9988 -> 9996 (+0.08%) SpillSGPRs: 40 -> 65 (+62.50%) CodeSize: 1410188 -> 1385584 (-1.74%); split: -1.76%, +0.02% Instrs: 257149 -> 250579 (-2.55%); split: -2.57%, +0.01% Cycles: 1096892 -> 1070600 (-2.40%); split: -2.41%, +0.01% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10384>	2021-04-21 20:26:58 +00:00
Rhys Perry	4df3654c79	nir/load_store_vectorize: assume CAN_REORDER ops don't alias with stores fossil-db (GFX10.3): Totals from 20 (0.01% of 139391) affected shaders: SGPRs: 688 -> 712 (+3.49%); split: -1.16%, +4.65% CodeSize: 35488 -> 34424 (-3.00%); split: -3.04%, +0.05% Instrs: 6405 -> 6259 (-2.28%); split: -2.44%, +0.16% Cycles: 51768 -> 51268 (-0.97%); split: -1.21%, +0.24% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10384>	2021-04-21 20:26:58 +00:00
Mike Blumenkrantz	3ccd0891d3	nir/lower_fragcolor: set outputs_written for fragdata members normal gather_info stuff Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10080>	2021-04-21 19:36:16 +00:00
Jesse Natalie	09440ce3fb	nir: Fix MSVC warning C4334 (32bit shift cast to 64bit) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-By: Bill Kristiansen <billkris@microsoft.com> Cc: mesa-stable@lists.freedesktop.org Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10331>	2021-04-20 00:28:34 +00:00
Alyssa Rosenzweig	899dd8e60a	nir: Update some comments referring to imov This was renamed when I was in high school. I remember updating the Midgard compiler while sitting in AP Physics. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10296>	2021-04-19 20:07:35 +00:00
Danylo Piliaiev	f17b41ab4f	nir: add lowering pass for helperInvocationEXT() Some hardware doesn't have a way to check if invocation was demoted, in such case we have to track it ourselves. OpIsHelperInvocationEXT is specified as: "An invocation is currently a helper invocation if it was originally invoked as a helper invocation or if it has been demoted to a helper invocation by OpDemoteToHelperInvocationEXT." Therefore we: - Set gl_IsHelperInvocationEXT = gl_HelperInvocation - Add "gl_IsHelperInvocationEXT = true" right before each demote - Add "gl_IsHelperInvocationEXT = gl_IsHelperInvocationEXT \|\| condition" right before each demote_if Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9460>	2021-04-19 17:11:36 +00:00
Erik Faye-Lund	7886983835	nir/lower_tex: do not stumble on 16-bit inputs If a has been lowered to float16 here, then we end up trying to construct a vector of mixed precision, which the validator asserts about. So let's make sure we use the same type for all arguments. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10201>	2021-04-19 14:28:05 +00:00
Eric Anholt	5de3cbbb2e	nir: Generate load_ubo_vec4 directly for !PIPE_CAP_NATIVE_INTEGERS The prog_to_nir->NIR-to-TGSI change ended up causing regressions on r300, and svga against r300-class hardware, because nir_lower_uniforms_to_ubo() introduced shifts that nir_lower_ubo_vec4() tried to reverse, but that NIR couldn't prove are no-ops (since shifting up and back down may drop bits), and the hardware can't do the integer ops. Instead, make it so that nir_lower_uniforms_to_ubo can generate nir_intrinsic_load_ubo_vec4 directly for !INTEGER hardware. Fixes: `cf3fc79cd0` ("st/mesa: Replace mesa_to_tgsi() with prog_to_nir() and nir_to_tgsi().") Closes: #4602 Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10194>	2021-04-16 21:58:00 +00:00
Michel Dänzer	2928c21eb7	Convert most remaining free-form fall-through comments to FALLTHROUGH One exception is src/amd/addrlib/, for which -Wimplicit-fallthrough is explicitly disabled. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10220>	2021-04-15 16:01:22 +00:00
Marek Olšák	165a69d2f7	nir: handle mediump varyings in varying compaction helpers Group mediump varyings and don't put 16-bit and 32-bit components in the same vec4. ... and reply to the comment there. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10224>	2021-04-14 01:42:49 +00:00
Alyssa Rosenzweig	5d32cf642f	nir: Add varying precision linking helper (v2) It is useful for the precisions of varyings to match across shader stages at link-time to enable precision lowering optimizations, which would otherwise require costly draw-time fixups. The goal is to enable `producer->precision == consumer->precision` to be an invariant drivers may rely on for linked shaders. v2: keep transform feedback outputs at mediump - mareko Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> (v1) Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9050>	2021-04-13 05:07:42 +00:00
Marek Olšák	fb29cef8dd	nir: add many passes that lower and optimize 16-bit input/outputs and samplers Added: * a pass that renumbers bases of IO intrinsics * a pass that converts mediump IO to 16 bits, optionally using the new packed varying slots * a pass that sets (forces) mediump in IO intrinsics (for testing) * a pass that remaps VARYING_SLOT_VAR[0..15]_16BIT to VARYING_SLOT_VAR[0..31] (if some shader stages don't want packed varyings) * a pass that folds type conversions around texture opcodes into those opcodes (e.g. tex(f2f32(coord), ..) is changed into tex accepting f16) * a pass that changes (legalizes) sampler src and dst types based on specified hw constraints (e.g. derivatives must be the same type as coordinates) Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9050>	2021-04-13 05:07:42 +00:00
Marek Olšák	73f532e5bf	nir: add new VARYING_SLOTs and shader info for packed 16-bit varyings This allows mediump inputs and outputs to be trivially lowered into packed 16-bit varyings where 1 slot is occupied by 2 16-bit vec4s, without any packing instructions in NIR and without any conflicts with 32-bit varyings. The only thing that is changed is IO semantics in intrinsics to get packed 16-bit varyings. This simplifies supporting 16-bit types for drivers that have 32-bit slots everywhere except the fragment shader where they can do 16-bit interpolation on either the low or high half of each slot. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9050>	2021-04-13 05:07:42 +00:00
Marek Olšák	5f7c7c9a7f	nir: add src and dest types to all IO loads and stores for mediump Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9050>	2021-04-13 05:07:42 +00:00
Jesse Natalie	4b69ae8e1e	nir_opt_deref: ptr_as_array(deref_cast<T*>(x))[0] isn't the same as x[0] if the cast has alignment This breaks CLOn12's handling of CL CTS test_basic vector_creation for char3 (at least). Removing this cast causes us to try to load from a deref with no alignment info. Fixes: `99bb2a4d` ("nir/opt_deref: Don't remove casts with alignment information") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10165>	2021-04-13 03:40:23 +00:00
Rhys Perry	e9dc3df868	nir/loop_unroll: fix is_indirect_load() with load_global load_global only has one source. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Fixes: `dfe429eb41` ("nir/loop_unroll: unroll more aggressively if it can improve load scheduling") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10186>	2021-04-12 20:28:57 +00:00
Rhys Perry	0f2bf55c7e	nir/lcssa: fix nondeterminism in predecessor iteration set_foreach()'s order on a list of nir_block * isn't deterministic, so we need to sort the predecessor list. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3364>	2021-04-12 18:17:19 +00:00
Rhys Perry	7050896be0	nir: add nir_block_get_predecessors_sorted() helper Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3364>	2021-04-12 18:17:19 +00:00
Rhys Perry	254360d96c	nir/lower_idiv: make lowered divisions exact I can't imagine any reasonable optimization which could break this, but since it's lowered from an integer instructions, we shouldn't do anything which could change the result. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10081>	2021-04-12 16:19:46 +00:00
Rhys Perry	a2619b97f5	nir/lower_idiv: add options to use fp32 for 8-bit division lowering Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10081>	2021-04-12 16:19:46 +00:00
Jesse Natalie	3c8bcdc863	nir: Add a new opcode for [un]packing doubles HLSL doesn't support bitcasting a 64bit integer to a double. DXIL doesn't have generic pack/unpack instructions, so we lower those to integer bitwise ops. As a result, NIR generic double pack/unpack would require our backend to emit a bitcast to get a double, but we want to match HLSL semantics and emit MakeDouble/SplitDouble. Adding a dedicated opcode for double pack/unpack allows us to add a pass to emit that instead, which lets our backend emit the right instruction to pack and unpack doubles. Acked-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10063>	2021-04-09 01:54:33 +00:00
Rhys Perry	5f62083c26	nir/gather_info: fix partial masking of compact I/O with location_frac!=0 nir_lower_clip_cull_distance_arrays() can create compact variables with location_frac!=0. Fixes: `cc7a187411` ("nir/gather_info: implement partial masking of struct and compact I/O") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4554 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10002>	2021-04-08 16:39:48 +00:00
Bas Nieuwenhuizen	edb89e7c4d	nir: Do not reset shared_size in nir_lower_io. I'd like to use raw shared intrinsics already for some raytracing stuff before this pass gets called and this was a real pitfall. This mirrors scratch_size and constant_data_size. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10094>	2021-04-08 14:39:28 +00:00
Bas Nieuwenhuizen	4ca4de50f7	nir: Remove nir_shader->shared_size. The same info is in shader_info. Dedupe. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10094>	2021-04-08 14:39:28 +00:00
Bas Nieuwenhuizen	580f1ac473	nir: Extract shader_info->cs.shared_size out of union. It is valid for all stages, just 0 for most of them. In particular mesh/task shaders might be using it. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10094>	2021-04-08 14:39:28 +00:00
Bas Nieuwenhuizen	84e0f6dbd8	nir: Fix shader calls with nir_opt_dead_write_vars. Fixes: `5a28893279` ("spirv,nir: Add ray-tracing intrinsics") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10096>	2021-04-08 11:10:52 +00:00
Alyssa Rosenzweig	1286e73c2c	nir/lower_idiv: Add 8-bit and 16-bit lowering path Roundtrip to a larger float and divide there. The extra details for mod/rem are handled directly in integer space to simplify verification of rounding details. The one issue is that the mantissa might be rounded down which will cause issues; adding 1 unconditionally (proposed by Jonathan Marek) fixes this. The lowerings here were tested exhaustively on all pairs of 16-bit integers. v2: Update idiv lowering per Rhys Perry's comment. v3: Rewrite lowerings. v4: Remove useless ftrunc, fix 8-bit issue, simplify code. v5: Remove useless ffloor Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Tested-by: Danylo Piliaiev <dpiliaiev@igalia.com> Tested-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8339>	2021-04-07 15:48:15 +00:00
Alyssa Rosenzweig	e91dec1327	nir/lower_idiv: Factor out numer/denom load No need to duplicate across paths. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8339>	2021-04-07 15:48:15 +00:00
Alyssa Rosenzweig	7b0eb4aa00	nir/lower_idiv: Convert to lower_instructions Helps deduplicate some code between the two lowering paths. In particular, it ports the missing 32-bit? check to the precise pass. This does not change anything immediately: drivers depending on this to lower 16-bit did not work before due to type mismatches and will not work now since it'll refuse to lower. But that means sub-32-bit idiv can be lowered more efficiently in an algebraic pass. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8339>	2021-04-07 15:48:15 +00:00
Alyssa Rosenzweig	e4da24bd24	nir: Add {i2f, u2f, f2i, f2u} helpers Convenient for bitsize independent lowerings, will be used in the idiv lowering. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8339>	2021-04-07 15:48:15 +00:00
Alyssa Rosenzweig	6b19711645	nir: Add nir_type_convert Generalizes nir_convert_to_bit_size, which we implement as a special-case. v2: Take a sized dest type but allow unsized or sized source to address Jason's feedback. Shorten name. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8339>	2021-04-07 15:48:15 +00:00
Rhys Perry	292ac71a4a	nir/lower_tex: handle deref casts A RDR2 shader has a undef->texture cast which is eventually optimized out. Without handling NULL from nir_deref_instr_get_variable(), compiling this shader will result in a crash. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Fixes: `bc438c91d9` ("nir/lower_tex: ignore texture_index if tex_instr has deref src") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10038>	2021-04-06 08:35:39 +00:00
Pierre-Eric Pelloux-Prayer	bc438c91d9	nir/lower_tex: ignore texture_index if tex_instr has deref src texture_index is meaningless when a tex_instr has deref src. Use var->data.binding instead. This fixes the incorrect lowering on radeonsi where the same lowering steps was applied to all tex_instr based on the needs of the first one (since texture_index is always 0). CC: mesa-stable Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9931>	2021-04-05 10:14:07 +02:00
Rhys Perry	cc7a187411	nir/gather_info: implement partial masking of struct and compact I/O fossil-db (Sienna): Totals from 138 (0.10% of 138791) affected shaders: CodeSize: 504060 -> 482136 (-4.35%) Instrs: 97318 -> 94518 (-2.88%) Cycles: 389272 -> 378072 (-2.88%) VMEM: 14397 -> 14614 (+1.51%); split: +1.76%, -0.25% SMEM: 9088 -> 9024 (-0.70%) VClause: 2915 -> 2430 (-16.64%) SClause: 1790 -> 1791 (+0.06%) PreVGPRs: 5013 -> 4998 (-0.30%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8364>	2021-04-01 10:15:44 +00:00
Alyssa Rosenzweig	8578adeaa6	nir: Unify memory atomics Avoids some copypaste and makes it easier to see how the different types relate. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8847>	2021-03-30 00:11:01 +00:00
Eric Anholt	683d3972a6	nir: Update clip_distance_array_size in clip lowering. If we've added the array, then we should update the info. This is the value that gallium drivers setting !PIPE_CAP_CLIP_PLANES have to use in place of rasterizer->clip_planes_enabled. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9815>	2021-03-26 20:51:18 +00:00
Danylo Piliaiev	2bff8fd53b	nir: add nir_shader_as_str function It would be later used by Turnip in implementation of VK_KHR_pipeline_executable_properties. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8877>	2021-03-25 13:53:33 +00:00
Mike Blumenkrantz	6900498faa	nir: add nir_lower_indirect_builtin_uniform_derefs() this is a special version of indirect deref lowering which is used by mesa/st to remove dynamic indexing from builtin uniforms for the lowering pass in non-packed uniform case Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9741>	2021-03-23 14:44:48 +00:00
Gert Wollny	318701b803	nir: Add r600 specific sin and cos variants r600 expect the input values to be normalited by divinding by 2 *PI, so add an opcode to be able to lower this in nir. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9452>	2021-03-22 15:19:46 +01:00
Gert Wollny	0f5b3c37c5	nir: Add opcodes for fused comp + csel and optimizations Some backends, like r600 support a fused version of int and float compare against zero and and csel. Adding these opcodes here makes it possible to optimize this in nir. v2: Add rules for float compare + csel Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9452>	2021-03-22 15:19:46 +01:00
Gert Wollny	a5747f8ab3	nir: add opcodes for find_msb_rev and lowering Some hardware supports a version of find_msb where the bits are counted starting at the high bit, and this needs some lowering to obtain the value that is expected by find_msb Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9452>	2021-03-22 15:19:46 +01:00
Dave Airlie	48080e5bdf	nir: lower 64-bit floats to 32-bit first. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9643>	2021-03-22 12:17:14 +10:00
Dave Airlie	01dfd65a2d	nir: port fp16 casting code from dxil This moves the dxil pass to common code and makes dxil use the new code. Acked-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9643>	2021-03-22 12:16:59 +10:00
Jesse Natalie	55d153b9f5	nir: Temporarily disable optimizations for MSVC ARM64 There's currently an MSVC optimizer bug which causes a stack overflow in the compiler if it attempts to optimize fsat. Acked-by: Rob Clark <robdclark@chromium.org> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9700>	2021-03-21 21:41:41 +00:00
Jason Ekstrand	1ba9c262fd	nir: Add image atomic_fmin/fmax intrinsics Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8750>	2021-03-18 00:13:40 +00:00
Caio Marcelo de Oliveira Filho	302183d635	nir: Handle deref_atomic_fadd in a couple of passes Acked-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8750>	2021-03-18 00:13:40 +00:00
Jason Ekstrand	4079279051	anv/apply_pipeline_layout: Add support for A64 descriptor access Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8635>	2021-03-17 17:49:59 +00:00
Jason Ekstrand	c8748771bb	nir/lower_io: Support global addresses for UBOs in nir_lower_explicit_io For nir_address_format_64bit_global_32bit_offset and nir_address_format_64bit_bounded_global, we use a new intrinsics which take the base address and offset as separate parameters. For bounds- checked access, the bound is also included in the intrinsic. This gives the drive more control over the bounds checking so that UBOs don't suddenly become massively more expensive. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8635>	2021-03-17 17:49:59 +00:00

1 2 3 4 5 ...

3080 commits