fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-21 04:38:09 +02:00

Author	SHA1	Message	Date
Connor Abbott	77fcb01f7f	nir/lower_clip_disable: Fix store writemask We're storing into the array element, not the whole variable. Fixes: `fb2fe80` ("nir: add lowering pass for clip plane enabling") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7274>	2021-04-26 17:07:02 +00:00
Jesse Natalie	2775b9139b	nir_lower_readonly_images_to_tex: Use nir_shader_lower_instructions Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10356>	2021-04-23 23:16:15 +00:00
Jesse Natalie	fa677c8644	nir_lower_readonly_images_to_tex: Support non-CL semantics For non-CL, intrinsic access isn't set, because the image type doesn't have access qualifier. Instead, the access qualifier is set on the variable. So, add a mode to this pass which can chase back to the variable in addition to the intrinsic access. Also, update the variable type and the deref chain types so everything is consistent, that the tex is accessing a sampler. Note we can't do this for CL, because void-typed samplers don't exist. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10356>	2021-04-23 23:16:15 +00:00
Jesse Natalie	29c9731400	nir: Rename nir_lower_cl_images_to_tex, replace 'cl' with 'readonly' Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10356>	2021-04-23 23:16:15 +00:00
Alyssa Rosenzweig	c84804f167	nir/lower_fragcolor: Take max cbufs as argument One step closer to generalizing this pass to more drivers. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10411>	2021-04-23 17:20:43 +00:00
Alyssa Rosenzweig	73eb497b86	nir/lower_fragcolor: Fix driver_location assignment Fixes crash in dEQP-GLES31.functional.shaders.framebuffer_fetch.basic.last_frag_data when using this pass. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10411>	2021-04-23 17:20:43 +00:00
Alyssa Rosenzweig	0f4ba349e9	nir/lower_fragcolor: Handle fp16 outputs Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10391>	2021-04-21 22:17:28 +00:00
Alyssa Rosenzweig	49c6157b15	nir/lower_fragcolor: Use shader_instructions_pass While I was in the area. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10391>	2021-04-21 22:17:28 +00:00
Rhys Perry	89b759c4f9	nir/opt_load_store_vectorize: loop internally To vectorize to vec8/16 or vec4 (without vec3), we can't incrementally add components to a load/store. This patch loops vectorization so that two new vec2/4/8 operations can be combined into a larger operation. fossil-db (GFX10.3): Totals from 22 (0.02% of 139391) affected shaders: SpillVGPRs: 1749 -> 1771 (+1.26%) CodeSize: 901212 -> 892532 (-0.96%); split: -1.19%, +0.22% Scratch: 178176 -> 184320 (+3.45%) Instrs: 159358 -> 158027 (-0.84%); split: -0.99%, +0.16% Cycles: 37046772 -> 36738544 (-0.83%); split: -1.00%, +0.17% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10384>	2021-04-21 20:26:58 +00:00
Rhys Perry	447820d003	nir/opt_load_store_vectorize: ignore load_vulkan_descriptor These mess with alignment calculation. No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10384>	2021-04-21 20:26:58 +00:00
Rhys Perry	6ca11b4a66	nir/opt_load_store_vectorize: improve handling of swizzles Previously (for simplicity), it could have skipped vectorization if swizzles were involved. fossil-db (GFX10.3): Totals from 498 (0.36% of 139391) affected shaders: SGPRs: 25328 -> 26608 (+5.05%); split: -1.36%, +6.41% VGPRs: 9988 -> 9996 (+0.08%) SpillSGPRs: 40 -> 65 (+62.50%) CodeSize: 1410188 -> 1385584 (-1.74%); split: -1.76%, +0.02% Instrs: 257149 -> 250579 (-2.55%); split: -2.57%, +0.01% Cycles: 1096892 -> 1070600 (-2.40%); split: -2.41%, +0.01% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10384>	2021-04-21 20:26:58 +00:00
Rhys Perry	4df3654c79	nir/load_store_vectorize: assume CAN_REORDER ops don't alias with stores fossil-db (GFX10.3): Totals from 20 (0.01% of 139391) affected shaders: SGPRs: 688 -> 712 (+3.49%); split: -1.16%, +4.65% CodeSize: 35488 -> 34424 (-3.00%); split: -3.04%, +0.05% Instrs: 6405 -> 6259 (-2.28%); split: -2.44%, +0.16% Cycles: 51768 -> 51268 (-0.97%); split: -1.21%, +0.24% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10384>	2021-04-21 20:26:58 +00:00
Mike Blumenkrantz	3ccd0891d3	nir/lower_fragcolor: set outputs_written for fragdata members normal gather_info stuff Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10080>	2021-04-21 19:36:16 +00:00
Jesse Natalie	09440ce3fb	nir: Fix MSVC warning C4334 (32bit shift cast to 64bit) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-By: Bill Kristiansen <billkris@microsoft.com> Cc: mesa-stable@lists.freedesktop.org Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10331>	2021-04-20 00:28:34 +00:00
Alyssa Rosenzweig	899dd8e60a	nir: Update some comments referring to imov This was renamed when I was in high school. I remember updating the Midgard compiler while sitting in AP Physics. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10296>	2021-04-19 20:07:35 +00:00
Danylo Piliaiev	f17b41ab4f	nir: add lowering pass for helperInvocationEXT() Some hardware doesn't have a way to check if invocation was demoted, in such case we have to track it ourselves. OpIsHelperInvocationEXT is specified as: "An invocation is currently a helper invocation if it was originally invoked as a helper invocation or if it has been demoted to a helper invocation by OpDemoteToHelperInvocationEXT." Therefore we: - Set gl_IsHelperInvocationEXT = gl_HelperInvocation - Add "gl_IsHelperInvocationEXT = true" right before each demote - Add "gl_IsHelperInvocationEXT = gl_IsHelperInvocationEXT \|\| condition" right before each demote_if Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9460>	2021-04-19 17:11:36 +00:00
Erik Faye-Lund	7886983835	nir/lower_tex: do not stumble on 16-bit inputs If a has been lowered to float16 here, then we end up trying to construct a vector of mixed precision, which the validator asserts about. So let's make sure we use the same type for all arguments. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10201>	2021-04-19 14:28:05 +00:00
Eric Anholt	5de3cbbb2e	nir: Generate load_ubo_vec4 directly for !PIPE_CAP_NATIVE_INTEGERS The prog_to_nir->NIR-to-TGSI change ended up causing regressions on r300, and svga against r300-class hardware, because nir_lower_uniforms_to_ubo() introduced shifts that nir_lower_ubo_vec4() tried to reverse, but that NIR couldn't prove are no-ops (since shifting up and back down may drop bits), and the hardware can't do the integer ops. Instead, make it so that nir_lower_uniforms_to_ubo can generate nir_intrinsic_load_ubo_vec4 directly for !INTEGER hardware. Fixes: `cf3fc79cd0` ("st/mesa: Replace mesa_to_tgsi() with prog_to_nir() and nir_to_tgsi().") Closes: #4602 Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10194>	2021-04-16 21:58:00 +00:00
Michel Dänzer	2928c21eb7	Convert most remaining free-form fall-through comments to FALLTHROUGH One exception is src/amd/addrlib/, for which -Wimplicit-fallthrough is explicitly disabled. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10220>	2021-04-15 16:01:22 +00:00
Marek Olšák	165a69d2f7	nir: handle mediump varyings in varying compaction helpers Group mediump varyings and don't put 16-bit and 32-bit components in the same vec4. ... and reply to the comment there. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10224>	2021-04-14 01:42:49 +00:00
Alyssa Rosenzweig	5d32cf642f	nir: Add varying precision linking helper (v2) It is useful for the precisions of varyings to match across shader stages at link-time to enable precision lowering optimizations, which would otherwise require costly draw-time fixups. The goal is to enable `producer->precision == consumer->precision` to be an invariant drivers may rely on for linked shaders. v2: keep transform feedback outputs at mediump - mareko Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> (v1) Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9050>	2021-04-13 05:07:42 +00:00
Marek Olšák	fb29cef8dd	nir: add many passes that lower and optimize 16-bit input/outputs and samplers Added: * a pass that renumbers bases of IO intrinsics * a pass that converts mediump IO to 16 bits, optionally using the new packed varying slots * a pass that sets (forces) mediump in IO intrinsics (for testing) * a pass that remaps VARYING_SLOT_VAR[0..15]_16BIT to VARYING_SLOT_VAR[0..31] (if some shader stages don't want packed varyings) * a pass that folds type conversions around texture opcodes into those opcodes (e.g. tex(f2f32(coord), ..) is changed into tex accepting f16) * a pass that changes (legalizes) sampler src and dst types based on specified hw constraints (e.g. derivatives must be the same type as coordinates) Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9050>	2021-04-13 05:07:42 +00:00
Marek Olšák	73f532e5bf	nir: add new VARYING_SLOTs and shader info for packed 16-bit varyings This allows mediump inputs and outputs to be trivially lowered into packed 16-bit varyings where 1 slot is occupied by 2 16-bit vec4s, without any packing instructions in NIR and without any conflicts with 32-bit varyings. The only thing that is changed is IO semantics in intrinsics to get packed 16-bit varyings. This simplifies supporting 16-bit types for drivers that have 32-bit slots everywhere except the fragment shader where they can do 16-bit interpolation on either the low or high half of each slot. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9050>	2021-04-13 05:07:42 +00:00
Marek Olšák	5f7c7c9a7f	nir: add src and dest types to all IO loads and stores for mediump Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9050>	2021-04-13 05:07:42 +00:00
Jesse Natalie	4b69ae8e1e	nir_opt_deref: ptr_as_array(deref_cast<T*>(x))[0] isn't the same as x[0] if the cast has alignment This breaks CLOn12's handling of CL CTS test_basic vector_creation for char3 (at least). Removing this cast causes us to try to load from a deref with no alignment info. Fixes: `99bb2a4d` ("nir/opt_deref: Don't remove casts with alignment information") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10165>	2021-04-13 03:40:23 +00:00
Rhys Perry	e9dc3df868	nir/loop_unroll: fix is_indirect_load() with load_global load_global only has one source. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Fixes: `dfe429eb41` ("nir/loop_unroll: unroll more aggressively if it can improve load scheduling") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10186>	2021-04-12 20:28:57 +00:00
Rhys Perry	0f2bf55c7e	nir/lcssa: fix nondeterminism in predecessor iteration set_foreach()'s order on a list of nir_block * isn't deterministic, so we need to sort the predecessor list. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3364>	2021-04-12 18:17:19 +00:00
Rhys Perry	7050896be0	nir: add nir_block_get_predecessors_sorted() helper Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3364>	2021-04-12 18:17:19 +00:00
Rhys Perry	254360d96c	nir/lower_idiv: make lowered divisions exact I can't imagine any reasonable optimization which could break this, but since it's lowered from an integer instructions, we shouldn't do anything which could change the result. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10081>	2021-04-12 16:19:46 +00:00
Rhys Perry	a2619b97f5	nir/lower_idiv: add options to use fp32 for 8-bit division lowering Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10081>	2021-04-12 16:19:46 +00:00
Jesse Natalie	3c8bcdc863	nir: Add a new opcode for [un]packing doubles HLSL doesn't support bitcasting a 64bit integer to a double. DXIL doesn't have generic pack/unpack instructions, so we lower those to integer bitwise ops. As a result, NIR generic double pack/unpack would require our backend to emit a bitcast to get a double, but we want to match HLSL semantics and emit MakeDouble/SplitDouble. Adding a dedicated opcode for double pack/unpack allows us to add a pass to emit that instead, which lets our backend emit the right instruction to pack and unpack doubles. Acked-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10063>	2021-04-09 01:54:33 +00:00
Rhys Perry	5f62083c26	nir/gather_info: fix partial masking of compact I/O with location_frac!=0 nir_lower_clip_cull_distance_arrays() can create compact variables with location_frac!=0. Fixes: `cc7a187411` ("nir/gather_info: implement partial masking of struct and compact I/O") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4554 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10002>	2021-04-08 16:39:48 +00:00
Bas Nieuwenhuizen	edb89e7c4d	nir: Do not reset shared_size in nir_lower_io. I'd like to use raw shared intrinsics already for some raytracing stuff before this pass gets called and this was a real pitfall. This mirrors scratch_size and constant_data_size. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10094>	2021-04-08 14:39:28 +00:00
Bas Nieuwenhuizen	4ca4de50f7	nir: Remove nir_shader->shared_size. The same info is in shader_info. Dedupe. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10094>	2021-04-08 14:39:28 +00:00
Bas Nieuwenhuizen	580f1ac473	nir: Extract shader_info->cs.shared_size out of union. It is valid for all stages, just 0 for most of them. In particular mesh/task shaders might be using it. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10094>	2021-04-08 14:39:28 +00:00
Bas Nieuwenhuizen	84e0f6dbd8	nir: Fix shader calls with nir_opt_dead_write_vars. Fixes: `5a28893279` ("spirv,nir: Add ray-tracing intrinsics") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10096>	2021-04-08 11:10:52 +00:00
Alyssa Rosenzweig	1286e73c2c	nir/lower_idiv: Add 8-bit and 16-bit lowering path Roundtrip to a larger float and divide there. The extra details for mod/rem are handled directly in integer space to simplify verification of rounding details. The one issue is that the mantissa might be rounded down which will cause issues; adding 1 unconditionally (proposed by Jonathan Marek) fixes this. The lowerings here were tested exhaustively on all pairs of 16-bit integers. v2: Update idiv lowering per Rhys Perry's comment. v3: Rewrite lowerings. v4: Remove useless ftrunc, fix 8-bit issue, simplify code. v5: Remove useless ffloor Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Tested-by: Danylo Piliaiev <dpiliaiev@igalia.com> Tested-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8339>	2021-04-07 15:48:15 +00:00
Alyssa Rosenzweig	e91dec1327	nir/lower_idiv: Factor out numer/denom load No need to duplicate across paths. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8339>	2021-04-07 15:48:15 +00:00
Alyssa Rosenzweig	7b0eb4aa00	nir/lower_idiv: Convert to lower_instructions Helps deduplicate some code between the two lowering paths. In particular, it ports the missing 32-bit? check to the precise pass. This does not change anything immediately: drivers depending on this to lower 16-bit did not work before due to type mismatches and will not work now since it'll refuse to lower. But that means sub-32-bit idiv can be lowered more efficiently in an algebraic pass. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8339>	2021-04-07 15:48:15 +00:00
Alyssa Rosenzweig	e4da24bd24	nir: Add {i2f, u2f, f2i, f2u} helpers Convenient for bitsize independent lowerings, will be used in the idiv lowering. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8339>	2021-04-07 15:48:15 +00:00
Alyssa Rosenzweig	6b19711645	nir: Add nir_type_convert Generalizes nir_convert_to_bit_size, which we implement as a special-case. v2: Take a sized dest type but allow unsized or sized source to address Jason's feedback. Shorten name. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8339>	2021-04-07 15:48:15 +00:00
Rhys Perry	292ac71a4a	nir/lower_tex: handle deref casts A RDR2 shader has a undef->texture cast which is eventually optimized out. Without handling NULL from nir_deref_instr_get_variable(), compiling this shader will result in a crash. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Fixes: `bc438c91d9` ("nir/lower_tex: ignore texture_index if tex_instr has deref src") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10038>	2021-04-06 08:35:39 +00:00
Pierre-Eric Pelloux-Prayer	bc438c91d9	nir/lower_tex: ignore texture_index if tex_instr has deref src texture_index is meaningless when a tex_instr has deref src. Use var->data.binding instead. This fixes the incorrect lowering on radeonsi where the same lowering steps was applied to all tex_instr based on the needs of the first one (since texture_index is always 0). CC: mesa-stable Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9931>	2021-04-05 10:14:07 +02:00
Rhys Perry	cc7a187411	nir/gather_info: implement partial masking of struct and compact I/O fossil-db (Sienna): Totals from 138 (0.10% of 138791) affected shaders: CodeSize: 504060 -> 482136 (-4.35%) Instrs: 97318 -> 94518 (-2.88%) Cycles: 389272 -> 378072 (-2.88%) VMEM: 14397 -> 14614 (+1.51%); split: +1.76%, -0.25% SMEM: 9088 -> 9024 (-0.70%) VClause: 2915 -> 2430 (-16.64%) SClause: 1790 -> 1791 (+0.06%) PreVGPRs: 5013 -> 4998 (-0.30%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8364>	2021-04-01 10:15:44 +00:00
Alyssa Rosenzweig	8578adeaa6	nir: Unify memory atomics Avoids some copypaste and makes it easier to see how the different types relate. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8847>	2021-03-30 00:11:01 +00:00
Eric Anholt	683d3972a6	nir: Update clip_distance_array_size in clip lowering. If we've added the array, then we should update the info. This is the value that gallium drivers setting !PIPE_CAP_CLIP_PLANES have to use in place of rasterizer->clip_planes_enabled. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9815>	2021-03-26 20:51:18 +00:00
Danylo Piliaiev	2bff8fd53b	nir: add nir_shader_as_str function It would be later used by Turnip in implementation of VK_KHR_pipeline_executable_properties. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8877>	2021-03-25 13:53:33 +00:00
Mike Blumenkrantz	6900498faa	nir: add nir_lower_indirect_builtin_uniform_derefs() this is a special version of indirect deref lowering which is used by mesa/st to remove dynamic indexing from builtin uniforms for the lowering pass in non-packed uniform case Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9741>	2021-03-23 14:44:48 +00:00
Gert Wollny	318701b803	nir: Add r600 specific sin and cos variants r600 expect the input values to be normalited by divinding by 2 *PI, so add an opcode to be able to lower this in nir. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9452>	2021-03-22 15:19:46 +01:00
Gert Wollny	0f5b3c37c5	nir: Add opcodes for fused comp + csel and optimizations Some backends, like r600 support a fused version of int and float compare against zero and and csel. Adding these opcodes here makes it possible to optimize this in nir. v2: Add rules for float compare + csel Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9452>	2021-03-22 15:19:46 +01:00

1 2 3 4 5 ...

3088 commits