fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-21 22:20:14 +01:00

Author	SHA1	Message	Date
Georg Lehmann	b93c92eba3	nir: lower ballot_bit_count_exclusive/inclusive to mbcnt_amd Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22783>	2023-05-03 10:39:20 +00:00
Lionel Landwerlin	53605f226b	nir/lower_non_uniform_access: add get_ssbo_size handling Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22793>	2023-05-02 16:27:15 +00:00
Erik Faye-Lund	4e8b532db3	nir: remove nir_state_slot::swizzle This is only ever written to, never read from. Let's just get rid of it! This also saves us a few needless includes. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22620>	2023-04-26 05:51:39 +00:00
antonino	a0645e3383	nir/zink: use sysvals in `nir_create_passthrough_gs` Previously the passthrough gs shader loaded some values with uniform loads using sevaral hardcoded values. This was not flexible for other drivers and started becoming too unflexible for zink itself. Use system values instead and use a lowering pass in zink. Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22667>	2023-04-25 13:11:59 +00:00
Thomas H.P. Andersen	0ddf98e85d	nir/nir_lower_two_sided_color: Use the nir_shader_instructions_pass() helper Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11683>	2023-04-22 23:35:37 +00:00
Thomas H.P. Andersen	087b082f3d	nir/nir_lower_viewport_transform: Use the nir_shader_instructions_pass() helper Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11683>	2023-04-22 23:35:36 +00:00
Marek Olšák	e6e406b483	nir: add next_stage parameter to nir_remove_varying so that e.g. the POS output is removed if the next stage is not FS. Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21861>	2023-04-19 21:42:11 +00:00
Marek Olšák	42822413cf	nir: add next_stage parameter to nir_slot_is_sysval_output to return better info If we know the next stage, we can tell whether an output is a sysval, such as POS. For example, POS is not a sysval output if the next stage is not FS. Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21861>	2023-04-19 21:42:11 +00:00
Marek Olšák	ea9156edc3	nir: return a status from nir_remove_varying whether it removed the instruction Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21861>	2023-04-19 21:42:11 +00:00
Marek Olšák	9d78fec684	nir: rework nir_lower_color_inputs to work with lowered IO intrinsics also only call it from radeonsi and remove the option Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21861>	2023-04-19 21:42:11 +00:00
Rhys Perry	48158636bf	nir: add is_gather_implicit_lod Needed for SPV_AMD_texture_gather_bias_lod. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22315>	2023-04-18 10:42:07 +00:00
Alyssa Rosenzweig	dcb59a7672	nir: Remove nir_if_rewrite_condition_ssa Now unused. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22343>	2023-04-07 23:48:03 +00:00
Alyssa Rosenzweig	e9e0956d62	nir: Factor out nir_src_rewrite_ssa helper Like nir_instr_rewrite_ssa but without the asserted extra argument. Works on ifs too, now that we have a unified use list. We do need to assert that the source has actually been inserted and has valid use/def chains. Previously, asserting on the parent instruction accomplished that indirectly. For the more general helper, we instead directly assert that there exists a non-null parent, whatever it is. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22343>	2023-04-07 23:48:03 +00:00
Alyssa Rosenzweig	f3b420692b	nir: Remove 2nd argument from nir_before_src We can now determine whether a nir_src is for an if without a sideband, so simplify the function signature. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Suggested-by: Faith Ekstrand <faith@gfxstrand.net> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22343>	2023-04-07 23:48:03 +00:00
Alyssa Rosenzweig	7f6491b76d	nir: Combine if_uses with instruction uses Every nir_ssa_def is part of a chain of uses, implemented with doubly linked lists. That means each requires 2 * 64-bit = 16 bytes per def, which is memory intensive. Together they require 32 bytes per def. Not cool. To cut that memory use in half, we can combine the two linked lists into a single use list that contains both regular instruction uses and if-uses. To do this, we augment the nir_src with a boolean "is_if", and reimplement the abstract if-uses operations on top of that list. That boolean should fit into the padding already in nir_src so should not actually affect memory use, and in the future we sneak it into the bottom bit of a pointer. However, this creates a new inefficiency: now iterating over regular uses separate from if-uses is (nominally) more expensive. It turns out virtually every caller of nir_foreach_if_use(_safe) also calls nir_foreach_use(_safe) immediately before, so we rewrite most of the callers to instead call a new single `nir_foreach_use_including_if(_safe)` which predicates the logic based on `src->is_if`. This should mitigate the performance difference. There's a bit of churn, but this is largely a mechanical set of changes. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22343>	2023-04-07 23:48:03 +00:00
antonino	0b65514775	nir/zink: handle provoking vertex mode in `nir_create_passthrough_gs` Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21238>	2023-03-29 19:18:40 +00:00
antonino	1a5bdca2dd	zink: implement flat shading using inlined uniforms Zink will now handle flat interpolation correctly when line loops are generated from primitives. The flat shading information is passed to the emulation gs using constant uniforms which get inlined. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21238>	2023-03-29 19:18:40 +00:00
antonino	3b5fb8b060	nir: allow to force line strip out in nir_create_passthrough_gs `nir_create_passthrough_gs` now allows the user to force the generated GS to always output a line strip from the primitive regardless of whether edgeflags are present. Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21238>	2023-03-29 19:18:40 +00:00
antonino	24535ffb3d	nir: handle edge flags in nir_create_passthrough_gs `nir_create_passthrough_gs` will now take a boolean argument to decide whether it needs to handle edgeflags. When true is passed it will output a line strip where edges that shouldn't be visible are not emitted. This is usefull because geometry shaders will generally throw away edgeflags so for a passthrough GS to act transparently it needs to emulate them. Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21238>	2023-03-29 19:18:40 +00:00
antonino	a0751e8088	nir: calculate number of vertices in nir_create_passthrough_gs `nir_create_passthrough_gs` has been changed to take the type of primitive as opposed to the number of vertices as an argument. Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21238>	2023-03-29 19:18:40 +00:00
Ian Romanick	28311f9d02	nir: intel/compiler: Move ufind_msb lowering to NIR Fossil-db results: All Intel platforms had similar results. (Ice Lake shown) Cycles in all programs: 9098346105 -> 9098333765 (-0.0%) Cycles helped: 6 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
Ian Romanick	0cc7bf63b7	nir: intel/compiler: Move ifind_msb lowering to NIR Unlike ufind_msb, ifind_msb is only defined in NIR for 32-bit values, so no @32 annotation is required. No shader-db or fossil-db changes on any Intel platform. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
Alyssa Rosenzweig	952bd63d6d	nir/opt_barrier: Generalize to control barriers For GLSL, we want to optimize code like memoryBarrierBuffer(); controlBarrier(); into a single scoped_barrier intrinsic for the backend to consume. Now that backends can get scoped_barriers everywhere, what's left is enabling backends to combine these barriers together. We already have an Intel-specific pass for combining memory barriers; it just needs a teensy bit of generalization to allow combining all sorts of barriers together. This avoids code quality regression on Asahi when switching to purely scoped barriers. It's probably useful for other backends too. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21661>	2023-03-06 22:09:27 +00:00
Alyssa Rosenzweig	282aeb9b9c	nir/lower_tex: Add lower_index_to_offset Some backends can handle a constant texture index or a dynamic texture index but not a constant texture index plus a dynamic texture offset. Add a nir_lower_tex option to lower to one of these options. v2: Use more straightforward code proposed by Faith. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> [v1] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21546>	2023-03-06 21:38:32 +00:00
Erik Faye-Lund	c305f97257	nir: add a print_internal debug-flag It can sometimes be useful to also print the shaders that are marked as internal, so let's add a flag that lets us do that. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21681>	2023-03-06 09:13:52 +00:00
Alyssa Rosenzweig	586da7b329	nir: Add nir_lower_helper_writes pass This NIR pass lowers stores in fragment shaders to: if (!gl_HelperInvocaton) { store(); } This implements the API requirement that helper invocations do not have visible side effects, and the lowering is required on any hardware that cannot directly mask helper invocation's side effects. The pass was originally written for Midgard (which has this issue) but is also needed for Asahi. Let's share the code, and fix it while we're at it. Changes from the Midgard pass: 1. Add an option to only lower atomics. AGX hardware can mask helper invocations for "plain" stores but not for atomics. Accordingly, the AGX compiler wants this lowering for atomics but not store_global. By contrast, Midgard cannot mask any stores and needs the lowering for all store intrinsics. Add an option to the common pass to accommodate both cases. This is an optimization for AGX. It is not required for correctness, this lowering is always legal. 2. Fix dominance issues. It's invalid to have NIR like if ... { ssa_1 = ... } foo ssa_1 Instead we need to rewrite as if ... { ssa_1 = ... } else { ssa_2 = undef } ssa_3 = phi ssa_1, ssa_2 foo ssa_3 By default, neither nir_validate nor the backends check this, so this doesn't currently fix a (known) real bug. But it's still invalid and fails validation with NIR_DEBUG=validate_ssa_dominance. Fix this in lower_helper_writes for intrinsics that return data (atomics). 3. Assert that the pass is run only for fragment shaders. This encourages backends to be judicious about which passes they call instead of just throwing everything in a giant lower everything spaghetti. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Italo Nicola <italonicola@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21413>	2023-03-04 13:31:05 -05:00
Marek Olšák	ec38758e86	nir: return progress from nir_lower_io_to_scalar oversight? Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19399>	2023-03-03 03:27:40 +00:00
Faith Ekstrand	eb9a56b6ca	nir: Rename nir_mem_access_size_align::align_mul to align It's a simple alignment so calling it align_mul is a bit misleading. Suggested-by: M Henning <drawoc@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21524>	2023-03-03 02:00:39 +00:00
Faith Ekstrand	ca4d73ba36	nir: Add a combined alignment helper Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@colllabora.com> Reviewed-by: M Henning <drawoc@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21524>	2023-03-03 02:00:39 +00:00
Faith Ekstrand	116a851264	nir: Add mode filtering to lower_mem_access_bit_sizes Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: M Henning <drawoc@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21524>	2023-03-03 02:00:39 +00:00
Georg Lehmann	a00b50d820	nir: change 16bit image dest folding option to per type Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21404>	2023-02-27 09:55:34 +00:00
Alyssa Rosenzweig	8058d31a25	nir: Add nir_texop_lod_bias_agx Add a new texture opcode that returns the LOD bias of the sampler. This will be used on AGX to lower sampler LOD bias to txb and friends. This needs to be a texture op (and not a new intrinsic) to handle both bindless and bindful samplers across GL and Vulkan in a uniform way. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21276>	2023-02-27 02:35:41 +00:00
Faith Ekstrand	e41753cf17	nir/lower_io: Handle buffer_array_length for more address modes Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21446>	2023-02-24 20:37:10 +00:00
Caio Oliveira	3328714295	nir/lower_subgroups: Add option lower_rotate_to_shuffle Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19797>	2023-02-24 06:33:51 +00:00
Daniel Schürmann	c20751d61d	nir: add lowering for Loop Continue Constructs This pass lowers Loop Continue Constructs to the previous solution by inserting it at the beginning of the loop: loop { if (i != 0) { continue construct } loop body } Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13962>	2023-02-21 10:41:11 +00:00
Daniel Schürmann	d4b97bf3fa	nir: add Continue Construct to nir_loop The added continue_list corresponds to the SPIR-V Continue Construct and serves as a converged control-flow construct and is executed after each continue statement and before the next iteration of the loop body. Also adds validation rules for loops with Continue Construct Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13962>	2023-02-21 10:41:11 +00:00
Faith Ekstrand	2e2d7803c7	nir: Add a load/store bit size lowering pass This is based on brw_nir_lower_mem_access_bit_sizes() but ended up being substantially different. While the core concepts are all the same, the brw_* version made a lot of Intel-specific assumptions. The new version takes a callback which takes a number of bytes of data and an alignment pair and returns a bit size and number of components to load/store. Reviewed-by: M Henning <drawoc@darkrefraction.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21232>	2023-02-17 00:55:54 +00:00
Jesse Natalie	25ee07373c	nir_lower_fp16_casts: Allow opting out of lowering certain rounding modes Reviewed-by: Giancarlo Devich <gdevich@microsoft.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21029>	2023-02-11 06:12:23 +00:00
Ian Romanick	682e83f012	nir/inline_uniforms: Make add_inlinable_uniforms public This is step 5 in an attempt to unify a bunch of nir_inline_uniforms.c and lvp_inline_uniforms.c code. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21179>	2023-02-10 03:18:23 +00:00
Ian Romanick	cdd23b1efa	nir/inline_uniforms: Make src_only_uses_uniforms public, change name While making the function public, rename it to nir_collect_src_uniforms. The old name makes it sound like it's just a query that doesn't have side effects. That is, however, not the case. This is step 4 in an attempt to unify a bunch of nir_inline_uniforms.c and lvp_inline_uniforms.c code. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21179>	2023-02-10 03:18:23 +00:00
Jason Ekstrand	9c62e0c77d	nir: Remove nir_lower_io_force_sample_interpolation It's no longer used. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21094>	2023-02-06 09:12:17 +00:00
Alyssa Rosenzweig	071ac59960	nir: Add a late texcoord replacement pass Add a second NIR pass for lowering point/texture coordinate replacement (i.e. point sprites). Why a second one? The current pass works on derefs/variables, which is good for drivers that don't lower I/O at all (like Zink, where the pass originates). However, it is problematic for hardware drivers: the inputs to this pass depend on the shader key, so we want to run the pass as late as possible to minimize the cost of building/compiling the associated shader variants. In particular, we need to be able to lower point sprites after lowering I/O if we would like to lower I/O when preprocessing NIR. The logic for early lowering and late lowering is considerably different (the late lowering is a lot simpler), so I've split this out into a second pass rather than trying to weld them together into one. This pass will be used on Asahi, which currently uses the early pass. It may be useful for other drivers as well. (Actually, it's been shipping on Asahi for a little while now, just hasn't been sent upstream yet.) Tested with Neverball. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Asahi Lina <lina@asahilina.net> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21065>	2023-02-03 15:03:06 +00:00
Amber	c384690ab7	nir: support lowering nir_intrinsic_image_samples to a constant load This can be used by multiple drivers that do not support ms images Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Rob Clark <robclark@freedesktop.org> Reviewer-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Signed-off-by: Amber Amber <amber@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20813>	2023-02-01 19:52:49 +00:00
Marcin Ślusarz	2255375c4d	nir: add nir_mod_analysis & its tests Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20050>	2023-01-31 13:50:08 +00:00
Timur Kristóf	12652cc549	nir: Add pack_half_2x16_rtz_split opcode. Same as pack_half_2x16_rtz_split, but always uses RTZ mode. Note that pack_half_2x16 rounding mode is unspecified. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15838>	2023-01-26 12:24:24 +00:00
Gert Wollny	2e05cfa179	nir: Add range_base to atomic_counter and an option to use it Some drivers may encode constant offsets in the instruction, so make it possible for the drivers to request lowering the atomic uniform offset into the range_base variable of the intrinsic. v2: drop patch to use build-in array offset evaluation, it makes problems with zink, and update the code accordingly v3: always initialize range base Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19980>	2023-01-17 13:19:04 +00:00
Gert Wollny	c4cde91c1b	nir: Add possibility to store image var offset in range_base Add the intrinsic range_base value to the image intrinsics and add the option to store the image array offset into range_base instead of adding it to the image array index if the driver requests it. v2: Always initialize range_base v3: fix for bindless intrinsics Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19980>	2023-01-17 13:19:04 +00:00
Pavel Ondračka	3fcdd9e4a7	nir/lower_bool: ntt: Generate a good opcode for bcsel This is heavily copy-pasted from a patch of Ian Romanick, including the commit message. Previously, this pass always generated fcsel for bcsel. This was the only place that generate fcsel, so various drivers assumed (and needed!) that src0 was a Boolean with 0.0 or 1.0 as the only values. Specifically, many DX9 / GL_ARB_vertex_program platforms lack a CMP instruction in vertex shaders. In those cases, they would use LRP to implement fcsel. The bummer is that many plaforms have a real fcsel instruction, and those platforms would benefit from other places generating that opcode. Instead of leaving assumptions in drivers about the sources of an opcode that they can't really support, allow them to control the way the lowering pass translates bcsel. Two flags are used to control this: - If the driver sets has_fused_comp_and_csel in nir_options, fcsel_gt will be used. Since the Boolean value is 0.0 or 1.0, this is equivalent to fcsel. - If the parameter has_fcsel_ne is set, fcsel will be used. This is the old path. - Otherwise, the lowering pass assumes we're on a crufty, old DX9 vertex program, and it emits flrp. With this, the assumptions about src0 of fcsel in NTT can be removed. If a platform can't handle fcsel, it should ensure that the lowering pass won't generate it. No change in shader-db. Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20162>	2023-01-12 23:01:05 +00:00
Danylo Piliaiev	1c9ee30838	nir/fold_16bit_tex_image: Add type granularity for dst folding Some HW may be able to fold only some of dst types, e.g. for Adreno folding i32 -> i16 could cause a different result since folded variant clamps the result instead of masking it. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20396>	2022-12-23 15:48:18 +01:00
Lionel Landwerlin	3af08b9c30	nir/divergence: handle shader_record_ptr intrinsic Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes `6b8fd65e84` ("spirv: Implement the new ray-tracing storage classes") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20413>	2022-12-23 09:22:13 +00:00

... 4 5 6 7 8 ...

1306 commits