fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-22 11:20:11 +01:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	82ae8b1d33	treewide: simplify nir_def_rewrite_uses_after Most of the time with nir_def_rewrite_uses_after, you want to rewrite after the replacement. Make that the default thing to be more ergonomic and to drop parent_instr uses. We leave nir_def_rewrite_uses_after_instr defined if you really want the old signature with an arbitrary after point. Via Coccinelle patch: @@ expression a, b; @@ -nir_def_rewrite_uses_after(a, b, b->parent_instr) +nir_def_rewrite_uses_after_def(a, b) Followed by a bunch of sed. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Marek Olšák <maraeo@gmail.com> Acked-by: Karol Herbst <kherbst@redhat.com> Acked-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36489>	2025-08-01 15:34:24 +00:00
Alyssa Rosenzweig	114bf69956	nir: add nir_def_block helper Another common composition. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Marek Olšák <maraeo@gmail.com> Acked-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36489>	2025-08-01 15:34:24 +00:00
Alyssa Rosenzweig	3624f054f2	nir: add nir_def_as_* helpers We want to get rid of nir_def::parent_instr eventually, requiring an accessor function instead nir_def_parent_instr(def), so to mitigate the hit to NIR ergonomics, let's add helpers for common patterns using parent_instr. This gets us an immediate win for NIR ergonomics and then reduces the surface area for the later flag day hiding parent_instr. This commit starts us off by adding compositions for nir_instr_as_* with parent_instr's, which are common. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Marek Olšák <maraeo@gmail.com> Acked-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36489>	2025-08-01 15:34:24 +00:00
Marek Olšák	5531f01326	nir: move list.h outside the glsl directory Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36425>	2025-07-31 20:23:02 +00:00
Antonio Ospite	ddf2aa3a4d	build: avoid redefining unreachable() which is standard in C23 In the C23 standard unreachable() is now a predefined function-like macro in <stddef.h> See https://android.googlesource.com/platform/bionic/+/HEAD/docs/c23.md#is-now-a-predefined-function_like-macro-in And this causes build errors when building for C23: ----------------------------------------------------------------------- In file included from ../src/util/log.h:30, from ../src/util/log.c:30: ../src/util/macros.h:123:9: warning: "unreachable" redefined 123 \| #define unreachable(str) \ \| ^~~~~~~~~~~ In file included from ../src/util/macros.h:31: /usr/lib/gcc/x86_64-linux-gnu/14/include/stddef.h:456:9: note: this is the location of the previous definition 456 \| #define unreachable() (__builtin_unreachable ()) \| ^~~~~~~~~~~ ----------------------------------------------------------------------- So don't redefine it with the same name, but use the name UNREACHABLE() to also signify it's a macro. Using a different name also makes sense because the behavior of the macro was extending the one of __builtin_unreachable() anyway, and it also had a different signature, accepting one argument, compared to the standard unreachable() with no arguments. This change improves the chances of building mesa with the C23 standard, which for instance is the default in recent AOSP versions. All the instances of the macro, including the definition, were updated with the following command line: git grep -l '[^_]unreachable(' -- "src/**" \| sort \| uniq \| \ while read file; \ do \ sed -e 's/$[^_]$unreachable(/\1UNREACHABLE(/g' -i "$file"; \ done && \ sed -e 's/#undef unreachable/#undef UNREACHABLE/g' -i src/intel/isl/isl_aux_info.c Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36437>	2025-07-31 17:49:42 +00:00
Marek Olšák	d61edf079b	nir: add nir_move_only_convergent/divergent This will be needed by nir_opt_move_reorder_loads, which will use the move flags. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36357>	2025-07-29 16:20:53 -04:00
Marek Olšák	35bbc8405b	nir: add more nir_move_options Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36357>	2025-07-29 16:20:51 -04:00
Marek Olšák	8d3e76c250	nir: split nir_move_load_frag_coord from nir_move_load_input It's a pure system value on AMD, not an input. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36357>	2025-07-29 16:20:48 -04:00
Marek Olšák	5083769fcb	nir: renumber nir_move_options for future commits Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36357>	2025-07-29 16:20:46 -04:00
Marek Olšák	2eea9b968d	nir/group_loads: rename to nir_opt_group_loads Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details It's meant to be an optimization pass. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36100>	2025-07-29 19:28:59 +00:00
Georg Lehmann	845961ab77	nir: remove NIR_PASS_V Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10409 Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36381>	2025-07-25 23:09:56 +00:00
Marek Olšák	688a639117	nir: add nir_tex_instr::can_speculate Set to true everywhere except: - spirv_to_nir used by Vulkan - bindless handles in GLSL - some internal shaders and driver-specific code Acked-by: Job Noorman <job@noorman.info> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36099>	2025-07-24 18:41:38 +00:00
Marek Olšák	b4afe848a1	nir: add nir_instr_can_speculate helper (for LICM) Acked-by: Job Noorman <job@noorman.info> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36099>	2025-07-24 18:41:37 +00:00
Alyssa Rosenzweig	8716012b21	glsl,nir: factor out nir_opt_varyings_bulk Correctly/optimally using nir_opt_varyings directly is pretty tricky. For GL, we have all the right logic in the GLSL linker. for VK, we don't want to duplicate this dance in every driver. Wrap it all up in a nir_opt_varyings_bulk helper that operates on an entire pipeline of nir_shader's, following the GLSL linker's logic. This is suitable for Vulkan drivers. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com> Reviewed-by: Marek Olšák <maraeo@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36265>	2025-07-23 14:15:57 +00:00
Alyssa Rosenzweig	e0b0f7e73c	nir: add ALU reassocation pass See the comment at the top file :-) The ideas in this pass are based on LLVM. The implementation itself is from scratch because have you /tried/ to read that thing? Because LLVM and therefore prop drivers do these optimizations, this should help narrow Mesa's performance gap with the blobs. This probably needs some tuning for best results on other ISAs, but the stats for AGX speak for themselves, see next commits. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36147>	2025-07-22 23:17:01 +00:00
Alyssa Rosenzweig	22b37c16c8	nir: add nir_alu_src_rewrite_scalar helper this is a little annoying. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Acked-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36147>	2025-07-22 23:17:01 +00:00
Alyssa Rosenzweig	e466b8735b	nir: introduce "inexact associative" property nothing currently uses the associative flag, but they will change soon. we need to stop incorrectly marking fmul/fadd/etc as associative, because they're not, but they almost are. distinguish these properties so we can correctly handle floating point rules without any opcode-based special casing. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36257>	2025-07-21 11:42:19 +00:00
Faith Ekstrand	9fbb57e0a4	nir,nak: Add a nir_texop_sample_pos_nv and plumb it through Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36207>	2025-07-18 22:21:46 +00:00
Alyssa Rosenzweig	58cc66238a	nir/opt_preamble: add sampler class AGX will use. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36127>	2025-07-16 18:27:17 +00:00
Marek Olšák	6286c1c66f	nir/opt_vectorize_io: optionally vectorize loads with holes e.g. load X; load W; ==> load XYZW. Verified with a shader test. This will be used by AMD drivers. See the code comments. Reviewed-by: Simon Perretta <simon.perretta@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36098>	2025-07-15 16:29:30 +00:00
Marek Olšák	b0494f9485	nir/opt_varyings: optimize the consumer after constant propagation and dedupli. A TF2 shader propagates 0 to the consumer, which eliminates 1 input if we run algebraic opts and DCE before compaction. This is a prerequisite for removing all IO var optimizations from the GLSL linker that are redundant with nir_opt_varyings. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36091>	2025-07-15 13:38:29 +00:00
Alyssa Rosenzweig	d55bdb4ec5	nir/opt_preamble: add "register class" concept Class represents an indexed "ideal" register class, where non-general classes only allow defs that choose that class in the def_size callback. nir_opt_preamble will try to assign specialized classes where possible, falling back to the general class once the special-purpose classes are exhausted. AGX will use this mechanism to promote bindless texture handles to bound texture registers where possible, falling back to pushing the handle as a uniform where not possible. Supporting multiple classes in nir_opt_preamble allows this multi-level hoisting to work in a single nir_opt_preamble call with proper global behaviour. Add this concept to nir_opt_preamble so we can use it in AGX later in this MR. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Job Noorman <job@noorman.info> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35949>	2025-07-10 14:55:17 -04:00
Marek Olšák	a4e522f8b0	nir: add new pass nir_opt_move_to_top This can be used to move input loads to top after we stop using nir_lower_io_vars_to_temporaries that does it unconditionally. It's more flexible than what nir_lower_io_vars_to_temporaries was doing, and can be extended to handle any instructions. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36018>	2025-07-10 16:37:44 +00:00
Marek Olšák	3dd9a9782b	nir: add new pass nir_lower_io_indirect_loads This is a partial replacement for nir_lower_io_vars_to_temporaries. It supports all input and output loads. It doesn't handle stores. The motivation is to improve compile times. The main differences compared to nir_lower_io_vars_to_temporaries are: - it only lowers indirect loads to temps and doesn't touch direct loads which improves compile times and removes the need for nir_lower_vars_to_ssa afterward because indirect temp access can't be lowered to SSA - it doesn't move all input loads to the top; it only moves those input loads to the top whose indirect loads are lowered (which improves register usage because direct loads are not moved) - it doesn't have to deal with complexities of variables Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36018>	2025-07-10 16:37:44 +00:00
Daniel Schürmann	2c51a8870d	nir: add nir_vectorize_cb callback parameter to nir_lower_phis_to_scalar() Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Similar to nir_lower_alu_width(), the callback can return the desired number of components for a phi, or 0 for no lowering. The previous behavior of nir_lower_phis_to_scalar() with lower_all=true can be elicited via nir_lower_all_phis_to_scalar() while the previous behavior with lower_all=false now corresponds to nir_lower_phis_to_scalar() with NULL callback. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35783>	2025-07-08 15:33:59 +00:00
Alyssa Rosenzweig	d13b321201	nir/lower_gs_intrinsics: drop stuff added for AGX AGX now vendors a significantly different version of this pass, so the common one doesn't need the stuff added for AGX. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35802>	2025-06-30 16:24:10 +00:00
Robert Mader	a166d7609f	gles: Add support for 10/12/16 bit SW decoder YCbCr formats Signed-off-by: Robert Mader <robert.mader@collabora.com> Co-Authored-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34303>	2025-06-30 11:56:23 +00:00
Mel Henning	10acb44c64	nir: Split lower_vote_eq into int/float versions Recent nvidia hardware has a native instruction for nir_intrinsic_vote_ieq but not for nir_intrinsic_vote_feq. So, split this boolean into two so we can contol the lowering separately for each instruction. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35778>	2025-06-28 16:10:50 +00:00
Konstantin Seurer	aacfc663cb	nir: Add nir_lower_halt_to_return This is a lowering pass that was implemented by multiple drivers. Reviewed-by: Mary Guillemard <mary@mary.zone> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33003>	2025-06-26 20:12:12 +00:00
Marek Olšák	1754507d49	nir: rename nir_lower_io_to_temporaries -> nir_lower_io_vars_to_temporaries Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:54 +00:00
Marek Olšák	1e03827c77	nir: rename nir_lower_io_arrays_to_elements -> nir_lower_io_array_vars_to_elements same for *_no_indirects Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:54 +00:00
Marek Olšák	3713e2d580	nir: rename nir_lower_clip_cull_distance_arrays -> nir_lower_clip_cull_distance_array_vars Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:53 +00:00
Marek Olšák	adb17a8609	nir: move nir_recompute_io_bases into its own file Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:53 +00:00
Marek Olšák	97743980ce	nir: remove unused nir_force_mediump_io & nir_unpack_16bit_varying_slots I think I added these. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:52 +00:00
Marek Olšák	5bd3e0c08c	nir: move nir_assign_var_locations to freedreno (its only use) Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:52 +00:00
Marek Olšák	12df9b3def	nir: rename nir_vectorize_tess_levels -> nir_lower_tess_level_array_vars_to_vec Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:50 +00:00
Marek Olšák	2aa94caf82	nir: rename nir_lower_io_to_vector -> nir_opt_vectorize_io_vars Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:50 +00:00
Marek Olšák	439d805291	nir: rename nir_lower_io_to_scalar_early -> nir_lower_io_vars_to_scalar Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:49 +00:00
Georg Lehmann	7de352e99e	nir,radv: add an option to not move 8/16bit vecs ACO will overestimate the register demand of the sources, so we don't want to create the vector later. Foz-DB Navi48: Totals from 240 (0.30% of 80265) affected shaders: MaxWaves: 6429 -> 6435 (+0.09%) Instrs: 3406069 -> 3406646 (+0.02%); split: -0.01%, +0.03% CodeSize: 18231596 -> 18233288 (+0.01%); split: -0.01%, +0.02% VGPRs: 14768 -> 14732 (-0.24%) Latency: 18981274 -> 18979170 (-0.01%); split: -0.02%, +0.01% InvThroughput: 4247331 -> 4246634 (-0.02%); split: -0.02%, +0.01% VClause: 85453 -> 85458 (+0.01%); split: -0.01%, +0.01% Copies: 262046 -> 261971 (-0.03%); split: -0.06%, +0.03% PreVGPRs: 10899 -> 10775 (-1.14%) VALU: 1923441 -> 1923485 (+0.00%); split: -0.01%, +0.01% SALU: 457983 -> 457982 (-0.00%) VOPD: 4980 -> 4861 (-2.39%); split: +0.48%, -2.87% Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35729>	2025-06-26 09:29:43 +00:00
Emma Anholt	bc8994cb48	nir: Add a pass to reassociate multiplication of matmatvec. The typical case of mat4mat4vec4 is 80 scalar multiplications, but mat4(mat4vec4) is only 32. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35622>	2025-06-23 17:49:51 +00:00
Faith Ekstrand	bb4c5edda1	nir: Add more tex_src helpers This adds a variant of nir_steal_tex_src() which is for derefs as well as versions that just return the source without removing it. Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35623>	2025-06-23 14:25:30 +00:00
Faith Ekstrand	2b40fa09f2	nir: Move nir_steal_tex_src() to nir.h Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35623>	2025-06-23 14:25:30 +00:00
Faith Ekstrand	9f9cde04ec	nir: Add a new load_input_attachment_coord intrinsic This hoists all the annoyance of figuring out the current pixel's input attachment coordinates to the driver. The pass still deals with all the annoyance of turning an image instruciton into a texture instruction but it gives the driver more control over the position. For most drivers, this will be something like ivec3(int(gl_FragCoord.xy), gl_Layer) or similar, some drivers need something more nuanced. Turnip, for instance, needs unscaled coordinates for some attachments and NVK doesn't really want gl_Layer or gl_ViewIndex for the layer. It's better to just have a new system value that drivers can make what they want. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35551>	2025-06-19 02:14:04 +00:00
Marek Olšák	fa2e7c3dfd	nir: return progress from nir_group_loads, nir_inline_uniforms so that NIR_PASS is usable Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35315>	2025-06-12 19:35:37 +00:00
Marek Olšák	b636e5ca66	nir: add nir_clear_mediump_io_flag Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35315>	2025-06-12 19:35:36 +00:00
Marek Olšák	bf2ed20eb9	nir: remove unused nir_io_semantics::invariant Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Acked-by: Alyssa on IRC Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35256>	2025-06-02 23:08:58 +00:00
Kenneth Graunke	deb1d47155	nir: Add a new optimization for acquire/release atomics & barriers Some shaders contain back-to-back atomic accesses in SPIR-V with AcquireRelease semantics. In NIR, we translate these to a release memory barrier, the atomic, then an acquire memory barrier. This results in a lot of unnecessary memory barriers in the middle of the sequence of atomics: 0. Release barrier 1. Atomic 2. Acquire barrier 3. Release barrier 4. Atomic 5. Acquire barrier 6. Release barrier 7. Atomic 8. Acquire barrier In the absence of loads/stores, and when the atomic destinations are unused, these barriers in-between atomics shouldn't be required. This optimization pass would drop them (lines 2-3 and 5-6 above) while leaving the first and last barriers (0 and 8), so the sequence remains synchronized against other access elsewhere in the program. One common example where this occurs is a sequence of min and max atomics to clamp a certain memory location's value within a range. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33504>	2025-05-16 00:29:13 +00:00
Marek Olšák	deda05e2b7	nir: move nir_lower_color_inputs into radeonsi it's the only user Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34492>	2025-05-14 20:19:17 +00:00
Konstantin Seurer	5926b63f66	nir: Print struct type declarations Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26267>	2025-05-12 18:28:50 +00:00
Karol Herbst	f0fa2209a8	nir: add nir_opt_algebraic_integer_promotion This handles basic operations where clang promotes integers to 32 bits according to the C99 spec in OpenCL C source code. This is its own opt_algerbraic pass, because we don't wanna fight with nir_lower_bit_size. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34641>	2025-05-12 09:29:20 +00:00

1 2 3 4 5 ...

1620 commits