mesa/src/compiler/nir
Daniel Schürmann 0089d81fb3 nir/tests: change opt_loop_peel_initial_break test to not use nir_jump_continue
We are going to disallow continue statements without
loop continue constructs.

Replaced with a test that checks that the optimization is not
applied in absense of actual work after the conditional break.

Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com>
Reviewed-by: Georg Lehmann <dadschoorse@gmail.com>
Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39942>
2026-03-21 07:42:55 +00:00
..
tests nir/tests: change opt_loop_peel_initial_break test to not use nir_jump_continue 2026-03-21 07:42:55 +00:00
.clang-format
meson.build nir/tests: Test nir_opt_large_constants 2026-03-19 06:59:16 +00:00
nir.c nir: rework nir_alu_src_is_trivial_ssa to take an alu src 2026-03-20 08:50:41 +00:00
nir.h nir: Use better calculation for alpha-to-coverage mask 2026-03-20 18:09:48 +00:00
nir_algebraic.py nir/algebraic/tests: invert all excluded fp_math_ctrl flags 2026-03-16 13:03:50 +00:00
nir_blend_equation_advanced_helper.h nir: Extract blend equation helpers to shared header 2026-01-24 21:44:01 +00:00
nir_builder.c nir/search: never insert movs for alu uses 2026-03-20 08:50:41 +00:00
nir_builder.h nir/search: never insert movs for alu uses 2026-03-20 08:50:41 +00:00
nir_builder_opcodes_h.py src/compiler: Fix warning C4244 'argument' : conversion from 'type1' to 'type2', possible loss of data 2025-01-15 21:40:20 +00:00
nir_builtin_builder.c nir: add nir_tex_instr::can_speculate 2025-07-24 18:41:38 +00:00
nir_builtin_builder.h nir: move exact bit to nir_fp_math_control 2026-01-07 09:40:57 +00:00
nir_clip_cull_distance_io_utils.c nir/clip_cull_distance_utils: add more assertions validating the type & sizes 2026-01-05 21:24:10 +00:00
nir_clone.c nir/clone: Fix cloning indirect call instructions 2026-02-13 11:27:59 +00:00
nir_constant_expressions.h nir: Let nir_eval_const_opcode() return a poison mask in case of UB. 2026-01-15 19:09:32 +00:00
nir_constant_expressions.py nir/constant_expressions: flush input denorms if denorms have to be flushed 2026-01-19 16:11:28 +00:00
nir_control_flow.c nir: Update progress info in nir_sort_unstructured_blocks 2025-11-17 10:30:37 +00:00
nir_control_flow.h nir/cf: have nir_remove_after_cf_node remove phis at the start too 2025-07-23 09:06:36 +00:00
nir_control_flow_private.h
nir_conversion_builder.h nir: fix nir_round_int_to_float for fp16 2026-03-04 14:32:35 +00:00
nir_defines.h nir: add support for pixel_local_storage variables 2025-11-18 20:25:42 +00:00
nir_deref.c nir: Use instr_clone in rematerialize_deref_in_block 2025-12-10 22:07:45 +00:00
nir_deref.h nir: Don't include the full nir.h when not necessary. 2025-02-12 22:33:07 +01:00
nir_divergence_analysis.c tu, ir3, nir: Plumb through driver param for alpha-to-coverage 2026-03-20 18:09:49 +00:00
nir_dominance.c treewide: Spell indices correctly 2025-09-08 23:03:13 +00:00
nir_dominance_lca.c nir: Add a faster lowest common ancestor algorithm 2025-09-08 23:03:13 +00:00
nir_downgrade_pls_vars.c nir: Add a pass to downgrade inout PLS vars to {in,out} only ones 2025-11-18 20:25:43 +00:00
nir_fixup_is_exported.c nir: fix nir_fixup_is_exported for LLVM-22 2026-01-30 16:06:25 +00:00
nir_format_convert.c nir/format_convert: use nan/inf preserve flag for fmax instead of exact 2026-02-10 18:42:02 +00:00
nir_format_convert.h nir: add a high precision conversion unorm->float 2025-02-06 08:20:21 +00:00
nir_from_ssa.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_functions.c nir: Use STACK_ARRAY instead of NIR_VLA 2026-02-14 01:19:27 +00:00
nir_gather_info.c nir/gather_info: use nir_intrinsic_has_io_semantics 2026-03-12 17:00:25 +00:00
nir_gather_output_deps.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_gather_tcs_info.c nir/gather_tcs_info: use nir_analyze_fp_class directly 2026-03-07 05:01:44 +00:00
nir_gather_types.c build: avoid redefining unreachable() which is standard in C23 2025-07-31 17:49:42 +00:00
nir_gather_xfb_info.c nir: merge xfb and xfb2 into one 64bit intrinsic index 2026-03-10 07:46:22 +00:00
nir_gs_count_vertices.c nir: make nir_block::predecessors & dom_frontier sets non-malloc'd 2025-08-21 06:13:48 +00:00
nir_inline_helpers.h nir: add a cmat call instruction type. 2025-11-17 23:33:58 +00:00
nir_inline_sysval.c nir: add nir_inline_sysval pass 2025-08-03 21:27:47 +00:00
nir_inline_uniforms.c nir: speedup nir_find_inlinable_uniforms() 2026-02-05 23:19:29 +00:00
nir_instr_set.c nir: don't assume indicies are always 32bit when accessing them as raw data 2026-03-10 07:46:20 +00:00
nir_instr_set.h nir/instr_set: don't ralloc the set 2025-08-21 06:13:48 +00:00
nir_intrinsics.py tu, ir3, nir: Plumb through driver param for alpha-to-coverage 2026-03-20 18:09:49 +00:00
nir_intrinsics_c.py nir: support intrinsic indicies larger than 32 bits 2026-03-10 07:46:21 +00:00
nir_intrinsics_h.py meson: remove '--outdir' argument in script 2025-10-08 20:51:20 +00:00
nir_intrinsics_indices_h.py nir: fix nir_intrinsic_copy_const_indices for large indices 2026-03-11 16:47:15 +00:00
nir_io_add_xfb_info.c nir: merge xfb and xfb2 into one 64bit intrinsic index 2026-03-10 07:46:22 +00:00
nir_legacy.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_legacy.h nir: Add bool return value to nir_legacy_trivialize(..) 2025-03-06 03:29:20 +00:00
nir_linking_helpers.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_liveness.c nir, vk: fix MSVC unused variable warning 2025-11-28 01:52:12 +00:00
nir_loop_analyze.c nir/loop_analyze: also set force_unroll if the array_size is larger than max_trip_count 2026-02-10 09:24:23 +00:00
nir_loop_analyze.h treewide: use nir_def_as_* 2025-08-01 15:34:24 +00:00
nir_lower_alpha.c nir: Use better calculation for alpha-to-coverage mask 2026-03-20 18:09:48 +00:00
nir_lower_alu.c nir/opt_algebraic: make bcsel(fcmp(b, a), b, a) -> fmin/fmax patterns exact 2026-02-10 18:42:03 +00:00
nir_lower_alu_width.c nir: remove fall_equal/fany_nequal opcodes 2026-03-04 19:50:27 +00:00
nir_lower_amul.c nir: add intrinsics for pixel local storage 2025-11-18 20:25:42 +00:00
nir_lower_array_deref_of_vec.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_lower_atomics.c nir/lower_atomics: use data helper 2026-02-19 14:47:11 +00:00
nir_lower_atomics_to_ssbo.c nir: add ACCESS_ATOMIC 2025-10-07 17:41:30 +00:00
nir_lower_bit_size.c nir: Add intrinsics for ShuffleUpINTEL and ShuffleDownINTEL 2026-03-17 17:21:52 +00:00
nir_lower_bitmap.c nir: add nir_tex_instr::can_speculate 2025-07-24 18:41:38 +00:00
nir_lower_blend.c nir/lower_blend: Add advanced blending support 2026-01-24 21:44:01 +00:00
nir_lower_blend.h nir/lower_blend: Add advanced blending support 2026-01-24 21:44:01 +00:00
nir_lower_bool_to_float.c nir/lower_bool_to_float: assert that vector comparisons were lowered 2026-03-04 19:50:27 +00:00
nir_lower_bool_to_int32.c treewide: use nir_metadata_control_flow 2024-06-17 16:28:14 -04:00
nir_lower_calls_to_builtins.c nir/nir_lower_calls_to_builtins: trivially handle IA64 mangled functions 2025-09-02 16:04:19 +00:00
nir_lower_cl_images.c treewide: use BITSET_*_COUNT 2025-12-16 17:42:10 +00:00
nir_lower_clamp_color_outputs.c nir: remove handling IO variables from passes used by st/mesa 2025-01-22 02:15:04 +00:00
nir_lower_clip.c nir/lower_clip: Correctly handle driver_location in VS lowering 2026-01-21 20:54:15 +00:00
nir_lower_clip_disable.c nir: remove handling IO variables from passes used by st/mesa 2025-01-22 02:15:04 +00:00
nir_lower_clip_halfz.c nir: Add bool return value to nir_lower_clip_halfz(..) 2025-02-19 21:16:05 +00:00
nir_lower_const_arrays_to_uniforms.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_lower_continue_constructs.c nir/lower_continue_constructs: Remove unnecessary handling of multiple continue statements 2026-03-21 07:42:55 +00:00
nir_lower_convert_alu_types.c nir/lower_convert_alu_types: use intrinsics_pass 2024-11-20 16:53:50 +00:00
nir_lower_cooperative_matrix.c nir: suppress clang warnings for cooperative matrix lowering 2025-12-08 19:36:05 +00:00
nir_lower_discard_if.c nir: make nir_block::predecessors & dom_frontier sets non-malloc'd 2025-08-21 06:13:48 +00:00
nir_lower_double_ops.c nir/lower_double_ops: don't create more exact ops than the input requires 2026-02-10 18:42:02 +00:00
nir_lower_explicit_io.c nir: allow get_ssbo_size to return a 64-bit result 2026-02-16 12:59:36 +00:00
nir_lower_fb_read.c nir: add nir_tex_instr::can_speculate 2025-07-24 18:41:38 +00:00
nir_lower_flatshade.c nir/lower_flatshade: clean up 2025-09-18 13:37:03 +00:00
nir_lower_floats.c nir, glsl: Add support for softfloat32 2025-11-14 19:31:17 +00:00
nir_lower_flrp.c nir: move exact bit to nir_fp_math_control 2026-01-07 09:40:57 +00:00
nir_lower_fp16_conv.c build: avoid redefining unreachable() which is standard in C23 2025-07-31 17:49:42 +00:00
nir_lower_frag_coord_to_pixel_coord.c nir: Add SYSTEM_VALUE_FRAG_COORD_Z/W. 2025-06-18 23:11:36 +00:00
nir_lower_fragcolor.c nir: don't use variables as ralloc parents, use the shader instead 2025-08-05 22:55:13 +00:00
nir_lower_fragcoord_wtrans.c treewide: simplify nir_def_rewrite_uses_after 2025-08-01 15:34:24 +00:00
nir_lower_frexp.c nir/lower_frexp: preserve fp_math_ctrl 2026-03-13 07:13:09 +00:00
nir_lower_global_vars_to_local.c treewide: Switch to nir_progress 2025-02-26 15:19:53 +00:00
nir_lower_goto_ifs.c nir/lower_goto_ifs: Add and lower loop continue constructs 2026-03-21 07:42:55 +00:00
nir_lower_gs_intrinsics.c nir: make nir_block::predecessors & dom_frontier sets non-malloc'd 2025-08-21 06:13:48 +00:00
nir_lower_halt_to_return.c nir: Add nir_lower_halt_to_return 2025-06-26 20:12:12 +00:00
nir_lower_helper_writes.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_lower_idiv.c nir: move exact bit to nir_fp_math_control 2026-01-07 09:40:57 +00:00
nir_lower_image.c treewide: use nir_def_as_* 2025-08-01 15:34:24 +00:00
nir_lower_image_atomics_to_global.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_lower_indirect_derefs_to_if_else_trees.c nir: rename nir_lower_indirect_derefs -> nir_lower_indirect_derefs_to_if_else_trees 2025-11-20 05:42:11 +00:00
nir_lower_input_attachments.c nir, tu: Add and use load_frag_coord_gmem_ir3 2025-12-08 20:44:45 +00:00
nir_lower_int64.c build: avoid redefining unreachable() which is standard in C23 2025-07-31 17:49:42 +00:00
nir_lower_int_to_float.c nir: only set fp_math_ctrl if meaningful 2026-02-18 14:04:22 +00:00
nir_lower_interpolation.c nir/lower_interpolation: check IO location correctly 2025-11-10 16:44:36 +00:00
nir_lower_io.c pan/bi: Add is_psiz_store flag in bi_instr 2026-03-19 11:25:30 +00:00
nir_lower_io_array_vars_to_elements.c all: rename gl_shader_stage to mesa_shader_stage 2025-08-06 10:28:40 +08:00
nir_lower_io_indirect_loads.c nir: Add missing #include for c99_alloca.h 2025-08-22 22:33:50 +00:00
nir_lower_io_to_scalar.c nir: merge xfb and xfb2 into one 64bit intrinsic index 2026-03-10 07:46:22 +00:00
nir_lower_io_vars_to_scalar.c nir: move nir_lower_io_vars_to_scalar into its own file 2025-06-26 18:20:49 +00:00
nir_lower_io_vars_to_temporaries.c nir/lower_io_vars: don't insert output stores for unrelated streams before emits 2025-12-06 02:27:46 +00:00
nir_lower_is_helper_invocation.c treewide: use nir_def_replace sometimes 2024-06-21 15:36:56 +00:00
nir_lower_load_const_to_scalar.c treewide: Switch to nir_progress 2025-02-26 15:19:53 +00:00
nir_lower_locals_to_regs.c treewide: use nir_def_as_* 2025-08-01 15:34:24 +00:00
nir_lower_mediump.c nir: only set fp_math_ctrl if meaningful 2026-02-18 14:04:22 +00:00
nir_lower_mem_access_bit_sizes.c nir: don't assume indicies are always 32bit when accessing them as raw data 2026-03-10 07:46:20 +00:00
nir_lower_memcpy.c build: avoid redefining unreachable() which is standard in C23 2025-07-31 17:49:42 +00:00
nir_lower_memory_model.c nir/lower_memory_model: remove empty lowered barriers 2025-08-04 15:36:51 +00:00
nir_lower_multiview.c build: avoid redefining unreachable() which is standard in C23 2025-07-31 17:49:42 +00:00
nir_lower_non_uniform_access.c nir: Handle nir_instr_type_cmat_call in more places 2026-02-20 13:09:45 -08:00
nir_lower_packing.c nir: clang-format 2025-02-26 15:19:53 +00:00
nir_lower_passthrough_edgeflags.c treewide: Switch to nir_progress 2025-02-26 15:19:53 +00:00
nir_lower_patch_vertices.c treewide: Switch to nir_progress 2025-02-26 15:19:53 +00:00
nir_lower_phis_to_scalar.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_lower_pntc_ytransform.c treewide: simplify nir_def_rewrite_uses_after 2025-08-01 15:34:24 +00:00
nir_lower_point_size.c Revert "nir: Add a type parameter to nir_lower_point_size()" 2026-03-12 22:59:13 +00:00
nir_lower_point_smooth.c treewide: use nir_def_as_* 2025-08-01 15:34:24 +00:00
nir_lower_poly_line_smooth.c nir/lower_poly_line_smooth: use intrinsics_pass 2025-02-05 11:23:35 +00:00
nir_lower_printf.c nir/compiler_options: add nir_load_pixel_coord 2026-01-21 16:00:50 +00:00
nir_lower_readonly_images_to_tex.c build: avoid redefining unreachable() which is standard in C23 2025-07-31 17:49:42 +00:00
nir_lower_reg_intrinsics_to_ssa.c nir: Don't require nir_metadata_control_flow 2025-09-08 23:03:13 +00:00
nir_lower_returns.c nir: make nir_block::predecessors & dom_frontier sets non-malloc'd 2025-08-21 06:13:48 +00:00
nir_lower_robust_access.c nir: Handle format swizzles for OOB image loads 2026-03-12 19:14:24 +00:00
nir_lower_sample_shading.c nir/lower_sample_shading: Set the sample qualifier on in vars. 2025-08-03 20:27:39 +00:00
nir_lower_samplers.c treewide: use nir_def_as_* 2025-08-01 15:34:24 +00:00
nir_lower_scratch.c nir: Introduce nir_lower_vars_to_scratch_global(). 2025-12-17 19:50:28 +00:00
nir_lower_scratch_to_var.c nir: rename nir_copy_prop -> nir_opt_copy_prop 2025-11-15 02:16:38 +00:00
nir_lower_shader_calls.c nir: add a new push_data_intel intrinsic 2026-01-09 14:19:46 +00:00
nir_lower_single_sampled.c nir: add nir_lower_single_sampled::lower_sample_mask_in option 2025-12-11 22:50:10 +00:00
nir_lower_ssbo.c treewide: use nir_store_global alias of nir_build_store_global 2025-10-21 12:37:58 +02:00
nir_lower_subgroups.c nir: Add intrinsics for ShuffleUpINTEL and ShuffleDownINTEL 2026-03-17 17:21:52 +00:00
nir_lower_system_values.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_lower_sysvals_to_varyings.c nir: Support sysval intrinsics in lower_sysvals_to_varyings() 2025-11-29 00:50:32 +00:00
nir_lower_task_shader.c treewide: Replace calling to function ALIGN with align 2025-11-12 21:58:40 +00:00
nir_lower_terminate_to_demote.c build: avoid redefining unreachable() which is standard in C23 2025-07-31 17:49:42 +00:00
nir_lower_tess_coord_z.c nir: clang-format 2025-02-26 15:19:53 +00:00
nir_lower_tex.c compiler/nir: use common ycbcr math 2026-03-17 15:00:54 +00:00
nir_lower_tex_shadow.c treewide: use nir_def_as_* 2025-08-01 15:34:24 +00:00
nir_lower_texcoord_replace.c treewide: simplify nir_def_rewrite_uses_after 2025-08-01 15:34:24 +00:00
nir_lower_texcoord_replace_late.c treewide: simplify nir_def_rewrite_uses_after 2025-08-01 15:34:24 +00:00
nir_lower_two_sided_color.c nir/lower_two_sided_color: cleanup 2025-10-29 15:52:27 +00:00
nir_lower_ubo_vec4.c treewide: use nir_def_as_* 2025-08-01 15:34:24 +00:00
nir_lower_undef_to_zero.c panfrost,nir: drop my lonely Authors tags 2026-01-05 17:47:52 +00:00
nir_lower_uniforms_to_ubo.c treewide: use nir_def_as_* 2025-08-01 15:34:24 +00:00
nir_lower_var_copies.c treewide: use nir_def_as_* 2025-08-01 15:34:24 +00:00
nir_lower_variable_initializers.c treewide: use nir_break_if 2025-06-30 14:51:24 -04:00
nir_lower_vars_to_ssa.c nir/lower_vars_to_ssa: return early if there is no local variables to lower 2025-11-14 09:09:15 +00:00
nir_lower_vec3_to_vec4.c treewide: simplify nir_def_rewrite_uses_after 2025-08-01 15:34:24 +00:00
nir_lower_vec_to_regs.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_lower_view_index_to_device_index.c nir: add pass to convert ViewIndex to DeviceIndex 2024-08-07 19:09:55 +00:00
nir_lower_viewport_transform.c nir: Suppress 'potentially uninitialized local pointer variable used' warning 2026-01-13 23:31:28 +00:00
nir_lower_workgroup_size.c nir: Handle nir_instr_type_cmat_call in more places 2026-02-20 13:09:45 -08:00
nir_lower_wpos_center.c treewide: simplify nir_def_rewrite_uses_after 2025-08-01 15:34:24 +00:00
nir_lower_wpos_ytransform.c treewide: simplify nir_def_rewrite_uses_after 2025-08-01 15:34:24 +00:00
nir_lower_wrmasks.c nir/lower_wrmasks: clean up & deprecate pass 2025-11-26 03:20:39 +00:00
nir_metadata.c treewide: Never preserve nir_metadata_dominance without nir_metadata_block_index 2025-11-14 09:09:14 +00:00
nir_mod_analysis.c nir: Fix mod analysis of ishl to shift the recursive result 2025-12-16 00:58:37 +00:00
nir_move_output_stores_to_end.c treewide: Switch to nir_progress 2025-02-26 15:19:53 +00:00
nir_move_vec_src_uses_to_dest.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_normalize_cubemap_coords.c nir/normalize_cubemap_coords: Handle the projector before the normalization 2025-12-30 16:25:09 +00:00
nir_opcodes.py nir: Fix constant folding for iadd_sat 2026-03-06 22:26:07 +00:00
nir_opcodes_c.py nir: only set fp_math_ctrl if meaningful 2026-02-18 14:04:22 +00:00
nir_opcodes_h.py
nir_opt_access.c compiler: rename gl_shader_stage_is_callable to mesa_shader_stage_is_callable 2025-08-06 10:28:41 +08:00
nir_opt_algebraic.py nir/opt_algebraic: optimize b2f(a) * b 2026-03-20 08:50:41 +00:00
nir_opt_barriers.c nir: convert nir_instr_worklist to init/fini semantics w/out allocation 2025-08-21 06:13:49 +00:00
nir_opt_barycentric.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_opt_call.c nir: add a new push_data_intel intrinsic 2026-01-09 14:19:46 +00:00
nir_opt_clip_cull_const.c nir: rename nir_lower_io_to_temporaries -> nir_lower_io_vars_to_temporaries 2025-06-26 18:20:54 +00:00
nir_opt_combine_stores.c nir: Handle nir_instr_type_cmat_call in more places 2026-02-20 13:09:45 -08:00
nir_opt_comparison_pre.c treewide: Switch to nir_progress 2025-02-26 15:19:53 +00:00
nir_opt_constant_folding.c nir/opt_constant_folding: optimize ballot(false) 2026-02-18 23:40:44 +00:00
nir_opt_copy_prop_vars.c nir: Handle nir_instr_type_cmat_call in more places 2026-02-20 13:09:45 -08:00
nir_opt_copy_propagate.c nir: rename nir_copy_prop -> nir_opt_copy_prop 2025-11-15 02:16:38 +00:00
nir_opt_cse.c nir/opt_cse: update potential future plans merging copy propagation with CSE 2025-12-13 06:41:59 +00:00
nir_opt_dce.c nir: add a cmat call instruction type. 2025-11-17 23:33:58 +00:00
nir_opt_dead_cf.c nir/dead_cf: Add missing load_global_nv handling 2026-03-20 20:19:35 +00:00
nir_opt_dead_write_vars.c nir: Handle nir_instr_type_cmat_call in more places 2026-02-20 13:09:45 -08:00
nir_opt_find_array_copies.c nir/opt_find_array_copies: don't allocate the hash tables 2025-08-21 06:13:48 +00:00
nir_opt_fp_math_ctrl.c nir/opt_fp_math_ctrl: handle input/output no_signed_zero flag 2026-03-11 16:47:15 +00:00
nir_opt_frag_coord_to_pixel_coord.c nir: only set fp_math_ctrl if meaningful 2026-02-18 14:04:22 +00:00
nir_opt_fragdepth.c nir/opt_fragdepth: use data helper 2026-02-19 14:47:11 +00:00
nir_opt_gcm.c nir: Handle nir_instr_type_cmat_call in more places 2026-02-20 13:09:45 -08:00
nir_opt_generate_bfi.c nir/opt_generate_bfi: create vector and non 32bit bitfield_select 2025-07-21 20:42:32 +00:00
nir_opt_group_loads.c nir: Handle nir_instr_type_cmat_call in more places 2026-02-20 13:09:45 -08:00
nir_opt_idiv_const.c build: avoid redefining unreachable() which is standard in C23 2025-07-31 17:49:42 +00:00
nir_opt_if.c nir: rework nir_alu_src_is_trivial_ssa to take an alu src 2026-03-20 08:50:41 +00:00
nir_opt_intrinsics.c nir: rework nir_alu_src_is_trivial_ssa to take an alu src 2026-03-20 08:50:41 +00:00
nir_opt_large_constants.c nir/opt_large_constants: only use 16bit float alu when supported 2026-03-19 06:59:18 +00:00
nir_opt_licm.c nir: Handle nir_instr_type_cmat_call in more places 2026-02-20 13:09:45 -08:00
nir_opt_load_skip_helpers.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_opt_load_store_vectorize.c nir/opt_load_store_vectorize: use nir_intrinsic_has_align_mul 2026-03-12 17:00:25 +00:00
nir_opt_loop.c nir/opt_loop: Relax restrictions on opt_loop_peel_initial_break() for more loops 2026-01-26 12:02:49 +00:00
nir_opt_loop_unroll.c nir/opt_loop_unroll: Always unroll loops with a known trip-count of 0 2026-02-10 09:24:23 +00:00
nir_opt_memcpy.c treewide: Switch to nir_progress 2025-02-26 15:19:53 +00:00
nir_opt_move.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_opt_move_discards_to_top.c nir: add a cmat call instruction type. 2025-11-17 23:33:58 +00:00
nir_opt_move_to_top.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_opt_mqsad.c nir: convert nir_instr_worklist to init/fini semantics w/out allocation 2025-08-21 06:13:49 +00:00
nir_opt_non_uniform_access.c nir/opt_non_uniform_access: use new query flags 2026-02-16 12:59:36 +00:00
nir_opt_offsets.c nir, nvk, nak: Add base to isbewr_nv and isberd_nv 2026-03-11 19:41:34 +00:00
nir_opt_peephole_select.c nir: add a new push_data_intel intrinsic 2026-01-09 14:19:46 +00:00
nir_opt_phi_precision.c nir: only set fp_math_ctrl if meaningful 2026-02-18 14:04:22 +00:00
nir_opt_phi_to_bool.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_opt_preamble.c nir,aco: Clean up useless lowering of sbt_base_amd 2026-01-14 14:19:07 +00:00
nir_opt_ray_queries.c nir: Ignore ray query ranges that don't start with rq_initialize 2025-12-11 15:56:29 +00:00
nir_opt_reassociate.c nir: move exact bit to nir_fp_math_control 2026-01-07 09:40:57 +00:00
nir_opt_reassociate_bfi.c nir/opt_reassociate_bfi: use alu_pass 2024-08-10 13:40:21 +00:00
nir_opt_rematerialize_compares.c treewide: Switch to nir_progress 2025-02-26 15:19:53 +00:00
nir_opt_remove_phis.c nir/opt_remove_phis: recursively check loop header phis for triviality 2026-03-04 14:03:40 +00:00
nir_opt_shrink_stores.c nir/shrink_stores: Don't shrink stores to an invalid num_components. 2025-10-20 19:24:45 +00:00
nir_opt_shrink_vectors.c nir: rework nir_alu_src_is_trivial_ssa to take an alu src 2026-03-20 08:50:41 +00:00
nir_opt_sink.c nir/opt_sink: sink pack_64_2x32_split 2026-03-13 17:03:00 +00:00
nir_opt_undef.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_opt_uniform_atomics.c nir/opt_uniform_atomics: use data helper 2026-02-19 14:47:11 +00:00
nir_opt_uniform_subgroup.c nir: Add intrinsics for ShuffleUpINTEL and ShuffleDownINTEL 2026-03-17 17:21:52 +00:00
nir_opt_uub.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_opt_varyings.c nir/opt_varyings: Skip code-motion for upconversions 2026-03-11 23:52:10 +00:00
nir_opt_vectorize.c nir: move exact bit to nir_fp_math_control 2026-01-07 09:40:57 +00:00
nir_opt_vectorize_io.c nir: add no_signed_zero flag to io semantics 2026-03-11 16:47:15 +00:00
nir_opt_vectorize_io_vars.c treewide: use UTIL_DYNARRAY_INIT 2025-11-04 13:39:48 +00:00
nir_passthrough_gs.c nir: don't use variables as ralloc parents, use the shader instead 2025-08-05 22:55:13 +00:00
nir_passthrough_tcs.c
nir_phi_builder.c nir/phi_builder: Adjust valid_metadata assert 2025-09-08 23:03:13 +00:00
nir_phi_builder.h nir: Don't include the full nir.h when not necessary. 2025-02-12 22:33:07 +01:00
nir_precompiled.h panfrost: fix cl_local_size for precompiled shaders 2025-10-22 00:15:49 +00:00
nir_print.c pan/bi,nir: Divide memory_access from segments 2026-03-19 11:25:30 +00:00
nir_propagate_invariant.c nir: only set fp_math_ctrl if meaningful 2026-02-18 14:04:22 +00:00
nir_range_analysis.c nir: Add intrinsics for ShuffleUpINTEL and ShuffleDownINTEL 2026-03-17 17:21:52 +00:00
nir_range_analysis.h nir/range_analysis: use a dense array 2026-03-13 15:38:54 +00:00
nir_recompute_io_bases.c nir/recompute_io_bases: Fix num_inputs with dual-slot VS inputs 2026-01-21 20:54:14 +00:00
nir_remove_dead_variables.c nir/remove_dead_variables: don't ralloc the set 2025-08-21 06:13:48 +00:00
nir_remove_outputs.c nir: Add new nir_remove_outputs pass 2026-01-01 21:25:42 -06:00
nir_remove_tex_shadow.c treewide: use nir_def_as_* 2025-08-01 15:34:24 +00:00
nir_repair_ssa.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_scale_fdiv.c nir/scale_fdiv: handle fp16 fdiv 2025-06-05 13:17:27 +00:00
nir_schedule.c nir: add a cmat call instruction type. 2025-11-17 23:33:58 +00:00
nir_schedule.h nir/schedule: return progress and fix metadata 2025-07-23 06:47:58 +00:00
nir_search.c nir/search: never insert movs for alu uses 2026-03-20 08:50:41 +00:00
nir_search.h nir: add nir_fp_analysis_state 2026-03-05 11:26:25 +00:00
nir_search_helpers.h nir/opt_algebraic: optimize b2f(a) * b 2026-03-20 08:50:41 +00:00
nir_separate_merged_clip_cull_io.c nir: add nir_separate_merged_clip_cull_io 2025-11-15 03:30:10 +00:00
nir_serialize.c nir: don't assume indicies are always 32bit when accessing them as raw data 2026-03-10 07:46:20 +00:00
nir_serialize.h nir: Don't include the full nir.h when not necessary. 2025-02-12 22:33:07 +01:00
nir_shader_bisect.c nir: Add a shader bisect tool. 2025-10-09 17:56:30 +00:00
nir_shader_bisect.py nir/shader_bisect: Allow passing in a --lo / --hi to continue a run. 2025-12-04 22:47:25 +00:00
nir_shader_compiler_options.h nir: replace lower_ldexp with has_ldexp 2026-03-20 08:15:08 +00:00
nir_softfloat.h nir, glsl: Add support for softfloat32 2025-11-14 19:31:17 +00:00
nir_split_64bit_vec3_and_vec4.c treewide: use nir_def_as_* 2025-08-01 15:34:24 +00:00
nir_split_conversions.c nak,nir: Generalize nak_nir_split_64bit_conversions and move it to NIR 2025-03-29 03:02:17 +00:00
nir_split_per_member_structs.c nir: clang-format 2025-02-26 15:19:53 +00:00
nir_split_var_copies.c treewide: use nir_def_as_* 2025-08-01 15:34:24 +00:00
nir_split_vars.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_stub.c nir: Only build nir headers for mediafoundation/d3d12-no-graphics paired build 2025-05-09 16:34:00 +00:00
nir_sweep.c nir/sweep: fix use-after-free with dominance LCA 2025-11-13 20:17:22 +00:00
nir_tcs_info.h nir/tcs_info: use range analysis to determine the range of tess levels 2025-05-28 06:46:56 +00:00
nir_to_lcssa.c nir: Handle nir_instr_type_cmat_call in more places 2026-02-20 13:09:45 -08:00
nir_trivialize_registers.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_unlower_io_to_vars.c mesa/st: fix unlower_io_to_vars to work with mesh shaders 2026-03-12 22:02:57 +00:00
nir_use_dominance.c build: avoid redefining unreachable() which is standard in C23 2025-07-31 17:49:42 +00:00
nir_validate.c nir: Allow 8-bit vertex output stores 2026-03-12 22:59:13 +00:00
nir_vla.h
nir_worklist.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_worklist.h nir/search: add nir_search_state 2025-08-22 15:45:55 +00:00
nir_xfb_info.h nir/xfb_info: add new fields to describe 16-bit XFB better 2025-06-02 23:08:58 +00:00
README

New IR, or NIR, is an IR for Mesa intended to sit below GLSL IR and Mesa IR.
Its design inherits from the various IRs that Mesa has used in the past, as
well as Direct3D assembly, and it includes a few new ideas as well. It is a
flat (in terms of using instructions instead of expressions), typeless IR,
similar to TGSI and Mesa IR.  It also supports SSA (although it doesn't require
it).

Variables
=========

NIR includes support for source-level GLSL variables through a structure mostly
copied from GLSL IR. These will be used for linking and conversion from GLSL IR
(and later, from an AST), but for the most part, they will be lowered to
registers (see below) and loads/stores.

Registers
=========

Registers are light-weight; they consist of a structure that only contains its
size, its index for liveness analysis, and an optional name for debugging. In
addition, registers can be local to a function or global to the entire shader;
the latter will be used in ARB_shader_subroutine for passing parameters and
getting return values from subroutines. Registers can also be an array, in which
case they can be accessed indirectly. Each ALU instruction (add, subtract, etc.)
works directly with registers or SSA values (see below).

SSA
========

Everywhere a register can be loaded/stored, an SSA value can be used instead.
The only exception is that arrays/indirect addressing are not supported with
SSA; although research has been done on extensions of SSA to arrays before, it's
usually for the purpose of parallelization (which we're not interested in), and
adds some overhead in the form of adding copies or extra arrays (which is much
more expensive than introducing copies between non-array registers). SSA uses
point directly to their corresponding definition, which in turn points to the
instruction it is part of. This creates an implicit use-def chain and avoids the
need for an external structure for each SSA register.

Functions
=========

Support for function calls is mostly similar to GLSL IR. Each shader contains a
list of functions, and each function has a list of overloads. Each overload
contains a list of parameters, and may contain an implementation which specifies
the variables that correspond to the parameters and return value. Inlining a
function, assuming it has a single return point, is as simple as copying its
instructions, registers, and local variables into the target function and then
inserting copies to and from the new parameters as appropriate. After functions
are inlined and any non-subroutine functions are deleted, parameters and return
variables will be converted to global variables and then global registers. We
don't do this lowering earlier (i.e. the fortranizer idea) for a few reasons:

- If we want to do optimizations before link time, we need to have the function
signature available during link-time.

- If we do any inlining before link time, then we might wind up with the
inlined function and the non-inlined function using the same global
variables/registers which would preclude optimization.

Intrinsics
=========

Any operation (other than function calls and textures) which touches a variable
or is not referentially transparent is represented by an intrinsic. Intrinsics
are similar to the idea of a "builtin function," i.e. a function declaration
whose implementation is provided by the backend, except they are more powerful
in the following ways:

- They can also load and store registers when appropriate, which limits the
number of variables needed in later stages of the IR while obviating the need
for a separate load/store variable instruction.

- Intrinsics can be marked as side-effect free, which permits them to be
treated like any other instruction when it comes to optimizations. This allows
load intrinsics to be represented as intrinsics while still being optimized
away by dead code elimination, common subexpression elimination, etc.

Intrinsics are used for:

- Atomic operations
- Memory barriers
- Subroutine calls
- Geometry shader emitVertex and endPrimitive
- Loading and storing variables (before lowering)
- Loading and storing uniforms, shader inputs and outputs, etc (after lowering)
- Copying variables (cases where in GLSL the destination is a structure or
array)
- The kitchen sink
- ...

Textures
=========

Unfortunately, there are far too many texture operations to represent each one
of them with an intrinsic, so there's a special texture instruction similar to
the GLSL IR one. The biggest difference is that, while the texture instruction
has a sampler dereference field used just like in GLSL IR, this gets lowered to
a texture unit index (with a possible indirect offset) while the type
information of the original sampler is kept around for backends. Also, all the
non-constant sources are stored in a single array to make it easier for
optimization passes to iterate over all the sources.

Control Flow
=========

Like in GLSL IR, control flow consists of a tree of "control flow nodes", which
include if statements and loops, and jump instructions (break, continue, and
return). Unlike GLSL IR, though, the leaves of the tree aren't statements but
basic blocks. Each basic block also keeps track of its successors and
predecessors, and function implementations keep track of the beginning basic
block (the first basic block of the function) and the ending basic block (a fake
basic block that every return statement points to). Together, these elements
make up the control flow graph, in this case a redundant piece of information on
top of the control flow tree that will be used by almost all the optimizations.
There are helper functions to add and remove control flow nodes that also update
the control flow graph, and so usually it doesn't need to be touched by passes
that modify control flow nodes.