mesa/src/compiler/nir
Georg Lehmann 6936282bd3 nir/opt_algebraic: remove min(a, >= 1.0) before fsat
Foz-DB Navi48:
Totals from 86 (0.08% of 114655) affected shaders:
Instrs: 217553 -> 217408 (-0.07%); split: -0.07%, +0.01%
CodeSize: 1159992 -> 1159380 (-0.05%); split: -0.06%, +0.01%
Latency: 1657600 -> 1657533 (-0.00%); split: -0.01%, +0.00%
InvThroughput: 203205 -> 203178 (-0.01%); split: -0.02%, +0.00%
SClause: 5245 -> 5244 (-0.02%)
Copies: 13726 -> 13716 (-0.07%); split: -0.14%, +0.07%
VALU: 130151 -> 130039 (-0.09%); split: -0.09%, +0.00%
SALU: 26476 -> 26474 (-0.01%); split: -0.02%, +0.01%

Reviewed-by: Marek Olšák <marek.olsak@amd.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40281>
2026-03-09 21:11:25 +00:00
..
tests nir: add fp class analysis tests 2026-03-07 05:01:45 +00:00
.clang-format
meson.build nir: add a pass to optimize fp_math_ctrl 2026-03-07 08:16:27 +01:00
nir.c nir,spirv: Add support for SPV_QCOM_image_processing. 2026-01-27 02:00:40 +00:00
nir.h nir: add a pass to optimize fp_math_ctrl 2026-03-07 08:16:27 +01:00
nir_algebraic.py nir/algebraic: remove ignore_exact 2026-02-18 14:04:22 +00:00
nir_blend_equation_advanced_helper.h nir: Extract blend equation helpers to shared header 2026-01-24 21:44:01 +00:00
nir_builder.c nir: only set fp_math_ctrl if meaningful 2026-02-18 14:04:22 +00:00
nir_builder.h nir: only set fp_math_ctrl if meaningful 2026-02-18 14:04:22 +00:00
nir_builder_opcodes_h.py src/compiler: Fix warning C4244 'argument' : conversion from 'type1' to 'type2', possible loss of data 2025-01-15 21:40:20 +00:00
nir_builtin_builder.c nir: add nir_tex_instr::can_speculate 2025-07-24 18:41:38 +00:00
nir_builtin_builder.h nir: move exact bit to nir_fp_math_control 2026-01-07 09:40:57 +00:00
nir_clip_cull_distance_io_utils.c nir/clip_cull_distance_utils: add more assertions validating the type & sizes 2026-01-05 21:24:10 +00:00
nir_clone.c nir/clone: Fix cloning indirect call instructions 2026-02-13 11:27:59 +00:00
nir_constant_expressions.h nir: Let nir_eval_const_opcode() return a poison mask in case of UB. 2026-01-15 19:09:32 +00:00
nir_constant_expressions.py nir/constant_expressions: flush input denorms if denorms have to be flushed 2026-01-19 16:11:28 +00:00
nir_control_flow.c nir: Update progress info in nir_sort_unstructured_blocks 2025-11-17 10:30:37 +00:00
nir_control_flow.h nir/cf: have nir_remove_after_cf_node remove phis at the start too 2025-07-23 09:06:36 +00:00
nir_control_flow_private.h
nir_conversion_builder.h nir: fix nir_round_int_to_float for fp16 2026-03-04 14:32:35 +00:00
nir_defines.h nir: add support for pixel_local_storage variables 2025-11-18 20:25:42 +00:00
nir_deref.c nir: Use instr_clone in rematerialize_deref_in_block 2025-12-10 22:07:45 +00:00
nir_deref.h nir: Don't include the full nir.h when not necessary. 2025-02-12 22:33:07 +01:00
nir_divergence_analysis.c nir: Fix divergence of Intel URB input/output handle intrinsics 2026-03-09 02:38:59 +00:00
nir_dominance.c treewide: Spell indices correctly 2025-09-08 23:03:13 +00:00
nir_dominance_lca.c nir: Add a faster lowest common ancestor algorithm 2025-09-08 23:03:13 +00:00
nir_downgrade_pls_vars.c nir: Add a pass to downgrade inout PLS vars to {in,out} only ones 2025-11-18 20:25:43 +00:00
nir_fixup_is_exported.c nir: fix nir_fixup_is_exported for LLVM-22 2026-01-30 16:06:25 +00:00
nir_format_convert.c nir/format_convert: use nan/inf preserve flag for fmax instead of exact 2026-02-10 18:42:02 +00:00
nir_format_convert.h nir: add a high precision conversion unorm->float 2025-02-06 08:20:21 +00:00
nir_from_ssa.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_functions.c nir: Use STACK_ARRAY instead of NIR_VLA 2026-02-14 01:19:27 +00:00
nir_gather_info.c nir/gather_info: Add support for panfrost tile load/store intrinsics 2026-02-23 21:00:01 +00:00
nir_gather_output_deps.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_gather_tcs_info.c nir/gather_tcs_info: use nir_analyze_fp_class directly 2026-03-07 05:01:44 +00:00
nir_gather_types.c build: avoid redefining unreachable() which is standard in C23 2025-07-31 17:49:42 +00:00
nir_gather_xfb_info.c util/dynarray: infer type in append 2025-10-24 18:32:07 +00:00
nir_gs_count_vertices.c nir: make nir_block::predecessors & dom_frontier sets non-malloc'd 2025-08-21 06:13:48 +00:00
nir_inline_helpers.h nir: add a cmat call instruction type. 2025-11-17 23:33:58 +00:00
nir_inline_sysval.c nir: add nir_inline_sysval pass 2025-08-03 21:27:47 +00:00
nir_inline_uniforms.c nir: speedup nir_find_inlinable_uniforms() 2026-02-05 23:19:29 +00:00
nir_instr_set.c nir: Handle nir_instr_type_cmat_call in more places 2026-02-20 13:09:45 -08:00
nir_instr_set.h nir/instr_set: don't ralloc the set 2025-08-21 06:13:48 +00:00
nir_intrinsics.py brw/nir: add new intrinsics to load data from the indirect address 2026-03-06 06:34:43 +00:00
nir_intrinsics_c.py meson: remove '--outdir' argument in script 2025-10-08 20:51:20 +00:00
nir_intrinsics_h.py meson: remove '--outdir' argument in script 2025-10-08 20:51:20 +00:00
nir_intrinsics_indices_h.py nir: Disable gcc warning -Wstringop-overflow for nir_intrinsic_set_* for latter commit 2025-11-12 21:22:13 +00:00
nir_io_add_xfb_info.c nir: move nir_io_add_intrinsic_xfb_info into its own file 2025-06-26 18:20:51 +00:00
nir_legacy.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_legacy.h nir: Add bool return value to nir_legacy_trivialize(..) 2025-03-06 03:29:20 +00:00
nir_linking_helpers.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_liveness.c nir, vk: fix MSVC unused variable warning 2025-11-28 01:52:12 +00:00
nir_loop_analyze.c nir/loop_analyze: also set force_unroll if the array_size is larger than max_trip_count 2026-02-10 09:24:23 +00:00
nir_loop_analyze.h treewide: use nir_def_as_* 2025-08-01 15:34:24 +00:00
nir_lower_alpha.c nir/lower_alpha: extend to support dynamic a2c 2025-09-30 12:15:53 +00:00
nir_lower_alu.c nir/opt_algebraic: make bcsel(fcmp(b, a), b, a) -> fmin/fmax patterns exact 2026-02-10 18:42:03 +00:00
nir_lower_alu_width.c nir: remove fall_equal/fany_nequal opcodes 2026-03-04 19:50:27 +00:00
nir_lower_amul.c nir: add intrinsics for pixel local storage 2025-11-18 20:25:42 +00:00
nir_lower_array_deref_of_vec.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_lower_atomics.c nir/lower_atomics: use data helper 2026-02-19 14:47:11 +00:00
nir_lower_atomics_to_ssbo.c nir: add ACCESS_ATOMIC 2025-10-07 17:41:30 +00:00
nir_lower_bit_size.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_lower_bitmap.c nir: add nir_tex_instr::can_speculate 2025-07-24 18:41:38 +00:00
nir_lower_blend.c nir/lower_blend: Add advanced blending support 2026-01-24 21:44:01 +00:00
nir_lower_blend.h nir/lower_blend: Add advanced blending support 2026-01-24 21:44:01 +00:00
nir_lower_bool_to_bitsize.c nir: only set fp_math_ctrl if meaningful 2026-02-18 14:04:22 +00:00
nir_lower_bool_to_float.c nir/lower_bool_to_float: assert that vector comparisons were lowered 2026-03-04 19:50:27 +00:00
nir_lower_bool_to_int32.c treewide: use nir_metadata_control_flow 2024-06-17 16:28:14 -04:00
nir_lower_calls_to_builtins.c nir/nir_lower_calls_to_builtins: trivially handle IA64 mangled functions 2025-09-02 16:04:19 +00:00
nir_lower_cl_images.c treewide: use BITSET_*_COUNT 2025-12-16 17:42:10 +00:00
nir_lower_clamp_color_outputs.c nir: remove handling IO variables from passes used by st/mesa 2025-01-22 02:15:04 +00:00
nir_lower_clip.c nir/lower_clip: Correctly handle driver_location in VS lowering 2026-01-21 20:54:15 +00:00
nir_lower_clip_disable.c nir: remove handling IO variables from passes used by st/mesa 2025-01-22 02:15:04 +00:00
nir_lower_clip_halfz.c nir: Add bool return value to nir_lower_clip_halfz(..) 2025-02-19 21:16:05 +00:00
nir_lower_const_arrays_to_uniforms.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_lower_continue_constructs.c nir: Add an option to make lower_phis_to_regs_block() less clever 2025-08-29 01:24:56 +00:00
nir_lower_convert_alu_types.c nir/lower_convert_alu_types: use intrinsics_pass 2024-11-20 16:53:50 +00:00
nir_lower_cooperative_matrix.c nir: suppress clang warnings for cooperative matrix lowering 2025-12-08 19:36:05 +00:00
nir_lower_discard_if.c nir: make nir_block::predecessors & dom_frontier sets non-malloc'd 2025-08-21 06:13:48 +00:00
nir_lower_double_ops.c nir/lower_double_ops: don't create more exact ops than the input requires 2026-02-10 18:42:02 +00:00
nir_lower_explicit_io.c nir: allow get_ssbo_size to return a 64-bit result 2026-02-16 12:59:36 +00:00
nir_lower_fb_read.c nir: add nir_tex_instr::can_speculate 2025-07-24 18:41:38 +00:00
nir_lower_flatshade.c nir/lower_flatshade: clean up 2025-09-18 13:37:03 +00:00
nir_lower_floats.c nir, glsl: Add support for softfloat32 2025-11-14 19:31:17 +00:00
nir_lower_flrp.c nir: move exact bit to nir_fp_math_control 2026-01-07 09:40:57 +00:00
nir_lower_fp16_conv.c build: avoid redefining unreachable() which is standard in C23 2025-07-31 17:49:42 +00:00
nir_lower_frag_coord_to_pixel_coord.c nir: Add SYSTEM_VALUE_FRAG_COORD_Z/W. 2025-06-18 23:11:36 +00:00
nir_lower_fragcolor.c nir: don't use variables as ralloc parents, use the shader instead 2025-08-05 22:55:13 +00:00
nir_lower_fragcoord_wtrans.c treewide: simplify nir_def_rewrite_uses_after 2025-08-01 15:34:24 +00:00
nir_lower_frexp.c build: avoid redefining unreachable() which is standard in C23 2025-07-31 17:49:42 +00:00
nir_lower_global_vars_to_local.c treewide: Switch to nir_progress 2025-02-26 15:19:53 +00:00
nir_lower_goto_ifs.c nir: Add an option to make lower_phis_to_regs_block() less clever 2025-08-29 01:24:56 +00:00
nir_lower_gs_intrinsics.c nir: make nir_block::predecessors & dom_frontier sets non-malloc'd 2025-08-21 06:13:48 +00:00
nir_lower_halt_to_return.c nir: Add nir_lower_halt_to_return 2025-06-26 20:12:12 +00:00
nir_lower_helper_writes.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_lower_idiv.c nir: move exact bit to nir_fp_math_control 2026-01-07 09:40:57 +00:00
nir_lower_image.c treewide: use nir_def_as_* 2025-08-01 15:34:24 +00:00
nir_lower_image_atomics_to_global.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_lower_indirect_derefs_to_if_else_trees.c nir: rename nir_lower_indirect_derefs -> nir_lower_indirect_derefs_to_if_else_trees 2025-11-20 05:42:11 +00:00
nir_lower_input_attachments.c nir, tu: Add and use load_frag_coord_gmem_ir3 2025-12-08 20:44:45 +00:00
nir_lower_int64.c build: avoid redefining unreachable() which is standard in C23 2025-07-31 17:49:42 +00:00
nir_lower_int_to_float.c nir: only set fp_math_ctrl if meaningful 2026-02-18 14:04:22 +00:00
nir_lower_interpolation.c nir/lower_interpolation: check IO location correctly 2025-11-10 16:44:36 +00:00
nir_lower_io.c nir/lower_io: handle Intel URB intrinsics 2026-03-06 13:28:32 +00:00
nir_lower_io_array_vars_to_elements.c all: rename gl_shader_stage to mesa_shader_stage 2025-08-06 10:28:40 +08:00
nir_lower_io_indirect_loads.c nir: Add missing #include for c99_alloca.h 2025-08-22 22:33:50 +00:00
nir_lower_io_to_scalar.c nir: Use nir_shader_intrinsics_pass in nir_lower_io_to_scalar 2025-12-05 22:30:22 +00:00
nir_lower_io_vars_to_scalar.c nir: move nir_lower_io_vars_to_scalar into its own file 2025-06-26 18:20:49 +00:00
nir_lower_io_vars_to_temporaries.c nir/lower_io_vars: don't insert output stores for unrelated streams before emits 2025-12-06 02:27:46 +00:00
nir_lower_is_helper_invocation.c treewide: use nir_def_replace sometimes 2024-06-21 15:36:56 +00:00
nir_lower_load_const_to_scalar.c treewide: Switch to nir_progress 2025-02-26 15:19:53 +00:00
nir_lower_locals_to_regs.c treewide: use nir_def_as_* 2025-08-01 15:34:24 +00:00
nir_lower_mediump.c nir: only set fp_math_ctrl if meaningful 2026-02-18 14:04:22 +00:00
nir_lower_mem_access_bit_sizes.c nir: add a new push_data_intel intrinsic 2026-01-09 14:19:46 +00:00
nir_lower_memcpy.c build: avoid redefining unreachable() which is standard in C23 2025-07-31 17:49:42 +00:00
nir_lower_memory_model.c nir/lower_memory_model: remove empty lowered barriers 2025-08-04 15:36:51 +00:00
nir_lower_multiview.c build: avoid redefining unreachable() which is standard in C23 2025-07-31 17:49:42 +00:00
nir_lower_non_uniform_access.c nir: Handle nir_instr_type_cmat_call in more places 2026-02-20 13:09:45 -08:00
nir_lower_packing.c nir: clang-format 2025-02-26 15:19:53 +00:00
nir_lower_passthrough_edgeflags.c treewide: Switch to nir_progress 2025-02-26 15:19:53 +00:00
nir_lower_patch_vertices.c treewide: Switch to nir_progress 2025-02-26 15:19:53 +00:00
nir_lower_phis_to_scalar.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_lower_pntc_ytransform.c treewide: simplify nir_def_rewrite_uses_after 2025-08-01 15:34:24 +00:00
nir_lower_point_size.c nir: Add a type parameter to nir_lower_point_size() 2025-11-12 01:34:36 +00:00
nir_lower_point_smooth.c treewide: use nir_def_as_* 2025-08-01 15:34:24 +00:00
nir_lower_poly_line_smooth.c nir/lower_poly_line_smooth: use intrinsics_pass 2025-02-05 11:23:35 +00:00
nir_lower_printf.c nir/compiler_options: add nir_load_pixel_coord 2026-01-21 16:00:50 +00:00
nir_lower_readonly_images_to_tex.c build: avoid redefining unreachable() which is standard in C23 2025-07-31 17:49:42 +00:00
nir_lower_reg_intrinsics_to_ssa.c nir: Don't require nir_metadata_control_flow 2025-09-08 23:03:13 +00:00
nir_lower_returns.c nir: make nir_block::predecessors & dom_frontier sets non-malloc'd 2025-08-21 06:13:48 +00:00
nir_lower_robust_access.c nir: allow get_ssbo_size to return a 64-bit result 2026-02-16 12:59:36 +00:00
nir_lower_sample_shading.c nir/lower_sample_shading: Set the sample qualifier on in vars. 2025-08-03 20:27:39 +00:00
nir_lower_samplers.c treewide: use nir_def_as_* 2025-08-01 15:34:24 +00:00
nir_lower_scratch.c nir: Introduce nir_lower_vars_to_scratch_global(). 2025-12-17 19:50:28 +00:00
nir_lower_scratch_to_var.c nir: rename nir_copy_prop -> nir_opt_copy_prop 2025-11-15 02:16:38 +00:00
nir_lower_shader_calls.c nir: add a new push_data_intel intrinsic 2026-01-09 14:19:46 +00:00
nir_lower_single_sampled.c nir: add nir_lower_single_sampled::lower_sample_mask_in option 2025-12-11 22:50:10 +00:00
nir_lower_ssbo.c treewide: use nir_store_global alias of nir_build_store_global 2025-10-21 12:37:58 +02:00
nir_lower_subgroups.c nir/lower_subgroups: fix boolean clustered reductions 2026-03-09 14:50:37 +00:00
nir_lower_system_values.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_lower_sysvals_to_varyings.c nir: Support sysval intrinsics in lower_sysvals_to_varyings() 2025-11-29 00:50:32 +00:00
nir_lower_task_shader.c treewide: Replace calling to function ALIGN with align 2025-11-12 21:58:40 +00:00
nir_lower_terminate_to_demote.c build: avoid redefining unreachable() which is standard in C23 2025-07-31 17:49:42 +00:00
nir_lower_tess_coord_z.c nir: clang-format 2025-02-26 15:19:53 +00:00
nir_lower_tex.c nir/lower_tex: use f2f32 instead of unpack_half 2026-02-06 06:12:36 +00:00
nir_lower_tex_shadow.c treewide: use nir_def_as_* 2025-08-01 15:34:24 +00:00
nir_lower_texcoord_replace.c treewide: simplify nir_def_rewrite_uses_after 2025-08-01 15:34:24 +00:00
nir_lower_texcoord_replace_late.c treewide: simplify nir_def_rewrite_uses_after 2025-08-01 15:34:24 +00:00
nir_lower_two_sided_color.c nir/lower_two_sided_color: cleanup 2025-10-29 15:52:27 +00:00
nir_lower_ubo_vec4.c treewide: use nir_def_as_* 2025-08-01 15:34:24 +00:00
nir_lower_undef_to_zero.c panfrost,nir: drop my lonely Authors tags 2026-01-05 17:47:52 +00:00
nir_lower_uniforms_to_ubo.c treewide: use nir_def_as_* 2025-08-01 15:34:24 +00:00
nir_lower_var_copies.c treewide: use nir_def_as_* 2025-08-01 15:34:24 +00:00
nir_lower_variable_initializers.c treewide: use nir_break_if 2025-06-30 14:51:24 -04:00
nir_lower_vars_to_ssa.c nir/lower_vars_to_ssa: return early if there is no local variables to lower 2025-11-14 09:09:15 +00:00
nir_lower_vec3_to_vec4.c treewide: simplify nir_def_rewrite_uses_after 2025-08-01 15:34:24 +00:00
nir_lower_vec_to_regs.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_lower_view_index_to_device_index.c nir: add pass to convert ViewIndex to DeviceIndex 2024-08-07 19:09:55 +00:00
nir_lower_viewport_transform.c nir: Suppress 'potentially uninitialized local pointer variable used' warning 2026-01-13 23:31:28 +00:00
nir_lower_workgroup_size.c nir: Handle nir_instr_type_cmat_call in more places 2026-02-20 13:09:45 -08:00
nir_lower_wpos_center.c treewide: simplify nir_def_rewrite_uses_after 2025-08-01 15:34:24 +00:00
nir_lower_wpos_ytransform.c treewide: simplify nir_def_rewrite_uses_after 2025-08-01 15:34:24 +00:00
nir_lower_wrmasks.c nir/lower_wrmasks: clean up & deprecate pass 2025-11-26 03:20:39 +00:00
nir_metadata.c treewide: Never preserve nir_metadata_dominance without nir_metadata_block_index 2025-11-14 09:09:14 +00:00
nir_mod_analysis.c nir: Fix mod analysis of ishl to shift the recursive result 2025-12-16 00:58:37 +00:00
nir_move_output_stores_to_end.c treewide: Switch to nir_progress 2025-02-26 15:19:53 +00:00
nir_move_vec_src_uses_to_dest.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_normalize_cubemap_coords.c nir/normalize_cubemap_coords: Handle the projector before the normalization 2025-12-30 16:25:09 +00:00
nir_opcodes.py nir: Fix constant folding for iadd_sat 2026-03-06 22:26:07 +00:00
nir_opcodes_c.py nir: only set fp_math_ctrl if meaningful 2026-02-18 14:04:22 +00:00
nir_opcodes_h.py
nir_opt_access.c compiler: rename gl_shader_stage_is_callable to mesa_shader_stage_is_callable 2025-08-06 10:28:41 +08:00
nir_opt_algebraic.py nir/opt_algebraic: remove min(a, >= 1.0) before fsat 2026-03-09 21:11:25 +00:00
nir_opt_barriers.c nir: convert nir_instr_worklist to init/fini semantics w/out allocation 2025-08-21 06:13:49 +00:00
nir_opt_barycentric.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_opt_call.c nir: add a new push_data_intel intrinsic 2026-01-09 14:19:46 +00:00
nir_opt_clip_cull_const.c nir: rename nir_lower_io_to_temporaries -> nir_lower_io_vars_to_temporaries 2025-06-26 18:20:54 +00:00
nir_opt_combine_stores.c nir: Handle nir_instr_type_cmat_call in more places 2026-02-20 13:09:45 -08:00
nir_opt_comparison_pre.c treewide: Switch to nir_progress 2025-02-26 15:19:53 +00:00
nir_opt_constant_folding.c nir/opt_constant_folding: optimize ballot(false) 2026-02-18 23:40:44 +00:00
nir_opt_copy_prop_vars.c nir: Handle nir_instr_type_cmat_call in more places 2026-02-20 13:09:45 -08:00
nir_opt_copy_propagate.c nir: rename nir_copy_prop -> nir_opt_copy_prop 2025-11-15 02:16:38 +00:00
nir_opt_cse.c nir/opt_cse: update potential future plans merging copy propagation with CSE 2025-12-13 06:41:59 +00:00
nir_opt_dce.c nir: add a cmat call instruction type. 2025-11-17 23:33:58 +00:00
nir_opt_dead_cf.c nir: Handle nir_instr_type_cmat_call in more places 2026-02-20 13:09:45 -08:00
nir_opt_dead_write_vars.c nir: Handle nir_instr_type_cmat_call in more places 2026-02-20 13:09:45 -08:00
nir_opt_find_array_copies.c nir/opt_find_array_copies: don't allocate the hash tables 2025-08-21 06:13:48 +00:00
nir_opt_fp_math_ctrl.c nir: add a pass to optimize fp_math_ctrl 2026-03-07 08:16:27 +01:00
nir_opt_frag_coord_to_pixel_coord.c nir: only set fp_math_ctrl if meaningful 2026-02-18 14:04:22 +00:00
nir_opt_fragdepth.c nir/opt_fragdepth: use data helper 2026-02-19 14:47:11 +00:00
nir_opt_gcm.c nir: Handle nir_instr_type_cmat_call in more places 2026-02-20 13:09:45 -08:00
nir_opt_generate_bfi.c nir/opt_generate_bfi: create vector and non 32bit bitfield_select 2025-07-21 20:42:32 +00:00
nir_opt_group_loads.c nir: Handle nir_instr_type_cmat_call in more places 2026-02-20 13:09:45 -08:00
nir_opt_idiv_const.c build: avoid redefining unreachable() which is standard in C23 2025-07-31 17:49:42 +00:00
nir_opt_if.c nir: move exact bit to nir_fp_math_control 2026-01-07 09:40:57 +00:00
nir_opt_intrinsics.c nir/opt_intrinsics: use data helpers 2026-02-19 14:47:11 +00:00
nir_opt_large_constants.c nir/opt_large_constants: Fix dead deref instructions accessing lowered variables 2025-11-14 09:09:14 +00:00
nir_opt_licm.c nir: Handle nir_instr_type_cmat_call in more places 2026-02-20 13:09:45 -08:00
nir_opt_load_skip_helpers.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_opt_load_store_vectorize.c brw/nir: add new intrinsics to load data from the indirect address 2026-03-06 06:34:43 +00:00
nir_opt_loop.c nir/opt_loop: Relax restrictions on opt_loop_peel_initial_break() for more loops 2026-01-26 12:02:49 +00:00
nir_opt_loop_unroll.c nir/opt_loop_unroll: Always unroll loops with a known trip-count of 0 2026-02-10 09:24:23 +00:00
nir_opt_memcpy.c treewide: Switch to nir_progress 2025-02-26 15:19:53 +00:00
nir_opt_move.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_opt_move_discards_to_top.c nir: add a cmat call instruction type. 2025-11-17 23:33:58 +00:00
nir_opt_move_to_top.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_opt_mqsad.c nir: convert nir_instr_worklist to init/fini semantics w/out allocation 2025-08-21 06:13:49 +00:00
nir_opt_non_uniform_access.c nir/opt_non_uniform_access: use new query flags 2026-02-16 12:59:36 +00:00
nir_opt_offsets.c brw/nir: enable constant offsets for global_constant_uniform_block_intel 2026-03-06 06:34:43 +00:00
nir_opt_peephole_select.c nir: add a new push_data_intel intrinsic 2026-01-09 14:19:46 +00:00
nir_opt_phi_precision.c nir: only set fp_math_ctrl if meaningful 2026-02-18 14:04:22 +00:00
nir_opt_phi_to_bool.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_opt_preamble.c nir,aco: Clean up useless lowering of sbt_base_amd 2026-01-14 14:19:07 +00:00
nir_opt_ray_queries.c nir: Ignore ray query ranges that don't start with rq_initialize 2025-12-11 15:56:29 +00:00
nir_opt_reassociate.c nir: move exact bit to nir_fp_math_control 2026-01-07 09:40:57 +00:00
nir_opt_reassociate_bfi.c nir/opt_reassociate_bfi: use alu_pass 2024-08-10 13:40:21 +00:00
nir_opt_rematerialize_compares.c treewide: Switch to nir_progress 2025-02-26 15:19:53 +00:00
nir_opt_remove_phis.c nir/opt_remove_phis: recursively check loop header phis for triviality 2026-03-04 14:03:40 +00:00
nir_opt_shrink_stores.c nir/shrink_stores: Don't shrink stores to an invalid num_components. 2025-10-20 19:24:45 +00:00
nir_opt_shrink_vectors.c nir: add a new push_data_intel intrinsic 2026-01-09 14:19:46 +00:00
nir_opt_sink.c nir: don't sink alu that uses ballot(true) 2025-12-08 09:07:54 +00:00
nir_opt_undef.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_opt_uniform_atomics.c nir/opt_uniform_atomics: use data helper 2026-02-19 14:47:11 +00:00
nir_opt_uniform_subgroup.c nir/lower_uniform_subgroup: use nan/inf preserve instead of exact for feq 2026-02-10 18:42:02 +00:00
nir_opt_uub.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_opt_varyings.c nir/opt_varyings: Support implementations that cannot compact 16-bits 2026-01-14 20:44:41 +00:00
nir_opt_vectorize.c nir: move exact bit to nir_fp_math_control 2026-01-07 09:40:57 +00:00
nir_opt_vectorize_io.c nir/opt_vectorize_io: Fix allow_holes option 2026-01-01 17:38:01 -06:00
nir_opt_vectorize_io_vars.c treewide: use UTIL_DYNARRAY_INIT 2025-11-04 13:39:48 +00:00
nir_passthrough_gs.c nir: don't use variables as ralloc parents, use the shader instead 2025-08-05 22:55:13 +00:00
nir_passthrough_tcs.c
nir_phi_builder.c nir/phi_builder: Adjust valid_metadata assert 2025-09-08 23:03:13 +00:00
nir_phi_builder.h nir: Don't include the full nir.h when not necessary. 2025-02-12 22:33:07 +01:00
nir_precompiled.h panfrost: fix cl_local_size for precompiled shaders 2025-10-22 00:15:49 +00:00
nir_print.c nir: Do NIR_DEBUG=print under a lock. 2026-03-06 19:50:38 +00:00
nir_propagate_invariant.c nir: only set fp_math_ctrl if meaningful 2026-02-18 14:04:22 +00:00
nir_range_analysis.c nir: remove nir_analyze_fp_range 2026-03-07 05:01:44 +00:00
nir_range_analysis.h nir: remove nir_analyze_fp_range 2026-03-07 05:01:44 +00:00
nir_recompute_io_bases.c nir/recompute_io_bases: Fix num_inputs with dual-slot VS inputs 2026-01-21 20:54:14 +00:00
nir_remove_dead_variables.c nir/remove_dead_variables: don't ralloc the set 2025-08-21 06:13:48 +00:00
nir_remove_outputs.c nir: Add new nir_remove_outputs pass 2026-01-01 21:25:42 -06:00
nir_remove_tex_shadow.c treewide: use nir_def_as_* 2025-08-01 15:34:24 +00:00
nir_repair_ssa.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_scale_fdiv.c nir/scale_fdiv: handle fp16 fdiv 2025-06-05 13:17:27 +00:00
nir_schedule.c nir: add a cmat call instruction type. 2025-11-17 23:33:58 +00:00
nir_schedule.h nir/schedule: return progress and fix metadata 2025-07-23 06:47:58 +00:00
nir_search.c nir: add nir_fp_analysis_state 2026-03-05 11:26:25 +00:00
nir_search.h nir: add nir_fp_analysis_state 2026-03-05 11:26:25 +00:00
nir_search_helpers.h nir: remove more fsat using range analysis 2026-03-07 05:01:45 +00:00
nir_separate_merged_clip_cull_io.c nir: add nir_separate_merged_clip_cull_io 2025-11-15 03:30:10 +00:00
nir_serialize.c nir/serialize: omit serializing fp_math_ctrl if it has to be 0 2026-02-19 15:21:27 +00:00
nir_serialize.h nir: Don't include the full nir.h when not necessary. 2025-02-12 22:33:07 +01:00
nir_shader_bisect.c nir: Add a shader bisect tool. 2025-10-09 17:56:30 +00:00
nir_shader_bisect.py nir/shader_bisect: Allow passing in a --lo / --hi to continue a run. 2025-12-04 22:47:25 +00:00
nir_shader_compiler_options.h nir: remove lower_vector_cmp 2026-03-04 19:50:28 +00:00
nir_softfloat.h nir, glsl: Add support for softfloat32 2025-11-14 19:31:17 +00:00
nir_split_64bit_vec3_and_vec4.c treewide: use nir_def_as_* 2025-08-01 15:34:24 +00:00
nir_split_conversions.c nak,nir: Generalize nak_nir_split_64bit_conversions and move it to NIR 2025-03-29 03:02:17 +00:00
nir_split_per_member_structs.c nir: clang-format 2025-02-26 15:19:53 +00:00
nir_split_var_copies.c treewide: use nir_def_as_* 2025-08-01 15:34:24 +00:00
nir_split_vars.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_stub.c nir: Only build nir headers for mediafoundation/d3d12-no-graphics paired build 2025-05-09 16:34:00 +00:00
nir_sweep.c nir/sweep: fix use-after-free with dominance LCA 2025-11-13 20:17:22 +00:00
nir_tcs_info.h nir/tcs_info: use range analysis to determine the range of tess levels 2025-05-28 06:46:56 +00:00
nir_to_lcssa.c nir: Handle nir_instr_type_cmat_call in more places 2026-02-20 13:09:45 -08:00
nir_trivialize_registers.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_unlower_io_to_vars.c nir: Move is is_compact() out of unlower_io_to_vars 2026-01-21 20:54:13 +00:00
nir_use_dominance.c build: avoid redefining unreachable() which is standard in C23 2025-07-31 17:49:42 +00:00
nir_validate.c nir: only set fp_math_ctrl if meaningful 2026-02-18 14:04:22 +00:00
nir_vla.h
nir_worklist.c treewide: add & use parent instr helpers 2025-11-12 21:22:13 +00:00
nir_worklist.h nir/search: add nir_search_state 2025-08-22 15:45:55 +00:00
nir_xfb_info.h nir/xfb_info: add new fields to describe 16-bit XFB better 2025-06-02 23:08:58 +00:00
README

New IR, or NIR, is an IR for Mesa intended to sit below GLSL IR and Mesa IR.
Its design inherits from the various IRs that Mesa has used in the past, as
well as Direct3D assembly, and it includes a few new ideas as well. It is a
flat (in terms of using instructions instead of expressions), typeless IR,
similar to TGSI and Mesa IR.  It also supports SSA (although it doesn't require
it).

Variables
=========

NIR includes support for source-level GLSL variables through a structure mostly
copied from GLSL IR. These will be used for linking and conversion from GLSL IR
(and later, from an AST), but for the most part, they will be lowered to
registers (see below) and loads/stores.

Registers
=========

Registers are light-weight; they consist of a structure that only contains its
size, its index for liveness analysis, and an optional name for debugging. In
addition, registers can be local to a function or global to the entire shader;
the latter will be used in ARB_shader_subroutine for passing parameters and
getting return values from subroutines. Registers can also be an array, in which
case they can be accessed indirectly. Each ALU instruction (add, subtract, etc.)
works directly with registers or SSA values (see below).

SSA
========

Everywhere a register can be loaded/stored, an SSA value can be used instead.
The only exception is that arrays/indirect addressing are not supported with
SSA; although research has been done on extensions of SSA to arrays before, it's
usually for the purpose of parallelization (which we're not interested in), and
adds some overhead in the form of adding copies or extra arrays (which is much
more expensive than introducing copies between non-array registers). SSA uses
point directly to their corresponding definition, which in turn points to the
instruction it is part of. This creates an implicit use-def chain and avoids the
need for an external structure for each SSA register.

Functions
=========

Support for function calls is mostly similar to GLSL IR. Each shader contains a
list of functions, and each function has a list of overloads. Each overload
contains a list of parameters, and may contain an implementation which specifies
the variables that correspond to the parameters and return value. Inlining a
function, assuming it has a single return point, is as simple as copying its
instructions, registers, and local variables into the target function and then
inserting copies to and from the new parameters as appropriate. After functions
are inlined and any non-subroutine functions are deleted, parameters and return
variables will be converted to global variables and then global registers. We
don't do this lowering earlier (i.e. the fortranizer idea) for a few reasons:

- If we want to do optimizations before link time, we need to have the function
signature available during link-time.

- If we do any inlining before link time, then we might wind up with the
inlined function and the non-inlined function using the same global
variables/registers which would preclude optimization.

Intrinsics
=========

Any operation (other than function calls and textures) which touches a variable
or is not referentially transparent is represented by an intrinsic. Intrinsics
are similar to the idea of a "builtin function," i.e. a function declaration
whose implementation is provided by the backend, except they are more powerful
in the following ways:

- They can also load and store registers when appropriate, which limits the
number of variables needed in later stages of the IR while obviating the need
for a separate load/store variable instruction.

- Intrinsics can be marked as side-effect free, which permits them to be
treated like any other instruction when it comes to optimizations. This allows
load intrinsics to be represented as intrinsics while still being optimized
away by dead code elimination, common subexpression elimination, etc.

Intrinsics are used for:

- Atomic operations
- Memory barriers
- Subroutine calls
- Geometry shader emitVertex and endPrimitive
- Loading and storing variables (before lowering)
- Loading and storing uniforms, shader inputs and outputs, etc (after lowering)
- Copying variables (cases where in GLSL the destination is a structure or
array)
- The kitchen sink
- ...

Textures
=========

Unfortunately, there are far too many texture operations to represent each one
of them with an intrinsic, so there's a special texture instruction similar to
the GLSL IR one. The biggest difference is that, while the texture instruction
has a sampler dereference field used just like in GLSL IR, this gets lowered to
a texture unit index (with a possible indirect offset) while the type
information of the original sampler is kept around for backends. Also, all the
non-constant sources are stored in a single array to make it easier for
optimization passes to iterate over all the sources.

Control Flow
=========

Like in GLSL IR, control flow consists of a tree of "control flow nodes", which
include if statements and loops, and jump instructions (break, continue, and
return). Unlike GLSL IR, though, the leaves of the tree aren't statements but
basic blocks. Each basic block also keeps track of its successors and
predecessors, and function implementations keep track of the beginning basic
block (the first basic block of the function) and the ending basic block (a fake
basic block that every return statement points to). Together, these elements
make up the control flow graph, in this case a redundant piece of information on
top of the control flow tree that will be used by almost all the optimizations.
There are helper functions to add and remove control flow nodes that also update
the control flow graph, and so usually it doesn't need to be touched by passes
that modify control flow nodes.