mesa/src/compiler/nir
Alyssa Rosenzweig 7229bffcb1 nir: Add intrinsics for register access
Note the writemask handling is chosen for consistency with the rest of NIR. In
every other instance, writemask=w requires a vec4 source. This is hardcoded into
nir_validate and nir_print as what it means to have a writemask.

More importantly, consistency with how register writemasks currently work.
nir_print hides it, but r0.w = fneg ssa_1.x is actually a vec4 instruction with
source ssa_1.xxxx. As a silly example nir_dest_num_components(that) = 4 in the
old model. I realize this is quite strange coming from a scalar ISA, but it's
perfectly natural for the class of vec4 hardware for which this was designed. In
that hardware, conceptually all instructions are vec4`, so the sequence "fneg
ssa_1 and write to channel w" is implemented as "fneg a vec4 with ssa_1.x in the
last component and write that vec4 out but mask to write only the w channel".

Isn't this inefficient? It can be. To save power, Midgard has scalar ALUs in
addition to vec4 ALUs. Those details are confined to the backend VLIW scheduler;
the instruction selection is still done as vec4. This mechanism has little in
common with AMD's SALUs. Midgard has a wave size of 1, with special hacks for
derivatives.

As a result, all backends consuming register writemasks are expecting this
pattern of code. Changing the store to take a vec1 instead of a vec4 would
require changing every backend to reswizzle the sources to resurrect the vec4. I
started typing a branch to do this yesterday, but it made a mess of both Midgard
and nir-to-tgsi. Without any good reason to think it'd actually help
performance, I abandoned the idea. Getting all 15 backends converted to the
helpers is enough of a challenge without forcing 10 backends to reswizzle their
sources too.

Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>
Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23089>
2023-07-12 01:34:26 +00:00
..
tests nir: use imm-helpers 2023-06-29 07:08:19 +00:00
meson.build nir: Add nir_lower_robust_access pass 2023-06-29 22:36:50 +00:00
nir.c nir: use nir_intrinsic_get_var 2023-07-10 16:06:40 +02:00
nir.h nir: Add intrinsics for register access 2023-07-12 01:34:26 +00:00
nir_algebraic.py nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_builder.c nir: Remove nir_builder_init, it's not used anymore 2023-07-10 19:20:18 +00:00
nir_builder.h nir: Add intrinsics for register access 2023-07-12 01:34:26 +00:00
nir_builder_opcodes_h.py nir: Add intrinsics for register access 2023-07-12 01:34:26 +00:00
nir_builtin_builder.c nir: Add and use nir_tex_src_ssa 2023-06-06 18:52:24 +00:00
nir_builtin_builder.h nir: use imm-helpers 2023-06-29 07:08:19 +00:00
nir_clone.c treewide: Switch to use nir_foreach_function_with_impl when possible 2023-06-29 08:36:03 +00:00
nir_constant_expressions.h
nir_constant_expressions.py nir: Drop a bunch of Authors tags 2023-03-26 00:16:25 +00:00
nir_control_flow.c nir: Add undef phi srcs when adding successors 2023-05-26 18:31:30 +00:00
nir_control_flow.h nir: create nir_push_continue() and related helpers 2023-02-21 10:41:11 +00:00
nir_control_flow_private.h
nir_conversion_builder.h nir: use generated immediate comparison helpers 2023-06-05 13:40:08 +00:00
nir_deref.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_deref.h
nir_divergence_analysis.c nir: Add pixel_coord, frag_coord_zw intrinsics 2023-06-27 14:38:21 +00:00
nir_dominance.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_format_convert.h nir: use imm-helpers 2023-06-29 07:08:19 +00:00
nir_from_ssa.c nir: Rename load/store_reg -> load/store_register 2023-06-30 18:19:51 -04:00
nir_gather_info.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_gather_ssa_types.c nir: Extract logic to get dest and srcs types from intrinsic 2023-06-28 20:17:18 +00:00
nir_gather_xfb_info.c nir: remove an obsolete comment from nir_gather_xfb_info_from_intrinsics 2023-04-19 21:42:11 +00:00
nir_group_loads.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_gs_count_vertices.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_inline_functions.c nir: Update the comment to call nir_remove_non_entrypoints directly 2023-07-03 21:45:35 +00:00
nir_inline_helpers.h
nir_inline_uniforms.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_instr_set.c nir: Do not consider phis with incompatible dests equal 2022-12-11 22:13:32 +00:00
nir_instr_set.h
nir_intrinsics.py nir: Add intrinsics for register access 2023-07-12 01:34:26 +00:00
nir_intrinsics_c.py
nir_intrinsics_h.py
nir_intrinsics_indices_h.py
nir_linking_helpers.c nir: use nir_intrinsic_get_var 2023-07-10 16:06:40 +02:00
nir_liveness.c nir: Use nir_foreach_phi(_safe) 2023-05-12 14:02:23 +00:00
nir_loop_analyze.c nir: Combine if_uses with instruction uses 2023-04-07 23:48:03 +00:00
nir_loop_analyze.h nir: update nir_is_supported_terminator_condition() 2022-09-08 01:01:14 +00:00
nir_lower_alpha_test.c nir: use nir_intrinsic_get_var 2023-07-10 16:06:40 +02:00
nir_lower_alu.c nir: use imm-helpers 2023-06-29 07:08:19 +00:00
nir_lower_alu_width.c aco,ac/llvm,ac/nir,vtn: unify cube opcodes 2023-06-30 15:35:03 +00:00
nir_lower_amul.c nir/lower_amul: make use nir_shader_clear_pass_flags(..) 2023-06-29 19:13:19 +00:00
nir_lower_array_deref_of_vec.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_lower_atomics_to_ssbo.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_lower_bit_size.c nir/lower_bit_size: mask bitz/bitnz src1 like shifts 2023-06-29 13:39:30 +00:00
nir_lower_bitmap.c nir: Use nir_builder_at 2023-07-03 15:21:37 +00:00
nir_lower_blend.c nir/lower_blend: Optimize masked out RTs 2023-06-27 14:38:21 +00:00
nir_lower_blend.h nir/lower_blend: Consume dual stores 2023-02-26 17:35:08 -05:00
nir_lower_bool_to_bitsize.c nir: Remove 2nd argument from nir_before_src 2023-04-07 23:48:03 +00:00
nir_lower_bool_to_float.c nir: Eliminate nir_op_f2b 2023-02-03 22:39:57 +00:00
nir_lower_bool_to_int32.c nir/lower_bool_to_int32: Fix progress reporting 2023-06-26 08:22:03 -04:00
nir_lower_cl_images.c nir: Use nir_builder_create 2023-06-27 18:13:02 +00:00
nir_lower_clamp_color_outputs.c nir: use nir_intrinsic_get_var 2023-07-10 16:06:40 +02:00
nir_lower_clip.c nir: Use nir_builder_at 2023-07-03 15:21:37 +00:00
nir_lower_clip_cull_distance_arrays.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_lower_clip_disable.c nir: use generated immediate comparison helpers 2023-06-05 13:40:08 +00:00
nir_lower_clip_halfz.c nir: use nir_shader_instructions_pass in nir_lower_clip_halfz 2022-09-26 11:13:03 +00:00
nir_lower_const_arrays_to_uniforms.c nir: Use nir_builder_create 2023-06-27 18:13:02 +00:00
nir_lower_continue_constructs.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_lower_convert_alu_types.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_lower_discard_if.c nir: Make nir_lower_discard_if() handle demotes and terminates, too. 2022-08-31 18:26:19 +00:00
nir_lower_discard_or_demote.c
nir_lower_double_ops.c nir: use imm-helpers 2023-06-29 07:08:19 +00:00
nir_lower_drawpixels.c treewide: Use nir_trim_vector more 2023-06-06 18:52:25 +00:00
nir_lower_fb_read.c treewide: Use nir_tex_src_for_ssa 2023-06-06 18:52:25 +00:00
nir_lower_flatshade.c
nir_lower_flrp.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_lower_fp16_conv.c nir: isub -> iadd_imm 2023-06-15 13:34:48 +00:00
nir_lower_frag_coord_to_pixel_coord.c nir: Add lower_frag_coord_to_pixel_coord pass 2023-06-27 14:38:21 +00:00
nir_lower_fragcolor.c nir: use nir_intrinsic_get_var 2023-07-10 16:06:40 +02:00
nir_lower_fragcoord_wtrans.c nir_lower_fragcoord_wtrans: Support Vulkan shaders 2023-01-10 04:25:26 +00:00
nir_lower_frexp.c nir: Drop a bunch of Authors tags 2023-03-26 00:16:25 +00:00
nir_lower_global_vars_to_local.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_lower_goto_ifs.c nir: Use nir_builder_at 2023-07-03 15:21:37 +00:00
nir_lower_gs_intrinsics.c nir: Use nir_builder_at 2023-07-03 15:21:37 +00:00
nir_lower_helper_writes.c nir: Drop legacy atomics in simple cases 2023-05-16 22:36:21 +00:00
nir_lower_idiv.c nir: use generated immediate comparison helpers 2023-06-05 13:40:08 +00:00
nir_lower_image.c nir: Drop unused name from nir_ssa_dest_init 2023-05-17 23:46:16 +00:00
nir_lower_image_atomics_to_global.c nir: Add pass to lower image atomics 2023-05-22 14:33:13 +00:00
nir_lower_indirect_derefs.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_lower_input_attachments.c nir: Add and use nir_tex_src_ssa 2023-06-06 18:52:24 +00:00
nir_lower_int64.c nir: split nir_lower_mov64 2023-07-03 10:38:27 +00:00
nir_lower_int_to_float.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_lower_interpolation.c nir: use nir_shader_instructions_pass in nir_lower_interpolation 2022-09-26 11:13:03 +00:00
nir_lower_io.c nir: use imm-helpers 2023-06-29 07:08:19 +00:00
nir_lower_io_arrays_to_elements.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_lower_io_to_scalar.c nir: Drop unused name from nir_ssa_dest_init 2023-05-17 23:46:16 +00:00
nir_lower_io_to_temporaries.c nir: Use nir_builder_at 2023-07-03 15:21:37 +00:00
nir_lower_io_to_vector.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_lower_is_helper_invocation.c nir: Use nir_builder_at 2023-07-03 15:21:37 +00:00
nir_lower_load_const_to_scalar.c nir: Use nir_builder_at 2023-07-03 15:21:37 +00:00
nir_lower_locals_to_regs.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_lower_mediump.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_lower_mem_access_bit_sizes.c nir: Use nir_ instead of nir_build_ helpers 2023-06-27 17:37:54 +00:00
nir_lower_memcpy.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_lower_memory_model.c nir: Drop legacy atomics in simple cases 2023-05-16 22:36:21 +00:00
nir_lower_multiview.c nir: Use nir_builder_at 2023-07-03 15:21:37 +00:00
nir_lower_non_uniform_access.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_lower_packing.c
nir_lower_passthrough_edgeflags.c nir: Use nir_builder_at 2023-07-03 15:21:37 +00:00
nir_lower_patch_vertices.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_lower_phis_to_scalar.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_lower_pntc_ytransform.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_lower_point_size.c nir/lower_point_size: Use shader_instructions_pass 2023-03-11 16:42:36 +00:00
nir_lower_point_size_mov.c treewide: Use nir_builder_create more 2023-06-27 18:13:02 +00:00
nir_lower_point_smooth.c nir: Use nir_ instead of nir_build_ helpers 2023-06-27 17:37:54 +00:00
nir_lower_poly_line_smooth.c nir: lower smooth lines conditionally using the new intrinsic 2023-05-22 07:58:34 +00:00
nir_lower_printf.c nir: use generated immediate comparison helpers 2023-06-05 13:40:08 +00:00
nir_lower_readonly_images_to_tex.c nir: Add and use nir_tex_src_ssa 2023-06-06 18:52:24 +00:00
nir_lower_regs_to_ssa.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_lower_returns.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_lower_robust_access.c nir: Add nir_lower_robust_access pass 2023-06-29 22:36:50 +00:00
nir_lower_samplers.c nir: use imm-helpers 2023-06-29 07:08:19 +00:00
nir_lower_scratch.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_lower_shader_calls.c nir/lower_shader_calls: Remat derefs after shader calls 2023-07-11 17:32:55 +00:00
nir_lower_single_sampled.c nir: Add a pass for lowering shaders to single-sampled 2022-07-13 20:28:42 +00:00
nir_lower_ssbo.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_lower_subgroups.c nir: use imm-helpers 2023-06-29 07:08:19 +00:00
nir_lower_system_values.c nir: add cheap shortcut for wg id to wg idx lowering 2023-07-04 09:15:08 +00:00
nir_lower_sysvals_to_varyings.c
nir_lower_task_shader.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_lower_tex.c nir: add options to lower y_vu, yv_yu, yx_xvxu and xy_vxux 2023-07-10 16:29:13 +00:00
nir_lower_tex_shadow.c nir: Drop unused name from nir_ssa_dest_init 2023-05-17 23:46:16 +00:00
nir_lower_texcoord_replace.c nir: Use nir_builder_at 2023-07-03 15:21:37 +00:00
nir_lower_texcoord_replace_late.c nir: Add a late texcoord replacement pass 2023-02-03 15:03:06 +00:00
nir_lower_to_source_mods.c nir: Remove integer and 64-bit modifiers 2023-06-22 19:55:49 +00:00
nir_lower_two_sided_color.c nir: Add helpers for lazy var creation. 2023-05-16 18:57:28 +00:00
nir_lower_ubo_vec4.c nir_lower_ubo_vec4: Delete an invalid assert 2023-06-13 00:43:36 +00:00
nir_lower_undef_to_zero.c
nir_lower_uniforms_to_ubo.c nir: use imm-helpers 2023-06-29 07:08:19 +00:00
nir_lower_var_copies.c nir/nir_lower_var_copies: Use the nir_shader_instructions_pass() helper 2023-04-22 23:35:37 +00:00
nir_lower_variable_initializers.c nir: Use nir_builder_at 2023-07-03 15:21:37 +00:00
nir_lower_vars_to_ssa.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_lower_vec3_to_vec4.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_lower_vec_to_movs.c nir: Combine if_uses with instruction uses 2023-04-07 23:48:03 +00:00
nir_lower_viewport_transform.c treewide: Use nir_trim_vector more 2023-06-06 18:52:25 +00:00
nir_lower_wpos_center.c nir/nir_lower_wpos_center: Use the nir_shader_instructions_pass() helper 2023-04-22 23:35:36 +00:00
nir_lower_wpos_ytransform.c nir: use generated immediate comparison helpers 2023-06-05 13:40:08 +00:00
nir_lower_wrmasks.c
nir_metadata.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_mod_analysis.c nir: add nir_mod_analysis & its tests 2023-01-31 13:50:08 +00:00
nir_move_vec_src_uses_to_dest.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_normalize_cubemap_coords.c nir: Drop a bunch of Authors tags 2023-03-26 00:16:25 +00:00
nir_opcodes.py nir: Add b32fcsel_mdg opcode for Midgard 2023-06-30 16:29:35 -04:00
nir_opcodes_c.py nir: Eliminate nir_op_f2b 2023-02-03 22:39:57 +00:00
nir_opcodes_h.py
nir_opt_access.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_opt_algebraic.py nir/opt_algebraic: combine bitz/bitnz 2023-06-29 13:39:30 +00:00
nir_opt_barriers.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_opt_combine_stores.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_opt_comparison_pre.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_opt_conditional_discard.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_opt_constant_folding.c nir: Add is_null_constant to nir_constant 2023-06-13 00:43:36 +00:00
nir_opt_copy_prop_vars.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_opt_copy_propagate.c nir: Use nir_builder_at 2023-07-03 15:21:37 +00:00
nir_opt_cse.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_opt_dce.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_opt_dead_cf.c nir/opt_dead_cf: Clarify comment 2023-07-11 17:32:55 +00:00
nir_opt_dead_write_vars.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_opt_find_array_copies.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_opt_fragdepth.c nir: extend nir_opt_fragdepth to handle lowered IO 2023-04-19 21:42:11 +00:00
nir_opt_gcm.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_opt_idiv_const.c nir: use imm-helpers 2023-06-29 07:08:19 +00:00
nir_opt_if.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_opt_intrinsics.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_opt_large_constants.c nir: Use nir_builder_create 2023-06-27 18:13:02 +00:00
nir_opt_load_store_vectorize.c nir: Use nir_builder_at 2023-07-03 15:21:37 +00:00
nir_opt_loop_unroll.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_opt_memcpy.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_opt_move.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_opt_move_discards_to_top.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_opt_non_uniform_access.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_opt_offsets.c nir/nir_opt_offsets: Prevent offsets going above max 2022-12-02 15:04:52 +00:00
nir_opt_peephole_select.c nir: Use nir_builder_at 2023-07-03 15:21:37 +00:00
nir_opt_phi_precision.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_opt_preamble.c treewide: Remove all usage of nir_builder_init with nir_builder_create and nir_builder_at 2023-07-10 19:20:17 +00:00
nir_opt_ray_queries.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_opt_reassociate_bfi.c nir: Add optimization pass to reassociate some bfi instructions 2023-06-14 18:49:53 +00:00
nir_opt_rematerialize_compares.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_opt_remove_phis.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_opt_shrink_stores.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_opt_shrink_vectors.c nir/opt_shrink_vectors: enable sparse intrinsics shrinking 2023-07-06 13:16:13 +00:00
nir_opt_sink.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_opt_trivial_continues.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_opt_undef.c nir/opt_undef: add a pass to clean up 64bit undefs 2022-09-27 18:38:25 +00:00
nir_opt_uniform_atomics.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_opt_vectorize.c nir: Use nir_builder_at 2023-07-03 15:21:37 +00:00
nir_passthrough_gs.c nir: use generated immediate comparison helpers 2023-06-05 13:40:08 +00:00
nir_passthrough_tcs.c nir: Add helpers for lazy var creation. 2023-05-16 18:57:28 +00:00
nir_phi_builder.c nir: Drop unused name from nir_ssa_dest_init 2023-05-17 23:46:16 +00:00
nir_phi_builder.h
nir_print.c nir/print: Reformat the preds/succs block information 2023-07-03 22:18:07 +00:00
nir_propagate_invariant.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_range_analysis.c nir: Fix use of alloca() without #include c99_alloca.h 2023-03-29 16:56:42 +00:00
nir_range_analysis.h util: reinstate ENUM_PACKED 2023-06-21 21:51:59 +00:00
nir_remove_dead_variables.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_remove_tex_shadow.c nir: Propagate the type sampler type change to the used variable. 2023-05-24 07:48:18 +00:00
nir_repair_ssa.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_scale_fdiv.c nir: use imm-helpers 2023-06-29 07:08:19 +00:00
nir_schedule.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_schedule.h nir/schedule: allow drivers to decide about instruction latency 2022-03-09 15:53:04 +00:00
nir_search.c nir: Use nir_builder_create 2023-06-27 18:13:02 +00:00
nir_search.h util: reinstate ENUM_PACKED 2023-06-21 21:51:59 +00:00
nir_search_helpers.h nir/algebraic: Simplify various trivial bfi 2023-06-14 18:49:53 +00:00
nir_serialize.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_serialize.h
nir_split_64bit_vec3_and_vec4.c nir/split_64bit_vec3_and_vec4: Use the right number of components 2023-06-29 10:59:57 +00:00
nir_split_per_member_structs.c nir: use nir_shader_instructions_pass in nir_split_per_member_structs 2022-09-26 11:13:03 +00:00
nir_split_var_copies.c nir: Drop a bunch of Authors tags 2023-03-26 00:16:25 +00:00
nir_split_vars.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_sweep.c nir: add assertions that loops don't have a Continue Construct 2023-02-21 10:41:11 +00:00
nir_to_lcssa.c nir: Convert to nir_foreach_function_impl 2023-06-27 22:44:04 +00:00
nir_validate.c nir: Add intrinsics for register access 2023-07-12 01:34:26 +00:00
nir_vla.h
nir_worklist.c nir: Drop a bunch of Authors tags 2023-03-26 00:16:25 +00:00
nir_worklist.h nir: Drop a bunch of Authors tags 2023-03-26 00:16:25 +00:00
nir_xfb_info.h nir/xfb_info: nir_gather_xfb_info_from_intrinsics update nir xfb_info 2023-01-18 05:30:14 +00:00
README

New IR, or NIR, is an IR for Mesa intended to sit below GLSL IR and Mesa IR.
Its design inherits from the various IRs that Mesa has used in the past, as
well as Direct3D assembly, and it includes a few new ideas as well. It is a
flat (in terms of using instructions instead of expressions), typeless IR,
similar to TGSI and Mesa IR.  It also supports SSA (although it doesn't require
it).

Variables
=========

NIR includes support for source-level GLSL variables through a structure mostly
copied from GLSL IR. These will be used for linking and conversion from GLSL IR
(and later, from an AST), but for the most part, they will be lowered to
registers (see below) and loads/stores.

Registers
=========

Registers are light-weight; they consist of a structure that only contains its
size, its index for liveness analysis, and an optional name for debugging. In
addition, registers can be local to a function or global to the entire shader;
the latter will be used in ARB_shader_subroutine for passing parameters and
getting return values from subroutines. Registers can also be an array, in which
case they can be accessed indirectly. Each ALU instruction (add, subtract, etc.)
works directly with registers or SSA values (see below).

SSA
========

Everywhere a register can be loaded/stored, an SSA value can be used instead.
The only exception is that arrays/indirect addressing are not supported with
SSA; although research has been done on extensions of SSA to arrays before, it's
usually for the purpose of parallelization (which we're not interested in), and
adds some overhead in the form of adding copies or extra arrays (which is much
more expensive than introducing copies between non-array registers). SSA uses
point directly to their corresponding definition, which in turn points to the
instruction it is part of. This creates an implicit use-def chain and avoids the
need for an external structure for each SSA register.

Functions
=========

Support for function calls is mostly similar to GLSL IR. Each shader contains a
list of functions, and each function has a list of overloads. Each overload
contains a list of parameters, and may contain an implementation which specifies
the variables that correspond to the parameters and return value. Inlining a
function, assuming it has a single return point, is as simple as copying its
instructions, registers, and local variables into the target function and then
inserting copies to and from the new parameters as appropriate. After functions
are inlined and any non-subroutine functions are deleted, parameters and return
variables will be converted to global variables and then global registers. We
don't do this lowering earlier (i.e. the fortranizer idea) for a few reasons:

- If we want to do optimizations before link time, we need to have the function
signature available during link-time.

- If we do any inlining before link time, then we might wind up with the
inlined function and the non-inlined function using the same global
variables/registers which would preclude optimization.

Intrinsics
=========

Any operation (other than function calls and textures) which touches a variable
or is not referentially transparent is represented by an intrinsic. Intrinsics
are similar to the idea of a "builtin function," i.e. a function declaration
whose implementation is provided by the backend, except they are more powerful
in the following ways:

- They can also load and store registers when appropriate, which limits the
number of variables needed in later stages of the IR while obviating the need
for a separate load/store variable instruction.

- Intrinsics can be marked as side-effect free, which permits them to be
treated like any other instruction when it comes to optimizations. This allows
load intrinsics to be represented as intrinsics while still being optimized
away by dead code elimination, common subexpression elimination, etc.

Intrinsics are used for:

- Atomic operations
- Memory barriers
- Subroutine calls
- Geometry shader emitVertex and endPrimitive
- Loading and storing variables (before lowering)
- Loading and storing uniforms, shader inputs and outputs, etc (after lowering)
- Copying variables (cases where in GLSL the destination is a structure or
array)
- The kitchen sink
- ...

Textures
=========

Unfortunately, there are far too many texture operations to represent each one
of them with an intrinsic, so there's a special texture instruction similar to
the GLSL IR one. The biggest difference is that, while the texture instruction
has a sampler dereference field used just like in GLSL IR, this gets lowered to
a texture unit index (with a possible indirect offset) while the type
information of the original sampler is kept around for backends. Also, all the
non-constant sources are stored in a single array to make it easier for
optimization passes to iterate over all the sources.

Control Flow
=========

Like in GLSL IR, control flow consists of a tree of "control flow nodes", which
include if statements and loops, and jump instructions (break, continue, and
return). Unlike GLSL IR, though, the leaves of the tree aren't statements but
basic blocks. Each basic block also keeps track of its successors and
predecessors, and function implementations keep track of the beginning basic
block (the first basic block of the function) and the ending basic block (a fake
basic block that every return statement points to). Together, these elements
make up the control flow graph, in this case a redundant piece of information on
top of the control flow tree that will be used by almost all the optimizations.
There are helper functions to add and remove control flow nodes that also update
the control flow graph, and so usually it doesn't need to be touched by passes
that modify control flow nodes.