fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-16 14:08:07 +02:00

Author	SHA1	Message	Date
Caio Oliveira	ea44879d2d	nir/print: Use symbols % for SSA and @ for intrinsic The variable uniquifying now uses # instead of @. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23564>	2023-07-03 22:18:06 +00:00
Yonggang Luo	c4d3bc03c4	nir: Add nir_foreach_function_safe and use it Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23902>	2023-07-03 21:45:35 +00:00
Yonggang Luo	1238a65251	nir: Update the comment to call nir_remove_non_entrypoints directly Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23902>	2023-07-03 21:45:35 +00:00
Konstantin Seurer	82aaf1893d	nir/builder_opcodes: Do not generate empty intrinsic indices Gets rid of all the struct nir__indices { int _; / exists to avoid empty initializers */ }; declarations. 14293 loc -> 12900 loc Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Friedrich Vock <friedrich.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23906>	2023-07-03 21:12:45 +00:00
Konstantin Seurer	e379b9ad8c	nir/opt_dead_cf: Handle if statements ending in a jump correctly If a then/else block ends in a jump, the phi nodes do not necessarily have to reference the always taken branch because they are dead code. Avoid crashing in this case by only rewriting phis, if the block does not end in a jump. cc: mesa-stable Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23150>	2023-07-03 20:36:51 +00:00
Konstantin Seurer	574079e354	nir: Use nir_builder_at Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23883>	2023-07-03 15:21:37 +00:00
Konstantin Seurer	a7cd206937	nir: Add nir_builder_at Creates and returns a nir_builder from a cursor. The nir_function_impl is retrieved using said cursor. This should be fine as long as it is not used on extracted control flow. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23883>	2023-07-03 15:21:37 +00:00
Rhys Perry	3d0e997e99	nir: split nir_lower_mov64 ACO will want to lower the conversions, but preserve the bcsels. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23926>	2023-07-03 10:38:27 +00:00
Yonggang Luo	21a0ca7ce5	nir: Strip the const modifier on nir_function * in nir_foreach_function_with_impl The function iterator should be able to modified in this foreach loop And the latter patches needs this Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23960>	2023-07-01 17:39:28 +08:00
Alyssa Rosenzweig	7e42fdac6b	nir: Rename nir_reg_{src,dest} -> nir_register_{src,dest} This frees up the shorter names for the intrinsic-based versions that will replace them. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23956>	2023-06-30 18:20:48 -04:00
Alyssa Rosenzweig	bed2f3f8e6	nir: Rename load/store_reg -> load/store_register This frees up the shorter names for the new register-based intrinsics. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23956>	2023-06-30 18:19:51 -04:00
Alyssa Rosenzweig	d1f6bcd1d0	nir: Add b32fcsel_mdg opcode for Midgard Midgard has both int and float version of b32csel. The backend needs some way to pick between the two, and it's a lot more convenient to choose in NIR before going out-of-SSA than in the backend. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Italo Nicola <italonicola@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23769>	2023-06-30 16:29:35 -04:00
Rhys Perry	25c49e491f	aco,ac/llvm,ac/nir,vtn: unify cube opcodes fossil-db (navi21): Totals from 17068 (12.79% of 133461) affected shaders: Instrs: 24743703 -> 24743572 (-0.00%); split: -0.00%, +0.00% CodeSize: 132579952 -> 132580620 (+0.00%); split: -0.00%, +0.00% VGPRs: 1227840 -> 1227984 (+0.01%) Latency: 403180114 -> 403251188 (+0.02%); split: -0.00%, +0.02% InvThroughput: 75311302 -> 75320892 (+0.01%); split: -0.00%, +0.01% VClause: 415400 -> 415402 (+0.00%); split: -0.00%, +0.00% Copies: 1715404 -> 1715258 (-0.01%); split: -0.01%, +0.01% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> (r600) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23930>	2023-06-30 15:35:03 +00:00
Alyssa Rosenzweig	f0fb8d05e3	nir: Add nir_lower_robust_access pass Add a pass for bounds checking UBOs, SSBOs, and images to implement robustness. This pass is based on v3d_nir_lower_robust_access.c, with significant modifications to be appropriate for common code. Notably: * v3d-isms are removed. * Stop generating invalid imageSize() instructions for cube maps, this blows up nir_validate with asahi's lowerings. * Logic to wrap an intrinsic in an if-statement is extracted in anticipation of future robustness2 support that will reuse that code path for buffers. * Misc cleanups to follow modern NIR best practice. This pass is noticeably shorter than the original v3d version. For future support of robustness2, I envision the booleans turning into tristate enums. There's a few more knobs added for Asahi's benefit. Apple hardware can do imageLoad and imageStore to non-buffer images (only). There is no support for image atomics. To handle, Asahi implements software lowering for buffer images and for image atomics. While the hardware is robust, the software paths are not. So we would like to use this pass to lower robustness for the software paths but not the hardware paths. Or maybe we want a filter callback? Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23895>	2023-06-29 22:36:50 +00:00
Christian Gmeiner	36b0cff774	nir/lower_amul: make use nir_shader_clear_pass_flags(..) Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23931>	2023-06-29 19:13:19 +00:00
Christian Gmeiner	fada46cf99	nir: add helper to clear all pass_flags Will be used in different places so lets move it to a common place. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23931>	2023-06-29 19:13:19 +00:00
Alyssa Rosenzweig	e81b5b972e	nir/validate: Assert txf(_ms) matches dimension We can't txf_ms on non-MS images and we can't txf on MS images. This would have caught a regression on Asahi. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23892>	2023-06-29 14:17:30 +00:00
Georg Lehmann	44d0b785cc	nir/opt_algebraic: combine bitz/bitnz Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23298>	2023-06-29 13:39:30 +00:00
Georg Lehmann	6585209cdd	nir/lower_bit_size: mask bitz/bitnz src1 like shifts Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23298>	2023-06-29 13:39:30 +00:00
Georg Lehmann	481a34e82e	nir: add single bit test opcodes These directly map to amd's SALU s_bitcmp0/1. For VALU we can use v_cmp_class_f32 if the second source is constant. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23298>	2023-06-29 13:39:30 +00:00
Corentin Noël	a8d669b593	nir/split_64bit_vec3_and_vec4: Use the right number of components Always make sure to correctly deref and store a 64bits variable from the right number of components. This fixes the `spec@arb_enhanced_layouts@matching_fp64_types_` piglit tests for virgl. Corrects this validation issue: ``` decl_var INTERP_MODE_FLAT dvec2[] var_7@2 decl_var INTERP_MODE_FLAT dvec2[] var_7@3 ... vec1 32 ssa_302 = deref_var &var_7@2 (function_temp dvec2[]) vec1 32 ssa_303 = deref_var &var_7@3 (function_temp dvec2[]) vec1 32 ssa_304 = deref_array &(ssa_302)[ssa_301] (function_temp dvec2) / &var_7@2[ssa_301] / vec1 32 ssa_305 = deref_array &(ssa_303)[ssa_301] (function_temp dvec2) /* &var_7@3[ssa_301] / vec1 64 ssa_306 = mov ssa_110.z intrinsic store_deref (ssa_305, ssa_306) (wrmask=x, access=0) error: instr->num_components == glsl_get_vector_elements(dst->type) (../src/compiler/nir/nir_validate.c:632) vec4 64 ssa_111 = vec4 ssa_14, ssa_13, ssa_12, ssa_109 vec1 32 ssa_307 = load_const (0x00000000 = 0.000000) vec1 32 ssa_308 = iadd ssa_307, ssa_61 vec1 32 ssa_309 = deref_var &var_7@2 (function_temp dvec2[]) vec1 32 ssa_310 = deref_var &var_7@3 (function_temp dvec2[]) vec1 32 ssa_311 = deref_array &(ssa_309)[ssa_308] (function_temp dvec2) /* &var_7@2[ssa_308] / vec1 32 ssa_312 = deref_array &(ssa_310)[ssa_308] (function_temp dvec2) /* &var_7@3[ssa_308] */ vec1 64 ssa_313 = mov ssa_111.w intrinsic store_deref (ssa_312, ssa_313) (wrmask=, access=0) error: (nir_intrinsic_write_mask(instr) & ~component_mask) == 0 (../src/compiler/nir/nir_validate.c:803) ``` Fixes: `496fd59d71` (add pass to split 64 bit vec3/4 variable access) Signed-off-by: Corentin Noël <corentin.noel@collabora.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23880>	2023-06-29 10:59:57 +00:00
Yonggang Luo	62ce223245	treewide: Switch to use nir_foreach_function_with_impl when possible Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23903>	2023-06-29 08:36:03 +00:00
Yonggang Luo	fde6b51749	nir: Split macro nir_foreach_function_with_impl out of nir_foreach_function_impl This macro nir_foreach_function_with_impl can be used when func and func->impl are both accessed in foreach loop Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23903>	2023-06-29 08:36:03 +00:00
Erik Faye-Lund	afa79cd9b8	nir: use imm-helpers We have to use 1ull instead of 1u because MSVC is stupid... Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23855>	2023-06-29 07:08:19 +00:00
Caio Oliveira	f4c2025e2c	nir/print: Print more representations in load_const In addition to the hexadecimal and float (when applicable), print the signed and unsigned representations. Representations may be omitted based on information about the value: - If gather types has unambiguous information, we use it; - Float is omitted for 8 bit values; - Signed decimal is omitted for positive values; - Unsigned decimal is omitted for small values (representation is same as hex); Note for now the "terse form" that appear in SSA uses is unchaged. Based on a patch by Mike Blumenkrantz. Examples: ``` // Just used as float. Omitted decimals. vec4 32 ssa_81 = load_const (0x3f800000, 0x3f800000, 0x3e4ccccd, 0x3f800000) = (1.000000, 1.000000, 0.200000, 1.000000) vec1 32 ssa_28 = load_const (0x3e4ccccd = 0.200000) // Just a small integer. Omitted float and decimal. vec1 32 ssa_45 = load_const (0x00000001) // Larger positive integers. Omitted float. vec1 32 ssa_39 = load_const (0x00002000 = 8192) vec1 32 ssa_30 = load_const (0x000000ff = 255) vec1 32 ssa_28 = load_const (0x00000010 = 16) // Integers with negative values. load_const (0xff = -1 = 255) load_const (0xff80 = -128 = 65408) load_const (0xffff = -1 = 65535) // Same value, in the first case we know is used as an integer. load_const (0xffffffe0 = -32 = 4294967264) load_const (0xffffffe0 = -nan = -32 = 4294967264) ``` Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23562>	2023-06-28 20:17:18 +00:00
Caio Oliveira	a185736a42	nir/print: Use src_type when printing consts in SSA uses If the src_type is not available, untie by looking at the results from nir_gather_ssa_types(). If that is ambiguous, just pick uint. Now in print_const_from_load() when the type is invalid, print the full constant form (with both padded hex and float); when the passed type is valid, print the terse form based on it. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23562>	2023-06-28 20:17:18 +00:00
Caio Oliveira	5d15f4ef28	nir: Extract logic to get dest and srcs types from intrinsic Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23562>	2023-06-28 20:17:18 +00:00
Caio Oliveira	7de530d3df	nir: Make a const-friendly way to get the offset_src and arrayed_io_src from intrinsic The existing helper returns a `nir_src *` so expects a non-const instr. We plan to use this function in queries that don't modify the shader, so create (and use internally) a variant that returns the index instead. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23562>	2023-06-28 20:17:18 +00:00
Caio Oliveira	8f64415af7	nir/print: Make NIR_DEBUG=print_consts behavior the default Now there's a NIR_DEBUG=print_no_inline_consts to omit them. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23562>	2023-06-28 20:17:18 +00:00
Caio Oliveira	260a9167db	nir/print: Improve NIR_PRINT=print_consts by using nir_gather_ssa_types() The two representations are always used for `load_const`, but when inlining the value as SSA source, use just a single terse representation. The choice between integer or float is based on the result of nir_gather_ssa_types(), with a bias for integer when in doubt. Also remove extra comment `/* /` syntax since the value is already enclosed by parenthesis. --- For illustration, here's some instructions from crucible test func.shader.averageRounded.uint64_t with NIR_DEBUG=print_consts: BEFORE: ``` vec1 32 con ssa_23 = load_const (0xfffffffc = -nan) vec1 32 div ssa_24 = iand ssa_13, ssa_23 /(0xfffffffc = -nan)/ vec1 32 con ssa_25 = load_const (0x00000024 = 0.000000) vec1 32 con ssa_26 = intrinsic load_ubo (ssa_1 /(0x00000002 = 0.000000)/, ssa_25 /(0x00000024 = 0.000000)/) (access=0, align_mul=1073741824, align_offset=36, range_base=0, range=-1) vec1 32 con ssa_27 = load_const (0x00000008 = 0.000000) vec1 32 con ssa_28 = load_const (0x00000007 = 0.000000) vec1 32 con ssa_29 = iand ssa_4.y, ssa_1 /(0x00000002 = 0.000000)/ vec1 32 con ssa_30 = ishl ssa_29, ssa_28 /(0x00000007 = 0.000000)/ vec1 32 con ssa_31 = load_const (0x7b000808 = 664776890994587263929995856502063104.000000) vec1 32 con ssa_32 = ior ssa_31 /(0x7b000808 = 664776890994587263929995856502063104.000000)/, ssa_30 ``` AFTER: ``` vec1 32 con ssa_23 = load_const (0xfffffffc = -nan) vec1 32 div ssa_24 = iand ssa_13, ssa_23 (0xfffffffc) vec1 32 con ssa_25 = load_const (0x00000024 = 0.000000) vec1 32 con ssa_26 = intrinsic load_ubo (ssa_1 (0x2), ssa_25 (0x24)) (access=0, align_mul=1073741824, align_offset=36, range_base=0, range=-1) vec1 32 con ssa_27 = load_const (0x00000008 = 0.000000) vec1 32 con ssa_28 = load_const (0x00000007 = 0.000000) vec1 32 con ssa_29 = iand ssa_4.y, ssa_1 (0x2) vec1 32 con ssa_30 = ishl ssa_29, ssa_28 (0x7) vec1 32 con ssa_31 = load_const (0x7b000808 = 664776890994587263929995856502063104.000000) vec1 32 con ssa_32 = ior ssa_31 (0x7b000808), ssa_30 ``` and some instructions from crucible test func.gs.basic with NIR_DEBUG=print_consts, now showing float representation being selected: BEFORE: ``` vec4 32 ssa_10 = load_const (0x3e4ccccd, 0x3e4ccccd, 0x00000000, 0x00000000) = (0.200000, 0.200000, 0.000000, 0.000000) vec4 32 ssa_9 = intrinsic load_deref (ssa_42) (access=0) vec4 32 ssa_11 = fadd ssa_9, ssa_10 /(0x3e4ccccd, 0x3e4ccccd, 0x00000000, 0x00000000) = (0.200000, 0.200000, 0.000000, 0.000000)*/ ``` AFTER: ``` vec4 32 ssa_10 = load_const (0x3e4ccccd, 0x3e4ccccd, 0x00000000, 0x00000000) = (0.200000, 0.200000, 0.000000, 0.000000) vec4 32 ssa_9 = intrinsic load_deref (ssa_42) (access=0) vec4 32 ssa_11 = fadd ssa_9, ssa_10 (0.200000, 0.200000, 0.000000, 0.000000) ``` Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23562>	2023-06-28 20:17:18 +00:00
Caio Oliveira	3cfdab8f92	nir: Allow nir_gather_ssa_types() to ignore regs instead of assert If we infer a type for a reg, just ignore and keep going. This will allow to use this pass even when registers are present. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23562>	2023-06-28 20:17:18 +00:00
Alyssa Rosenzweig	190b1fdc64	nir: Convert to nir_foreach_function_impl Done by hand at each call site but going very quickly with funny Vim motions and common regexes. This is a very common idiom in NIR. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23807>	2023-06-27 22:44:04 +00:00
Alyssa Rosenzweig	19daa9283c	nir: Add nir_foreach_function_impl helper Most users of nir_foreach_function actually want the nir_function_impl, not the nir_function, and want to skip empty functions (though some graphics-specific passes sometimes fail to do that part). Add a nir_foreach_function_impl macro to make that case more ergonomic. nir_foreach_function_impl(impl, shader) { ... foo(impl) } is equivalent to: nir_foreach_function(func, shader) { if (func->impl) { ... foo(func->impl); } } Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23807>	2023-06-27 22:44:04 +00:00
Rhys Perry	8649bde78f	nir/opt_intrinsic: optimize quad vote Optimizes a quadAll()/quadAny() pattern created by dxil-spirv: `7adc87d4de` dxil-spirv can't use clustered reductions because they are not guaranteed to include helper invocations. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23621>	2023-06-27 18:53:50 +00:00
Rhys Perry	58f8e0e2a0	nir,aco: add INCLUDE_HELPERS index to reduce intrinsic Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23621>	2023-06-27 18:53:50 +00:00
Rhys Perry	48674a1799	nir/peephole_select: allow some invocation broadcast intrinsics fossil-db (navi21): Totals from 3 (0.00% of 133428) affected shaders: Instrs: 2074 -> 2083 (+0.43%) CodeSize: 10596 -> 10692 (+0.91%) Latency: 75754 -> 75946 (+0.25%) InvThroughput: 16900 -> 16975 (+0.44%) Copies: 312 -> 309 (-0.96%) Branches: 150 -> 132 (-12.00%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23621>	2023-06-27 18:53:49 +00:00
Alyssa Rosenzweig	069cca9d66	treewide: Remove unused builders -Wunused-variables kicks in now that it can see through the init. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23860>	2023-06-27 18:13:02 +00:00
Alyssa Rosenzweig	173b9ee69a	treewide: Use nir_builder_create more perl -p0e 's/nir_builder_init\(&([^,]*), /\1 = nir_builder_create(/g' -i $(git grep -l nir_builder_init) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23860>	2023-06-27 18:13:02 +00:00
Alyssa Rosenzweig	815efcdf7e	nir: Use nir_builder_create perl -p0e 's/nir_builder ([^;]);\snir_builder_init\(&\1, /nir_builder \1 = nir_builder_create(/g' -i $(git grep -l nir_builder_init) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23860>	2023-06-27 18:13:02 +00:00
Alyssa Rosenzweig	e5410f9b00	nir: Add nir_builder_create returning nir_builder More ergonomic. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23860>	2023-06-27 18:13:02 +00:00
Konstantin Seurer	ddb7cf7a25	nir/builder_opcodes: Remove nir_build_ prefixed helpers This patch decreases the size of nir_builder_opcodes.h from 14292 loc to 13763 loc. nir_build_ versions are still needed if the nir_ is a custom helper. Intrinsics which need such a helper have to be added to build_prefixed_intrinsics. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23858>	2023-06-27 17:37:54 +00:00
Konstantin Seurer	400645a565	nir: Use nir_ instead of nir_build_ helpers Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23858>	2023-06-27 17:37:54 +00:00
Alyssa Rosenzweig	c24b753378	nir/lower_blend: Optimize masked out RTs While debugging KHR-GLES31.core.draw_buffers_indexed.color_masks, the noise from piles of store_output(load_output) instructions got in the way. Optimize it out. This does not fix the test, but if this case ever happened in a real app it would improve performance. This is only load bearing on Asahi (and PanVK?), since Panfrost wouldn't call nir_lower_blend at all in this case. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23836>	2023-06-27 14:38:21 +00:00
Alyssa Rosenzweig	f318cab4a1	nir: Add lower_frag_coord_to_pixel_coord pass We've open coded this in a few backends. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23836>	2023-06-27 14:38:21 +00:00
Alyssa Rosenzweig	c7067660b2	nir: Add pixel_coord, frag_coord_zw intrinsics On some architectures, gl_FragCoord.xy is available as an integer but gl_FragCoord.zw requires interpolation. Add dedicated intrinsics so we can lower it all in NIR. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23836>	2023-06-27 14:38:21 +00:00
Alyssa Rosenzweig	6689c678fe	nir/lower_locals_to_regs: Add bool bitsize knob GLSL booleans (and hence bool derefs) may be translated either as 1-bit or 32-bit NIR registers, depending whether the backend uses nir_lower_bool_to_int32 or not. Add a knob for this and choose the right type for different backends. Fixes nir_validate failure on dEQP-VK.subgroups.ballot_broadcast.graphics.subgroupbroadcast_bvec3 run under lavapipe. That test indexes into a bvec3 array, and gallivm first lowers bools and then lowers derefs to registers, resulting in random 1-bit booleans mixed in with 32-bit bools. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23804>	2023-06-26 08:22:06 -04:00
Alyssa Rosenzweig	5c8f21412f	nir/lower_bool_to_int32: Fix progress reporting If we only lower parameters, that's still progress. Technically. Fixes: `6a29cb2654` ("nir/lower_bool_to_int32: add support for lowering functions.") Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23804>	2023-06-26 08:22:03 -04:00
Yonggang Luo	5b29463746	nir: Add function nir_function_set_impl This function is added for create strong relationship between nir_function_impl and nir_function. So that nir_function->impl->function == nir_function is always true when (nir_function->impl != NULL && nir_function->impl != NIR_SERIALIZE_FUNC_HAS_IMPL) And indeed this invariant is already done in functions validate_function and validate_function_impl of nir_validate Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23820>	2023-06-24 14:48:47 +00:00
Alyssa Rosenzweig	942c206cd1	nir: Add discard_agx intrinsic sample_mask_agx corresponds directly to the hardware's 2-source instruction, but it's hard to use correctly and even harder to legalize after the fact, since it's responsible for not only discard but also late depth/stencil testing. For our various high-level lowering passes, it's easier to use a one-source discard (where we don't have to worry about sample masks), which the compiler will internally lower to the two-source instruction. Introduce such an instruction. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23832>	2023-06-23 17:37:41 +00:00
Diederik de Haas	231fa269ea	treewide: spelling fixes Debian's lintian tool flagged some spelling issues: assumtion -> assumption unkown -> unknown memeber -> member sucess -> success perfomance -> performance Signed-off-by: Diederik de Haas <didi.debian@cknow.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23618>	2023-06-23 12:20:59 +00:00

1 2 3 4 5 ...

4558 commits