fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-17 00:58:13 +02:00

Author	SHA1	Message	Date
Georg Lehmann	44d0b785cc	nir/opt_algebraic: combine bitz/bitnz Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23298>	2023-06-29 13:39:30 +00:00
Georg Lehmann	6585209cdd	nir/lower_bit_size: mask bitz/bitnz src1 like shifts Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23298>	2023-06-29 13:39:30 +00:00
Georg Lehmann	481a34e82e	nir: add single bit test opcodes These directly map to amd's SALU s_bitcmp0/1. For VALU we can use v_cmp_class_f32 if the second source is constant. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23298>	2023-06-29 13:39:30 +00:00
Yonggang Luo	e1bf96dd56	glsl: Remove the extra scope in gl_nir_link_uniforms.c Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23920>	2023-06-29 11:29:54 +00:00
Yonggang Luo	dcf9cfd297	glsl: Switch to use nir_foreach_function_impl from nir_foreach_function Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23920>	2023-06-29 11:29:54 +00:00
Corentin Noël	a8d669b593	nir/split_64bit_vec3_and_vec4: Use the right number of components Always make sure to correctly deref and store a 64bits variable from the right number of components. This fixes the `spec@arb_enhanced_layouts@matching_fp64_types_` piglit tests for virgl. Corrects this validation issue: ``` decl_var INTERP_MODE_FLAT dvec2[] var_7@2 decl_var INTERP_MODE_FLAT dvec2[] var_7@3 ... vec1 32 ssa_302 = deref_var &var_7@2 (function_temp dvec2[]) vec1 32 ssa_303 = deref_var &var_7@3 (function_temp dvec2[]) vec1 32 ssa_304 = deref_array &(ssa_302)[ssa_301] (function_temp dvec2) / &var_7@2[ssa_301] / vec1 32 ssa_305 = deref_array &(ssa_303)[ssa_301] (function_temp dvec2) /* &var_7@3[ssa_301] / vec1 64 ssa_306 = mov ssa_110.z intrinsic store_deref (ssa_305, ssa_306) (wrmask=x, access=0) error: instr->num_components == glsl_get_vector_elements(dst->type) (../src/compiler/nir/nir_validate.c:632) vec4 64 ssa_111 = vec4 ssa_14, ssa_13, ssa_12, ssa_109 vec1 32 ssa_307 = load_const (0x00000000 = 0.000000) vec1 32 ssa_308 = iadd ssa_307, ssa_61 vec1 32 ssa_309 = deref_var &var_7@2 (function_temp dvec2[]) vec1 32 ssa_310 = deref_var &var_7@3 (function_temp dvec2[]) vec1 32 ssa_311 = deref_array &(ssa_309)[ssa_308] (function_temp dvec2) /* &var_7@2[ssa_308] / vec1 32 ssa_312 = deref_array &(ssa_310)[ssa_308] (function_temp dvec2) /* &var_7@3[ssa_308] */ vec1 64 ssa_313 = mov ssa_111.w intrinsic store_deref (ssa_312, ssa_313) (wrmask=, access=0) error: (nir_intrinsic_write_mask(instr) & ~component_mask) == 0 (../src/compiler/nir/nir_validate.c:803) ``` Fixes: `496fd59d71` (add pass to split 64 bit vec3/4 variable access) Signed-off-by: Corentin Noël <corentin.noel@collabora.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23880>	2023-06-29 10:59:57 +00:00
Yonggang Luo	62ce223245	treewide: Switch to use nir_foreach_function_with_impl when possible Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23903>	2023-06-29 08:36:03 +00:00
Yonggang Luo	fde6b51749	nir: Split macro nir_foreach_function_with_impl out of nir_foreach_function_impl This macro nir_foreach_function_with_impl can be used when func and func->impl are both accessed in foreach loop Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23903>	2023-06-29 08:36:03 +00:00
Erik Faye-Lund	afa79cd9b8	nir: use imm-helpers We have to use 1ull instead of 1u because MSVC is stupid... Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23855>	2023-06-29 07:08:19 +00:00
Yonggang Luo	75ac852253	compiler: set alignment=1 by default for handling empty struct/interface in glsl_types.cpp When there is no elements in struct/interface, the alignment of it should be 1 instead of 0. Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23841>	2023-06-28 21:16:05 +00:00
Caio Oliveira	f4c2025e2c	nir/print: Print more representations in load_const In addition to the hexadecimal and float (when applicable), print the signed and unsigned representations. Representations may be omitted based on information about the value: - If gather types has unambiguous information, we use it; - Float is omitted for 8 bit values; - Signed decimal is omitted for positive values; - Unsigned decimal is omitted for small values (representation is same as hex); Note for now the "terse form" that appear in SSA uses is unchaged. Based on a patch by Mike Blumenkrantz. Examples: ``` // Just used as float. Omitted decimals. vec4 32 ssa_81 = load_const (0x3f800000, 0x3f800000, 0x3e4ccccd, 0x3f800000) = (1.000000, 1.000000, 0.200000, 1.000000) vec1 32 ssa_28 = load_const (0x3e4ccccd = 0.200000) // Just a small integer. Omitted float and decimal. vec1 32 ssa_45 = load_const (0x00000001) // Larger positive integers. Omitted float. vec1 32 ssa_39 = load_const (0x00002000 = 8192) vec1 32 ssa_30 = load_const (0x000000ff = 255) vec1 32 ssa_28 = load_const (0x00000010 = 16) // Integers with negative values. load_const (0xff = -1 = 255) load_const (0xff80 = -128 = 65408) load_const (0xffff = -1 = 65535) // Same value, in the first case we know is used as an integer. load_const (0xffffffe0 = -32 = 4294967264) load_const (0xffffffe0 = -nan = -32 = 4294967264) ``` Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23562>	2023-06-28 20:17:18 +00:00
Caio Oliveira	a185736a42	nir/print: Use src_type when printing consts in SSA uses If the src_type is not available, untie by looking at the results from nir_gather_ssa_types(). If that is ambiguous, just pick uint. Now in print_const_from_load() when the type is invalid, print the full constant form (with both padded hex and float); when the passed type is valid, print the terse form based on it. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23562>	2023-06-28 20:17:18 +00:00
Caio Oliveira	5d15f4ef28	nir: Extract logic to get dest and srcs types from intrinsic Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23562>	2023-06-28 20:17:18 +00:00
Caio Oliveira	7de530d3df	nir: Make a const-friendly way to get the offset_src and arrayed_io_src from intrinsic The existing helper returns a `nir_src *` so expects a non-const instr. We plan to use this function in queries that don't modify the shader, so create (and use internally) a variant that returns the index instead. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23562>	2023-06-28 20:17:18 +00:00
Caio Oliveira	8f64415af7	nir/print: Make NIR_DEBUG=print_consts behavior the default Now there's a NIR_DEBUG=print_no_inline_consts to omit them. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23562>	2023-06-28 20:17:18 +00:00
Caio Oliveira	260a9167db	nir/print: Improve NIR_PRINT=print_consts by using nir_gather_ssa_types() The two representations are always used for `load_const`, but when inlining the value as SSA source, use just a single terse representation. The choice between integer or float is based on the result of nir_gather_ssa_types(), with a bias for integer when in doubt. Also remove extra comment `/* /` syntax since the value is already enclosed by parenthesis. --- For illustration, here's some instructions from crucible test func.shader.averageRounded.uint64_t with NIR_DEBUG=print_consts: BEFORE: ``` vec1 32 con ssa_23 = load_const (0xfffffffc = -nan) vec1 32 div ssa_24 = iand ssa_13, ssa_23 /(0xfffffffc = -nan)/ vec1 32 con ssa_25 = load_const (0x00000024 = 0.000000) vec1 32 con ssa_26 = intrinsic load_ubo (ssa_1 /(0x00000002 = 0.000000)/, ssa_25 /(0x00000024 = 0.000000)/) (access=0, align_mul=1073741824, align_offset=36, range_base=0, range=-1) vec1 32 con ssa_27 = load_const (0x00000008 = 0.000000) vec1 32 con ssa_28 = load_const (0x00000007 = 0.000000) vec1 32 con ssa_29 = iand ssa_4.y, ssa_1 /(0x00000002 = 0.000000)/ vec1 32 con ssa_30 = ishl ssa_29, ssa_28 /(0x00000007 = 0.000000)/ vec1 32 con ssa_31 = load_const (0x7b000808 = 664776890994587263929995856502063104.000000) vec1 32 con ssa_32 = ior ssa_31 /(0x7b000808 = 664776890994587263929995856502063104.000000)/, ssa_30 ``` AFTER: ``` vec1 32 con ssa_23 = load_const (0xfffffffc = -nan) vec1 32 div ssa_24 = iand ssa_13, ssa_23 (0xfffffffc) vec1 32 con ssa_25 = load_const (0x00000024 = 0.000000) vec1 32 con ssa_26 = intrinsic load_ubo (ssa_1 (0x2), ssa_25 (0x24)) (access=0, align_mul=1073741824, align_offset=36, range_base=0, range=-1) vec1 32 con ssa_27 = load_const (0x00000008 = 0.000000) vec1 32 con ssa_28 = load_const (0x00000007 = 0.000000) vec1 32 con ssa_29 = iand ssa_4.y, ssa_1 (0x2) vec1 32 con ssa_30 = ishl ssa_29, ssa_28 (0x7) vec1 32 con ssa_31 = load_const (0x7b000808 = 664776890994587263929995856502063104.000000) vec1 32 con ssa_32 = ior ssa_31 (0x7b000808), ssa_30 ``` and some instructions from crucible test func.gs.basic with NIR_DEBUG=print_consts, now showing float representation being selected: BEFORE: ``` vec4 32 ssa_10 = load_const (0x3e4ccccd, 0x3e4ccccd, 0x00000000, 0x00000000) = (0.200000, 0.200000, 0.000000, 0.000000) vec4 32 ssa_9 = intrinsic load_deref (ssa_42) (access=0) vec4 32 ssa_11 = fadd ssa_9, ssa_10 /(0x3e4ccccd, 0x3e4ccccd, 0x00000000, 0x00000000) = (0.200000, 0.200000, 0.000000, 0.000000)*/ ``` AFTER: ``` vec4 32 ssa_10 = load_const (0x3e4ccccd, 0x3e4ccccd, 0x00000000, 0x00000000) = (0.200000, 0.200000, 0.000000, 0.000000) vec4 32 ssa_9 = intrinsic load_deref (ssa_42) (access=0) vec4 32 ssa_11 = fadd ssa_9, ssa_10 (0.200000, 0.200000, 0.000000, 0.000000) ``` Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23562>	2023-06-28 20:17:18 +00:00
Caio Oliveira	3cfdab8f92	nir: Allow nir_gather_ssa_types() to ignore regs instead of assert If we infer a type for a reg, just ignore and keep going. This will allow to use this pass even when registers are present. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23562>	2023-06-28 20:17:18 +00:00
Alyssa Rosenzweig	190b1fdc64	nir: Convert to nir_foreach_function_impl Done by hand at each call site but going very quickly with funny Vim motions and common regexes. This is a very common idiom in NIR. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23807>	2023-06-27 22:44:04 +00:00
Alyssa Rosenzweig	19daa9283c	nir: Add nir_foreach_function_impl helper Most users of nir_foreach_function actually want the nir_function_impl, not the nir_function, and want to skip empty functions (though some graphics-specific passes sometimes fail to do that part). Add a nir_foreach_function_impl macro to make that case more ergonomic. nir_foreach_function_impl(impl, shader) { ... foo(impl) } is equivalent to: nir_foreach_function(func, shader) { if (func->impl) { ... foo(func->impl); } } Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23807>	2023-06-27 22:44:04 +00:00
Rhys Perry	8649bde78f	nir/opt_intrinsic: optimize quad vote Optimizes a quadAll()/quadAny() pattern created by dxil-spirv: `7adc87d4de` dxil-spirv can't use clustered reductions because they are not guaranteed to include helper invocations. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23621>	2023-06-27 18:53:50 +00:00
Rhys Perry	58f8e0e2a0	nir,aco: add INCLUDE_HELPERS index to reduce intrinsic Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23621>	2023-06-27 18:53:50 +00:00
Rhys Perry	48674a1799	nir/peephole_select: allow some invocation broadcast intrinsics fossil-db (navi21): Totals from 3 (0.00% of 133428) affected shaders: Instrs: 2074 -> 2083 (+0.43%) CodeSize: 10596 -> 10692 (+0.91%) Latency: 75754 -> 75946 (+0.25%) InvThroughput: 16900 -> 16975 (+0.44%) Copies: 312 -> 309 (-0.96%) Branches: 150 -> 132 (-12.00%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23621>	2023-06-27 18:53:49 +00:00
Alyssa Rosenzweig	069cca9d66	treewide: Remove unused builders -Wunused-variables kicks in now that it can see through the init. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23860>	2023-06-27 18:13:02 +00:00
Alyssa Rosenzweig	173b9ee69a	treewide: Use nir_builder_create more perl -p0e 's/nir_builder_init\(&([^,]*), /\1 = nir_builder_create(/g' -i $(git grep -l nir_builder_init) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23860>	2023-06-27 18:13:02 +00:00
Alyssa Rosenzweig	815efcdf7e	nir: Use nir_builder_create perl -p0e 's/nir_builder ([^;]);\snir_builder_init\(&\1, /nir_builder \1 = nir_builder_create(/g' -i $(git grep -l nir_builder_init) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23860>	2023-06-27 18:13:02 +00:00
Alyssa Rosenzweig	e5410f9b00	nir: Add nir_builder_create returning nir_builder More ergonomic. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23860>	2023-06-27 18:13:02 +00:00
Konstantin Seurer	ddb7cf7a25	nir/builder_opcodes: Remove nir_build_ prefixed helpers This patch decreases the size of nir_builder_opcodes.h from 14292 loc to 13763 loc. nir_build_ versions are still needed if the nir_ is a custom helper. Intrinsics which need such a helper have to be added to build_prefixed_intrinsics. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23858>	2023-06-27 17:37:54 +00:00
Konstantin Seurer	400645a565	nir: Use nir_ instead of nir_build_ helpers Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23858>	2023-06-27 17:37:54 +00:00
Konstantin Seurer	083f7dba5b	vtn: Use nir_ instead of nir_build_ helpers Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23858>	2023-06-27 17:37:54 +00:00
Alyssa Rosenzweig	c24b753378	nir/lower_blend: Optimize masked out RTs While debugging KHR-GLES31.core.draw_buffers_indexed.color_masks, the noise from piles of store_output(load_output) instructions got in the way. Optimize it out. This does not fix the test, but if this case ever happened in a real app it would improve performance. This is only load bearing on Asahi (and PanVK?), since Panfrost wouldn't call nir_lower_blend at all in this case. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23836>	2023-06-27 14:38:21 +00:00
Alyssa Rosenzweig	f318cab4a1	nir: Add lower_frag_coord_to_pixel_coord pass We've open coded this in a few backends. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23836>	2023-06-27 14:38:21 +00:00
Alyssa Rosenzweig	c7067660b2	nir: Add pixel_coord, frag_coord_zw intrinsics On some architectures, gl_FragCoord.xy is available as an integer but gl_FragCoord.zw requires interpolation. Add dedicated intrinsics so we can lower it all in NIR. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23836>	2023-06-27 14:38:21 +00:00
Alyssa Rosenzweig	6689c678fe	nir/lower_locals_to_regs: Add bool bitsize knob GLSL booleans (and hence bool derefs) may be translated either as 1-bit or 32-bit NIR registers, depending whether the backend uses nir_lower_bool_to_int32 or not. Add a knob for this and choose the right type for different backends. Fixes nir_validate failure on dEQP-VK.subgroups.ballot_broadcast.graphics.subgroupbroadcast_bvec3 run under lavapipe. That test indexes into a bvec3 array, and gallivm first lowers bools and then lowers derefs to registers, resulting in random 1-bit booleans mixed in with 32-bit bools. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23804>	2023-06-26 08:22:06 -04:00
Alyssa Rosenzweig	5c8f21412f	nir/lower_bool_to_int32: Fix progress reporting If we only lower parameters, that's still progress. Technically. Fixes: `6a29cb2654` ("nir/lower_bool_to_int32: add support for lowering functions.") Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23804>	2023-06-26 08:22:03 -04:00
Corentin Noël	bc2828a436	compiler: Allow the explicit_stride of aoa types to be zero The explicit stride doesn't have to be defined to aoa and therefore can be zero in some cases, like in arrays of arrays of uniform blocks. Resolves crash with spec@arb_gl_spirv@execution@ubo@aoa-2.shader_test piglit test for virgl. Signed-off-by: Corentin Noël <corentin.noel@collabora.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Acked-by: Gert Wollny <gert.wollny@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23648>	2023-06-26 09:19:43 +02:00
Yonggang Luo	5b29463746	nir: Add function nir_function_set_impl This function is added for create strong relationship between nir_function_impl and nir_function. So that nir_function->impl->function == nir_function is always true when (nir_function->impl != NULL && nir_function->impl != NIR_SERIALIZE_FUNC_HAS_IMPL) And indeed this invariant is already done in functions validate_function and validate_function_impl of nir_validate Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23820>	2023-06-24 14:48:47 +00:00
Yonggang Luo	9fa38cf142	vtn: Do not assign main_entry_point->impl twice Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23820>	2023-06-24 14:48:47 +00:00
Alyssa Rosenzweig	942c206cd1	nir: Add discard_agx intrinsic sample_mask_agx corresponds directly to the hardware's 2-source instruction, but it's hard to use correctly and even harder to legalize after the fact, since it's responsible for not only discard but also late depth/stencil testing. For our various high-level lowering passes, it's easier to use a one-source discard (where we don't have to worry about sample masks), which the compiler will internally lower to the two-source instruction. Introduce such an instruction. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23832>	2023-06-23 17:37:41 +00:00
Diederik de Haas	231fa269ea	treewide: spelling fixes Debian's lintian tool flagged some spelling issues: assumtion -> assumption unkown -> unknown memeber -> member sucess -> success perfomance -> performance Signed-off-by: Diederik de Haas <didi.debian@cknow.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23618>	2023-06-23 12:20:59 +00:00
Timothy Arceri	d336bc3926	glsl: call nir_opt_find_array_copies() when linking shader-db results IRIS (BDW): total instructions in shared programs: 17883388 -> 17859658 (-0.13%) instructions in affected programs: 48100 -> 24370 (-49.33%) helped: 6 HURT: 0 helped stats (abs) min: 1450 max: 7028 x̄: 3955.00 x̃: 3387 helped stats (rel) min: 40.31% max: 51.92% x̄: 47.07% x̃: 48.96% 95% mean confidence interval for instructions value: -6613.28 -1296.72 95% mean confidence interval for instructions %-change: -52.73% -41.40% Instructions are helped. total cycles in shared programs: 866961809 -> 863521521 (-0.40%) cycles in affected programs: 9179396 -> 5739108 (-37.48%) helped: 6 HURT: 0 helped stats (abs) min: 252584 max: 972430 x̄: 573381.33 x̃: 495130 helped stats (rel) min: 21.80% max: 48.65% x̄: 35.01% x̃: 34.58% 95% mean confidence interval for cycles value: -917157.00 -229605.67 95% mean confidence interval for cycles %-change: -47.61% -22.40% Cycles are helped. total spills in shared programs: 20417 -> 15521 (-23.98%) spills in affected programs: 6966 -> 2070 (-70.28%) helped: 6 HURT: 0 total fills in shared programs: 25151 -> 21005 (-16.48%) fills in affected programs: 4374 -> 228 (-94.79%) helped: 6 HURT: 0 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9055 Fixes: `d75a36a9ee` ("glsl: remove do_copy_propagation_elements() optimisation pass") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23737>	2023-06-23 09:10:15 +10:00
Karol Herbst	570c263ea3	nir/load_libclc: run some opt passes for everybody Cuts down serialized size from 2850288 to 1377780 bytes. Reduces clinfo with Rusticl time by 40% for debug builds. (Old data, but the point stands) Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15996>	2023-06-22 21:02:57 +00:00
Faith Ekstrand	f278b30e94	nir/opt_if: Use block_ends_in_jump Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23782>	2023-06-22 19:55:49 +00:00
Alyssa Rosenzweig	7ddfc43fdf	nir: Remove integer and 64-bit modifiers Now that Intel and R600 both do their own modifier propagation, the only backends that still lower modifiers in NIR are: * nir-to-tgsi * lima * etnaviv * a2xx The latter 3 backends do not support integers, and certainly do not support fp64. So they don't use these. TGSI in theory supports integer negate modifiers but NTT doesn't use them, so they're unused there too. Since they're unused, we remove NIR support for integer and 64-bit modifiers, leaving only 16/32-bit float modifiers. This will reduce the scope needed for a replacement to NIR modifiers, being pursued in !23089. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23782>	2023-06-22 19:55:49 +00:00
Yonggang Luo	ff29016753	meson: Guard the glsl tests that only working when OpenGL ES2 is enabled Reviewed-by: Eric Engestrom <eric@igalia.com> Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23793>	2023-06-22 11:08:11 +00:00
Pavel Ondračka	b4ca45911d	nir_opt_algebraic: don't use i32csel without native integer support Otherwise nir_lower_int_to_float (or specifically nir_gather_ssa_types) will fail to recognize we already have float constants and converts them again. Example from spec/glsl-1.10/execution/vs-loop-array-index-unroll.shader_test with r300 driver (after enabling has_fused_comp_and_csel). impl main { block block_0: /* preds: / vec1 32 ssa_0 = load_const (0x00000000 = 0.000000) vec4 32 ssa_1 = intrinsic load_input (ssa_0) (base=0, component=0, dest_type=float32, io location=VERT_ATTRIB_POS slots=1) / gl_Vertex / vec3 32 ssa_2 = load_const (0x00000000, 0x3e800000, 0x3f800000) = (0.000000, 0.250000, 1.000000) vec3 32 ssa_3 = load_const (0x00000000, 0x3f000000, 0x3f800000) = (0.000000, 0.500000, 1.000000) vec3 32 ssa_4 = load_const (0x00000000, 0x3f400000, 0x3f800000) = (0.000000, 0.750000, 1.000000) vec2 32 ssa_5 = load_const (0x00000000, 0x3f800000) = (0.000000, 1.000000) vec1 32 ssa_6 = load_const (0x3f800000 = 1.000000) vec1 32 ssa_7 = intrinsic load_ubo_vec4 (ssa_0, ssa_0) (access=0, base=0, component=0) vec4 32 ssa_8 = load_const (0x00000000, 0x00000001, 0x00000002, 0x00000003) = (0.000000, 0.000000, 0.000000, 0.000000) vec4 1 ssa_9 = ilt ssa_8, ssa_7.xxxx vec3 32 ssa_10 = bcsel ssa_9.www, ssa_5.xyy, ssa_4 vec3 32 ssa_11 = bcsel ssa_9.zzz, ssa_10, ssa_3 vec3 32 ssa_12 = bcsel ssa_9.yyy, ssa_11, ssa_2 vec3 32 ssa_15 = i32csel_gt ssa_7.xxx, ssa_12, ssa_6.xxx vec4 32 ssa_14 = fsat ssa_15.xyxz intrinsic store_output (ssa_14, ssa_0) (base=1, wrmask=xyzw, component=0, src_type=float32, io location=VARYING_SLOT_COL0 slots=1, xfb(), xfb2()) / gl_FrontColor / intrinsic store_output (ssa_1, ssa_0) (base=0, wrmask=xyzw, component=0, src_type=float32, io location=VARYING_SLOT_POS slots=1, xfb(), xfb2()) / gl_Position / / succs: block_1 / block block_1: } and after nir_lower_int_to_float impl main { block block_0: / preds: / vec1 32 ssa_0 = load_const (0x00000000 = 0.000000) vec4 32 ssa_1 = intrinsic load_input (ssa_0) (base=0, component=0, dest_type=float32, io location=VERT_ATTRIB_POS slots=1) / gl_Vertex / vec3 32 ssa_2 = load_const (0x00000000, 0x4e7a0000, 0x4e7e0000) = (0.000000, 1048576000.000000, 1065353216.000000) vec3 32 ssa_3 = load_const (0x00000000, 0x4e7c0000, 0x4e7e0000) = (0.000000, 1056964608.000000, 1065353216.000000) vec3 32 ssa_4 = load_const (0x00000000, 0x4e7d0000, 0x4e7e0000) = (0.000000, 1061158912.000000, 1065353216.000000) vec2 32 ssa_5 = load_const (0x00000000, 0x4e7e0000) = (0.000000, 1065353216.000000) vec1 32 ssa_6 = load_const (0x4e7e0000 = 1065353216.000000) vec1 32 ssa_7 = intrinsic load_ubo_vec4 (ssa_0, ssa_0) (access=0, base=0, component=0) vec4 32 ssa_8 = load_const (0x00000000, 0x3f800000, 0x40000000, 0x40400000) = (0.000000, 1.000000, 2.000000, 3.000000) vec4 1 ssa_9 = flt ssa_8, ssa_7.xxxx vec3 32 ssa_10 = bcsel ssa_9.www, ssa_5.xyy, ssa_4 vec3 32 ssa_11 = bcsel ssa_9.zzz, ssa_10, ssa_3 vec3 32 ssa_12 = bcsel ssa_9.yyy, ssa_11, ssa_2 vec3 32 ssa_13 = fcsel_gt ssa_7.xxx, ssa_12, ssa_6.xxx vec4 32 ssa_14 = fsat ssa_13.xyxz intrinsic store_output (ssa_14, ssa_0) (base=1, wrmask=xyzw, component=0, src_type=float32, io location=VARYING_SLOT_COL0 slots=1, xfb(), xfb2()) / gl_FrontColor / intrinsic store_output (ssa_1, ssa_0) (base=0, wrmask=xyzw, component=0, src_type=float32, io location=VARYING_SLOT_POS slots=1, xfb(), xfb2()) / gl_Position / / succs: block_1 */ block block_1: } Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23704>	2023-06-22 07:25:44 +00:00
Mike Blumenkrantz	402ae3b132	nir/lower_tex: ignore saturate for txf ops saturate is used for GL_CLAMP emulation, and GL_CLAMP cannot be used with txf ref #9226 cc: mesa-stable Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23750>	2023-06-21 23:13:50 +00:00
Michel Zou	badb85edb8	util: reinstate ENUM_PACKED gets rid of warning: 'gcc_struct' attribute ignored [-Wattributes] introduced by !23338 Fixes: `86532fa21d` ("util: Use the gcc_struct attribute for packed structures in mingw") Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23478>	2023-06-21 21:51:59 +00:00
Caio Oliveira	af9be8c024	nir/print: Print whether the shader is internal or not Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23756>	2023-06-21 00:01:10 +00:00
Caio Oliveira	59a72570b6	compiler: Move spirv into a module of its own For historical reasons, nir and vtn were compiled together, and a bunch of vtn specific targets were defined in src/compiler/meson.build. Now that we can, make src/compiler/spirv produce an internal library that depends on NIR, and is used by the drivers/tools. Also move the vtn specific targets into that directory's meson.build. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23668>	2023-06-20 16:18:08 +00:00
Caio Oliveira	cb588d5d6e	compiler/clc: Move related NIR passes to the common mesa clc These were historically in the spirv+nir combo, but the common mesa clc is a better home for them. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Nora Allen <blackcatgames@protonmail.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23667>	2023-06-20 03:43:41 +00:00

1 2 3 4 5 ...

8177 commits