fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-19 00:38:06 +02:00

Author	SHA1	Message	Date
Yonggang Luo	3261a54c79	glsl: replace tab with 3 space in glcpp-parse.y Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19875>	2022-12-16 19:02:17 +00:00
Yonggang Luo	c5a4520b3c	glsl: Fixes ident issue in glsl_parser.yy and update editorconfig for it Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19875>	2022-12-16 19:02:17 +00:00
Karol Herbst	6d6c6caff1	nir_lower_io_to_scalar: handle load/store_global Signed-off-by: Karol Herbst <kherbst@redhat.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20106>	2022-12-16 08:02:32 +00:00
Karol Herbst	3cd641bebd	nir_lower_io_to_scalar: make use of nir_get_io_offset_src Signed-off-by: Karol Herbst <kherbst@redhat.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20106>	2022-12-16 08:02:32 +00:00
Kenneth Graunke	0521027182	nir: Allow more than just ALU instructions in 'weak' GVN This removes the ALU-only restriction on the "weak" GVN introduced by the previous commit. This makes it slightly more aggressive, allowing it to coalesce things like UBO loads (still within sister then/else blocks). This also can have surprisingly large cascading effects. I was concerned that this might increase register pressure, but shader-db and fossil-db show effectively no change in spills/fills, so it seems to be fine. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19823>	2022-12-14 20:56:55 +00:00
Kenneth Graunke	d5d03a7273	nir: Perform 'weak' global value numbering in all GCM passes Full global value numbering (GVN) can be pretty aggressive, moving values far away from their original locations, even out of loops, and can extend their live ranges a lot. So we've left it disabled. This patch introduces a weaker form of GVN: we only allow coalescing identical values when they appear on either side of the same if/else construct. For now, we also only allow ALU instructions. This allows nir_opt_gcm to clean up identical instructions appearing on both sides of if/then/else control flow. But it avoids aggressively combining every other occurrence of a value in the program. This can still have surprisingly large cascading effects, as simple constructs are cleaned up, leading to more opportunities to do the same clean up, up a chain of nested ifs. It also enables greater use of the select peephole as ifs are cleaned up. shader-db and fossil-db results show a reduction in spills/fills on Icelake, so it doesn't seem to be hurting register pressure. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19823>	2022-12-14 20:56:55 +00:00
Samuel Pitoiset	877c10efd1	spirv: add support for AMD_shader_early_and_late_fragment_tests Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Tatsuyuki Ishi <ishitatsuyuki@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19738>	2022-12-14 08:16:27 +00:00
Ian Romanick	eb76cee9f8	nir: Eliminate nir_op_i2b There are a lot of optimizations in opt_algebraic that match ('ine', a, 0), but there are almost none that match i2b. Instead of adding a huge pile of additional patterns (including variations that include both ine and i2b), always lower i2b to a != 0. At this point in the series, it should be impossible for anything to generate i2b, so there /should not/ be any changes. The failing test on d3d12 is a pre-existing bug that is triggered by this change. I talked to Jesse about it, and, after some analysis, he suggested just adding it to the list of known failures. v2: Don't rematerialize i2b instructions in dxil_nir_lower_x2b. v3: Don't rematerialize i2b instructions in zink_nir_algebraic.py. v4: Fix zink-on-TGL CI failures by calling nir_opt_algebraic after nir_lower_doubles makes progress. The latter can generate b2i instructions, but nir_lower_int64 can't handle them (anymore). v5: Add back most of the hunk at line 2125 of nir_opt_algebraic.py. I had accidentally removed the f2b(bf2(x)) optimization. v6: Just eliminate the i2b instruction. v7: Remove missed i2b32 in midgard_compile.c. Remove (now unused) emit_alu_i2orf2_b1 function from sfn_instr_alu.cpp. Previously this function was still used. 🤷 No shader-db changes on any Intel platform. All Intel platforms had similar results. (Ice Lake shown) Instructions in all programs: 141165875 -> 141165873 (-0.0%) Instructions helped: 2 Cycles in all programs: 9098956382 -> 9098956350 (-0.0%) Cycles helped: 2 The two Vulkan shaders are helped because of the "new" (('b2i32', ('ine', ('ubfe', a, b, 1), 0)), ('ubfe', a, b, 1)) algebraic pattern. Acked-by: Jesse Natalie <jenatali@microsoft.com> [earlier version] Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Tested-by: Daniel Schürmann <daniel@schuermann.dev> [earlier version] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15121>	2022-12-14 06:23:21 +00:00
Ian Romanick	8b37046765	nir/builder: Handle i2b conversions specially in nir_type_convert The shaders affected here are ones that were previously affected when i2b was unconditionally lowered in opt_algebraic. There are a few places where some transformations happen in a different order, so some algebraic patterns are missed. All Broadwell and newer Intel platforms had similar results. (Ice Lake shown) total instructions in shared programs: 19914369 -> 19914566 (<.01%) instructions in affected programs: 92375 -> 92572 (0.21%) helped: 0 / HURT: 90 total cycles in shared programs: 853851470 -> 853867215 (<.01%) cycles in affected programs: 12400663 -> 12416408 (0.13%) helped: 28 / HURT: 69 Haswell and Ivy Bridge had similar results. (Haswell shown) total instructions in shared programs: 16710721 -> 16710700 (<.01%) instructions in affected programs: 108010 -> 107989 (-0.02%) helped: 57 / HURT: 103 total cycles in shared programs: 884299412 -> 884306546 (<.01%) cycles in affected programs: 12986423 -> 12993557 (0.05%) helped: 87 / HURT: 102 total spills in shared programs: 14937 -> 14925 (-0.08%) spills in affected programs: 12 -> 0 helped: 9 / HURT: 0 total fills in shared programs: 17569 -> 17557 (-0.07%) fills in affected programs: 12 -> 0 helped: 9 / HURT: 0 Sandy Bridge total instructions in shared programs: 13902341 -> 13902347 (<.01%) instructions in affected programs: 7311 -> 7317 (0.08%) helped: 3 / HURT: 8 total cycles in shared programs: 741795500 -> 741792266 (<.01%) cycles in affected programs: 273308 -> 270074 (-1.18%) helped: 9 / HURT: 2 No shader-db changes on any other Intel platform. No fossil-db changes on any Intel platform. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15121>	2022-12-14 06:23:21 +00:00
Ian Romanick	58164794f4	spirv: Use nir_type_convert instead of nir_type_conversion_op In a future commit, nit_type_conversion_op won't be able to handle i2b (and in a much later commit f2b), so switch many users to the fully featured function. No shader-db or fossil-db changes on any Intel platform. v2: Use the actual bit size of the source to determine the conversion op. With mediump, the "planned" bit size and the actual bit size might be different. Fixes many, many Vulkan CTS assertion failures on any platform that sets mediump_16bit_alu (e.g., Freedreno). Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> [v1] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15121>	2022-12-14 06:23:21 +00:00
Ian Romanick	ded3572947	nir: Use nir_type_convert instead of nir_type_conversion_op In a future commit, nit_type_conversion_op won't be able to handle i2b (and in a much later commit f2b), so switch many users to the fully featured function. No shader-db or fossil-db changes on any Intel platform. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15121>	2022-12-14 06:23:21 +00:00
Ian Romanick	1197030727	glsl: Use nir_type_convert instead of nir_type_conversion_op In a future commit, nit_type_conversion_op won't be able to handle i2b (and in a much later commit f2b), so switch many users to the fully featured function. In gl_nir_lower_packed_varyings, all of the type conversions are between int32 and uint32 types. In NIR, those are just moves, so elide them. No shader-db or fossil-db changes on any Intel platform. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15121>	2022-12-14 06:23:21 +00:00
Ian Romanick	9f86d18b2d	nir/builder: Add rounding mode parameter to nir_type_convert Later changes will use this. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15121>	2022-12-14 06:23:21 +00:00
Ian Romanick	43da822312	glsl_to_nir: Fix NIR bit-size of ir_triop_bitfield_extract and ir_quadop_bitfield_insert Previously these would return result->bit_size of 32 even though the type might have been int16_t or uint16_t. This prevents many assertion failures in "glsl: Use nir_type_convert instead of nir_type_conversion_op" on zink. Fixes: `5e922fbc16` ("glsl_to_nir: fix bitfield_extract with 16-bit operands") Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15121>	2022-12-14 06:23:21 +00:00
Ian Romanick	9342c14eeb	nir/builder: Emit x != 0 for nir_i2b There are a lot of optimizations in opt_algebraic that match ('ine', a, 0), but there are almost none that match i2b. Instead of adding a huge pile of additional patterns (including variation that include both ine and i2b), just emit a != 0 instead of i2b(a). I think that the changes to the unit tests weaken them slightly, but perhaps that's okay? No shader-db changes on any Intel platform. The GLSL paths use other means to generate i2b operations, but the SPIR-V paths use nir_i2b. Presumably since `4676b3d3dd` (nir: Use nir_test_mask instead of i2b(iand)), no fossil-db changes either. v2: Use nir_ine_imm. Suggested by Jesse. Acked-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Tested-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15121>	2022-12-14 06:23:21 +00:00
Ian Romanick	7a5e9df39d	nir: Use nir_i2b wrapper everywhere instead of using nir_i2b1 directly No shader-db or fossil-db changes on any Intel platform. v2: Add missed i2b1 in ir3_nir_opt_preamble.c. v3: Add missed i2b1 in ac_nir_lower_ngg.c. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Acked-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Tested-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15121>	2022-12-14 06:23:21 +00:00
Ian Romanick	b60b2f2add	nir/algebraic: Optimize some b2i involved in masking operations v2: Remove the ineg from the b2i in the ior pattern. Suggested by Jason. All Ivy Bridge and newer Intel platforms had similar results. (Ice Lake shown) total instructions in shared programs: 19914441 -> 19914369 (<.01%) instructions in affected programs: 63507 -> 63435 (-0.11%) helped: 24 / HURT: 0 total cycles in shared programs: 853869766 -> 853851470 (<.01%) cycles in affected programs: 10551542 -> 10533246 (-0.17%) helped: 24 / HURT: 0 All Intel platforms had similar results. (Ice Lake shown) Instructions in all programs: 141163061 -> 141092683 (-0.0%) Instructions helped: 14103 Instructions hurt: 55 Cycles in all programs: 9132376195 -> 9133183045 (+0.0%) Cycles helped: 13775 Cycles hurt: 380 Spills in all programs: 18286 -> 18284 (-0.0%) Spills helped: 1 Fills in all programs: 30647 -> 30643 (-0.0%) Fills helped: 1 Gained: 133 Lost: 130 Acked-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Tested-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15121>	2022-12-14 06:23:21 +00:00
Ian Romanick	ba0b248ac2	nir/algebraic: Eliminate unary op on src of integer comparison w/ zero This helps because it enables cmod propagation to do more. The removed patterns involving b2i will be handled by other existing patterns after the unary operations are removed. All Intel platforms had similar results. (Ice Lake shown) total instructions in shared programs: 19914458 -> 19914441 (<.01%) instructions in affected programs: 5456 -> 5439 (-0.31%) helped: 17 / HURT: 0 total cycles in shared programs: 855302118 -> 853869766 (-0.17%) cycles in affected programs: 327354347 -> 325921995 (-0.44%) helped: 291 / HURT: 81 All Intel platforms had similar results. (Ice Lake shown) Instructions in all programs: 141205979 -> 141205961 (-0.0%) Instructions helped: 4 Instructions hurt: 3 SENDs in all programs: 7466919 -> 7466913 (-0.0%) SENDs helped: 1 Cycles in all programs: 9133387327 -> 9133384475 (-0.0%) Cycles helped: 3 Cycles hurt: 12 In the shader that was helped for sends, it appears that a NIR pass that moves code out of loops was able to move 3 send operations outside a loop after this change. I did not investigate further. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Acked-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Tested-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15121>	2022-12-14 06:23:20 +00:00
Ian Romanick	ee15d89322	nir/algebraic: Simplify min and max of b2i This prevents ~400 shader-db regresssions and a handful of fossil-db regressions after i2b is always lowered. All Ivy Bridge and newer Intel platforms had similar results. (Ice Lake shown) total cycles in shared programs: 855301494 -> 855302118 (<.01%) cycles in affected programs: 52787 -> 53411 (1.18%) helped: 4 / HURT: 5 All Intel platforms had similar results. (Ice Lake shown) Instructions in all programs: 141206055 -> 141205979 (-0.0%) Instructions helped: 14 Cycles in all programs: 9133376616 -> 9133387327 (+0.0%) Cycles helped: 13 Cycles hurt: 3 Acked-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Tested-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15121>	2022-12-14 06:23:20 +00:00
Ian Romanick	19222867e4	nir/algebraic: Reassociate some iand to eliminate an operation No shader-db changes on any Intel platform. All of the helped shaders were presumably regressed by `4676b3d3dd` (nir: Use nir_test_mask instead of i2b(iand)). v2: Add some comments explaining why specific replacements are used. In the umin pattern, only markup the first usage of 'b' in the source pattern. Tiger Lake, Ice Lake, and Skylake had similar results. (Ice Lake shown) Instructions in all programs: 141384970 -> 141200966 (-0.1%) Instructions helped: 45842 Cycles in all programs: 9133648977 -> 9133282672 (-0.0%) Cycles helped: 26812 Cycles hurt: 6025 Gained: 23 Lost: 135 Acked-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Tested-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15121>	2022-12-14 06:23:20 +00:00
Ian Romanick	d48ce1f47d	nir/algebraic: Remove redundant i2b(b2i(x)) patterns A loop below already adds all the permutations... including the 1-bit version that isn't included in this group. No shader-db or fossil-db changes on any Intel platform. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Acked-by: Jesse Natalie <jenatali@microsoft.com> Tested-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15121>	2022-12-14 06:23:20 +00:00
Ian Romanick	14a9bb04e4	nir/algebraic: Remove redundant i2b(-x) pattern The exact same pattern appears later (around line 1323). No shader-db or fossil-db changes on any Intel platform. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Acked-by: Jesse Natalie <jenatali@microsoft.com> Tested-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15121>	2022-12-14 06:23:20 +00:00
Ian Romanick	8d90b13954	nir/algebraic: Catch some kinds of copy-and-paste bugs in algebraic patterns A later commit adds a pattern (('umin', ('iand', a, '#b(is_pos_power_of_two)'), ('iand', c, '#b(is_pos_power_of_two)')), ('iand', ('iand', a, b), ('iand', c, b))), When I originally made that pattern, I copied and pasted the search to the replacement as (('umin', ('iand', a, '#b(is_pos_power_of_two)'), ('iand', c, '#b(is_pos_power_of_two)')), ('iand', ('iand', a, '#b(is_pos_power_of_two)'), ('iand', c, '#b(is_pos_power_of_two)'))), The caused the variables in the replacement to be marked is_constant, and that resulted in an assertion failure deep inside nir_search. src/compiler/nir/nir_search.c:530: construct_value: Assertion `!var->is_constant' failed. These extra validation rules catch this kind of error at compile time rather than at run time. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Acked-by: Jesse Natalie <jenatali@microsoft.com> Tested-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15121>	2022-12-14 06:23:20 +00:00
Marek Olšák	a3aea98a2a	nir: validate that store_buffer_amd doesn't use a non-trivial writemask Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19422>	2022-12-13 20:33:05 +00:00
Marek Olšák	150c2cec63	nir: add ACCESS_USES_FORMAT_AMD for typed buffer opcodes Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19422>	2022-12-13 20:33:05 +00:00
Marek Olšák	716ac4a55d	nir: replace IS_SWIZZLED flag with ACCESS_IS_SWIZZLED_AMD Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19422>	2022-12-13 20:33:05 +00:00
Marek Olšák	7998c3bdd3	nir: remove redundant SLC_AMD in favor of ACCESS_STREAM_CACHE_POLICY ACCESS_STREAM_CACHE_POLICY was added to map to SLC for AMD. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19422>	2022-12-13 20:33:05 +00:00
Marek Olšák	c0d69b40bc	nir: add nir_texop_sampler_descriptor_amd We'll use it to query the min/mag filter in the shader. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19422>	2022-12-13 20:33:05 +00:00
Qiang Yu	9a6416b374	nir,ac/llvm,radv: add stream id index to nir_load_ring_gsvs_amd For used by legacy GS to store output to different ring according to stream id. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20158>	2022-12-13 11:43:45 +08:00
Qiang Yu	796a150196	nir: add nir_load_ring_gs2vs_offset_amd Used by legacy GS output lowering. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20158>	2022-12-13 11:42:33 +08:00
Qiang Yu	fd240f759f	nir,radv,radeonsi: add nir_atomic_add_gs_invocation_count_amd For shader query emulation. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20156>	2022-12-13 01:26:42 +00:00
Timothy Arceri	9e9b8dc7f8	glsl: fix function inlining for images Here we skip replacing parameters with their actual values for images as glsl_to_nir() expects them to be copied to temps first. Tree grafting has a similiar rule to avoid this happening also. Fixes: `8d10a6835f` ("glsl: dont create temps for builtin function inputs") Tested-by: Martin Roukala <martin.roukala@mupuf.org> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20274>	2022-12-12 21:28:44 +00:00
Konstantin Seurer	7a994d92ff	spirv: Add a debug option to force non uniform texture sampling Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20243>	2022-12-12 18:18:32 +00:00
Friedrich Vock	e20564cfdb	nir/lower_shader_calls: Remove phis after dead control flow This potentially gets rid of some more phis without sources. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19960>	2022-12-11 22:13:32 +00:00
Friedrich Vock	a54c2c8289	nir: Do not consider phis with incompatible dests equal CSE tries to collapse equal instructions, and collapsing two phis with incompatible dests is illegal. Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Fixes: `6bdce55c` ("nir: Add a basic CSE pass") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19960>	2022-12-11 22:13:32 +00:00
Rhys Perry	907fbf22dd	nir/gather_info: use nir_ssa_scalar_resolved This lets us skip copies. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19597>	2022-12-09 20:56:52 +00:00
Rhys Perry	085828ea4d	vtn: add mesh output and task_payload to vtn_mode_is_cross_invocation This fixes a potential race condition, and removes output loads (which should not exist in the EXT_mesh_shader). Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7391 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19597>	2022-12-09 20:56:52 +00:00
Rhys Perry	e1f5100311	nir: add task_payload and shader_out to nir_var_vec_indexable_modes Since these can be cross-invocation, we need this to write individual components without race conditions or loads. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7391 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19597>	2022-12-09 20:56:52 +00:00
Georg Lehmann	4dff3ff005	nir/opt_algebraic: Optimize open coded bfm. Foz-DB Navi21: Totals from 1553 (1.15% of 134913) affected shaders: SpillVGPRs: 2246 -> 2223 (-1.02%); split: -1.42%, +0.40% CodeSize: 10409156 -> 10410720 (+0.02%); split: -0.03%, +0.04% Instrs: 1899725 -> 1898773 (-0.05%); split: -0.07%, +0.02% Latency: 71225814 -> 71118314 (-0.15%); split: -0.21%, +0.06% InvThroughput: 13384926 -> 13330369 (-0.41%); split: -0.47%, +0.06% VClause: 38309 -> 38284 (-0.07%); split: -0.17%, +0.11% SClause: 70743 -> 70706 (-0.05%) Copies: 167296 -> 167230 (-0.04%); split: -0.28%, +0.24% Branches: 42446 -> 42444 (-0.00%); split: -0.01%, +0.00% PreVGPRs: 95191 -> 95188 (-0.00%) Some minor instructions count regressions in parallel-rdp because v_bfm_b32 can't use SDWA, but overall an improvement. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18887>	2022-12-09 14:59:16 +00:00
Konstantin Seurer	36125598c8	nir: Add intrinsics for hit attribute io Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19866>	2022-12-09 07:07:10 +00:00
Konstantin Seurer	5bfc4c293f	nir/split_vars: Handle ray hit attributes Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19866>	2022-12-09 07:07:10 +00:00
Timothy Arceri	8d10a6835f	glsl: dont create temps for builtin function inputs It's not valid to be copying input variables to temps when inlining atomic memory, interpolateAt functions, etc. We got away with this previously because tree grafting would clean up the mess but we shouldn't depend on an optimisation to clean up invalid IR. Also I hope to remove tree grafting in a follow up merge request. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19890>	2022-12-08 05:22:27 +00:00
Timothy Arceri	7b9ec592aa	glsl: use ir_rvalue_visitor for function inlining This allows us to drop some duplicate code that is already in the ir_rvalue_visitor. It also allows us to better replace rvalues and handle swizzle in the following patch without having to add even more duplicate code. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19890>	2022-12-08 05:22:27 +00:00
Mihai Preda	613e9b8e7a	nir: fix digit order in print_bitset() Also fix the leading curly for the new function definitions. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19570>	2022-12-07 12:59:33 +00:00
Mihai Preda	0320dbaff5	nir: print shader_info bools with the value Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19570>	2022-12-07 12:59:33 +00:00
Mihai Preda	da2d36a9d5	nir: print shader_info inputs/outputs as bit ranges e.g. inputs_read: 15-17 outputs_written: 0,32 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19570>	2022-12-07 12:59:33 +00:00
Mihai Preda	e9f3f80b1d	nir: print_shader_info(): brief output Make the shader_info printing less verbose by skipping the fields that are likely not used (being zero). Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19570>	2022-12-07 12:59:33 +00:00
Mihai Preda	814ba7d13d	nir: print_shader_info: print stage-specific shader info Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19570>	2022-12-07 12:59:33 +00:00
Mihai Preda	37b7233c15	nir: print_shader_info() print bitsets Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19570>	2022-12-07 12:59:33 +00:00
Mihai Preda	4ed85c16f9	nir: print more in print_shader_info() Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19570>	2022-12-07 12:59:33 +00:00

1 2 3 4 5 ...

7520 commits