fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-24 19:18:11 +02:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	0afd691f29	panfrost: clang-format the tree This switches us over to Mesa's code style [1], normalizing us within the tree. The results aren't perfect, but they bring us a hell of a lot closer to the rest of the tree. Panfrost doesn't feel so foreign relative to Mesa with this, which I think (in retrospect after a bunch of years of being "different") is the right call. I skipped PanVK because that's paused right now. find panfrost/ -type f -name '.h' \| grep -v vulkan \| xargs clang-format -i; find panfrost/ -type f -name '.c' \| grep -v vulkan \| xargs clang-format -i; clang-format -i gallium/drivers/panfrost/.c gallium/drivers/panfrost/.h ; find panfrost/ -type f -name '*.cpp' \| grep -v vulkan \| xargs clang-format -i [1] https://docs.mesa3d.org/codingstyle.html Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20425>	2022-12-24 02:22:57 +00:00
Alyssa Rosenzweig	a4705afe63	panfrost: Fix up some formatting for clang-format clang-format will make a mess of these otherwise. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20425>	2022-12-24 02:22:57 +00:00
Alyssa Rosenzweig	e35719be6f	panfrost: Add missing #includes Found shuffling headers with clang format. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20425>	2022-12-24 02:22:57 +00:00
Alyssa Rosenzweig	8dd35e0ac7	pan/mdg: Remove unused disassembler functions Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20420>	2022-12-23 16:27:16 +00:00
Ian Romanick	eb76cee9f8	nir: Eliminate nir_op_i2b There are a lot of optimizations in opt_algebraic that match ('ine', a, 0), but there are almost none that match i2b. Instead of adding a huge pile of additional patterns (including variations that include both ine and i2b), always lower i2b to a != 0. At this point in the series, it should be impossible for anything to generate i2b, so there /should not/ be any changes. The failing test on d3d12 is a pre-existing bug that is triggered by this change. I talked to Jesse about it, and, after some analysis, he suggested just adding it to the list of known failures. v2: Don't rematerialize i2b instructions in dxil_nir_lower_x2b. v3: Don't rematerialize i2b instructions in zink_nir_algebraic.py. v4: Fix zink-on-TGL CI failures by calling nir_opt_algebraic after nir_lower_doubles makes progress. The latter can generate b2i instructions, but nir_lower_int64 can't handle them (anymore). v5: Add back most of the hunk at line 2125 of nir_opt_algebraic.py. I had accidentally removed the f2b(bf2(x)) optimization. v6: Just eliminate the i2b instruction. v7: Remove missed i2b32 in midgard_compile.c. Remove (now unused) emit_alu_i2orf2_b1 function from sfn_instr_alu.cpp. Previously this function was still used. 🤷 No shader-db changes on any Intel platform. All Intel platforms had similar results. (Ice Lake shown) Instructions in all programs: 141165875 -> 141165873 (-0.0%) Instructions helped: 2 Cycles in all programs: 9098956382 -> 9098956350 (-0.0%) Cycles helped: 2 The two Vulkan shaders are helped because of the "new" (('b2i32', ('ine', ('ubfe', a, b, 1), 0)), ('ubfe', a, b, 1)) algebraic pattern. Acked-by: Jesse Natalie <jenatali@microsoft.com> [earlier version] Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Tested-by: Daniel Schürmann <daniel@schuermann.dev> [earlier version] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15121>	2022-12-14 06:23:21 +00:00
Alyssa Rosenzweig	976405907e	pan/mdg: Emulate 8-bit with the 16-bit pipe We don't care to support i8vec16, we just need a bit of 8-bit support to implement format packing/unpacking in blend shaders. We're already doing this by using the 16-bit pipe, we just need to commit to it all the way -- reporting the correct sizes in max_bitsize_for_alu so the mask packing logic works as intended -- and dropping the imov-specific hack that was introduced to workaround a similar class of bugs. With the previous patch, fixes: dEQP-GLES31.functional.draw_buffers_indexed.random.max_required_draw_buffers.1 Fixes: `39e4b7279d` ("pan/midg: Fix swizzling on 8-bit sources") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19763>	2022-12-01 00:52:53 +00:00
Alyssa Rosenzweig	261d48fc9b	pan/mdg: Refuse to schedule CSEL.vector to SMUL Even if we only mask a single component from the result of CSEL.vector, in our IR we treat its semantics as vector which causes trouble with when scheduled to a scalar unit. The problematic bundle looks like this: vmul.MOV.i32 R31, TMP0.xxxx, R0.yzww sadd.MAX.i32 TMP0.y, R0.y, #65408 smul.CSEL.vector.i32 R0.y, TMP0.y, #127 As the comment in midgard.h illuminates, these CSEL instructions are actually operating per-bit, lining up with the all-1's booleans in Midgard. The Bifrost analogue is MUX.i32.bit, not CSEL.i32. We should probably rename the Midgard instruction to make that clear. Anyhoo, on the scalar unit, CSEL/MUX operates on the bottom 32-bits of its source. That's ok for the usual r31.w case, because that's secretly replicating to its nonexistent register, I think? But that doesn't work with the CSEL.vector (MUX.vector) form, because the condition it's actually muxing on is r31.x, which here is R0.y, not the intended R0.x. Rather than adding more special cases to the already overcomplicated scheduler (for the dubious benefit of avoiding a small shaderdb regression), just avoid scheduling CSEL.vector to smul. With the next patch, fixes: dEQP-GLES31.functional.draw_buffers_indexed.random.max_required_draw_buffers.1 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19763>	2022-12-01 00:52:53 +00:00
Alyssa Rosenzweig	044428211c	pan/mdg: Fix out-of-order execution We can go up to 15 instructions out of order (performance fix) but we can't go past a branch (bug fix). Fixes: `30a393f458` ("pan/mdg: Enable out-of-order execution after texture ops") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19762>	2022-11-23 20:23:50 +00:00
Yonggang Luo	40a9fc57aa	tree-wide: Use __func__ instead of __FUNCTION__ in non-gallium code Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Acked-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19861>	2022-11-22 06:53:46 +00:00
M Henning	f3ee9be836	glsl: Drop borrow/carry lowerings in favor of nir Unconditionally lowering prevents GL drivers from natively implementing these ops. Drivers that need lowering should set lower_uadd_carry and lower_usub_borrow on nir_shader_compiler_options to get the nir lowerings. Tested with dEQP-GLES31.functional.shaders.builtin_functions.integer.* Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19704>	2022-11-15 21:51:04 +00:00
Alyssa Rosenzweig	2316b80d77	panfrost: Don't use nir_variable to link varyings NIR deemphasizes nir_variable. We want to transition off it. Instead of walking the list of variables and playing games with the GLSL types to collect varying information, walk the list of instructions and use the I/O semantics to collect similar information. In addition to avoiding the reliance on nir_variable, this fixes handling of struct varyings under certain circumstances. Such programs are compiled by the GLES3.1 CTS but not used, so without this fix, the affected tests would regress when precompiling. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19363>	2022-11-02 16:52:11 +00:00
Alyssa Rosenzweig	d0281fc16a	pan/mdg: Use bifrost_nir_lower_store_component Move the pass from the Bifrost compiler to the Midgard/Bifrost common code directory, and take advantage of it on Midgard, where it fixes the same tests as it fixed originally on Bifrost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19363>	2022-11-02 16:52:11 +00:00
Alyssa Rosenzweig	17589be72b	pan/mdg: Use .u32 for flat shading This is simple and matches what we do on Bifrost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19363>	2022-11-02 16:52:11 +00:00
Alyssa Rosenzweig	225a8f6e27	pan/mdg: Don't pair ST_VARY.a32 with other instrs For some reason, LD_ATTR/ST_VARY.a32 bundles raise INSTR_INVALID_ENC, at least on Mali-T860. Don't construct such pairs. This is a blunt hack but I don't know where this curveball requirement is coming from and this unblocks the rest of this series. total instructions in shared programs: 99879 -> 99788 (-0.09%) instructions in affected programs: 3179 -> 3088 (-2.86%) helped: 49 HURT: 9 helped stats (abs) min: 1.0 max: 6.0 x̄: 2.04 x̃: 2 helped stats (rel) min: 0.93% max: 10.53% x̄: 5.46% x̃: 4.88% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 0.61% max: 2.13% x̄: 1.41% x̃: 1.14% 95% mean confidence interval for instructions value: -1.93 -1.20 95% mean confidence interval for instructions %-change: -5.37% -3.41% Instructions are helped. total bundles in shared programs: 43778 -> 45102 (3.02%) bundles in affected programs: 10737 -> 12061 (12.33%) helped: 10 HURT: 369 helped stats (abs) min: 1.0 max: 3.0 x̄: 1.50 x̃: 1 helped stats (rel) min: 2.90% max: 18.75% x̄: 6.93% x̃: 5.21% HURT stats (abs) min: 1.0 max: 10.0 x̄: 3.63 x̃: 4 HURT stats (rel) min: 0.82% max: 44.44% x̄: 15.27% x̃: 13.33% 95% mean confidence interval for bundles value: 3.29 3.69 95% mean confidence interval for bundles %-change: 13.68% 15.69% Bundles are HURT. total quadwords in shared programs: 76783 -> 77914 (1.47%) quadwords in affected programs: 18633 -> 19764 (6.07%) helped: 9 HURT: 370 helped stats (abs) min: 1.0 max: 2.0 x̄: 1.22 x̃: 1 helped stats (rel) min: 0.87% max: 8.33% x̄: 3.71% x̃: 3.85% HURT stats (abs) min: 1.0 max: 7.0 x̄: 3.09 x̃: 3 HURT stats (rel) min: 0.82% max: 35.00% x̄: 7.82% x̃: 6.11% 95% mean confidence interval for quadwords value: 2.82 3.15 95% mean confidence interval for quadwords %-change: 7.02% 8.06% Quadwords are HURT. total registers in shared programs: 7266 -> 7076 (-2.61%) registers in affected programs: 1224 -> 1034 (-15.52%) helped: 171 HURT: 25 helped stats (abs) min: 1.0 max: 3.0 x̄: 1.27 x̃: 1 helped stats (rel) min: 8.33% max: 50.00% x̄: 21.85% x̃: 20.00% HURT stats (abs) min: 1.0 max: 2.0 x̄: 1.12 x̃: 1 HURT stats (rel) min: 10.00% max: 100.00% x̄: 35.73% x̃: 33.33% 95% mean confidence interval for registers value: -1.10 -0.84 95% mean confidence interval for registers %-change: -17.69% -11.32% Registers are helped. total threads in shared programs: 4956 -> 5019 (1.27%) threads in affected programs: 99 -> 162 (63.64%) helped: 43 HURT: 6 helped stats (abs) min: 1.0 max: 2.0 x̄: 1.74 x̃: 2 helped stats (rel) min: 100.00% max: 100.00% x̄: 100.00% x̃: 100.00% HURT stats (abs) min: 2.0 max: 2.0 x̄: 2.00 x̃: 2 HURT stats (rel) min: 50.00% max: 50.00% x̄: 50.00% x̃: 50.00% 95% mean confidence interval for threads value: 0.91 1.66 95% mean confidence interval for threads %-change: 67.36% 95.90% Threads are helped. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19363>	2022-11-02 16:52:11 +00:00
Alyssa Rosenzweig	e04156b42a	pan/mdg: Disassemble the .a32 bit Corresponds to .auto32 on Bifrost. This is helpful for a conformant implementation of flat shading. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19363>	2022-11-02 16:52:11 +00:00
Alyssa Rosenzweig	2a6338722e	panfrost: Don't use nir_variable in the compilers More future proof, simpler, and works with early I/O lowering. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19456>	2022-11-02 04:22:06 +00:00
Alyssa Rosenzweig	78785f3b18	pan/mdg: Don't schedule across memory barrier Fixes KHR-GLES31.core.shader_image_load_store.basic-glsl-misc-cs Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19238>	2022-10-27 20:13:11 +00:00
Alyssa Rosenzweig	0955fe8fe2	panfrost: Use compute-based XFB on Midgard Now we're back to a single XFB implementation for all gens. Fixes: KHR-GLES31.core.draw_indirect.advanced-twoPasses-transformFeedback-arrays KHR-GLES31.core.draw_indirect.advanced-twoPasses-transformFeedback-elements Cc: mesa-stable Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19238>	2022-10-27 20:13:11 +00:00
Alyssa Rosenzweig	9e2ce225e6	pan/mdg: Fix 64-bit address arithmetic Cc: mesa-stable Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19238>	2022-10-27 20:13:11 +00:00
Alyssa Rosenzweig	21a4dbb720	panfrost: Don't use lower_wpos_pntc on Midgard gl_PointCoord is implemented via a special attribute descriptor on Midgard. This descriptor has an orientation bit, the orientation is driver-controlled. That means we can map rast->sprite_coord_mode to this bit, rather than lowering in the shader. This is a bug fix for point sprites, which are implemented natively on Midgard for dubious reasons and need to be flipped this way. It is also an optimization for apps reading gl_PointCoord, removing the extra arithmetic to flip, although the value of this is somewhat dubious. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19237>	2022-10-26 01:56:08 +00:00
Alyssa Rosenzweig	829f769e60	pan/mdg: Fix 16-bit alignment with spiller The loop over sources has to happen for every instruction, regardless of whether we also need to register allocate the destination. The other source loops handle this properly, but this one was missed. Fixes spilling failure in shaders/android/angle/aztec_ruins/16.shader_test when the input NIR is shuffled a bit (from reordering passes). Fixes: `129d390bd8` ("pan/mdg: Fix bound setting in RA for sources") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19093>	2022-10-17 19:11:10 +00:00
Alyssa Rosenzweig	2c446b6636	pan/mdg: Limit work registers for large workgroups When more than 8 registers are used, Midgard can only fit 64 threads in a thread group. For barriers to work properly, a threadgroup must fit an entire work group. The GL driver configures the hardware to have threadgroups the size of work groups. That means if more than 64 threads are used in a workgroup, and more than 8 registers are used, the hardware will fault spawning threads. To workaround this hardware limitation, we need to limit the number of work registers used depending on the size of the workgroup. Typically, the work group size is known at compile-time so that determination can usually be made without variants. To avoid variants, we make a pessimistic estimate in the case when it's not known at compile-time. shader-db shows 6 shaders affected. I expect that all of these would fault with DATA_INVALID_FAULT if they tried to execute before this patch, due to the oversize local size, and faulting is even slower than spilling ;-) Fixes dEQP-GLES31.functional.synchronization.* on Mali-T860. instructions HURT: shaders/android/gfxbench/carchase/6.shader_test MESA_SHADER_COMPUTE: 121 -> 157 (29.75%) instructions HURT: shaders/android/gfxbench/carchase/386.shader_test MESA_SHADER_COMPUTE: 121 -> 157 (29.75%) instructions HURT: shaders/android/gfxbench/carchase/374.shader_test MESA_SHADER_COMPUTE: 141 -> 184 (30.50%) instructions HURT: shaders/android/gfxbench/carchase/4-1.shader_test MESA_SHADER_COMPUTE: 141 -> 184 (30.50%) instructions HURT: shaders/android/com.miHoYo.GenshinImpact/18.shader_test MESA_SHADER_COMPUTE: 513 -> 933 (81.87%) instructions HURT: shaders/android/com.miHoYo.GenshinImpact/16.shader_test MESA_SHADER_COMPUTE: 505 -> 1002 (98.42%) bundles HURT: shaders/android/gfxbench/carchase/374.shader_test MESA_SHADER_COMPUTE: 73 -> 116 (58.90%) bundles HURT: shaders/android/gfxbench/carchase/4-1.shader_test MESA_SHADER_COMPUTE: 73 -> 116 (58.90%) bundles HURT: shaders/android/gfxbench/carchase/6.shader_test MESA_SHADER_COMPUTE: 61 -> 97 (59.02%) bundles HURT: shaders/android/gfxbench/carchase/386.shader_test MESA_SHADER_COMPUTE: 61 -> 97 (59.02%) bundles HURT: shaders/android/com.miHoYo.GenshinImpact/18.shader_test MESA_SHADER_COMPUTE: 281 -> 701 (149.47%) bundles HURT: shaders/android/com.miHoYo.GenshinImpact/16.shader_test MESA_SHADER_COMPUTE: 278 -> 775 (178.78%) registers helped: shaders/android/gfxbench/carchase/374.shader_test MESA_SHADER_COMPUTE: 11 -> 8 (-27.27%) registers helped: shaders/android/gfxbench/carchase/4-1.shader_test MESA_SHADER_COMPUTE: 11 -> 8 (-27.27%) registers helped: shaders/android/gfxbench/carchase/6.shader_test MESA_SHADER_COMPUTE: 14 -> 8 (-42.86%) registers helped: shaders/android/gfxbench/carchase/386.shader_test MESA_SHADER_COMPUTE: 14 -> 8 (-42.86%) registers helped: shaders/android/com.miHoYo.GenshinImpact/16.shader_test MESA_SHADER_COMPUTE: 16 -> 8 (-50.00%) registers helped: shaders/android/com.miHoYo.GenshinImpact/18.shader_test MESA_SHADER_COMPUTE: 16 -> 8 (-50.00%) threads helped: shaders/android/gfxbench/carchase/6.shader_test MESA_SHADER_COMPUTE: 1 -> 2 (100.00%) threads helped: shaders/android/gfxbench/carchase/386.shader_test MESA_SHADER_COMPUTE: 1 -> 2 (100.00%) threads helped: shaders/android/gfxbench/carchase/374.shader_test MESA_SHADER_COMPUTE: 1 -> 2 (100.00%) threads helped: shaders/android/gfxbench/carchase/4-1.shader_test MESA_SHADER_COMPUTE: 1 -> 2 (100.00%) threads helped: shaders/android/com.miHoYo.GenshinImpact/16.shader_test MESA_SHADER_COMPUTE: 1 -> 2 (100.00%) threads helped: shaders/android/com.miHoYo.GenshinImpact/18.shader_test MESA_SHADER_COMPUTE: 1 -> 2 (100.00%) spills HURT: shaders/android/gfxbench/carchase/374.shader_test MESA_SHADER_COMPUTE: 0 -> 5 spills HURT: shaders/android/gfxbench/carchase/4-1.shader_test MESA_SHADER_COMPUTE: 0 -> 5 spills HURT: shaders/android/gfxbench/carchase/6.shader_test MESA_SHADER_COMPUTE: 0 -> 8 spills HURT: shaders/android/gfxbench/carchase/386.shader_test MESA_SHADER_COMPUTE: 0 -> 8 spills HURT: shaders/android/com.miHoYo.GenshinImpact/18.shader_test MESA_SHADER_COMPUTE: 0 -> 112 spills HURT: shaders/android/com.miHoYo.GenshinImpact/16.shader_test MESA_SHADER_COMPUTE: 0 -> 146 fills HURT: shaders/android/gfxbench/carchase/6.shader_test MESA_SHADER_COMPUTE: 0 -> 26 fills HURT: shaders/android/gfxbench/carchase/386.shader_test MESA_SHADER_COMPUTE: 0 -> 26 fills HURT: shaders/android/gfxbench/carchase/374.shader_test MESA_SHADER_COMPUTE: 0 -> 33 fills HURT: shaders/android/gfxbench/carchase/4-1.shader_test MESA_SHADER_COMPUTE: 0 -> 33 fills HURT: shaders/android/com.miHoYo.GenshinImpact/18.shader_test MESA_SHADER_COMPUTE: 0 -> 209 fills HURT: shaders/android/com.miHoYo.GenshinImpact/16.shader_test MESA_SHADER_COMPUTE: 0 -> 234 total instructions in shared programs: 1521691 -> 1522766 (0.07%) instructions in affected programs: 1542 -> 2617 (69.71%) helped: 0 HURT: 6 HURT stats (abs) min: 36.0 max: 497.0 x̄: 179.17 x̃: 43 HURT stats (rel) min: 29.75% max: 98.42% x̄: 50.13% x̃: 30.50% 95% mean confidence interval for instructions value: -49.36 407.69 95% mean confidence interval for instructions %-change: 17.14% 83.12% Inconclusive result (value mean confidence interval includes 0). total bundles in shared programs: 649296 -> 650371 (0.17%) bundles in affected programs: 827 -> 1902 (129.99%) helped: 0 HURT: 6 HURT stats (abs) min: 36.0 max: 497.0 x̄: 179.17 x̃: 43 HURT stats (rel) min: 58.90% max: 178.78% x̄: 94.01% x̃: 59.02% 95% mean confidence interval for bundles value: -49.36 407.69 95% mean confidence interval for bundles %-change: 36.20% 151.83% Inconclusive result (value mean confidence interval includes 0). total registers in shared programs: 90681 -> 90647 (-0.04%) registers in affected programs: 82 -> 48 (-41.46%) helped: 6 HURT: 0 helped stats (abs) min: 3.0 max: 8.0 x̄: 5.67 x̃: 6 helped stats (rel) min: 27.27% max: 50.00% x̄: 40.04% x̃: 42.86% 95% mean confidence interval for registers value: -8.03 -3.30 95% mean confidence interval for registers %-change: -50.95% -29.13% Registers are helped. total threads in shared programs: 55717 -> 55723 (0.01%) threads in affected programs: 6 -> 12 (100.00%) helped: 6 HURT: 0 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 100.00% max: 100.00% x̄: 100.00% x̃: 100.00% 95% mean confidence interval for threads value: 1.00 1.00 95% mean confidence interval for threads %-change: 100.00% 100.00% Threads are helped. total spills in shared programs: 1108 -> 1392 (25.63%) spills in affected programs: 0 -> 284 helped: 0 HURT: 6 total fills in shared programs: 4721 -> 5282 (11.88%) fills in affected programs: 0 -> 561 helped: 0 HURT: 6 Cc: mesa-stable Closes: #7228 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19092>	2022-10-17 18:56:13 +00:00
Alyssa Rosenzweig	847361ba07	panfrost: Remove load_kernel_input path Now the state tracker's responsible to lower away for us (and the state tracker can do it correctly, our implementation is incorrect with a strict reading of the Gallium contract). Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18658>	2022-10-05 16:09:21 +00:00
Alyssa Rosenzweig	e55b60d0bb	panfrost: Route shader-db to debug, not stderr This brings us in line with the rest of Mesa, fixing multithreaded shader-db reports and the Total CPU Time report at the end. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18351>	2022-09-01 14:50:24 +00:00
Alyssa Rosenzweig	6fed616187	pan/mdg: Print 3 sources for CSEL The third source exists logically but not architecturally. We still need to print it. Caught by the assertion. Fixes: `0ee24c46e0` ("pan/mdg: Only print 2 sources for ALU") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18338>	2022-08-31 14:07:53 +00:00
Alyssa Rosenzweig	4fe755e803	pan/mdg: Always write return address to r1.w This might not be optimal but it matches our current behaviour and is much more justified than the "accidental" code before. Caught by the gcc warning: ../src/panfrost/midgard/midgard_schedule.c:1227:48: warning: the comparison will always evaluate as ‘true’ for the address of ‘writeout_branch’ will never be NULL [-Waddress] Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18338>	2022-08-31 14:07:53 +00:00
Rhys Perry	69ba1c4d59	nir: adjust nir_src_copy signature to take a nir_instr * This is almost always a nir_instr and updating the src of a nir_if will have to work slightly differently in the future. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12910>	2022-08-30 18:21:44 +00:00
Rhys Perry	aa2d6e020b	Revert "nir: Drop the unused instr arg for src/dest copy functions." This reverts commit `c3a0184118`. Acked-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12910>	2022-08-30 18:21:44 +00:00
Alyssa Rosenzweig	5777f99fc5	pan/mdg: Use correct idiv lowering Rip off the bandaid. We can't tolerate straight-up wrong results. We have an efficient umul_high implementation so it's not so bad. total instructions in shared programs: 1537404 -> 1537204 (-0.01%) instructions in affected programs: 143299 -> 143099 (-0.14%) helped: 89 HURT: 283 helped stats (abs) min: 1.0 max: 41.0 x̄: 5.87 x̃: 6 helped stats (rel) min: 0.39% max: 6.67% x̄: 1.41% x̃: 1.44% HURT stats (abs) min: 1.0 max: 7.0 x̄: 1.14 x̃: 1 HURT stats (rel) min: 0.24% max: 5.71% x̄: 0.35% x̃: 0.27% 95% mean confidence interval for instructions value: -0.96 -0.12 95% mean confidence interval for instructions %-change: -0.17% 0.03% Inconclusive result (%-change mean confidence interval includes 0). total bundles in shared programs: 647521 -> 648154 (0.10%) bundles in affected programs: 45833 -> 46466 (1.38%) helped: 92 HURT: 228 helped stats (abs) min: 1.0 max: 13.0 x̄: 3.10 x̃: 3 helped stats (rel) min: 0.69% max: 7.14% x̄: 2.11% x̃: 1.99% HURT stats (abs) min: 1.0 max: 7.0 x̄: 4.03 x̃: 5 HURT stats (rel) min: 0.59% max: 7.22% x̄: 2.93% x̃: 3.40% 95% mean confidence interval for bundles value: 1.58 2.38 95% mean confidence interval for bundles %-change: 1.21% 1.76% Bundles are HURT. total quadwords in shared programs: 1135141 -> 1138268 (0.28%) quadwords in affected programs: 101064 -> 104191 (3.09%) helped: 30 HURT: 342 helped stats (abs) min: 1.0 max: 30.0 x̄: 4.97 x̃: 3 helped stats (rel) min: 0.24% max: 5.99% x̄: 1.72% x̃: 1.06% HURT stats (abs) min: 1.0 max: 16.0 x̄: 9.58 x̃: 10 HURT stats (rel) min: 0.73% max: 17.14% x̄: 3.64% x̃: 3.80% 95% mean confidence interval for quadwords value: 7.84 8.97 95% mean confidence interval for quadwords %-change: 2.99% 3.43% Quadwords are HURT. total registers in shared programs: 91938 -> 92265 (0.36%) registers in affected programs: 2639 -> 2966 (12.39%) helped: 0 HURT: 280 HURT stats (abs) min: 1.0 max: 3.0 x̄: 1.17 x̃: 1 HURT stats (rel) min: 9.09% max: 50.00% x̄: 12.75% x̃: 11.11% 95% mean confidence interval for registers value: 1.12 1.22 95% mean confidence interval for registers %-change: 12.05% 13.45% Registers are HURT. total threads in shared programs: 55280 -> 55268 (-0.02%) threads in affected programs: 24 -> 12 (-50.00%) helped: 0 HURT: 11 HURT stats (abs) min: 1.0 max: 2.0 x̄: 1.09 x̃: 1 HURT stats (rel) min: 50.00% max: 50.00% x̄: 50.00% x̃: 50.00% 95% mean confidence interval for threads value: -1.29 -0.89 95% mean confidence interval for threads %-change: -50.00% -50.00% Threads are HURT. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17860>	2022-08-24 19:54:23 +00:00
Alyssa Rosenzweig	5bc830cbf2	pan/mdg: Reexpress umul_high packing There are a bunch of subtle details of how 32-bit sources are zero-extended to 64-bit, how their swizzles work, how 64-bit destinations are shrunk to 32-bit, and how those two interact. This fixes the interactions... mostly. Fixes umul_high, all such tests should be passing now. Unblocks idiv lowering that depends on umul_high. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17860>	2022-08-24 19:54:23 +00:00
Alyssa Rosenzweig	7b78e05ba8	pan/mdg: Replicate swizzles for scalar sources This works around issue packing 32-bit scalar swizzles zero-extended to 64-bit, seen with the umul_high implementation. I tried for a while figuring out the root cause (even rewrote a big chunk of disassembler) but am still a bit lost. Nevertheless this is a safe workaround with no performance impact (and avoids relying on NIR undefined behaviour to implement GPU undefined behaviour), so let's do this for now to fix umul_high. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17860>	2022-08-24 19:54:23 +00:00
Alyssa Rosenzweig	d7e6174c2b	pan/mdg: Remove disassembler stats They're now unused and they were never especially useful. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18094>	2022-08-17 17:25:56 +00:00
Konstantin Kharlamov	91362340f3	meson: remove source_root() call in nir compiler path source_root function is deprecated in Meson version 0.56.0, so let's use instead a current_source_dir() function, available in all Meson versions. This also allows to deduplicate some code by declaring commonly used string at the top meson.build file. Signed-off-by: Konstantin Kharlamov <Hi-Angel@yandex.ru> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com>. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17974>	2022-08-12 13:11:03 +00:00
Alyssa Rosenzweig	e596a0423b	pan/mdg: Print outmods when printing IR In particular, this lets us distinguish mul_high from regular mul. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16798>	2022-06-01 14:24:10 -04:00
Alyssa Rosenzweig	a099834b97	pan/mdg: Distinguish SSA vs reg when printing IR This makes it easy to match the printed IR with the indices in the NIR. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16798>	2022-06-01 14:24:10 -04:00
Alyssa Rosenzweig	520204ae18	pan/mdg: Only print 1 source for moves This makes the printed IR easier to read at a glance. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16798>	2022-06-01 14:24:10 -04:00
Alyssa Rosenzweig	0ee24c46e0	pan/mdg: Only print 2 sources for ALU ..and assert the other sources are null. The one place this might fail in the future is for real FMA, but we don't support that for GL. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16798>	2022-06-01 14:24:10 -04:00
Alyssa Rosenzweig	9c9db27e3c	pan/mdg: Only print masked components of swizzle This matches the IR printer with the disassembler, making the output of the IR printer much easier to parse at a glance. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16798>	2022-06-01 14:24:10 -04:00
Alyssa Rosenzweig	c9093554d0	pan/mdg: Use "<<" instead of "lsl" Easier to read and consistent with C code. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16798>	2022-06-01 14:24:10 -04:00
Alyssa Rosenzweig	8c11f4809b	pan/mdg: Remove uppercase write masks These do not convey any additional information, and fail to account for shrinking. In particular, a 64-bit writemask with .keephi would fail to disassemble and instead trip the assertion, since that would be the ZW components. Just delete the broken code. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16798>	2022-06-01 14:24:10 -04:00
Alyssa Rosenzweig	9e4b457958	pan/mdg: Scalarize with 64-bit sources Otherwise, we can get vec3 with u2u32 with 64-bit sources which we need lowered. Since our current approach is "scalarize all 64-bit ops", we need to check for conversions too. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16798>	2022-06-01 14:24:05 -04:00
Daniel Schürmann	bd151a256e	nir/opt_vectorize: add callback for max vectorization width The callback allows to request different vectorization factors per instruction depending on e.g. bitsize or opcode. This patch also removes using the vectorize_vec2_16bit option from nir_opt_vectorize(). Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13080>	2022-06-01 11:41:44 +00:00
Emma Anholt	7ae206d76e	panfrost: always print the bad ALU op if we're failing to translate. CI failure could have told me what needed fixing, but no... Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16437>	2022-06-01 10:56:35 +00:00
Emma Anholt	7472bb4bad	glsl,nir: Move i/umulExtended lowering to NIR. NIR already has the necessary lowering, and the GLSL lowering violates GLSL IR validation rules. Once quadop lowering was turned off, the IR validation at the end of the compile path on DEBUG builds caught the problem. In order to move the lowering to NIR, though, we need to make sure that drivers supporting these functions actually have the lowering flag set. xfails added for t860, where apparently this tickles a variety of existing 64-bit bugs in the backend. Fixes: #6461 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Mykhailo Skorokhodov <mykhailo.skorokhodov@globallogic.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16437>	2022-06-01 10:56:35 +00:00
Icecream95	0a53ebabcd	pan/mdg: Read base for combined stores Fixes depth/stencil writes with MRT. Fixes: `b3d7272753` ("pan/mdg: Don't read base for combined stores") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16685>	2022-05-24 16:13:33 +00:00
Timothy Arceri	d7a071a28f	gallium/drivers: set force_indirect_unrolling_sampler for all required drivers This is set to true for all drivers that have a GLSL level of support lower than 4.00. This matches the rule for setting the GLSL IR option EmitNoIndirectSampler. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16543>	2022-05-17 02:12:21 +00:00
Jason Ekstrand	f0a47d8602	bifrost,midgard: Allow providing a fixed sysval layout Vulkan doesn't need nearly as many system values and would like to bake its layout up-front instead of having it provided by the back-end compiler. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16276>	2022-05-12 10:53:16 +00:00
Jason Ekstrand	4e60f0655a	panfrost,panvk: Make fixed_sysval_ubo < 0 mean compiler-assigned In `3559efb9bf` ("panfrost: Allow passing an explicit UBO index for the sysval UBO"), an explicit UBO index was added and it was implicitly assumed that it would be > num_ubos. This was convenient because it meant 0, the default for designated initializers, implicitly meant compiler-assigned. However, we're about to move the sysval UBO to 0 which breaks this assumption. Also, we don't want the back-end compiler to even look at num_ubos since it's meaningless in Vulkan. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16276>	2022-05-12 10:53:15 +00:00
Icecream95	c65afe541b	pan/mdg: Fix multiple spilt writes in the same bundle If two instructions in a single bundle both write to a spilt destination, then we need to reuse the fill and spill instructions, otherwise the value will be overwritten. This and the rest of this set of Midgard bug fixes were found from a vertex shader in Firefox WebRender that is used when a video is clipped, for example by setting the border-radius CSS property. CC: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16382>	2022-05-10 13:16:50 +00:00
Icecream95	7b9c976c2d	pan/mdg: Return the instruction from mir_insert_instruction_*_scheduled We can't return a pointer to the bundle itself because it might move about in memory. CC: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16382>	2022-05-10 13:16:50 +00:00

1 2 3 4 5 ...

1031 commits