fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-20 04:48:07 +02:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	22ab505a3d	agx: Augment if/else/while_cmp with a target Add an optional pointer to a target block for these instructions. This does NOT act like a logical branch, and does NOT get added to the logical control flow. It is ignored wholesale until after RA, when physical edges may be inserted by a pass we add later in this series. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io>	2023-10-01 12:32:11 -04:00
Alyssa Rosenzweig	7895d5b79c	agx: Add unit test for cmp+sel fusing Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25052>	2023-09-05 18:50:34 +00:00
Alyssa Rosenzweig	bdad7992bc	agx: Add unit test for if_cmp fusing Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25052>	2023-09-05 18:50:34 +00:00
Alyssa Rosenzweig	d459de85b7	agx: Optimize swaps of 2x16 channels We can use extr to swap the low and high halves of a 32-bit register in one instruction. No shader-db changes, but it reduces xor's on a deqp I'm looking at. Yes, I'm procrastinating on debugging deqps, how'd you guess? Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24635>	2023-08-11 20:31:27 +00:00
Alyssa Rosenzweig	afa38c7d4f	agx: Vectorize 16-bit parallel copies If we have two 16-bit copies to/from adjacent 16-bit registers, we can instead use a single 32-bit copy from the 32-bit register pair. Since 32-bit integer arithmetic is (almost) as efficient as 16-bit on AGX, this (almost) doubles performance of affected parallel copies. total instructions in shared programs: 1788606 -> 1788301 (-0.02%) instructions in affected programs: 17057 -> 16752 (-1.79%) helped: 150 HURT: 0 Instructions are helped. total bytes in shared programs: 12196492 -> 12194662 (-0.02%) bytes in affected programs: 122894 -> 121064 (-1.49%) helped: 150 HURT: 0 Bytes are helped. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:28 +00:00
Alyssa Rosenzweig	3a0d1f83d5	agx: Stop bit-inexact conversion propagation Despite being mathematically equivalent, the following code sequences are not bit-identical under IEEE 754 rules due to differing internal precision: fadd16 r0l, r2, 0.0 z = f2f16 x fadd16 r1h, r0l, r0h w = fadd z, y versus fadd32 r1h, r2, r0h f2f16(w) = fadd x, f2f32(y) This is probably fine under GL's relaxed floating point precision rules, but it's definitely not ok with the more strict OpenCL or Vulkan. It also is a potential problem with GL invariance rules, if we get different results for the same shader depending whether we did a monolithic compile or a fast link. The place for doing inexact transformations is NIR, when we have the information available to do so correctly. By the time we get to the backend, everything we do needs to be bit-exact to preserve sanity. Fixes dEQP-GLES2.functional.shaders.algorithm.rgb_to_hsl_vertex. We believe that this is a CTS bug, but it's a useful one since it uncovered a serious driver bug that would bite us in the much less friendly Vulkan (or god forbid OpenCL) CTS later. It also seems like a magnet for GL app bugs, the fp16 support we do now is uncovering bad enough bugs as it is. shader-db results are pretty abysmal, though :\| total instructions in shared programs: 1537964 -> 1571328 (2.17%) instructions in affected programs: 670231 -> 703595 (4.98%) total bytes in shared programs: 10533984 -> 10732316 (1.88%) bytes in affected programs: 4662414 -> 4860746 (4.25%) total halfregs in shared programs: 483448 -> 474541 (-1.84%) halfregs in affected programs: 58867 -> 49960 (-15.13%) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23480>	2023-06-07 03:21:49 +00:00
Konstantin Seurer	13c9b490a7	asahi: Reformat using the new style Now, that the foreach macro list is complete (I hope), let's reformat drivers that enforce correct formatting in CI. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23275>	2023-05-29 21:06:12 +00:00
Alyssa Rosenzweig	0f974d1f90	asahi: Convert to SPDX headers Also drop my email address in the copyright lines and fix some "Copyright 208 Alyssa Rosenzweig" lines, I'm not that old. Together this drops a lot of boilerplate without losing any meaningful licensing information. SPDX is already in use for the MIT-licensed code in turnip, venus, and a few other scattered parts of the tree, so this should be ok from a Mesa licensing standpoint. This reduces friction to create new files, by parsing the copy/paste boilerplate and being short enough you can easily type it out if you want. It makes new files seem less daunting: 20 lines of header for 30 lines of code is discouraging, but 2 lines of header for 30 lines of code is reasonable for a simple compiler pass. This has technical effects, as lowering the barrier to making new files should encourage people to split code into more modular files with (hopefully positive) effects on project compile time. This helps with consistency between files. Across the tree we have at least a half dozen variants of the MIT license text (probably more), plus code that uses SPDX headers instead. I've already been using SPDX headers in Asahi manually, so you can tell old vs new code based on the headers. Finally, it means less for reviewers to scroll through adding files. Minimal actual cognitive burden for reviewers thanks to banner blindness, but the big headers still bloat diffs that add/delete files. I originally proposed this in December (for much more of the tree) but someone requested I wait until January to discuss. I've been trying to get in touch with them since then. It is now almost April and, with still no response, I'd like to press forward with this. So with a joint sign-off from the major authors of the code in question, let's do this. Signed-off-by: Asahi Lina <lina@asahilina.net> Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Emma Anholt <emma@anholt.net> Acked-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Eric Engestrom <eric@igalia.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Rose Hudson <rose@krx.sh> Acked-by: Lyude Paul [over IRC: "yes I'm fine with that"] Meh'd-by: Rob Clark <robdclark@chromium.org> Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22062>	2023-03-28 05:14:00 +00:00
Alyssa Rosenzweig	5ea9c2e634	agx: Make partial DCE optional Our dead code elimination pass does two things: 1. delete instructions that are entirely unnecessary 2. delete unnecessary destinations of necessary instructions To deal with pass ordering issues, we sometimes want to do #1 without #2. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21674>	2023-03-11 14:15:50 +00:00
Alyssa Rosenzweig	f603d8ce9e	asahi: Clang-format the subtree See `0afd691f29` ("panfrost: clang-format the tree") for why I'm doing this. Asahi already mostly follows Mesa style so this doesn't do much. But this means we can all stop thinking about formatting and trust the robot poets to do that for us. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20434>	2022-12-27 22:46:29 +00:00
Alyssa Rosenzweig	680c873b35	agx: Undo sed fail Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20434>	2022-12-27 22:46:29 +00:00
Alyssa Rosenzweig	98f0ebf264	agx: Pass agx_index to agx_copy More straightforward interface and will allow including immediates later if we want to. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19590>	2022-11-10 02:25:09 +00:00
Alyssa Rosenzweig	c9a96d4615	agx: Preload vertex/instance ID only at start This means we don't reserve the registers, which improves RA considerably. Using a special preload psuedo-op instead of a regular move allows us to constrain semantics and gaurantee coalescing. shader-db on glmark2 subset: total instructions in shared programs: 6448 -> 6442 (-0.09%) instructions in affected programs: 230 -> 224 (-2.61%) helped: 4 HURT: 0 total bytes in shared programs: 42232 -> 42196 (-0.09%) bytes in affected programs: 1530 -> 1494 (-2.35%) helped: 4 HURT: 0 total halfregs in shared programs: 2291 -> 1926 (-15.93%) halfregs in affected programs: 2185 -> 1820 (-16.70%) helped: 75 HURT: 0 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18804>	2022-10-14 01:37:39 +00:00
Alyssa Rosenzweig	1dcaade3e2	agx: Rename "combine" to "collect" For consistency with ir3 and bifrost. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18804>	2022-10-14 01:37:39 +00:00
Alyssa Rosenzweig	7c9fba34bc	agx: Switch to dynamic allocation of srcs/dests So we can handle parallel copies later. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18804>	2022-10-14 01:37:39 +00:00
Alyssa Rosenzweig	c2bc8c1384	agx: Don't prefix pseudo-ops It's not really buying us anything and it clutters the IR. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18804>	2022-10-14 01:37:39 +00:00
Alyssa Rosenzweig	640fd089a2	agx: Ensure that the optimizer sees legitimate SSA Expecting it to keep around unused definitions around is wishful. Add an "anchoring" unit_test instruction to consume the results so they don't have to be precoloured registers. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18687>	2022-09-22 03:23:36 +00:00
Alyssa Rosenzweig	52467c2d1e	agx: Test fsat+f2f16 together Something I hit when mucking with this pass. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18687>	2022-09-22 03:23:36 +00:00
Alyssa Rosenzweig	4f85a7be8c	agx: Make p_combine take a dynamic src count For larger vectors. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18525>	2022-09-13 16:04:28 +00:00
Alyssa Rosenzweig	18bb64fd3a	agx: Add more unit tests for float copyprop Would have caught the bug fixed by the previous commit. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18380>	2022-09-04 18:05:31 +00:00
Alyssa Rosenzweig	8066ef9d30	agx: Port minifloat tests to GTest These tests predate using GTest in the compiler. Now that we do, we'd like to have the tests together so they run regularly. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17824>	2022-08-01 18:34:11 +00:00
Alyssa Rosenzweig	c712043b9c	agx: Unit test parallel copy lowering It's pretty tricky. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16268>	2022-05-01 22:00:00 -04:00
Alyssa Rosenzweig	3f1e926bf4	agx: Use a dynarray for predecessors This imposes a fixed ordering, allowing phi sources to be implicitly ordered. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16268>	2022-05-01 22:00:00 -04:00
Alyssa Rosenzweig	bb1fb0a9db	agx: Dynamically allocate agx_instr->src Required for phi nodes. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16268>	2022-05-01 21:58:29 -04:00
Alyssa Rosenzweig	d39b1c3426	agx: Implement simple copyprop Cleans up some of the mess. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16268>	2022-05-01 21:58:29 -04:00
Alyssa Rosenzweig	7d38bcb7ee	agx: Use pseudo ops for mov/not/and/xor/or Rather than using builder magic (implicitly lowered on emit), add actual pseudo operations (explicitly lowered before encoding). In theory this is slower, I doubt it matters. This makes the instruction aliases first-class for IR prining and machine inspection, which will make optimization passes easier to write. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16268>	2022-05-01 21:58:29 -04:00
Alyssa Rosenzweig	3d8c2f2693	agx: Add unit test infrastructure Lifted from Bifrost. Add some basic optimizer tests (they pass!) to show the compiler is ready to be unit tested. Given we can't have hardware CI for Asahi yet -- and dEQP is still pretty janky -- unit testing should prove quite useful. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16268>	2022-05-01 21:58:29 -04:00

27 commits