fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-28 22:58:13 +02:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	f2740ac69c	pan/decode: Add support for decoding CSF Add support to pandecode for Mali architecture v10, featuring the new command stream frontend (CSF). This replaces the "job chain" with a new Command Execution Unit (CEU) that runs a domain-specific assembly language. That requires us to refactor pandecode substantially, splitting out JM-only code from shared JM/CSF common code, and adding new CSF-only decode routines to disassemble and interpret CSF command streams and pretty-printing the data structures hit. This is of course impossible to do properly, since the CEU is pretty easily Turing-complete and hence subject to the halting problem. But we implement some simple heuristics to follow jumps that are just good enough for the simple command streams emitting by both the DDK and Panfrost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20837>	2023-02-13 15:24:10 +00:00
Alyssa Rosenzweig	102d4292d5	panfrost: Fix some fields in v10.xml Correct some errors from the file's initial check in, as we're about to add corresponding pandecode changes for the file. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20837>	2023-02-13 15:24:10 +00:00
Alyssa Rosenzweig	7968c474b8	panvk: Disable SNORM rendering Driver isn't ready for this yet. `7f98a9ba2b` ("panfrost: Implement GL_EXT_render_snorm on Bifrost+") caused piles of tests to go from NotSupported -> Fail, so let's functionally revert that. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21257>	2023-02-13 14:04:52 +00:00
Alyssa Rosenzweig	6142d50375	panvk: Fix varying linking Since `2316b80d77` ("panfrost: Don't use nir_variable to link varyings"), we can only get correct type information from the fragment shader inputs (not the vertex shader output). Fixes piles of CTS regressions. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21257>	2023-02-13 14:04:52 +00:00
Alyssa Rosenzweig	1ba20868c4	panvk: Take lock when tracing We're not supposed to call the GENX(pandecode_jc) routines (e.g. pandecode_jc_v7), since it's an internal interface that expects the caller to take a lock first. Instead we're supposed to call the non-GenXML pandecode_jc entrypoint which does the locking properly. Fixes assertion failures when tracing with recent pandecode: deqp-vk: ../src/util/simple_mtx.h:142: simple_mtx_assert_locked: Assertion `mtx->val' failed. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21257>	2023-02-13 14:04:52 +00:00
Eric Engestrom	5a2326f9b2	panfrost: drop no-longer-needed libglsl Fixes: `551c2aadd4` ("pan/bi: Remove standalone compiler") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Signed-off-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21240>	2023-02-10 14:09:37 +00:00
Ian Romanick	ea413e826b	nir: Eliminate nir_op_f2b Builds on the work of !15121. This gets to delete even more code because many drivers shared a lot of code for i2b and f2b. No shader-db or fossil-db changes on any Intel platform. v2: Rebase on `1a35acd8d9`. v3: Update a comment in nir_opcodes_c.py. Suggested by Konstantin. v4: Another rebase. Remove f2b stuff from Midgard. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20509>	2023-02-03 22:39:57 +00:00
Alyssa Rosenzweig	7f98a9ba2b	panfrost: Implement GL_EXT_render_snorm on Bifrost+ It turns out it's really easy. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20684>	2023-02-03 17:21:34 +00:00
Erik Faye-Lund	b6a344f4ba	meson: do not reconstruct ICD paths Meson will already construct these paths for us, so let's reuse them instead of throwing away the result and recontstructing them. Reviewed-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20907>	2023-01-27 11:35:50 +00:00
Emma Anholt	f6c06ef2f6	ci: Add manual rules variations to disable irrelevant driver jobs. If you're only affecting one or a couple of drivers, it would be nice if your pipeline buttons on the web UI weren't full of manual run buttons for all the other drivers. This is a bunch of duplicated lines, but less than it could have been now that we have !references. In some of these cases (i915g, nouveau, etnaviv), we have no non-manual jobs for those drivers, so I could have just rewritten the original "driver-rules" to "driver-manual-rules". I decided to keep things consistent between drivers, though, because this is all esoteric enough to readers already without making different drivers' rules look different. Fixes: #4891 Acked-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17445>	2023-01-26 00:48:19 +00:00
Emma Anholt	849af68dbd	ci/piglit: Add some common piglit skips for Mesa CI's testing of glx. Since our X servers don't have a compositor, and we run tests in parallel, various swap and frontbuffer tests won't ever be stable. Rather than having every driver have to track those flakes, make a general X11 skips list as a known issue of our CI rather than pointing fingers at drivers. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Acked-by: Karol Herbst <kherbst@redhat.com> Acked-by: Martin Roukala <martin.roukala@mupuf.org> Acked-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20798>	2023-01-24 00:13:02 +00:00
Dylan Baker	c31629ee78	meson: remove version checks for < 0.59 Which is now required, so these are useless Reviewed-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20752>	2023-01-19 23:06:07 +00:00
Alyssa Rosenzweig	f02354d3e2	pan/mdg: Remove MSGS debug These should all be unreachable and what's left is dead-code. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19350>	2023-01-16 22:20:43 +00:00
Alyssa Rosenzweig	23968aeeb5	pan/mdg: Scalarize LUT instructions in NIR Simpler. Small shaderdb regressions from using IR registers instead of SSA, but that's probably what we needed for correctness (given that SSA is violated otherwise) hence the Cc. total instructions in shared programs: 1520220 -> 1518127 (-0.14%) instructions in affected programs: 167437 -> 165344 (-1.25%) helped: 662 HURT: 206 helped stats (abs) min: 1.0 max: 46.0 x̄: 3.65 x̃: 2 helped stats (rel) min: 0.18% max: 22.22% x̄: 2.43% x̃: 1.71% HURT stats (abs) min: 1.0 max: 7.0 x̄: 1.56 x̃: 1 HURT stats (rel) min: 0.17% max: 8.33% x̄: 2.66% x̃: 2.33% 95% mean confidence interval for instructions value: -2.65 -2.18 95% mean confidence interval for instructions %-change: -1.45% -0.99% Instructions are helped. total bundles in shared programs: 649844 -> 649345 (-0.08%) bundles in affected programs: 59278 -> 58779 (-0.84%) helped: 577 HURT: 249 helped stats (abs) min: 1.0 max: 39.0 x̄: 1.56 x̃: 1 helped stats (rel) min: 0.26% max: 30.00% x̄: 3.13% x̃: 2.19% HURT stats (abs) min: 1.0 max: 12.0 x̄: 1.61 x̃: 1 HURT stats (rel) min: 0.58% max: 25.00% x̄: 5.25% x̃: 4.00% 95% mean confidence interval for bundles value: -0.78 -0.43 95% mean confidence interval for bundles %-change: -0.98% -0.23% Bundles are helped. total quadwords in shared programs: 1136767 -> 1134956 (-0.16%) quadwords in affected programs: 141780 -> 139969 (-1.28%) helped: 744 HURT: 311 helped stats (abs) min: 1.0 max: 9.0 x̄: 3.13 x̃: 2 helped stats (rel) min: 0.14% max: 26.67% x̄: 2.77% x̃: 2.13% HURT stats (abs) min: 1.0 max: 8.0 x̄: 1.68 x̃: 1 HURT stats (rel) min: 0.35% max: 10.00% x̄: 3.17% x̃: 1.69% 95% mean confidence interval for quadwords value: -1.89 -1.54 95% mean confidence interval for quadwords %-change: -1.27% -0.77% Quadwords are helped. total registers in shared programs: 90461 -> 90273 (-0.21%) registers in affected programs: 2833 -> 2645 (-6.64%) helped: 250 HURT: 82 helped stats (abs) min: 1.0 max: 2.0 x̄: 1.08 x̃: 1 helped stats (rel) min: 6.67% max: 33.33% x̄: 14.06% x̃: 12.50% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 6.67% max: 50.00% x̄: 13.90% x̃: 12.50% 95% mean confidence interval for registers value: -0.67 -0.47 95% mean confidence interval for registers %-change: -8.62% -5.69% Registers are helped. total threads in shared programs: 55685 -> 55686 (<.01%) threads in affected programs: 76 -> 77 (1.32%) helped: 20 HURT: 17 helped stats (abs) min: 1.0 max: 2.0 x̄: 1.30 x̃: 1 helped stats (rel) min: 100.00% max: 100.00% x̄: 100.00% x̃: 100.00% HURT stats (abs) min: 1.0 max: 2.0 x̄: 1.47 x̃: 1 HURT stats (rel) min: 50.00% max: 50.00% x̄: 50.00% x̃: 50.00% 95% mean confidence interval for threads value: -0.47 0.52 95% mean confidence interval for threads %-change: 5.81% 56.35% Inconclusive result (value mean confidence interval includes 0). total spills in shared programs: 1387 -> 1379 (-0.58%) spills in affected programs: 283 -> 275 (-2.83%) helped: 5 HURT: 1 total fills in shared programs: 5256 -> 5176 (-1.52%) fills in affected programs: 557 -> 477 (-14.36%) helped: 5 HURT: 1 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19350>	2023-01-16 22:20:43 +00:00
Alyssa Rosenzweig	10759d1708	pan/mdg: Use special NIR ops for trig scaling Otherwise the lowering is fundamentally unsound due to incorrect constant folding, even though it worked by chance with the old pass ordering. We're about to change slightly the way we handle fsin/fcos, which was enough to trigger this unsoundness. shader-db results are mostly a toss-up. total instructions in shared programs: 1520675 -> 1520220 (-0.03%) instructions in affected programs: 96841 -> 96386 (-0.47%) helped: 397 HURT: 3 helped stats (abs) min: 1.0 max: 4.0 x̄: 1.15 x̃: 1 helped stats (rel) min: 0.22% max: 6.25% x̄: 1.15% x̃: 0.40% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 0.58% max: 2.08% x̄: 1.08% x̃: 0.58% 95% mean confidence interval for instructions value: -1.19 -1.08 95% mean confidence interval for instructions %-change: -1.26% -1.01% Instructions are helped. total bundles in shared programs: 650088 -> 649844 (-0.04%) bundles in affected programs: 31132 -> 30888 (-0.78%) helped: 229 HURT: 23 helped stats (abs) min: 1.0 max: 4.0 x̄: 1.21 x̃: 1 helped stats (rel) min: 0.49% max: 7.14% x̄: 1.28% x̃: 0.71% HURT stats (abs) min: 1.0 max: 3.0 x̄: 1.48 x̃: 1 HURT stats (rel) min: 0.83% max: 8.33% x̄: 2.38% x̃: 1.85% 95% mean confidence interval for bundles value: -1.08 -0.86 95% mean confidence interval for bundles %-change: -1.15% -0.74% Bundles are helped. total quadwords in shared programs: 1137388 -> 1136767 (-0.05%) quadwords in affected programs: 71826 -> 71205 (-0.86%) helped: 367 HURT: 17 helped stats (abs) min: 1.0 max: 8.0 x̄: 1.80 x̃: 1 helped stats (rel) min: 0.31% max: 17.24% x̄: 2.27% x̃: 0.96% HURT stats (abs) min: 1.0 max: 6.0 x̄: 2.29 x̃: 2 HURT stats (rel) min: 0.44% max: 11.11% x̄: 2.18% x̃: 1.47% 95% mean confidence interval for quadwords value: -1.76 -1.47 95% mean confidence interval for quadwords %-change: -2.36% -1.78% Quadwords are helped. total registers in shared programs: 90483 -> 90461 (-0.02%) registers in affected programs: 890 -> 868 (-2.47%) helped: 67 HURT: 44 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 8.33% max: 25.00% x̄: 10.52% x̃: 9.09% HURT stats (abs) min: 1.0 max: 2.0 x̄: 1.02 x̃: 1 HURT stats (rel) min: 9.09% max: 50.00% x̄: 31.15% x̃: 33.33% 95% mean confidence interval for registers value: -0.39 -0.01 95% mean confidence interval for registers %-change: 1.75% 10.25% Inconclusive result (value mean confidence interval and %-change mean confidence interval disagree). total threads in shared programs: 55694 -> 55685 (-0.02%) threads in affected programs: 21 -> 12 (-42.86%) helped: 1 HURT: 5 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 100.00% max: 100.00% x̄: 100.00% x̃: 100.00% HURT stats (abs) min: 2.0 max: 2.0 x̄: 2.00 x̃: 2 HURT stats (rel) min: 50.00% max: 50.00% x̄: 50.00% x̃: 50.00% 95% mean confidence interval for threads value: -2.79 -0.21 95% mean confidence interval for threads %-change: -89.26% 39.26% Inconclusive result (%-change mean confidence interval includes 0). Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19350>	2023-01-16 22:20:43 +00:00
Alyssa Rosenzweig	7c7c38b126	panfrost: Remove unused debug parameter We removed this path. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20707>	2023-01-16 16:57:47 +00:00
Alyssa Rosenzweig	ea03d0652d	panfrost: Remove PAN_MESA_DEBUG=deqp Now unused. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20707>	2023-01-16 16:57:46 +00:00
Alyssa Rosenzweig	41d99c10d1	panfrost: Fix logic ops on Bifrost opaque should not be set when logicops are enabled, that needs blending even on Bifrost. Fixes is for when I believe the bug became possible to hit. The logical error is older. Fixes Piglit logicop tests again. Fixes: `d849d9779a` ("panfrost: Avoid blend shader when not blending") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20685>	2023-01-16 16:02:23 +00:00
Alyssa Rosenzweig	2f97883276	pan/bi: Add a unit test for fsat(reg.yx) This would have caught the issue from the previous commit. Split out to make backporting the previous change less onerous. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20683>	2023-01-16 15:29:38 +00:00
Alyssa Rosenzweig	ed46c617b0	pan/bi: Fix incorrect compilation of fsat(reg.yx) Future changes to nir_lower_blend cause fsat(reg.yx) instructions to be generated, which correspond to "FCLAMP.v2f16 x.h10" pseudoinstructions. These get their swizzles lowered, but we forgot to clear the swizzle out, so we end up with extra swap (cancelling out the intended swizzle). Fix the lowering logic. Fixes: `ac636f5adb` ("pan/bi: Use FCLAMP pseudo op for clamp prop") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20683>	2023-01-16 15:29:38 +00:00
Alyssa Rosenzweig	5fdfd8044d	panfrost: Don't use AFBC of sRGB luminance-alpha This isn't allowed for the same reason that AFBC of regular luminance-alpha isn't allowed (and will raise DATA_INVALID_FAULTs). Reorder the checks to ensure these formats are checked. Fixes Piglit texwrap GL_EXT_texture_sRGB-s3tc. Fixes: `476be5cb27` ("panfrost: Don't use texture format swizzles on v7") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20686>	2023-01-14 20:00:37 +00:00
Eric Engestrom	aab4a260db	meson: add missing dependency Now that renderonly.h includes util/simple_mtx.h, which itself includes valgrind.h, dep_valgrind is required by any module that includes renderonly.h. In file included from ../src/gallium/auxiliary/renderonly/renderonly.h:33, from ../src/gallium/winsys/kmsro/drm/kmsro_drm_winsys.c:39: ../src/util/simple_mtx.h:34:12: fatal error: valgrind.h: No such file or directory 34 \| # include <valgrind.h> \| ^~~~~~~~~~~~ compilation terminated. dep_valgrind is part of idep_mesautil, which should be used instead of copying the list of deps for each util header included (which would have to be updated every time a util header changes its own includes), so let's add idep_mesautil everywhere that includes renderonly.h. Fixes: `ad4d7ca833` ("kmsro: Fix renderonly_scanout BO aliasing") Tested-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20530>	2023-01-06 15:40:39 +00:00
Alyssa Rosenzweig	980df9ede1	pan/bi: Move Bifrost specific C code to src/compiler/bifrost The goal is to make files at the root of src/compiler/ apply to both Bifrost and Valhall, while ISA-specific code (e.g. instruction packing) code goes in compiler/bifrost/ or compiler/valhall/. This is what Valhall is already doing, the Bifrost specific stuff was just grandfathered in. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20455>	2023-01-02 17:54:49 +00:00
Alyssa Rosenzweig	551c2aadd4	pan/bi: Remove standalone compiler This functionality is now available on Linux with drm-shim + shader-db, and I suspect the version bundled here is broken anyway. Strictly this drops Windows/macOS support for the known-broken frontend to the shader compiler but I can't say I'm terribly worried about that. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20455>	2023-01-02 17:54:48 +00:00
Alyssa Rosenzweig	1a35acd8d9	pan/bi: Rename panfrost/bifrost -> panfrost/compiler This is the compiler for both Bifrost and Valhall, and presumably future Mali GPUs too. Give it a more generic name so we can use the bifrost/ path for something a bit more specific. For historical reasons the compiler's name is still "bifrost" and uses the prefix `bi_`. I think that's ok in the same way that i915 in the kernel supports way more than just i915. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20455>	2023-01-02 17:54:48 +00:00
Aleksey Komarov	dcae301828	pan/va: Fix MUX.i32 and MUX.v2i16 description. Should be: `(A & mask) \| (B & ~mask)` Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20441>	2022-12-28 21:36:54 +00:00
Aleksey Komarov	d14d7c49db	pan/va: Fix d0 description in enum "Load lane (8-bit)" Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20441>	2022-12-28 21:36:54 +00:00
Aleksey Komarov	f102b57423	pan/va: Fix description for constant 0xFAFCFDFE: -2, -3, -4, -6 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20441>	2022-12-28 21:36:54 +00:00
nihui	e584447aed	panvk: Fix null pointer dereference on cmd_buffer->ops Fixes: `84cd81e104` (panvk: Use common code for command buffer lifecycle management) Signed-off-by: Hui Ni <shuizhuyuanluo@126.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20406>	2022-12-26 12:57:07 +00:00
Asahi Lina	bb4aa8a3ea	panfrost: Fix race condition in BO imports When importing a BO, if it is already imported, then the handle will alias an existing BO instance. It is possible for the existing owner to free the BO after the import and leave a dangling handle before we get a chance to increase the refcount, so we need to lock the BO table mutex before importing, to make sure nobody else goes through the free path during that window. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20403>	2022-12-25 22:04:24 +00:00
Alyssa Rosenzweig	b53fa25587	panfrost: Clang-format pan_layout.c Messed up the "clang-format off" for this file. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reported-by: Aleksey Komarov <q4arus@ya.ru> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20431>	2022-12-23 21:43:08 -05:00
Alyssa Rosenzweig	0afd691f29	panfrost: clang-format the tree This switches us over to Mesa's code style [1], normalizing us within the tree. The results aren't perfect, but they bring us a hell of a lot closer to the rest of the tree. Panfrost doesn't feel so foreign relative to Mesa with this, which I think (in retrospect after a bunch of years of being "different") is the right call. I skipped PanVK because that's paused right now. find panfrost/ -type f -name '.h' \| grep -v vulkan \| xargs clang-format -i; find panfrost/ -type f -name '.c' \| grep -v vulkan \| xargs clang-format -i; clang-format -i gallium/drivers/panfrost/.c gallium/drivers/panfrost/.h ; find panfrost/ -type f -name '*.cpp' \| grep -v vulkan \| xargs clang-format -i [1] https://docs.mesa3d.org/codingstyle.html Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20425>	2022-12-24 02:22:57 +00:00
Alyssa Rosenzweig	a4705afe63	panfrost: Fix up some formatting for clang-format clang-format will make a mess of these otherwise. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20425>	2022-12-24 02:22:57 +00:00
Alyssa Rosenzweig	e35719be6f	panfrost: Add missing #includes Found shuffling headers with clang format. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20425>	2022-12-24 02:22:57 +00:00
Alyssa Rosenzweig	90e128ae03	panfrost: Remove perfetto-specific .clang-format We'll use the one in src/panfrost/.clang-format instead, which isn't identical but should be good enough. This way they don't conflict with each other. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20425>	2022-12-24 02:22:57 +00:00
Alyssa Rosenzweig	ee2dcdc3df	panfrost: Add clang-format file Based on freedreno settings, tweaked for panfrost's foreach macros. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20425>	2022-12-24 02:22:57 +00:00
Alyssa Rosenzweig	bd83e5ddaf	pan/bi: Use write masks on Valhall texture instrs I noticed a sequence like the following in a scheduled SuperTuxKart shader: TEX_SINGLE.slot0 @r0:r1, .. LD_VAR.wait0 @r2, ... FMA r1, ... Why do we stall waiting for the TEX_SINGLE instruction when it's not actually read? Because its upper channels are never read, leading to a write-after-write dependency when the register allocator puts some unrelated ALU destination in there. By appropriately masking the texture instruction's write, that false dependency disappears, avoiding the stall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20426>	2022-12-23 19:05:10 +00:00
Alyssa Rosenzweig	7d9c771b9b	pan/va: Pack texture write masks We'll generate nontrivial ones in a moment. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20426>	2022-12-23 19:05:10 +00:00
Alyssa Rosenzweig	f6d73ea7b4	pan/lower_framebuffer: Remove unused pack Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20420>	2022-12-23 16:27:16 +00:00
Alyssa Rosenzweig	8dd35e0ac7	pan/mdg: Remove unused disassembler functions Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20420>	2022-12-23 16:27:16 +00:00
Alyssa Rosenzweig	9cd6d0873d	panfrost: Remove experimental v7-only indirect draw path There are too many problems with indirect draws on v7 that we never got this code path to the finish line, and none of us have a good plan (or reason) to fix this. Proper indirect draws are only possible since v10 on Mali. There was interest in using this path to implement indexed draws in PanVK, that MR is stalled and it's not clear how much sense it makes to do Vulkan on anything older than v9 or v10 at this point. This code isn't gone, it'll still be in git history, but I don't see a lot of reason in keeping it in tree if it's unused and complicating e.g. the sysval upload path of the driver. Indirect dispatch remains supported on v7, as that path is working and flipped on for end users. Indirect dispatch on v7 is considerably less complicated than indirect draws. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20420>	2022-12-23 16:27:16 +00:00
Alyssa Rosenzweig	486c341769	panfrost: Add architecture description XML for v10 Add the GenXML hardware description for Mali architecture v10, as implemented in Mali-G610. This is not 100% complete but it should be good enough for parity with v9. The XML itself is forked off of v9, with all Job Managerisms replaced with CSFisms. This notably includes a large number of new structures defining the instructions that run on the Command Execution Unit (CEU). This is the first step towards supporting Mali-G610 (i.e. RK3588) upstream. Next up will be pandecode support. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20360>	2022-12-17 20:33:39 +00:00
Yonggang Luo	9f5ace9857	panvk: Fixes -Werror,-Wunused-but-set-variable for clang-15 in panvk_descriptor_set.c ../../src/panfrost/vulkan/panvk_descriptor_set.c:67:13: error: variable 'dynoffset_idx' set but not used [-Werror,-Wunused-but-set-variable] unsigned dynoffset_idx = 0, img_idx = 0; ^ Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19875>	2022-12-16 19:02:17 +00:00
Alyssa Rosenzweig	476be5cb27	panfrost: Don't use texture format swizzles on v7 They're too restricted for AFBC. Fix up instead. There are two problems at play: 1. We can't just map the format swizzle to the pixel format ordering on v7, because the "reordered" values aren't allowed with compression. 2. We can't just compose the format swizzle with the API swizzle, because the composed swizzle is applied to the border colour, so we need to be able to apply an inverted swizzle to the border colour. That only works for bijective format swizzles. Fortunately, there's a neat solution: decompose the format's swizzle into two swizzles, the first mapping to a reordering that IS allowed for compression, and the second a bijection. Then we use the allowed reordering when texturing, apply the bijective swizzle to the API swizzle, and apply the inverse of the bijective swizzle to the border colour. When we're sampling a border colour, what's now happening mathematically is: (API swizzle o bijective swizzle)((bijective swizzle^-1)(border colour)) = (API swizzle o (bijective swizzle o bijective swizzle^-1))(border colour) = API swizzle(border colour) which is exactly what we wanted. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20311>	2022-12-16 18:27:47 +00:00
Alyssa Rosenzweig	f159ff530e	panfrost: Allow swizzled AFBC on v9+ On v6 and earlier, the hardware supports arbitrary format swizzles for AFBC, so there's no restriction on AFBC. On v8 and newer, the format swizzle gets applied to the decompressed interchange format, so we can effectively support BGRA of AFBC images without any special handling. (Confirmed working on v9. Obviously I can't test on v8 but the expression is cleaner if we assume optimistically it's like v9. Without hardware, we get to make that assumption :-p) That just leaves v7 as the only architecture where format swizzles are restricted for compression but there are no plane descriptor. Don't apply the restriction to the newer parts. This gets us AFBC of window surfaces on v9+. As the limiting case, fullscreen glmark2-es2-wayland -btexture (1080p) in sway on Mali-G57 from 1300fps to 2353fps. 45% reduction in frame time is nothing to sneeze at. Achoo. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20311>	2022-12-16 18:27:47 +00:00
Alyssa Rosenzweig	cb5e417c01	panfrost: Introduce pan_afbc_mode Introduce an enum to represent an AFBC compression mode. These modes are not formats, on Valhall they are decoupled from the format. As such, it does not make sense to use a pipe_format to represent them. Add an enum that we can use in a straightforward way on Midgard and Bifrost to fallback for texture views, and can map 1:1 to the Valhall hardware enum. In addition to being less overloaded semantically, this lets -Wswitch kick in to ensure that we handle all enums when translating. The straightforward translation raises the following warnings: ../src/panfrost/lib/pan_cs.c:437:9: warning: enumeration value ‘PAN_AFBC_MODE_R5G5B5A1’ not handled in switch [-Wswitch] 437 \| switch (panfrost_afbc_format(PAN_ARCH, format)) { \| ^~~~~~ ...indicating that some formats were missed, leading to assertion fails "unknown canonical AFBC format" when rendering RGB5A1, which dEQP-GLES31 does. Fixes regressions in dEQP-GLES31.functional.draw_buffers_indexed.random.max_required_draw_buffers.* on Valhall. Given how scarce v9 hardware is, that v10 isn't upstream yet, and the offending code was merged a week ago, this should not have actually affected anyone. At any rate, it's a good reminder we really do need CI for v9... Fixes: `8e125b6c15` ("panfrost: Enable AFBC of more formats") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20311>	2022-12-16 18:27:47 +00:00
Alyssa Rosenzweig	0784adc668	panfrost: Luminance-alpha AFBC unsupported on v7+ The L8_UNORM, A8_UNORM, and L8A8_UNORM v7 formats do not support AFBC, regardless of swizzling. We're about to lift the restrictions on swizzling with AFBC on v7, so we'll need to handle these cases explicitly to avoid using AFBC in these cases. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20311>	2022-12-16 18:27:47 +00:00
Alyssa Rosenzweig	a3f9aa3b3e	panfrost: Align WSI strides for tiled AFBC When calculating legacy WSI strides for tiled AFBC, we need to account for the greater alignment requirement of tiled AFBC, or importing resources will fail later. Since tiled AFBC is only supported on v7 and later, and AFBC of window surfaces isn't being used on Linux on v7 and later, this probably hasn't been hit in practice. Probably. We're about to fix AFBC of window surfaces so we need to fix this side first. Fixes: `0255f554f3` ("panfrost: Advertise 16x16 tiled AFBC") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20311>	2022-12-16 18:27:47 +00:00
Alyssa Rosenzweig	a861501632	panfrost: Add tool to print supported texture formats While all Panfrost-supported Mali GPUs support all the compressed texture formats architecturally, the system integrator decides which formats will actually be wired up in the production system-on-chip. In the past there may have been legal considerations, I'm neither a lawyer nor a system integrator so couldn't say. It's useful for users to know which compressed texture formats are supported by their hardware, to understand its performance characteristics (and perhaps to buy systems that support their needs, especially if they need BCn formats which are omitted in many Mali implementations). To help with that, this commit adds a small standalone tool that prints which formats are supported. It is tested so far on Mali-T860 and Mali-G57. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Tested-by: Chris Healy <healych@amazon.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20086>	2022-12-14 22:48:47 +00:00
Ian Romanick	eb76cee9f8	nir: Eliminate nir_op_i2b There are a lot of optimizations in opt_algebraic that match ('ine', a, 0), but there are almost none that match i2b. Instead of adding a huge pile of additional patterns (including variations that include both ine and i2b), always lower i2b to a != 0. At this point in the series, it should be impossible for anything to generate i2b, so there /should not/ be any changes. The failing test on d3d12 is a pre-existing bug that is triggered by this change. I talked to Jesse about it, and, after some analysis, he suggested just adding it to the list of known failures. v2: Don't rematerialize i2b instructions in dxil_nir_lower_x2b. v3: Don't rematerialize i2b instructions in zink_nir_algebraic.py. v4: Fix zink-on-TGL CI failures by calling nir_opt_algebraic after nir_lower_doubles makes progress. The latter can generate b2i instructions, but nir_lower_int64 can't handle them (anymore). v5: Add back most of the hunk at line 2125 of nir_opt_algebraic.py. I had accidentally removed the f2b(bf2(x)) optimization. v6: Just eliminate the i2b instruction. v7: Remove missed i2b32 in midgard_compile.c. Remove (now unused) emit_alu_i2orf2_b1 function from sfn_instr_alu.cpp. Previously this function was still used. 🤷 No shader-db changes on any Intel platform. All Intel platforms had similar results. (Ice Lake shown) Instructions in all programs: 141165875 -> 141165873 (-0.0%) Instructions helped: 2 Cycles in all programs: 9098956382 -> 9098956350 (-0.0%) Cycles helped: 2 The two Vulkan shaders are helped because of the "new" (('b2i32', ('ine', ('ubfe', a, b, 1), 0)), ('ubfe', a, b, 1)) algebraic pattern. Acked-by: Jesse Natalie <jenatali@microsoft.com> [earlier version] Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Tested-by: Daniel Schürmann <daniel@schuermann.dev> [earlier version] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15121>	2022-12-14 06:23:21 +00:00

1 2 3 4 5 ...

4521 commits