fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-22 13:08:09 +02:00

Author	SHA1	Message	Date
David Heidelberg	63781071db	panfrost: drop leftover definition after pan_nir_lower_64bit_intrin removal Fixes: `bd0d3c7b1c` ("panfrost: drop pan_nir_lower_64bit_intrin") Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Signed-off-by: David Heidelberg <david@ixit.cz> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30994>	2024-09-05 11:02:29 +00:00
Eric R. Smith	9e04c0a818	panfrost: add support for image2DMSArray on bifrost On bifrost we only can use 3 coordinates for images, but image2DMSArray needs 4 (x, y, sample#, and array index). We work around this by making the image nr_samples times higher than the original image, using the Y coordinate to address the sample plane. This limits the maximum image height (to 4K pixels instead of 64K pixels in the 16 sample case) but at least allows us to use the images. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30521>	2024-08-23 16:57:58 +00:00
Daniel Stone	e05415a82e	format: Generate endian-independent format aliases Instead of having a hardcoded list of endian-independent format aliases in the header, generate them from the format definitions. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29649>	2024-07-19 13:50:42 +00:00
David Heidelberg	68215332a8	build: pass licensing information in SPDX form Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Dylan Baker <dylan.c.baker@intel.com> Acked-by: Eric Engestrom <eric@igalia.com> Acked-by: Daniel Stone <daniels@collabora.com> Signed-off-by: David Heidelberg <david@ixit.cz> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29972>	2024-06-29 12:42:49 -07:00
Alyssa Rosenzweig	15257b65c6	treewide: use nir_metadata_control_flow Via Coccinelle patch: @@ @@ -nir_metadata_block_index \| nir_metadata_dominance +nir_metadata_control_flow ...plus some manual fixups for call sites missed by coccinelle. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Acked-by: Karol Herbst <kherbst@redhat.com> Acked-by: Juan A. Suarez Romero <jasuarez@igalia.com> [broadcom] Acked-by: Vasily Khoruzhick <anarsoul@gmail.com> [lima] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29745>	2024-06-17 16:28:14 -04:00
Emma Anholt	3beae0f98e	nir,panfrost,agx: Fix driver PIXEL_COORD_INTEGER setting and drop workaround. nir_lower_frag_coord_to_pixel_coord was adding .5 to work around that the drivers were mistakenly setting PIXEL_COORD_HALF_INTEGER. With the setting corrected, the GL frontend handles it appropriately (instead of subtracting half in the frontend for ARB_fragment_coord_conventions integer setting and then adding the half back here), and makes the pass reusable from Intel. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29585>	2024-06-10 16:59:38 +00:00
Alexandre Marquet	ee9809c889	pan/mdg: quirk to disable auto32 For some reason, flat shading on T604 does not work when using auto32 varyings type. This commit introduces a quirk for T60x, and some plumbing in pan_nir, allowing to explicitely use appropriate types, rather than always using .u32 for flat shading. Backport-to: 24.1 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10632 Signed-off-by: Alexandre Marquet <tb@a-marquet.fr> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28146>	2024-05-09 21:21:32 +00:00
Boris Brezillon	8cba497701	panfrost: Move the image attribute offset adjustment to a NIR pass The gallium and vulkan drivers deal with vertex attribute emission differently. The gallium driver re-emits the VS attributes on each draw, while the vulkan driver uses explicit attribute/image descriptor dirtiness tracking, and could keep the attribute array around if a new pipeline using a different number of attribute is bound. If we want to be able to do that, we need to assign a fixed offset for image attributes, such that the Vulkan descriptor lowering pass knows where the images are in the attribute table. We could teach the Bifrost backend how to deal with a custom offset but it doing that in a lowering pass also simplifies the Midgard code. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28200>	2024-03-26 09:24:25 +01:00
Yonggang Luo	d2229304dc	panfrost/meson: remove redundant gallium include from meson files Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24439>	2024-03-14 17:23:55 +00:00
Mary Guillemard	fbe820f5a0	panfrost, pan/lib: Move pan_resource_table to panfrost pan_blitter now uses its own table definition. Signed-off-by: Mary Guillemard <mary.guillemard@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27846>	2024-03-11 09:23:56 +00:00
Mary Guillemard	652e1c2e13	pan/bi: Rework indices for attributes on Valhall This also fix missing encoding of indice with non immediate index. Signed-off-by: Mary Guillemard <mary.guillemard@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27846>	2024-03-11 09:23:56 +00:00
Mary Guillemard	ce52b6d359	pan/bi: Rework indices for tex on Valhall Lower tex/sampler table in indices on panfrost. This also implement wide indices and change the format of texture and sampler indices received by the compiler. Signed-off-by: Mary Guillemard <mary.guillemard@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27846>	2024-03-11 09:23:56 +00:00
Boris Brezillon	4477daf957	panfrost: Rework the way we compute thread info Rework the way we compute thread info to make it mostly GPU-agnostic outside of the kmod backend. The new logic is based on the following information extracted from GPU registers: - mximum number of threads per core - maximum number ot threads per workgroup - number of registers per core If the GPU doesn't provide this information (registers are zero), we pick the per-arch defaults we had in panfrost_max_thread_count(). Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Antonino Maniscalco <antonino.maniscalco@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26358>	2024-03-01 10:42:43 +00:00
Eric R. Smith	c9831a4d34	panfrost: add lowering pass for multisampled images Panfrost generally treats 2D multisampled images like 3D images, with the R coordinate holding the sample index. This commit adds a lowering pass to convert 2DMS images to 3D for the compiler. It is not actually invoked yet. Signed-off-by: Eric R. Smith <eric.smith@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27626>	2024-02-26 19:01:32 +00:00
Christian Duerr	49c1b404e5	panfrost: Fix dual-source blending If dual blending is enabled, only 1 output is supported. Multiple outputs confuse the write combining pass in this case, leading to incorrect output and/or an assert failure in emit_fragment_store. The fix is straightforward, just skip the speculative emitting of multiple outputs in the case where dual source blending is enabled. This also adds an extra sanity check in `pan_nir_lower_zs_store` to check for only one blend store being present. Fixes: `c65a9be421` ("panfrost: Preprocess shaders at CSO create time") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9487 Co-Authored-By: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26474>	2024-02-05 13:25:56 +00:00
Alyssa Rosenzweig	d1eb17e92e	treewide: Drop nir_ssa_for_src users Via Coccinelle patch: @@ expression b, s, n; @@ -nir_ssa_for_src(b, *s, n) +s->ssa @@ expression b, s, n; @@ -nir_ssa_for_src(b, s, n) +s.ssa Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25247>	2023-09-18 10:25:17 -04:00
Karol Herbst	bd0d3c7b1c	panfrost: drop pan_nir_lower_64bit_intrin It's dead code now. Signed-off-by: Karol Herbst <git@karolherbst.de> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24939>	2023-08-30 16:25:40 +00:00
Karol Herbst	7550f59178	panfrost: drop 64 bit handling for cl workgroup intrinsics Signed-off-by: Karol Herbst <git@karolherbst.de> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24905>	2023-08-30 07:04:33 +00:00
Alyssa Rosenzweig	cda1961835	treewide: Also handle struct nir_builder form Via Coccinelle patch: @def@ typedef bool; typedef nir_builder; typedef nir_instr; typedef nir_def; identifier fn, instr, intr, x, builder, data; @@ static fn(struct nir_builder* builder, -nir_instr instr, +nir_intrinsic_instr intr, ...) { ( - if (instr->type != nir_instr_type_intrinsic) - return false; - nir_intrinsic_instr intr = nir_instr_as_intrinsic(instr); \| - nir_intrinsic_instr intr = nir_instr_as_intrinsic(instr); - if (instr->type != nir_instr_type_intrinsic) - return false; ) <... ( -instr->x +intr->instr.x \| -instr +&intr->instr ) ...> } @pass depends on def@ identifier def.fn; expression shader, progress; @@ ( -nir_shader_instructions_pass(shader, fn, +nir_shader_intrinsics_pass(shader, fn, ...) \| -NIR_PASS_V(shader, nir_shader_instructions_pass, fn, +NIR_PASS_V(shader, nir_shader_intrinsics_pass, fn, ...) \| -NIR_PASS(progress, shader, nir_shader_instructions_pass, fn, +NIR_PASS(progress, shader, nir_shader_intrinsics_pass, fn, ...) ) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24852>	2023-08-24 15:48:02 +00:00
Alyssa Rosenzweig	465b138f01	treewide: Use nir_shader_intrinsic_pass sometimes This converts a lot of trivial passes. Nice boilerplate deletion. Via Coccinelle patch (with a small manual fix-up for panfrost where coccinelle got confused by genxml + ninja clang-format squashed in, and for Zink because my semantic patch was slightly buggy). @def@ typedef bool; typedef nir_builder; typedef nir_instr; typedef nir_def; identifier fn, instr, intr, x, builder, data; @@ static fn(nir_builder* builder, -nir_instr instr, +nir_intrinsic_instr intr, ...) { ( - if (instr->type != nir_instr_type_intrinsic) - return false; - nir_intrinsic_instr intr = nir_instr_as_intrinsic(instr); \| - nir_intrinsic_instr intr = nir_instr_as_intrinsic(instr); - if (instr->type != nir_instr_type_intrinsic) - return false; ) <... ( -instr->x +intr->instr.x \| -instr +&intr->instr ) ...> } @pass depends on def@ identifier def.fn; expression shader, progress; @@ ( -nir_shader_instructions_pass(shader, fn, +nir_shader_intrinsics_pass(shader, fn, ...) \| -NIR_PASS_V(shader, nir_shader_instructions_pass, fn, +NIR_PASS_V(shader, nir_shader_intrinsics_pass, fn, ...) \| -NIR_PASS(progress, shader, nir_shader_instructions_pass, fn, +NIR_PASS(progress, shader, nir_shader_intrinsics_pass, fn, ...) ) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24852>	2023-08-24 15:48:02 +00:00
Faith Ekstrand	de063a1481	nir: Drop most uses of nir_instr_rewrite_src_ssa() Generated with the following semantic patch: @@ expression I, S, D; @@ -nir_instr_rewrite_src_ssa(I, S, D); +nir_src_rewrite(S, D); Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24729>	2023-08-18 01:00:15 +00:00
Faith Ekstrand	4695bebc79	nir: Drop nir_dest Instead, we replace every use of it with nir_def. Most of this commit was generated by sed: sed -i -e 's/dest.ssa/def/g' src/*/.h src/*/.c src/*/.cpp A few manual fixups were required in lima and the nir_legacy code. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24674>	2023-08-14 21:22:53 +00:00
Faith Ekstrand	9d81f13a75	nir: Get rid of nir_dest_num_components() We could add a nir_def_num_components() helper but we use ssa.num_components about 3x as often as nir_dest_num_components() today so that's a major Coccinelle refactor anyway and this doesn't make it much worse. Most of this commit was generated byt the following semantic patch: @@ expression D; @@ <... -nir_dest_num_components(D) +D.ssa.num_components ... Some manual fixup was needed, especially in cpp files where Coccinelle tends to give up the moment it sees any interesting C++. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24674>	2023-08-14 21:22:53 +00:00
Faith Ekstrand	80a1836d8b	nir: Get rid of nir_dest_bit_size() We could add a nir_def_bit_size() helper but we use ->bit_size about 3x as often as nir_dest_bit_size() today so that's a major Coccinelle refactor anyway and this doesn't make it much worse. Most of this commit was generated byt the following semantic patch: @@ expression D; @@ <... -nir_dest_bit_size(D) +D.ssa.bit_size ... Some manual fixup was needed, especially in cpp files where Coccinelle tends to give up the moment it sees any interesting C++. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24674>	2023-08-14 21:22:53 +00:00
Alyssa Rosenzweig	09d31922de	nir: Drop "SSA" from NIR language Everything is SSA now. sed -e 's/nir_ssa_def/nir_def/g' \ -e 's/nir_ssa_undef/nir_undef/g' \ -e 's/nir_ssa_scalar/nir_scalar/g' \ -e 's/nir_src_rewrite_ssa/nir_src_rewrite/g' \ -e 's/nir_gather_ssa_types/nir_gather_types/g' \ -i $(git grep -l nir \| grep -v relnotes) git mv src/compiler/nir/nir_gather_ssa_types.c \ src/compiler/nir/nir_gather_types.c ninja -C build/ clang-format cd src/compiler/nir && find .c .h -type f -exec clang-format -i \{} \; Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24585>	2023-08-12 16:44:41 -04:00
Alyssa Rosenzweig	95e3df39c0	treewide: sed out more is_ssa Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24432>	2023-08-03 22:40:28 +00:00
Alyssa Rosenzweig	5fead24365	treewide: Drop is_ssa asserts We only see SSA now. Via Coccinelle patch: @@ expression x; @@ -assert(x.is_ssa); Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24432>	2023-08-03 22:40:28 +00:00
Alyssa Rosenzweig	a8c0b6695f	panfrost: Remove unused helpers Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24253>	2023-07-21 11:25:48 +00:00
Alyssa Rosenzweig	64ff2b3ed6	panfrost: Lower vertex_id for XFB Even on Valhall, vertex_id is zero-based in a transform feedback program. Lower that for transform feedback programs properly since it wouldn't happen automatically on Valhall. Fixes assertion fails. Fixes: `91ffd10351` ("pan/bi: Lower gl_VertexID in NIR") Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24198>	2023-07-20 01:25:34 +00:00
Konstantin Seurer	ed08305549	panfrost: Use nir_builder_at Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23883>	2023-07-03 15:21:37 +00:00
Yonggang Luo	d86bcc39d6	panfrost: Convert to use nir_foreach_function_impl when possible Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23960>	2023-07-01 17:39:50 +08:00
Alyssa Rosenzweig	f9a0423a20	pan/mdg: Propagate modifiers in the backend It really isn't that hard. This drops the roundmode optimization but otherwise should be at parity to what there was before, and it's massively more competent at it anyway. total instructions in shared programs: 1514477 -> 1508444 (-0.40%) instructions in affected programs: 645848 -> 639815 (-0.93%) helped: 2712 HURT: 187 Instructions are helped. total bundles in shared programs: 645069 -> 642999 (-0.32%) bundles in affected programs: 136233 -> 134163 (-1.52%) helped: 1242 HURT: 319 Bundles are helped. total quadwords in shared programs: 1130469 -> 1125969 (-0.40%) quadwords in affected programs: 379780 -> 375280 (-1.18%) helped: 1878 HURT: 376 Quadwords are helped. total registers in shared programs: 90577 -> 90633 (0.06%) registers in affected programs: 5627 -> 5683 (1.00%) helped: 309 HURT: 294 Inconclusive result (value mean confidence interval includes 0). total threads in shared programs: 55594 -> 55607 (0.02%) threads in affected programs: 118 -> 131 (11.02%) helped: 43 HURT: 33 Inconclusive result (value mean confidence interval includes 0). total spills in shared programs: 1399 -> 1371 (-2.00%) spills in affected programs: 345 -> 317 (-8.12%) helped: 10 HURT: 4 total fills in shared programs: 5273 -> 5133 (-2.66%) fills in affected programs: 1035 -> 895 (-13.53%) helped: 12 HURT: 4 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Italo Nicola <italonicola@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23769>	2023-06-30 16:29:35 -04:00
Erik Faye-Lund	45e7e16222	pan: use imm-helpers Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23855>	2023-06-29 07:08:18 +00:00
Alyssa Rosenzweig	815efcdf7e	nir: Use nir_builder_create perl -p0e 's/nir_builder ([^;]);\snir_builder_init\(&\1, /nir_builder \1 = nir_builder_create(/g' -i $(git grep -l nir_builder_init) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23860>	2023-06-27 18:13:02 +00:00
Alyssa Rosenzweig	99a00e2247	treewide: Use nir_trim_vector more Via Coccinelle patches @@ expression a, b, c; @@ -nir_channels(b, a, (1 << c) - 1) +nir_trim_vector(b, a, c) @@ expression a, b, c; @@ -nir_channels(b, a, BITFIELD_MASK(c)) +nir_trim_vector(b, a, c) @@ expression a, b; @@ -nir_channels(b, a, 3) +nir_trim_vector(b, a, 2) @@ expression a, b; @@ -nir_channels(b, a, 7) +nir_trim_vector(b, a, 3) Plus a fixup for pointless trimming an immediate in RADV and radeonsi. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23352>	2023-06-06 18:52:25 +00:00
Erik Faye-Lund	28b1c5bca1	nir: use nir_i{ne,eq}_imm helpers We already have these, so let's use them more. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23393>	2023-06-05 13:40:07 +00:00
Alyssa Rosenzweig	2b2685f551	pan/lower_framebuffer: Use nir_replicate Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Italo Nicola <italonicola@collabora.com> Acked-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23259>	2023-05-30 16:24:21 -04:00
Alyssa Rosenzweig	7f6491b76d	nir: Combine if_uses with instruction uses Every nir_ssa_def is part of a chain of uses, implemented with doubly linked lists. That means each requires 2 * 64-bit = 16 bytes per def, which is memory intensive. Together they require 32 bytes per def. Not cool. To cut that memory use in half, we can combine the two linked lists into a single use list that contains both regular instruction uses and if-uses. To do this, we augment the nir_src with a boolean "is_if", and reimplement the abstract if-uses operations on top of that list. That boolean should fit into the padding already in nir_src so should not actually affect memory use, and in the future we sneak it into the bottom bit of a pointer. However, this creates a new inefficiency: now iterating over regular uses separate from if-uses is (nominally) more expensive. It turns out virtually every caller of nir_foreach_if_use(_safe) also calls nir_foreach_use(_safe) immediately before, so we rewrite most of the callers to instead call a new single `nir_foreach_use_including_if(_safe)` which predicates the logic based on `src->is_if`. This should mitigate the performance difference. There's a bit of churn, but this is largely a mechanical set of changes. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22343>	2023-04-07 23:48:03 +00:00
Karol Herbst	87aeea20ac	panfrost: move max_thread_count and take reg_count into account We'll need it to report proper thread counts for OpenCL. Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19855>	2023-03-31 20:29:00 +00:00
Faith Ekstrand	e001995dc5	util,mesa,panfrost: Drop some author tags This is what git blame is for Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22120>	2023-03-26 00:16:25 +00:00
Alyssa Rosenzweig	f888994679	panfrost: Move panfrost_sysvals to GL driver This shouldn't be used by anything else at this point. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20906>	2023-03-23 23:53:46 +00:00
Alyssa Rosenzweig	3e64b13193	panfrost: Move sysvals to GL driver struct Only the GL driver produces/consumes these, they shouldn't be in the common shader_info. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20906>	2023-03-23 23:53:46 +00:00
Alyssa Rosenzweig	ffb9919c2f	panfrost: Lower sysvals in GL Drop the backend compiler sysval handling in favour of the pass in the GL driver, bringing us into compliance with Ekstrand's rule. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20906>	2023-03-23 23:53:46 +00:00
Alyssa Rosenzweig	2745daa05a	pan/lower_framebuffer: Lower MSAA blend shaders Do it explicitly in NIR rather than implicitly in the Midgard compiler. This avoids a nasty sideband input for the render target formats and sample count, for blend shaders on midgard only. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20906>	2023-03-23 23:53:46 +00:00
Alyssa Rosenzweig	ca2042f359	panfrost: Preprocess shaders in the driver This is a flag-day change to how we compile. We split preprocessing NIR into a separate step from compiling, giving the driver a chance to apply its own lowerings on the preprocessed NIR before the final optimization loop. During that time, the different producers of NIR (panfrost, panvk, blend shaders, blit shaders...) will be able to (differently) lower system values. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20906>	2023-03-23 23:53:46 +00:00
Alyssa Rosenzweig	bccd6d3880	pan/lower_framebuffer: Use nir_shader_instructions_pass Removes a lot of indentation, and improves metadata handling. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20906>	2023-03-23 23:53:46 +00:00
Alyssa Rosenzweig	8059eb1577	pan/lower_framebuffer: Only call for FS It doesn't make sense for shader stages other than fragment (and blend which is fragment-like), assert this. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20906>	2023-03-23 23:53:46 +00:00
Alyssa Rosenzweig	c333c0ea57	panfrost: Remove unused inputs.nr_cbufs Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20906>	2023-03-23 23:53:46 +00:00
Alyssa Rosenzweig	da0815fb9b	panfrost: Remove inputs->blend.rt This sideband input is now unused, as the information is available locally within the NIR as it should be. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20906>	2023-03-23 23:53:45 +00:00
Alyssa Rosenzweig	8db30010dc	pan/bi: Lower load_output to make sysval explicit See previous commits for justification. Later, we'll split up NIR processing in a few steps to give the caller a chance to lower the sysval, at which point the goofy inputs here will go away. v2: Only lower in fragment shaders. Likely harmless to run elsewhere but still wrong because the location enum is defined per-stage. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> [v1] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20906>	2023-03-23 23:53:45 +00:00

1 2 3 4 5

238 commits