fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-21 17:38:08 +02:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	eab4d6a96f	agx: Add and use agx_nir_ssa_index helper Common subexpression that we'll repeat once more in the next patch. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21431>	2023-02-21 08:10:15 +00:00
Alyssa Rosenzweig	e93a221024	agx: Handle group_memory_barrier A combination of control_barrier + memory_barrier but it's always seen with those. This would be safer with scoped barriers... Fixes dEQP-GLES31.functional.synchronization.inter_invocation.ssbo Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21326>	2023-02-20 18:50:40 +00:00
Alyssa Rosenzweig	e9cec96633	agx: Implement b2b32 Shows up with store_shared. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21326>	2023-02-20 18:50:40 +00:00
Alyssa Rosenzweig	955797bb00	agx: Pack local atomics Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21326>	2023-02-20 18:50:39 +00:00
Alyssa Rosenzweig	14f546726e	agx: Lower shared memory offsets to 16-bit Per the hardware requirement. This simplifies instruction selection (it avoids the need to constant fold u2u16 in the backend). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21326>	2023-02-20 18:50:39 +00:00
Alyssa Rosenzweig	a21f6f8cb0	agx: Translate load/store_shared Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21326>	2023-02-20 18:50:39 +00:00
Alyssa Rosenzweig	f8b9dfbbad	agx: Translate NIR atomics Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21326>	2023-02-20 18:50:39 +00:00
Alyssa Rosenzweig	2a021b1818	agx: Pack local load/store instructions Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21326>	2023-02-20 18:50:39 +00:00
Alyssa Rosenzweig	96904f83b4	agx: Pack global atomics Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21326>	2023-02-20 18:50:39 +00:00
Alyssa Rosenzweig	eea3674f36	agx: Disallow immediate bases to device_load Lina pointed this out in review. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21326>	2023-02-20 18:50:39 +00:00
Alyssa Rosenzweig	6b0ef2b462	agx: Model local loads/stores Aka shared memory or threadgroup memory. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21326>	2023-02-20 18:50:39 +00:00
Alyssa Rosenzweig	0d07d27173	agx: Model atomic instructions Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21326>	2023-02-20 18:50:39 +00:00
Alyssa Rosenzweig	cf96edff1c	agx: Implement gathers (nir_texop_tg4) Passes dEQP-GLES31.functional.texture.gather.* Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21264>	2023-02-20 17:27:21 +00:00
Alyssa Rosenzweig	978d3fefa8	agx: Model and pack gathers Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21264>	2023-02-20 17:27:21 +00:00
Alyssa Rosenzweig	8dc861dbb5	agx: Lower offsets in NIR Rather than the backend. This way we can handle non-constant offsets as well as constants with a single code path (with the constant offset code subsumed as a special case via NIR's constant folding). This nets us dynamic offset support. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21264>	2023-02-20 17:27:21 +00:00
Alyssa Rosenzweig	0e0825013d	agx: Do more work in agx_preprocess_nir agx_preprocess_nir runs once per shader, whereas agx_optimize_nir runs once per variant. That means we want to do as much work as possible in agx_preprocess_nir to make shader variants as cheap as possible to compiler. So, move our standard suite of lowering and optimizing to the preprocess loop, leaving just a single (easy) trip through the optimizer for simple variant processing. Plus, we can remove variables when preprocessing, since we no longer use variables anywhere. We remove them to reduce the RAM and disk cache footprint of shader variants. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21104>	2023-02-20 11:34:58 +00:00
Alyssa Rosenzweig	5b92bd99db	agx: Don't treat clip distances specially We've been using the clip lowering, but it's been broken upstream because of this artefact from the (non-lowered implementation) sneaking in from downstream. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21104>	2023-02-20 11:34:58 +00:00
Alyssa Rosenzweig	888492ecd3	asahi: Vectorize background colour load No point to scalarizing this, the background can handle the vector load fine since `bfa7ec0aa0` ("agx: Don't scalarize preambles in NIR"). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21327>	2023-02-16 06:36:49 +00:00
Asahi Lina	b39947ee0c	asahi: Drop agx_device.memctx No longer used. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21348>	2023-02-15 22:16:51 +00:00
Asahi Lina	6ad64387dd	asahi: Do not use memctx for pools / meta cache ralloc is not thread-safe, so we can't use dev->memctx for allocating context-specific things without locking. On top of that, we always need to explicitly clean up pools anyway since we need to unref the BOs, so there is no point to using a memctx. And since pools need to be explicitly cleaned up, the meta cache code needs explicit cleanup, so add that and drop memctx from there too. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21348>	2023-02-15 22:16:51 +00:00
Alyssa Rosenzweig	e4731ec335	asahi: Remove default=true on index list values These will cause issues with indirect draws. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21273>	2023-02-13 09:51:42 -05:00
Alyssa Rosenzweig	8e1eee8b5e	asahi: Add XML for VDM memory barriers We'll use these in our implementation of transform feedback. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21272>	2023-02-13 11:45:03 +00:00
Alyssa Rosenzweig	8e0e68510f	asahi: Add XML for indirect draws Nice and simple. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21272>	2023-02-13 11:45:03 +00:00
Alyssa Rosenzweig	c3b8928b84	asahi: Add XML for indirect dispatch This splits up the CDM commands into their subparts, after which indirect dispatch is straightforward. Also fix the pipeline bits. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21272>	2023-02-13 11:45:03 +00:00
Alyssa Rosenzweig	2c2f189fe7	agx: Write sample mask even with no colour output Needed for discard to work properly, which has visible side effects with occlusion queries. Fixes no_attachment framebuffers together with the next commit. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21267>	2023-02-13 11:28:07 +00:00
Alyssa Rosenzweig	e785ae6125	agx: Implement load_helper_invocation Passes dEQP-GLES31.functional.shaders.helper_invocation.* Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21265>	2023-02-13 11:12:05 +00:00
Alyssa Rosenzweig	6214c9921a	agx: Remove bogus gl_Position assertion It is reasonable not to write gl_Position in a transform feedback program. Fixes rendering of the apitrace of Domekeeper in #7798. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21266>	2023-02-13 10:48:13 +00:00
Alyssa Rosenzweig	eeae9b93de	agx: Fix AGX_MAX_CF_BINDINGS Potentially could be larger with aliasing of component offsets, though that would be silly. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21266>	2023-02-13 10:48:13 +00:00
Alyssa Rosenzweig	fbe8878dcb	agx: Respect component in frag load_input Fixes fails in dEQP-GLES31.functional.separate_shader.random.*. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21266>	2023-02-13 10:48:13 +00:00
Alyssa Rosenzweig	a5d478d17c	agx: Remove unused AGX_MAX_VARYINGS Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21266>	2023-02-13 10:48:13 +00:00
Alyssa Rosenzweig	bfa7ec0aa0	agx: Don't scalarize preambles in NIR Scalarizing preambles in NIR isn't really necessary, we can do it more efficiently in the backend. This makes the final NIR a lot less annoying to read; the backend IR was already nice to read thanks to all the scalarized moves being copypropped. Plus, this is a lot simpler. No shader-db changes. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21122>	2023-02-05 08:53:29 +00:00
Alyssa Rosenzweig	7edd42cbc0	agx: Lower uniform sources with a dedicated pass Move the decision of "can I copyprop this uniform?" from copyprop to a standalone lowering pass. This is more straightforward and will enable the next patch. This has the side effect of sinking load_preamble instructions, for a nice reduction in register pressure. Instruction count increase is from rematerializing some moves, which should be more than balanced out by the reduced register pressure. total instructions in shared programs: 1523285 -> 1523317 (<.01%) instructions in affected programs: 1148 -> 1180 (2.79%) helped: 0 HURT: 13 HURT stats (abs) min: 1.0 max: 4.0 x̄: 2.46 x̃: 2 HURT stats (rel) min: 0.69% max: 7.69% x̄: 3.65% x̃: 2.61% 95% mean confidence interval for instructions value: 1.78 3.14 95% mean confidence interval for instructions %-change: 2.16% 5.15% Instructions are HURT. total bytes in shared programs: 10444532 -> 10444724 (<.01%) bytes in affected programs: 7386 -> 7578 (2.60%) helped: 0 HURT: 13 HURT stats (abs) min: 6.0 max: 24.0 x̄: 14.77 x̃: 12 HURT stats (rel) min: 0.63% max: 7.14% x̄: 3.40% x̃: 2.48% 95% mean confidence interval for bytes value: 10.68 18.85 95% mean confidence interval for bytes %-change: 2.02% 4.78% Bytes are HURT. total halfregs in shared programs: 419444 -> 416434 (-0.72%) halfregs in affected programs: 27080 -> 24070 (-11.12%) helped: 634 HURT: 0 helped stats (abs) min: 1.0 max: 30.0 x̄: 4.75 x̃: 2 helped stats (rel) min: 2.90% max: 54.55% x̄: 13.13% x̃: 8.51% 95% mean confidence interval for halfregs value: -5.08 -4.41 95% mean confidence interval for halfregs %-change: -14.03% -12.23% Halfregs are helped. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21122>	2023-02-05 08:53:29 +00:00
Alyssa Rosenzweig	e44a53f5dc	agx: Run DCE twice Needed to combine fsat with vectors due to nir_lower_blend changes. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21122>	2023-02-05 08:53:29 +00:00
Alyssa Rosenzweig	cd8b5427c7	agx: Allow uniform sources on phis The parallel copy lowering has been able to handle uniform sources since `98f0ebf264` ("agx: Pass agx_index to agx_copy"), and uniform sources work fine with phis. It's not super common but there's no need to restrict them. This is a small instruction count win and will greatly simplify the lowering later in this series. total instructions in shared programs: 1523806 -> 1523285 (-0.03%) instructions in affected programs: 17088 -> 16567 (-3.05%) helped: 38 HURT: 1 helped stats (abs) min: 1.0 max: 44.0 x̄: 13.95 x̃: 7 helped stats (rel) min: 0.42% max: 18.64% x̄: 4.73% x̃: 1.26% HURT stats (abs) min: 9.0 max: 9.0 x̄: 9.00 x̃: 9 HURT stats (rel) min: 8.57% max: 8.57% x̄: 8.57% x̃: 8.57% 95% mean confidence interval for instructions value: -17.95 -8.77 95% mean confidence interval for instructions %-change: -6.35% -2.43% Instructions are helped. total bytes in shared programs: 10447658 -> 10444532 (-0.03%) bytes in affected programs: 118850 -> 115724 (-2.63%) helped: 38 HURT: 1 helped stats (abs) min: 6.0 max: 264.0 x̄: 83.68 x̃: 45 helped stats (rel) min: 0.36% max: 16.51% x̄: 4.14% x̃: 1.09% HURT stats (abs) min: 54.0 max: 54.0 x̄: 54.00 x̃: 54 HURT stats (rel) min: 7.30% max: 7.30% x̄: 7.30% x̃: 7.30% 95% mean confidence interval for bytes value: -107.68 -52.62 95% mean confidence interval for bytes %-change: -5.55% -2.13% Bytes are helped. total halfregs in shared programs: 419446 -> 419444 (<.01%) halfregs in affected programs: 29 -> 27 (-6.90%) helped: 1 HURT: 0 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21122>	2023-02-05 08:53:29 +00:00
Alyssa Rosenzweig	f857795e83	agx: Implement barriers Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21062>	2023-02-04 17:10:15 +00:00
Alyssa Rosenzweig	251f6fb224	agx: Implement compute ID intrinsics These NIR intrinsics map to vectors of special registers. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21062>	2023-02-04 17:10:15 +00:00
Alyssa Rosenzweig	da91a78ab7	asahi: Identify more compute-related XML Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21062>	2023-02-04 17:10:15 +00:00
Hampus Linander	b73b5cc71a	agx: Optimize lower_resinfo for cube maps We can avoid reading both width and height when the texture is a cube map, and we do so more simply by relying on CSE+DCE (Alyssa). Closes: #7541 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20628>	2023-02-04 11:13:37 -05:00
Hampus Linander	9ab1c0d83b	agx: Use AGX extr for tex lowering Replaces a number of bit operations by a single extr instruction, optimizing the extraction of the width from the packed value. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20628>	2023-02-04 11:13:37 -05:00
Hampus Linander	f3d6524a2d	agx: Add extr instruction to AGX backend Encoding is similar to bfeil, in particular the immidiate has the same encoding as BFI_MASK hence its reuse. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20628>	2023-02-04 11:13:37 -05:00
Alyssa Rosenzweig	e765ec21ec	asahi: Implement custom border colours Implement custom border colours, as required by OpenGL's CLAMP_TO_BORDER and Vulkan with customBorderColor. This uses an extended sampler descriptor, which has space for the custom border values. The trouble is that the border must be packed into an internal interchange format that depends on the original format in a complex way. That said, we're not solving NP-complete problems here, and it passes the tests (dEQP-GLES31.functional.texture.border_clamp.* and piglit texwrap). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20570>	2023-02-04 10:37:02 -05:00
Alyssa Rosenzweig	507ca71f3e	agx/decode: Handle extended samplers These include a border colour field. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20570>	2023-02-04 10:32:38 -05:00
Alyssa Rosenzweig	afce5be659	agx/decode: Add a data parameter to stateful So we can handle extended samplers. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20570>	2023-02-04 10:32:38 -05:00
Alyssa Rosenzweig	10eaa4a2ec	asahi: Add XML for custom border colours These use extended sampler descriptors. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20570>	2023-02-04 10:32:24 -05:00
Alyssa Rosenzweig	221311e1e9	agx: Handle constant-offset in address matching Match iadd(x, #y). The format shift will get constant-folded away and, if y is sufficiently small, the constant will be inlined by the AGX backend optimizer. This gets rid of piles of 64-bit arithmetic from lowering UBOs. It probably doesn't matter for perf since that's happening in preamble shaders but it is noisy. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21108>	2023-02-04 08:41:37 +00:00
Alyssa Rosenzweig	c3f7abaaef	agx: Fix storing to varying arrays The offset is in vec4s, not words (unlike the component). This doesn't matter right now since we get everything lowered (offset -> 0) but it will come up if we implement clip distances natively (instead of lowering in FS). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21097>	2023-02-04 08:28:43 +00:00
Alyssa Rosenzweig	13b25a6114	asahi: Don't use 16-bit inputs to 32-bit st_tile The hardware doesn't extend in this case, we need to extend for it. This fixes 32-bit render target formats with lower_mediump_io. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21082>	2023-02-04 08:14:32 +00:00
Alyssa Rosenzweig	6b0322d441	agx: Keep varyings forwarded to texture as fp32 This works around bugs in a LOT of applications, since fp16 texture coordinates are almost never appropriate even though it's a valid implementation of the GLES spec. It also doesn't seem to matter for perf. Code from the Bifrost compiler which implements the same workaround for slightly different reasons. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21082>	2023-02-04 08:14:32 +00:00
Alyssa Rosenzweig	5678fbe010	asahi: Merge fragment control XML Same struct specified twice and merged in the hw. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21081>	2023-02-04 07:58:42 +00:00
Alyssa Rosenzweig	3706da1d1a	agx: Support uniform registers as LODs This will avoid regressing moves when we lower sampler LOD bias. Corresponding disassembler change: https://github.com/dougallj/applegpu/pull/22 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20833>	2023-02-04 07:33:08 +00:00

1 2 3 4 5 ...

698 commits