fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-21 21:58:10 +02:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	7bb8112fd1	agx: Refactor vector creation agx_vec4 is unused, drop in, and split out the common logic since we'll use it in a new helper. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21430>	2023-03-05 09:27:02 +00:00
Alyssa Rosenzweig	037609f1dc	agx: Constify agx_print Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21430>	2023-03-05 09:27:02 +00:00
Alyssa Rosenzweig	a9c5956f2f	agx: Inline 16-bit load/store offsets Most integer immediates are only 8-bit, but load/store instructions allow their immediate offsets to be 16-bit instead. Take advantage of this in the optimizer. This eliminates 36% of the instructions in dEQP-GLES31.functional.ssbo.layout.random.all_shared_buffer.36, a fitting percentage. Insignificant effect on dEQP-GLES31.functional.ssbo.* performance... Only a small % of our compile-time pie is actually spent in the backend anyway (as opposed to NIR passes or GLSL IR). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21430>	2023-03-05 09:27:02 +00:00
Alyssa Rosenzweig	c9728b41d5	agx: Factor out allows_16bit_immediate check The optimizer needs this information to inline immediates effectively. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21430>	2023-03-05 09:27:02 +00:00
Alyssa Rosenzweig	445ca949cd	agx: Clean up after lowering address arithmetic This avoids creating silly preambles that don't actually do anything except push a constant that we could've inlined for cheaper anyway, since nir_opt_preamble's cost model is sensitive to e.g. constant folding. This avoids a pointless preamble in split-hell. As a nice bonus, this also improves compile-time on address-heavy shaders. With a release build, CPU time in dEQP-GLES31.functional.ssbo.* reduces from 12.87s to 10.77... a 16% improvement is nothing to sneeze at. shader-db results are mostly noise. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21430>	2023-03-05 09:27:02 +00:00
Alyssa Rosenzweig	4b1f4b86ea	agx: Add AGX_MESA_DEBUG=nopreamble option Useful both for ruling out issues with shader preambles as well as (in some cases) making for a nicer reading experience of the compiled assembly. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21430>	2023-03-05 09:27:02 +00:00
Alyssa Rosenzweig	c22a18c9af	agx: Don't write sample mask from preambles It doesn't make sense, they're basically little compute kernel environments. Noticed while debugging dEQP-GLES31.functional.fbo.no_attachments.multisample.* Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21710>	2023-03-05 08:20:09 +00:00
Alyssa Rosenzweig	8bb40ce4ad	agx: Fix 2D MSAA array texture register allocation Sample index and layer index are both 16-bits, even though they are zero extended for compiler simplicity in some cases. In particular this means that 2D MSAA arrays consume 6 half-regs for their coordinates, not 8. This is what the IR translation (actually agx_nir_lower_texture) produces, we just need to fix the calculation in agx_read_registers to agree. Fixes validation failure in tests like dEQP-GLES31.functional.texture.multisample.samples_4.use_texture_color_2d_array Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21708>	2023-03-05 08:06:43 +00:00
Alyssa Rosenzweig	3032e3ad23	agx: Mask shifts in the backend This gives our shifts SM5 behaviour at the cost of a little extra ALU. That way, we match NIR's shifts. This fixes unsoundness of GLSL expressions like "a << (b & 31)", where the & would mistakenly get optimized away. Closes: #8181 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reported-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21673>	2023-03-05 07:52:22 +00:00
Alyssa Rosenzweig	f4e2b22646	asahi: Advertise dual-source blending This is handled entirely in common code. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21545>	2023-03-05 07:38:36 +00:00
Asahi Lina	1dd872ec17	asahi: Assert on TIB strides > 64 These just don't seem to work. macOS falls back to eMRT here... dEQP-GLES3.functional.draw_buffers_indexed.random.max_implementation_draw_buffers.13 from Fail -> Crash. Proper solution will come when we implement eMRT later on. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21705>	2023-03-04 10:58:10 -05:00
Asahi Lina	26c51bb8d8	asahi: clang-format the world again Some things were missed (like winsys) and there's still some bad include orders lying around and some other randomness. We should set up CI checks for this soon... ^^;; Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21687>	2023-03-03 22:55:59 +00:00
Asahi Lina	0a5f3556a1	asahi: Fix device fd leak in agx_close_device I'm not sure if this was always broken downstream or just got dropped at some point, but it's definitely UAPI-agnostic and missing now that we have all the non-UAPI bits upstream. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21677>	2023-03-03 21:11:47 +00:00
Alyssa Rosenzweig	f083e1807d	asahi/decode: Handle VDM barriers We emit these now (for transform feedback). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21675>	2023-03-03 20:54:18 +00:00
Asahi Lina	798fc2730b	asahi: Add agx_debug_fault() helper We expect to forward GPU fault information to userspace. Since Mesa can get that information, we can look up the fault address to log what was the containing or nearest BO. Add a helper for that, so it can be called from the driver. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21662>	2023-03-03 00:28:48 +00:00
Asahi Lina	240e9dc5dc	asahi: Add APIs for DMA-BUF sync file import/export These are generic ioctls, so it is safe to add them now. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21662>	2023-03-03 00:28:48 +00:00
Asahi Lina	d610f40e17	asahi: Implement Linux driver scaffolding, sans UAPI With macOS support out of the way, we can start implementing a lot of the Linux driver interface and bookkeeping without actually adding the UAPI proper. Let's do that to reduce the size of the UAPI patchset. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21662>	2023-03-03 00:28:48 +00:00
Asahi Lina	942d9cc17b	asahi: Align device submission API with upcoming UAPI Nothing implemented, but this lets us get the batch tracking bits in, including explicit sync/DMA-BUF integration which uses generic ioctls. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21662>	2023-03-03 00:28:48 +00:00
Asahi Lina	7f2e24d2ef	asahi: Add nocluster,sync,stats debug flags These are only useful with the upcoming Linux UAPI, but there's no harm in getting the debug scaffolding in now. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21662>	2023-03-03 00:28:48 +00:00
Asahi Lina	afe134a49c	asahi: Drop macOS backend This might be useful in the future, but it is best reimplemented in terms of the upcoming Linux UAPI instead of having parallel codepaths. Let's drop it. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21662>	2023-03-03 00:28:48 +00:00
Asahi Lina	70169c7488	asahi: Identify USC cache invalidate Signed-off-by: Asahi Lina <lina@asahilina.net> Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21538>	2023-03-01 01:04:29 +00:00
Asahi Lina	860ac5c149	asahi: Add readonly BO flag Signed-off-by: Asahi Lina <lina@asahilina.net> Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21538>	2023-03-01 01:04:29 +00:00
Asahi Lina	0498ad3e26	asahi: Add BO_SHAREABLE flag Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21538>	2023-03-01 01:04:29 +00:00
Alyssa Rosenzweig	760f367386	agx: Lower sampler LOD bias G13 does not support sampler descriptor LOD biasing, so this needs to be lowered to shader code for APIs that require this functionality. Add an option to do this lowering while doing our other backend texture lowerings. This generates lod_bias_agx texture instructions which the driver is expected to lower according to its binding model. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21276>	2023-02-27 02:35:41 +00:00
Daniel Schürmann	2bb369dd8d	nir: add assertions that loops don't have a Continue Construct Hoping that I didn't miss any, this should add assertions to all functions and passes which explicitly handle 'nir_loop'. Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13962>	2023-02-21 10:41:11 +00:00
Alyssa Rosenzweig	029c686c6d	asahi: Implement color masks with masked stores Blend states can require masking colour. Currently, this is handled by nir_lower_blend, which lowers masks to a read-modify-write operation as required on Mali hardware. However, our "tilebuffer store" instruction supports a write mask, allowing us to write only a subset of channels to the tilebuffer. It's more efficient to use that than to emit pointless tilebuffer loads. Note that even without tilebuffer loads, non-opaque masks don't work with opaque pass types. Here, we handle this with a translucent pass type, which gets HSR to do the right thing and is consistent with the pass type used previously. However, it's a bit heavy handed -- Apple manages to use an opaque pass type with masking but with some unknown HSR fields twiddled. IMO reverse-engineering those details shouldn't block this because this gets us closer to optimal (just not all the way there) and is strictly better than what we had before. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21431>	2023-02-21 08:10:15 +00:00
Alyssa Rosenzweig	3084e6e689	agx: Add agx_internal_format_supports_mask helper Not all formats can be masked, add a query to check which can be. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21431>	2023-02-21 08:10:15 +00:00
Alyssa Rosenzweig	5e031867fe	agx: Handle ssa_undef as zero Masked stores may result in undefs after optimization. Rather than call lower_undef_to_zero late (but get no benefit), we may as well handle ourselves to prepare for proper undef support down the line. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21431>	2023-02-21 08:10:15 +00:00
Alyssa Rosenzweig	eab4d6a96f	agx: Add and use agx_nir_ssa_index helper Common subexpression that we'll repeat once more in the next patch. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21431>	2023-02-21 08:10:15 +00:00
Alyssa Rosenzweig	e93a221024	agx: Handle group_memory_barrier A combination of control_barrier + memory_barrier but it's always seen with those. This would be safer with scoped barriers... Fixes dEQP-GLES31.functional.synchronization.inter_invocation.ssbo Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21326>	2023-02-20 18:50:40 +00:00
Alyssa Rosenzweig	e9cec96633	agx: Implement b2b32 Shows up with store_shared. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21326>	2023-02-20 18:50:40 +00:00
Alyssa Rosenzweig	955797bb00	agx: Pack local atomics Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21326>	2023-02-20 18:50:39 +00:00
Alyssa Rosenzweig	14f546726e	agx: Lower shared memory offsets to 16-bit Per the hardware requirement. This simplifies instruction selection (it avoids the need to constant fold u2u16 in the backend). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21326>	2023-02-20 18:50:39 +00:00
Alyssa Rosenzweig	a21f6f8cb0	agx: Translate load/store_shared Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21326>	2023-02-20 18:50:39 +00:00
Alyssa Rosenzweig	f8b9dfbbad	agx: Translate NIR atomics Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21326>	2023-02-20 18:50:39 +00:00
Alyssa Rosenzweig	2a021b1818	agx: Pack local load/store instructions Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21326>	2023-02-20 18:50:39 +00:00
Alyssa Rosenzweig	96904f83b4	agx: Pack global atomics Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21326>	2023-02-20 18:50:39 +00:00
Alyssa Rosenzweig	eea3674f36	agx: Disallow immediate bases to device_load Lina pointed this out in review. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21326>	2023-02-20 18:50:39 +00:00
Alyssa Rosenzweig	6b0ef2b462	agx: Model local loads/stores Aka shared memory or threadgroup memory. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21326>	2023-02-20 18:50:39 +00:00
Alyssa Rosenzweig	0d07d27173	agx: Model atomic instructions Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21326>	2023-02-20 18:50:39 +00:00
Alyssa Rosenzweig	cf96edff1c	agx: Implement gathers (nir_texop_tg4) Passes dEQP-GLES31.functional.texture.gather.* Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21264>	2023-02-20 17:27:21 +00:00
Alyssa Rosenzweig	978d3fefa8	agx: Model and pack gathers Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21264>	2023-02-20 17:27:21 +00:00
Alyssa Rosenzweig	8dc861dbb5	agx: Lower offsets in NIR Rather than the backend. This way we can handle non-constant offsets as well as constants with a single code path (with the constant offset code subsumed as a special case via NIR's constant folding). This nets us dynamic offset support. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21264>	2023-02-20 17:27:21 +00:00
Alyssa Rosenzweig	0e0825013d	agx: Do more work in agx_preprocess_nir agx_preprocess_nir runs once per shader, whereas agx_optimize_nir runs once per variant. That means we want to do as much work as possible in agx_preprocess_nir to make shader variants as cheap as possible to compiler. So, move our standard suite of lowering and optimizing to the preprocess loop, leaving just a single (easy) trip through the optimizer for simple variant processing. Plus, we can remove variables when preprocessing, since we no longer use variables anywhere. We remove them to reduce the RAM and disk cache footprint of shader variants. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21104>	2023-02-20 11:34:58 +00:00
Alyssa Rosenzweig	5b92bd99db	agx: Don't treat clip distances specially We've been using the clip lowering, but it's been broken upstream because of this artefact from the (non-lowered implementation) sneaking in from downstream. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21104>	2023-02-20 11:34:58 +00:00
Alyssa Rosenzweig	888492ecd3	asahi: Vectorize background colour load No point to scalarizing this, the background can handle the vector load fine since `bfa7ec0aa0` ("agx: Don't scalarize preambles in NIR"). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21327>	2023-02-16 06:36:49 +00:00
Asahi Lina	b39947ee0c	asahi: Drop agx_device.memctx No longer used. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21348>	2023-02-15 22:16:51 +00:00
Asahi Lina	6ad64387dd	asahi: Do not use memctx for pools / meta cache ralloc is not thread-safe, so we can't use dev->memctx for allocating context-specific things without locking. On top of that, we always need to explicitly clean up pools anyway since we need to unref the BOs, so there is no point to using a memctx. And since pools need to be explicitly cleaned up, the meta cache code needs explicit cleanup, so add that and drop memctx from there too. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21348>	2023-02-15 22:16:51 +00:00
Alyssa Rosenzweig	e4731ec335	asahi: Remove default=true on index list values These will cause issues with indirect draws. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21273>	2023-02-13 09:51:42 -05:00
Alyssa Rosenzweig	8e1eee8b5e	asahi: Add XML for VDM memory barriers We'll use these in our implementation of transform feedback. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21272>	2023-02-13 11:45:03 +00:00

1 2 3 4 5 ...

726 commits