fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-25 16:58:10 +02:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	231561d53a	asahi: Correct alignment for USC Uniform packets We only need 4 byte alignment, not 8 bytes. This isn't a big difference in practice, but it probably reduces padding in some cases. More importantly, it corrects our XML to match what the hardware actually does, which is great. (There is exactly enough room for a 40-bit address with 4 byte alignment.) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21118>	2023-02-04 07:19:29 +00:00
Alyssa Rosenzweig	b0f1964771	asahi: Strengthen agx_usc_uniform contract Check the size explicitly, instead of just implicitly in the GenXML pack: it is the responsibility of the caller to split up larger uploads. While this is nominally more complicated, agx_usc_uniform is called in the draw hot path whereas the actual splitting decision can usually be done at compile-time. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21118>	2023-02-04 07:19:29 +00:00
Alyssa Rosenzweig	ea38709345	asahi: Fix encoding of uniform size Only 6-bits, with zero=64 like a groups() encoding. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Suggested-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21118>	2023-02-04 07:19:29 +00:00
Alyssa Rosenzweig	9b2dc92228	ail: Test 63x63 cube map This has a subtle interaction with page-aligned layers. Written while debugging dEQP-GLES3.functional.texture.filtering.cube.combinations.nearest_nearest_repeat_clamp Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21114>	2023-02-04 07:04:49 +00:00
Alyssa Rosenzweig	294351ff77	ail: Test mipmapped_z behaviour The mipmapped_z = true case is checked against Metal, the false case is smoke testing the old behaviour (which is still used for 2D arrays). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21114>	2023-02-04 07:04:49 +00:00
Alyssa Rosenzweig	c2bf66ab87	ail: Add layout->mipmapped_z input For 3D images, the full miptree depends on the depth of the image, in contrast to 2D arrays. We need to account for this to calculate the correct layer strides. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21114>	2023-02-04 07:04:49 +00:00
Ian Romanick	ea413e826b	nir: Eliminate nir_op_f2b Builds on the work of !15121. This gets to delete even more code because many drivers shared a lot of code for i2b and f2b. No shader-db or fossil-db changes on any Intel platform. v2: Rebase on `1a35acd8d9`. v3: Update a comment in nir_opcodes_c.py. Suggested by Konstantin. v4: Another rebase. Remove f2b stuff from Midgard. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20509>	2023-02-03 22:39:57 +00:00
Alyssa Rosenzweig	6908a0dece	asahi: Run nir_lower_fragcolor during preprocessing This pass needs to run early (because it depends on early I/O), but it doesn't actually need the shader key. Why not? If we overestimate the number of render targets, extra store_output intrinsics will be generated, but they will be deleted by AGX tilebuffer lowering later. Note we'll probably want something smarter than this for fragment epilogues in the future to avoid piles of unnecessary moves. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21065>	2023-02-03 15:03:06 +00:00
Asahi Lina	ed6edc07e4	asahi: Split off macOS support into its own file All the ifdef __APPLE__ is getting really silly. Let's split off the macOS UAPI abstraction into its own file, so we can have parallel implementations. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21058>	2023-02-02 11:45:52 +00:00
Asahi Lina	2e51ccac82	asahi: Split off common BO code into its own file In preparation for splitting off the macOS backend implementation into its own file, pull out the shared BO code from agx_device.c into agx_bo.c. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21058>	2023-02-02 11:45:52 +00:00
Alyssa Rosenzweig	ea285aea8d	asahi: Use non-UAPI specific BO create flags So we're not tied to the macOS or Linux UAPIs and are not translating awkwardly from one to the other when creating BOs. They're not quite equivalent -- macOS doesn't include writeback information in this flag field, and Linux doesn't have a executable flag. (Maybe we should add one, though? Then we can enforce W^X.) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21058>	2023-02-02 11:45:52 +00:00
Alyssa Rosenzweig	5e14792200	agx: Centralize texture lowering Lowering buffer textures will interact with multiple of our existing lowerings, and it's convenient to have it all in one place. This also keeps the pass ordering dependencies centralized. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21060>	2023-02-02 06:39:42 +00:00
Alyssa Rosenzweig	0f087b56d0	agx: Bump preamble_storage_size to 512 nir_opt_preamble is now aware of the internal uniforms we insert, so it can use the whole uniform file available to it. This lets us push more (all?) uniform loads in Dolphin ubershaders to the preamble. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20562>	2023-01-31 17:02:34 +00:00
Alyssa Rosenzweig	02fe57b7e9	agx: Lower system values in NIR in the driver To comply with The Ekstrand Rule. AGX has a large number of "uniform registers" available. These may be loaded with arbitrary ranges of GPU memory by the driver, or they can be written by the preamble shader. Currently, the compiler runs nir_opt_preamble on the first half of the uniform file, and then translates NIR sysvals to moves from the second half of the uniform file, passing back a uniform->sysval map for the GL driver to respect. This has (at least) two issues: * Since nir_opt_preamble runs before gathering sysvals, it has to assume the maximum number of sysvals are pushed, which can prevent it from moving some computation to the preamble due to running out of partitioned uniform registers. This is a problem for Dolphin's ubershaders, though it's unclear how much it matters for Dolphin perf. * This violates The Ekstrand Rule and apparently will be a problem for our Vulkan driver. I'm just a compiler+GL girl, so I wouldn't know. To fix this, we invert the order of operations. At the end of this series, we instead lower NIR system values to NIR load_preamble instructions in the GL driver. The compiler just translates directly to uniform registers reads. The Vulkan driver will need its own version of this code, but maybe it can do something clever and descriptor set aware. This means that there will already be some load_preamble instructions when nir_opt_preamble runs, so I've made minor changes to nir_opt_preamble to handle that gracefully. This is a bit lazy... The alternative is to introduce a `load_uniform_agx` intrinsic which `load_preamble` gets lowered to trivially. But that's another pass over the IR (and due to AGX's shader variant hell I'm sensitive to backend compile time) and it would be more complicated than what's implemented here. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Ella Stanforth <ella@iglunix.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20562>	2023-01-31 17:02:34 +00:00
Alyssa Rosenzweig	4a675f93b9	asahi: Omit extra call to clock_gettime It's cheap but it isn't free. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20973>	2023-01-29 16:26:48 +00:00
Alyssa Rosenzweig	862bf420a9	asahi: Handle sampler->compare_mode Instead of smashing unconditionally to 1. Not sure if this fixes anything but it gets rid of an unknown at least. Possibly slightly faster. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20561>	2023-01-13 19:43:14 +00:00
Alyssa Rosenzweig	61c7e1bf48	agx: Peephole select after opt_preamble Reduces control flow in Dolphin uber shaders, which saves us a few cycles. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20597>	2023-01-13 00:43:04 +00:00
Alyssa Rosenzweig	4311c636c2	agx: Don't crash trying to encoding minifloats Fixes assertion fails in piglit isinf-and-isnan, which uses a constant infinity, which has an out-of-bounds mantissa (but the function contract says that's fine and we just return something undefined.) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20563>	2023-01-11 21:14:21 +00:00
Alyssa Rosenzweig	7859b531c2	agx: Use BITFIELD64_BIT for outputs_written Fix by inspection. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20563>	2023-01-11 21:14:20 +00:00
Alyssa Rosenzweig	93c40e3353	agx: Wire up nir_intrinsic_store_agx Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20558>	2023-01-11 20:36:51 +00:00
Alyssa Rosenzweig	baac17131d	agx: Remove load_global(_constant) support Now lowered in NIR to better instructions than we were selecting. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20558>	2023-01-11 20:36:51 +00:00
Alyssa Rosenzweig	ac3272be84	agx: Use load_global_constant for UBO lowering Rely on the common address arithmetic optimizations. We don't need the special formats for UBO loads anyway, so this is simpler and optimizes out the ushr. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20558>	2023-01-11 20:36:51 +00:00
Alyssa Rosenzweig	3a6a5281b3	agx: Lower global loads/stores to AGX versions This lets us do all the needed address arithmetic in a central place. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20558>	2023-01-11 20:36:51 +00:00
Alyssa Rosenzweig	90dea84ef6	agx: Remove dead arg Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20559>	2023-01-10 05:19:25 +00:00
Alyssa Rosenzweig	17d1559036	agx: Use i0/i1 variables Now that we've defined them. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20559>	2023-01-10 05:19:25 +00:00
Alyssa Rosenzweig	1e61f13ffd	agx: Get rid of emit_alu_bool Deduplicate lots of cases. Splitting this out was silly, bools aren't that special. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20559>	2023-01-10 05:19:25 +00:00
Alyssa Rosenzweig	5b25ee6cc7	agx: Use agx_subdivide_to for umul_high Helpers! Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20559>	2023-01-10 05:19:25 +00:00
Alyssa Rosenzweig	f6c5b2a5a3	agx: Remove dead code Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20559>	2023-01-10 05:19:25 +00:00
Alyssa Rosenzweig	9b67afb55d	agx: Fix missing #include Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20569>	2023-01-10 00:17:12 +00:00
Alyssa Rosenzweig	b4d8be165b	asahi: Implement ARB_texture_mirror_clamp_to_edge Guessing the enum value, passes texwrap piglit. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20560>	2023-01-09 23:58:52 +00:00
Alyssa Rosenzweig	0e2d786579	asahi: Implement GL_CLAMP natively Turns out there's a hardware mode for this. Apple's GL driver uses this. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20560>	2023-01-09 23:58:52 +00:00
Alyssa Rosenzweig	fa96dfb2d7	agx: Lower discard to zs_emit when zs_emit used It is invalid to use both sample_mask and zs_emit in the same shader. We'll need to do something similar for sample mask writes. Fixes Dolphin ubershaders. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20446>	2023-01-05 11:49:23 -05:00
Alyssa Rosenzweig	ebe40b15ea	agx: Fix discard with MRT The exact semantics of sample_mask aren't quite clear to me yet, but executing multiple sample_mask instructions seems to raise a fault :\| Fixes SuperTuxKart's advanced renderer. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20446>	2023-01-05 11:49:23 -05:00
Alyssa Rosenzweig	2b5519e865	agx: Introduce "no_varyings" instruction Must be used at the end of a vertex shader that does NOT write any varyings, has rasterizer discard enabled, and is run only for its side effects. The encoding looks like st_var, but I don't know what this actually does. I just know that the GPU faults if this is omitted. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20446>	2023-01-05 11:49:23 -05:00
Alyssa Rosenzweig	33e3418cfe	agx: Consider "stop" a control flow instruction ...and therefore it needs to be after a "logical end". This means that "after_block_logical" will do the right thing for the last block. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20446>	2023-01-05 11:49:22 -05:00
Alyssa Rosenzweig	f6aa43cf42	agx: Optimize waits locally Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20446>	2023-01-05 11:49:22 -05:00
Alyssa Rosenzweig	a01680b979	agx: Remove logical_end later So we can use after_block_logical in the wait insertion pass. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20446>	2023-01-05 11:49:22 -05:00
Alyssa Rosenzweig	73ac73308b	agx: Validate widths of vectors Check the invariant that the widths of vectors in the IR are consistent, by checking that write registers and read registers match up between the writers and readers respectively. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20446>	2023-01-05 11:49:22 -05:00
Alyssa Rosenzweig	6685dba75e	agx: Add agx_read_registers helper To be used for inserting waits post-RA. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20446>	2023-01-05 11:49:22 -05:00
Alyssa Rosenzweig	e6631ba5af	agx: Compact st_tile argument per mask Otherwise the number of read registers won't match the vector we input, which will trigger validation errors. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20446>	2023-01-05 11:49:22 -05:00
Alyssa Rosenzweig	545a3eb601	agx: Insert waits post-RA This is the first step towards reducing stalling. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20446>	2023-01-05 11:49:22 -05:00
Alyssa Rosenzweig	463744e4f9	agx: Pack texture scoreboard slots Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20446>	2023-01-05 11:49:22 -05:00
Alyssa Rosenzweig	01f948ee13	agx: Pack wait instructions For different scoreboard slots. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20446>	2023-01-05 11:49:22 -05:00
Alyssa Rosenzweig	640afb33b9	agx: Remove unused idiv const func This was used for instancing, but has been unused since `8dcf7648f1` ("agx: Lower VBOs in NIR") Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20446>	2023-01-05 11:49:22 -05:00
Alyssa Rosenzweig	44925a142e	agx: Use metadata for VS varying linking Rather than variables. This gets rid of all backend nir_variable use. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20446>	2023-01-05 11:49:22 -05:00
Alyssa Rosenzweig	617f2f7a02	agx: Don't use nir_variable when gathering flat varyings Walk the IR instead. This happens when preprocessing so it doesn't really matter, but it complicates the nir_variable audit. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20446>	2023-01-05 11:49:22 -05:00
Alyssa Rosenzweig	d00a43f682	agx: Hash agx_instr faster Prior to this change, agx_opt_cse is our most expensive backend pass, due to the time spent hashing instructions. hash_instr was calling into XXH32 a massive number of times, often to hash only a single bit. It's much faster to hash entire blocks of memory at a time. Optimize to do just that. With this change, agx_opt_cse is now cheaper than instruction selection as it should be. No shader-db changes (except CPU time decrease). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20446>	2023-01-05 11:49:22 -05:00
Alyssa Rosenzweig	f44afe766f	agx: Use texture write mask We do need to use undefs instead of zeroes in this internal collect. While this vector gets copypropped out, it'd cause us to fail compilation if noopt is on. Fix that. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20446>	2023-01-05 11:49:22 -05:00
Alyssa Rosenzweig	7284e4967c	agx: Note that textures clobber even masked Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20446>	2023-01-05 11:49:22 -05:00
Alyssa Rosenzweig	ddbec45b6f	agx: Plumb in store instruction This will be used for compute kernels (and transform feedback) in the (near) future. For now, let's get the opcode plumbed in the backend to reduce some of the rebase pain. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20446>	2023-01-05 11:49:22 -05:00

... 12 13 14 15 16 ...

1298 commits