fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-17 11:48:05 +02:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	888492ecd3	asahi: Vectorize background colour load No point to scalarizing this, the background can handle the vector load fine since `bfa7ec0aa0` ("agx: Don't scalarize preambles in NIR"). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21327>	2023-02-16 06:36:49 +00:00
Asahi Lina	b39947ee0c	asahi: Drop agx_device.memctx No longer used. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21348>	2023-02-15 22:16:51 +00:00
Asahi Lina	6ad64387dd	asahi: Do not use memctx for pools / meta cache ralloc is not thread-safe, so we can't use dev->memctx for allocating context-specific things without locking. On top of that, we always need to explicitly clean up pools anyway since we need to unref the BOs, so there is no point to using a memctx. And since pools need to be explicitly cleaned up, the meta cache code needs explicit cleanup, so add that and drop memctx from there too. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21348>	2023-02-15 22:16:51 +00:00
Alyssa Rosenzweig	e4731ec335	asahi: Remove default=true on index list values These will cause issues with indirect draws. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21273>	2023-02-13 09:51:42 -05:00
Alyssa Rosenzweig	8e1eee8b5e	asahi: Add XML for VDM memory barriers We'll use these in our implementation of transform feedback. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21272>	2023-02-13 11:45:03 +00:00
Alyssa Rosenzweig	8e0e68510f	asahi: Add XML for indirect draws Nice and simple. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21272>	2023-02-13 11:45:03 +00:00
Alyssa Rosenzweig	c3b8928b84	asahi: Add XML for indirect dispatch This splits up the CDM commands into their subparts, after which indirect dispatch is straightforward. Also fix the pipeline bits. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21272>	2023-02-13 11:45:03 +00:00
Alyssa Rosenzweig	2c2f189fe7	agx: Write sample mask even with no colour output Needed for discard to work properly, which has visible side effects with occlusion queries. Fixes no_attachment framebuffers together with the next commit. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21267>	2023-02-13 11:28:07 +00:00
Alyssa Rosenzweig	e785ae6125	agx: Implement load_helper_invocation Passes dEQP-GLES31.functional.shaders.helper_invocation.* Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21265>	2023-02-13 11:12:05 +00:00
Alyssa Rosenzweig	6214c9921a	agx: Remove bogus gl_Position assertion It is reasonable not to write gl_Position in a transform feedback program. Fixes rendering of the apitrace of Domekeeper in #7798. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21266>	2023-02-13 10:48:13 +00:00
Alyssa Rosenzweig	eeae9b93de	agx: Fix AGX_MAX_CF_BINDINGS Potentially could be larger with aliasing of component offsets, though that would be silly. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21266>	2023-02-13 10:48:13 +00:00
Alyssa Rosenzweig	fbe8878dcb	agx: Respect component in frag load_input Fixes fails in dEQP-GLES31.functional.separate_shader.random.*. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21266>	2023-02-13 10:48:13 +00:00
Alyssa Rosenzweig	a5d478d17c	agx: Remove unused AGX_MAX_VARYINGS Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21266>	2023-02-13 10:48:13 +00:00
Alyssa Rosenzweig	bfa7ec0aa0	agx: Don't scalarize preambles in NIR Scalarizing preambles in NIR isn't really necessary, we can do it more efficiently in the backend. This makes the final NIR a lot less annoying to read; the backend IR was already nice to read thanks to all the scalarized moves being copypropped. Plus, this is a lot simpler. No shader-db changes. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21122>	2023-02-05 08:53:29 +00:00
Alyssa Rosenzweig	7edd42cbc0	agx: Lower uniform sources with a dedicated pass Move the decision of "can I copyprop this uniform?" from copyprop to a standalone lowering pass. This is more straightforward and will enable the next patch. This has the side effect of sinking load_preamble instructions, for a nice reduction in register pressure. Instruction count increase is from rematerializing some moves, which should be more than balanced out by the reduced register pressure. total instructions in shared programs: 1523285 -> 1523317 (<.01%) instructions in affected programs: 1148 -> 1180 (2.79%) helped: 0 HURT: 13 HURT stats (abs) min: 1.0 max: 4.0 x̄: 2.46 x̃: 2 HURT stats (rel) min: 0.69% max: 7.69% x̄: 3.65% x̃: 2.61% 95% mean confidence interval for instructions value: 1.78 3.14 95% mean confidence interval for instructions %-change: 2.16% 5.15% Instructions are HURT. total bytes in shared programs: 10444532 -> 10444724 (<.01%) bytes in affected programs: 7386 -> 7578 (2.60%) helped: 0 HURT: 13 HURT stats (abs) min: 6.0 max: 24.0 x̄: 14.77 x̃: 12 HURT stats (rel) min: 0.63% max: 7.14% x̄: 3.40% x̃: 2.48% 95% mean confidence interval for bytes value: 10.68 18.85 95% mean confidence interval for bytes %-change: 2.02% 4.78% Bytes are HURT. total halfregs in shared programs: 419444 -> 416434 (-0.72%) halfregs in affected programs: 27080 -> 24070 (-11.12%) helped: 634 HURT: 0 helped stats (abs) min: 1.0 max: 30.0 x̄: 4.75 x̃: 2 helped stats (rel) min: 2.90% max: 54.55% x̄: 13.13% x̃: 8.51% 95% mean confidence interval for halfregs value: -5.08 -4.41 95% mean confidence interval for halfregs %-change: -14.03% -12.23% Halfregs are helped. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21122>	2023-02-05 08:53:29 +00:00
Alyssa Rosenzweig	e44a53f5dc	agx: Run DCE twice Needed to combine fsat with vectors due to nir_lower_blend changes. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21122>	2023-02-05 08:53:29 +00:00
Alyssa Rosenzweig	cd8b5427c7	agx: Allow uniform sources on phis The parallel copy lowering has been able to handle uniform sources since `98f0ebf264` ("agx: Pass agx_index to agx_copy"), and uniform sources work fine with phis. It's not super common but there's no need to restrict them. This is a small instruction count win and will greatly simplify the lowering later in this series. total instructions in shared programs: 1523806 -> 1523285 (-0.03%) instructions in affected programs: 17088 -> 16567 (-3.05%) helped: 38 HURT: 1 helped stats (abs) min: 1.0 max: 44.0 x̄: 13.95 x̃: 7 helped stats (rel) min: 0.42% max: 18.64% x̄: 4.73% x̃: 1.26% HURT stats (abs) min: 9.0 max: 9.0 x̄: 9.00 x̃: 9 HURT stats (rel) min: 8.57% max: 8.57% x̄: 8.57% x̃: 8.57% 95% mean confidence interval for instructions value: -17.95 -8.77 95% mean confidence interval for instructions %-change: -6.35% -2.43% Instructions are helped. total bytes in shared programs: 10447658 -> 10444532 (-0.03%) bytes in affected programs: 118850 -> 115724 (-2.63%) helped: 38 HURT: 1 helped stats (abs) min: 6.0 max: 264.0 x̄: 83.68 x̃: 45 helped stats (rel) min: 0.36% max: 16.51% x̄: 4.14% x̃: 1.09% HURT stats (abs) min: 54.0 max: 54.0 x̄: 54.00 x̃: 54 HURT stats (rel) min: 7.30% max: 7.30% x̄: 7.30% x̃: 7.30% 95% mean confidence interval for bytes value: -107.68 -52.62 95% mean confidence interval for bytes %-change: -5.55% -2.13% Bytes are helped. total halfregs in shared programs: 419446 -> 419444 (<.01%) halfregs in affected programs: 29 -> 27 (-6.90%) helped: 1 HURT: 0 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21122>	2023-02-05 08:53:29 +00:00
Alyssa Rosenzweig	f857795e83	agx: Implement barriers Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21062>	2023-02-04 17:10:15 +00:00
Alyssa Rosenzweig	251f6fb224	agx: Implement compute ID intrinsics These NIR intrinsics map to vectors of special registers. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21062>	2023-02-04 17:10:15 +00:00
Alyssa Rosenzweig	da91a78ab7	asahi: Identify more compute-related XML Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21062>	2023-02-04 17:10:15 +00:00
Hampus Linander	b73b5cc71a	agx: Optimize lower_resinfo for cube maps We can avoid reading both width and height when the texture is a cube map, and we do so more simply by relying on CSE+DCE (Alyssa). Closes: #7541 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20628>	2023-02-04 11:13:37 -05:00
Hampus Linander	9ab1c0d83b	agx: Use AGX extr for tex lowering Replaces a number of bit operations by a single extr instruction, optimizing the extraction of the width from the packed value. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20628>	2023-02-04 11:13:37 -05:00
Hampus Linander	f3d6524a2d	agx: Add extr instruction to AGX backend Encoding is similar to bfeil, in particular the immidiate has the same encoding as BFI_MASK hence its reuse. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20628>	2023-02-04 11:13:37 -05:00
Alyssa Rosenzweig	e765ec21ec	asahi: Implement custom border colours Implement custom border colours, as required by OpenGL's CLAMP_TO_BORDER and Vulkan with customBorderColor. This uses an extended sampler descriptor, which has space for the custom border values. The trouble is that the border must be packed into an internal interchange format that depends on the original format in a complex way. That said, we're not solving NP-complete problems here, and it passes the tests (dEQP-GLES31.functional.texture.border_clamp.* and piglit texwrap). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20570>	2023-02-04 10:37:02 -05:00
Alyssa Rosenzweig	507ca71f3e	agx/decode: Handle extended samplers These include a border colour field. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20570>	2023-02-04 10:32:38 -05:00
Alyssa Rosenzweig	afce5be659	agx/decode: Add a data parameter to stateful So we can handle extended samplers. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20570>	2023-02-04 10:32:38 -05:00
Alyssa Rosenzweig	10eaa4a2ec	asahi: Add XML for custom border colours These use extended sampler descriptors. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20570>	2023-02-04 10:32:24 -05:00
Alyssa Rosenzweig	221311e1e9	agx: Handle constant-offset in address matching Match iadd(x, #y). The format shift will get constant-folded away and, if y is sufficiently small, the constant will be inlined by the AGX backend optimizer. This gets rid of piles of 64-bit arithmetic from lowering UBOs. It probably doesn't matter for perf since that's happening in preamble shaders but it is noisy. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21108>	2023-02-04 08:41:37 +00:00
Alyssa Rosenzweig	c3f7abaaef	agx: Fix storing to varying arrays The offset is in vec4s, not words (unlike the component). This doesn't matter right now since we get everything lowered (offset -> 0) but it will come up if we implement clip distances natively (instead of lowering in FS). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21097>	2023-02-04 08:28:43 +00:00
Alyssa Rosenzweig	13b25a6114	asahi: Don't use 16-bit inputs to 32-bit st_tile The hardware doesn't extend in this case, we need to extend for it. This fixes 32-bit render target formats with lower_mediump_io. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21082>	2023-02-04 08:14:32 +00:00
Alyssa Rosenzweig	6b0322d441	agx: Keep varyings forwarded to texture as fp32 This works around bugs in a LOT of applications, since fp16 texture coordinates are almost never appropriate even though it's a valid implementation of the GLES spec. It also doesn't seem to matter for perf. Code from the Bifrost compiler which implements the same workaround for slightly different reasons. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21082>	2023-02-04 08:14:32 +00:00
Alyssa Rosenzweig	5678fbe010	asahi: Merge fragment control XML Same struct specified twice and merged in the hw. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21081>	2023-02-04 07:58:42 +00:00
Alyssa Rosenzweig	3706da1d1a	agx: Support uniform registers as LODs This will avoid regressing moves when we lower sampler LOD bias. Corresponding disassembler change: https://github.com/dougallj/applegpu/pull/22 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20833>	2023-02-04 07:33:08 +00:00
Alyssa Rosenzweig	231561d53a	asahi: Correct alignment for USC Uniform packets We only need 4 byte alignment, not 8 bytes. This isn't a big difference in practice, but it probably reduces padding in some cases. More importantly, it corrects our XML to match what the hardware actually does, which is great. (There is exactly enough room for a 40-bit address with 4 byte alignment.) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21118>	2023-02-04 07:19:29 +00:00
Alyssa Rosenzweig	b0f1964771	asahi: Strengthen agx_usc_uniform contract Check the size explicitly, instead of just implicitly in the GenXML pack: it is the responsibility of the caller to split up larger uploads. While this is nominally more complicated, agx_usc_uniform is called in the draw hot path whereas the actual splitting decision can usually be done at compile-time. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21118>	2023-02-04 07:19:29 +00:00
Alyssa Rosenzweig	ea38709345	asahi: Fix encoding of uniform size Only 6-bits, with zero=64 like a groups() encoding. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Suggested-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21118>	2023-02-04 07:19:29 +00:00
Alyssa Rosenzweig	9b2dc92228	ail: Test 63x63 cube map This has a subtle interaction with page-aligned layers. Written while debugging dEQP-GLES3.functional.texture.filtering.cube.combinations.nearest_nearest_repeat_clamp Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21114>	2023-02-04 07:04:49 +00:00
Alyssa Rosenzweig	294351ff77	ail: Test mipmapped_z behaviour The mipmapped_z = true case is checked against Metal, the false case is smoke testing the old behaviour (which is still used for 2D arrays). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21114>	2023-02-04 07:04:49 +00:00
Alyssa Rosenzweig	c2bf66ab87	ail: Add layout->mipmapped_z input For 3D images, the full miptree depends on the depth of the image, in contrast to 2D arrays. We need to account for this to calculate the correct layer strides. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21114>	2023-02-04 07:04:49 +00:00
Ian Romanick	ea413e826b	nir: Eliminate nir_op_f2b Builds on the work of !15121. This gets to delete even more code because many drivers shared a lot of code for i2b and f2b. No shader-db or fossil-db changes on any Intel platform. v2: Rebase on `1a35acd8d9`. v3: Update a comment in nir_opcodes_c.py. Suggested by Konstantin. v4: Another rebase. Remove f2b stuff from Midgard. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20509>	2023-02-03 22:39:57 +00:00
Alyssa Rosenzweig	6908a0dece	asahi: Run nir_lower_fragcolor during preprocessing This pass needs to run early (because it depends on early I/O), but it doesn't actually need the shader key. Why not? If we overestimate the number of render targets, extra store_output intrinsics will be generated, but they will be deleted by AGX tilebuffer lowering later. Note we'll probably want something smarter than this for fragment epilogues in the future to avoid piles of unnecessary moves. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21065>	2023-02-03 15:03:06 +00:00
Asahi Lina	ed6edc07e4	asahi: Split off macOS support into its own file All the ifdef __APPLE__ is getting really silly. Let's split off the macOS UAPI abstraction into its own file, so we can have parallel implementations. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21058>	2023-02-02 11:45:52 +00:00
Asahi Lina	2e51ccac82	asahi: Split off common BO code into its own file In preparation for splitting off the macOS backend implementation into its own file, pull out the shared BO code from agx_device.c into agx_bo.c. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21058>	2023-02-02 11:45:52 +00:00
Alyssa Rosenzweig	ea285aea8d	asahi: Use non-UAPI specific BO create flags So we're not tied to the macOS or Linux UAPIs and are not translating awkwardly from one to the other when creating BOs. They're not quite equivalent -- macOS doesn't include writeback information in this flag field, and Linux doesn't have a executable flag. (Maybe we should add one, though? Then we can enforce W^X.) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21058>	2023-02-02 11:45:52 +00:00
Alyssa Rosenzweig	5e14792200	agx: Centralize texture lowering Lowering buffer textures will interact with multiple of our existing lowerings, and it's convenient to have it all in one place. This also keeps the pass ordering dependencies centralized. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21060>	2023-02-02 06:39:42 +00:00
Alyssa Rosenzweig	0f087b56d0	agx: Bump preamble_storage_size to 512 nir_opt_preamble is now aware of the internal uniforms we insert, so it can use the whole uniform file available to it. This lets us push more (all?) uniform loads in Dolphin ubershaders to the preamble. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20562>	2023-01-31 17:02:34 +00:00
Alyssa Rosenzweig	02fe57b7e9	agx: Lower system values in NIR in the driver To comply with The Ekstrand Rule. AGX has a large number of "uniform registers" available. These may be loaded with arbitrary ranges of GPU memory by the driver, or they can be written by the preamble shader. Currently, the compiler runs nir_opt_preamble on the first half of the uniform file, and then translates NIR sysvals to moves from the second half of the uniform file, passing back a uniform->sysval map for the GL driver to respect. This has (at least) two issues: * Since nir_opt_preamble runs before gathering sysvals, it has to assume the maximum number of sysvals are pushed, which can prevent it from moving some computation to the preamble due to running out of partitioned uniform registers. This is a problem for Dolphin's ubershaders, though it's unclear how much it matters for Dolphin perf. * This violates The Ekstrand Rule and apparently will be a problem for our Vulkan driver. I'm just a compiler+GL girl, so I wouldn't know. To fix this, we invert the order of operations. At the end of this series, we instead lower NIR system values to NIR load_preamble instructions in the GL driver. The compiler just translates directly to uniform registers reads. The Vulkan driver will need its own version of this code, but maybe it can do something clever and descriptor set aware. This means that there will already be some load_preamble instructions when nir_opt_preamble runs, so I've made minor changes to nir_opt_preamble to handle that gracefully. This is a bit lazy... The alternative is to introduce a `load_uniform_agx` intrinsic which `load_preamble` gets lowered to trivially. But that's another pass over the IR (and due to AGX's shader variant hell I'm sensitive to backend compile time) and it would be more complicated than what's implemented here. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Ella Stanforth <ella@iglunix.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20562>	2023-01-31 17:02:34 +00:00
Alyssa Rosenzweig	4a675f93b9	asahi: Omit extra call to clock_gettime It's cheap but it isn't free. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20973>	2023-01-29 16:26:48 +00:00
Alyssa Rosenzweig	862bf420a9	asahi: Handle sampler->compare_mode Instead of smashing unconditionally to 1. Not sure if this fixes anything but it gets rid of an unknown at least. Possibly slightly faster. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20561>	2023-01-13 19:43:14 +00:00
Alyssa Rosenzweig	61c7e1bf48	agx: Peephole select after opt_preamble Reduces control flow in Dolphin uber shaders, which saves us a few cycles. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20597>	2023-01-13 00:43:04 +00:00

1 2 3 4 5 ...

681 commits