fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-23 22:00:13 +01:00

Author	SHA1	Message	Date
Konstantin Seurer	10ac51a52b	ac/llvm: Fix validation error with global io Fixes: `afd645f057` ("ac/llvm: remove LLVMBuildGEP usages") Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20521>	2023-02-05 12:16:05 +00:00
Konstantin Seurer	55175cd13c	radv/llvm: Use the shader names as module name This makes it easier to identify which (if any) shaders fail validation. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20521>	2023-02-05 12:16:05 +00:00
Konstantin Seurer	877e150ec8	radv/rq: Use 16 stack entries if there is only one ray query Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21120>	2023-02-05 11:51:42 +00:00
Asahi Lina	4ca4a05627	meson: Fix Asahi build on macOS !19950 introduced a dependency between NIR and Vulkan headers, and the Vulkan headers try to include X11 headers we cannot find on macOS. Disable this (we have no plans for Vulkan on the macOS testing platform anyway). Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21059>	2023-02-05 09:15:48 +00:00
Alyssa Rosenzweig	bfa7ec0aa0	agx: Don't scalarize preambles in NIR Scalarizing preambles in NIR isn't really necessary, we can do it more efficiently in the backend. This makes the final NIR a lot less annoying to read; the backend IR was already nice to read thanks to all the scalarized moves being copypropped. Plus, this is a lot simpler. No shader-db changes. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21122>	2023-02-05 08:53:29 +00:00
Alyssa Rosenzweig	7edd42cbc0	agx: Lower uniform sources with a dedicated pass Move the decision of "can I copyprop this uniform?" from copyprop to a standalone lowering pass. This is more straightforward and will enable the next patch. This has the side effect of sinking load_preamble instructions, for a nice reduction in register pressure. Instruction count increase is from rematerializing some moves, which should be more than balanced out by the reduced register pressure. total instructions in shared programs: 1523285 -> 1523317 (<.01%) instructions in affected programs: 1148 -> 1180 (2.79%) helped: 0 HURT: 13 HURT stats (abs) min: 1.0 max: 4.0 x̄: 2.46 x̃: 2 HURT stats (rel) min: 0.69% max: 7.69% x̄: 3.65% x̃: 2.61% 95% mean confidence interval for instructions value: 1.78 3.14 95% mean confidence interval for instructions %-change: 2.16% 5.15% Instructions are HURT. total bytes in shared programs: 10444532 -> 10444724 (<.01%) bytes in affected programs: 7386 -> 7578 (2.60%) helped: 0 HURT: 13 HURT stats (abs) min: 6.0 max: 24.0 x̄: 14.77 x̃: 12 HURT stats (rel) min: 0.63% max: 7.14% x̄: 3.40% x̃: 2.48% 95% mean confidence interval for bytes value: 10.68 18.85 95% mean confidence interval for bytes %-change: 2.02% 4.78% Bytes are HURT. total halfregs in shared programs: 419444 -> 416434 (-0.72%) halfregs in affected programs: 27080 -> 24070 (-11.12%) helped: 634 HURT: 0 helped stats (abs) min: 1.0 max: 30.0 x̄: 4.75 x̃: 2 helped stats (rel) min: 2.90% max: 54.55% x̄: 13.13% x̃: 8.51% 95% mean confidence interval for halfregs value: -5.08 -4.41 95% mean confidence interval for halfregs %-change: -14.03% -12.23% Halfregs are helped. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21122>	2023-02-05 08:53:29 +00:00
Alyssa Rosenzweig	e44a53f5dc	agx: Run DCE twice Needed to combine fsat with vectors due to nir_lower_blend changes. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21122>	2023-02-05 08:53:29 +00:00
Alyssa Rosenzweig	cd8b5427c7	agx: Allow uniform sources on phis The parallel copy lowering has been able to handle uniform sources since `98f0ebf264` ("agx: Pass agx_index to agx_copy"), and uniform sources work fine with phis. It's not super common but there's no need to restrict them. This is a small instruction count win and will greatly simplify the lowering later in this series. total instructions in shared programs: 1523806 -> 1523285 (-0.03%) instructions in affected programs: 17088 -> 16567 (-3.05%) helped: 38 HURT: 1 helped stats (abs) min: 1.0 max: 44.0 x̄: 13.95 x̃: 7 helped stats (rel) min: 0.42% max: 18.64% x̄: 4.73% x̃: 1.26% HURT stats (abs) min: 9.0 max: 9.0 x̄: 9.00 x̃: 9 HURT stats (rel) min: 8.57% max: 8.57% x̄: 8.57% x̃: 8.57% 95% mean confidence interval for instructions value: -17.95 -8.77 95% mean confidence interval for instructions %-change: -6.35% -2.43% Instructions are helped. total bytes in shared programs: 10447658 -> 10444532 (-0.03%) bytes in affected programs: 118850 -> 115724 (-2.63%) helped: 38 HURT: 1 helped stats (abs) min: 6.0 max: 264.0 x̄: 83.68 x̃: 45 helped stats (rel) min: 0.36% max: 16.51% x̄: 4.14% x̃: 1.09% HURT stats (abs) min: 54.0 max: 54.0 x̄: 54.00 x̃: 54 HURT stats (rel) min: 7.30% max: 7.30% x̄: 7.30% x̃: 7.30% 95% mean confidence interval for bytes value: -107.68 -52.62 95% mean confidence interval for bytes %-change: -5.55% -2.13% Bytes are helped. total halfregs in shared programs: 419446 -> 419444 (<.01%) halfregs in affected programs: 29 -> 27 (-6.90%) helped: 1 HURT: 0 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21122>	2023-02-05 08:53:29 +00:00
Luc Ma	abe6d750e5	xlib: fix glXDestroyContext in Gallium frontends when glx is built with -Dglx=xlib, the mishandle in glXDestroyContext causes glmark2 to exit unexpectedly. Error: Glmark2 needs OpenGL(ES) version >= 2.0 to run (but version string is: '(null)')! Error: Failed to add vertex shader from file None: Error: Failed to create the new program [build] <default>: Set up failed Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3985 Signed-off-by: Luc Ma <luc@sietium.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21067>	2023-02-04 22:25:09 +00:00
SoroushIMG	8f928a95e1	zink: fix cap check for arb sparse texture2 arb_sparse_texture2 also enables multisampled sparse textures. bring back the check for msaa support. fixes #8229 Fixes: `4f8ba2b9aa` ("zink: fix sparse residency query and minLOD feature checks") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21121>	2023-02-04 18:05:48 +00:00
Alyssa Rosenzweig	93db6094a1	nir/print: Pretty-print color0/1_interp These are an enum. Furthermore, their 0 state is INTERP_MODE_NONE which we shouldn't bother printing at all. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21091>	2023-02-04 17:26:30 +00:00
Alyssa Rosenzweig	b235be1fd4	nir/print: Pretty-print I/O semantic locations Instead of printing the raw location number, which is pretty hard to interpret, let's print the name of the location. Example output: vec4 16 ssa_2 = intrinsic load_interpolated_input (ssa_0, ssa_1) (base=0, component=0, dest_type=float16 /144/, io location=VARYING_SLOT_VAR0 slots=1 mediump /8388768/) One of the "regressions" from moving to purely lowered I/O with all variables removed is a lack of debuggability, since otherwise these location strings don't show up anywhere in the printed shader! By contrast this should make the lowered I/O nice to read like the early I/O. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21091>	2023-02-04 17:26:30 +00:00
Alyssa Rosenzweig	435e7f5e6d	nir/print: Extract get_location_str Locations show up in two places: variables and lowered I/O semantics. We want to reuse the logic in both places, so extract it out. The extracted logic is IMO easier to read, too. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21091>	2023-02-04 17:26:30 +00:00
Alyssa Rosenzweig	f857795e83	agx: Implement barriers Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21062>	2023-02-04 17:10:15 +00:00
Alyssa Rosenzweig	251f6fb224	agx: Implement compute ID intrinsics These NIR intrinsics map to vectors of special registers. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21062>	2023-02-04 17:10:15 +00:00
Alyssa Rosenzweig	da91a78ab7	asahi: Identify more compute-related XML Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21062>	2023-02-04 17:10:15 +00:00
Alyssa Rosenzweig	57e0dbe55b	asahi: Implement load_ssbo_address/get_ssbo_size More uniforms that get pushed. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21062>	2023-02-04 17:10:15 +00:00
Alyssa Rosenzweig	78c9344a4d	asahi: Add compute batches Add a specialized agx_batch for compute commands (queued to the CDM instead of the VDM for graphics). This uses a sentinel value for the width. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21062>	2023-02-04 17:10:15 +00:00
Alyssa Rosenzweig	f54739396c	asahi: Bump PIPE_CAP_MAX_TEXTURE_ARRAY_LAYERS Seems arbitrary. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21062>	2023-02-04 17:10:15 +00:00
Alyssa Rosenzweig	580ed13779	asahi: Stub out MSAA for dEQP Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21062>	2023-02-04 17:10:15 +00:00
Alyssa Rosenzweig	5e7babfa1b	asahi: Advertise seamless cube maps These are already wired up. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21062>	2023-02-04 17:10:15 +00:00
Alyssa Rosenzweig	32cbcbcb50	asahi: Fake more caps for dEQP-GLES31 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21062>	2023-02-04 17:10:15 +00:00
Alyssa Rosenzweig	f4b553d55a	asahi: Add hooks for SSBO and images Copy paste from Panfrost. This should be close to what we need for Asahi, and this lets us run dEQP-GLES31 without crashing immediately. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21062>	2023-02-04 17:10:15 +00:00
Alyssa Rosenzweig	c1a6465644	asahi: Don't leak shader NIR create_shader_state passes ownership of the NIR to the driver, so we need to free it when we destroy the shader CSO later. Use ralloc to manage this in a uniform way between graphics and compute. Strategy from Panfrost. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21062>	2023-02-04 17:10:15 +00:00
Alyssa Rosenzweig	227d4f6d75	asahi: Add compute kernel scaffolding This adds the basic scaffolding for compute kernels. There's a bit of churn to make sure we don't need to hang onto the kernel NIR, since it's never used for anything else except looking up the shader stage. The compute kernels aren't actually wired up here, but they do get compiled. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21062>	2023-02-04 17:10:15 +00:00
Alyssa Rosenzweig	60121e3a42	asahi: Fix delete_vs_state implementation The generic free won't delete the shader variants, leaking them all! Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21062>	2023-02-04 17:10:15 +00:00
Hampus Linander	b73b5cc71a	agx: Optimize lower_resinfo for cube maps We can avoid reading both width and height when the texture is a cube map, and we do so more simply by relying on CSE+DCE (Alyssa). Closes: #7541 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20628>	2023-02-04 11:13:37 -05:00
Hampus Linander	9ab1c0d83b	agx: Use AGX extr for tex lowering Replaces a number of bit operations by a single extr instruction, optimizing the extraction of the width from the packed value. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20628>	2023-02-04 11:13:37 -05:00
Hampus Linander	f3d6524a2d	agx: Add extr instruction to AGX backend Encoding is similar to bfeil, in particular the immidiate has the same encoding as BFI_MASK hence its reuse. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20628>	2023-02-04 11:13:37 -05:00
Hampus Linander	4ffc7c3ff4	nir: Add extr_agx opcode The AGX extr instruction extracts a bitfield from two 32bit registers. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20628>	2023-02-04 11:13:24 -05:00
Alyssa Rosenzweig	e765ec21ec	asahi: Implement custom border colours Implement custom border colours, as required by OpenGL's CLAMP_TO_BORDER and Vulkan with customBorderColor. This uses an extended sampler descriptor, which has space for the custom border values. The trouble is that the border must be packed into an internal interchange format that depends on the original format in a complex way. That said, we're not solving NP-complete problems here, and it passes the tests (dEQP-GLES31.functional.texture.border_clamp.* and piglit texwrap). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20570>	2023-02-04 10:37:02 -05:00
Alyssa Rosenzweig	507ca71f3e	agx/decode: Handle extended samplers These include a border colour field. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20570>	2023-02-04 10:32:38 -05:00
Alyssa Rosenzweig	afce5be659	agx/decode: Add a data parameter to stateful So we can handle extended samplers. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20570>	2023-02-04 10:32:38 -05:00
Alyssa Rosenzweig	10eaa4a2ec	asahi: Add XML for custom border colours These use extended sampler descriptors. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20570>	2023-02-04 10:32:24 -05:00
Timur Kristóf	3a819bd22e	ac/nir/ngg: Include culled primitives in query. Vulkan spec 18.8. Primitives Generated Queries: When a generated primitive query for a vertex stream is active, the primitives-generated count is incremented every time a primitive emitted to that stream reaches the transform feedback stage, whether or not transform feedback is active. We can see the order of stages in chapter 27 Fixed-Function Vertex Post-Processing, which shows that the transform feedback stage is before rasterization (and therefore culling). Conclusion is that culled primitives should be included in the primitives generated query. This commit makes sure to emit the primitives generated query code before culling and uses the input primitive count passed to the current wave instead of the exec mask after culling. Cc: mesa-stable Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21037>	2023-02-04 11:53:07 +01:00
Alyssa Rosenzweig	221311e1e9	agx: Handle constant-offset in address matching Match iadd(x, #y). The format shift will get constant-folded away and, if y is sufficiently small, the constant will be inlined by the AGX backend optimizer. This gets rid of piles of 64-bit arithmetic from lowering UBOs. It probably doesn't matter for perf since that's happening in preamble shaders but it is noisy. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21108>	2023-02-04 08:41:37 +00:00
Alyssa Rosenzweig	c3f7abaaef	agx: Fix storing to varying arrays The offset is in vec4s, not words (unlike the component). This doesn't matter right now since we get everything lowered (offset -> 0) but it will come up if we implement clip distances natively (instead of lowering in FS). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21097>	2023-02-04 08:28:43 +00:00
Alyssa Rosenzweig	897c47aa1c	docs/asahi: Document clip distance varyings These implement gl_ClipDistance in hardware, avoiding the fragment shader lowering. Unfortunately, they can't be disabled on a per-plane basis and they can't be interpolated, so using them for OpenGL would still require a bunch of extra lowering steps. Still, we should document the hardware and the caveats. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21097>	2023-02-04 08:28:43 +00:00
Alyssa Rosenzweig	13b25a6114	asahi: Don't use 16-bit inputs to 32-bit st_tile The hardware doesn't extend in this case, we need to extend for it. This fixes 32-bit render target formats with lower_mediump_io. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21082>	2023-02-04 08:14:32 +00:00
Alyssa Rosenzweig	6b0322d441	agx: Keep varyings forwarded to texture as fp32 This works around bugs in a LOT of applications, since fp16 texture coordinates are almost never appropriate even though it's a valid implementation of the GLES spec. It also doesn't seem to matter for perf. Code from the Bifrost compiler which implements the same workaround for slightly different reasons. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21082>	2023-02-04 08:14:32 +00:00
Alyssa Rosenzweig	5678fbe010	asahi: Merge fragment control XML Same struct specified twice and merged in the hw. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21081>	2023-02-04 07:58:42 +00:00
Alyssa Rosenzweig	50e61e251b	asahi: Remove redundant tri merge disable bit Cargoculted from Metal. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21081>	2023-02-04 07:58:42 +00:00
Alyssa Rosenzweig	6ee38e2635	asahi: DRY dirty tracking conditions Ella did this in agxv and it made a lot more sense than the copypasta I did. Should get copypropped to similar code. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21081>	2023-02-04 07:58:42 +00:00
Alyssa Rosenzweig	98b2657b9e	asahi: Implement nontrivial rasterizer discard For vertex shaders with side effects, as seen with transform feedback. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21081>	2023-02-04 07:58:42 +00:00
Alyssa Rosenzweig	64ae63c41f	asahi: Prefer blit-based texture transfer This speeds up glReadPixels. Instead of reading from the write-combined framebuffer and converting colours on the CPU, this blits on the GPU to a writeback staging resource with the colour conversion for free, and memcpies from the writeback staging resource on the CPU. In general, due to textures being write combined and tiled/compressed by default by staging resources being linear writeback, blit-based texture transfer should win out (you were going to blit anyway), particularly when format conversion is involved 33% reduction in wall clock time for grim at 4K. No change in deqp-gles2 runtime. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21063>	2023-02-04 07:45:12 +00:00
Alyssa Rosenzweig	0a5c3764c7	asahi: Make STAGING resources linear As intended by the flag. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21063>	2023-02-04 07:45:12 +00:00
Alyssa Rosenzweig	e7b97899ac	asahi: Use writeback when it looks beneficial When playing the My Little Pony theme song at 1080p on T8103, with mpv's GPU compositing but software decoding, CPU usage drops from 200% to 50% due to proper caching of the staging resource. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21063>	2023-02-04 07:45:12 +00:00
Asahi Lina	a88aa3e835	asahi: Refuse to transfer out-of-bounds mip levels Fixes ail asserts on a pile of dEQP3 tests. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21063>	2023-02-04 07:45:12 +00:00
Alyssa Rosenzweig	3706da1d1a	agx: Support uniform registers as LODs This will avoid regressing moves when we lower sampler LOD bias. Corresponding disassembler change: https://github.com/dougallj/applegpu/pull/22 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20833>	2023-02-04 07:33:08 +00:00
Alyssa Rosenzweig	231561d53a	asahi: Correct alignment for USC Uniform packets We only need 4 byte alignment, not 8 bytes. This isn't a big difference in practice, but it probably reduces padding in some cases. More importantly, it corrects our XML to match what the hardware actually does, which is great. (There is exactly enough room for a 40-bit address with 4 byte alignment.) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21118>	2023-02-04 07:19:29 +00:00

... 19 20 21 22 23 ...

167219 commits