fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-22 13:08:09 +02:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	17d4486c6a	asahi: Add XML for linear 2D arrays These look a bit like compressed images, and elucidate one of the common fields. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20446>	2023-01-05 11:48:13 -05:00
Alyssa Rosenzweig	48c9a9676c	asahi: Add XML required for vertex shader side effects Basically for rasterizer discard. We'll use these in a moment to implement transform feedback. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20446>	2023-01-05 11:48:13 -05:00
Alyssa Rosenzweig	6bda0f2a70	asahi: Dump uniforms when decoding These often have addresses in them. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20446>	2023-01-05 11:48:13 -05:00
Alyssa Rosenzweig	f603d8ce9e	asahi: Clang-format the subtree See `0afd691f29` ("panfrost: clang-format the tree") for why I'm doing this. Asahi already mostly follows Mesa style so this doesn't do much. But this means we can all stop thinking about formatting and trust the robot poets to do that for us. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20434>	2022-12-27 22:46:29 +00:00
Alyssa Rosenzweig	d9dc77f068	asahi: Add some clang-format commas Otherwise clang-format will mangle this. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20434>	2022-12-27 22:46:29 +00:00
Alyssa Rosenzweig	c1f175c9fa	asahi: Manually format some parts of the code clang-format will mangle these. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20434>	2022-12-27 22:46:29 +00:00
Alyssa Rosenzweig	0347d1c358	asahi: Identify seamful cube map bit Fixes dEQP-GLES2.functional.texture.mipmap.cube.basic.linear_nearest when run with a GLES2 version. We wire up seamless cube maps for GLES3+ only, working around an obscure mesa/st limitation. See `6148e3aae7` ("mesa: Fix ctx->Texture.CubeMapSeamless") for the full context. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20365>	2022-12-17 18:10:28 +00:00
Alyssa Rosenzweig	623a2bf488	asahi: Identify XML for more flatshading controls Names from PowerVR <3 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20365>	2022-12-17 18:10:28 +00:00
Alyssa Rosenzweig	15155268de	asahi: Allow texturing S8 portion of combined Z/S Comes up in gles3. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20365>	2022-12-17 18:10:28 +00:00
Asahi Lina	9fc2c0f341	asahi: Put meta shader keys into the meta shader itself The hash table needs a key pointer with at least the lifetime of the hash entry, which the key pointer we get does not have (since it is stack-allocated by agx_build_meta). Copy it into the shader struct itself and use that for the hash table. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20365>	2022-12-17 18:10:28 +00:00
Asahi Lina	c12153cd89	asahi: Identify & disable triangle merging for shaders using derivatives It seems triangle merging is incompatible with calculating derivatives along primitive edges correctly. Take the appropriate NIR shader info flags in the compiler and pass them down as a flag to the driver, so it can set the disable triangle merging flag (formerly called "lines or points"). TODO: Is this what macOS does when you set a sample mask there (which apparently fixes the same bug on the Darwinia Metal backend)? Do we also need to set this when sample masks are used? Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Fixes Darwinia and dEQP2 projected tests. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20365>	2022-12-17 18:10:28 +00:00
Asahi Lina	005f556065	asahi: Fix include guard comment on decode.h Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20365>	2022-12-17 18:10:28 +00:00
Alyssa Rosenzweig	c9144eff48	asahi: Model alignment of occlusion query indices 8-byte offsets. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:51:04 -05:00
Alyssa Rosenzweig	3a318e4265	asahi: Identify some more fields used with layered These values depend on the framebuffer width/height and maybe other stuff. Maybe strides. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:51:04 -05:00
Alyssa Rosenzweig	c3eb81fd16	asahi: Identify XML for anisotropic filtering Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:51:04 -05:00
Alyssa Rosenzweig	8dcf7648f1	agx: Lower VBOs in NIR Now we support all the vertex formats! This means we don't hit u_vbuf for format translation, which helps performance in lots of applications. By doing the lowering in NIR, the vertex fetch code itself can be optimized by NIR (e.g. nir_opt_algebraic) which can improve generated code quality. In my first implementation of this, I had a big switch statement mapping format enums to interchange formats and post-processing code. This ends up being really unwieldly, the combinatorics of bit packing + conversion + swizzles is enormous and for performance we want to support everything (no u_vbuf fallbacks). To keep the combinatorics in check, we rely on parsing the util_format_description to separate out the issues of bit packing, conversion, and swizzling, allowing us to handle bizarro formats like B10G10R10A2_SNORM with no special casing. In an effort to support everything in one shot, this handles all the formats needed for the extensions EXT_vertex_array_bgra, ARB_vertex_type_2_10_10_10_rev, and ARB_vertex_type_10f_11f_11f_rev. Passes dEQP-GLES3.functional.vertex_arrays.* Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19996>	2022-12-02 06:25:20 +00:00
Asahi Lina	112830f1a0	asahi: Pass through layer alignment flag to the hardware Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20031>	2022-11-28 19:50:18 +00:00
Alyssa Rosenzweig	597e303b5b	agx: Add merge helpers to GenXML From panfrost. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20013>	2022-11-28 16:48:38 +00:00
Alyssa Rosenzweig	debee344a2	agx: Make empty texture pack to all-zeroes So we can do partial textures. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20013>	2022-11-28 16:48:38 +00:00
Asahi Lina	f5a26cc646	asahi: Fix remaining build issues on macOS Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20030>	2022-11-28 16:10:19 +00:00
Alyssa Rosenzweig	20cdc35fdb	asahi: Add missing #include Noticed when shuffling headers. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19999>	2022-11-25 18:56:48 +00:00
Asahi Lina	6f15873d44	asahi: Introduce compressed resource support Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19999>	2022-11-25 18:56:48 +00:00
Asahi Lina	78948c03f0	asahi: Identify compression-related XML Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19999>	2022-11-25 18:56:48 +00:00
Alyssa Rosenzweig	b102f045ab	asahi: Set GPR count accurately for background/EOT Better occupancy, which is especially important when the background shader does memory access (for reloads). On my 4K monitor, glmark2 -bdesktop fullscreen from 95fps to 133fps. At default settings, glmark2 -bterrain from 63fps to 71fps. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19997>	2022-11-25 18:02:42 +00:00
Alyssa Rosenzweig	04360a270e	asahi: Copy panfrost's bo cache Massive performance gains, some fps before/after numbers from glmark2: [shading] 1486 -> 2391 [refract] 87 -> 127 [terrain] 32 -> 56 ...and it's basically for free with enough copy/paste, so thank you to Boris Brezillon for an excellent Asahi patch, the LRU cache seems to work great on M1 :-p There are a few minor changes I made from panfrost, notably adjusting the constants to account for 16KiB pages and switching from pthread_mutex to simple_mtx to be less weird in Mesa. For context on the design, the following commits evolved it in Panfrost and their commit messages may be useful... The logic in this module is the product of years of mistakes and correcting course :-) `f06809cdca` ("panfrost: Evict the BO cache when allocation fails") `77d0498913` ("panfrost: Fix major flaw in BO cache") `ee82f9f07e` ("panfrost: Try to evict unused BOs from the cache") `2225383af8` ("panfrost: Make sure the BO is 'ready' when picked from the cache") `9af4aeaaf7` ("panfrost: Don't return imported/exported BOs to the cache") Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19971>	2022-11-24 23:37:48 +00:00
Alyssa Rosenzweig	7c8e3963bd	asahi: Stop aligning pool allocations to 4KiB This defeats the point of specifying alignments and of packing allocations together with the BO cache. We're a real driver now, let's allocate memory like one. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19971>	2022-11-24 23:37:48 +00:00
Alyssa Rosenzweig	860f5d77c6	asahi: Label BOs internally This will help debugging memory usage problems. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19971>	2022-11-24 23:37:48 +00:00
Alyssa Rosenzweig	70f40ea4d3	asahi: Wire up all BCn formats We have these native. Passes the relevant piglits. Large reduction in memory usage on Xonotic on higher settings (8x less memory per texture), which allows Xonotic to run at high settings without OOMing. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Tested-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19903>	2022-11-21 22:33:43 +00:00
Alyssa Rosenzweig	74e92274af	asahi,agx: Use new tilebuffer infrastructure Flag day change to replace the previous hardcoded background/end-of-tile shaders and the API-style load/store_output in fragment shaders with the generated shaders and lowered *_agx intrinsics. This gets us working non-UNORM8 render targets and working MRT. It's also a step in the direction of working MSAA but that needs a lot more work, since the multisampling programming model on AGX is quite different from any of the APIs (including Metal). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	c5c0ea39f6	asahi: Add new clear/reload/store infrastructure With multiple render targets, it's not practical to generate all variants of the background and end-of-tile programs at start up. Rather than trying, add a hash table of meta program keys to background programs, and compile variants as they're needed. With the new infrastructure, it's sensible to handle clears with the same code path as reloads. In addition to getting us closer to multiple render target support, this gets us support for non-RGBA8 render targets, as the u8norm tilebuffer format was baked into the hardcoded clear shader and store shaders used before. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	b1f5004ee7	asahi: Add agx_usc_shared_none helper Convenience for vertex USC programs. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	c713197c25	asahi: Add R16 SNORM formats For completeness, since we do have hardware for this. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	d637189d36	asahi: Add more XML via PowerVR These bits are the same as RGX. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	a3907e92da	asahi: Add note to XML about 16-bit varyings Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	94a8fe51d5	asahi: Identify more depth-related fields in XML Needed for gl_FragDepth writes. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	6ce615d852	asahi: Add XML for layered rendering We don't need to support this for a while but it's good to know the mechanism. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	74de571402	asahi: Add NIR pass to lower tilebuffer access The compiler can't handle load/store_output directly for nontrivial tilebuffer layouts. Add a NIR pass to lower these intrinsics, applying a given layout. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	66a680a043	asahi: Add tilebuffer layout helpers Laying out the tilebuffer is nontrivial and a task shared between GL and VK, so add unit-tested helpers. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	5d3243ea2d	asahi: Add some notes about unknowns to the XML Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	363ffa779d	asahi: Identify multisampling fields of shared layout Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	5a20c90508	asahi: Add _with_bo pool uploads Will be useful for managing our meta shaders. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	8781aef6b4	asahi: Make libasahi_lib depend on libasahi_decode The track_alloc and track_free symbols are used, we need to link them in. Depending on build flags / environment / etc, fixes the potential build error hit by a CI job: mold: error: undefined symbol: agxdecode_track_alloc >>> referenced by agx_device.c >>> src/asahi/lib/libasahi_lib.a(src/asahi/lib/libasahi_lib.a.p/agx_device.c.o):(agx_shmem_alloc)>>> referenced by agx_device.c >>> src/asahi/lib/libasahi_lib.a(src/asahi/lib/libasahi_lib.a.p/agx_device.c.o):(agx_bo_create) mold: error: undefined symbol: agxdecode_track_free >>> referenced by agx_device.c >>> src/asahi/lib/libasahi_lib.a(src/asahi/lib/libasahi_lib.a.p/agx_device.c.o):(agx_bo_unreference) ...when trying to link with libasahi_lib without libasahi_decode for unit tests. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	6ee6cfec41	asahi: Use PIPE_FORMATs for driver-compiler ABI This avoids exposing the ISA-internal agx_format to the driver, instead hiding it behind a real PIPE_FORMAT. This lets us use real pipe formats in formatted intrinsics in NIR, which is convenient; it will allow us to simplify the compiler/driver ABI; and it lets us use common format helpers (e.g. util_format_get_blocksize) for the internal formats in driver lowering. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	f328207475	asahi: Split out agx_usc.h into a common file So the tilebuffer helpers can build the "shared" USC word. Also because Ella will probably want to use these O:) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19811>	2022-11-19 04:27:11 +00:00
Alyssa Rosenzweig	8be506039d	asahi: Note some magic bits used with memoryless RTs Obviously there can't actually be memoryless render targets, because how would partial renders work? The control stream with memoryless looks like everything would if it went to memory (e.g. full 2D MSAA attachments for the partial loads/stores even if only a resolved 2D image for the final store). Except the memoryless attachments all load from the same address 0xeeee0000. Clearly that's not actually what happens, so what gives? Unclear... but I see the magic bits mentioned here set, and I assume there are some firmware (or kernel) shenanigans used to JIT allocate the backing storage for partial renders. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19811>	2022-11-19 04:27:11 +00:00
Alyssa Rosenzweig	3fa87e47d5	asahi: Identify "Sample mask after depth/stencil" bit Corresponds to Metal [[sample_mask,post_depth_coverage]]. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19811>	2022-11-19 04:27:11 +00:00
Alyssa Rosenzweig	ff616099ce	asahi: Identify the pass type enum Via PowerVR. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19811>	2022-11-19 04:27:10 +00:00
Alyssa Rosenzweig	2e6369f5f6	asahi: Identify PBE sample count Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19811>	2022-11-19 04:27:10 +00:00
Alyssa Rosenzweig	1f0edc0158	asahi: Identify Dimension for Render Target Metal uses when rendering to multisampled 2D. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19811>	2022-11-19 04:27:10 +00:00
Alyssa Rosenzweig	9c52001a1d	asahi: Implement stencil texturing Stencil texturing is easy: S8_UINT is textured like R8_UINT (with a little swizzle fixup), and stencil is always S8_UINT thanks to u_transfer_helper. So we just need to do some fixups to make u_transfer_helper's seperate_stencil work and everything will work out. Passes dEQP-GLES31.functional.stencil_texturing.* Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19811>	2022-11-19 04:27:10 +00:00

1 2 3 4 5

245 commits