fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-25 06:08:21 +02:00

Author	SHA1	Message	Date
Asahi Lina	c12153cd89	asahi: Identify & disable triangle merging for shaders using derivatives It seems triangle merging is incompatible with calculating derivatives along primitive edges correctly. Take the appropriate NIR shader info flags in the compiler and pass them down as a flag to the driver, so it can set the disable triangle merging flag (formerly called "lines or points"). TODO: Is this what macOS does when you set a sample mask there (which apparently fixes the same bug on the Darwinia Metal backend)? Do we also need to set this when sample masks are used? Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Fixes Darwinia and dEQP2 projected tests. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20365>	2022-12-17 18:10:28 +00:00
Asahi Lina	005f556065	asahi: Fix include guard comment on decode.h Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20365>	2022-12-17 18:10:28 +00:00
Asahi Lina	b80fb31678	asahi: Allocate enough push ranges for the worst possible case We need one for every possible sysval, plus up to 16 VBOs. Fixes plasma-systemmonitor. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20365>	2022-12-17 18:10:28 +00:00
Alyssa Rosenzweig	eba2b182c8	agx: Fix packing of extension for block image stores Probably impossible to hit in practice but let's get it right. Found when forcing RA to use the upper half of the reg file. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20365>	2022-12-17 18:10:28 +00:00
Alyssa Rosenzweig	ef23bbfdbd	agx: Coalesce i2i16 and u2u16 Extract out the code for unpack_64_2x32_split_x and use it for other integer downcasts too to coalesce out a move. Pointless, but I wanted to have a little RA fun after getting stencil export working. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20365>	2022-12-17 18:10:28 +00:00
Asahi Lina	58d02e4f59	ail: Assert that the mip level is in bounds This preempts possible out-of-bounds accesses and later asserts when trying to get the tile size. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20365>	2022-12-17 18:10:28 +00:00
Alyssa Rosenzweig	a8ec3135bb	ail: Fix tile sizes Fixes dEQP-GLES3.functional.texture.filtering.2d.sizes.3x7_nearest_mipmap_linear. Tested for all sizes 1..256x1..256. Tested-by: Asahi Lina <lina@asahilina.net> Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20365>	2022-12-17 18:10:28 +00:00
Asahi Lina	d36a829fa1	ail: Fix typo Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20365>	2022-12-17 18:10:28 +00:00
Asahi Lina	0d57fcaf28	ail: Always allocate the full miptree Layer strides are based on the full miptree, and even for single-layer images macOS always allocates a full one (possibly relevant for compression). Make sure we do the same, regardless of how many mip levels the user asked for. Fixes Darwinia. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20365>	2022-12-17 18:10:28 +00:00
Ian Romanick	eb76cee9f8	nir: Eliminate nir_op_i2b There are a lot of optimizations in opt_algebraic that match ('ine', a, 0), but there are almost none that match i2b. Instead of adding a huge pile of additional patterns (including variations that include both ine and i2b), always lower i2b to a != 0. At this point in the series, it should be impossible for anything to generate i2b, so there /should not/ be any changes. The failing test on d3d12 is a pre-existing bug that is triggered by this change. I talked to Jesse about it, and, after some analysis, he suggested just adding it to the list of known failures. v2: Don't rematerialize i2b instructions in dxil_nir_lower_x2b. v3: Don't rematerialize i2b instructions in zink_nir_algebraic.py. v4: Fix zink-on-TGL CI failures by calling nir_opt_algebraic after nir_lower_doubles makes progress. The latter can generate b2i instructions, but nir_lower_int64 can't handle them (anymore). v5: Add back most of the hunk at line 2125 of nir_opt_algebraic.py. I had accidentally removed the f2b(bf2(x)) optimization. v6: Just eliminate the i2b instruction. v7: Remove missed i2b32 in midgard_compile.c. Remove (now unused) emit_alu_i2orf2_b1 function from sfn_instr_alu.cpp. Previously this function was still used. 🤷 No shader-db changes on any Intel platform. All Intel platforms had similar results. (Ice Lake shown) Instructions in all programs: 141165875 -> 141165873 (-0.0%) Instructions helped: 2 Cycles in all programs: 9098956382 -> 9098956350 (-0.0%) Cycles helped: 2 The two Vulkan shaders are helped because of the "new" (('b2i32', ('ine', ('ubfe', a, b, 1), 0)), ('ubfe', a, b, 1)) algebraic pattern. Acked-by: Jesse Natalie <jenatali@microsoft.com> [earlier version] Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Tested-by: Daniel Schürmann <daniel@schuermann.dev> [earlier version] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15121>	2022-12-14 06:23:21 +00:00
Alyssa Rosenzweig	c9144eff48	asahi: Model alignment of occlusion query indices 8-byte offsets. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:51:04 -05:00
Alyssa Rosenzweig	3a318e4265	asahi: Identify some more fields used with layered These values depend on the framebuffer width/height and maybe other stuff. Maybe strides. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:51:04 -05:00
Alyssa Rosenzweig	c3eb81fd16	asahi: Identify XML for anisotropic filtering Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:51:04 -05:00
Alyssa Rosenzweig	b28fe26d7c	ail: Save level_offsets_compressed_B So we can bind specific mip levels for rendering into compressed Z/S. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:50:45 -05:00
Alyssa Rosenzweig	8dcf7648f1	agx: Lower VBOs in NIR Now we support all the vertex formats! This means we don't hit u_vbuf for format translation, which helps performance in lots of applications. By doing the lowering in NIR, the vertex fetch code itself can be optimized by NIR (e.g. nir_opt_algebraic) which can improve generated code quality. In my first implementation of this, I had a big switch statement mapping format enums to interchange formats and post-processing code. This ends up being really unwieldly, the combinatorics of bit packing + conversion + swizzles is enormous and for performance we want to support everything (no u_vbuf fallbacks). To keep the combinatorics in check, we rely on parsing the util_format_description to separate out the issues of bit packing, conversion, and swizzling, allowing us to handle bizarro formats like B10G10R10A2_SNORM with no special casing. In an effort to support everything in one shot, this handles all the formats needed for the extensions EXT_vertex_array_bgra, ARB_vertex_type_2_10_10_10_rev, and ARB_vertex_type_10f_11f_11f_rev. Passes dEQP-GLES3.functional.vertex_arrays.* Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19996>	2022-12-02 06:25:20 +00:00
Alyssa Rosenzweig	fb49715a2c	agx: Lower UBOs in NIR Simpler than lowering in the backend and makes the sysvals obvious in the NIR. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19996>	2022-12-02 06:25:20 +00:00
Alyssa Rosenzweig	6b4ed663a8	agx: Implement 8-bit sign extensions Long term, I think having i2i16 and i2i32 available with 8-bit sources should make lowering the rest of 8-bit away a bit easier. Short term, this avoids special casing 8-bit in the VBO lowering code. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19996>	2022-12-02 06:25:20 +00:00
Alyssa Rosenzweig	8127737c1e	agx: Allow some 8-bit sources 8-bit sources are useful for int8->float32 conversions, which we can do in a single hardware instruction. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19996>	2022-12-02 06:25:20 +00:00
Alyssa Rosenzweig	ba209fe493	agx: Implement formatted loads These will be generated by the UBO and VBO lowerings. (and eventually by other lowerings too?) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19996>	2022-12-02 06:25:20 +00:00
Alyssa Rosenzweig	580f25a266	agx: Add shift to device_load We'll use this as an optimization soon. This acts in addition to the format's shift. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19996>	2022-12-02 06:25:20 +00:00
Alyssa Rosenzweig	1555ac6f0b	agx: Clamp point sizes Fixes vs-point_size-zero. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20017>	2022-12-01 05:58:30 +00:00
Alyssa Rosenzweig	7108619c0d	agx: Handle 32-bit gl_FragCoord.zw The coefficient register is 16-bit so our builder will make the iter 16-bit too (maybe not the best design...), force fp32 to match the NIR intrinsic. Fixes glsl-fs-fragcoord-zw-ortho Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20017>	2022-12-01 05:58:30 +00:00
Alyssa Rosenzweig	eb4187b02d	agx: Handle large varying indices Fixes glsl-max-varyings. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20017>	2022-12-01 05:58:30 +00:00
Asahi Lina	022d03013a	ail: Split off test-miptree.cpp from test-layout.cpp Keep test-layout.cpp for the simple smoke tests, and move the big pile of miptree tests to its own file. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20031>	2022-11-28 19:50:18 +00:00
Asahi Lina	d0532196a2	ail: Add uncompressed twiddled texture sizing tests Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20031>	2022-11-28 19:50:18 +00:00
Asahi Lina	50ee22f5a5	ail: Rename test-compression.cpp to test-comp-twiddled.cpp To better align with the analogous test-uncomp-twiddled.cpp Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20031>	2022-11-28 19:50:18 +00:00
Asahi Lina	c52d4bef2d	ail: Add more compression size test cases Also sort the table in a consistent way, to make it easier to add tests without creating duplicates in the future. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20031>	2022-11-28 19:50:18 +00:00
Asahi Lina	c39ca7007f	ail: Fix logic for buffer alignment It turns out that specifically Z/S single-layer textures have the main miptree padded to the page size, but not others. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20031>	2022-11-28 19:50:18 +00:00
Asahi Lina	ecdcb3e1aa	ail: Fix compression metadata buffer sizing corner cases Although the metadata is possibly one byte per 8x4 block, the logical block size for compression/allocation is a 16x16 block, so align to that. Also align the initial dimensions to that size, and change the minification to a simple DIV_ROUND_UP. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20031>	2022-11-28 19:50:18 +00:00
Asahi Lina	112830f1a0	asahi: Pass through layer alignment flag to the hardware Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20031>	2022-11-28 19:50:18 +00:00
Asahi Lina	d88b546e65	ail: Introduce layer_alignment flag The hardware uses this flag to determine whether layer strides are implicitly aligned to the page size or not. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20031>	2022-11-28 19:50:18 +00:00
Alyssa Rosenzweig	597e303b5b	agx: Add merge helpers to GenXML From panfrost. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20013>	2022-11-28 16:48:38 +00:00
Alyssa Rosenzweig	debee344a2	agx: Make empty texture pack to all-zeroes So we can do partial textures. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20013>	2022-11-28 16:48:38 +00:00
Asahi Lina	f5a26cc646	asahi: Fix remaining build issues on macOS Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20030>	2022-11-28 16:10:19 +00:00
Alyssa Rosenzweig	20cdc35fdb	asahi: Add missing #include Noticed when shuffling headers. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19999>	2022-11-25 18:56:48 +00:00
Asahi Lina	6f15873d44	asahi: Introduce compressed resource support Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19999>	2022-11-25 18:56:48 +00:00
Asahi Lina	78948c03f0	asahi: Identify compression-related XML Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19999>	2022-11-25 18:56:48 +00:00
Asahi Lina	bea975b298	ail: Add unit tests for compression Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19999>	2022-11-25 18:56:48 +00:00
Asahi Lina	0ba63d5c26	ail: Introduce support for compression The main buffer is twiddled as before, but there's now also an auxiliary compression buffer that we need to reserve space for. With compression, the main buffer is aligned less. The macOS logic seems to be to align to the page size only if the texture is both 3D and mipmapped, and the layer stride is greater than the page size. That's gated on compression being enabled. Page alignment seems to be needed for uncompressed twiddled cube maps. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19999>	2022-11-25 18:56:48 +00:00
Alyssa Rosenzweig	b102f045ab	asahi: Set GPR count accurately for background/EOT Better occupancy, which is especially important when the background shader does memory access (for reloads). On my 4K monitor, glmark2 -bdesktop fullscreen from 95fps to 133fps. At default settings, glmark2 -bterrain from 63fps to 71fps. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19997>	2022-11-25 18:02:42 +00:00
Alyssa Rosenzweig	6de5bd5f41	agx: Fix signedness issues packing UBSan complains otherwise: ../src/asahi/compiler/agx_pack.c:701:21: runtime error: left shift of 1 by 31 places cannot be represented in type 'int' ../src/asahi/compiler/agx_pack.c:534:18: runtime error: left shift of 8 by 28 places cannot be represented in type 'int' Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19971>	2022-11-24 23:37:48 +00:00
Alyssa Rosenzweig	d608ca0363	agx: Handle vertex shaders that use <= 8 halfregs r5 and r6 are always getting lowered. Will prevent a regression with VBO lowering on a shader which has stride=0 and hence gets the vertex ID read optimized out with NIR: dEQP-GLES2.functional.draw.random.50 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19971>	2022-11-24 23:37:48 +00:00
Alyssa Rosenzweig	94124925ca	agx: Try to align sources of pack_64_2x32_split Helps with coalescing the pack. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19971>	2022-11-24 23:37:48 +00:00
Alyssa Rosenzweig	442e29890d	agx: Implement nir_op_pack_64_2x32_split This maps to a collect where the dest size is 64 and the src size is 32. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19971>	2022-11-24 23:37:48 +00:00
Alyssa Rosenzweig	04360a270e	asahi: Copy panfrost's bo cache Massive performance gains, some fps before/after numbers from glmark2: [shading] 1486 -> 2391 [refract] 87 -> 127 [terrain] 32 -> 56 ...and it's basically for free with enough copy/paste, so thank you to Boris Brezillon for an excellent Asahi patch, the LRU cache seems to work great on M1 :-p There are a few minor changes I made from panfrost, notably adjusting the constants to account for 16KiB pages and switching from pthread_mutex to simple_mtx to be less weird in Mesa. For context on the design, the following commits evolved it in Panfrost and their commit messages may be useful... The logic in this module is the product of years of mistakes and correcting course :-) `f06809cdca` ("panfrost: Evict the BO cache when allocation fails") `77d0498913` ("panfrost: Fix major flaw in BO cache") `ee82f9f07e` ("panfrost: Try to evict unused BOs from the cache") `2225383af8` ("panfrost: Make sure the BO is 'ready' when picked from the cache") `9af4aeaaf7` ("panfrost: Don't return imported/exported BOs to the cache") Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19971>	2022-11-24 23:37:48 +00:00
Alyssa Rosenzweig	7c8e3963bd	asahi: Stop aligning pool allocations to 4KiB This defeats the point of specifying alignments and of packing allocations together with the BO cache. We're a real driver now, let's allocate memory like one. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19971>	2022-11-24 23:37:48 +00:00
Alyssa Rosenzweig	860f5d77c6	asahi: Label BOs internally This will help debugging memory usage problems. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19971>	2022-11-24 23:37:48 +00:00
Yonggang Luo	40a9fc57aa	tree-wide: Use __func__ instead of __FUNCTION__ in non-gallium code Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Acked-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19861>	2022-11-22 06:53:46 +00:00
Alyssa Rosenzweig	70f40ea4d3	asahi: Wire up all BCn formats We have these native. Passes the relevant piglits. Large reduction in memory usage on Xonotic on higher settings (8x less memory per texture), which allows Xonotic to run at high settings without OOMing. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Tested-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19903>	2022-11-21 22:33:43 +00:00
Alyssa Rosenzweig	74e92274af	asahi,agx: Use new tilebuffer infrastructure Flag day change to replace the previous hardcoded background/end-of-tile shaders and the API-style load/store_output in fragment shaders with the generated shaders and lowered *_agx intrinsics. This gets us working non-UNORM8 render targets and working MRT. It's also a step in the direction of working MSAA but that needs a lot more work, since the multisampling programming model on AGX is quite different from any of the APIs (including Metal). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00

1 2 3 4 5 ...

581 commits