fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-20 11:18:11 +02:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	ea38709345	asahi: Fix encoding of uniform size Only 6-bits, with zero=64 like a groups() encoding. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Suggested-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21118>	2023-02-04 07:19:29 +00:00
Alyssa Rosenzweig	862bf420a9	asahi: Handle sampler->compare_mode Instead of smashing unconditionally to 1. Not sure if this fixes anything but it gets rid of an unknown at least. Possibly slightly faster. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20561>	2023-01-13 19:43:14 +00:00
Alyssa Rosenzweig	b4d8be165b	asahi: Implement ARB_texture_mirror_clamp_to_edge Guessing the enum value, passes texwrap piglit. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20560>	2023-01-09 23:58:52 +00:00
Alyssa Rosenzweig	0e2d786579	asahi: Implement GL_CLAMP natively Turns out there's a hardware mode for this. Apple's GL driver uses this. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20560>	2023-01-09 23:58:52 +00:00
Alyssa Rosenzweig	17d4486c6a	asahi: Add XML for linear 2D arrays These look a bit like compressed images, and elucidate one of the common fields. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20446>	2023-01-05 11:48:13 -05:00
Alyssa Rosenzweig	48c9a9676c	asahi: Add XML required for vertex shader side effects Basically for rasterizer discard. We'll use these in a moment to implement transform feedback. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20446>	2023-01-05 11:48:13 -05:00
Alyssa Rosenzweig	0347d1c358	asahi: Identify seamful cube map bit Fixes dEQP-GLES2.functional.texture.mipmap.cube.basic.linear_nearest when run with a GLES2 version. We wire up seamless cube maps for GLES3+ only, working around an obscure mesa/st limitation. See `6148e3aae7` ("mesa: Fix ctx->Texture.CubeMapSeamless") for the full context. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20365>	2022-12-17 18:10:28 +00:00
Alyssa Rosenzweig	623a2bf488	asahi: Identify XML for more flatshading controls Names from PowerVR <3 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20365>	2022-12-17 18:10:28 +00:00
Asahi Lina	c12153cd89	asahi: Identify & disable triangle merging for shaders using derivatives It seems triangle merging is incompatible with calculating derivatives along primitive edges correctly. Take the appropriate NIR shader info flags in the compiler and pass them down as a flag to the driver, so it can set the disable triangle merging flag (formerly called "lines or points"). TODO: Is this what macOS does when you set a sample mask there (which apparently fixes the same bug on the Darwinia Metal backend)? Do we also need to set this when sample masks are used? Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Fixes Darwinia and dEQP2 projected tests. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20365>	2022-12-17 18:10:28 +00:00
Alyssa Rosenzweig	c9144eff48	asahi: Model alignment of occlusion query indices 8-byte offsets. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:51:04 -05:00
Alyssa Rosenzweig	3a318e4265	asahi: Identify some more fields used with layered These values depend on the framebuffer width/height and maybe other stuff. Maybe strides. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:51:04 -05:00
Alyssa Rosenzweig	c3eb81fd16	asahi: Identify XML for anisotropic filtering Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:51:04 -05:00
Asahi Lina	112830f1a0	asahi: Pass through layer alignment flag to the hardware Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20031>	2022-11-28 19:50:18 +00:00
Alyssa Rosenzweig	debee344a2	agx: Make empty texture pack to all-zeroes So we can do partial textures. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20013>	2022-11-28 16:48:38 +00:00
Asahi Lina	78948c03f0	asahi: Identify compression-related XML Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19999>	2022-11-25 18:56:48 +00:00
Alyssa Rosenzweig	d637189d36	asahi: Add more XML via PowerVR These bits are the same as RGX. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	a3907e92da	asahi: Add note to XML about 16-bit varyings Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	94a8fe51d5	asahi: Identify more depth-related fields in XML Needed for gl_FragDepth writes. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	6ce615d852	asahi: Add XML for layered rendering We don't need to support this for a while but it's good to know the mechanism. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	5d3243ea2d	asahi: Add some notes about unknowns to the XML Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	363ffa779d	asahi: Identify multisampling fields of shared layout Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	8be506039d	asahi: Note some magic bits used with memoryless RTs Obviously there can't actually be memoryless render targets, because how would partial renders work? The control stream with memoryless looks like everything would if it went to memory (e.g. full 2D MSAA attachments for the partial loads/stores even if only a resolved 2D image for the final store). Except the memoryless attachments all load from the same address 0xeeee0000. Clearly that's not actually what happens, so what gives? Unclear... but I see the magic bits mentioned here set, and I assume there are some firmware (or kernel) shenanigans used to JIT allocate the backing storage for partial renders. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19811>	2022-11-19 04:27:11 +00:00
Alyssa Rosenzweig	3fa87e47d5	asahi: Identify "Sample mask after depth/stencil" bit Corresponds to Metal [[sample_mask,post_depth_coverage]]. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19811>	2022-11-19 04:27:11 +00:00
Alyssa Rosenzweig	ff616099ce	asahi: Identify the pass type enum Via PowerVR. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19811>	2022-11-19 04:27:10 +00:00
Alyssa Rosenzweig	2e6369f5f6	asahi: Identify PBE sample count Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19811>	2022-11-19 04:27:10 +00:00
Alyssa Rosenzweig	1f0edc0158	asahi: Identify Dimension for Render Target Metal uses when rendering to multisampled 2D. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19811>	2022-11-19 04:27:10 +00:00
Alyssa Rosenzweig	eac8cbb049	asahi: Identify counts for compute kernels In the same place as for vertex/fragment. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19265>	2022-10-29 19:23:51 +00:00
Alyssa Rosenzweig	721c4f2186	asahi: Remove "padding" field Trivial. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18813>	2022-10-22 14:58:48 -04:00
Alyssa Rosenzweig	06cb242a54	asahi: Identify more shader-related fields The big discovery is the "number of uniform registers" field. I learned about this one accidentally when my preamble shaders weren't working right, because we had inadvertently hardcoded "at most 32 registers" :-) In the course of identifying that field, I found that the pipeline address is used as a tagged pointer, with some unknown field in the bottom bits and alignment demanded. The XML is updated to account for this. I later found that there's also a "number of general purpose registers used by the preamble shader" field. I missed this one first, because the encoding is slightly different from the usual "number of general purpose registers in the main shader" field. The specification is slightly coarser. I don't know why the hardware needs that information anyway -- occupancy of the preamble shader should be irrelevant -- but it's not a big deal. Finally I found that the "more than 4 textures?" bit is... not that. I do not yet know what it is, but it is... not that. These all use the new groups() modifier for GenXML Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18813>	2022-10-22 14:58:37 -04:00
Alyssa Rosenzweig	24bfa5af88	asahi: Identify "Uniform high" USC word The start field in the Uniform USC word is only 8-bits, whereas 9-bits are required to address the entire uniform register file. This other word gets used for the high half, with start indexed from u128l in the natural way. Apparently spending the evening stuffing too many uniforms into Metal is paying off. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18813>	2022-10-22 14:54:07 -04:00
Alyssa Rosenzweig	ea58edaafb	asahi: Use a header more like Intel's GenXML We're trying to converge on a common schema. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18922>	2022-10-13 18:06:52 -04:00
Alyssa Rosenzweig	8eefda4ea9	asahi: Eliminate "Pixel Format" type from GenXML This is leaky and hurts compatibility with upstream GenXML. Just use the actual hardware fields. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18922>	2022-10-13 18:06:51 -04:00
Alyssa Rosenzweig	bcd75a13e0	asahi: Identify shared memory layouts Somehow maps to the tile size. Not sure about the details yet. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18623>	2022-09-18 10:34:37 -04:00
Alyssa Rosenzweig	b8b3c9fa2a	asahi: Identify pixel stride Number of bytes in a pixel in the tilebuffer, does not depend on the tile size. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18623>	2022-09-18 10:34:37 -04:00
Alyssa Rosenzweig	933a9e350e	asahi: Overhaul USC control packing Break up the monolithic SET_SHADER_EXTENDED packet into the separate underlying commands (some only 2-byte sized and aligned), and add a builder for USC control streams like we did for PPP updates to make that change manageable. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18623>	2022-09-18 10:34:37 -04:00
Alyssa Rosenzweig	22d3756207	asahi: Consolidate magic numbers for USC controls Aka "pipeline" states. It's another command/control stream. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18623>	2022-09-18 10:34:37 -04:00
Alyssa Rosenzweig	09cc736c42	asahi: Identify shared memory fields For compute kernels, this encodes how much workgroup-local memory is used ("shared memory" or "threadgroup memory" or "local memory"). This memory is partitioned by the hardware. For fragment shaders, this... encodes exactly the same thing. There is no traditional tilebuffer in AGX, instead local memory is interpreted as an imageblock, where each workgroup is a tile. This is a nifty design. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18623>	2022-09-18 10:34:37 -04:00
Alyssa Rosenzweig	2fbe1ae09c	asahi: Identify spill buffer histogram Histogram of sizes of the spill buffer, with logarithmic bucket sizes (relative to the amount spilled from the perspective of a single thread). Pretty funny. Also mark a few unknowns that are nonzero when spilling is used. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18623>	2022-09-18 10:34:37 -04:00
Alyssa Rosenzweig	a9c26df462	asahi: Identify IOGPU compute header Much simpler than the graphics one. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18623>	2022-09-18 10:34:25 -04:00
Alyssa Rosenzweig	58d138334d	asahi: Shuffle IOGPU structs We need the header to be common between gfx and compute, but everything else seems to be different. Shuffle so we can decode compute without any terrible hacks. I don't know the exact layout and don't care: the layout of the fields here is all software defined in macOS, even though the values are defined by hardware (or firmware in a few cases). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18623>	2022-09-18 10:34:25 -04:00
Alyssa Rosenzweig	4e8a586fd3	asahi: Identify CDM block types Same enum as PowerVR CDM, annoyingly different from the VDM block types. Split out the stream link / terminate structs (both observed with Metal for copious amounts of compute), in preparation for decoding "properly". Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18623>	2022-09-18 10:34:25 -04:00
Alyssa Rosenzweig	1400733320	asahi: Identify ZLS Control word from PowerVR We're into the cr.xml file now, which is the blob that gets passed through the kernel. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18623>	2022-09-18 10:34:25 -04:00
Alyssa Rosenzweig	b0f8639382	asahi: Assert cache line alignment on Z/S buffers Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18623>	2022-09-18 10:34:25 -04:00
Alyssa Rosenzweig	a7ddb8ebf7	asahi: Handle Stream Link VDM commands Jumps in the command streams, allowing us to chain ("link") command buffers. Naming is from PowerVR, which contains an identical command. PowerVR's has conditional jumps and function call support, it's likely that AGX inherited this too but I haven't tested that. (Those might be useful for conditional rendering and secondary command buffers respectively?) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18421>	2022-09-06 21:01:28 +00:00
Alyssa Rosenzweig	6f5c8d0e24	asahi: Express VDM commands according to PowerVR Piles of unknown bits go away, as we find they're either "field present" bits or block types. And yep, the block type enum lines up between AGX and RGX. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18421>	2022-09-06 21:01:28 +00:00
Alyssa Rosenzweig	80d8273705	asahi: Annotate VDM/CDM commands as per PVR Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18421>	2022-09-06 21:01:28 +00:00
Alyssa Rosenzweig	942bda7f2d	asahi: Match PPP data structures with PowerVR Looking at PowerVR's PPP definitions in tree in Mesa (src/imagination/csbgen/), we find that AGX's "tagged" data structures are actually sequences of state items prefixed by a header specifying which state follows. Rather than hardcoding the sequences in which Apple's driver chooses to bundle state, we need the XML to be flexible enough to encode or decode any valid combination of state. That means reworking the XML. While doing so, we find a number of fields that are identical between RGX and AGX, and fix the names while at it (for example, the W Clamp floating point). Names are from the PowerVR code in Mesa where sensible. Once we've reworked the XML, we need to rework the decoder. Instead of reading tags and printing the combined state packets, the decoder now must unpack the header and print the individual state items specified by the header, with slightly more complicated bounds checking. Finally, state emission in the driver becomes much more flexible. To prove the flexibility actually works, we now emit all PPP state (except for viewport and scissor state) as a single PPP update. This works. After this we can move onto more interesting arrangements of state for lower driver overhead. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18421>	2022-09-06 21:01:28 +00:00
Alyssa Rosenzweig	f7ef5eefdd	asahi: Identify object type field via PowerVR src/imagination/csbgen/rogue_ppp.xml STATE_ISPA bits 28. Looks like that got split into two structs in AGX (with info duplicated?) but yeah I have a lot to work with here. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18421>	2022-09-06 21:01:28 +00:00
Alyssa Rosenzweig	d93878f77a	asahi: Split RASTERIZER into constituent words As done in the PowerVR driver. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18421>	2022-09-06 21:01:28 +00:00
Alyssa Rosenzweig	02babc834a	asahi: Identify stencil test enable There are a pair of flags controlling the stencil test. One enables stencil testing in general, the other enables two-sided stencil. Compare the identical "twosided" flag in src/imagination/csbgen/rogue_ppp.xml's STATE_ISPCTL structure, at the samebit offset even. Evidently this word of the "Rasterizer" is, in fact, a derivative of STATE_ISPCTL. Fixes dEQP-GLES2.functional.fragment_ops.depth_stencil.* dEQP-GLES2.functional.fragment_ops.interaction.basic_shader.* dEQP-GLES2.functional.fragment_ops.random.* Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18380>	2022-09-04 18:05:31 +00:00

1 2 3

133 commits