fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-21 21:58:10 +02:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	deb3810f1e	agx: Remove load_kernel_input path Unused and now won't be used. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18658>	2022-10-05 16:09:21 +00:00
Alyssa Rosenzweig	c17fcbaa2f	agx: Account for mask when writing registers To use fewer registers. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18687>	2022-09-22 03:23:36 +00:00
Alyssa Rosenzweig	5cd2371318	agx: Pass mask into ld/st_tile instructions Properly handle render target formats with <4 components. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18687>	2022-09-22 03:23:36 +00:00
Alyssa Rosenzweig	640fd089a2	agx: Ensure that the optimizer sees legitimate SSA Expecting it to keep around unused definitions around is wishful. Add an "anchoring" unit_test instruction to consume the results so they don't have to be precoloured registers. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18687>	2022-09-22 03:23:36 +00:00
Alyssa Rosenzweig	52467c2d1e	agx: Test fsat+f2f16 together Something I hit when mucking with this pass. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18687>	2022-09-22 03:23:36 +00:00
Alyssa Rosenzweig	3e86522cf2	agx: Validate immediates In particular the new sizing rules. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18687>	2022-09-22 03:23:36 +00:00
Alyssa Rosenzweig	14f2be1f33	agx: Use 16-bit immediates This is slightly more accurate in the IR, and means we instruction select the current 16-bit size floating point instructions when all non-immediate operands are 16-bit. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18687>	2022-09-22 03:23:36 +00:00
Alyssa Rosenzweig	e302e5d527	agx: Emit fewer combines for intrinsics A bunch of the emitted combines were unnecessary, or unnecessarily large. Fix the accounting now that combines are variable size. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18687>	2022-09-22 03:23:36 +00:00
Alyssa Rosenzweig	e887a11b06	agx: Fix bfi_mask packing Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18687>	2022-09-22 03:23:36 +00:00
Alyssa Rosenzweig	a1faab0b90	agx: Convert and clamp array indices in NIR ..Rather than at backend IR translation time. This is considerably simpler because we can use the txs lowering instead of special casing array sizes. Unfortunately it generates worse code, but that gap should close once nir_opt_preamble is wired in. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18652>	2022-09-19 16:14:24 +00:00
Alyssa Rosenzweig	bcd75a13e0	asahi: Identify shared memory layouts Somehow maps to the tile size. Not sure about the details yet. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18623>	2022-09-18 10:34:37 -04:00
Alyssa Rosenzweig	b8b3c9fa2a	asahi: Identify pixel stride Number of bytes in a pixel in the tilebuffer, does not depend on the tile size. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18623>	2022-09-18 10:34:37 -04:00
Alyssa Rosenzweig	933a9e350e	asahi: Overhaul USC control packing Break up the monolithic SET_SHADER_EXTENDED packet into the separate underlying commands (some only 2-byte sized and aligned), and add a builder for USC control streams like we did for PPP updates to make that change manageable. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18623>	2022-09-18 10:34:37 -04:00
Alyssa Rosenzweig	35d5558fa5	asahi/genxml: Overflow up to words when packing So we can pack things that aren't 4-byte sized. Note this doesn't help with alignment. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18623>	2022-09-18 10:34:37 -04:00
Alyssa Rosenzweig	22d3756207	asahi: Consolidate magic numbers for USC controls Aka "pipeline" states. It's another command/control stream. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18623>	2022-09-18 10:34:37 -04:00
Alyssa Rosenzweig	09cc736c42	asahi: Identify shared memory fields For compute kernels, this encodes how much workgroup-local memory is used ("shared memory" or "threadgroup memory" or "local memory"). This memory is partitioned by the hardware. For fragment shaders, this... encodes exactly the same thing. There is no traditional tilebuffer in AGX, instead local memory is interpreted as an imageblock, where each workgroup is a tile. This is a nifty design. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18623>	2022-09-18 10:34:37 -04:00
Alyssa Rosenzweig	2fbe1ae09c	asahi: Identify spill buffer histogram Histogram of sizes of the spill buffer, with logarithmic bucket sizes (relative to the amount spilled from the perspective of a single thread). Pretty funny. Also mark a few unknowns that are nonzero when spilling is used. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18623>	2022-09-18 10:34:37 -04:00
Alyssa Rosenzweig	adfd213241	asahi: Decode IOGPU compute header Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18623>	2022-09-18 10:34:25 -04:00
Alyssa Rosenzweig	a9c26df462	asahi: Identify IOGPU compute header Much simpler than the graphics one. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18623>	2022-09-18 10:34:25 -04:00
Alyssa Rosenzweig	58d138334d	asahi: Shuffle IOGPU structs We need the header to be common between gfx and compute, but everything else seems to be different. Shuffle so we can decode compute without any terrible hacks. I don't know the exact layout and don't care: the layout of the fields here is all software defined in macOS, even though the values are defined by hardware (or firmware in a few cases). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18623>	2022-09-18 10:34:25 -04:00
Alyssa Rosenzweig	287a0d4f40	asahi: Decode CDM commands separate from VDM This gets correct handling of CDM stream link/terminate, which are encoded in a slightly different way from VDM. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18623>	2022-09-18 10:34:25 -04:00
Alyssa Rosenzweig	4e8a586fd3	asahi: Identify CDM block types Same enum as PowerVR CDM, annoyingly different from the VDM block types. Split out the stream link / terminate structs (both observed with Metal for copious amounts of compute), in preparation for decoding "properly". Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18623>	2022-09-18 10:34:25 -04:00
Alyssa Rosenzweig	1400733320	asahi: Identify ZLS Control word from PowerVR We're into the cr.xml file now, which is the blob that gets passed through the kernel. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18623>	2022-09-18 10:34:25 -04:00
Alyssa Rosenzweig	b0f8639382	asahi: Assert cache line alignment on Z/S buffers Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18623>	2022-09-18 10:34:25 -04:00
Alyssa Rosenzweig	89e0f54422	agx: Don't use nir_find_variable_with_driver_location io_semantics is the preferred alternative. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18525>	2022-09-13 16:04:29 +00:00
Alyssa Rosenzweig	0883f0b302	agx: Lower txs to a descriptor crawl There's no native txs instruction... but we can emulate one :-) This is heavy on shader ALU, but in the production driver, it'll all be hoisted up to the preamble shader and so it shouldn't matter much. This keeps the driver itself simple and low overhead, with a completely obvious generalization to bindless. Passes dEQP-GLES3.functional.shaders.texture_functions.texturesize.* Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18525>	2022-09-13 16:04:29 +00:00
Alyssa Rosenzweig	bc4f418cb4	agx: Implement load_global(_constant) Found in compute shaders, maps to a subset of device_load, and will be used for some lowerings soon. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18525>	2022-09-13 16:04:29 +00:00
Alyssa Rosenzweig	965cc62bdd	agx: Implement txd Handles all cases except for cube maps, which don't seem to work properly, so those are lowered. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18525>	2022-09-13 16:04:29 +00:00
Alyssa Rosenzweig	7a4e0a4d35	agx: Implement texture offsets and comparators Texture offsets and shadow comparison values get grouped into a vector passed by register. Comparison values are provided as-is (fp32). Texture offsets are packed into nibbles, but we can do this on the CPU, as nonconstant offsets are forbidden in GLSL at least. They're also forbidden in Vulkan/SPIR-V without ImageGatherExtended/ shaderImageGatherExtended. I'm happy kicking the NIR lowering can down the line, this commit is complicated enough already. Passes dEQP-GLES3.functional.shaders.texture_functions.texture.* and dEQP-GLES3.functional.shaders.texture_functions.textureoffset.* Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18525>	2022-09-13 16:04:29 +00:00
Alyssa Rosenzweig	4f85a7be8c	agx: Make p_combine take a dynamic src count For larger vectors. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18525>	2022-09-13 16:04:28 +00:00
Alyssa Rosenzweig	ef31dceee8	agx,asahi: Implement nir_intrinsic_load_texture_base_agx Save off what we pass to BIND_TEXTURE. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18525>	2022-09-13 16:04:28 +00:00
Alyssa Rosenzweig	a7ddb8ebf7	asahi: Handle Stream Link VDM commands Jumps in the command streams, allowing us to chain ("link") command buffers. Naming is from PowerVR, which contains an identical command. PowerVR's has conditional jumps and function call support, it's likely that AGX inherited this too but I haven't tested that. (Those might be useful for conditional rendering and secondary command buffers respectively?) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18421>	2022-09-06 21:01:28 +00:00
Alyssa Rosenzweig	6f5c8d0e24	asahi: Express VDM commands according to PowerVR Piles of unknown bits go away, as we find they're either "field present" bits or block types. And yep, the block type enum lines up between AGX and RGX. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18421>	2022-09-06 21:01:28 +00:00
Alyssa Rosenzweig	80d8273705	asahi: Annotate VDM/CDM commands as per PVR Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18421>	2022-09-06 21:01:28 +00:00
Alyssa Rosenzweig	22f6efde02	asahi: Dirty track everything Now that we have fine grained state emit code, let's use it to reduce driver overhead. Dirty tracking is delicate: while this seems to work, I've also added an ASAHI_MESA_DEBUG=dirty option in debug builds to disable the optimizations here for future debug. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18421>	2022-09-06 21:01:28 +00:00
Alyssa Rosenzweig	942bda7f2d	asahi: Match PPP data structures with PowerVR Looking at PowerVR's PPP definitions in tree in Mesa (src/imagination/csbgen/), we find that AGX's "tagged" data structures are actually sequences of state items prefixed by a header specifying which state follows. Rather than hardcoding the sequences in which Apple's driver chooses to bundle state, we need the XML to be flexible enough to encode or decode any valid combination of state. That means reworking the XML. While doing so, we find a number of fields that are identical between RGX and AGX, and fix the names while at it (for example, the W Clamp floating point). Names are from the PowerVR code in Mesa where sensible. Once we've reworked the XML, we need to rework the decoder. Instead of reading tags and printing the combined state packets, the decoder now must unpack the header and print the individual state items specified by the header, with slightly more complicated bounds checking. Finally, state emission in the driver becomes much more flexible. To prove the flexibility actually works, we now emit all PPP state (except for viewport and scissor state) as a single PPP update. This works. After this we can move onto more interesting arrangements of state for lower driver overhead. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18421>	2022-09-06 21:01:28 +00:00
Alyssa Rosenzweig	baadc1ec13	asahi: Don't use lower_wpos_pntc Instead we can flip point coords with the object type. That means fewer instructions without shader variants. Thanks, PowerVR ^_^ Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18421>	2022-09-06 21:01:28 +00:00
Alyssa Rosenzweig	f7ef5eefdd	asahi: Identify object type field via PowerVR src/imagination/csbgen/rogue_ppp.xml STATE_ISPA bits 28. Looks like that got split into two structs in AGX (with info duplicated?) but yeah I have a lot to work with here. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18421>	2022-09-06 21:01:28 +00:00
Alyssa Rosenzweig	d93878f77a	asahi: Split RASTERIZER into constituent words As done in the PowerVR driver. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18421>	2022-09-06 21:01:28 +00:00
Alyssa Rosenzweig	02babc834a	asahi: Identify stencil test enable There are a pair of flags controlling the stencil test. One enables stencil testing in general, the other enables two-sided stencil. Compare the identical "twosided" flag in src/imagination/csbgen/rogue_ppp.xml's STATE_ISPCTL structure, at the samebit offset even. Evidently this word of the "Rasterizer" is, in fact, a derivative of STATE_ISPCTL. Fixes dEQP-GLES2.functional.fragment_ops.depth_stencil.* dEQP-GLES2.functional.fragment_ops.interaction.basic_shader.* dEQP-GLES2.functional.fragment_ops.random.* Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18380>	2022-09-04 18:05:31 +00:00
Alyssa Rosenzweig	b891d60efa	asahi: Fix depth/stencil buffers There are a bunch of bits we need to set right to get depth/stencil loads/stores working, including with independent settings for each. The kernel "helps" us here. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18380>	2022-09-04 18:05:31 +00:00
Alyssa Rosenzweig	66f1164976	asahi: Add 1D and 1D Array enums To finish out the enum. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18380>	2022-09-04 18:05:31 +00:00
Alyssa Rosenzweig	2bdb8ba3ce	asahi: Correct SET_SHADER_EXTENDED disambig bit This is still a guess, but a considerably firmer one as it now corrects handles the clear pipelines emitted by Metal as well as the regular vertex/fragment shader, and gets rid of the preshader special cases seen there. Fixes decode of clear pipeline's preshaders. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18380>	2022-09-04 18:05:31 +00:00
Alyssa Rosenzweig	210f4aff1e	asahi: Identify and use first level field of texture As we recently discovered, the layout of level L of a mipmapped 2D image of size WxH is /not/ the same as the layout of a non-mipmapped 2D image of size minify(W, L) x minify(H, L). The difference occurs due to subtleties of the "power of two" miptrees which can force a level to use a larger tile size than it would have required at root level. To handle this quirk correctly, the driver must not implement texture views with address arithmetic -- it must supply instead the base width/height of a texture and use first/last level fields on the texture descriptor to map it. Similar issues occur when writing a particular level of a mipmapped texture, which was handled correctly in the colour case but not the Z/S case. Fixes dEQP-GLES2.functional.texture.mipmap.cube.generate.* Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18380>	2022-09-04 18:05:31 +00:00
Alyssa Rosenzweig	1d72d3feb6	asahi: Fix "stride" for tiled textures It doesn't exist, but there's a count of mip levels for writeable image descs. Setting that appropriately matters at high mip levels. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18380>	2022-09-04 18:05:31 +00:00
Alyssa Rosenzweig	4442be1155	asahi: Fix nonmipmapped array textures pot_level can be greater than the number of levels actually included -- don't overallocate. Fix the issue and add a representative unit test. Fixes: dEQP-GLES2.functional.texture.size.cube.512x512_rgb888 Fixes: `6ff75da8aa` ("ail: Introduce image layout module") Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18380>	2022-09-04 18:05:31 +00:00
Alyssa Rosenzweig	a41d732784	asahi: Fix depth for cube maps For cube maps, depth=1 in the hardware (but 6 in Gallium). Likewise for cube map arrays, depth=n in the hardware (but 6n in Gallium). We need to divide to compensate. This will be relevant for cube map arrays in the future -- add the dimension XML for cube map arrays too. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18380>	2022-09-04 18:05:31 +00:00
Alyssa Rosenzweig	e66a901bc8	asahi: Relax assert in decoder Seen == 0x8 with >4 render targets. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18380>	2022-09-04 18:05:31 +00:00
Alyssa Rosenzweig	9542f95864	asahi: Trim garbage at end of set shader Unfortunately the actual size of this data structure is unclear. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18380>	2022-09-04 18:05:31 +00:00
Alyssa Rosenzweig	fb7860ed24	asahi: Handle empty fragment shaders When an empty fragment shader is used with Metal, the stop command is still included but this special bit is set, suppressing tilebuffer access. Failing to do so but using empty shaders for u_blitter depth clears causes Glitch Lina: https://twitter.com/LinaAsahi/status/1537869064793575424 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18380>	2022-09-04 18:05:31 +00:00

1 2 3 4 5 ...

421 commits