fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-02-23 22:40:34 +01:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	e765ec21ec	asahi: Implement custom border colours Implement custom border colours, as required by OpenGL's CLAMP_TO_BORDER and Vulkan with customBorderColor. This uses an extended sampler descriptor, which has space for the custom border values. The trouble is that the border must be packed into an internal interchange format that depends on the original format in a complex way. That said, we're not solving NP-complete problems here, and it passes the tests (dEQP-GLES31.functional.texture.border_clamp.* and piglit texwrap). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20570>	2023-02-04 10:37:02 -05:00
Alyssa Rosenzweig	507ca71f3e	agx/decode: Handle extended samplers These include a border colour field. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20570>	2023-02-04 10:32:38 -05:00
Alyssa Rosenzweig	afce5be659	agx/decode: Add a data parameter to stateful So we can handle extended samplers. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20570>	2023-02-04 10:32:38 -05:00
Alyssa Rosenzweig	10eaa4a2ec	asahi: Add XML for custom border colours These use extended sampler descriptors. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20570>	2023-02-04 10:32:24 -05:00
Timur Kristóf	3a819bd22e	ac/nir/ngg: Include culled primitives in query. Vulkan spec 18.8. Primitives Generated Queries: When a generated primitive query for a vertex stream is active, the primitives-generated count is incremented every time a primitive emitted to that stream reaches the transform feedback stage, whether or not transform feedback is active. We can see the order of stages in chapter 27 Fixed-Function Vertex Post-Processing, which shows that the transform feedback stage is before rasterization (and therefore culling). Conclusion is that culled primitives should be included in the primitives generated query. This commit makes sure to emit the primitives generated query code before culling and uses the input primitive count passed to the current wave instead of the exec mask after culling. Cc: mesa-stable Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21037>	2023-02-04 11:53:07 +01:00
Alyssa Rosenzweig	221311e1e9	agx: Handle constant-offset in address matching Match iadd(x, #y). The format shift will get constant-folded away and, if y is sufficiently small, the constant will be inlined by the AGX backend optimizer. This gets rid of piles of 64-bit arithmetic from lowering UBOs. It probably doesn't matter for perf since that's happening in preamble shaders but it is noisy. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21108>	2023-02-04 08:41:37 +00:00
Alyssa Rosenzweig	c3f7abaaef	agx: Fix storing to varying arrays The offset is in vec4s, not words (unlike the component). This doesn't matter right now since we get everything lowered (offset -> 0) but it will come up if we implement clip distances natively (instead of lowering in FS). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21097>	2023-02-04 08:28:43 +00:00
Alyssa Rosenzweig	897c47aa1c	docs/asahi: Document clip distance varyings These implement gl_ClipDistance in hardware, avoiding the fragment shader lowering. Unfortunately, they can't be disabled on a per-plane basis and they can't be interpolated, so using them for OpenGL would still require a bunch of extra lowering steps. Still, we should document the hardware and the caveats. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21097>	2023-02-04 08:28:43 +00:00
Alyssa Rosenzweig	13b25a6114	asahi: Don't use 16-bit inputs to 32-bit st_tile The hardware doesn't extend in this case, we need to extend for it. This fixes 32-bit render target formats with lower_mediump_io. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21082>	2023-02-04 08:14:32 +00:00
Alyssa Rosenzweig	6b0322d441	agx: Keep varyings forwarded to texture as fp32 This works around bugs in a LOT of applications, since fp16 texture coordinates are almost never appropriate even though it's a valid implementation of the GLES spec. It also doesn't seem to matter for perf. Code from the Bifrost compiler which implements the same workaround for slightly different reasons. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21082>	2023-02-04 08:14:32 +00:00
Alyssa Rosenzweig	5678fbe010	asahi: Merge fragment control XML Same struct specified twice and merged in the hw. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21081>	2023-02-04 07:58:42 +00:00
Alyssa Rosenzweig	50e61e251b	asahi: Remove redundant tri merge disable bit Cargoculted from Metal. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21081>	2023-02-04 07:58:42 +00:00
Alyssa Rosenzweig	6ee38e2635	asahi: DRY dirty tracking conditions Ella did this in agxv and it made a lot more sense than the copypasta I did. Should get copypropped to similar code. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21081>	2023-02-04 07:58:42 +00:00
Alyssa Rosenzweig	98b2657b9e	asahi: Implement nontrivial rasterizer discard For vertex shaders with side effects, as seen with transform feedback. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21081>	2023-02-04 07:58:42 +00:00
Alyssa Rosenzweig	64ae63c41f	asahi: Prefer blit-based texture transfer This speeds up glReadPixels. Instead of reading from the write-combined framebuffer and converting colours on the CPU, this blits on the GPU to a writeback staging resource with the colour conversion for free, and memcpies from the writeback staging resource on the CPU. In general, due to textures being write combined and tiled/compressed by default by staging resources being linear writeback, blit-based texture transfer should win out (you were going to blit anyway), particularly when format conversion is involved 33% reduction in wall clock time for grim at 4K. No change in deqp-gles2 runtime. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21063>	2023-02-04 07:45:12 +00:00
Alyssa Rosenzweig	0a5c3764c7	asahi: Make STAGING resources linear As intended by the flag. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21063>	2023-02-04 07:45:12 +00:00
Alyssa Rosenzweig	e7b97899ac	asahi: Use writeback when it looks beneficial When playing the My Little Pony theme song at 1080p on T8103, with mpv's GPU compositing but software decoding, CPU usage drops from 200% to 50% due to proper caching of the staging resource. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21063>	2023-02-04 07:45:12 +00:00
Asahi Lina	a88aa3e835	asahi: Refuse to transfer out-of-bounds mip levels Fixes ail asserts on a pile of dEQP3 tests. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21063>	2023-02-04 07:45:12 +00:00
Alyssa Rosenzweig	3706da1d1a	agx: Support uniform registers as LODs This will avoid regressing moves when we lower sampler LOD bias. Corresponding disassembler change: https://github.com/dougallj/applegpu/pull/22 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20833>	2023-02-04 07:33:08 +00:00
Alyssa Rosenzweig	231561d53a	asahi: Correct alignment for USC Uniform packets We only need 4 byte alignment, not 8 bytes. This isn't a big difference in practice, but it probably reduces padding in some cases. More importantly, it corrects our XML to match what the hardware actually does, which is great. (There is exactly enough room for a 40-bit address with 4 byte alignment.) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21118>	2023-02-04 07:19:29 +00:00
Alyssa Rosenzweig	e4cb64c0e2	asahi/nir_lower_sysvals: Split large ranges It is our responsibility to ensure uniform ranges don't exceed 64 uniforms. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21118>	2023-02-04 07:19:29 +00:00
Alyssa Rosenzweig	b0f1964771	asahi: Strengthen agx_usc_uniform contract Check the size explicitly, instead of just implicitly in the GenXML pack: it is the responsibility of the caller to split up larger uploads. While this is nominally more complicated, agx_usc_uniform is called in the draw hot path whereas the actual splitting decision can usually be done at compile-time. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21118>	2023-02-04 07:19:29 +00:00
Alyssa Rosenzweig	ea38709345	asahi: Fix encoding of uniform size Only 6-bits, with zero=64 like a groups() encoding. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Suggested-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21118>	2023-02-04 07:19:29 +00:00
Alyssa Rosenzweig	79a7c6e3bd	asahi: Set layout->mipmapped_z for 3D textures There's a corner case where 3D textures have extra padding compared to 2D arrays. We need to communicate that to ail. Fixes dEQP-GLES3.functional.texture.specification.texstorage3d.size.3d_32x16x64_4_levels. That test now uses the same layout as Metal. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21114>	2023-02-04 07:04:49 +00:00
Alyssa Rosenzweig	9b2dc92228	ail: Test 63x63 cube map This has a subtle interaction with page-aligned layers. Written while debugging dEQP-GLES3.functional.texture.filtering.cube.combinations.nearest_nearest_repeat_clamp Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21114>	2023-02-04 07:04:49 +00:00
Alyssa Rosenzweig	294351ff77	ail: Test mipmapped_z behaviour The mipmapped_z = true case is checked against Metal, the false case is smoke testing the old behaviour (which is still used for 2D arrays). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21114>	2023-02-04 07:04:49 +00:00
Alyssa Rosenzweig	c2bf66ab87	ail: Add layout->mipmapped_z input For 3D images, the full miptree depends on the depth of the image, in contrast to 2D arrays. We need to account for this to calculate the correct layer strides. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21114>	2023-02-04 07:04:49 +00:00
Sergi Blanch Torne	60d7e15a7e	ci: disable Collabora's LAVA lab for maintance This is to inform you of some planned downtime in the LAVA lab as follows: Start: 2023-02-04 06:00 GMT End: 2023-02-06 12:00 GMT Signed-off-by: Sergi Blanch Torne <sergi.blanch.torne@collabora.com> Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21119>	2023-02-04 00:21:05 -03:00
Ian Romanick	ea413e826b	nir: Eliminate nir_op_f2b Builds on the work of !15121. This gets to delete even more code because many drivers shared a lot of code for i2b and f2b. No shader-db or fossil-db changes on any Intel platform. v2: Rebase on `1a35acd8d9`. v3: Update a comment in nir_opcodes_c.py. Suggested by Konstantin. v4: Another rebase. Remove f2b stuff from Midgard. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20509>	2023-02-03 22:39:57 +00:00
Ian Romanick	024122c069	nir/builder: Handle f2b conversions specially in nir_type_convert No shader-db or fossil-db changes on any Intel platform. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20509>	2023-02-03 22:39:57 +00:00
Ian Romanick	b265020b82	nir/builder: Eliminate nir_f2b helper (and use of nir_f2b32 helper) There were only two users. Replace each with nir_fneu instead. This is now a squash of what was two separate commits. nir_lower_pstipple_block is called after nir_lower_bool_to_int32, so nir_fneu32 has to be used here or there will be regresssions in stipple tests on llvmpipe. v2: Rebase on !20869. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Suggested-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20509>	2023-02-03 22:39:57 +00:00
Mike Blumenkrantz	7b0d000342	zink: add back VK_DESCRIPTOR_BINDING_PARTIALLY_BOUND_BIT for bindless this was accidentally lost in refactor Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21100>	2023-02-03 21:59:07 +00:00
Mike Blumenkrantz	e67bdf47d4	zink: handle missing line rasterization modes with ds3 it's annoying to validate this at runtime since it has to happen during draw, but storing the "usable" ds3 mode separately from the pipeline state should be a reasonable enough compromise for perf here...hopefully Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21100>	2023-02-03 21:59:07 +00:00
Mike Blumenkrantz	813bb9e442	zink: cache and reuse dummy inputattachment for fbfetch apparently an actual null descriptor is illegal here, and it's wasted cpu anyway, so just cache the dummy surface on init and use that data when fbfetch isn't active but the layout requires it Fixes: `7ab5c5d36d` ("zink: use EXT_descriptor_buffer with ZINK_DESCRIPTORS=db") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21100>	2023-02-03 21:59:07 +00:00
Mike Blumenkrantz	abf63b7c68	zink: fix more cases of heap/memtype suballocator mismatch suballocation must happen based on the memtype, so also add some asserts to ensure the slab bos are always what the caller expects Fixes: `f6d3a5755f` ("zink: zink_heap isn't 1-to-1 with memoryTypeIndex") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21100>	2023-02-03 21:59:07 +00:00
Mike Blumenkrantz	e1e4ddcf10	zink: free descriptor buffer maps on batch state destroy Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21100>	2023-02-03 21:59:07 +00:00
SoroushIMG	4f8ba2b9aa	zink: fix sparse residency query and minLOD feature checks cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21013>	2023-02-03 20:05:23 +00:00
Yiwei Zhang	86c6484fba	venus: lazily query and cache gralloc front rendering usage When skiavk is the default system ui renderer, venus icd gets preloaded into Zygote. However, Zygote access to render node is normally denied by selinux except for legacy bootanimation purpose. This change fixes venus icd loading to avoid invoking cros gralloc driver loading by moving the perform op outside, so that we still get the memory footprint win. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Ryan Neph <ryanneph@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21107>	2023-02-03 19:33:18 +00:00
Emma Anholt	de5b67ef2c	ci/llvmpipe: Drop skip of InteractionFunctionCalls2. This one is down to <5 seconds here these days. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21084>	2023-02-03 19:01:59 +00:00
Emma Anholt	2eb07304e3	ci/swrast: Drop skips for tests whose perf had been fixed. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21084>	2023-02-03 19:01:59 +00:00
Emma Anholt	907b0a01b7	gallivm: Do the same codegen improvement for constant-index array loads. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21084>	2023-02-03 19:01:59 +00:00
Emma Anholt	cf47154300	gallivm: Fix codegen performance for constant-index register array stores. Instead of generating num_components*simdwidth scattered stores, if there's no indirect then we can just look up the pointer to the base_offset and do a simd store there. dEQP-VK.subgroups.ballot_broadcast.compute.subgroupbroadcast_i64vec4 goes from 30s to ~2s. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21084>	2023-02-03 19:01:59 +00:00
Emma Anholt	833a74351c	gallivm: Fix the type of array nir_registers. This now matches how they get dereffed by get_soa_array_offsets() -- each array element has num_components vecs inside of it, rather than each components has an array in it. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21084>	2023-02-03 19:01:59 +00:00
Emma Anholt	a5d360550e	gallivm: Enable GALLIVM_DEBUG (mostly) on non-DEBUG builds. This is what let me do the performance work in my recent gallivm MRs. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21086>	2023-02-03 18:21:49 +00:00
Emma Anholt	947c60fa2f	llvmpipe: Enable LP_DEBUG on normal builds. I don't typically include DEBUG because it sometimes has expensive debug code, but these options are not that. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21086>	2023-02-03 18:21:49 +00:00
Dylan Baker	fd9b50aa1c	meson: combine checks for linker --gc-sections support We first do an incomplete check for whether the linker supports --gc-sections, then potentially add C and C++ arguments assuming that it works, then later do a complete check to see if it actually works and use --gc-sections. This means we can end up putting functions and data in separate sections when we can't gc them. Combine the checks, do less work, and be more accurate. fixes: `f51ce21e4e` ("meson: Drop adding -Wl,--gc-sections to project c/cpp arguments.") Reviewed-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21083>	2023-02-03 17:48:58 +00:00
Alyssa Rosenzweig	7f98a9ba2b	panfrost: Implement GL_EXT_render_snorm on Bifrost+ It turns out it's really easy. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20684>	2023-02-03 17:21:34 +00:00
Emma Anholt	b6bd904019	ci/lvp: Drop the subgroupbroadcast skips. These have the same runtime as the others in the group, and with these optimizations they no longer time out. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21001>	2023-02-03 08:51:42 -08:00
Emma Anholt	70be21e7c6	gallivm: Use first active invocation in some image/ssbo accesses. These should be looking at that rather than blindly using invocation 0 (which may be junk when in control flow). Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21001>	2023-02-03 08:51:40 -08:00
Emma Anholt	8c2493d041	gallivm: Use cttz instead of a loop for first_active_invocation(). This should be way faster to compile by not spamming so many loops at LLVM, and faster to execute if LLVM didn't figure out what that loop meant. It looks vector reduce ops aren't really a thing, just a convenience in the IR. We should be able to do better by counting zeroes in the exec_mask != 0 result. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21001>	2023-02-03 08:51:37 -08:00

1 2 3 4 5 ...

166189 commits