fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-27 03:38:12 +02:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	6ee38e2635	asahi: DRY dirty tracking conditions Ella did this in agxv and it made a lot more sense than the copypasta I did. Should get copypropped to similar code. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21081>	2023-02-04 07:58:42 +00:00
Alyssa Rosenzweig	98b2657b9e	asahi: Implement nontrivial rasterizer discard For vertex shaders with side effects, as seen with transform feedback. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21081>	2023-02-04 07:58:42 +00:00
Alyssa Rosenzweig	64ae63c41f	asahi: Prefer blit-based texture transfer This speeds up glReadPixels. Instead of reading from the write-combined framebuffer and converting colours on the CPU, this blits on the GPU to a writeback staging resource with the colour conversion for free, and memcpies from the writeback staging resource on the CPU. In general, due to textures being write combined and tiled/compressed by default by staging resources being linear writeback, blit-based texture transfer should win out (you were going to blit anyway), particularly when format conversion is involved 33% reduction in wall clock time for grim at 4K. No change in deqp-gles2 runtime. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21063>	2023-02-04 07:45:12 +00:00
Alyssa Rosenzweig	0a5c3764c7	asahi: Make STAGING resources linear As intended by the flag. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21063>	2023-02-04 07:45:12 +00:00
Alyssa Rosenzweig	e7b97899ac	asahi: Use writeback when it looks beneficial When playing the My Little Pony theme song at 1080p on T8103, with mpv's GPU compositing but software decoding, CPU usage drops from 200% to 50% due to proper caching of the staging resource. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21063>	2023-02-04 07:45:12 +00:00
Asahi Lina	a88aa3e835	asahi: Refuse to transfer out-of-bounds mip levels Fixes ail asserts on a pile of dEQP3 tests. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21063>	2023-02-04 07:45:12 +00:00
Alyssa Rosenzweig	231561d53a	asahi: Correct alignment for USC Uniform packets We only need 4 byte alignment, not 8 bytes. This isn't a big difference in practice, but it probably reduces padding in some cases. More importantly, it corrects our XML to match what the hardware actually does, which is great. (There is exactly enough room for a 40-bit address with 4 byte alignment.) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21118>	2023-02-04 07:19:29 +00:00
Alyssa Rosenzweig	e4cb64c0e2	asahi/nir_lower_sysvals: Split large ranges It is our responsibility to ensure uniform ranges don't exceed 64 uniforms. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21118>	2023-02-04 07:19:29 +00:00
Alyssa Rosenzweig	79a7c6e3bd	asahi: Set layout->mipmapped_z for 3D textures There's a corner case where 3D textures have extra padding compared to 2D arrays. We need to communicate that to ail. Fixes dEQP-GLES3.functional.texture.specification.texstorage3d.size.3d_32x16x64_4_levels. That test now uses the same layout as Metal. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21114>	2023-02-04 07:04:49 +00:00
Ian Romanick	ea413e826b	nir: Eliminate nir_op_f2b Builds on the work of !15121. This gets to delete even more code because many drivers shared a lot of code for i2b and f2b. No shader-db or fossil-db changes on any Intel platform. v2: Rebase on `1a35acd8d9`. v3: Update a comment in nir_opcodes_c.py. Suggested by Konstantin. v4: Another rebase. Remove f2b stuff from Midgard. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20509>	2023-02-03 22:39:57 +00:00
Ian Romanick	b265020b82	nir/builder: Eliminate nir_f2b helper (and use of nir_f2b32 helper) There were only two users. Replace each with nir_fneu instead. This is now a squash of what was two separate commits. nir_lower_pstipple_block is called after nir_lower_bool_to_int32, so nir_fneu32 has to be used here or there will be regresssions in stipple tests on llvmpipe. v2: Rebase on !20869. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Suggested-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20509>	2023-02-03 22:39:57 +00:00
Mike Blumenkrantz	7b0d000342	zink: add back VK_DESCRIPTOR_BINDING_PARTIALLY_BOUND_BIT for bindless this was accidentally lost in refactor Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21100>	2023-02-03 21:59:07 +00:00
Mike Blumenkrantz	e67bdf47d4	zink: handle missing line rasterization modes with ds3 it's annoying to validate this at runtime since it has to happen during draw, but storing the "usable" ds3 mode separately from the pipeline state should be a reasonable enough compromise for perf here...hopefully Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21100>	2023-02-03 21:59:07 +00:00
Mike Blumenkrantz	813bb9e442	zink: cache and reuse dummy inputattachment for fbfetch apparently an actual null descriptor is illegal here, and it's wasted cpu anyway, so just cache the dummy surface on init and use that data when fbfetch isn't active but the layout requires it Fixes: `7ab5c5d36d` ("zink: use EXT_descriptor_buffer with ZINK_DESCRIPTORS=db") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21100>	2023-02-03 21:59:07 +00:00
Mike Blumenkrantz	abf63b7c68	zink: fix more cases of heap/memtype suballocator mismatch suballocation must happen based on the memtype, so also add some asserts to ensure the slab bos are always what the caller expects Fixes: `f6d3a5755f` ("zink: zink_heap isn't 1-to-1 with memoryTypeIndex") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21100>	2023-02-03 21:59:07 +00:00
Mike Blumenkrantz	e1e4ddcf10	zink: free descriptor buffer maps on batch state destroy Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21100>	2023-02-03 21:59:07 +00:00
SoroushIMG	4f8ba2b9aa	zink: fix sparse residency query and minLOD feature checks cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21013>	2023-02-03 20:05:23 +00:00
Emma Anholt	de5b67ef2c	ci/llvmpipe: Drop skip of InteractionFunctionCalls2. This one is down to <5 seconds here these days. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21084>	2023-02-03 19:01:59 +00:00
Emma Anholt	2eb07304e3	ci/swrast: Drop skips for tests whose perf had been fixed. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21084>	2023-02-03 19:01:59 +00:00
Emma Anholt	907b0a01b7	gallivm: Do the same codegen improvement for constant-index array loads. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21084>	2023-02-03 19:01:59 +00:00
Emma Anholt	cf47154300	gallivm: Fix codegen performance for constant-index register array stores. Instead of generating num_components*simdwidth scattered stores, if there's no indirect then we can just look up the pointer to the base_offset and do a simd store there. dEQP-VK.subgroups.ballot_broadcast.compute.subgroupbroadcast_i64vec4 goes from 30s to ~2s. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21084>	2023-02-03 19:01:59 +00:00
Emma Anholt	833a74351c	gallivm: Fix the type of array nir_registers. This now matches how they get dereffed by get_soa_array_offsets() -- each array element has num_components vecs inside of it, rather than each components has an array in it. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21084>	2023-02-03 19:01:59 +00:00
Emma Anholt	a5d360550e	gallivm: Enable GALLIVM_DEBUG (mostly) on non-DEBUG builds. This is what let me do the performance work in my recent gallivm MRs. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21086>	2023-02-03 18:21:49 +00:00
Emma Anholt	947c60fa2f	llvmpipe: Enable LP_DEBUG on normal builds. I don't typically include DEBUG because it sometimes has expensive debug code, but these options are not that. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21086>	2023-02-03 18:21:49 +00:00
Emma Anholt	b6bd904019	ci/lvp: Drop the subgroupbroadcast skips. These have the same runtime as the others in the group, and with these optimizations they no longer time out. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21001>	2023-02-03 08:51:42 -08:00
Emma Anholt	70be21e7c6	gallivm: Use first active invocation in some image/ssbo accesses. These should be looking at that rather than blindly using invocation 0 (which may be junk when in control flow). Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21001>	2023-02-03 08:51:40 -08:00
Emma Anholt	8c2493d041	gallivm: Use cttz instead of a loop for first_active_invocation(). This should be way faster to compile by not spamming so many loops at LLVM, and faster to execute if LLVM didn't figure out what that loop meant. It looks vector reduce ops aren't really a thing, just a convenience in the IR. We should be able to do better by counting zeroes in the exec_mask != 0 result. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21001>	2023-02-03 08:51:37 -08:00
Emma Anholt	c11fa55f6d	gallivm: Return 0 first_active_invocation when we know that up front. 46 -> 30 seconds on dEQP-VK.subgroups.ballot_broadcast.compute.subgroupbroadcast_i16vec4 by not spamming LLVM with so many loops. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21001>	2023-02-03 08:51:35 -08:00
Emma Anholt	dc7c518abe	gallivm: Refactor out a shared "get the first active invocation" loop. Dynamic texture indices had a similar "find an active channel" loop, though it happened to use the last active channel rather than the first. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21001>	2023-02-03 08:51:32 -08:00
Emma Anholt	0b0246706e	gallivm: Optimize emit_read_invocation's first-invocation loop. We don't need to deref invoc inside -- invoc is uniform in active channels, so we can find our first active invocation in the loop, and then dereference invocation once outside. 50 -> 46 seconds on dEQP-VK.subgroups.ballot_broadcast.compute.subgroupbroadcast_i16vec4 Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21001>	2023-02-03 08:51:12 -08:00
Alyssa Rosenzweig	d73f72120a	asahi: Lower texcoords late This uses the new pass to lower tex coordinates late, which gets us one step closer to preprocessing NIR at CSO create time instead of variant create time. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21065>	2023-02-03 15:03:06 +00:00
Alyssa Rosenzweig	6908a0dece	asahi: Run nir_lower_fragcolor during preprocessing This pass needs to run early (because it depends on early I/O), but it doesn't actually need the shader key. Why not? If we overestimate the number of render targets, extra store_output intrinsics will be generated, but they will be deleted by AGX tilebuffer lowering later. Note we'll probably want something smarter than this for fragment epilogues in the future to avoid piles of unnecessary moves. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21065>	2023-02-03 15:03:06 +00:00
David Heidelberg	002707ff09	ci/lavapipe: use dxvk for the traces Since the job is manual, this stayed overlooked. Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20875>	2023-02-03 13:48:51 +00:00
David Heidelberg	3bc1bf7eea	ci: uprev piglit (etag md5 checksumming support) Support for FDO etag http header. Includes line-smooth-stipple test improvements. Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Acked-by: Guilherme Gallo <guilherme.gallo@collabora.com> Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20875>	2023-02-03 13:48:51 +00:00
Qiang Yu	f6b194b648	nir,ac/llvm,aco,radv,radeonsi: remove nir_export_vertex_amd Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20691>	2023-02-03 12:27:44 +00:00
Qiang Yu	80506be31b	ac/nir/ngg,radv,radeonsi: nogs use ac_nir_export_(position\|parameter) Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20691>	2023-02-03 12:27:44 +00:00
Qiang Yu	7c41cdb81f	ac/nir,radv,radeonsi: gs copy shader use ac_nir_export_(position\|parameter) Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20691>	2023-02-03 12:27:44 +00:00
Qiang Yu	7308637bb4	ac/nir,radv,radeonsi: legacy vs use ac_nir_export_(position\|parameter) Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20691>	2023-02-03 12:27:44 +00:00
Qiang Yu	df8c93a9f3	radeonsi: set nr_pos_exports outside of llvm translation This can save an abi interface when we share position export code with RADV. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20691>	2023-02-03 12:27:43 +00:00
Qiang Yu	048d4de5e5	radeonsi: remove the extra handling for VS/TES primitive id We have moved si_nir_assign_param_offsets before output lowering pass, so there won't be primitive id store output when VS/TES here. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20691>	2023-02-03 12:27:43 +00:00
Qiang Yu	59135678cf	radeonsi: update outputs written nir info We may remove some outputs when si_nir_kill_outputs and ac_nir_optimize_outputs, so update the outputs written info for output lower pass to skip manipulating these outputs. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20691>	2023-02-03 12:27:43 +00:00
Qiang Yu	dcccd94faf	radeonsi: clamp vertex color in legacy gs instead of gs copy shader gs copy shader is going to emit nir_export_amd directly so this vertex color clamp pass which apply to nir_store_output will not work. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20691>	2023-02-03 12:27:43 +00:00
Qiang Yu	601ad9e0a9	amd,radeonsi: implement nir_load_force_vrs_rates_amd in driver abi Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20691>	2023-02-03 12:27:43 +00:00
Yonggang Luo	b1a33789b8	util: Implement util_iround with lrintf unconditionally Because the place that called util_iround are always ensured that INT_MIN <= f <= INT_MAX Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19978>	2023-02-03 04:00:17 +00:00
Mike Blumenkrantz	e82369d06b	zink: enable bindless texture with ZINK_DESCRIPTORS=db Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21085>	2023-02-03 02:12:33 +00:00
Mike Blumenkrantz	99ba529fee	zink: implement descriptor buffer handling of bindless texture pretty straightforward, just lazily allocating the context-based db and then writing updates to it on-demand Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21085>	2023-02-03 02:12:33 +00:00
Mike Blumenkrantz	6b49dec675	zink: add a flag to indicate whether a descriptor buffer is bound Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21085>	2023-02-03 02:12:33 +00:00
Mike Blumenkrantz	f81a4e904c	zink: break out descriptor binding into separate function Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21085>	2023-02-03 02:12:33 +00:00
Mike Blumenkrantz	362b8792e7	zink: set VK_PIPELINE_CREATE_DESCRIPTOR_BUFFER_BIT_EXT on compute pipelines same as gfx Fixes: `7ab5c5d36d` ("zink: use EXT_descriptor_buffer with ZINK_DESCRIPTORS=db") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21085>	2023-02-03 02:12:33 +00:00
Mike Blumenkrantz	e471b4360d	zink: skip updating descriptor buffer sets that aren't active this is a no-op and illegal Fixes: `7ab5c5d36d` ("zink: use EXT_descriptor_buffer with ZINK_DESCRIPTORS=db") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21085>	2023-02-03 02:12:33 +00:00

1 2 3 4 5 ...

58399 commits