fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-31 09:50:08 +01:00

Author	SHA1	Message	Date
Faith Ekstrand	212e07a70e	compiler/rust: Add a unit test for the memstream abstraction Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31718>	2024-10-17 18:59:02 +00:00
Faith Ekstrand	ec24156b31	compiler/rust: Enable unit tests Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31718>	2024-10-17 18:59:01 +00:00
Connor Abbott	016ce14ac7	tu: Implement VK_EXT_host_image_copy Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26578>	2024-10-17 18:17:18 +00:00
Connor Abbott	e77a2f1cc3	tu: Add a flag for cached non-coherent BOs We will have to flush/invalidate the memory backing an image in the driver when copying it to/from the host if it's cached and not coherent. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26578>	2024-10-17 18:17:18 +00:00
Connor Abbott	7a5a33e0e3	freedreno/fdl: Add tiling/untiling implementation for a6xx/a7xx This implements copies to/from the standard tiling (aka TILE6_3), similar to isl_tiled_memcpy for intel. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26578>	2024-10-17 18:17:18 +00:00
Connor Abbott	66bdb50736	tu: Gather UBWC config Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26578>	2024-10-17 18:17:18 +00:00
Connor Abbott	3df6c19a22	virtio/drm: Update header Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26578>	2024-10-17 18:17:18 +00:00
Connor Abbott	927968e266	freedreno: Add default UBWC config values These values are programmed by the kernel and not determined by the hardware, but provide a default value that should match what drm/msm programs for older kernels that can't report it. kgsl has always supported returning the highest_bank_bit, although it hardcodes some of the other parameters so we have to follow what it does instead of using this. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26578>	2024-10-17 18:17:18 +00:00
Connor Abbott	f67b64ae6c	freedreno/fdl: Add UBWC config struct This will be used for the tiled memcpy implementation, but we add this part of the API first so that subsequent commits can embed it in turnip and set it up. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26578>	2024-10-17 18:17:18 +00:00
Connor Abbott	8a6f051a13	freedreno/a6xx: Remove dead fd6_get_ubwc_blockwidth() call Unused since `a9057d4`. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26578>	2024-10-17 18:17:18 +00:00
Connor Abbott	a266360dff	freedreno/fdl: Extend 2bpp UBWC special case to 1bpp Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26578>	2024-10-17 18:17:18 +00:00
Zan Dobersek	b44480e86a	zink: fix bo_export caching When creating and caching the bo_export object for a given zink_bo, the screen file descriptor was used. Since no buffer object's file descriptor would match that, bo_export objects were continuously added to the exports list and no bo_export was effectively picked from the cache. Using the buffer object's file descriptor avoids that. Signed-off-by: Zan Dobersek <zdobersek@igalia.com> Fixes: `b0fe621459` ("zink: add back kms handling") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31715>	2024-10-17 17:27:20 +00:00
Aleksi Sapon	cfdb653f1c	llvmpipe: update traces for aniso filtering fix Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31562>	2024-10-17 16:47:22 +00:00
Aleksi Sapon	e7a851e6cf	softpipe: Fix anisotropic sampling aliasing bug "Backport" of the llvmpip fix. Nearest sampling was being done using coordinates on texel boundaries, which caused aliasing bugs. Shift coordinates by half a texel to correct this. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31562>	2024-10-17 16:47:22 +00:00
Aleksi Sapon	5947e3d760	llvmpipe: Fix pmin calculation Based on the original code from sp_tex_sample.c, this was supposed to be a comparison with pmax2, not pmin2. This mostly seemed to result in anisotropic filtering turning on to "maximum" at any value of max_aniso > 1. Most apparent when runing the texfilt test in demos. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31562>	2024-10-17 16:47:22 +00:00
Aleksi Sapon	313115f98b	llvmpipe: Fix anisotropic sampling aliasing bug Nearest sampling was being done using coordinates on texel boundaries, which caused aliasing bugs. Shift coordinates by half a texel to correct this. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31562>	2024-10-17 16:47:21 +00:00
Connor Abbott	e9bb906a32	tu: Implement VK_EXT_pipeline_robustness Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31687>	2024-10-17 16:16:05 +00:00
Connor Abbott	c323848b0b	ir3, tu: Plumb through support for per-shader robustness We need to pass through the robust_modes flag to nir_opt_vectorize based on a flag set when compiling the shader, not globally in the compiler, for VK_EXT_pipeline_robustness. Refactor the ir3 compiler interface to add an ir3_shader_nir_options struct that can be passed around to the appropriate places, and wire it up in turnip to the shader key. The shader key replaces the old mechanism of hashing in the compiler options. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31687>	2024-10-17 16:16:05 +00:00
Anil Hiranniah	3d066e5ef1	panfrost: Fix a memory leak in the CSF backend The geometry BO should be released in csf_cleanup_context(). Fixes: `447075eeee` ("panfrost: Add support for the CSF job frontend") Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31705>	2024-10-17 15:03:44 +00:00
Boris Brezillon	27bde761a7	panvk: Fix the hierarchy_mask selection Always enable the level covering the whole FB, and disable the finest levels if we don't have enough to cover everything. This is suboptimal for small primitives, since it might force primitives to be walked multiple times even if they don't cover the the tile being processed. On the other hand, it's hard to guess the draw pattern, so it's probably good enough for now. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31682>	2024-10-17 14:29:57 +00:00
Iago Toral Quiroga	ad111aed29	v3dv: fix leak during device initialization Fixes: `188f1c6cbe` ('v3dv: rewrite device identification') Reviewed-by: Karmjit Mahil <karmjit.mahil@igalia.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31703>	2024-10-17 13:27:05 +00:00
Juan A. Suarez Romero	e4301621a2	v3d/ci: add OpenCL failures Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31710>	2024-10-17 12:35:33 +00:00
Lionel Landwerlin	ea2bbe3271	anv: use stage mask to deduce cs/pb-stall requirements When flushing the render target cache for future operations, we need a stall at pixel scoreboard. We likely didn't see any issue until now because a change in render target added the pb-stall. When using a 2 compute shaders with the following pattern : vkCmdDispatch() vkCmdPipelineBarrier() ImageBarrier with (src\|dst)AccessMask=0 & identical layout vkCmdDispatch() we should ensure that the first dispatch is completed before executing the second one, otherwise they can race to on resource accesses. This fixes failures in some new CTS tests. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31676>	2024-10-17 11:55:33 +00:00
Georg Lehmann	aabadb30fc	aco/print_ir: use parse_depctr_wait Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31132>	2024-10-17 11:16:16 +00:00
Georg Lehmann	ced7a01954	aco/statistics: update branch issue cycles Foz-DB Navi31: Totals from 14319 (18.04% of 79395) affected shaders: Instrs: 20064495 -> 20062876 (-0.01%) CodeSize: 105334568 -> 105327704 (-0.01%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31132>	2024-10-17 11:16:16 +00:00
Georg Lehmann	ec11cfc69d	aco/insert_delay_alu: do not delay lane mask fast forwarding The delay actually hurts performance in this case. Foz-DB Navi31: Totals from 30340 (38.21% of 79395) affected shaders: Instrs: 30778999 -> 30726605 (-0.17%); split: -0.17%, +0.00% CodeSize: 162380180 -> 162170808 (-0.13%); split: -0.13%, +0.00% Latency: 228185562 -> 228186976 (+0.00%); split: -0.00%, +0.00% InvThroughput: 39001151 -> 39000897 (-0.00%); split: -0.00%, +0.00% Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31132>	2024-10-17 11:16:16 +00:00
Georg Lehmann	e4889fd4b5	aco/insert_delay_alu: consider more implicit waits Foz-DB Navi31: Totals from 37961 (47.81% of 79395) affected shaders: Instrs: 34175286 -> 33978599 (-0.58%) CodeSize: 180059352 -> 179190076 (-0.48%); split: -0.48%, +0.00% Latency: 259826196 -> 259798474 (-0.01%); split: -0.01%, +0.00% InvThroughput: 42792700 -> 42789298 (-0.01%); split: -0.01%, +0.00% Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31132>	2024-10-17 11:16:16 +00:00
Georg Lehmann	840b5841d3	aco: do not track ALU delay across jumps This assumes that the best case jump latency is higher than the worst case ALU latency. Foz-DB Navi31: Totals from 17720 (22.32% of 79395) affected shaders: Instrs: 26009663 -> 25929989 (-0.31%); split: -0.31%, +0.00% CodeSize: 136571496 -> 136254420 (-0.23%); split: -0.23%, +0.00% Latency: 215731308 -> 215722059 (-0.00%); split: -0.01%, +0.00% InvThroughput: 36534197 -> 36532070 (-0.01%); split: -0.01%, +0.00% Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31132>	2024-10-17 11:16:16 +00:00
Georg Lehmann	977f435f4c	aco/ir: add function to parse depctr waits No Foz-DB changes on Navi31. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31132>	2024-10-17 11:16:16 +00:00
Mike Blumenkrantz	a061c80629	zink: further improve image usage detection there was some confusion over exactly where ici->usage should be set, since the value must be set when doing all the format checks but then also it was re-set again later to a potentially different value based on an unchecked return now get_image_usage() is set_image_usage() with a more consistent policy around exactly where the usage is set this code still sucks tho Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31686>	2024-10-17 10:38:28 +00:00
Georg Lehmann	dbf63a0788	nir: remove nir_op_is_derivative Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31014>	2024-10-17 09:50:19 +00:00
Georg Lehmann	f9d2aad7a3	nir: remove alu ddx/ddy Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31014>	2024-10-17 09:50:19 +00:00
Georg Lehmann	bf0d1a42b4	nir: remove uses_fddx_fddy Unused and the code didn't even do what the comment said. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31014>	2024-10-17 09:50:19 +00:00
Georg Lehmann	cba575f4df	nir: always emit ddx intrinsics Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31014>	2024-10-17 09:50:19 +00:00
Georg Lehmann	5205501e2f	mesa/prog_to_nir: use derivative builder Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31014>	2024-10-17 09:50:19 +00:00
Georg Lehmann	41cce70584	spirv: remove alu fddx/fddy from comment Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31014>	2024-10-17 09:50:19 +00:00
Georg Lehmann	403e2393e3	ir3: remove alu fddx/fddy check Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31014>	2024-10-17 09:50:19 +00:00
Georg Lehmann	6cb6bc7133	elk: remove alu fddx/fddy check Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31014>	2024-10-17 09:50:19 +00:00
Georg Lehmann	1371a8fe2b	nir/opt_move_discards_to_top: handle ddx/ddy intrinsics Fixes: `daa97bb41a` ("amd: switch to derivative intrinsics") Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31014>	2024-10-17 09:50:19 +00:00
Marek Olšák	948f94b8c5	nir/opt_varyings: pack TCS inputs with cross-invocation access together Unigine Heaven has a TCS that reads pos.xyz and tescoord.w from all invocations in every invocation. By putting those two in the same vec4, AMD hw can reduce the amount of shared memory that is allocated for those inputs from 2 vec4s to 1 vec4. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31670>	2024-10-17 03:30:07 +00:00
Marek Olšák	8e93907b7c	nir/opt_varyings: assign locations of no_varying IO for TCS outputs only Skip the code for other shader stages because it doesn't do anything there. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31670>	2024-10-17 03:30:07 +00:00
Daniel Almeida	01586bc57e	util: u_memstream: add tests Now we have a few extra methods and things are diverging a bit between Linux and Windows. Add a few unit tests to make sure this works. Signed-off-by: Daniel Almeida <daniel.almeida@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30594>	2024-10-17 02:50:21 +00:00
Daniel Almeida	279f38918f	nak: memstream: move into common code Move the memstream code into common code. Other Rust code interfacing with FILE pointers will find the memstream abstraction useful. Most notably, pinning is actually enforced this time with PhantomPinned. Add a .clang-format from a sibling dir (i.e.: compiler/nir) while we're at it to keep things tidy. Signed-off-by: Daniel Almeida <daniel.almeida@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30594>	2024-10-17 02:50:21 +00:00
Daniel Almeida	3136d1c8c6	util: memstream: add fflush support Flush support is needed by the Rust code, which will switch from its own memstream to u_memstream in a future patch. Signed-off-by: Daniel Almeida <daniel.almeida@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30594>	2024-10-17 02:50:20 +00:00
Connor Abbott	e6c5dcd295	tu: Expose VK_KHR_dynamic_rendering_local_read Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31261>	2024-10-17 00:30:45 +00:00
Connor Abbott	a99600322c	tu: Track possible feedback loops for dynamic renderpasses With DRLR, we unfortunately don't get notified about when there are feedback loops. We also can't detect ahead of time the case where an input attachment is read before written. This means we need to try and guess in the driver when that happens based on which input attachments we read. There are three main things we have to do with feedback loops: - Forcing flushing between primitives in sysmem to avoid problems with compressed textures before a7xx. - Forcing late depth test if it's a depth feedback loop. - Invalidating UCHE because we may read blit results. The way this is implemented is currently a bit conservative, following radv. However the impact of the flushing should be mitigated by switching to GMEM whenever we can, which we already do. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31261>	2024-10-17 00:30:44 +00:00
Connor Abbott	ef693c5785	tu: Fix flushes for feedback_invalidate case We also have to wait for the blits to land, and WFI so that everything finishes before any draws. Noticed when adding the equivalent thing for dynamic renderpasses. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31261>	2024-10-17 00:30:44 +00:00
Connor Abbott	cad2ca74d9	tu: Make input attachments always contain a real descriptor We need this for read-only input attachments. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31261>	2024-10-17 00:30:44 +00:00
Connor Abbott	beb513ad78	tu: Support dynamic input attachments Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31261>	2024-10-17 00:30:44 +00:00
Connor Abbott	d50eef5b06	tu: Support color attachment remapping Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31261>	2024-10-17 00:30:44 +00:00

... 95 96 97 98 99 ...

201327 commits