fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-21 22:20:14 +01:00

Author	SHA1	Message	Date
Seán de Búrca	ea6c57cfcb	rusticl: cap max alloc size to i32 to avoid overflowing gallium Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details v2: remove stray comment Reviewed-by: Karol Herbst <kherbst@redhat.com> Cc: stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34166>	2025-04-11 09:54:07 +00:00
Seán de Búrca	5b1088220e	rusticl: correct calculation of maximum allocation size This verifies that the requested allocation doesn't exceed the maximum in cases where the size passed to `clSVMAlloc()` isn't a multiple of the provided alignment. It also clamps the maximum allocation to `i32::MAX`, which prevents overflowing `pipe_box`'s `width` field. Both of these changes prevent possible undefined behavior on 32-bit systems due to violation of `Layout` prerequisites. v2: use safe layout creation for maintainability, add a few comments v3: use Layout utils for aligned size calc, split out max alloc changes v4: use `checked_compare()` for alloc/size comparison Reviewed-by: Karol Herbst <kherbst@redhat.com> Cc: stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34166>	2025-04-11 09:54:06 +00:00
Erik Faye-Lund	7c88a52350	panvk: enable sampledImageIntegerSampleCounts for all MSAA formats Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details There's no good reason not to support this, it should just work... Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33925>	2025-04-11 08:01:08 +00:00
Erik Faye-Lund	1729409141	panvk: enable 8x and 16x msaa when supported This uses the helpers from the previous patch to calculate how many attachments and MRT buffers we have space for. In the case where we can support more MSAA samples for smaller formats, we also add support for that. The flaking test seems to be due to a CTS issue, see this ticket for details: https://gitlab.khronos.org/Tracker/vk-gl-cts/-/issues/5651 Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33925>	2025-04-11 08:01:08 +00:00
Erik Faye-Lund	329568b5eb	panfrost: add color-attachment and msaa helpers In order to enable higher MSAA modes, we're going to have to perform some calculations on how to budget the (sometimes) limited tile-buffer space. Due to limited tilebuffer space, we need to prioritize a bit here. First, we reserve space for 4x MSAA for all formats. Then we try to fit 8 color attachments into the tile-buffer. And then finally, we calculate how many extra multi-sample buffers we can fit into the rest. The reason we reserve 4x MSAA first, is that this is required by all Vulkan versions. It also prevents us from regressing existing features. Then we try to pick 8 color attachments next, because that's required by Vulkan 1.4 as well as Vulkan Roadmap 2024 and D3D12. Vulkan Roadmap 2022 requires 7 as well. This adds helpers that implements this, which can be used by both the Gallium and the Vulkan driver. It's really benefitial if both of these drivers prioritize the same way here. Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33925>	2025-04-11 08:01:08 +00:00
Erik Faye-Lund	20acee81ac	panfrost: disable tile-pipelining when needed On v5, as well as v7 onwards, we can disable pipelining in order to fit more data into the tile-memory. This is important in order to support multiple, large color buffers with high MSAA sample counts. Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33925>	2025-04-11 08:01:08 +00:00
Boris Brezillon	77d38bb7ca	panfrost: also consider z/s for tile-size We also have a budget for the tile size for depth-buffers. It's currently hard to trigger issues with this than for color-buffers, but this becomes important when we support larger MSAA counts. We also need to take a bit of care for stencil-only attachments, because they also count against a limit here. We really only care about the sample counts here, because the stencil buffer budget is always a quarter of the depth-buffer budget, and always uses a single byte per sample. Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33925>	2025-04-11 08:01:08 +00:00
Erik Faye-Lund	9ec6197a0b	panfrost: allocate tile-buffer for dummy render-targets There's two limitations we have to cater to: 1. The HW needs at least one render-target. We can disable write-back for it, but it needs to allocate tile-buffer space for it. 2. The HW can't have "holes" in the render-targets. In both of those cases, we already set up dummy RGBA8 UNORM as the format, and disable write-back. But we forgot to take this into account when calculating the tile buffer allocation. This makes what we program the HW to do consistent, meaning we don't end up smashing the tile-buffer space. We might be able to do something better by adjusting how we program these buffers, but let's leave that for later. Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33925>	2025-04-11 08:01:08 +00:00
Erik Faye-Lund	af87aa5ee4	panfrost: properly align value This matches the rest of this file. Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33925>	2025-04-11 08:01:08 +00:00
Erik Faye-Lund	c2ddafb935	panvk/ci: move timeouts to crash These seems to crash on CI, not timeout. And the stencil.samples_1 variant is already present in the fails file, so let's remove the duplicate. Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33925>	2025-04-11 08:01:08 +00:00
Erik Faye-Lund	c99a3298d9	panfrost: correct tile-buffer size for some v7 GPUs These GPUs had their tilebuffer sizes listed at twice their actual values. While that still works, it ends up disabling pipelining in some cases. This gives a significant performance hit, compared to using the correct values. But, it turns out to be hard or impossible to trigger at the moment, due to the limited number of MSAA samples we support. Once that changes, this is a lot easier to trigger, so let's fix it up. Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33925>	2025-04-11 08:01:08 +00:00
Erik Faye-Lund	b4ebffa1aa	panfrost: fixup typo in 16x sample-pattern This is an n-queen pattern, where no two values should be on the same row or column. But this and the second to last element has the same y component, and neither has the negative one. Let's fix this up by setting the first value to the negative value. This matches the D3D 16x sample pattern. Fixes: `a61fb62966` ("panfrost: Upload sample positions on device init") Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33925>	2025-04-11 08:01:08 +00:00
Valentine Burley	b49eaf0966	ci/lava: Consolidate piglit trace job definitions Clean up LAVA job definitions. Signed-off-by: Valentine Burley <valentine.burley@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34424>	2025-04-11 07:05:07 +00:00
Valentine Burley	87d58ea57a	ci/piglit: Consolidate HWCI_TEST_SCRIPT for piglit traces The HWCI_TEST_SCRIPT variable was always getting overwritten for these definitions. Signed-off-by: Valentine Burley <valentine.burley@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34424>	2025-04-11 07:05:06 +00:00
Valentine Burley	1aeedddbb6	ci/piglit: Drop redundant PIGLIT_PROFILES variable PIGLIT_PROFILES was only used with the piglit-runner.sh script, which no jobs were using anymore. Signed-off-by: Valentine Burley <valentine.burley@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34424>	2025-04-11 07:05:06 +00:00
Valentine Burley	09f86df938	intel/ci: Convert iris-kbl-piglit to deqp-runner suite This was the last job using the piglit-runner.sh script. Signed-off-by: Valentine Burley <valentine.burley@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34424>	2025-04-11 07:05:06 +00:00
Benjamin Lee	a02b6e6bef	panvk: don't advertise VK_KHR_float_controls2 on bifrost Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This extension requires vulkan 1.1. Fixes dEQP-VK.api.info.extension_core_versions.extension_core_versions on bifrost. Signed-off-by: Benjamin Lee <benjamin.lee@collabora.com> Fixes: `22fa3e88dd` ("panvk: advertise VK_KHR_float_controls2") Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34463>	2025-04-10 20:12:21 +00:00
Konstantin Seurer	b218c45973	radv: Handle nir_intrinsic_printf Makes it possible to use printf statements inside glsl meta shaders. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34208>	2025-04-10 19:31:37 +00:00
Konstantin Seurer	ba001626ac	nir: Turn the format string index into a const index It is already expected to be constant. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34208>	2025-04-10 19:31:37 +00:00
Konstantin Seurer	d21926bc04	spirv: Emit code for NonSemantic.DebugPrintf if supported This can be useful for debugging code in situations where VVL cannot be used. (DGC, meta shaders) Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34208>	2025-04-10 19:31:37 +00:00
Benjamin Lee	28e2d9088c	panvk: enable uniformAndStorageBuffer8BitAccess and storagePushConstant8 Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details 8-bit loads are already supported by bi_emit_load_ubo and bi_emit_load_push_constant, so the only necessary changes were fixing swizzle lowering issues uncovered by these CTS tests. Signed-off-by: Benjamin Lee <benjamin.lee@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33416>	2025-04-10 10:16:02 -07:00
Benjamin Lee	b683a59438	panfrost/va: valhall-specific swizzle lowering For most swizzled instructions that are different between valhall and bifrost, valhall allows more values than bifrost does, so we can avoid some unnecessary lowering. Signed-off-by: Benjamin Lee <benjamin.lee@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33416>	2025-04-10 10:15:46 -07:00
Benjamin Lee	168c96816a	panfrost: use bifrost instruction table for bi_lower_swizzle Fixes two known issues: - We did not lower invalid swizzles for IADD.v4s8, triggered in the CTS by enabling uniformAndStorageBuffer8BitAccess and storageBuffer8BitAccess in panvk. - We did not lower invalid swizzles for IMUL.v4i8, triggered by dEQP-VK.spirv_assembly.instruction.compute.mul_extended.(un)signed_8bit on bifrost. The old logic was missing several other instructions, so there may be additional bugs that we don't know about. There are no cases where the new behavior will keep swizzles that would have been lowered previously, so this change should not introduce any new bugs with valhall. Signed-off-by: Benjamin Lee <benjamin.lee@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33416>	2025-04-10 10:15:46 -07:00
Benjamin Lee	f513ddfc1c	panfrost/va: use 'lane' modifier for MKVEC.v2i8 Previously we were using 'swizzle', with special handling in va_pack. This does not work if we want to use va_src_info to determine allowed swizzles in bi_lower_swizzle. The allowed set of swizzle values for 'lane' is correct for this instruction. Signed-off-by: Benjamin Lee <benjamin.lee@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33416>	2025-04-10 10:15:46 -07:00
Benjamin Lee	f2c8267962	panfrost: add bi_swizzle values for unused swizzles This makes codegen using bifrost/ISA.xml swizzle values simpler because we don't need to special-case the values that we don't emit. Signed-off-by: Benjamin Lee <benjamin.lee@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33416>	2025-04-10 10:15:46 -07:00
Benjamin Lee	4abd75c7d1	panfrost: define bi_swizzle alias values Primary reason to do this is to make codegen using the swizzle names in bifrost/ISA.xml simpler. A secondary benefit is that dependent code can now use the swizzle name that matches the context, making things a little more readable. We may want to consider giving widens separate values later, so that va_lower_constants and bi_opt_constant_fold can fold them correctly, but I don't know of current bugs caused by this. Signed-off-by: Benjamin Lee <benjamin.lee@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33416>	2025-04-10 10:15:46 -07:00
Lionel Landwerlin	06ad9a25e5	brw: fix Wa_22013689345 emission Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details 2 problems : - not detecting null destination correctly - applied too late using SHADER_OPCODE_MEMORY_FENCE, when lowering already happened Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34319>	2025-04-10 16:44:28 +00:00
Benjamin Lee	22fa3e88dd	panvk: advertise VK_KHR_float_controls2 This is all supported by the common nir code, no changes needed on our end. Signed-off-by: Benjamin Lee <benjamin.lee@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33660>	2025-04-10 16:21:09 +00:00
Benjamin Lee	7612dc4713	panvk: advertise VK_KHR_shader_float_controls Signed-off-by: Benjamin Lee <benjamin.lee@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33660>	2025-04-10 16:21:09 +00:00
Benjamin Lee	95056fa75a	panvk/va: don't advertise independent denorm behavior Valhall supports all combinations of ftz/preserve denorm behavior between FP16 and FP32 except FP16=ftz, FP32=preserve. Because of this, we can't advertise independent denorm behavior. Even with INDEPENDENCE_NONE, it is still possible for shaders to set denorm behavior for one size and leave the other size unspecified. Previously we were defaulting to preserve for any unspecified size, but with FP16=ftz, we need to default unspecified FP32 to preserve. When advertising INDEPENDENCE_NONE, the CTS checks that the shaderDenormFlushToZeroFloat* and shaderDenormPreserveFloat* features are equal for all sizes, so we need to advertise the same supported denorm behavior for FP64 even though we don't support FP64 at all. Signed-off-by: Benjamin Lee <benjamin.lee@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33660>	2025-04-10 16:21:09 +00:00
Benjamin Lee	b6406c179b	pan/bi: implement denorm behavior float controls On bifrost independent float controls are implementable, just potentially expensive because it requires scheduling FP16 and FP32 instructions in separate clauses. Signed-off-by: Benjamin Lee <benjamin.lee@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33660>	2025-04-10 16:21:09 +00:00
Benjamin Lee	9737c1fa15	pan/bi: ignore ftz mode when scheduling int instructions This allows more efficient scheduling by putting a 16-bit int instruction in the same clause as a 32-bit float instruction even when the 16-bit and 32-bit float controls are different. Signed-off-by: Benjamin Lee <benjamin.lee@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33660>	2025-04-10 16:21:09 +00:00
Benjamin Lee	08765d53c9	pan/bi: refactor bi_instr_ftz to allow dontcare FTZ states The current behavior is identical, but we can express that some instructions may be packed in either FTZ and no-FTZ clauses in the future. Signed-off-by: Benjamin Lee <benjamin.lee@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33660>	2025-04-10 16:21:09 +00:00
Benjamin Lee	5bb85e965e	pan/va: preserve signed zero in f32->f16 conversions Using 'FADD.f32 x, +0' for f32->f16 conversions strips signed zero, which we can't do if we advertise shaderSignedZeroInfNanPreserveFloat16. Adding -0 instead preserves the original sign. Signed-off-by: Benjamin Lee <benjamin.lee@collabora.com> Fixes: `b63ef74e73` ("pan/bi: Stop using V2F32_TO_V2F16 on Valhall") Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33660>	2025-04-10 16:21:09 +00:00
Benjamin Lee	239c6b833a	panfrost: implement float controls rounding mode Many float instructions do not have a rounding mode modifier, but all of the operations that are listed as requiring correct rounding in the vulkan spec are supported in hardware. Signed-off-by: Benjamin Lee <benjamin.lee@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33660>	2025-04-10 16:21:09 +00:00
Benjamin Lee	6f68649400	pan/va: add roundmode modifier to additional instructions These are needed to implement VK_KHR_shader_float_controls rounding mode. Signed-off-by: Benjamin Lee <benjamin.lee@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33660>	2025-04-10 16:21:09 +00:00
Lars-Ivar Hesselberg Simonsen	20c0d169e4	vk/sync: Fix execution only barriers With vkCmdPipelineBarrier, it's possible to specify a barrier with pipeline stages but without any memory barriers. These might not be practical, but are legal Vulkan code. Barriers like this are currently ignored in mesa, as we only convert barriers with passed memory barriers into vkCmdPipelineBarrier2. This commit adds handling of execution only barriers by converting them into a memory barrier without access masks. Fixes: `97f0a4494b` ("vulkan: implement legacy entrypoints on top of VK_KHR_synchronization2") Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34187>	2025-04-10 15:28:22 +00:00
Ludvig Lindau	6393ebbdbb	panvk: Get flush_id once per submit Get flush_id once per command buffer in the submit and use it for all subqueues instead of getting a new flush_id for every subqueue. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34448>	2025-04-10 15:00:57 +00:00
Tapani Pälli	30d78dc942	mesa: various fixes for ClearTexImage/ClearTexSubImage Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Fixes some upcoming CTS tests for texture clears. * some drivers will attempt to issue clears with zero range and hit asserts/crashes (spec clarification for negative values) * fix error thrown with negative values to match spec * fix cases for clearing generic compressed formats * fix negative case of using color format while having depth/stencil internalformat and vice versa Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34428>	2025-04-10 14:32:56 +00:00
Tapani Pälli	3bc016bb6c	mesa: clamp texbuf query size to MAX_TEXTURE_BUFFER_SIZE Fixes upcoming CTS test checking for clamping. Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34428>	2025-04-10 14:32:56 +00:00
Boris Brezillon	24b1aa6c28	panvk/csf: Optimize read-only tile buffer access When the color/input attachment map is known at compile time, we can determine the set of read-only render targets and replace .wait by .wait_resource flows, in order to avoid read-after-read serialization. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32540>	2025-04-10 13:17:54 +00:00
Boris Brezillon	4f4ac56145	pan/va: Support relaxed waits on read-only render targets On Valhall we can optimize lower waits, which waits for both readers and writers, into resource_waits which only wait for writers, allowing threads accessing read-only resources to execute concurrently. Let's use that on LD_TILE instructions so we can optmize the read-only case. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32540>	2025-04-10 13:17:53 +00:00
Boris Brezillon	fbb2805575	panvk: Advertise KHR_dynamic_rendering_local_read support Now that we support local reads we can safely advertise KHR_dynamic_rendering_local_read. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32540>	2025-04-10 13:17:53 +00:00
Boris Brezillon	7a2b23b0bd	panvk: Skip BY_REGION barriers if we're in a render pass If we are in a render pass, the intra-draw synchronization happens through the FPK parameters, shader waits and draw dependencies, so we can safely skip the barrier in that case. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32540>	2025-04-10 13:17:53 +00:00
Boris Brezillon	bfd5ddbf32	panvk: Optimize input attachment loads when we can When we know the input attachment is also an active color attachment we can load the value from the tile buffer instead of going back to the texture. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32540>	2025-04-10 13:17:53 +00:00
Boris Brezillon	20275d6521	pan/bi: Introduce two intrinsics to support input attachment remapping In order to dynamically load the content of the tile buffer, we need to know the target (color, depth or stencil) and the conversion to apply. Let's define the load_input_attachment_{target,conv}_pan intrinsics so we can dissociate the logic lowering input attachment loads into load_converted_output_pan, and the part optimizing the shader when input attachment map is passed at compile time. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32540>	2025-04-10 13:17:53 +00:00
Boris Brezillon	244995e4af	panvk: Support color attachment remapping We take the color attachment remapping into account when emitting blend descriptors, and we make sure we re-emit those when this color attachment map is dirty. We also need to take the remapping into account when checking the render targets written by the fragment shader, hence the addition of a color_attachment_written_mask() helper. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32540>	2025-04-10 13:17:53 +00:00
Boris Brezillon	9d5d03bf78	panvk/jm: Move cmd_prepare_draw_sysvals() out of the layer loop The only sysval that changes is the layer_id, so let's call cmd_prepare_draw_sysvals() outside of the layer loop, and manually update the sysval there. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32540>	2025-04-10 13:17:53 +00:00
Boris Brezillon	fe21da08ed	pan/earlyzs: Support the shader ZS read-only case and its optimization on v10+ We are about to allow ZS tile buffer reads in panvk in order to support VK_KHR_dynamic_rendering_local_read, and this requires dealing with a new case in the early ZS logic. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32540>	2025-04-10 13:17:53 +00:00
Boris Brezillon	d2cd5ca609	panvk: Generate the earlyzs LUT at shader creation time Do what the gallium driver does and generate the LUT when creating the shader to avoid regenerating this LUT in the draw path. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32540>	2025-04-10 13:17:53 +00:00

... 21 22 23 24 25 ...

190082 commits