fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-22 02:18:10 +02:00

Author	SHA1	Message	Date
Olivia Lee	e9ca69b807	panvk/csf: take merged workgroups into account for divergence Merging workgroups affects divergence analysis, since subgroups can now contain extra threads from other workgroups. We already have divergence analysis flags to handle this case, but since the compiler options memory is static, we need to define an entirely separate option set for merged vs non-merged workgroups. In gallium, we don't have to switch options because opengl requires uniformity over the entire dispatch in application shaders. Signed-off-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38586>	2026-04-09 17:53:46 +00:00
Olivia Lee	c42e124a66	pan/va: don't merge workgroups when subgroups are used Vulkan guarantees that all subgroup invocations will be part of the same workgroup, so we need to disable merging workgroups for shaders where the subgroup layout is observable. Signed-off-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38586>	2026-04-09 17:53:46 +00:00
Olivia Lee	a0f6c6d84d	pan/va: move allow_merging_workgroups decision to drivers In panvk, we will need to decide whether we are merging workgroups early in shader compilation, before calling nir_lower_non_uniform_access. This is because nonuniform lowering introduces new subgroup intrinsics which would otherwise inhibit workgroup merging, and because the set of instructions that need to be lowered may be different with merged workgroups. Signed-off-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38586>	2026-04-09 17:53:46 +00:00
Olivia Lee	1f75299ebb	pan/va: weaken barrier requirements for allow_merging_workgroups The only requirement for barriers is that the hardware doesn't support allow_merging_workgroups with actual BARRIER instructions. We only emit these for workgroup execution barriers though, so are safe to merge workgroups when the shader uses memory barriers or subgroup execution barriers. Signed-off-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Caterina Shablia <caterina.shablia@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38586>	2026-04-09 17:53:46 +00:00
Lars-Ivar Hesselberg Simonsen	affcc7fe54	pan/va/isa: Src for X16_TO* takes lane, not swizzle Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details While the swizzle code was producing the correct encoding, the disassembly was slightly weird and swz_16 required an extra argument that was always "false". Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40865>	2026-04-09 11:23:36 +00:00
Christian Gmeiner	41b34334a6	panvk: Advertise VK_EXT_attachment_feedback_loop_dynamic_state The Vulkan runtime provides the dynamic state infrastructure via vk_common_CmdSetAttachmentFeedbackLoopEnableEXT(). This builds on the attachment feedback loop layout support. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40498>	2026-04-09 12:30:21 +02:00
Christian Gmeiner	a2d9d2b5f8	panvk: Advertise VK_EXT_attachment_feedback_loop_layout PanVK treats image layouts as no-ops and already disables Forward Pixel Kill when the same render target is both read and written. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40498>	2026-04-09 12:30:19 +02:00
Lars-Ivar Hesselberg Simonsen	6cdc3cc1d2	pan: Add support for 64 bit gpu_id While not currently required, it will be for future GPUs. Also cleans up gpu_id as parameter to some functions that didn't use it. Reviewed-by: Aksel Hjerpbakk <aksel.hjerpbakk@arm.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40610>	2026-04-09 09:49:20 +00:00
Lars-Ivar Hesselberg Simonsen	f181cc5bca	pan/model: Redo gpu_prod_id in the model The current implementation is a bit awkward and becomes tricky when adding support for 64 bit gpu_ids. Rather than keeping a mask of bits in gpu_id to compare with the stored gpu_prod_id value, rely on macro functions for fetching the information required from gpu_id and creating the comparison value. Reviewed-by: Aksel Hjerpbakk <aksel.hjerpbakk@arm.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40610>	2026-04-09 09:49:20 +00:00
Lars-Ivar Hesselberg Simonsen	1f0370616a	pan: Centralize preload registers Rather than having preload registers hardcoded over multiple files, gather them in one place with an enum abstraction. This should simplify updates to the preload registers. Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40643>	2026-04-08 20:30:32 +00:00
Ryan Zhang	eaf68f744b	panvk/csf: rework IR descriptor handling for tiler OOM Replace the old partial IR state snapshotting with prebuilt per-pass IR descriptor buffers and a queue-attached scratch FBD. 1. emit FBD+DBD+RTDs for each IR-pass+layer 2. store it into some side-band GPU buffer that's passed around through the queue context 3. in the execption handler, copy FBD+DBD+RTDs from the IR desc buffer to some space that's attached to the queue context and not the cmdbuf Fixes: `46f611c9` ("panvk: Also use resolve shaders for Z/S") Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Signed-off-by: Ryan zhang <ryan.zhang@nxp.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40625>	2026-04-08 11:33:15 +00:00
Jakob Sinclair	5e2c951ba6	pan: add sigil to SSA values for debug printing Make it easier to distinguish between SSA values and constant values when debug printing BIR. Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40711>	2026-04-08 07:41:56 +00:00
Jakob Sinclair	a24364e418	pan: move discard/kill_ssa flag after index for debug prints Match the style of the disassembler when debug printing BIR. Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40711>	2026-04-08 07:41:56 +00:00
Jakob Sinclair	c3d56e9f82	pan: improve debug printing of multiple registers This makes printing of BIR in SSA-form more similar to NIR and after register allocation, it shows consecutive registers for operands reading/writing to more than one register. Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40711>	2026-04-08 07:41:56 +00:00
Faith Ekstrand	0d5cae97b7	pan/bi: Vectorize 8-bit ops up to v4i8 Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40720>	2026-04-06 21:39:25 +00:00
Faith Ekstrand	15d5675e8e	pan/bi: Pack 8-bit vec2s We used to splat out 8-bit vec2s to 16-bit by repeating both 8-bit halves twice with the B0011 swizzle. I think the original idea here was that 16-bit swizzles were more widely available in the hardware and that this would make swizzling things easier. The problem is that nothing actually knows that the value is half-repeated like this so nothing knows it can upgrade a swizzle from B0022 to B0123 (H01). So instead we get a bunch of B0022 swizzles, which nothing supports. We can shave a lot of instructions if we just stop trying to be so clever and instead repeat the whole thing with a B0101 swizzle. The only real issue here is that v2[fiu]8_to_v2[fiu]16 needs a B0011 swizzle, which we have to apply on-the-fly. Fortunately, any swizzle can be composed with B0011. Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40720>	2026-04-06 21:39:25 +00:00
Faith Ekstrand	db8cb73b34	pan/bi: Add bytewise copy propagation This adds a new bytewise copy propagation pass which chews through MKVEC and SWZ instructions. The word-based copy propagation pass only existed to chew through SPLIT/COLLECT but MKVEC is COLLECT for bytes and we had nothing to help with that. This is actually two passes in one: Byte propagation and swizzle propagation. Any time we see a MKVEC, we look at its sources only as bytes and chase individual bytes back, through other MKVEC and SWZ, to their generating instruction and make the MKVEC only consume the original bytes. If the MKVEC happens to construct something that's just a swizzle of another def (this is fairly common), we record that as well. The idea here is that a lot of MKVEC just consume other MKVEC and we can get rid of the intermediate ones or even the whole chain if it just ends up being a swizzle in the end. For SWZ instructions, we first look at them like a MKVEC of the individual bytes they consume. If that doesn't yield a single swizzled word, we then crawl through the words table, just accumulating swizzles. This gives us the best (closest to the generating instructions) coherent word. We could also replace SWZ with MKVEC and just do byte propagation but MKVEC is often 2 instructions whereas SWZ is often one (or folded into a source) so this is probably the better balance. Finally, we not only replace the MKVEC and SWZ instructions but we also attempt to propagate swizzles into individual ALU op sources. For v4i8 ops, this often fails since the full generality isn't always available but for fp16, we can almost always fold the swizzle into the consuming instruction. Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40720>	2026-04-06 21:39:25 +00:00
Faith Ekstrand	a4e9002660	pan/bi: Emit MKVEC directly Now that we have bi_lower_mkvec_swz(), there's no need to be so careful in the NIR -> bi translation. We can just emit MKVEC and move on. The lowering pass will sort out the detaisl. Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40720>	2026-04-06 21:39:25 +00:00
Faith Ekstrand	b9e33c7897	pan/bi: Stop lowering swizzles on mkvec and swz The new lowering can handle all the swizzle cases and is generally better at it than swizzle lowering. Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40720>	2026-04-06 21:39:25 +00:00
Faith Ekstrand	ed83d46d4e	pan/bi: Always use SWZ.v4i8 in bi_lower_swizzle() Now that we lower it, there's no advantage to one over the other at the time this pass runs. Also, the is_8bit check was technically wrong since it checks destination sizes, not source sizes. It's a lot safer to just use SWZ.v4i8 and let the lowering pass do the right thing. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40720>	2026-04-06 21:39:25 +00:00
Faith Ekstrand	bc7053a976	pan/bi: Add a lowering pass for MKVEC and SWZ Instead of trying very carefully in the bifrost emit code to only generate valid MKVEC for the target hardware, this adds a lowering pass which is capable of lowering any MKVEC or SWZ we can throw at it. Even if the swizzle isn't supported or if it's a MKVEC.v4i8 on Valhall, we'll lower it to something that does work on that platform. This frees up the rest of the compiler so we can add and modify MKVEC and SWZ at-will and never have to worry about hardware generation details. Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40720>	2026-04-06 21:39:24 +00:00
Faith Ekstrand	0edceaf383	pan/bi: Add a bi_op_supports_swizzle() helper Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40720>	2026-04-06 21:39:24 +00:00
Faith Ekstrand	a8879daf9c	pan/bi: Add a bi_try_compose_swizzles() helper Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40720>	2026-04-06 21:39:24 +00:00
Faith Ekstrand	3b728cb613	pan/bi: Add a bi_swizzle_from_byte_channels() helper Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40720>	2026-04-06 21:39:24 +00:00
Faith Ekstrand	4912bda122	pan/bi: Return void from bi_swizzle_to_byte_channels() Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40720>	2026-04-06 21:39:24 +00:00
Faith Ekstrand	e637130794	pan/bi: Use bi_half() for texture MS indices It feeds into a v2i16 so it needs to be 16-bit. Fixes: `ae79f6765a` ("pan/bi: Emit Valhall texture instructions") Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40720>	2026-04-06 21:39:23 +00:00
Faith Ekstrand	77f9cbd0c2	pan/bi: Compose swizzles in bi_half() and bi_byte() At least bi_half() has the decency to assert if the swizzle isn't BI_SWIZZLE_H01 to start with but bi_byte() did an irrelevant assert and then overwrote the swizzle with BI_SWIZZLE_B<lane> regardless of what was there before. In a lot of cases, this doesn't matter but we use both in translating NIR to BI on things that may have already been swizzled so we need to do the composition. Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40720>	2026-04-06 21:39:23 +00:00
Faith Ekstrand	342e9ac7e8	pan/bi: Add a bi_swizzle_from_half() helper Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40720>	2026-04-06 21:39:23 +00:00
Faith Ekstrand	05c5e52054	pan/bi/ra: Allow offsets on tied sources The only real requirement here is that the destination offset is zero and that the destination is big enough to hold the source. The source offset doesn't matter. Fixes: `bc17288697` ("pan/bi: Lower split/collect before RA") Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40720>	2026-04-06 21:39:23 +00:00
Faith Ekstrand	538b5c411e	pan/bi: Delete a few instruction encodings The non-trivial non-replicate swizzles on IADD.v4x8 and ISUB.v4x8 are either documented wrong or broken in hardware. Instead of swizzling b0101 and b2323, they swizzle b0011 and b2233 on G52. This is either a hardware bug or an issue with documentation. In either case, it's probably best not to trust it. Those swizzles aren't all that useful anyway. We also weren't using any of them before (or they'd have broken) so this isn't a performance regression. Cc: mesa-stable Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40720>	2026-04-06 21:39:23 +00:00
Faith Ekstrand	3fffcf4338	pan/bi: Support more swizzle aliases in the bifrost pack code Fixes: `82328a5245` ("pan/bi: Generate instruction packer for new IR") Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40720>	2026-04-06 21:39:23 +00:00
Valentine Burley	8444723c6a	pan/ci: Document recent flakes and timeouts Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details These have blocked Marge multiple times recently. Signed-off-by: Valentine Burley <valentine.burley@collabora.com> Acked-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40779>	2026-04-03 14:25:39 +00:00
Lorenzo Rossi	a45a7bbf44	pan/compiler: Split bifrost_nir.c from bifrost_compile.c Before this, everything was in the giant bifrost_compile.c file, now preprocess, optimize and postproces are in their own "small" bifrost_nir.c. I also removed some dead functions and moved the passes closer to their usage, (ex, passes only used in preprocess are now just before preprocess). Otherwise it's all the same code we had before. Signed-off-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40717>	2026-04-01 18:14:32 +00:00
Lorenzo Rossi	7d8b2c4128	pan/compiler: Split bi_debug.c from bifrost_compile.c Signed-off-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40717>	2026-04-01 18:14:32 +00:00
Lorenzo Rossi	f19b9eddb6	pan/compiler: Replace bi_lower_ldexp16 with algebraic pass Signed-off-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40717>	2026-04-01 18:14:31 +00:00
Christian Gmeiner	6333bf359b	panvk: Advertise VK_KHR_shader_untyped_pointers on v9+ Enable the VK_KHR_shader_untyped_pointers extension and the shaderUntypedPointers feature for Valhall and newer (v9+). Bifrost (v6/v7) has issues with 8-bit vector loads through untyped pointers combined with 16-bit storage, so restrict the extension to architectures where it's fully functional. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40457>	2026-03-31 11:57:10 +00:00
Christian Gmeiner	37c8ff5416	panvk: Lower memcpy derefs To make sure all memcpy derefs are lowered before explicit IO lowering. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40457>	2026-03-31 11:57:10 +00:00
Samuel Pitoiset	c4e3380187	nir,treewide: add nir_image_intrinsic_type We have 4 image intrinsic variants now. This enum is useful for nir_rewrite_image_intrinsic() and it will be used by other NIR passes. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40709>	2026-03-31 09:10:27 +00:00
Samuel Pitoiset	9d059a60f5	nir: introduce nir_descriptor_type for Vulkan like descriptors This removes a Vulkan dependency in NIR core. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40670>	2026-03-31 07:16:20 +00:00
Samuel Pitoiset	ee7c6e3752	treewide: cleanup non-existent descriptor types from nir_intrinsic_desc_type() The only possible values are: - VK_DESCRIPTOR_TYPE_UNIFORM_BUFFER - VK_DESCRIPTOR_TYPE_STORAGE_BUFFER - VK_DESCRIPTOR_TYPE_ACCELERATION_STRUCTURE_KHR Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40670>	2026-03-31 07:16:20 +00:00
Ryan Mckeever	9c9f5f69fd	panvk: enable fragmentStoresAndAtomics for Bifrost Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39207>	2026-03-31 02:18:24 +00:00
Ryan Mckeever	cf8a63c578	panvk: lower multisampled images before nir_lower_descriptors This will allow image_deref_size intrinsics generated in pan_nir_lower_image_ms to be lowered in nir_lower_descriptors. Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39207>	2026-03-31 02:18:24 +00:00
Yiwei Zhang	a2e42eff52	ci/panvk: update expectations with new flakes Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40682>	2026-03-28 20:16:09 +00:00
Yiwei Zhang	73c9d35644	panvk: hide swapchainMaintenance1 behind WSI guard Fixes: `9ec387efb1` ("panvk: advertise wsi maintenance extensions") Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40682>	2026-03-28 20:16:09 +00:00
Olivia Lee	8d5ba04e65	panvk/csf: use different resource registers for precomp vs user dispatch Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This allows us to avoid dirtying all of the state for user compute dispatches when we run a precomp shader. Signed-off-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37970>	2026-03-28 03:53:41 +00:00
Lorenzo Rossi	c0e0591999	pan/compiler: Replace frag_coord_zw_pan with var_special_pan Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Just a bit cleaner, and we can unify point size too. Signed-off-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40677>	2026-03-27 19:23:02 +00:00
Lorenzo Rossi	5be2b03b88	pan/compiler: Add bound assert on emit_split_i32 This could've saved me a lot of time debugging stack corruption. Signed-off-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40677>	2026-03-27 19:23:02 +00:00
Christian Gmeiner	32ca98a26e	panvk: Advertise VK_EXT_shader_atomic_float Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Expose float32 atomic exchange support for buffer, shared, and image operations on all architectures. The existing axchg instruction is type-agnostic, so no compiler changes are needed. Image atomics are already lowered to global atomics via nir_lower_image_atomics_to_global. Also add R32_FLOAT to the STORAGE_IMAGE_ATOMIC format feature flag so image atomic operations are accepted for r32f images. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40506>	2026-03-26 21:28:49 +00:00
Lorenzo Rossi	245d54397d	pan/compiler: Make lower_vs_outputs write needs_extended_fifo Signed-off-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40537>	2026-03-26 20:53:21 +00:00
Lorenzo Rossi	445a22acbd	pan/compiler: Group outputs in lower_vs_outputs Previously bifrost_nir_lower_shader_output grouped outputs in separate if blocks and made a best-effort attempt to group them together. This also assumed that pan_nir_lower_store_component wrote each output only once and that nir_lower_io_vars_to_temporaries pulled them out of any control flow. Now all of these are handled by the new pan_nir_lower_vs_outputs pass that handles write masks, control flow, per_view and grouping for IDVS. This makes the overall dependencies much simpler, ensures that the stores are grouped in the same ifs and should be more robust. Signed-off-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40537>	2026-03-26 20:53:21 +00:00

1 2 3 4 5 ...

7650 commits