fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-07 07:08:04 +02:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	9d82888383	brw: lower 16-bit mulh Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40877>	2026-04-10 09:16:42 +00:00
Daniel Schürmann	8cb8c710fb	aco: remove remaining occurences of block_kind_continue It has no purpose anymore. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40628>	2026-04-10 08:51:39 +00:00
Daniel Schürmann	74661ccec2	aco/lower_branches: remove handling of block_kind_continue Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40628>	2026-04-10 08:51:39 +00:00
Daniel Schürmann	7f0709cff5	aco/opt_value_numbering: remove handling of block_kind_continue Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40628>	2026-04-10 08:51:39 +00:00
Daniel Schürmann	a8c4b9f100	aco/lower_phis: remove handling of block_kind_continue Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40628>	2026-04-10 08:51:39 +00:00
Daniel Schürmann	16396f2ce6	aco/insert_exec_mask: remove handling of loop continues Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40628>	2026-04-10 08:51:39 +00:00
Daniel Schürmann	495c7271a3	aco/isel: remove handling of nir_jump_continue Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40628>	2026-04-10 08:51:39 +00:00
Daniel Schürmann	5e89be331f	aco/lower_branches: Fix try_rotate_latch_block() Found by inspection. Fixes: `97f095f6e0` ('aco/lower_branches: Add try_rotate_latch_block() optimization') Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40628>	2026-04-10 08:51:39 +00:00
Daniel Schürmann	60b3e5b3f0	aco/lower_branches: Don't remove branches which jump over loops Entering a loop with empty exec mask might lead to not be able to execute the break condition and lead to infinite loops. Totals from 81 (0.04% of 202440) affected shaders: (Navi48) Instrs: 3040566 -> 3040716 (+0.00%) CodeSize: 17506768 -> 17507188 (+0.00%) Latency: 16342966 -> 16345166 (+0.01%) InvThroughput: 3112932 -> 3113286 (+0.01%) Branches: 82229 -> 82365 (+0.17%) Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40628>	2026-04-10 08:51:39 +00:00
Julia Zhang	373498bf7e	radv/amdgpu: handle DISCARDABLE flag in get_flags_from_fd Map the kernel alloc_flag AMDGPU_GEM_CREATE_DISCARDABLE to RADEON_FLAG_DISCARDABLE in function radv_amdgpu_bo_get_flags_from_fd. Signed-off-by: Julia Zhang <Julia.Zhang@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40879>	2026-04-10 08:24:56 +00:00
Tapani Pälli	c5bfa688b4	intel/dev: update mesa_defs.json from workaround database This updates 14024997852 with BMG and brings in media WA 16021867713. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40881>	2026-04-10 09:50:41 +03:00
Marek Olšák	a7c63ae6fa	amd: switch to new packet definitions for all packets Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The new definitions have their numbers offset by 1 (e.g. S_580 -> S_581). The remaining old definitions are adjusted to match that. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40588>	2026-04-10 03:42:45 +00:00
Marek Olšák	30f8bbd97b	amd/packets: add disable_wr_confirm alias to dis_wc Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40588>	2026-04-10 03:42:45 +00:00
Marek Olšák	e281b7b653	amd/packets: remove the underscore between opcode number and word index, use %x we are more used to this format Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40588>	2026-04-10 03:42:45 +00:00
Marek Olšák	2aa9ec5018	amd/packets: fix the size of 1-bit bitfields Closes: https://gitlab.freedesktop.org/mesa/mesa/-/work_items/15137 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40588>	2026-04-10 03:42:44 +00:00
Alexander Koskovich	f560760b27	freedreno/common: add support for the Adreno 810 Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Add support for the Adreno 810 found on the SM7635 (milos). Signed-off-by: Alexander Koskovich <akoskovich@pm.me> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40613>	2026-04-10 01:24:59 +00:00
Faith Ekstrand	432a298f67	pan/bi: Vectorize more conversions Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40769>	2026-04-09 18:08:40 -04:00
Faith Ekstrand	37dcfcc6d0	pan/bi: Handle vector 16-bit extract_[ui]8 The old implementation only worked for 16-bit because we assumed scalar so we could stomp the whole destination as if it was 32-bit. This version works for v2i16. Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40769>	2026-04-09 18:08:40 -04:00
Faith Ekstrand	91e6507665	nir: Add a nir_alu_src_comp_as_uint() helper Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40769>	2026-04-09 18:08:40 -04:00
Faith Ekstrand	567bc7a8df	pan/bi: Simplify extract_i8 handling Now that bi_byte() does the right thing, we can just use it and not worry about the rest. Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40769>	2026-04-09 18:08:40 -04:00
Faith Ekstrand	a88e724b6e	pan/nir: Use minimum-width constants instead of scalar Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40769>	2026-04-09 18:08:39 -04:00
Faith Ekstrand	f138d13672	pan/nir: Stop being so conservative about phi scalarizing Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40769>	2026-04-09 18:08:25 -04:00
Faith Ekstrand	eca0575069	pan/nir: Stop doing manual optimization after resize_varying_io We call bi_optimize_nir() a few lines later. Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40769>	2026-04-09 18:07:53 -04:00
Faith Ekstrand	c8f61b6b0e	pan/bi: Handle arbitrary size constants This isn't hard to do and it gives us a lot more flexibility in what NIR we can consume. Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40769>	2026-04-09 18:05:21 -04:00
Faith Ekstrand	0e5626d717	pan/bi: Use nir_src_is_zero() Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40769>	2026-04-09 18:05:20 -04:00
Faith Ekstrand	fb347b8458	nir: Add a couple is_zero() helpers Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40769>	2026-04-09 18:05:20 -04:00
Faith Ekstrand	fd5c6d1223	pan/bi: Support all the swizzles in the packer Add asserts this time that we don't miss any and that the buckets actually match the enum in bifrost/compiler.h. Fixes: `82328a5245` ("pan/bi: Generate instruction packer for new IR") Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40769>	2026-04-09 18:05:20 -04:00
Faith Ekstrand	ab285efd1b	pan/bi: Add BI_SWIZZLE_NONE Fixes: `82328a5245` ("pan/bi: Generate instruction packer for new IR") Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40769>	2026-04-09 18:05:20 -04:00
Faith Ekstrand	48b2e6b551	pan/bi: Delete BI_SWIZZLE_1123 It appears nowhere so I don't know why we have it in the enum. Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40769>	2026-04-09 18:05:20 -04:00
Kenneth Graunke	0b99c88337	nir, brw: lower scratch in NIR Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This will let us share a common scratch swizzling between brw and jay. Changes by Ken: - Use an immediate SIMD width when known so we don't need to re-lower - Switch to load_simd_width_intel because it may not match info->api_subgroup_size on Vulkan without VK_EXT_subgroup_size_control - Stop using DWord Scattered Write messages for scratch. These take an offset in DWords, and our offsets are now always in bytes. This also means that we no longer create MEMORY_OPCODE_* IR with inconsistent units of either bytes or dwords. Yikes. We use byte scattered messages now. fossil-db stats on Battlemage: Instrs: 500477504 -> 500450056 (-0.01%); split: -0.01%, +0.00% CodeSize: 7807432368 -> 7806786192 (-0.01%); split: -0.01%, +0.00% Cycle count: 62404008370 -> 62398437734 (-0.01%); split: -0.01%, +0.00% Fill count: 546690 -> 546695 (+0.00%); split: -0.00%, +0.00% Max live registers: 141257956 -> 141258100 (+0.00%); split: -0.00%, +0.00% Non SSA regs after NIR: 72350283 -> 72336544 (-0.02%) Totals from 99 (0.01% of 1581969) affected shaders: Instrs: 366593 -> 339145 (-7.49%); split: -7.58%, +0.09% CodeSize: 6425936 -> 5779760 (-10.06%); split: -10.06%, +0.00% Cycle count: 2412009876 -> 2406439240 (-0.23%); split: -0.26%, +0.03% Fill count: 19675 -> 19680 (+0.03%); split: -0.02%, +0.04% Max live registers: 17600 -> 17744 (+0.82%); split: -0.09%, +0.91% Non SSA regs after NIR: 37894 -> 24155 (-36.26%) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40843>	2026-04-09 21:02:16 +00:00
Alyssa Rosenzweig	140616d26a	brw: scalarize even 64-bit scratch access No, I don't know how this worked before, thanks for asking. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40843>	2026-04-09 21:02:16 +00:00
Alyssa Rosenzweig	15b11635a2	brw: Move intel_nir_opt_peephole_imul32x16 later in compilation (Split by Ken out of a patch authored by Alyssa.) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40843>	2026-04-09 21:02:16 +00:00
Kenneth Graunke	e5598166b0	brw: Have brw_nir_apply_key call brw_nir_lower_simd for all stages brw_nir_apply_key typically knows the dispatch width (it's fixed for geometry stages, and we clone the NIR for compute and mesh shaders). For compute/mesh, this was the very next thing called. For the others, if we know the width, there's no reason not to lower it. Scratch lowering will start using load_simd_width_intel soon, so we need it to work in all stages. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40843>	2026-04-09 21:02:16 +00:00
Kenneth Graunke	765d74eebe	brw: Set nir->info.{min,max}_subgroup_size in brw_nir_apply_key This records the actual SIMD width we selected for the shader, in all cases except fragment shaders, where we don't know it yet. MR 37258 notes that "Backends can update [these fields] when they make new decisions about the subgroup size" - which is what we now do. Note that nir->info.api_subgroup_size may be different than min/max subgroup size on Vulkan prior to SPV1.6/VK_EXT_subgroup_size_control, so we do not alter that. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40843>	2026-04-09 21:02:16 +00:00
Kenneth Graunke	d7d2d7aceb	brw: Support load_simd_width_intel for fragment shaders This lets us emit NIR code based on the SIMD size. For non-fragment stages, we'll replace it with a constant and optimize, but for FS, we delay it until the backend. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40843>	2026-04-09 21:02:16 +00:00
Kenneth Graunke	cac9f670d1	intel/compiler: Use nir_static_workgroup_size helper Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40843>	2026-04-09 21:02:16 +00:00
Connor Abbott	82b3db7e06	tu: Enable multiviewGeometryShader Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40153>	2026-04-09 20:34:58 +00:00
Connor Abbott	73ab56fd2e	tu: Lower maxMultiviewViewCount to 6 With multiview, the HW has to dispatch at least ViewCount * 6 fibers per primitive, since there are up to 6 VS threads per primitive. The HW can launch multiple GS waves per VS wave but one VS wave must contain the entire primitive. With ViewCount = 16 there are 96 fibers per primitive, which is more than we can launch in one wave. To fix this, lower the maximum view count. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40153>	2026-04-09 20:34:58 +00:00
Connor Abbott	3433e53da7	tu: Fill GS/DS ViewID register fields Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40153>	2026-04-09 20:34:58 +00:00
Connor Abbott	f497e3913b	tu: Adjust multiview lowering for GS When there is a GS, run multiview lowering for the VS and multiply the per-primitive varying stride by the view count since all outputs are now per-view. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40153>	2026-04-09 20:34:58 +00:00
Connor Abbott	be84cb6211	ir3: Support multiview in GS lowering With GS+multiview, the VS will loop over each view in the shader while each GS invocation only corresponds to a single view. Varyings for each view will be stored next to each other in local memory. Implement view index calculations when lowering VS outputs/GS inputs. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40153>	2026-04-09 20:34:58 +00:00
Connor Abbott	bc72ef2ee9	ir3: Implement ViewIndex for GS For GS, the ViewIndex is passed through from the DS/VS in a similar manner to PrimID. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40153>	2026-04-09 20:34:58 +00:00
Connor Abbott	2d4bb4cdc6	freedreno: Name GS/DS ViewID register fields Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40153>	2026-04-09 20:34:58 +00:00
Olivia Lee	31ddfe26eb	panfrost: don't try to emit varying shader stats on v12+ Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details On v12+, IDVS no longer has separate position and varying variants, so we only need to emit stats for one binary. Attempting to emit stats for the nonexistent varying shader breaks shader-db. Fixes: `7819b103fa` ("pan/bi: Add support for IDVS2 on Avalon") Signed-off-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40810>	2026-04-09 18:21:12 +00:00
Olivia Lee	43b85b151b	panvk/csf: enable allow_merging_workgroups when possible Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Now that all of the additional cases are handled, we can hook up the allow_merging_workgroups flag in panvk. Signed-off-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Caterina Shablia <caterina.shablia@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38586>	2026-04-09 17:53:46 +00:00
Olivia Lee	a5a3036972	panvk/csf: lower divergent values introduced by merged workgroups Mali does not support divergent operands in some cases, and we are already using lower_non_uniform_access to handle this for descriptor indexing. We can extend this to handle merged workgroups by just tagging every intrinsic as nonuniform and then letting divergence analysis sort out which ones can actually be nonuniform in opt_non_uniform_access. Signed-off-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38586>	2026-04-09 17:53:46 +00:00
Olivia Lee	e9ca69b807	panvk/csf: take merged workgroups into account for divergence Merging workgroups affects divergence analysis, since subgroups can now contain extra threads from other workgroups. We already have divergence analysis flags to handle this case, but since the compiler options memory is static, we need to define an entirely separate option set for merged vs non-merged workgroups. In gallium, we don't have to switch options because opengl requires uniformity over the entire dispatch in application shaders. Signed-off-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38586>	2026-04-09 17:53:46 +00:00
Olivia Lee	c42e124a66	pan/va: don't merge workgroups when subgroups are used Vulkan guarantees that all subgroup invocations will be part of the same workgroup, so we need to disable merging workgroups for shaders where the subgroup layout is observable. Signed-off-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38586>	2026-04-09 17:53:46 +00:00
Olivia Lee	a0f6c6d84d	pan/va: move allow_merging_workgroups decision to drivers In panvk, we will need to decide whether we are merging workgroups early in shader compilation, before calling nir_lower_non_uniform_access. This is because nonuniform lowering introduces new subgroup intrinsics which would otherwise inhibit workgroup merging, and because the set of instructions that need to be lowered may be different with merged workgroups. Signed-off-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38586>	2026-04-09 17:53:46 +00:00
Olivia Lee	1f75299ebb	pan/va: weaken barrier requirements for allow_merging_workgroups The only requirement for barriers is that the hardware doesn't support allow_merging_workgroups with actual BARRIER instructions. We only emit these for workgroup execution barriers though, so are safe to merge workgroups when the shader uses memory barriers or subgroup execution barriers. Signed-off-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Caterina Shablia <caterina.shablia@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38586>	2026-04-09 17:53:46 +00:00

1 2 3 4 5 ...

220928 commits