fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-26 23:18:12 +02:00

Author	SHA1	Message	Date
Rhys Perry	463e3643f2	nir: add and use block predecessor helpers Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40242>	2026-04-08 15:06:32 +00:00
Samuel Pitoiset	457eac7d66	nir: add new variable modes for the resource/sampler heap pointers These two new variable modes are used to relax restrictions on deref casts through because it's possible to cast different modes from the heap pointers. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40768>	2026-04-07 18:55:49 +00:00
Job Noorman	d56d35aa76	nir/opt_varyings_bulk: add data parameter to optimize callback Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40651>	2026-04-03 08:18:08 +02:00
Marek Olšák	92cf9af827	nir: factor out nir_system_value_from_instr from nir_opt_varyings Acked-by: Pierre-Eric Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40664>	2026-04-02 14:38:56 +00:00
Samuel Pitoiset	c4e3380187	nir,treewide: add nir_image_intrinsic_type We have 4 image intrinsic variants now. This enum is useful for nir_rewrite_image_intrinsic() and it will be used by other NIR passes. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40709>	2026-03-31 09:10:27 +00:00
Samuel Pitoiset	9d059a60f5	nir: introduce nir_descriptor_type for Vulkan like descriptors This removes a Vulkan dependency in NIR core. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40670>	2026-03-31 07:16:20 +00:00
Karol Herbst	b7ca34db13	nir: unvendor ac_nir_lower_sin_cos So we can use it for Nvidia. Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40541>	2026-03-31 01:47:31 +02:00
Georg Lehmann	1b6ed1b34e	nir,radv: lower shadow compare gather to 16bit Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The output is 1.0 or 0.0 anyway, so there are no precision issues. For hardware that has v_fma_mix_f32, the inserted conversions should be free in most cases. Foz-DB Navi21: Totals from 1393 (0.68% of 205005) affected shaders: MaxWaves: 40612 -> 40660 (+0.12%) Instrs: 571239 -> 570266 (-0.17%); split: -0.19%, +0.02% CodeSize: 2933912 -> 2979304 (+1.55%); split: -0.00%, +1.55% VGPRs: 50504 -> 50256 (-0.49%) Latency: 9883143 -> 9879335 (-0.04%); split: -0.05%, +0.01% InvThroughput: 2591073 -> 2570721 (-0.79%); split: -0.79%, +0.00% VClause: 11600 -> 11551 (-0.42%); split: -0.43%, +0.01% SClause: 26644 -> 26641 (-0.01%) Copies: 31434 -> 30556 (-2.79%); split: -3.14%, +0.34% PreVGPRs: 41762 -> 41509 (-0.61%) VALU: 405533 -> 404655 (-0.22%); split: -0.24%, +0.03% SALU: 55576 -> 55575 (-0.00%); split: -0.02%, +0.02% Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40685>	2026-03-30 18:54:22 +00:00
Konstantin Seurer	b127c11be9	spirv,nir: Preserve more information about the descriptor type Descriptor heap mappings need the information to selectively apply mappings (descriptor type masks). Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40649>	2026-03-30 06:51:25 +00:00
Samuel Pitoiset	df515cfb5b	nir: make nir_variable::descriptor_set a 32-bit variable With descriptor heap there is no limit. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40649>	2026-03-30 06:51:25 +00:00
Lionel Landwerlin	302194a566	nir: improve deref_instr_get_variable So we can get through all the casting inserted by heaps. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40649>	2026-03-30 06:51:23 +00:00
Faith Ekstrand	e7e601f113	nir: Add tex sources for descriptor heaps We also add a new boolean which indicates that the texture op uses an embedded sampler. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40649>	2026-03-30 06:51:22 +00:00
Faith Ekstrand	f117b81435	nir: Add intrinsics for descriptor heaps Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40649>	2026-03-30 06:51:22 +00:00
Kenneth Graunke	0e143ae663	nir: Add nir_texop_resinfo_intel This is a combination of txs and query_levels in a single vec4 result. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40451>	2026-03-29 12:53:09 +00:00
Faith Ekstrand	60acd4da12	nir: Support primitive_id in lower_sysvals_to_varyings Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Mary Guillemard <mary@mary.zone> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40512>	2026-03-25 03:11:56 +00:00
Daniel Schürmann	4ca0eb9f54	nir: validate that loop continue statements always link to continue constructs Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39942>	2026-03-21 07:42:55 +00:00
Connor Abbott	22a061fb91	nir: Use better calculation for alpha-to-coverage mask The old calculation depended on the sample count, and gave subpar results for 8x MSAA with standard sample locations. The new calculation is based on the Intel pass, with some changing of the constants so that the sample count is always proportional to alpha for 2xMSAA and 4xMSAA and the addition of rotating the sample mask based on the pixel. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39335>	2026-03-20 18:09:48 +00:00
Georg Lehmann	1626df7a90	nir: rework nir_alu_src_is_trivial_ssa to take an alu src Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40399>	2026-03-20 08:50:41 +00:00
Faith Ekstrand	f2f792996d	Revert "nir: Add a type parameter to nir_lower_point_size()" This reverts commit `6ee4ea5ea3`. Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Acked-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38681>	2026-03-12 22:59:13 +00:00
Mike Blumenkrantz	e604a8f617	nir: fix nir_is_io_compact for mesh shaders cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37408>	2026-03-12 22:02:57 +00:00
Georg Lehmann	0d747eee88	nir: add no_signed_zero flag to io semantics Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40323>	2026-03-11 16:47:15 +00:00
Faith Ekstrand	5de5987678	nir,panfrost: Move lower_bool_to_bitsize to panfrost It's the only driver that uses the pass so it may as well go there. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40307>	2026-03-10 20:54:44 +00:00
Georg Lehmann	452025f75e	nir: add free bits in nir_io_semantics for future use Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40299>	2026-03-10 07:46:22 +00:00
Georg Lehmann	a25f00eaed	nir: merge xfb and xfb2 into one 64bit intrinsic index Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40299>	2026-03-10 07:46:22 +00:00
Georg Lehmann	abfd6a4df9	nir: don't assume indicies are always 32bit when accessing them as raw data Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40299>	2026-03-10 07:46:20 +00:00
Georg Lehmann	7c217e540c	nir: add a pass to optimize fp_math_ctrl Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40098>	2026-03-07 08:16:27 +01:00
Emma Anholt	2ec8ecd7de	nir: Do NIR_DEBUG=print under a lock. With most Vulkan engines doing multithreaded compiles, NIR_DEBUG=print has been a frustrating racy mess. Take a lock when we're doing per-pass printing, so that the output is coherent. This unfortunately single-threads the compiler process itself in that case, but when you're NIR_DEBUG=printing, that's probably not a big deal. An assert is introduced to make sure that nobody nests NIR_PASS() in a way that would break printing. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40126>	2026-03-06 19:50:38 +00:00
Alyssa Rosenzweig	8a450fb0ff	nir/lower_subgroups: generalize vote lowering We currently have code to lower quad votes to a ballot. The same idea works for subgroup votes. Generalize the quad vote code and use it to lower vote_all/vote_eq for backends setting a new lower_vote option. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40074>	2026-02-25 17:29:29 +00:00
Alyssa Rosenzweig	8fb1d65426	nir: add nir_get_io_data_src This complements our existing nir_get_io_index_src helper. Most, but annoyingly not all, stores put their data source in source 0. Having a helper for this lets us reduce special casing in a bunch of random places. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39939>	2026-02-19 14:47:11 +00:00
Rhys Perry	f44de53586	nir: only set fp_math_ctrl if meaningful Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39809>	2026-02-18 14:04:22 +00:00
Marek Olšák	61a96be494	nir/lower_non_uniform_access: add an option not to lower tex & image queries AMD can do non-uniform queries. The RADV change will be in a separate commit. NFC for drivers. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39743>	2026-02-16 12:59:36 +00:00
Sagar Ghuge	1fb8435b77	nir: Add nir_resource_intel_internal entry Will use the load/store_ssbo with nir_resource_intel_internal later in this series. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35160>	2026-02-12 16:45:22 +00:00
Georg Lehmann	aa78083477	nir: make alu fp_math_ctrl helpers const No Foz-DB changes. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39641>	2026-02-10 18:42:03 +00:00
Georg Lehmann	63d199a01e	nir: remove special fp_math_ctrl rules All opcodes should now respect the nan/inf/sz preserving flags. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39641>	2026-02-10 18:42:02 +00:00
Daniel Schürmann	4ca7ee7bd7	nir/opt_load_store_vectorize: Allow to vectorize at most one entry of each type across blocks The idea is to initialize the vectorization table with one entry from the previous blocks if it's the same for all predecessors. In order to not speculatively load out-of-bounds, backends need to set a new bounds_checked_modes option indicating variable modes for which per-component bounds checks are supported. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39373>	2026-02-06 10:16:50 +00:00
Timothy Arceri	0410377b63	nir: make nir_add_inlinable_uniforms() private Hasn't been used externally since `e93592dc62` Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39664>	2026-02-05 23:19:28 +00:00
Timothy Arceri	257875034d	nir: make nir_collect_src_uniforms() private Hasn't been used externally since `e93592dc62` Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39664>	2026-02-05 23:19:28 +00:00
Karol Herbst	4add3959e9	nir: add BASE to nvidia memory intrinsics Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39525>	2026-02-03 22:23:50 +00:00
Karol Herbst	e779538ad2	nir: add nvidia IO intrinsics Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39525>	2026-02-03 22:23:50 +00:00
Georg Lehmann	575affaf48	nir/search: gather union of all fp_math_ctrl Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39616>	2026-01-31 15:30:25 +00:00
Kenneth Graunke	b844082017	nir: Add a round_up_components callback to load/store vectorization By default, load/store vectorization uses nir_round_up_components() to round up loads and possibly writemasked stores to the next valid NIR vector width. However, some backends may not support load/stores at all sizes. For example, older Intel supports only power-of-two vector widths. Newer Intel also supports vec2 and vec3, but not vec5/6/7. By providing a callback, backends can request promotion to their next supported memory load/store vector width. The existing "should we vectorize?" callback should continue to return false for unsupported vector widths (i.e. beyond the maximum supported). With this new callback, they do not need to say "no" to vectorization that would normally produce an unsupported count (e.g. vec5/6/7) but instead request that the component count be rounded up appropriately. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39250>	2026-01-27 16:08:36 +00:00
Kenneth Graunke	e23a83b786	nir: Add load/store vectorizer option for rounding up masked stores This adds a new option, round_up_store_components, which rounds up the number of components for stores that support writemasking to the next valid vector size. For example, vec4+vec2 stores would round up from 6 components (which wouldn't be supported) to a full supportable vec8 store, relying on writemasking to ensure the correct pieces are written. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39250>	2026-01-27 16:08:36 +00:00
Emma Anholt	e922c2cabc	nir,spirv: Add support for SPV_QCOM_image_processing. Initial work was done by Mark Collins, which I significantly rewrote. Signed-off-by: Mark Collins <mark@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38559>	2026-01-27 02:00:40 +00:00
Connor Abbott	3208cc80b1	nir: Move is is_compact() out of unlower_io_to_vars Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39328>	2026-01-21 20:54:13 +00:00
Daniel Schürmann	598928d7e7	nir/loop_analyze: determine whether all control flow gets eliminated upon loop unrolling Totals from 17 (0.02% of 79839) affected shaders: (Navi48) MaxWaves: 241 -> 243 (+0.83%); split: +5.81%, -4.98% Instrs: 44198 -> 43786 (-0.93%); split: -8.19%, +7.26% CodeSize: 230284 -> 226900 (-1.47%); split: -10.55%, +9.08% VGPRs: 2152 -> 2524 (+17.29%); split: -3.90%, +21.19% Scratch: 718848 -> 0 (-inf%) Latency: 128977 -> 145720 (+12.98%); split: -2.12%, +15.10% InvThroughput: 206804 -> 254250 (+22.94%); split: -0.32%, +23.27% VClause: 1296 -> 1309 (+1.00%); split: -28.09%, +29.09% SClause: 835 -> 833 (-0.24%) Copies: 6284 -> 3630 (-42.23%); split: -44.51%, +2.28% Branches: 1003 -> 961 (-4.19%) PreSGPRs: 1003 -> 996 (-0.70%); split: -1.20%, +0.50% PreVGPRs: 1510 -> 2130 (+41.06%) VALU: 23577 -> 24309 (+3.10%); split: -6.26%, +9.37% SALU: 5875 -> 5688 (-3.18%); split: -6.26%, +3.08% VMEM: 3679 -> 3001 (-18.43%); split: -33.27%, +14.84% SMEM: 1632 -> 1631 (-0.06%) VOPD: 23 -> 24 (+4.35%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38659>	2026-01-21 14:20:06 +00:00
Alyssa Rosenzweig	4e59199cbb	nir: add nir_is_shared_access helper This is helpful to identify shared mem access for writing more generic code operating on nir intrinsics. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39219>	2026-01-09 20:51:12 +00:00
Georg Lehmann	a706769a0b	nir: move exact bit to nir_fp_math_control Unifies nir per instruction float control. In the future this can be split into contract/reassoc/transform like SPIR-V. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> (except SPIR-V) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39103>	2026-01-07 09:40:57 +00:00
Georg Lehmann	eb4737a1dd	nir: add nir_alu_instr_is_exact helper Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39103>	2026-01-07 09:40:57 +00:00
Georg Lehmann	b70294b91f	nir: document signed zero, inf, nan preserve flags Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39103>	2026-01-07 09:40:56 +00:00
Timur Kristóf	2ecb7a9e18	nir: Add pass to lower workgroup size Lowers a shader to use a smaller workgroup to do the same work, while it will still appear as a bigger workgroup to applications. To achieve this, the pass augments the CF of the shader so that each real subgroup will execute two or more logical subgroups. A logical subgroup represents what the application can observe as a subgroup. The size of a logical subgroup is the same as a real subgroup. Only one logical subgroup may be executed per real subgroup at the same time. This ensures that all subgroup operations keep working and the subgroup invocation ID stays the same. - When the CF contains barriers, we can't just repeat the code and we need to augment each CF node individually so that they are aware of logical subgroups. - In case parts of the CF don't contain any barriers, we can simply repeat and predicate that CF for each logical subgroup. It is technically not necessary to implement this strategy, but in practice it helps reduce the amount of branches in the shader and therefore improves compile times. The pass is mainly intended for working around HW limitations, for example when the HW has an upper limit on the workgroup size or doesn't support workgroups at all, but the API requires a certain minimum. Notes: - Only applicable to shader stages that use workgroups - Hits an assertion when called on smaller workgroups - Always flattens workgroup size to 1D - Creates local variables - Does not change subgroup size - Variable workgroup size not supported yet, maybe later Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Anna Maniscalco <anna.maniscalco2000@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37985>	2026-01-02 13:33:54 -06:00

1 2 3 4 5 ...

1710 commits