fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-07 07:08:04 +02:00

Author	SHA1	Message	Date
Olivia Lee	31ddfe26eb	panfrost: don't try to emit varying shader stats on v12+ Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details On v12+, IDVS no longer has separate position and varying variants, so we only need to emit stats for one binary. Attempting to emit stats for the nonexistent varying shader breaks shader-db. Fixes: `7819b103fa` ("pan/bi: Add support for IDVS2 on Avalon") Signed-off-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40810>	2026-04-09 18:21:12 +00:00
Olivia Lee	43b85b151b	panvk/csf: enable allow_merging_workgroups when possible Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Now that all of the additional cases are handled, we can hook up the allow_merging_workgroups flag in panvk. Signed-off-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Caterina Shablia <caterina.shablia@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38586>	2026-04-09 17:53:46 +00:00
Olivia Lee	a5a3036972	panvk/csf: lower divergent values introduced by merged workgroups Mali does not support divergent operands in some cases, and we are already using lower_non_uniform_access to handle this for descriptor indexing. We can extend this to handle merged workgroups by just tagging every intrinsic as nonuniform and then letting divergence analysis sort out which ones can actually be nonuniform in opt_non_uniform_access. Signed-off-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38586>	2026-04-09 17:53:46 +00:00
Olivia Lee	e9ca69b807	panvk/csf: take merged workgroups into account for divergence Merging workgroups affects divergence analysis, since subgroups can now contain extra threads from other workgroups. We already have divergence analysis flags to handle this case, but since the compiler options memory is static, we need to define an entirely separate option set for merged vs non-merged workgroups. In gallium, we don't have to switch options because opengl requires uniformity over the entire dispatch in application shaders. Signed-off-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38586>	2026-04-09 17:53:46 +00:00
Olivia Lee	c42e124a66	pan/va: don't merge workgroups when subgroups are used Vulkan guarantees that all subgroup invocations will be part of the same workgroup, so we need to disable merging workgroups for shaders where the subgroup layout is observable. Signed-off-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38586>	2026-04-09 17:53:46 +00:00
Olivia Lee	a0f6c6d84d	pan/va: move allow_merging_workgroups decision to drivers In panvk, we will need to decide whether we are merging workgroups early in shader compilation, before calling nir_lower_non_uniform_access. This is because nonuniform lowering introduces new subgroup intrinsics which would otherwise inhibit workgroup merging, and because the set of instructions that need to be lowered may be different with merged workgroups. Signed-off-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38586>	2026-04-09 17:53:46 +00:00
Olivia Lee	1f75299ebb	pan/va: weaken barrier requirements for allow_merging_workgroups The only requirement for barriers is that the hardware doesn't support allow_merging_workgroups with actual BARRIER instructions. We only emit these for workgroup execution barriers though, so are safe to merge workgroups when the shader uses memory barriers or subgroup execution barriers. Signed-off-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Caterina Shablia <caterina.shablia@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38586>	2026-04-09 17:53:46 +00:00
Dhruv Mark Collins	46aac5abaf	tu: Only emit preempt optimization ambles when active This avoid unnecessarily emitting the switch back/away ambles when they aren't actually used due to preemption optimization being disabled. This alleviates unnecessary overhead when not running with the mitigation for kernel drivers which support it. Signed-off-by: Dhruv Mark Collins <mark@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40852>	2026-04-09 17:29:18 +00:00
Dhruv Mark Collins	18437c7a65	tu: Disable features using performance counter for KGSL KGSL doesn't support reading of performance counters by writing to the selector registers directly from a userspace CS, instead these requests need to be routed via the KGSL uAPI for perf counters. Certain Turnip features which use performance counters such as KHR_performance_query as well as preempt-optimize mode in autotune are now explicitly disabled to reflect this. Signed-off-by: Dhruv Mark Collins <mark@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40852>	2026-04-09 17:29:18 +00:00
Samuel Pitoiset	63d55d84a3	radv: replace remaining occurrences of VK_ACCESS_xxx Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40845>	2026-04-09 15:58:06 +00:00
Samuel Pitoiset	f33757416b	radv/meta: remove an outdated comment in vkCmdClearAttachments() This is no longer true since "ae84d41d483 - radv/meta: Rework saving/restoring state". Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40845>	2026-04-09 15:58:06 +00:00
Mike Blumenkrantz	166c68914b	zink: use EXT_primitive_restart_index Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40784>	2026-04-09 15:37:29 +00:00
Roland Scheidegger	76cdbcb96a	llvmpipe: disable denorms in compute shaders on x86/sse For consistency with other shader stages (required by d3d, neither GL nor Vulkan really care). A bit awkward since we don't want to disable them for things like rusticl, which we should be able to distinguish with shader type. Note that to satisfy d3d requirements, disabling denorms in general is not sufficient, due to d3d requiring them to be disabled for single precision opcodes, but enabled for double precision ones, and x86 can't switch that individually (hence will need per-instruction tracking and switching inside the shader). Reviewed-by: Brian Paul <brian.paul@broadcom.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40787>	2026-04-09 15:10:00 +00:00
Roland Scheidegger	c20106f1e4	llvmpipe: don't rely on cpu denorms for float to smallfloat conversion Similar to what we already do for smallfloats to floats, handle denorms and normals separately with bit manipulation stuff rather than rely on a rescale mul which depends on cpu denorms. This is a bit more complex, but on the upside we don't need to track fpstate for denorms anymore in llvmpipe backend. (With modern x86 cpus this is essentially only really relevant for r11g11b10 float format, since f16 formats are using f16c instructions.) Reviewed-by: Brian Paul <brian.paul@broadcom.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40787>	2026-04-09 15:10:00 +00:00
Roland Scheidegger	2ba62d1502	llvmpipe: get rid of unused code in float to small float code We really don't want to use the slightly faster but slightly incorrect (wrt NaN preservation etc.) code. Reviewed-by: Brian Paul <brian.paul@broadcom.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40787>	2026-04-09 15:10:00 +00:00
Olle Lögdahl	c69da756d1	aco/isel: added test-case for iterative cf visitor Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details isel.cf.deep_traversal is a new ACO test that verifies that the iterative nir cf visitor allows arbitrary depth. A depth of 10000 would cause a stack overflow on x86-64 linux (4096 kB stack) for the old recursive code. This test is by default not enabled. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40364>	2026-04-09 13:46:23 +00:00
Olle Lögdahl	aa49d69ea0	aco/isel: use iterative visitor during traversal When iterating control-flow recursively, we always run the risk of causing a stack overflow if the control-flow depth is too large. This patch resolves this by visiting control-flow nodes in an iterative way, managing an explicit stack on the heap. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40364>	2026-04-09 13:46:23 +00:00
Daniel Schürmann	37e2deab74	aco/isel: Remove if_context* parameter from begin_if() / end_if() helper functions We can transparently create the context inside the functions, now. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40364>	2026-04-09 13:46:23 +00:00
Daniel Schürmann	53836320a9	aco/isel: Remove loop_context* parameter from begin_loop() / end_loop() helper functions We can transparently create the context inside the functions, now. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40364>	2026-04-09 13:46:22 +00:00
Olle Lögdahl	5c1dea7ee4	aco/isel: move if_context and loop_context to heap if_context and loop_context are large structs and may cause stack overflows during CF traversal. This fix moves them to the heap. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40364>	2026-04-09 13:46:22 +00:00
Collabora's Gfx CI Team	909e0026d8	Uprev VVL to cb2acdf7f49053406770ae73cbb315229a9131eb `adfdda5b66...cb2acdf7f4` Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40618>	2026-04-09 12:59:00 +00:00
Janne Grunau	564b061981	hk: Increase maxFragmentCombinedOutputResources to HK_MAX_DESCRIPTORS Backport-to: 26.0 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/15249 for hk Signed-off-by: Janne Grunau <j@jannau.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40859>	2026-04-09 12:35:40 +00:00
Icenowy Zheng	9b44def4e9	pvr: set has_usc_alu_roundingmode_rne for all B-series Rogue cores All B-series Rogue cores seem to have USC rounding mode as RTE instead of RTZ. Set the has_usc_alu_roundingmode_rne feature flag for them (currently only BXS-4-64 has it set). Verified via testing on BXM-4-64 (36.52.104.182) by fixing CTS tests dEQP-VK.spirv_assembly.instruction..float_controls.fp32.input_args. , and via proprietary driver vulkaninfo result on BXE-2-32 (36.29.52.182), BXE-4-32 (36.50.54.182) and BXM-4-64 (36.56.104.183) (checking shaderRoundingModeRT?Float32 properties). Fixes: `1db1038a61` ("pvr: add device info for BXM-4-64 (36.56.104.183)") Fixes: `e60e0c96ba` ("pvr: add device info for BXE-2-32 (36.29.52.182)") Fixes: `2743363a57` ("pvr: add device info for BXM-4-64 (36.52.104.182)") Fixes: `ea28791d40` ("pvr: add device info for BXE-4-32 (36.50.54.182)") Signed-off-by: Icenowy Zheng <zhengxingda@iscas.ac.cn> Reviewed-by: Simon Perretta <simon.perretta@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40794>	2026-04-09 12:18:59 +00:00
Lars-Ivar Hesselberg Simonsen	affcc7fe54	pan/va/isa: Src for X16_TO* takes lane, not swizzle Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details While the swizzle code was producing the correct encoding, the disassembly was slightly weird and swz_16 required an extra argument that was always "false". Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40865>	2026-04-09 11:23:36 +00:00
Christian Gmeiner	41b34334a6	panvk: Advertise VK_EXT_attachment_feedback_loop_dynamic_state The Vulkan runtime provides the dynamic state infrastructure via vk_common_CmdSetAttachmentFeedbackLoopEnableEXT(). This builds on the attachment feedback loop layout support. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40498>	2026-04-09 12:30:21 +02:00
Christian Gmeiner	a2d9d2b5f8	panvk: Advertise VK_EXT_attachment_feedback_loop_layout PanVK treats image layouts as no-ops and already disables Forward Pixel Kill when the same render target is both read and written. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40498>	2026-04-09 12:30:19 +02:00
Lars-Ivar Hesselberg Simonsen	6cdc3cc1d2	pan: Add support for 64 bit gpu_id While not currently required, it will be for future GPUs. Also cleans up gpu_id as parameter to some functions that didn't use it. Reviewed-by: Aksel Hjerpbakk <aksel.hjerpbakk@arm.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40610>	2026-04-09 09:49:20 +00:00
Lars-Ivar Hesselberg Simonsen	f181cc5bca	pan/model: Redo gpu_prod_id in the model The current implementation is a bit awkward and becomes tricky when adding support for 64 bit gpu_ids. Rather than keeping a mask of bits in gpu_id to compare with the stored gpu_prod_id value, rely on macro functions for fetching the information required from gpu_id and creating the comparison value. Reviewed-by: Aksel Hjerpbakk <aksel.hjerpbakk@arm.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40610>	2026-04-09 09:49:20 +00:00
Caius-Moldovan-img	37bdb2b792	pco: Move part of legalization after register allocation Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Caius Moldovan <caius.moldovan@imgtec.com> Reviewed-by: Simon Perretta <simon.perretta@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40724>	2026-04-09 09:29:21 +00:00
Caius-Moldovan-img	75dc7ef8fd	pco: Add pseudo instruction fencing for DITR and DITRP Signed-off-by: Caius Moldovan <caius.moldovan@imgtec.com> Reviewed-by: Simon Perretta <simon.perretta@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40724>	2026-04-09 09:29:21 +00:00
Caius-Moldovan-img	092106d742	pco: Move DITR and DITRP fencing from translation to legalization Signed-off-by: Caius Moldovan <caius.moldovan@imgtec.com> Reviewed-by: Simon Perretta <simon.perretta@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40724>	2026-04-09 09:29:21 +00:00
David Rosca	cc78a5dd4b	radeonsi/video: Fix setting decode surface format for single plane formats multi_plane_format is only valid when num_planes > 1. Fixes: `26979becec` ("radeonsi/video: Add video decoder using ac_video_dec") Reviewed-by: Benjamin Cheng <benjamin.cheng@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40811>	2026-04-09 08:16:05 +00:00
Georg Lehmann	44a061a034	aco/spill: fix mixed lds+scratch spill/reload We shouldn't increment the scratch offset while accessing LDS. Fixes: `133ef9f94b` ("aco: spill VGPRs to LDS if it doesn't further limit occupancy") Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40855>	2026-04-09 07:51:52 +00:00
Tapani Pälli	3ab9145393	intel/compiler: implement dummy mov for Wa_18035690555 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37804>	2026-04-09 07:30:01 +00:00
Tapani Pälli	4bb68d7474	intel/compiler: expose inferred_exec_pipe from scoreboarding Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37804>	2026-04-09 07:30:01 +00:00
Sagar Ghuge	2bf520340d	intel/compiler: Remove unused brw_nir_memclear_global helper Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This is a dead code, we can remvoe it for now. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenz.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40801>	2026-04-09 05:06:05 +00:00
Silvio Vilerino	4cac78bb3c	mediafoundation: MFTRegisterWorkQueue/MFTUnregisterWorkQueue to validate null param instead of crash Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40860>	2026-04-08 14:48:54 -07:00
Silvio Vilerino	6d1b209d0c	d3d12: Check queues are registered before unregistering in unregister_work_queue Reviewed-by: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40860>	2026-04-08 14:48:33 -07:00
Lars-Ivar Hesselberg Simonsen	1f0370616a	pan: Centralize preload registers Rather than having preload registers hardcoded over multiple files, gather them in one place with an enum abstraction. This should simplify updates to the preload registers. Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40643>	2026-04-08 20:30:32 +00:00
José Roberto de Souza	1e052f0bb5	intel/brw: Remove unsed functions to get data port message type Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40832>	2026-04-08 17:44:52 +00:00
José Roberto de Souza	667a58ab38	anv: Use helper to get anv_address in emit_simple_shader_dispatch() Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40832>	2026-04-08 17:44:52 +00:00
José Roberto de Souza	a69e02d97c	anv: Make use of anv_shader_get_scratch_surf() in genX_cmd_compute.c genX_cmd_compute.c has 2 places that is had a code very similar to anv_shader_get_scratch_surf() but we could not make use of this function without change it parameters. Now it takes the shader stage and the total_scratch instead of anv_shader because cmd_buffer_trace_rays() don't have a shader. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40832>	2026-04-08 17:44:52 +00:00
José Roberto de Souza	fd420e80e2	anv: Rename and share get_scratch_surf() with other files We will need to call get_scratch_surf() from other files, so here removing the static and adding it to anv_private.h. No changes in behavior expected here. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40832>	2026-04-08 17:44:51 +00:00
Samuel Pitoiset	5f1c22037b	radv: advertise VK_EXT_primitive_restart_index Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40775>	2026-04-08 17:19:48 +00:00
Samuel Pitoiset	c9b2bd385b	radv: implement VK_EXT_primitive_restart_index Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40775>	2026-04-08 17:19:48 +00:00
Samuel Pitoiset	8f33896822	radv: pre-compute the primitive restart index Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40775>	2026-04-08 17:19:48 +00:00
Kenneth Graunke	b391f2d888	anv: Use nir_lower_memory_model This replaces NIR_MEMORY_MAKE_{AVALIABLE,VISIBLE} with COHERENT. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40833>	2026-04-08 16:07:35 +00:00
Kenneth Graunke	0c0fea1cdb	nir: Increase tex opcode bits from 5 to 6 in nir_instr_set We are already at our limit of 31 texture opcodes, and cannot add any more without expanding the opcode hashing in nir_instr_set. Thankfully, it's at 29 bits, so adding one here is possible still. Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40833>	2026-04-08 16:07:35 +00:00
Alyssa Rosenzweig	73701c305e	brw: wire up MACL New on Xe2, this instruction enables faster 32x32 integer multiply at the cost of extra accumulator usage. Add it to the opcode list for future use. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40833>	2026-04-08 16:07:35 +00:00
Rhys Perry	01516746eb	nir: use a u_dynarray for block predecessors Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details A set is large and expensive to iterate. This is faster (overall fossilize-replay difference): Difference at 95.0% confidence -250 +/- 28.9257 -2.04849% +/- 0.235211% (Student's t, pooled s = 34.1626) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40242>	2026-04-08 15:06:34 +00:00

1 2 3 4 5 ...

220885 commits