fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-22 23:58:10 +02:00

Author	SHA1	Message	Date
Connor Abbott	e234dcf62c	vtn: Fix vtn_mediump_downconvert_value() for transposed matrices We forgot to set the actual value. This meant that whenever we actually needed to use the transposed matrix we would immediately segfault. Cc: mesa-stable (cherry picked from commit `048d2a0c68`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40488>	2026-03-17 18:59:22 +01:00
Mike Blumenkrantz	c0a931e338	mesa/st: fix unlower_io_to_vars to work with mesh shaders cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/15034 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/15040 Reviewed-by: Marek Olšák <marek.olsak@amd.com> (cherry picked from commit `3dbb7e896d`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40488>	2026-03-17 13:11:18 +01:00
Mike Blumenkrantz	3dea1bd33d	nir: fix nir_is_io_compact for mesh shaders cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> (cherry picked from commit `e604a8f617`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40488>	2026-03-17 13:11:15 +01:00
Faith Ekstrand	9c2b19219a	nir/lower_bool_to_bitsize: Make all bN_csel sources match Previously, we assumed that the selector for bcsel could be whatever, regardless of the bit sizes of the data and we'd just fix it in the back-end. This works okay for scalars but falls over the moment we vectorize because all our vector handling assumes bit sizes match. Since matching bit sizes is what the hardware wants anyway, it's better to do the right thing in NIR and hope copy-propagation can fold in conversions if needed. Unfortunately, copy prop isn't that smart yet so this does hurt a bit: Instrs: 1193679 -> 1198086 (+0.37%); split: -0.06%, +0.43% CodeSize: 11915136 -> 11950592 (+0.30%); split: -0.05%, +0.34% Full: 160985 -> 160941 (-0.03%); split: -0.04%, +0.01% Estimated normalized CVT cycles: 4456.938557000181 -> 4480.876069000186 (+0.54%); split: -0.13%, +0.67% Estimated normalized SFU cycles: 6350.9375 -> 6392.21875 (+0.65%) Estimated normalized Load/Store cycles: 205773.0 -> 205795.0 (+0.01%) Maximum number of threads: 12864 -> 12863 (-0.01%) Number of spill instructions: 22487 -> 22489 (+0.01%) Number of fill instructions: 52179 -> 52219 (+0.08%) Hurt shaders: google-meet-clvk/BgBlur google-meet-clvk/Relight parallel-rdp/small_subgroup parallel-rdp/small_uber_subgroup The proper solution here is to teach copy-prop about this stuff so that it can propagate swizzles into ALU ops when they're supported: https://gitlab.freedesktop.org/panfrost/mesa/-/issues/265 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14945 Cc: mesa-stable Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> (cherry picked from commit `3fd471dca5`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:12 +01:00
Georg Lehmann	8f6c3dcc90	nir/opt_algebraic: fix frsq clamp pattern This is not NaN correct. And also make the pattern 32bit only because the constant is hard coded FLT_MAX. Fixes: `780b5c1037` ("nir/algebraic: Simplify some Inf and NaN avoidance code") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> (cherry picked from commit `ab773fc5d4`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:11 +01:00
Caio Oliveira	8355670805	nir: Fix constant folding for iadd_sat Use INT_MIN instead of INT_MAX for underflow. Fixes: `cc4b50b023` ("nir/opcodes: use u_overflow to fix incorrect checks") Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pelloux@gmail.com> (cherry picked from commit `da57fbfb07`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:11 +01:00
Caio Oliveira	b2a34da82f	spirv: Fix spec constant to handle Select for non-native floats There was an assumption that if the instruction had non-native float as a source, the first source would have such type. This doesn't hold for Select, and the code failed in two ways - The boolean source of Select was being converted to the non-native float type. - The loop that resolves the bit-size for unsized operands would trip at `assert(i == 0)` because Select has more than one source. Re-organize the code to track the types of the sources independently, and fix both issues above. Fixes: `90e1b12890` ("spirv: Add bfloat16 support to SpecConstantOp") Fixes: `51d3c4c889` ("spirv: support float8 spec constant op") Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> (cherry picked from commit `6affcb43a7`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:10 +01:00
Caio Oliveira	4588b025c8	spirv: Pull constant source fixup to the existing loop Backport-to: 26.0 Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> (cherry picked from commit `b0c3b20bff`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:10 +01:00
Caio Oliveira	0775d0f1b5	spirv: Refactor ALU opcode translation to take bit sizes Only used by Convert operations, so just pass 0 from callers that are not Convert and clarify that in the code. Backport-to: 26.0 Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> (cherry picked from commit `1c3c987d5c`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:10 +01:00
Timothy Arceri	a66a9280fb	glsl: add workaround for MDK2 HD Allows a shader to compile that uses an embedded struct declaration which are not allowed in glsl 1.20+ Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14986 (cherry picked from commit `f109bfc3f1`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:10 +01:00
Rhys Perry	1d66a995ce	nir/range_analysis: set deleted key If (uintptr_t)&deleted_key is small enough, inserting entries into the hash table might not work correctly. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Backport-to: 26.0 Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> (cherry picked from commit `c0079e09ca`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:10 +01:00
Karol Herbst	d29063d4f2	nir: fix nir_round_int_to_float for fp16 fp16 has quite the limited value range and with bigger integers nir_round_int_to_float might return Inf where it shouldn't depending on the rounding mode. Fixes conversions half_rt[npz]_(u)?(int\|long) CL CTS tests. Cc: mesa-stable Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Rob Clark <rob.clark@oss.qualcomm.com> (cherry picked from commit `e1ed7de274`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:09 +01:00
Karol Herbst	3d8ff40d58	nir: fix nir_alu_type_range_contains_type_range for fp16 to int The special value "Inf" doesn't fit into an int and therefore we have to clamp regardless of whether all the other values would fit. And because f2u32 and f2u64 define out-of-range conversions as UB in nir, we need to clamp. This change should have no effect for non saturating conversions. Fixes "conversions long_sat_*half" CL CTS tests Cc: mesa-stable Suggested-by: Rob Clark <rob.clark@oss.qualcomm.com> Reviewed-by: Rob Clark <rob.clark@oss.qualcomm.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> (cherry picked from commit `8e8fb2ebaa`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:09 +01:00
Timothy Arceri	734e53c96b	glsl: relax precision matching on unused uniforms ES `0886be09` ("glsl: Allow precision mismatch on dead data with GLSL ES 1.00") allowed precision mismatches on uniforms, however if you lower precision on 16-bit consts, then this error triggers instead. So here we relax the type matching and just make sure we match int vs float. Fixes: `0886be09` ("glsl: Allow precision mismatch on dead data with GLSL ES 1.00") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5337 Reviewed-by: Marek Olšák <marek.olsak@amd.com> (cherry picked from commit `73bc604128`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40359>	2026-03-11 23:21:09 +01:00
Faith Ekstrand	3a92074d8c	nir/gather_info: Add support for panfrost tile load/store intrinsics Fixes: `6fc1030e4f` ("nir: Add some new panfrost fragment shader intrinsics") Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Acked-by: Boris Brezillon <boris.brezillon@collabora.com> Acked-by: Eric R. Smith <eric.smith@collabora.com> (cherry picked from commit `88ad8bc75d`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:23 +01:00
Ian Romanick	45ce75f3bc	nir: Use STACK_ARRAY instead of NIR_VLA The number of fields comes from the shader, so it could be a value large enough that using alloca would be problematic. Fixes: `c11833ab24` ("nir,spirv: Rework function calls") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ryan Neph <ryanneph@google.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (cherry picked from commit `9017d37e84`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:20 +01:00
Ian Romanick	978fd42b4b	spirv: Use STACK_ARRAY instead of NIR_VLA The number of fields comes from the shader, so it could be a value large enough that using alloca would be problematic. Fixes: `2a023f30a6` ("nir/spirv: Add basic support for types") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ryan Neph <ryanneph@google.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (cherry picked from commit `3da828d2dd`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:20 +01:00
Daniel Schürmann	7fda785505	nir/clone: Fix cloning indirect call instructions Fixes: `bb40284f76` ('nir: Add indirect calls') (cherry picked from commit `88b4221519`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40092>	2026-02-25 14:22:20 +01:00
Karol Herbst	acdbdcc53b	vtn: set default fp_math_ctrl values for kernels The kernel capabilty has the `FPFastMathMode` decoration, but not the `FPFastMathDefault` execution mode, so a SPIR-V module not using `SPV_KHR_float_controls2` has no way of setting any defaults. Fixes: `9da2d21804` ("vtn: implement default fp_math_ctrl without using execution mode") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Tested-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> (cherry picked from commit `faf3a93e8f`) [Eric: adjusted commit because of missing `46a617884e`, as suggested by the author at https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39790#note_3325830] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39828>	2026-02-11 14:54:48 +00:00
Karol Herbst	dc8a39037b	vtn/opencl: flush denorms for cbrt() libclc doesn't so we have to. fixes math_brutefore cbrt on Iris. Co-authored-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> (cherry picked from commit `af954427bf`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39828>	2026-02-11 14:54:48 +00:00
Faith Ekstrand	6eded1a7d0	nir/lower_bool_to_bit_size: Use the correct num_components for conversions There's a nice little comment here saying we use the same write mask (an out of date term in NIR) and swizzle but we're no longer actually doing that. Depending on nir_builder magic, we may actually generate a scalar when we really want a vector. The fix is to use more builder helpers and just eat the potential copy. Fixes: `3180656bbc` ("nir: don't use nir_build_alu() with incomplete sources") Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> (cherry picked from commit `711b3358a8`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39828>	2026-02-11 14:54:48 +00:00
Alyssa Rosenzweig	cf716d4586	nir: disable fast-math for lowering conversions the lowerings for e.g. f2f16_rtp have carefully written sequences using Infinity. nir_opt_algebraic will stomp right through this. `feq x, inf` without an exact flag is basically always a bug. Disable fast math here. Fixes OpenCL CTS test_half on Iris. Cc: mesa-stable Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> (cherry picked from commit `91550d0709`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39828>	2026-02-11 14:54:47 +00:00
Iago Toral Quiroga	6e899b3eba	nir/opt_vectorize_load_store: allow sizes unaligned with high offset for loads This was added specifically for vectorized stores, so allow for loads. Without this, the pass will fail to vectorize 2 consecutive 16-bit loads into a single 32-bit load. Fixes: `2ed79f80ba` ("nir/load_store_vectorize: Skip new bit-sizes that are unaligned with high_offset") Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> (cherry picked from commit `f6a2d14008`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39828>	2026-02-11 14:54:47 +00:00
Vinson Lee	a871c42e39	compiler/clc: Fix const correctness in libclc_add_generic_variants Fix compiler error: ../src/compiler/clc/nir_load_libclc.c:266:13: error: initializing 'char ' with an expression of type 'const char ' discards qualifiers [-Werror,-Wincompatible-pointer-types-discards-qualifiers] 266 \| char U3AS1 = strstr(func->name, "U3AS1"); \| ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~ glibc now provides C23-style type-generic string functions. strstr returns const char when passed a const char * argument. Update U3AS1 declaration to const since it's only used for offset calculation. Fixes: `4a08ee7ecf` ("spirv/libclc: Add generic versions of arithmetic functions") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Karol Herbst <kherbst@redhat.com> (cherry picked from commit `85fd63068e`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39828>	2026-02-11 14:54:47 +00:00
Karol Herbst	79f909808c	clc: fix compile compatability with LLVM-22 See `d090311aa7` Cc: mesa-stable Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39374> (cherry picked from commit `dc03f94e07`)	2026-02-04 18:39:34 +01:00
Karol Herbst	ca428e3b3c	nir: fix nir_fixup_is_exported for LLVM-22 Starting with LLVM-22 we won't see the kernel wrapper anymore, and this is a trivial fix to get around this. See: `5458eb2511` Cc: mesa-stable Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39374> (cherry picked from commit `24d20df3d6`)	2026-02-04 18:39:34 +01:00
Karol Herbst	84566763c2	clc: enable generic address space and seq_cst and device scope atomic features This is going to be required with LLVM-22. See `423bdb2bf2` Cc: mesa-stable Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39374> (cherry picked from commit `6eda573a8a`)	2026-02-04 18:39:33 +01:00
Karol Herbst	05c679d37b	clc: support some atomic and generic address space features Cc: mesa-stable Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39374> (cherry picked from commit `01e1392139`)	2026-02-04 18:39:33 +01:00
Karol Herbst	c6f8d2ef92	clc: reorder headers to fix compilation errors due to UNUSED Cc: mesa-stable Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39374> (cherry picked from commit `7f9a7ed553`)	2026-02-04 18:39:33 +01:00
Georg Lehmann	1f5f2cc952	nir/opt_algebraic: use correct syntax to create exact fsat Fixes: `3b06824e4c` ("nir/opt_algebraic: optimize some post peephole select patterns") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39586> (cherry picked from commit `d8ef28671d`)	2026-02-04 18:39:33 +01:00
Iván Briano	cb8a069e24	brw: fix local_invocation_index with quad derivaties on mesh/task shaders For mesh/task shaders, the thread payload provides a local invocation index, but it's always linear so it doesn't give the correct value when quad derivatives are in use. The lowering pass where all of this is done correctly for compute shaders assumes load_local_invocation_index will be lowered in the backend for mesh/task, calculates the values for the quads correctly but then avoid replacing the original intrinsic and we remain with the wrong results. Add an intel specific intrinsic and always lower the generic one to that (or whatever else was calculated) to avoid ambiguities and fix the value for quad derivatives. Fixes future CTS tests using mesh/task shaders under: dEQP-VK.spirv_assembly.instruction.compute.compute_shader_derivatives.* Fixes: `d89bfb1ff7` ("intel/brw: Reorganize lowering of LocalID/Index to handle Mesh/Task") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39276> (cherry picked from commit `5b48805b42`)	2026-01-28 16:17:59 +01:00
Eric Engestrom	e68f96eb1f	nir/meson: fix cpp_args of nir_opt_algebraic_pattern_tests Fixes: `4c30c44b75` ("nir: Generate unit tests for nir_opt_algebraic") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39550> (cherry picked from commit `d12e3454e6`)	2026-01-28 16:17:59 +01:00
Lionel Landwerlin	a19e949824	brw: move coarse_z computation to NIR So that we can print it easily with debug printfs Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38996>	2026-01-21 16:00:52 +00:00
Lionel Landwerlin	98194dfa0b	nir: add intrinsics for Z calculation in shaders with FSR Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38996>	2026-01-21 16:00:52 +00:00
Lionel Landwerlin	12be2a580c	nir/compiler_options: add nir_load_pixel_coord And use it for nir_printf_fmt_at_px(). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38996>	2026-01-21 16:00:50 +00:00
Daniel Schürmann	89b9fcb5e7	nir/opt_load_store_vectorize: delay aliasing test in try_vectorize_shared2() Checking for aliasing can be very expensive. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38659>	2026-01-21 14:20:06 +00:00
Daniel Schürmann	598928d7e7	nir/loop_analyze: determine whether all control flow gets eliminated upon loop unrolling Totals from 17 (0.02% of 79839) affected shaders: (Navi48) MaxWaves: 241 -> 243 (+0.83%); split: +5.81%, -4.98% Instrs: 44198 -> 43786 (-0.93%); split: -8.19%, +7.26% CodeSize: 230284 -> 226900 (-1.47%); split: -10.55%, +9.08% VGPRs: 2152 -> 2524 (+17.29%); split: -3.90%, +21.19% Scratch: 718848 -> 0 (-inf%) Latency: 128977 -> 145720 (+12.98%); split: -2.12%, +15.10% InvThroughput: 206804 -> 254250 (+22.94%); split: -0.32%, +23.27% VClause: 1296 -> 1309 (+1.00%); split: -28.09%, +29.09% SClause: 835 -> 833 (-0.24%) Copies: 6284 -> 3630 (-42.23%); split: -44.51%, +2.28% Branches: 1003 -> 961 (-4.19%) PreSGPRs: 1003 -> 996 (-0.70%); split: -1.20%, +0.50% PreVGPRs: 1510 -> 2130 (+41.06%) VALU: 23577 -> 24309 (+3.10%); split: -6.26%, +9.37% SALU: 5875 -> 5688 (-3.18%); split: -6.26%, +3.08% VMEM: 3679 -> 3001 (-18.43%); split: -33.27%, +14.84% SMEM: 1632 -> 1631 (-0.06%) VOPD: 23 -> 24 (+4.35%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38659>	2026-01-21 14:20:06 +00:00
Daniel Schürmann	4997d8fb1b	nir/loop_analyze: determine for all ALU whether it can be constant-folded Totals from 16 (0.02% of 79839) affected shaders: (Navi48) MaxWaves: 512 -> 464 (-9.38%) Instrs: 11821 -> 17205 (+45.55%) CodeSize: 60536 -> 86644 (+43.13%) VGPRs: 732 -> 804 (+9.84%) Latency: 68411 -> 39349 (-42.48%) InvThroughput: 14217 -> 9306 (-34.54%) VClause: 223 -> 302 (+35.43%) SClause: 262 -> 317 (+20.99%) Copies: 961 -> 696 (-27.58%); split: -39.23%, +11.65% Branches: 182 -> 158 (-13.19%); split: -29.67%, +16.48% PreSGPRs: 1210 -> 945 (-21.90%); split: -29.42%, +7.52% PreVGPRs: 647 -> 633 (-2.16%) VALU: 5112 -> 10857 (+112.38%) SALU: 3215 -> 2335 (-27.37%); split: -30.67%, +3.30% VMEM: 228 -> 349 (+53.07%) SMEM: 567 -> 549 (-3.17%); split: -3.70%, +0.53% Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38659>	2026-01-21 14:20:06 +00:00
Natalie Vock	30f6eacfad	radv/rt: Call ahit/isec shaders Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39314>	2026-01-20 21:49:55 +00:00
Icenowy Zheng	b61dbc98fd	nir/algebraic: fix Python-3.10-incompatible syntax Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Using a string literal enclosed with the same type of quotation marks with the outer f-string isn't supported on Python 3.10, which is currently still with security maintainance. This leads to syntax error when building Mesa with Python 3.10. Fix this by alternating these string literals' quotation mark to '' (as the outer f-string uses ""). Signed-off-by: Icenowy Zheng <zhengxingda@iscas.ac.cn> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14673 Reviewed-by: Valentine Burley <valentine.burley@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39372>	2026-01-20 11:14:41 +00:00
Faith Ekstrand	2313bec66e	nir: Expose the guts of nir_lower_blend as builder helpers Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39367>	2026-01-19 21:33:14 +00:00
Faith Ekstrand	d2c2d798f8	nir/lower_blend: Optimize trivial logic op cases There's no point in going to/from UNORM if we're just going to copy or throw away the source. Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39367>	2026-01-19 21:33:14 +00:00
Faith Ekstrand	68d22b5a2a	nir/lower_blend: Move the format to nir_lower_blend_rt Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39367>	2026-01-19 21:33:14 +00:00
Faith Ekstrand	d6556a580f	nir,pan: Add and implement a new store_tile_pan intrinsic Like we just did with load_tile_pan, this maps directly to ST_TILE in the hardware. This is more versatile and lets us do more of our lowering in NIR. Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39367>	2026-01-19 21:33:13 +00:00
Faith Ekstrand	11b6cd2f2c	nir,pan: Rework the pafrost tile load intrinsic Instead of making it explicitly about outputs, this switchies it to being a NIR version of LD_TILE. It means we have to do a bit of work in NIR and add a builder helper but the end result is something much more versatile. Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39367>	2026-01-19 21:33:13 +00:00
Faith Ekstrand	4189865347	nir: panfrost tile loads are always divergent Each lane refers to a different pixel. Cc: mesa-stable Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39367>	2026-01-19 21:33:13 +00:00
Georg Lehmann	70a951c3f3	nir/search: allow inexact patterns if denorms have to be flushed Patterns should ensure that they flush denorms with fcanonicalize. Removing in between denorm flushing when fusing operations is explicitly allowed unless those optimizations are generally disallowed by other floating point math control flags. Foz-DB Navi21: Totals from 291 (0.35% of 82377) affected shaders: Instrs: 138347 -> 137773 (-0.41%) CodeSize: 751460 -> 748516 (-0.39%) Latency: 1686466 -> 1686226 (-0.01%); split: -0.02%, +0.01% InvThroughput: 270847 -> 269963 (-0.33%) VClause: 2023 -> 2022 (-0.05%) SClause: 5271 -> 5260 (-0.21%); split: -0.25%, +0.04% Copies: 8929 -> 8912 (-0.19%) VALU: 87108 -> 86552 (-0.64%) SALU: 23460 -> 23443 (-0.07%) Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39180>	2026-01-19 16:11:29 +00:00
Georg Lehmann	442daeb54a	nir/opt_algebraic: use fcanonicalize Mostly optimizations, some minor fixes but I don't think they are worth backporting. Foz-DB Navi21: Totals from 7570 (9.21% of 82151) affected shaders: MaxWaves: 204288 -> 204476 (+0.09%); split: +0.09%, -0.00% Instrs: 4511439 -> 4500261 (-0.25%); split: -0.25%, +0.00% CodeSize: 23727088 -> 23644388 (-0.35%); split: -0.35%, +0.00% VGPRs: 290944 -> 290616 (-0.11%); split: -0.12%, +0.01% SpillSGPRs: 1256 -> 1251 (-0.40%) Latency: 16738072 -> 16726717 (-0.07%); split: -0.10%, +0.04% InvThroughput: 3736856 -> 3716631 (-0.54%); split: -0.55%, +0.01% VClause: 66150 -> 66156 (+0.01%); split: -0.05%, +0.06% SClause: 93644 -> 93631 (-0.01%); split: -0.02%, +0.01% Copies: 448816 -> 458584 (+2.18%); split: -0.05%, +2.22% Branches: 139817 -> 139775 (-0.03%); split: -0.03%, +0.00% PreSGPRs: 321922 -> 321900 (-0.01%); split: -0.01%, +0.00% PreVGPRs: 239709 -> 238856 (-0.36%); split: -0.39%, +0.03% VALU: 2595164 -> 2584250 (-0.42%); split: -0.43%, +0.01% SALU: 839038 -> 838965 (-0.01%); split: -0.02%, +0.01% VMEM: 137584 -> 137583 (-0.00%) Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39180>	2026-01-19 16:11:29 +00:00
Rhys Perry	625afb0d29	nir: add fcanonicalize v2(Georg Lehmann): Always remove fcanonicalize if denorms must be neither flushed nor preserved. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39180>	2026-01-19 16:11:29 +00:00
Georg Lehmann	43d998df84	nir: document that both input and output denorms have to be flushed This allows us to remove a * 1.0 or a - 0.0 if is_only_used_as_float. We already rely on that. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39180>	2026-01-19 16:11:28 +00:00

1 2 3 4 5 ...

11584 commits