fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-20 22:08:10 +02:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	b0f8c22682	nir/opt_sink: sink agx backfacing helps an elden ring shader: Totals from 1 (0.03% of 3206) affected shaders: Instrs: 4146 -> 4141 (-0.12%) CodeSize: 27374 -> 27334 (-0.15%) ALU: 2554 -> 2549 (-0.20%) FSCIB: 2554 -> 2549 (-0.20%) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35559>	2025-06-20 16:09:28 +00:00
Alyssa Rosenzweig	3eeba6efdd	nir/opt_preamble: hoist reorderable SSBO loads on AGX basically asahi version of `2f93137308` ("nir/opt_preamble: Handle load_global_ir3"). elden ring: Totals from 2409 (75.14% of 3206) affected shaders: MaxWaves: 2068416 -> 2081984 (+0.66%); split: +0.74%, -0.08% Instrs: 2439078 -> 1849792 (-24.16%) CodeSize: 15570886 -> 12196612 (-21.67%) Spills: 11623 -> 11678 (+0.47%); split: -0.63%, +1.10% Fills: 9815 -> 9762 (-0.54%); split: -1.37%, +0.83% Scratch: 31200 -> 31328 (+0.41%); split: -0.23%, +0.64% ALU: 1154256 -> 1038680 (-10.01%); split: -10.22%, +0.21% FSCIB: 1154256 -> 1038479 (-10.03%); split: -10.24%, +0.21% IC: 248318 -> 237344 (-4.42%); split: -4.47%, +0.05% GPRs: 266323 -> 254621 (-4.39%); split: -4.72%, +0.33% Uniforms: 639557 -> 794085 (+24.16%); split: -0.07%, +24.23% Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reacted-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35559>	2025-06-20 16:09:28 +00:00
Mary Guillemard	2c455c2d81	libcl: Add more UINT_MAX variants Needed by panvk's minmax CL code. Signed-off-by: Mary Guillemard <mary.guillemard@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35639>	2025-06-20 10:11:52 +00:00
Emma Anholt	6808ccf23a	mesa: Retire the OptimizeForAOS code. It's been unused for years. Back on the brw driver, it was used for a ~1% win on a VS-heavy workload, because you could hide instruction latency using a DP4 sequence instead of MADs for doing the MVP transform. Given that r300 and i915 don't seem to want it, and nobody has successfully ported it for crocus (see https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14277), just garbage collect the code at this point. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35617>	2025-06-19 19:23:53 +00:00
Faith Ekstrand	9f9cde04ec	nir: Add a new load_input_attachment_coord intrinsic This hoists all the annoyance of figuring out the current pixel's input attachment coordinates to the driver. The pass still deals with all the annoyance of turning an image instruciton into a texture instruction but it gives the driver more control over the position. For most drivers, this will be something like ivec3(int(gl_FragCoord.xy), gl_Layer) or similar, some drivers need something more nuanced. Turnip, for instance, needs unscaled coordinates for some attachments and NVK doesn't really want gl_Layer or gl_ViewIndex for the layer. It's better to just have a new system value that drivers can make what they want. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35551>	2025-06-19 02:14:04 +00:00
Faith Ekstrand	2c13e1e655	nir/lower_input_attachments: Don't ignore tex coordinates The SPIR-V spec is pretty clear that coordinates on subpass attachments are relative to the current pixel. They're required to be zero but we should stay consistent with ourselves (we already do this for image intrinsics) and with the spec. Fixes: `84b08971fb` ("nir/lower_input_attachments: lower nir_texop_fragment_{mask}_fetch") Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35551>	2025-06-19 02:14:04 +00:00
Faith Ekstrand	9a52b9372c	nir/lower_input_attachments: Stop assuming tex src indices There's nothing in NIR which guarantees that the deref is the first source or that the coordinate is the second. Use nir_tex_instr_src_index() to get the actual indices. Fixes: `84b08971fb` ("nir/lower_input_attachments: lower nir_texop_fragment_{mask}_fetch") Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35551>	2025-06-19 02:14:03 +00:00
Emma Anholt	908bfb2ac9	nir: Add support for load_frag_coord_zw to nir_opt_fragdepth. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25190>	2025-06-18 23:11:36 +00:00
Emma Anholt	3b28604be2	nir: Make pixel_coord/frag_coord_zw be peephole-able sysvals. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25190>	2025-06-18 23:11:36 +00:00
Emma Anholt	8fa6d473e7	nir: Add SYSTEM_VALUE_FRAG_COORD_Z/W. Intel's going to use these. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25190>	2025-06-18 23:11:36 +00:00
Emma Anholt	7db62e6dad	nir: Split nir_load_frag_coord_zw to separate z/w intrinsics. This will be a win for Intel for tracking which payload values need to be set up. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25190>	2025-06-18 23:11:36 +00:00
Job Noorman	78f62d6d6d	nir: remove unused global_atomic(_swap)_ir3 intrinsics Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details ir3 switched to using the generic ones. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33797>	2025-06-18 19:06:33 +00:00
Job Noorman	2490ecf5fc	ir3: ingest global addresses as 64b values from NIR There are currently two places where we have to handle values that are logically 64b: 64b atomics and 64b global addresses. For the former, we ingest the values as 64b from NIR, while the latter uses 2x32b values. This commit makes things more consistent by using 64b NIR values for global addresses as well. Of course, we could also go the other way around and use 2x32b values everywhere, which would make things consistent as well. Given that ir3 doesn't actually have 64b registers, and 64b values are represented by collected 2x32b registers, this could actually make more sense. In the end, both methods are mostly equivalent and it probably doesn't matter too much one way or the other. However, the reason I have a slight preference for ingesting things as 64b is that it allows us to use more of the generic NIR intrinsics, which use 1-component values for 64b addresses or atomic values. This commit already makes global_atomic(_swap)_ir3 obsolete and I'm planning to create generic intrinsics to support ldg.a/stg.a as well. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33797>	2025-06-18 19:06:32 +00:00
Georg Lehmann	e9c886c331	nir/opt_intrinsic: fix inclusive scan rewrite with multiple uses Modifying the iterated list is a footgun, so just create a new instruction. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13364 Fixes: `5c70a55bf3` ("nir/opt_intrinsics: optimize (exclusive_scan(op, a) op a) to inclusive scan") Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35577>	2025-06-18 18:18:15 +00:00
Karol Herbst	cf3b16f7af	clc: support cl_khr_extended_bit_ops Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35448>	2025-06-18 10:13:44 +00:00
Karol Herbst	1a5b5a883d	vtn: mark BitInstructions cap as supported It simply enables certain Shader only instructions for Kernels. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35448>	2025-06-18 10:13:44 +00:00
Rhys Perry	ea0670dfb5	nir: simplify nir_addition_might_overflow nir_unsigned_upper_bound is good enough that this isn't needed anymore. No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35514>	2025-06-17 13:28:00 +00:00
Rhys Perry	f3b7ac730c	nir/uub: improve ior/ixor with constant sources No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35514>	2025-06-17 13:28:00 +00:00
Rhys Perry	ae6ad8977b	nir/uub: improve iand with constant sources fossil-db (navi21): Totals from 9 (0.01% of 79653) affected shaders: Instrs: 11878 -> 11868 (-0.08%) CodeSize: 61572 -> 61508 (-0.10%) Latency: 44585 -> 44581 (-0.01%); split: -0.02%, +0.01% InvThroughput: 9697 -> 9660 (-0.38%) VALU: 8889 -> 8876 (-0.15%) SALU: 1339 -> 1342 (+0.22%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35514>	2025-06-17 13:27:59 +00:00
Rhys Perry	8ee5440073	nir/uub: improve ishl/imul with constant sources fossil-db (navi21): Totals from 1 (0.00% of 79653) affected shaders: Instrs: 1339 -> 1338 (-0.07%) CodeSize: 7244 -> 7240 (-0.06%) Latency: 19827 -> 19822 (-0.03%) InvThroughput: 9913 -> 9911 (-0.02%) SALU: 419 -> 418 (-0.24%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35514>	2025-06-17 13:27:59 +00:00
Olivia Lee	88ac602cc2	panvk: implement shaderInputAttachmentArrayNonUniformIndexing Signed-off-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35408>	2025-06-13 19:02:19 +00:00
Christian Gmeiner	b30b87c096	nir/inline_uniforms: Convert to use nir_shader_intrinsics_pass(..) Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35463>	2025-06-12 22:35:48 +02:00
Marek Olšák	fa2e7c3dfd	nir: return progress from nir_group_loads, nir_inline_uniforms so that NIR_PASS is usable Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35315>	2025-06-12 19:35:37 +00:00
Marek Olšák	0cbcb72869	nir/opt_vectorize_io: work around a 16-bit IO bug for RADV If nir_opt_vectorize_io isn't called, 16-bit IO is broken. This is a workaround to keep RADV working and consume incorrect NIR while other drivers consume correct NIR. Hopefully this will be removed ASAP. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35315>	2025-06-12 19:35:37 +00:00
Marek Olšák	6e9e9c9f0c	nir: add shader_info::prev_stage_has_xfb We will use it to skip lowering mediump IO to 16 bits if the previous shader has XFB and mediump XFB doesn't work. Drivers can decide not to lower mediump IO to 16 bits if nir->xfb_info != NULL for the pre-rast stage and if nir->info.prev_stage_has_xfb is true for the FS stage. That way, drivers can lower mediump to 16 bits for all cases except XFB. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35315>	2025-06-12 19:35:37 +00:00
Marek Olšák	ebbcec76b0	glsl/spirv: link XFB before prelink_lowering Hopefully this doesn't break it (we may even lack tests), but we need to know in prelink_lowering whether XFB is enabled or not. The next commit that adds shader_info::prev_stage_uses_xfb depends on this. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35315>	2025-06-12 19:35:37 +00:00
Marek Olšák	b636e5ca66	nir: add nir_clear_mediump_io_flag Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35315>	2025-06-12 19:35:36 +00:00
Marek Olšák	13005d5e4e	nir/xfb_info: don't merge incompatible XFB outputs to fix mediump Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35315>	2025-06-12 19:35:36 +00:00
Marek Olšák	118c0e6991	nir/opt_vectorize_io: fix vectorizing 16-bit XFB Tested with mediump. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35315>	2025-06-12 19:35:36 +00:00
Marek Olšák	caddd67b8c	nir/opt_vectorize_io: don't vectorize 16-bit IO to vec8 - it's illegal NIR represents low bits of 16-bit IO as a separate vec4, and high bits as another separate vec4. There is no representation that allows vec8. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35315>	2025-06-12 19:35:35 +00:00
Marek Olšák	1f80ff5550	nir/opt_vectorize_io: convert bool merge_low_high_16_to_32 to an enum refactoring for the next commit Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35315>	2025-06-12 19:35:35 +00:00
Marek Olšák	6270136b7d	nir/opt_varyings: set prev_stage/next_stage if they are NONE and validate them Doing it here ensures that any linked shader will have the correct values there. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35315>	2025-06-12 19:35:34 +00:00
Marek Olšák	e3d122ed7b	nir/opt_varyings: completely exclude mediump from type changes It broke mediump XFB, which needs the correct type for the up-conversion. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35315>	2025-06-12 19:35:34 +00:00
Marek Olšák	cf26760218	glsl: set prev/next_stage according to the new definition Keep MESA_SHADER_NONE if there is no previous/next shader. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35315>	2025-06-12 19:35:34 +00:00
Marek Olšák	aba7b0831c	nir: add shader_info::prev_stage When lowering mediump to 16 bits, this will allow drivers to enable the lowering only for certain pairs of stages, e.g. a driver can lower mediump for VS->FS, but not GS->FS. This could also be useful for other things. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35315>	2025-06-12 19:35:33 +00:00
Georg Lehmann	ad80b554f4	spirv: use feq for OpIsInf Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This effectively reverts `fcca6a83cd` because feq was clarified to be ordered when used with exact and without fast math flags. It's common for HW to only have free abs for floating point instructions. Foz-DB Navi21: Totals from 63 (0.08% of 80065) affected shaders: Instrs: 337027 -> 336667 (-0.11%); split: -0.12%, +0.02% CodeSize: 1846752 -> 1845000 (-0.09%); split: -0.13%, +0.03% Latency: 3401087 -> 3400633 (-0.01%); split: -0.04%, +0.03% InvThroughput: 847299 -> 845939 (-0.16%); split: -0.19%, +0.03% VClause: 7693 -> 7694 (+0.01%) Copies: 45175 -> 45240 (+0.14%); split: -0.12%, +0.27% PreSGPRs: 3555 -> 3553 (-0.06%) PreVGPRs: 4565 -> 4564 (-0.02%) VALU: 225473 -> 225245 (-0.10%); split: -0.13%, +0.03% SALU: 44735 -> 44625 (-0.25%) Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35437>	2025-06-11 18:34:21 +00:00
Rob Clark	cd4f6caa0d	vtn: Handle non-32b tex dests With cl_khr_fp16 we can get texture instructions w/ f16 dest. Not all drivers handle this, so convert to 32b dest and insert alu conversion to the requested type. Drivers that can handle f16 texture loads would fold away the extra conversion with nir_opt_16bit_tex_image. Signed-off-by: Rob Clark <rob.clark@oss.qualcomm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35470>	2025-06-11 17:48:10 +00:00
Karol Herbst	a482ec7f05	clc: fix DiagnosticOptions related build failure with llvm-21 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13257 Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35399>	2025-06-10 13:16:29 +02:00
Karol Herbst	392ad203eb	clc: use new createTargetMachine overload with llvm-21 The old one is deprecated, so let's move and silence the warning. Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35399>	2025-06-10 13:16:16 +02:00
Mel Henning	42ba492b88	compiler/rust/bitset: BitSetStream takes Key type This was an oversight when BitSet was parameterized on a key type. BitSetStream needs to also take a key type to prevent users from mixing different key types in binary operators. Constraining this makes BitSet usage more type safe. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35328>	2025-06-09 21:49:29 +00:00
Marek Olšák	f2c48652da	nir: add shader_info::tess::tcs_outputs_read_by_tes Gather no_varying for AMD. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34780>	2025-06-07 16:29:39 +00:00
Marek Olšák	a59464b6e3	radv,radeonsi: precompute and pass TCS per-vertex output stride via a user SGPR It's a stride of 1 output, which isn't 16. It's 16 * num_threads, aligned to 256. tcs_offchip_layout has 5 unused bits, so let's use them. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34780>	2025-06-07 16:29:39 +00:00
Marek Olšák	534b282573	ac/nir/tess: adjust memory layout of TCS outputs to have aligned store offsets There is a comment that explains it. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34780>	2025-06-07 16:29:38 +00:00
Mel Henning	5f0e4a7605	nak,nir: Stop using std::mem::zeroed() We can replace all of these with safe alternatives if we ask bindgen for implementations of Default. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35390>	2025-06-06 18:58:35 +00:00
Mel Henning	d15b5fadbb	nir/divergence_analysis: Update LCSSA comment Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35271>	2025-06-06 18:15:05 +00:00
Lionel Landwerlin	49def5ca9d	spirv: bump headers Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35382>	2025-06-06 14:38:17 +00:00
Karol Herbst	33fb1eca3e	nir/scale_fdiv: handle fp16 fdiv Not strictly scaling, but we upcast fo fp32, do the fdiv there and cast back again. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34053>	2025-06-05 13:17:27 +00:00
Karol Herbst	aa5a981b83	vtn/opencl: support fp16 builtins If we can't find an appropiate builtin in the libclc library, we add our own wrapper at runtime executing the op in fp32 space. Libclc has variying support for fp16 opcodes and with a libclc prior llvm-19 it does not work as good as with the newer one. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34053>	2025-06-05 13:17:27 +00:00
Karol Herbst	ca01635075	clc: support fp16 spec constants Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34053>	2025-06-05 13:17:27 +00:00
Mike Blumenkrantz	208450fc57	nir/lower_to_scalar: fix opt_varying with output reads no_varying cannot be used to eliminate stores on locations which may be subsequently read Fixes: `0058989357` ("nir/lower_io_to_scalar: don't create output stores that have no effect") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35325>	2025-06-04 18:21:16 +00:00

1 2 3 4 5 ...

10611 commits