fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-02-24 00:50:30 +01:00

Author	SHA1	Message	Date
Marek Olšák	f2c48652da	nir: add shader_info::tess::tcs_outputs_read_by_tes Gather no_varying for AMD. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34780>	2025-06-07 16:29:39 +00:00
Marek Olšák	a59464b6e3	radv,radeonsi: precompute and pass TCS per-vertex output stride via a user SGPR It's a stride of 1 output, which isn't 16. It's 16 * num_threads, aligned to 256. tcs_offchip_layout has 5 unused bits, so let's use them. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34780>	2025-06-07 16:29:39 +00:00
Marek Olšák	534b282573	ac/nir/tess: adjust memory layout of TCS outputs to have aligned store offsets There is a comment that explains it. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34780>	2025-06-07 16:29:38 +00:00
Mel Henning	d15b5fadbb	nir/divergence_analysis: Update LCSSA comment Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35271>	2025-06-06 18:15:05 +00:00
Karol Herbst	33fb1eca3e	nir/scale_fdiv: handle fp16 fdiv Not strictly scaling, but we upcast fo fp32, do the fdiv there and cast back again. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34053>	2025-06-05 13:17:27 +00:00
Mike Blumenkrantz	208450fc57	nir/lower_to_scalar: fix opt_varying with output reads no_varying cannot be used to eliminate stores on locations which may be subsequently read Fixes: `0058989357` ("nir/lower_io_to_scalar: don't create output stores that have no effect") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35325>	2025-06-04 18:21:16 +00:00
Marek Olšák	c3034fa82c	amd: replace most u_bit_consecutive* with BITFIELD_MASK/RANGE Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35346>	2025-06-04 17:46:38 +00:00
Lionel Landwerlin	978933c015	nir/opt_algebraic: extend lowering for (i\|u)bitfield_extract Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35334>	2025-06-04 16:28:39 +00:00
Georg Lehmann	1c4070f3e9	nir/opt_if: limit rewrite_uniform_uses iand recursion https://github.com/doitsujin/dxvk/issues/4970 has a shader where unrolled loops caused large iand chains and if we don't limit this we won't finish compiling in reasonable time. Cc: mesa-stable Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35312>	2025-06-04 10:49:05 +00:00
Georg Lehmann	eaeaf9554d	nir/opt_if: don't replace constant uses with other uniform values If constant folding wasn't run, this could replace constant uses with different constants. Additional, it could also create worse code for "if (subgroupXor(1) == 1)". Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13281 Cc: mesa-stable Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35312>	2025-06-04 10:49:05 +00:00
Samuel Pitoiset	226b0e28db	nir: generalize bitfield insert/extract sizes Original patch from Alyssa Rosenzweig Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35209>	2025-06-04 09:37:53 +00:00
Caio Oliveira	542836afe5	intel: Don't require dpas_intel src2 to match destination With upcoming configurations, the number of elements in the src2 slice might not match the destination. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35301>	2025-06-03 21:31:23 +00:00
Rhys Perry	dd45bf5bce	nir/load_store_vectorize: stabilize entry sort I think this was unlikely to cause issues, even if the qsort() implementation is unstable. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35255>	2025-06-03 09:45:01 +00:00
Rhys Perry	397920c16e	nir: fix left shift of negative value in ibfe constant folding Fixes "left shift of negative value -128" with parallel_rdp/00f93a9497dfbb3b and UBSan. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35255>	2025-06-03 09:45:01 +00:00
Rhys Perry	78aae4b1ba	nir: fix signed overflow in pack_half_2x16 constant folding Without this cast, the left shift is promoted to 'int'. Fixes "left shift of 50432 by 16 places cannot be represented in type 'int'" with horizon_zero_dawn/001064f580f8e3be and UBSan. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35255>	2025-06-03 09:45:01 +00:00
Rhys Perry	6852538ba0	nir: fix unpack_unorm_2x16/unpack_snorm_2x16 constant folding Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Backport-to: 25.0 Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35255>	2025-06-03 09:45:01 +00:00
Marek Olšák	bf2ed20eb9	nir: remove unused nir_io_semantics::invariant Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Acked-by: Alyssa on IRC Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35256>	2025-06-02 23:08:58 +00:00
Marek Olšák	44fcda9631	nir/opt_clip_cull_const: support GS Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35256>	2025-06-02 23:08:58 +00:00
Marek Olšák	6677d087c0	nir/xfb_info: add new fields to describe 16-bit XFB better for drivers that need this information Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35256>	2025-06-02 23:08:58 +00:00
Marek Olšák	7b70b419b5	nir: always index SSA defs before printing This makes the output more readable. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35256>	2025-06-02 23:08:58 +00:00
Marek Olšák	cf94ae8544	nir: change the type of shader_info::patch_* fields to 32 bits Patch outputs only use 32 bits. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35256>	2025-06-02 23:08:58 +00:00
Jesse Natalie	f0dde6ca7f	nir_gather_output_deps: Fix incorrect enum in switch Cc: mesa-stable Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35247>	2025-05-30 17:04:18 +00:00
Lionel Landwerlin	f0e18c475b	intel: remove GRL/intel-clc Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35227>	2025-05-29 20:17:13 +00:00
Samuel Pitoiset	cecf6675be	nir/lower_int64: add bitfield_extract lowering This will be used by RADV for ACO/LLVM. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35187>	2025-05-29 08:45:40 +02:00
Alyssa Rosenzweig	d696b19dd0	nir/lower_int64: add bitfield_reverse lowering now that we can represent 64-bit bitfield_reverse in NIR, we need a lowering for it as well. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35198>	2025-05-28 16:29:30 +00:00
Alyssa Rosenzweig	c3fb0645d8	nir/lower_alu: compact bitcount lowering while in the area. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35198>	2025-05-28 16:29:30 +00:00
Alyssa Rosenzweig	759dc70bde	nir: generalize bitfield_reverse bit size No reason we can't reverse other bit sizes, we just need to generalize the constant folding & bit size lowering. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35198>	2025-05-28 16:29:30 +00:00
Marek Olšák	35c76bc7f7	nir/tcs_info: use range analysis to determine the range of tess levels Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35195>	2025-05-28 06:46:56 +00:00
Marek Olšák	24c3f30e4a	nir/tcs_info: gather which patch outputs are only read/written by invoc 0 Tested thoroughly by a shader test. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35195>	2025-05-28 06:46:56 +00:00
Marek Olšák	a3632d7d88	nir/tcs_info: gather for all patch outputs whether they're written by all invocs This substantially rewrites the pass. It also makes it easier to read. Tested thoroughly by a shader test. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35195>	2025-05-28 06:46:56 +00:00
Lorenzo Rossi	2c0d0bad01	nak: Remove unused intrinsic image_load_raw_nv Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34975>	2025-05-28 01:47:19 +00:00
Lorenzo Rossi	5fbcdd6e32	nir,nak: Add NV-specific image intrinsics Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34975>	2025-05-28 01:47:19 +00:00
Lorenzo Rossi	47f6c74b71	nir,nak: Add KeplerB shared atomics intrinsics and lowering Kepler cards do not support shared atomic operations directly, but they have special ldslk and stsul that can implement mutex locks on addresses. Shared atomics can be lowered into operations in mutexes. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35028>	2025-05-26 16:29:05 +00:00
Qiang Yu	6f2a1e19da	nir/opt_varyings: fix mesh shader miss promote varying to flat Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details We still allow mesh shader promote constant output to flat, but mesh shader like geometry shader may store multi vertices' varying in a single thread. So mesh shader may store different constant values to different vertices in a single thread, we should not promote this case to flat. I'm not using shader_info.mesh.ms_cross_invocation_output_access because OpenGL does not require IO to have explicit location, so when nir_shader_gather_info is called in OpenGL GLSL compiler to compute ms_cross_invocation_output_access, some implicit output has -1 location which causes ms_cross_invocation_output_access unset for it. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13134 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35081>	2025-05-26 02:07:50 +00:00
Eric Engestrom	162f1f5566	delete xa leftovers Fixes: `3be2c47db2` ("delete the XA frontend") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35136>	2025-05-23 18:54:04 +00:00
Mike Blumenkrantz	00aaef9f12	delete gallium-nine Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details farewell, old friend Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Axel Davy <davyaxel0@gmail.com> Acked-by: David Heidelberg <david@ixit.cz> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34887>	2025-05-23 13:43:37 -04:00
Karol Herbst	bc444f6d26	nir: fix use-after-free on function parameter names Fixes: `3da8444be5` ("nir: add names to function parameters") Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35098>	2025-05-22 22:59:08 +00:00
Patrick Lerda	1186c73c6b	r600: implement gs indirect load_per_vertex_input This functionality is useful with the software fp64 implementation. It allows running the remaining tests. Note: the same tests do not generate this indirect access on cayman which has the hardware fp64 implementation enabled. This change was tested on cypress, palm and barts. Here are the tests fixed: spec/arb_gpu_shader_fp64/execution/gs-isnan-dvec: fail pass spec/arb_gpu_shader_fp64/uniform_buffers/gs-array-copy: fail pass spec/arb_gpu_shader_fp64/uniform_buffers/gs-dmat4: fail pass spec/arb_gpu_shader_fp64/uniform_buffers/gs-dmat4-row-major: fail pass spec/arb_gpu_shader_fp64/uniform_buffers/gs-double-array-const-index: fail pass spec/arb_gpu_shader_fp64/uniform_buffers/gs-double-array-variable-index: fail pass spec/arb_gpu_shader_fp64/uniform_buffers/gs-double-bool-double: fail pass spec/arb_gpu_shader_fp64/uniform_buffers/gs-double-uniform-array-direct-indirect: fail pass spec/arb_gpu_shader_fp64/uniform_buffers/gs-doubles-float-mixed: fail pass spec/arb_gpu_shader_fp64/uniform_buffers/gs-dvec4-uniform-array-direct-indirect: fail pass spec/arb_gpu_shader_fp64/uniform_buffers/gs-nested-struct: fail pass Signed-off-by: Patrick Lerda <patrick9876@free.fr> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34926>	2025-05-19 12:07:37 +00:00
Ian Romanick	37ee91679a	nir/algebraic: Generalize an existing bfi(a, 0, ...) pattern No shader-db changes on any Intel platform. fossil-db: All Intel platforms had similar results. (Lunar Lake shown) Totals: Instrs: 210561118 -> 210560921 (-0.00%) Send messages: 10979615 -> 10979613 (-0.00%) Cycle count: 31576352808 -> 31576347218 (-0.00%); split: -0.00%, +0.00% Max live registers: 66068161 -> 66068157 (-0.00%) Non SSA regs after NIR: 60230775 -> 60230949 (+0.00%) Totals from 180 (0.03% of 707082) affected shaders: Instrs: 68035 -> 67838 (-0.29%) Send messages: 3190 -> 3188 (-0.06%) Cycle count: 3979496 -> 3973906 (-0.14%); split: -0.14%, +0.00% Max live registers: 11812 -> 11808 (-0.03%) Non SSA regs after NIR: 18878 -> 19052 (+0.92%) Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34905>	2025-05-16 14:49:25 -07:00
Ian Romanick	464955bbdd	nir/algebraic: Optimize some open-coded extract_i8 These were initially observed in Hogwarts Legacy while working on something else entirely. Two compute shaders in that app are helped for spills and fills. On Skylake, one of the shaders benefits from this change, and the other is hurt pretty significantly. About 40 vertex shaders in Shadow of the Tomb Raider were helped for instructions. v2: Use ~0xff instead of 0xffffff00 to ensure the patterns will work properly with all bit sizes. Noticed by Georg. v3: No, really, fix the various errors to ensure the patterns will work properly with all bit sizes. Noticed by Georg. No shader-db changes on any Intel platform. fossil-db: Lunar Lake, Meteor Lake, and DG2 had similar results. (Lunar Lake) Totals: Instrs: 210566294 -> 210561118 (-0.00%) Cycle count: 31582309052 -> 31576352808 (-0.02%); split: -0.02%, +0.00% Spill count: 519300 -> 519280 (-0.00%) Fill count: 625181 -> 625161 (-0.00%) Scratch Memory Size: 36289536 -> 36281344 (-0.02%) Max live registers: 66068413 -> 66068161 (-0.00%) Non SSA regs after NIR: 60230773 -> 60230775 (+0.00%) Totals from 1662 (0.24% of 707082) affected shaders: Instrs: 635064 -> 629888 (-0.82%) Cycle count: 36549632 -> 30593388 (-16.30%); split: -16.43%, +0.14% Spill count: 246 -> 226 (-8.13%) Fill count: 280 -> 260 (-7.14%) Scratch Memory Size: 16384 -> 8192 (-50.00%) Max live registers: 178491 -> 178239 (-0.14%) Non SSA regs after NIR: 169552 -> 169554 (+0.00%) Tiger Lake Totals: Instrs: 238544730 -> 238539407 (-0.00%) Cycle count: 23679446097 -> 23673238578 (-0.03%); split: -0.03%, +0.00% Max live registers: 42494925 -> 42494799 (-0.00%) Non SSA regs after NIR: 63639071 -> 63639074 (+0.00%) Totals from 1662 (0.21% of 802704) affected shaders: Instrs: 626604 -> 621281 (-0.85%) Cycle count: 26444363 -> 20236844 (-23.47%); split: -23.50%, +0.02% Max live registers: 95405 -> 95279 (-0.13%) Non SSA regs after NIR: 181150 -> 181153 (+0.00%) Ice Lake Totals: Instrs: 238855310 -> 238826534 (-0.01%) Cycle count: 24952257277 -> 24944589398 (-0.03%); split: -0.03%, +0.00% Spill count: 575510 -> 575117 (-0.07%) Fill count: 713007 -> 708632 (-0.61%) Max live registers: 42499556 -> 42499432 (-0.00%) Non SSA regs after NIR: 64388747 -> 64388750 (+0.00%) Totals from 1662 (0.21% of 805149) affected shaders: Instrs: 926887 -> 898111 (-3.10%) Cycle count: 67025583 -> 59357704 (-11.44%); split: -11.45%, +0.01% Spill count: 5168 -> 4775 (-7.60%) Fill count: 32883 -> 28508 (-13.30%) Max live registers: 95614 -> 95490 (-0.13%) Non SSA regs after NIR: 181150 -> 181153 (+0.00%) Skylake Totals: Instrs: 161904416 -> 161895239 (-0.01%); split: -0.01%, +0.00% Cycle count: 20098067714 -> 20090767583 (-0.04%); split: -0.04%, +0.00% Spill count: 525546 -> 525789 (+0.05%); split: -0.04%, +0.09% Fill count: 603369 -> 602276 (-0.18%); split: -0.28%, +0.10% Max live registers: 33895714 -> 33895590 (-0.00%) Non SSA regs after NIR: 57348729 -> 57348730 (+0.00%) Totals from 1655 (0.25% of 653734) affected shaders: Instrs: 769979 -> 760802 (-1.19%); split: -1.83%, +0.64% Cycle count: 51365416 -> 44065285 (-14.21%); split: -14.22%, +0.01% Spill count: 4186 -> 4429 (+5.81%); split: -4.90%, +10.70% Fill count: 16356 -> 15263 (-6.68%); split: -10.50%, +3.82% Max live registers: 95115 -> 94991 (-0.13%) Non SSA regs after NIR: 180797 -> 180798 (+0.00%) Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34905>	2025-05-16 14:49:05 -07:00
Kenneth Graunke	deb1d47155	nir: Add a new optimization for acquire/release atomics & barriers Some shaders contain back-to-back atomic accesses in SPIR-V with AcquireRelease semantics. In NIR, we translate these to a release memory barrier, the atomic, then an acquire memory barrier. This results in a lot of unnecessary memory barriers in the middle of the sequence of atomics: 0. Release barrier 1. Atomic 2. Acquire barrier 3. Release barrier 4. Atomic 5. Acquire barrier 6. Release barrier 7. Atomic 8. Acquire barrier In the absence of loads/stores, and when the atomic destinations are unused, these barriers in-between atomics shouldn't be required. This optimization pass would drop them (lines 2-3 and 5-6 above) while leaving the first and last barriers (0 and 8), so the sequence remains synchronized against other access elsewhere in the program. One common example where this occurs is a sequence of min and max atomics to clamp a certain memory location's value within a range. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33504>	2025-05-16 00:29:13 +00:00
Marek Olšák	deda05e2b7	nir: move nir_lower_color_inputs into radeonsi it's the only user Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34492>	2025-05-14 20:19:17 +00:00
Alyssa Rosenzweig	52cc6c101f	nir/lower_printf: fix vectors with nir_printf_fmt for specifiers like %v4f, we need to store the whole vector. u_printf can already handle this from OpenCL, we just need to match that here. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34909>	2025-05-14 11:29:08 -04:00
Marek Olšák	069fdc6f71	nir: handle mov and bcsel in nir_def_bits_used Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34489>	2025-05-13 15:38:37 +00:00
Marek Olšák	e080833478	nir: handle iand/ior opcodes recursively in nir_def_bits_used Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34489>	2025-05-13 15:38:37 +00:00
Marek Olšák	a78ed8b8e8	nir: handle extract opcodes recursively in nir_def_bits_used Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34489>	2025-05-13 15:38:37 +00:00
Marek Olšák	e38a0b9a05	nir: handle u2u/i2i recursively in nir_def_bits_used to get the number of bits actually used by the uses. Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34489>	2025-05-13 15:38:37 +00:00
Marek Olšák	15369a792a	nir: handle mul24 in nir_def_bits_used Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34489>	2025-05-13 15:38:37 +00:00
Marek Olšák	7e7ef7b8b7	nir: handle bit shifts by constants in nir_def_bits_used useful for open-coded bitfield extracts that are not using ubfe/ibfe Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34489>	2025-05-13 15:38:37 +00:00
Marek Olšák	7d24a9b649	nir: handle ibfe/ubfe in nir_def_bits_used it will be used by radeonsi Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34489>	2025-05-13 15:38:37 +00:00

1 2 3 4 5 ...

6227 commits