fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-18 22:28:06 +02:00

Author	SHA1	Message	Date
Rhys Perry	397920c16e	nir: fix left shift of negative value in ibfe constant folding Fixes "left shift of negative value -128" with parallel_rdp/00f93a9497dfbb3b and UBSan. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35255>	2025-06-03 09:45:01 +00:00
Rhys Perry	78aae4b1ba	nir: fix signed overflow in pack_half_2x16 constant folding Without this cast, the left shift is promoted to 'int'. Fixes "left shift of 50432 by 16 places cannot be represented in type 'int'" with horizon_zero_dawn/001064f580f8e3be and UBSan. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35255>	2025-06-03 09:45:01 +00:00
Rhys Perry	6852538ba0	nir: fix unpack_unorm_2x16/unpack_snorm_2x16 constant folding Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Backport-to: 25.0 Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35255>	2025-06-03 09:45:01 +00:00
Marek Olšák	bf2ed20eb9	nir: remove unused nir_io_semantics::invariant Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Acked-by: Alyssa on IRC Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35256>	2025-06-02 23:08:58 +00:00
Marek Olšák	44fcda9631	nir/opt_clip_cull_const: support GS Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35256>	2025-06-02 23:08:58 +00:00
Marek Olšák	6677d087c0	nir/xfb_info: add new fields to describe 16-bit XFB better for drivers that need this information Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35256>	2025-06-02 23:08:58 +00:00
Marek Olšák	7b70b419b5	nir: always index SSA defs before printing This makes the output more readable. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35256>	2025-06-02 23:08:58 +00:00
Marek Olšák	cf94ae8544	nir: change the type of shader_info::patch_* fields to 32 bits Patch outputs only use 32 bits. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35256>	2025-06-02 23:08:58 +00:00
Jesse Natalie	f0dde6ca7f	nir_gather_output_deps: Fix incorrect enum in switch Cc: mesa-stable Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35247>	2025-05-30 17:04:18 +00:00
Lionel Landwerlin	f0e18c475b	intel: remove GRL/intel-clc Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35227>	2025-05-29 20:17:13 +00:00
Samuel Pitoiset	cecf6675be	nir/lower_int64: add bitfield_extract lowering This will be used by RADV for ACO/LLVM. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35187>	2025-05-29 08:45:40 +02:00
Alyssa Rosenzweig	d696b19dd0	nir/lower_int64: add bitfield_reverse lowering now that we can represent 64-bit bitfield_reverse in NIR, we need a lowering for it as well. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35198>	2025-05-28 16:29:30 +00:00
Alyssa Rosenzweig	c3fb0645d8	nir/lower_alu: compact bitcount lowering while in the area. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35198>	2025-05-28 16:29:30 +00:00
Alyssa Rosenzweig	759dc70bde	nir: generalize bitfield_reverse bit size No reason we can't reverse other bit sizes, we just need to generalize the constant folding & bit size lowering. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35198>	2025-05-28 16:29:30 +00:00
Marek Olšák	35c76bc7f7	nir/tcs_info: use range analysis to determine the range of tess levels Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35195>	2025-05-28 06:46:56 +00:00
Marek Olšák	24c3f30e4a	nir/tcs_info: gather which patch outputs are only read/written by invoc 0 Tested thoroughly by a shader test. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35195>	2025-05-28 06:46:56 +00:00
Marek Olšák	a3632d7d88	nir/tcs_info: gather for all patch outputs whether they're written by all invocs This substantially rewrites the pass. It also makes it easier to read. Tested thoroughly by a shader test. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35195>	2025-05-28 06:46:56 +00:00
Lorenzo Rossi	2c0d0bad01	nak: Remove unused intrinsic image_load_raw_nv Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34975>	2025-05-28 01:47:19 +00:00
Lorenzo Rossi	5fbcdd6e32	nir,nak: Add NV-specific image intrinsics Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34975>	2025-05-28 01:47:19 +00:00
Lorenzo Rossi	47f6c74b71	nir,nak: Add KeplerB shared atomics intrinsics and lowering Kepler cards do not support shared atomic operations directly, but they have special ldslk and stsul that can implement mutex locks on addresses. Shared atomics can be lowered into operations in mutexes. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35028>	2025-05-26 16:29:05 +00:00
Qiang Yu	6f2a1e19da	nir/opt_varyings: fix mesh shader miss promote varying to flat Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details We still allow mesh shader promote constant output to flat, but mesh shader like geometry shader may store multi vertices' varying in a single thread. So mesh shader may store different constant values to different vertices in a single thread, we should not promote this case to flat. I'm not using shader_info.mesh.ms_cross_invocation_output_access because OpenGL does not require IO to have explicit location, so when nir_shader_gather_info is called in OpenGL GLSL compiler to compute ms_cross_invocation_output_access, some implicit output has -1 location which causes ms_cross_invocation_output_access unset for it. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13134 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35081>	2025-05-26 02:07:50 +00:00
Eric Engestrom	162f1f5566	delete xa leftovers Fixes: `3be2c47db2` ("delete the XA frontend") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35136>	2025-05-23 18:54:04 +00:00
Mike Blumenkrantz	00aaef9f12	delete gallium-nine Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details farewell, old friend Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Axel Davy <davyaxel0@gmail.com> Acked-by: David Heidelberg <david@ixit.cz> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34887>	2025-05-23 13:43:37 -04:00
Karol Herbst	bc444f6d26	nir: fix use-after-free on function parameter names Fixes: `3da8444be5` ("nir: add names to function parameters") Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35098>	2025-05-22 22:59:08 +00:00
Patrick Lerda	1186c73c6b	r600: implement gs indirect load_per_vertex_input This functionality is useful with the software fp64 implementation. It allows running the remaining tests. Note: the same tests do not generate this indirect access on cayman which has the hardware fp64 implementation enabled. This change was tested on cypress, palm and barts. Here are the tests fixed: spec/arb_gpu_shader_fp64/execution/gs-isnan-dvec: fail pass spec/arb_gpu_shader_fp64/uniform_buffers/gs-array-copy: fail pass spec/arb_gpu_shader_fp64/uniform_buffers/gs-dmat4: fail pass spec/arb_gpu_shader_fp64/uniform_buffers/gs-dmat4-row-major: fail pass spec/arb_gpu_shader_fp64/uniform_buffers/gs-double-array-const-index: fail pass spec/arb_gpu_shader_fp64/uniform_buffers/gs-double-array-variable-index: fail pass spec/arb_gpu_shader_fp64/uniform_buffers/gs-double-bool-double: fail pass spec/arb_gpu_shader_fp64/uniform_buffers/gs-double-uniform-array-direct-indirect: fail pass spec/arb_gpu_shader_fp64/uniform_buffers/gs-doubles-float-mixed: fail pass spec/arb_gpu_shader_fp64/uniform_buffers/gs-dvec4-uniform-array-direct-indirect: fail pass spec/arb_gpu_shader_fp64/uniform_buffers/gs-nested-struct: fail pass Signed-off-by: Patrick Lerda <patrick9876@free.fr> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34926>	2025-05-19 12:07:37 +00:00
Ian Romanick	37ee91679a	nir/algebraic: Generalize an existing bfi(a, 0, ...) pattern No shader-db changes on any Intel platform. fossil-db: All Intel platforms had similar results. (Lunar Lake shown) Totals: Instrs: 210561118 -> 210560921 (-0.00%) Send messages: 10979615 -> 10979613 (-0.00%) Cycle count: 31576352808 -> 31576347218 (-0.00%); split: -0.00%, +0.00% Max live registers: 66068161 -> 66068157 (-0.00%) Non SSA regs after NIR: 60230775 -> 60230949 (+0.00%) Totals from 180 (0.03% of 707082) affected shaders: Instrs: 68035 -> 67838 (-0.29%) Send messages: 3190 -> 3188 (-0.06%) Cycle count: 3979496 -> 3973906 (-0.14%); split: -0.14%, +0.00% Max live registers: 11812 -> 11808 (-0.03%) Non SSA regs after NIR: 18878 -> 19052 (+0.92%) Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34905>	2025-05-16 14:49:25 -07:00
Ian Romanick	464955bbdd	nir/algebraic: Optimize some open-coded extract_i8 These were initially observed in Hogwarts Legacy while working on something else entirely. Two compute shaders in that app are helped for spills and fills. On Skylake, one of the shaders benefits from this change, and the other is hurt pretty significantly. About 40 vertex shaders in Shadow of the Tomb Raider were helped for instructions. v2: Use ~0xff instead of 0xffffff00 to ensure the patterns will work properly with all bit sizes. Noticed by Georg. v3: No, really, fix the various errors to ensure the patterns will work properly with all bit sizes. Noticed by Georg. No shader-db changes on any Intel platform. fossil-db: Lunar Lake, Meteor Lake, and DG2 had similar results. (Lunar Lake) Totals: Instrs: 210566294 -> 210561118 (-0.00%) Cycle count: 31582309052 -> 31576352808 (-0.02%); split: -0.02%, +0.00% Spill count: 519300 -> 519280 (-0.00%) Fill count: 625181 -> 625161 (-0.00%) Scratch Memory Size: 36289536 -> 36281344 (-0.02%) Max live registers: 66068413 -> 66068161 (-0.00%) Non SSA regs after NIR: 60230773 -> 60230775 (+0.00%) Totals from 1662 (0.24% of 707082) affected shaders: Instrs: 635064 -> 629888 (-0.82%) Cycle count: 36549632 -> 30593388 (-16.30%); split: -16.43%, +0.14% Spill count: 246 -> 226 (-8.13%) Fill count: 280 -> 260 (-7.14%) Scratch Memory Size: 16384 -> 8192 (-50.00%) Max live registers: 178491 -> 178239 (-0.14%) Non SSA regs after NIR: 169552 -> 169554 (+0.00%) Tiger Lake Totals: Instrs: 238544730 -> 238539407 (-0.00%) Cycle count: 23679446097 -> 23673238578 (-0.03%); split: -0.03%, +0.00% Max live registers: 42494925 -> 42494799 (-0.00%) Non SSA regs after NIR: 63639071 -> 63639074 (+0.00%) Totals from 1662 (0.21% of 802704) affected shaders: Instrs: 626604 -> 621281 (-0.85%) Cycle count: 26444363 -> 20236844 (-23.47%); split: -23.50%, +0.02% Max live registers: 95405 -> 95279 (-0.13%) Non SSA regs after NIR: 181150 -> 181153 (+0.00%) Ice Lake Totals: Instrs: 238855310 -> 238826534 (-0.01%) Cycle count: 24952257277 -> 24944589398 (-0.03%); split: -0.03%, +0.00% Spill count: 575510 -> 575117 (-0.07%) Fill count: 713007 -> 708632 (-0.61%) Max live registers: 42499556 -> 42499432 (-0.00%) Non SSA regs after NIR: 64388747 -> 64388750 (+0.00%) Totals from 1662 (0.21% of 805149) affected shaders: Instrs: 926887 -> 898111 (-3.10%) Cycle count: 67025583 -> 59357704 (-11.44%); split: -11.45%, +0.01% Spill count: 5168 -> 4775 (-7.60%) Fill count: 32883 -> 28508 (-13.30%) Max live registers: 95614 -> 95490 (-0.13%) Non SSA regs after NIR: 181150 -> 181153 (+0.00%) Skylake Totals: Instrs: 161904416 -> 161895239 (-0.01%); split: -0.01%, +0.00% Cycle count: 20098067714 -> 20090767583 (-0.04%); split: -0.04%, +0.00% Spill count: 525546 -> 525789 (+0.05%); split: -0.04%, +0.09% Fill count: 603369 -> 602276 (-0.18%); split: -0.28%, +0.10% Max live registers: 33895714 -> 33895590 (-0.00%) Non SSA regs after NIR: 57348729 -> 57348730 (+0.00%) Totals from 1655 (0.25% of 653734) affected shaders: Instrs: 769979 -> 760802 (-1.19%); split: -1.83%, +0.64% Cycle count: 51365416 -> 44065285 (-14.21%); split: -14.22%, +0.01% Spill count: 4186 -> 4429 (+5.81%); split: -4.90%, +10.70% Fill count: 16356 -> 15263 (-6.68%); split: -10.50%, +3.82% Max live registers: 95115 -> 94991 (-0.13%) Non SSA regs after NIR: 180797 -> 180798 (+0.00%) Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34905>	2025-05-16 14:49:05 -07:00
Kenneth Graunke	deb1d47155	nir: Add a new optimization for acquire/release atomics & barriers Some shaders contain back-to-back atomic accesses in SPIR-V with AcquireRelease semantics. In NIR, we translate these to a release memory barrier, the atomic, then an acquire memory barrier. This results in a lot of unnecessary memory barriers in the middle of the sequence of atomics: 0. Release barrier 1. Atomic 2. Acquire barrier 3. Release barrier 4. Atomic 5. Acquire barrier 6. Release barrier 7. Atomic 8. Acquire barrier In the absence of loads/stores, and when the atomic destinations are unused, these barriers in-between atomics shouldn't be required. This optimization pass would drop them (lines 2-3 and 5-6 above) while leaving the first and last barriers (0 and 8), so the sequence remains synchronized against other access elsewhere in the program. One common example where this occurs is a sequence of min and max atomics to clamp a certain memory location's value within a range. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33504>	2025-05-16 00:29:13 +00:00
Marek Olšák	deda05e2b7	nir: move nir_lower_color_inputs into radeonsi it's the only user Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34492>	2025-05-14 20:19:17 +00:00
Alyssa Rosenzweig	52cc6c101f	nir/lower_printf: fix vectors with nir_printf_fmt for specifiers like %v4f, we need to store the whole vector. u_printf can already handle this from OpenCL, we just need to match that here. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34909>	2025-05-14 11:29:08 -04:00
Marek Olšák	069fdc6f71	nir: handle mov and bcsel in nir_def_bits_used Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34489>	2025-05-13 15:38:37 +00:00
Marek Olšák	e080833478	nir: handle iand/ior opcodes recursively in nir_def_bits_used Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34489>	2025-05-13 15:38:37 +00:00
Marek Olšák	a78ed8b8e8	nir: handle extract opcodes recursively in nir_def_bits_used Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34489>	2025-05-13 15:38:37 +00:00
Marek Olšák	e38a0b9a05	nir: handle u2u/i2i recursively in nir_def_bits_used to get the number of bits actually used by the uses. Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34489>	2025-05-13 15:38:37 +00:00
Marek Olšák	15369a792a	nir: handle mul24 in nir_def_bits_used Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34489>	2025-05-13 15:38:37 +00:00
Marek Olšák	7e7ef7b8b7	nir: handle bit shifts by constants in nir_def_bits_used useful for open-coded bitfield extracts that are not using ubfe/ibfe Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34489>	2025-05-13 15:38:37 +00:00
Marek Olšák	7d24a9b649	nir: handle ibfe/ubfe in nir_def_bits_used it will be used by radeonsi Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34489>	2025-05-13 15:38:37 +00:00
Danylo Piliaiev	4bc060ea11	nir: Add option to not lower gl_InstanceIndex Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Emma Anholt <anholt@igalia.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34902>	2025-05-13 11:35:39 +00:00
Georg Lehmann	0a30611c10	nir/opt_algebraic: some bitfield_select optimizations Foz-DB Navi21: Totals from 47 (0.06% of 79789) affected shaders: Instrs: 69536 -> 69363 (-0.25%) CodeSize: 370624 -> 369388 (-0.33%) Latency: 383505 -> 383298 (-0.05%) InvThroughput: 72924 -> 72727 (-0.27%) PreSGPRs: 2618 -> 2610 (-0.31%) VALU: 43261 -> 43091 (-0.39%) SALU: 13065 -> 13063 (-0.02%) Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34739>	2025-05-13 10:59:09 +00:00
Marek Olšák	a1ee6d6730	nir: fix gathering color interp modes in nir_lower_color_inputs Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Fixes: `709ebd82` ("amd: expose nir_io_mix_convergent_flat_with_interpolated") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12800 Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34942>	2025-05-13 00:05:37 -04:00
Konstantin Seurer	5926b63f66	nir: Print struct type declarations Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26267>	2025-05-12 18:28:50 +00:00
Konstantin Seurer	5981b5bb7e	nir/print: Use get_name for types This avoids an awkward " " if the struct name is missing. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26267>	2025-05-12 18:28:50 +00:00
Konstantin Seurer	d21311504b	nir/print: Add a get_name helper get_name works for any identifier, not just variables. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26267>	2025-05-12 18:28:50 +00:00
Lorenzo Rossi	ee4cff7603	nvk: nak: Add OpViLd support Kepler and earlier GPUs do not support the ISBERD instruction but have a different VILD (Vertex Indirect Load) instruction that provides less functionality. This commit adds support for the op in nak and nir, needed for the upcoming encoder commit. Signed-off-by: Lorenzo Rossi <snowycoder@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34329>	2025-05-12 16:41:48 +00:00
Karol Herbst	f0fa2209a8	nir: add nir_opt_algebraic_integer_promotion This handles basic operations where clang promotes integers to 32 bits according to the C99 spec in OpenCL C source code. This is its own opt_algerbraic pass, because we don't wanna fight with nir_lower_bit_size. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34641>	2025-05-12 09:29:20 +00:00
Marek Olšák	dbef8f1791	nir/opt_vectorize_io: fix a failure when vectorizing different bit sizes Fixes: `2514999c9c` - nir: add nir_opt_vectorize_io, vectorizing lowered IO Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13085 Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34897>	2025-05-09 20:50:57 +00:00
Daniel Schürmann	3dab7b0a45	nir/tests: add tests for nir_move_terminate_out_of_loops Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33479>	2025-05-09 17:20:29 +00:00
Daniel Schürmann	c59356e6a5	nir: add option to move terminate{_if} out of loops Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33479>	2025-05-09 17:20:29 +00:00
Sil Vilerino	150fa795fe	nir: Only build nir headers for mediafoundation/d3d12-no-graphics paired build Reviewed-by: <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34845>	2025-05-09 16:34:00 +00:00
Georg Lehmann	ba63263f32	nir: add bfdot2_bfadd and use it for lowering bfdot if supported Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34768>	2025-05-09 11:20:26 +00:00

1 2 3 4 5 ...

6214 commits