fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-17 11:48:05 +02:00

Author	SHA1	Message	Date
Natalie Vock	57f796752d	nir/deref: Elide loads/stores from deref cast of undef These can never be meaningful. DOOM: The Dark Ages also relies on this. Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40799>	2026-04-15 08:42:12 +00:00
Samuel Pitoiset	0b016f4bff	nir: add new system values for descriptor heap RT traversal inputs Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39483>	2026-04-14 10:10:22 +00:00
Georg Lehmann	d2d86b83fb	nir/opt_reassociate: use nir_fp_no_reassoc instead of exact Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40872>	2026-04-12 17:10:27 +00:00
Georg Lehmann	6549b993e6	nir/algebraic: actually seperate contract and inexact Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40872>	2026-04-12 17:10:27 +00:00
Georg Lehmann	ed50cf29f3	spirv: map float control2 to fine grained nir flags instead of exact Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40872>	2026-04-12 17:10:27 +00:00
Georg Lehmann	a42ac8dec6	nir: split exact bit into no_contract/reassoc/transform Just like float control2. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40872>	2026-04-12 17:10:27 +00:00
Job Noorman	b8437e30bf	nir/lower_atomics: add support for bindless_image_atomic Initial data is loaded via bindless_image_load, atomic swap via bindless_image_atomic_swap. Signed-off-by: Job Noorman <jnoorman@igalia.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39932>	2026-04-11 19:46:13 +00:00
Alyssa Rosenzweig	4356ad1bf5	nir: add pixel_coord_intel This is a 2x16 bitpacked version of load_pixel_coord which maps directly to the hardware value and is much easier for Jay to consume due to the sadness that is true 16-bit on Intel. Jay will lower to this internally. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40835>	2026-04-10 18:21:21 +00:00
Alyssa Rosenzweig	bd6d210386	nir: add shuffle_intel Jay will use this to lower & optimize subgroup shuffles. This is closer to how Intel hardware works but still much higher level than the hardware primitive. This gets us NIR optimizations on the multiply however. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40835>	2026-04-10 18:21:21 +00:00
Alyssa Rosenzweig	b840b178af	nir: add Intel RT write intrinsic This exposes the underlying render target write message directly, which Jay will use to lower RT writes in NIR. I'm still on the fence about what exactly this should look like but this is good enough for GLES3.0 (so, multiple render targets but not necessarily dual source blending). Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40835>	2026-04-10 18:21:21 +00:00
Alyssa Rosenzweig	566047222e	nir: add frag_coord_w_rcp intrinsic This maps directly to what Intel's thread payload gives us, allowing us to optimize out frcp's in some cases. Jay will use this. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40835>	2026-04-10 18:21:21 +00:00
Kenneth Graunke	09089fdd13	nir: Add nir_texop_sparse_residency[_txf]_intel operations These lowered versions map to what Jay can deal with. The hardware is more flexible but we're not due to data model restrictions. We choose to lower to get us off the ground, we can revisit later. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40835>	2026-04-10 18:21:21 +00:00
Faith Ekstrand	91e6507665	nir: Add a nir_alu_src_comp_as_uint() helper Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40769>	2026-04-09 18:08:40 -04:00
Faith Ekstrand	fb347b8458	nir: Add a couple is_zero() helpers Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40769>	2026-04-09 18:05:20 -04:00
Kenneth Graunke	0b99c88337	nir, brw: lower scratch in NIR Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This will let us share a common scratch swizzling between brw and jay. Changes by Ken: - Use an immediate SIMD width when known so we don't need to re-lower - Switch to load_simd_width_intel because it may not match info->api_subgroup_size on Vulkan without VK_EXT_subgroup_size_control - Stop using DWord Scattered Write messages for scratch. These take an offset in DWords, and our offsets are now always in bytes. This also means that we no longer create MEMORY_OPCODE_* IR with inconsistent units of either bytes or dwords. Yikes. We use byte scattered messages now. fossil-db stats on Battlemage: Instrs: 500477504 -> 500450056 (-0.01%); split: -0.01%, +0.00% CodeSize: 7807432368 -> 7806786192 (-0.01%); split: -0.01%, +0.00% Cycle count: 62404008370 -> 62398437734 (-0.01%); split: -0.01%, +0.00% Fill count: 546690 -> 546695 (+0.00%); split: -0.00%, +0.00% Max live registers: 141257956 -> 141258100 (+0.00%); split: -0.00%, +0.00% Non SSA regs after NIR: 72350283 -> 72336544 (-0.02%) Totals from 99 (0.01% of 1581969) affected shaders: Instrs: 366593 -> 339145 (-7.49%); split: -7.58%, +0.09% CodeSize: 6425936 -> 5779760 (-10.06%); split: -10.06%, +0.00% Cycle count: 2412009876 -> 2406439240 (-0.23%); split: -0.26%, +0.03% Fill count: 19675 -> 19680 (+0.03%); split: -0.02%, +0.04% Max live registers: 17600 -> 17744 (+0.82%); split: -0.09%, +0.91% Non SSA regs after NIR: 37894 -> 24155 (-36.26%) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40843>	2026-04-09 21:02:16 +00:00
Kenneth Graunke	0c0fea1cdb	nir: Increase tex opcode bits from 5 to 6 in nir_instr_set We are already at our limit of 31 texture opcodes, and cannot add any more without expanding the opcode hashing in nir_instr_set. Thankfully, it's at 29 bits, so adding one here is possible still. Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40833>	2026-04-08 16:07:35 +00:00
Rhys Perry	01516746eb	nir: use a u_dynarray for block predecessors Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details A set is large and expensive to iterate. This is faster (overall fossilize-replay difference): Difference at 95.0% confidence -250 +/- 28.9257 -2.04849% +/- 0.235211% (Student's t, pooled s = 34.1626) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40242>	2026-04-08 15:06:34 +00:00
Rhys Perry	99c5f48da9	nir/cf: don't remove block predecessors while iterating This would have been fine for a set, but we're going to use a dynarray for the predecessors in the future. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40242>	2026-04-08 15:06:33 +00:00
Rhys Perry	ea148c9136	nir: add nir_loop_has_back_edge helper Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40242>	2026-04-08 15:06:32 +00:00
Rhys Perry	463e3643f2	nir: add and use block predecessor helpers Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40242>	2026-04-08 15:06:32 +00:00
Samuel Pitoiset	b402b98347	nir: allow heap image intrinsics in nir_rewrite_image_intrinsic() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40842>	2026-04-08 13:56:04 +00:00
Samuel Pitoiset	a41724e923	nir: remove nir_intrinsic_global_addr_to_descriptor It's no longer emitted by vk_nir_lower_descriptor_heaps(). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40826>	2026-04-08 09:46:01 +00:00
Georg Lehmann	0066328cf1	nir/opt_algebraic: create more 64bit bit test Foz-DB GFX1201: Totals from 2 (0.00% of 205032) affected shaders: Instrs: 3429 -> 3425 (-0.12%) CodeSize: 19580 -> 19568 (-0.06%) Latency: 13629 -> 13628 (-0.01%); split: -0.02%, +0.01% InvThroughput: 1853 -> 1847 (-0.32%) Copies: 235 -> 237 (+0.85%) VALU: 1901 -> 1898 (-0.16%) SALU: 381 -> 380 (-0.26%) VOPD: 307 -> 309 (+0.65%) Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40705>	2026-04-08 08:44:20 +00:00
Rhys Perry	54515bf8d1	nir/propagate_invariant: handle images Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40740>	2026-04-08 07:10:26 +00:00
Rhys Perry	4d65a7d16e	nir/propagate_invariant: be more conservative with aliasing variables No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40740>	2026-04-08 07:10:26 +00:00
Rhys Perry	310c8a0a17	nir/propagate_invariant: be more conservative with NULL variables If we can't determine what variable is accessed, we should assume it could be any. This might make things worse with Vulkan since it does vulkan_resource_index+load_vulkan_descriptor, but I don't think it matters much. SSBO stores are rarely used in vertex shaders. fossil-db (navi21): Totals from 1 (0.00% of 202427) affected shaders: Instrs: 442 -> 445 (+0.68%) Latency: 2038 -> 2043 (+0.25%) InvThroughput: 432 -> 437 (+1.16%) VALU: 295 -> 298 (+1.02%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40740>	2026-04-08 07:10:26 +00:00
Rhys Perry	b6f9a9092a	nir/propagate_invariant: include derefs In case an array index is calculated using floating point. fossil-db (navi21): Totals from 1339 (0.66% of 202427) affected shaders: MaxWaves: 30368 -> 30396 (+0.09%) Instrs: 805468 -> 817035 (+1.44%); split: -0.00%, +1.44% CodeSize: 4084432 -> 4130548 (+1.13%); split: -0.01%, +1.14% VGPRs: 62000 -> 61952 (-0.08%) Latency: 3885607 -> 3903915 (+0.47%); split: -0.01%, +0.48% InvThroughput: 726350 -> 742176 (+2.18%); split: -0.03%, +2.21% VClause: 22187 -> 22676 (+2.20%); split: -0.14%, +2.34% SClause: 14663 -> 14616 (-0.32%); split: -0.42%, +0.10% Copies: 65726 -> 65840 (+0.17%); split: -0.09%, +0.26% Branches: 21075 -> 21076 (+0.00%); split: -0.02%, +0.03% PreVGPRs: 48970 -> 48941 (-0.06%) VALU: 454510 -> 466082 (+2.55%); split: -0.01%, +2.55% SALU: 130543 -> 130521 (-0.02%); split: -0.03%, +0.01% VMEM: 43876 -> 43896 (+0.05%) Most of these changes are elden_ring shaders which can no longer optimize f2u32(u2f32()). Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40740>	2026-04-08 07:10:26 +00:00
Rhys Perry	75ccfd8fb2	nir/propagate_invariant: set fp_math_ctrl for intrinsics No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40740>	2026-04-08 07:10:26 +00:00
Rhys Perry	20e85604ad	nir/propagate_invariant: include intrinsics No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40740>	2026-04-08 07:10:26 +00:00
Rhys Perry	b30c0d8264	nir/algebraic: optimize exact f2u32(fmul(unpack_norm)) fossil-db (navi21): Totals from 16 (0.01% of 202427) affected shaders: Instrs: 17730 -> 17226 (-2.84%) CodeSize: 97500 -> 95708 (-1.84%) InvThroughput: 44437 -> 44419 (-0.04%) Copies: 1502 -> 1446 (-3.73%) VALU: 9973 -> 9525 (-4.49%) SALU: 3509 -> 3453 (-1.60%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40740>	2026-04-08 07:10:26 +00:00
Rhys Perry	f52dace6e8	nir/tests: fix NaN/inf checks in skip_test() Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40740>	2026-04-08 07:10:26 +00:00
Samuel Pitoiset	33676d8296	spirv: mark all resources as non-uniform by default with descriptor heap It's required by descriptor heap. There is already a NIR pass that optimizes non-uniform access, so this should be mostly safe. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40768>	2026-04-07 18:55:49 +00:00
Samuel Pitoiset	74aa40f6ed	nir: remove resource/sampler heap ptrs sysvals They are no longer used. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40768>	2026-04-07 18:55:49 +00:00
Samuel Pitoiset	fb96f85d19	spirv: implement SpvOpUntypedImageTexelPointerEXT Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40768>	2026-04-07 18:55:49 +00:00
Samuel Pitoiset	7088621874	spirv: emit nir_intrinsic_image_heap when resource/sampler ptrs are used This seems better because there are no variables at all. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40768>	2026-04-07 18:55:49 +00:00
Samuel Pitoiset	f1afce673b	spirv: set the image format for image intrinsics This isn't required for deref instructions because it's possible to get the image format back from the variable but it will be useful for descriptor heap. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40768>	2026-04-07 18:55:49 +00:00
Samuel Pitoiset	7dbef85365	spirv: change the resource/sampler builtins variable mode Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40768>	2026-04-07 18:55:49 +00:00
Samuel Pitoiset	457eac7d66	nir: add new variable modes for the resource/sampler heap pointers These two new variable modes are used to relax restrictions on deref casts through because it's possible to cast different modes from the heap pointers. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40768>	2026-04-07 18:55:49 +00:00
Daniel Schürmann	daa3d5292f	nir/opt_if: allow undef instructions on ELSE side for if-simplification Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Totals from 72 (0.04% of 202440) affected shaders: (Navi48) Instrs: 213900 -> 213873 (-0.01%); split: -0.03%, +0.02% CodeSize: 1215012 -> 1214924 (-0.01%); split: -0.01%, +0.01% Latency: 4458993 -> 4458679 (-0.01%) Copies: 18840 -> 18816 (-0.13%) Branches: 5044 -> 5043 (-0.02%) VALU: 116547 -> 116529 (-0.02%) SALU: 28686 -> 28669 (-0.06%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40738>	2026-04-07 17:28:11 +00:00
Daniel Schürmann	1394f79517	nir/opt_if: allow load_const instructions on ELSE-side for if-simplifaction Totals from 1974 (0.98% of 202440) affected shaders: (Navi48) Instrs: 6438920 -> 6437055 (-0.03%); split: -0.06%, +0.03% CodeSize: 35080732 -> 35075136 (-0.02%); split: -0.04%, +0.02% SpillSGPRs: 2106 -> 2122 (+0.76%) Latency: 52684517 -> 52677236 (-0.01%); split: -0.02%, +0.01% InvThroughput: 8977644 -> 8976740 (-0.01%); split: -0.01%, +0.00% VClause: 124447 -> 124444 (-0.00%) SClause: 117561 -> 117560 (-0.00%); split: -0.00%, +0.00% Copies: 413450 -> 410708 (-0.66%); split: -0.67%, +0.01% Branches: 136429 -> 136169 (-0.19%); split: -0.20%, +0.01% PreSGPRs: 114813 -> 114918 (+0.09%); split: -0.01%, +0.10% PreVGPRs: 108142 -> 108145 (+0.00%); split: -0.00%, +0.00% VALU: 3275624 -> 3274927 (-0.02%); split: -0.03%, +0.00% SALU: 1166159 -> 1165039 (-0.10%); split: -0.17%, +0.07% VOPD: 333456 -> 333183 (-0.08%); split: +0.02%, -0.10% Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40738>	2026-04-07 17:28:11 +00:00
Job Noorman	cc6eec79c2	nir/opt_uniform_subgroup: fix ballot_bit_count components Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details ballot_bit_count_reduce expects the ballot to have 4 components causing validation failures on targets where 1 < ballot_components < 4. Fix this by padding the ballot to 4 components. Signed-off-by: Job Noorman <jnoorman@igalia.com> Fixes: `ae66bd1c00` ("nir/opt_uniform_subgroup: use ballot_bit_count") Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40792>	2026-04-04 17:48:34 +00:00
Georg Lehmann	8730c039bf	nir/opt_algebraic: move some lower_lerp patterns Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details If we rely on the pattern order here, we don't need to duplicate per bit size. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40730>	2026-04-04 10:44:09 +00:00
Georg Lehmann	f192fe99eb	nir/opt_algebraic: update open coded flerp(..., b2f(c)) to bcsel patterns We remove 1.0 - b2f(a) since `6a662a59b7`. Foz-DB Navi48: Totals from 200 (0.10% of 205032) affected shaders: Instrs: 410309 -> 409750 (-0.14%); split: -0.18%, +0.05% CodeSize: 2140424 -> 2136956 (-0.16%); split: -0.21%, +0.05% Latency: 5834394 -> 5834042 (-0.01%); split: -0.02%, +0.01% InvThroughput: 906879 -> 906374 (-0.06%); split: -0.06%, +0.01% VClause: 8247 -> 8244 (-0.04%) SClause: 7721 -> 7723 (+0.03%); split: -0.03%, +0.05% Copies: 20515 -> 20487 (-0.14%); split: -0.29%, +0.16% PreVGPRs: 14510 -> 14481 (-0.20%) VALU: 228703 -> 228235 (-0.20%); split: -0.28%, +0.07% SALU: 62832 -> 62914 (+0.13%); split: -0.18%, +0.31% VOPD: 929 -> 927 (-0.22%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40730>	2026-04-04 10:44:09 +00:00
Georg Lehmann	fc19ce6c17	nir/opt_load_skip_helpers: don't skip helpers for store_scratch data Scratch stores store data for helper lanes that might be used later by an instruction that cares about helpers, or even by control flow. Fixes: `a65009e808` ("nir: Add a nir_opt_tex_skip_helpers optimization") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/work_items/14965 Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40770>	2026-04-04 09:41:00 +00:00
Lionel Landwerlin	30fda4488f	nir: divergence analysis support for image_heap_load_param_intel Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `22b16d54ab` ("nir: add heap variant of load_param_intel") Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40777>	2026-04-03 07:49:24 +00:00
Job Noorman	adfc1086c9	nir/recompute_io_bases: fix num_slots for per_view outputs per_view outputs use one slot per enabled view. Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40651>	2026-04-03 08:18:08 +02:00
Job Noorman	a72704d0fb	nir/gather_info: clear interpolation qualifiers before gathering Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Fixes: `66740d9c91` ("nir: gather interpolation qualifiers") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40651>	2026-04-03 08:18:08 +02:00
Job Noorman	2403e88a76	nir/gather_info: gather per_view info Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40651>	2026-04-03 08:18:08 +02:00
Job Noorman	273fd18b89	nir/opt_varyings: fix alu def cloning nir_builder_alu_instr_finish_and_insert initialized the def's bit_size and num_components so we should set them afterwards. Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Emma Anholt <emma@anholt.net> Fixes: `c66967b5cb` ("nir: add nir_opt_varyings, new pass optimizing and compacting varyings") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40651>	2026-04-03 08:18:08 +02:00
Job Noorman	d56d35aa76	nir/opt_varyings_bulk: add data parameter to optimize callback Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40651>	2026-04-03 08:18:08 +02:00

1 2 3 4 5 ...

12031 commits