fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-18 00:48:07 +02:00

Author	SHA1	Message	Date
Konstantin Seurer	2a4b1ea69b	nir/opt_ray_queries: Cleanup and return if functions is not singular Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37283>	2025-09-12 19:11:02 +00:00
David Rosca	4b54277d2e	Remove VDPAU VDPAU only supports X11 and GL interop. There is no Wayland or Vulkan interop support. The API has limitations that makes it impossible to correctly decode certain streams. Application support is also very limited, and VAAPI is always a better choice over VDPAU. Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36632>	2025-09-10 12:33:57 +00:00
Georg Lehmann	08b58c3fac	nir/lower_subgroups: remove lower_fp64 option This was incorrect (it also lowered int64 reductions/scans), and the only user can just use the general callback to precisely only lower what it wants. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37164>	2025-09-09 11:09:22 +00:00
Georg Lehmann	687510495f	nir: remove subgroup size related nir_shader_compiler_options members This was added with the goal to eventually replace the per pass subgroup/ballot size options, but that won't work because some backends don't have a fixed subgroup size across the compilation process. It was also mostly added to hack around mesa state tracker behavior, and we have a better solution there now. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37164>	2025-09-09 11:09:22 +00:00
Georg Lehmann	9bc14a0047	nir/lower_subgroup: optimize reduce/scans with unknown subgroup size We skip iterations with ifs. These can be optimized aways after the subgroup size is known. Every driver should do that because applications depend on it anyway. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37164>	2025-09-09 11:09:21 +00:00
Rhys Perry	c59a85d406	nir/load_store_vectorize: remove offset check in try_vectorize_shared2 Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This doesn't seem useful anymore. fossil-db (gfx1201): Totals from 111 (0.14% of 79839) affected shaders: Instrs: 152356 -> 151883 (-0.31%); split: -0.35%, +0.04% CodeSize: 808484 -> 805584 (-0.36%); split: -0.39%, +0.04% VGPRs: 7880 -> 7844 (-0.46%); split: -0.91%, +0.46% Latency: 4121366 -> 4120648 (-0.02%); split: -0.04%, +0.02% InvThroughput: 814622 -> 815362 (+0.09%); split: -0.02%, +0.11% VClause: 3066 -> 3065 (-0.03%); split: -0.10%, +0.07% SClause: 2594 -> 2593 (-0.04%) Copies: 9412 -> 9447 (+0.37%); split: -0.47%, +0.84% PreSGPRs: 4012 -> 4026 (+0.35%) PreVGPRs: 4025 -> 4070 (+1.12%); split: -0.22%, +1.34% VALU: 80457 -> 81039 (+0.72%); split: -0.08%, +0.80% SALU: 16542 -> 16528 (-0.08%); split: -0.10%, +0.02% VOPD: 39 -> 44 (+12.82%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36370>	2025-09-09 10:11:52 +00:00
Rhys Perry	0f364aded3	nir/opt_offsets: improve shared2 optimization Combine additions too, instead of just constant offsets. fossil-db (gfx1201): Totals from 97 (0.12% of 79839) affected shaders: Instrs: 145269 -> 144886 (-0.26%); split: -0.27%, +0.01% CodeSize: 762184 -> 759556 (-0.34%); split: -0.36%, +0.01% VGPRs: 5812 -> 5764 (-0.83%) Latency: 4050681 -> 4050528 (-0.00%); split: -0.01%, +0.00% InvThroughput: 617458 -> 617181 (-0.04%); split: -0.05%, +0.00% Copies: 8719 -> 8672 (-0.54%); split: -0.70%, +0.16% PreVGPRs: 3558 -> 3543 (-0.42%); split: -0.59%, +0.17% VALU: 77793 -> 77462 (-0.43%); split: -0.44%, +0.01% SALU: 17028 -> 17009 (-0.11%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36370>	2025-09-09 10:11:51 +00:00
Rhys Perry	c10e495182	nir/opt_offsets: fix progress determination with offsets that add to zero If the offset is iadd(iadd(iadd(a, 1), b), -1), try_extract_const_addition will create a dead iadd(a, b) and claim that it didn't modify the shader. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <maraeo@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36370>	2025-09-09 10:11:50 +00:00
Rhys Perry	9aad852af8	nir/opt_offsets: report progress if NUW is set Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Marek Olšák <maraeo@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36370>	2025-09-09 10:11:50 +00:00
Mel Henning	eba08245a8	treewide: Spell indices correctly Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details LOLed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36184>	2025-09-08 23:03:13 +00:00
Mel Henning	17876a00af	nir: Add a faster lowest common ancestor algorithm On a fossil from the blender 4.5.0 vulkan backend, this improves compile times in nak by about 17%. Compile time of other shaders improves by a more modest 1.2%. No stat changes on shader-db. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36184>	2025-09-08 23:03:13 +00:00
Mel Henning	cd06366ca2	nir/phi_builder: Adjust valid_metadata assert so we can add more metadata to nir_metadata_control_flow. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36184>	2025-09-08 23:03:13 +00:00
Mel Henning	ee8d448241	nir: Don't require nir_metadata_control_flow We're about to add to nir_metadata_control_flow, and we don't want passes to require the new metadata. Via coccinelle: @@ expression e1; @@ - nir_metadata_require(e1, nir_metadata_control_flow) + nir_metadata_require(e1, nir_metadata_block_index \| nir_metadata_dominance) Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36184>	2025-09-08 23:03:13 +00:00
Daniel Schürmann	a53190a426	nir/load_store_vectorize: hoist base addr instead of subtracting Totals from 3130 (3.92% of 79839) affected shaders: (Navi48) Instrs: 2634316 -> 2633652 (-0.03%); split: -0.06%, +0.04% CodeSize: 13999784 -> 13996888 (-0.02%); split: -0.05%, +0.03% SpillSGPRs: 1771 -> 1778 (+0.40%) Latency: 27233464 -> 27230934 (-0.01%); split: -0.02%, +0.01% InvThroughput: 4234587 -> 4234550 (-0.00%); split: -0.00%, +0.00% VClause: 54684 -> 54689 (+0.01%) SClause: 62743 -> 62912 (+0.27%); split: -0.08%, +0.35% Copies: 162594 -> 163729 (+0.70%); split: -0.22%, +0.91% PreSGPRs: 146909 -> 146914 (+0.00%); split: -0.01%, +0.01% VALU: 1558771 -> 1558778 (+0.00%) SALU: 337715 -> 338168 (+0.13%); split: -0.30%, +0.44% Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37163>	2025-09-08 09:56:04 +00:00
Rhys Perry	cfba417316	nir/load_store_vectorize: optimize accesses with u2u64(ishl.nuw(iadd)) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37163>	2025-09-08 09:56:04 +00:00
Rhys Perry	4bc4322150	nir/load_store_vectorize: call nir_def_num_lsb_zero less Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37163>	2025-09-08 09:56:03 +00:00
Rhys Perry	491b7e851f	nir/load_store_vectorize: refactor entry key creation Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37163>	2025-09-08 09:56:03 +00:00
Rhys Perry	8888c2471b	nir/load_store_vectorize: refactor offset parsing Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37163>	2025-09-08 09:56:03 +00:00
Daniel Schürmann	acb47d2c78	nir/load_store_vectorize: also parse offsets through u2u64 if additions don't wrap around Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37163>	2025-09-08 09:56:03 +00:00
Timothy Arceri	8b1d48cf0b	nir: move nir_lower_point_size_mov() to st Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37037>	2025-09-07 23:13:24 +00:00
Timothy Arceri	450419c3f4	nir: move nir_lower_alpha_test() to the st This is gl specific and a following fix will add more gl specific params so here we move it to the st to avoid filling nir.h with more junk. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37037>	2025-09-07 23:13:23 +00:00
Timothy Arceri	8417f4a8eb	nir: move nir_lower_drawpixels() to the state tracker This is gl specific and a following fix will add more gl specific params so here we move it to the st to avoid filling nir.h with more junk. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37037>	2025-09-07 23:13:22 +00:00
Daniel Schürmann	c78f1d516c	nir/algebraic: add pattern for (a << #b) * #c => a * (#c << #b) Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Totals from 2545 (3.19% of 79839) affected shaders: (Navi48) Instrs: 6371003 -> 6364130 (-0.11%); split: -0.12%, +0.01% CodeSize: 33827548 -> 33812244 (-0.05%); split: -0.06%, +0.01% Latency: 47451755 -> 47430108 (-0.05%); split: -0.05%, +0.00% InvThroughput: 10442450 -> 10437159 (-0.05%); split: -0.05%, +0.00% SClause: 159829 -> 159874 (+0.03%); split: -0.01%, +0.04% Copies: 500725 -> 500721 (-0.00%); split: -0.01%, +0.01% PreSGPRs: 110482 -> 110478 (-0.00%); split: -0.00%, +0.00% PreVGPRs: 147289 -> 147287 (-0.00%); split: -0.00%, +0.00% VALU: 3456135 -> 3454241 (-0.05%); split: -0.06%, +0.01% SALU: 925982 -> 923616 (-0.26%) VOPD: 1243 -> 1212 (-2.49%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37173>	2025-09-06 10:18:42 +00:00
Christoph Pillmayer	f81f3c85e2	nir/opt_algebraic: Convert a + b + a to b + 2a Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This allows fusing into one FMA later. Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37113>	2025-09-05 11:39:51 +00:00
Lionel Landwerlin	afea98593e	nir: add a new intrinsic for load dynamic tessellation config Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34872>	2025-09-05 07:46:15 +00:00
Rob Clark	d5a8233598	nir/lower-amul: Comment fix Signed-off-by: Rob Clark <rob.clark@oss.qualcomm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37063>	2025-09-04 15:21:38 +00:00
Rob Clark	55d77749ed	nir/lower-amul: Fix crash with unused SSBO Since https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12175 we should be able to rely on driver_location for both UBOs and SSBOs. Signed-off-by: Rob Clark <rob.clark@oss.qualcomm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37063>	2025-09-04 15:21:38 +00:00
Georg Lehmann	796f0847a6	nir/lower_subgroups: recursively lower ballot scans This should be better for backends that have le/lt mask intrinsics. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37178>	2025-09-04 14:04:00 +00:00
Georg Lehmann	2725eaf9a2	nir/lower_subgroups: change filter to intrinsic callback Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37178>	2025-09-04 14:04:00 +00:00
Georg Lehmann	d14897b2f7	nir/lower_subgroups: don't use get_max_subgroup_size for lowering boolean rotates The lowering won't work with an unknown subgroup size, and we correctly assert that at the top of the function. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37178>	2025-09-04 14:03:59 +00:00
Georg Lehmann	f8633511be	nir: make ballot find_lsb/msb/bit_count 32bit only The lowering is 32bit only too. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37178>	2025-09-04 14:03:58 +00:00
Georg Lehmann	b8db8f877d	nir: make ballot_bitfield_extract 1bit only Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37178>	2025-09-04 14:03:57 +00:00
Georg Lehmann	83326af899	nir/builder: add nir_inverse_ballot_imm Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37178>	2025-09-04 14:03:56 +00:00
Georg Lehmann	ef8c364d3d	nir: make inverse_ballot 1bit only Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37178>	2025-09-04 14:03:56 +00:00
Simon Perretta	880098158d	nir/nir_lower_calls_to_builtins: trivially handle IA64 mangled functions Using __attribute__((overloadable)) when declaring nir ops with variable-width params in clc results in their symbol names being (IA64) mangled; this change enables the mangled names to be handled when later lowering the calls. Signed-off-by: Simon Perretta <simon.perretta@imgtec.com> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36873>	2025-09-02 16:04:19 +00:00
Robert Mader	1772380307	nir: Fixup 10/12 bit SW decoder YCbCr formats Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The highest possible values that can be represented with 16/12/10 bits are 65535/4095/1023, not 65536/4096/1024. In order to ensure 1023 maps to 65535 in the Sx10 case we thus need to multiply by 65535 / 1023 ~= 64.06158 instead of 64. Fixes: `a166d7609f` ("gles: Add support for 10/12/16 bit SW decoder YCbCr formats") Suggested-by: Benjamin Otte <otte@redhat.com> Signed-off-by: Robert Mader <robert.mader@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37077>	2025-09-02 09:08:51 +00:00
Job Noorman	e78bd88a06	nir/opt_offsets: add callback to set need_nuw per intrinsic Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Wether need_nuw is used is currently decided in two different ways: - globally through the allow_offset_wrap option; - per intrinsic but hard-coded in opt_offsets. Make this more flexible by creating a callback that is called per intrinsic. This will allow backends to decide, on a per-intrinsic basis, whether need_nuw is needed. Note that the main use case for ir3 is to add support for opt_offsets for global memory accesses. Other intrinsics don't need need_nuw but global memory accesses do. Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37114>	2025-09-01 11:25:07 +00:00
Job Noorman	bc03086320	nir/opt_offsets: rename max_offset_data to cb_data We want to add more callbacks and pass the same data. Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37114>	2025-09-01 11:25:07 +00:00
Rhys Perry	2d0f93631c	nir/divergence: make smem load_global_amd uniform Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37101>	2025-08-30 14:55:13 -04:00
Marek Olšák	25294f3dd4	nir/opt_move_to_top: handle load_global_amd with ACCESS_SMEM_AMD to match the behavior of load_smem_amd Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37101>	2025-08-30 14:55:13 -04:00
Marek Olšák	48050dbef6	nir/opt_sink: handle load_global_amd Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37101>	2025-08-30 14:55:13 -04:00
Marek Olšák	219fcd4b32	nir/opt_call: handle load_global(_amd) with SPECULATE as rematerializable Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37101>	2025-08-30 14:55:13 -04:00
Faith Ekstrand	26e32417b9	nir: Add an option to make lower_phis_to_regs_block() less clever Right now it tries to place reg_write instructions as far up the predecessor chain as possible. This is useful for a bunch of the passes that call it since it ensures they don't get placed in dead blocks or in single successors and things like that. But it screws up NAK's control flow lowering so we need the option to turn it off and make the pass place the reg_write instructions in the most obvious place possible. Fixes: `b013d54e4f` ("nak/lower_cf: Flag phis as convergent when possible") Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36914>	2025-08-29 01:24:56 +00:00
Dave Airlie	c38170452d	nir: add nir_intrinsic_cmat_load_shared_nv This maps to NAK's OpLdsm Reviewed-by: Mary Guillemard <mary@mary.zone> Reviewed-by: Karol Herbst <kherbst@redhat.com> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36363>	2025-08-28 16:09:07 +02:00
Georg Lehmann	3b06824e4c	nir/opt_algebraic: optimize some post peephole select patterns Foz-DB GFX1201: Totals from 208 (0.26% of 80287) affected shaders: Instrs: 427684 -> 426834 (-0.20%); split: -0.22%, +0.02% CodeSize: 2232616 -> 2228816 (-0.17%); split: -0.20%, +0.03% Latency: 3993934 -> 3992726 (-0.03%); split: -0.04%, +0.01% InvThroughput: 569055 -> 568622 (-0.08%); split: -0.09%, +0.01% SClause: 12932 -> 12927 (-0.04%) Copies: 22567 -> 22604 (+0.16%); split: -0.47%, +0.63% Branches: 7671 -> 7658 (-0.17%) VALU: 222047 -> 221625 (-0.19%) SALU: 83954 -> 83815 (-0.17%); split: -0.29%, +0.13% Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36938>	2025-08-27 09:45:19 +00:00
Georg Lehmann	395893e16b	nir/peephole_select: allows more lowered io Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36938>	2025-08-27 09:45:19 +00:00
Georg Lehmann	e270a7480b	nir/lower_io: fix boolean output stores Stores don't have a definition, we have to check the bit size of the source. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13762 Fixes: `c217ee8d35` ("nir: Insert b2b1s around booleans in nir_lower_to") Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Mary Guillemard <mary@mary.zone> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36966>	2025-08-27 08:46:34 +00:00
Georg Lehmann	047b95a8c3	nir/shrink_vec_array_vars: detect zero init shared memory using constant initializer More consistent. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36956>	2025-08-27 06:37:41 +00:00
Georg Lehmann	edc5bea61e	nir/shrink_vec_array_vars: update constant initializer after shrinking Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13751 Fixes: `c7df3b4f64` ("nir/shrink_vec_array_vars: allow nir_var_mem_shared") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36956>	2025-08-27 06:37:41 +00:00
Georg Lehmann	d0f4b535fe	nir: constant fold txd with 0 ddx/ddy to txl Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Foz-DB GFX1201: Totals from 34 (0.04% of 80287) affected shaders: Instrs: 3111158 -> 3111076 (-0.00%) CodeSize: 16345020 -> 16344908 (-0.00%); split: -0.00%, +0.00% Latency: 15378053 -> 15378063 (+0.00%); split: -0.00%, +0.00% InvThroughput: 2940485 -> 2940477 (-0.00%); split: -0.00%, +0.00% VClause: 79940 -> 79941 (+0.00%) Copies: 228205 -> 228159 (-0.02%) VALU: 1730040 -> 1729994 (-0.00%) Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36967>	2025-08-26 06:19:43 +00:00

1 2 3 4 5 ...

6607 commits