fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-04-20 14:20:36 +02:00

Author	SHA1	Message	Date
Konstantin Seurer	034f58c7e3	nir: Ignore ray query ranges that don't start with rq_initialize Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Handles is a rare edge case where the ray query is used "before" there is a rq_initialize. cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38850>	2025-12-11 15:56:29 +00:00
Konstantin Seurer	5e03d09eb5	nir: Fix typo in nir_opt_ray_query_ranges Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38850>	2025-12-11 15:56:28 +00:00
Emma Anholt	1a2d0d3f31	nir: Optimistically unroll loops using induction var as a sample id. On the assumption that nobody will use a sample id greater than the sample count, have loop unrolling guess based on the driver's max sample count. This unrolls a simple resolve shader with a uniform max samples on ir3 to: value = vec4(0); if (max_samples > 0) { value += txf_ms(coord, 0); if (max_samples > 1 { value += txf_ms(coord, 1); if (max_samples > 2){ value += txf_ms(coord, 2); if (max_samples > 3) { value += txf_ms(coord, 3); for (i = 4; i < max_samples; i++) value += txf_ms(coord, i); } } } } ... This is only worth a 1% win on our microbenchmark as-is, but if we could flatten those ifs out and pull the fadds out to the end, avoiding syncs per load would be a big win. This seems like a first step. I've taken a shot at updating drivers to set the value, and tried to leave notes in places that drivers might update, and want to follow up with updating the compiler option. This affects over half the DX11 apps in shader-db-private. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38585>	2025-12-11 14:26:11 +00:00
Emma Anholt	10ba7675c8	nir/uub: Use an optional max_samples from drivers for sample counts. This triggers some unrolling in Fallout 4, GTAV, and Rocky Planet in my shader-db. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38585>	2025-12-11 14:26:11 +00:00
Emma Anholt	dc30e1a128	nir/loop_analyze: Use nir_unsigned_upper_bound for loop trip limits. This triggers some unrolling in Monster Hunter World, Total War: Warhammer, and Planet Zoo. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38585>	2025-12-11 14:26:10 +00:00
Mel Henning	2fab8fc297	nir: Use instr_clone in rematerialize_deref_in_block Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The previous implementation seems to predate nir_instr_clone() and duplicates a lot of the deref cloning code. This also makes the pass preserve deref->arr.in_bounds correctly. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38856>	2025-12-10 22:07:45 +00:00
Mel Henning	dc44c0f32b	treewide: Use nir_deref_instr_is_arr() Via coccinelle and some manual fixups. @@ expression e1; @@ - e1->deref_type == nir_deref_type_array \|\| e1->deref_type == nir_deref_type_ptr_as_array + nir_deref_instr_is_arr(e1) Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38856>	2025-12-10 22:07:45 +00:00
Mel Henning	263a82f49b	nir: Add nir_deref_instr_is_arr() helper Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38856>	2025-12-10 22:07:44 +00:00
Marek Olšák	9a2f1be814	nir: add FRAG_RESULT_DUAL_SRC_BLEND and an option to use it This is potentially nicer for some drivers. AMD drivers will use it. mesa_frag_result_get_color_index will be used often. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38604>	2025-12-10 19:16:46 +00:00
Georg Lehmann	621465e417	nir/opt_uniform_subgroup: handle more trivial shuffles/votes Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38867>	2025-12-10 13:32:08 +00:00
Georg Lehmann	e648e551c1	nir/opt_uniform_subgroup: wire up mbcnt_amd path Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38867>	2025-12-10 13:32:08 +00:00
Georg Lehmann	5778436e99	nir/opt_uniform_subgroup: use nir_shader_intrinsics_pass Nothing here needs the recursion of the full lower_instructions pass. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38867>	2025-12-10 13:32:08 +00:00
Georg Lehmann	5f28bb72a7	nir/divergence_analysis: fix swizzle_amd without fetch inactive Fixes: `ad5be40303` ("nir: add fetch inactive index to quad_swizzle_amd/masked_swizzle_amd") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38867>	2025-12-10 13:32:08 +00:00
Georg Lehmann	1fc38d8539	nir/opt_uniform_subgroup: fix swizzle_amd without fetch_inactive Fixes: `ad5be40303` ("nir: add fetch inactive index to quad_swizzle_amd/masked_swizzle_amd") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38867>	2025-12-10 13:32:08 +00:00
Georg Lehmann	e11d7f06d0	nir/opt_uniform_subgroup: don't try to optimize non trivial clustered reduce Fixes: `535caaf3e0` ("nir: Optimize uniform iadd, fadd, and ixor reduction operations") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38867>	2025-12-10 13:32:08 +00:00
Marek Olšák	0c400fbed9	nir: give nir_lower_clip_cull_distance_array_vars a better name also rename the file Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38465>	2025-12-10 05:16:34 +00:00
Marek Olšák	74995eb64d	nir: split gathering array sizes from nir_lower_clip_cull_distance_array_vars nir_lower_clip_cull_distance_array_vars was sneakily updating shader_info::clip/cull_distance_array_size. This moves the gathering into a new function nir_gather_clip_cull_distance_sizes_from_vars. v2: remove assertions that prevented nir_lower_clip_cull_distance_array_vars from being used with non-compact arrays Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> (v1) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38465>	2025-12-10 05:16:34 +00:00
Marek Olšák	bdcb7bc674	nir/gather_info: clear clip/cull_distance_array_size if the IO is not present Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38465>	2025-12-10 05:16:33 +00:00
Alyssa Rosenzweig	5ced623fdf	nir: print nir_tex_instr::backend_flags if present I was wondering where this was disappearing to. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38789>	2025-12-09 20:44:15 +00:00
Arcady Goldmints-Orlov	68bb5d9e49	kk: enable shaderClipDistance Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Since Metal doesn't pass clip distance into the fragment shader, we have to do it ourselves. The CLIP_DIST0/1 varying slots are used to represent the user-defined varyings we use to pass them from vertex to fragment and a new intrinsic is added to represent the write to the built-in clip_distance variable. Since the CLIP_DIST0/1 varying slots are not affected by opt_varyings, there can be potential interface mismatches so the machinery in msl_iomap.c is refactored to allow them to be output as a series of scalars rather than vectors. Reviewed-by: Aitor Camacho <aitor@lunarg.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38839>	2025-12-08 23:09:53 -05:00
Connor Abbott	ad84ae2719	tu: Implement VK_QCOM_subpass_shader_resolve Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38451>	2025-12-08 20:44:46 +00:00
Connor Abbott	bd821b9a17	nir, tu: Add and use load_frag_coord_gmem_ir3 We used load_frag_coord_unscaled_ir3 for loading the fragment coord for input attachments in GMEM, where the normal scaling for gl_FragCoord shouldn't be used. However with custom resolve a different scaling will apply to attachments in GMEM. Separate "unscaled" from "gmem" and rename the NIR options, in preparation for this. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38451>	2025-12-08 20:44:45 +00:00
Yiwei Zhang	2de8981351	nir: suppress clang warnings for cooperative matrix lowering This suppresses below compile warnings: - warning: variable 'idx' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38835>	2025-12-08 19:36:05 +00:00
Georg Lehmann	7f6bd8b003	nir/peephole_select: allow mbcnt_amd Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details It's just alu, so handle it like alu. Foz-DB Navi21: Totals from 3 (0.00% of 97591) affected shaders: Instrs: 433 -> 426 (-1.62%) CodeSize: 2408 -> 2388 (-0.83%) Latency: 7520 -> 7925 (+5.39%) InvThroughput: 857 -> 1009 (+17.74%) Copies: 55 -> 43 (-21.82%) Branches: 21 -> 17 (-19.05%) SALU: 79 -> 76 (-3.80%) Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38828>	2025-12-08 10:52:48 +00:00
Georg Lehmann	005cc4110c	nir/peephole_select: allow ballot We can allow collapsing control flow around ballot if we update the ballot condition like we do for discards. ballot_relaxed needs no condition update, as the result bits are undefined for inactive invocations. Foz-DB Navi21: Totals from 27 (0.03% of 97591) affected shaders: Instrs: 2554506 -> 2554469 (-0.00%); split: -0.00%, +0.00% CodeSize: 13765636 -> 13765684 (+0.00%); split: -0.00%, +0.00% Latency: 14186667 -> 14186861 (+0.00%); split: -0.00%, +0.00% InvThroughput: 3542516 -> 3542595 (+0.00%); split: -0.00%, +0.00% SClause: 52038 -> 52030 (-0.02%) Copies: 209410 -> 208763 (-0.31%) Branches: 83716 -> 83399 (-0.38%) PreSGPRs: 2372 -> 2386 (+0.59%); split: -0.17%, +0.76% VALU: 1701458 -> 1701482 (+0.00%) SALU: 369884 -> 370107 (+0.06%); split: -0.00%, +0.07% SMEM: 67643 -> 67634 (-0.01%) Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38828>	2025-12-08 10:52:48 +00:00
Georg Lehmann	077b654cc7	nir: don't sink alu that uses ballot(true) Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Don't sink alu that uses ballot(true), as that can a local system value and moving the alu then requires a new mov in the old location. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38829>	2025-12-08 09:07:54 +00:00
Marek Olšák	a051d4ee6b	nir/lower_io_vars: don't insert output stores for unrelated streams before emits Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Before every emit_vertex(stream_id = n), we would insert stores for all outputs, including outputs that are not meant for that stream. Those stores would end up having no effect while potentially reducing performance. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38100>	2025-12-06 02:27:46 +00:00
Arcady Goldmints-Orlov	0df8aa940c	nir: Use nir_shader_intrinsics_pass in nir_lower_io_to_scalar Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38816>	2025-12-05 22:30:22 +00:00
Emma Anholt	66b157095c	nir/shader_bisect: Allow passing in a --lo / --hi to continue a run. Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Sometimes you fumble an answer, and would like to not restart from the beginning (or just want to see the behavior of the script late in the run if you're debugging it). Pass in the last bad range, and you can keep going. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38760>	2025-12-04 22:47:25 +00:00
Emma Anholt	4287bb761e	nir/shader_bisect: Fix C code printing after review feedback changes. When I added in the printed-shader and env var value both being tracked in shaders[], it broke the C printing. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38760>	2025-12-04 22:47:25 +00:00
Karol Herbst	a255e2ca56	nir: add ACCESS to shared_uniform_block_intel intel_nir_blockify_uniform_loads simply overwrites the intrinsic for load_shared, which leads to messed up indicies, e.g: "base=0, access=volatile, align_mul=4, align_offset=0 became: "base=0, align_mul=4, align_offset=4" Fixes: `0dd09a292b` ("nir: add ACCESS_ATOMIC") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38801>	2025-12-04 10:01:52 +00:00
Connor Abbott	d5498240ac	spirv: Remove view_index_is_input Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The last user was removed. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38793>	2025-12-03 22:52:29 +00:00
Marek Olšák	e14f8ee0e4	nir/has_divergent_loop: require divergence metadata, check all function impls Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details instead of forcing callers to call nir_divergence_analysis Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38597>	2025-12-03 20:14:18 +00:00
Ian Romanick	92e609f4fe	glsl: Move flrp lowering out of the loop Other lower_flrp Intel platforms had similar shader-db changes. Lunar Lake total instructions in shared programs: 17131619 -> 17131182 (<.01%) instructions in affected programs: 59924 -> 59487 (-0.73%) helped: 255 / HURT: 9 total loops in shared programs: 5336 -> 5334 (-0.04%) loops in affected programs: 4 -> 2 (-50.00%) helped: 2 / HURT: 0 total cycles in shared programs: 888274988 -> 888269628 (<.01%) cycles in affected programs: 1753370 -> 1748010 (-0.31%) helped: 182 / HURT: 94 Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12526>	2025-12-02 21:28:05 +00:00
Ian Romanick	4bbc29373a	nir/lower_flrp: Check and set shader_info::flrp_lowered No shader-db or fossil-db changes on any Intel platform. v2: Return early if lowering_mask is zero. If the first call to nir_lower_flrp has a lowering_mask of zero, later calls with non-zero masks would not do any lowering. lp_bld_nir.c has this issue. Suggested-by: Alyssa Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12526>	2025-12-02 21:28:05 +00:00
Qiang Yu	2f6a034528	glsl: support barrier() for task and mesh shader Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details It was ignored when translating glsl to nir. Fixes: `d52452a486` ("glsl: allow barrier builtin functions for mesh shader") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38692>	2025-12-01 02:33:00 +00:00
Marek Olšák	9294448fe1	nir/recompute_io_bases: report progress only if anything was changed Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details also preserve all metadata because it doesn't add/remove any instructions Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38599>	2025-11-29 05:00:40 +00:00
Marek Olšák	e6499fa73e	nir/recompute_io_bases: move color input bases after all other inputs This is related to the FS prolog. It should have no effect on other drivers. v2: make it optional via io_options Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> (v1) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38599>	2025-11-29 05:00:40 +00:00
Marek Olšák	18a338066b	nir/recompute_io_bases: don't use safe iterators the pass doesn't remove anything Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38599>	2025-11-29 05:00:40 +00:00
Faith Ekstrand	4711e5954e	nir: Always use sysvals in lower_input_attachments() Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The last holdouts of the var options are gone so we can just emit the system values. This is overall simpler as it confines all the sysval to var logic to nir_lower_sysvals_to_varyings(). Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38562>	2025-11-29 00:50:34 +00:00
Faith Ekstrand	82280a7e86	nir: Support sysval intrinsics in lower_sysvals_to_varyings() Since this is a downgrade path for drivers, it's useful to support both forms of these common sysvals. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38562>	2025-11-29 00:50:32 +00:00
Faith Ekstrand	0c36c39103	spirv: Emit SYSTEM_VALUE_LAYER_ID for fragment shaders We have nir_lower_sysvals_to_varyings() so we can just have that lower it for the drivers who don't want a sysval. Most have to support the sysval version anyway for various lowering so making them all have to support both is pretty annoying. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38562>	2025-11-29 00:50:32 +00:00
Faith Ekstrand	701a9c269e	nir: Add LAYER_ID and VIEW_INDEX to nir_lower_sysvals_to_varyings() Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38562>	2025-11-29 00:50:31 +00:00
Marek Olšák	fa0bea5ff8	nir: remove nir_io_add_const_offset_to_base Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details nir_opt_constant_folding does it now. Acked-by: Emma Anholt <emma@anholt.net> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38277>	2025-11-29 00:16:38 +00:00
Marek Olšák	726bbb352e	nir/opt_constant_folding: add nir_io_add_const_offset_to_base behavior We almost always call both passes next to each other. The code is copied from nir_io_add_const_offset_to_base. No changes. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38277>	2025-11-29 00:16:38 +00:00
Marek Olšák	9a56672f56	nir: add shader_info::disable_input/output_offset_src_constant_folding and set it where needed to prevent nir_opt_constant_folding from breaking those drivers. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38277>	2025-11-29 00:16:38 +00:00
Marek Olšák	7330bca9db	nir: handle load_fs_input_interp_deltas in nir_is_input_load for nir_opt_constant_folding Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38277>	2025-11-29 00:16:37 +00:00
Marek Olšák	ffcbbeb54a	nir/validate: don't require offset src to be 0 if constant nir_opt_constant_folding does the folding, so this can be non-zero before that. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38277>	2025-11-29 00:16:36 +00:00
Georg Lehmann	653716b745	nir/opt_algebraic: create more bit test Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Helps hackends with has_bit_test more (i.e. ACO), but it shouldn't hurt others either. Foz-DB Navi21: Totals from 1138 (1.17% of 97591) affected shaders: Instrs: 5478747 -> 5476055 (-0.05%); split: -0.05%, +0.00% CodeSize: 29850188 -> 29853140 (+0.01%); split: -0.04%, +0.05% SpillSGPRs: 1406 -> 1401 (-0.36%) Latency: 42324245 -> 42325921 (+0.00%); split: -0.01%, +0.01% InvThroughput: 11396940 -> 11394048 (-0.03%); split: -0.04%, +0.01% VClause: 142294 -> 142309 (+0.01%); split: -0.00%, +0.01% SClause: 124412 -> 124411 (-0.00%); split: -0.00%, +0.00% Copies: 572696 -> 572749 (+0.01%); split: -0.02%, +0.03% Branches: 199932 -> 199929 (-0.00%) PreSGPRs: 73372 -> 74970 (+2.18%) PreVGPRs: 79514 -> 79511 (-0.00%) VALU: 3628764 -> 3625744 (-0.08%); split: -0.08%, +0.00% SALU: 818258 -> 818475 (+0.03%); split: -0.03%, +0.06% Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38700>	2025-11-28 13:25:24 +00:00
Aleksi Sapon	cef4102548	nir, vk: fix MSVC unused variable warning Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38663>	2025-11-28 01:52:12 +00:00

1 2 3 4 5 ...

11424 commits