fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-21 02:28:07 +02:00

Author	SHA1	Message	Date
Arcady Goldmints-Orlov	68bb5d9e49	kk: enable shaderClipDistance Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Since Metal doesn't pass clip distance into the fragment shader, we have to do it ourselves. The CLIP_DIST0/1 varying slots are used to represent the user-defined varyings we use to pass them from vertex to fragment and a new intrinsic is added to represent the write to the built-in clip_distance variable. Since the CLIP_DIST0/1 varying slots are not affected by opt_varyings, there can be potential interface mismatches so the machinery in msl_iomap.c is refactored to allow them to be output as a series of scalars rather than vectors. Reviewed-by: Aitor Camacho <aitor@lunarg.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38839>	2025-12-08 23:09:53 -05:00
Connor Abbott	ad84ae2719	tu: Implement VK_QCOM_subpass_shader_resolve Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38451>	2025-12-08 20:44:46 +00:00
Connor Abbott	bd821b9a17	nir, tu: Add and use load_frag_coord_gmem_ir3 We used load_frag_coord_unscaled_ir3 for loading the fragment coord for input attachments in GMEM, where the normal scaling for gl_FragCoord shouldn't be used. However with custom resolve a different scaling will apply to attachments in GMEM. Separate "unscaled" from "gmem" and rename the NIR options, in preparation for this. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38451>	2025-12-08 20:44:45 +00:00
Yiwei Zhang	2de8981351	nir: suppress clang warnings for cooperative matrix lowering This suppresses below compile warnings: - warning: variable 'idx' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38835>	2025-12-08 19:36:05 +00:00
Georg Lehmann	7f6bd8b003	nir/peephole_select: allow mbcnt_amd Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details It's just alu, so handle it like alu. Foz-DB Navi21: Totals from 3 (0.00% of 97591) affected shaders: Instrs: 433 -> 426 (-1.62%) CodeSize: 2408 -> 2388 (-0.83%) Latency: 7520 -> 7925 (+5.39%) InvThroughput: 857 -> 1009 (+17.74%) Copies: 55 -> 43 (-21.82%) Branches: 21 -> 17 (-19.05%) SALU: 79 -> 76 (-3.80%) Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38828>	2025-12-08 10:52:48 +00:00
Georg Lehmann	005cc4110c	nir/peephole_select: allow ballot We can allow collapsing control flow around ballot if we update the ballot condition like we do for discards. ballot_relaxed needs no condition update, as the result bits are undefined for inactive invocations. Foz-DB Navi21: Totals from 27 (0.03% of 97591) affected shaders: Instrs: 2554506 -> 2554469 (-0.00%); split: -0.00%, +0.00% CodeSize: 13765636 -> 13765684 (+0.00%); split: -0.00%, +0.00% Latency: 14186667 -> 14186861 (+0.00%); split: -0.00%, +0.00% InvThroughput: 3542516 -> 3542595 (+0.00%); split: -0.00%, +0.00% SClause: 52038 -> 52030 (-0.02%) Copies: 209410 -> 208763 (-0.31%) Branches: 83716 -> 83399 (-0.38%) PreSGPRs: 2372 -> 2386 (+0.59%); split: -0.17%, +0.76% VALU: 1701458 -> 1701482 (+0.00%) SALU: 369884 -> 370107 (+0.06%); split: -0.00%, +0.07% SMEM: 67643 -> 67634 (-0.01%) Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38828>	2025-12-08 10:52:48 +00:00
Georg Lehmann	077b654cc7	nir: don't sink alu that uses ballot(true) Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Don't sink alu that uses ballot(true), as that can a local system value and moving the alu then requires a new mov in the old location. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38829>	2025-12-08 09:07:54 +00:00
Marek Olšák	a051d4ee6b	nir/lower_io_vars: don't insert output stores for unrelated streams before emits Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Before every emit_vertex(stream_id = n), we would insert stores for all outputs, including outputs that are not meant for that stream. Those stores would end up having no effect while potentially reducing performance. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38100>	2025-12-06 02:27:46 +00:00
Arcady Goldmints-Orlov	0df8aa940c	nir: Use nir_shader_intrinsics_pass in nir_lower_io_to_scalar Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38816>	2025-12-05 22:30:22 +00:00
Emma Anholt	66b157095c	nir/shader_bisect: Allow passing in a --lo / --hi to continue a run. Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Sometimes you fumble an answer, and would like to not restart from the beginning (or just want to see the behavior of the script late in the run if you're debugging it). Pass in the last bad range, and you can keep going. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38760>	2025-12-04 22:47:25 +00:00
Emma Anholt	4287bb761e	nir/shader_bisect: Fix C code printing after review feedback changes. When I added in the printed-shader and env var value both being tracked in shaders[], it broke the C printing. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38760>	2025-12-04 22:47:25 +00:00
Karol Herbst	a255e2ca56	nir: add ACCESS to shared_uniform_block_intel intel_nir_blockify_uniform_loads simply overwrites the intrinsic for load_shared, which leads to messed up indicies, e.g: "base=0, access=volatile, align_mul=4, align_offset=0 became: "base=0, align_mul=4, align_offset=4" Fixes: `0dd09a292b` ("nir: add ACCESS_ATOMIC") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38801>	2025-12-04 10:01:52 +00:00
Connor Abbott	d5498240ac	spirv: Remove view_index_is_input Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The last user was removed. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38793>	2025-12-03 22:52:29 +00:00
Marek Olšák	e14f8ee0e4	nir/has_divergent_loop: require divergence metadata, check all function impls Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details instead of forcing callers to call nir_divergence_analysis Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38597>	2025-12-03 20:14:18 +00:00
Ian Romanick	92e609f4fe	glsl: Move flrp lowering out of the loop Other lower_flrp Intel platforms had similar shader-db changes. Lunar Lake total instructions in shared programs: 17131619 -> 17131182 (<.01%) instructions in affected programs: 59924 -> 59487 (-0.73%) helped: 255 / HURT: 9 total loops in shared programs: 5336 -> 5334 (-0.04%) loops in affected programs: 4 -> 2 (-50.00%) helped: 2 / HURT: 0 total cycles in shared programs: 888274988 -> 888269628 (<.01%) cycles in affected programs: 1753370 -> 1748010 (-0.31%) helped: 182 / HURT: 94 Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12526>	2025-12-02 21:28:05 +00:00
Ian Romanick	4bbc29373a	nir/lower_flrp: Check and set shader_info::flrp_lowered No shader-db or fossil-db changes on any Intel platform. v2: Return early if lowering_mask is zero. If the first call to nir_lower_flrp has a lowering_mask of zero, later calls with non-zero masks would not do any lowering. lp_bld_nir.c has this issue. Suggested-by: Alyssa Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12526>	2025-12-02 21:28:05 +00:00
Qiang Yu	2f6a034528	glsl: support barrier() for task and mesh shader Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details It was ignored when translating glsl to nir. Fixes: `d52452a486` ("glsl: allow barrier builtin functions for mesh shader") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38692>	2025-12-01 02:33:00 +00:00
Marek Olšák	9294448fe1	nir/recompute_io_bases: report progress only if anything was changed Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details also preserve all metadata because it doesn't add/remove any instructions Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38599>	2025-11-29 05:00:40 +00:00
Marek Olšák	e6499fa73e	nir/recompute_io_bases: move color input bases after all other inputs This is related to the FS prolog. It should have no effect on other drivers. v2: make it optional via io_options Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> (v1) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38599>	2025-11-29 05:00:40 +00:00
Marek Olšák	18a338066b	nir/recompute_io_bases: don't use safe iterators the pass doesn't remove anything Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38599>	2025-11-29 05:00:40 +00:00
Faith Ekstrand	4711e5954e	nir: Always use sysvals in lower_input_attachments() Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The last holdouts of the var options are gone so we can just emit the system values. This is overall simpler as it confines all the sysval to var logic to nir_lower_sysvals_to_varyings(). Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38562>	2025-11-29 00:50:34 +00:00
Faith Ekstrand	82280a7e86	nir: Support sysval intrinsics in lower_sysvals_to_varyings() Since this is a downgrade path for drivers, it's useful to support both forms of these common sysvals. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38562>	2025-11-29 00:50:32 +00:00
Faith Ekstrand	0c36c39103	spirv: Emit SYSTEM_VALUE_LAYER_ID for fragment shaders We have nir_lower_sysvals_to_varyings() so we can just have that lower it for the drivers who don't want a sysval. Most have to support the sysval version anyway for various lowering so making them all have to support both is pretty annoying. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38562>	2025-11-29 00:50:32 +00:00
Faith Ekstrand	701a9c269e	nir: Add LAYER_ID and VIEW_INDEX to nir_lower_sysvals_to_varyings() Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38562>	2025-11-29 00:50:31 +00:00
Marek Olšák	fa0bea5ff8	nir: remove nir_io_add_const_offset_to_base Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details nir_opt_constant_folding does it now. Acked-by: Emma Anholt <emma@anholt.net> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38277>	2025-11-29 00:16:38 +00:00
Marek Olšák	726bbb352e	nir/opt_constant_folding: add nir_io_add_const_offset_to_base behavior We almost always call both passes next to each other. The code is copied from nir_io_add_const_offset_to_base. No changes. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38277>	2025-11-29 00:16:38 +00:00
Marek Olšák	9a56672f56	nir: add shader_info::disable_input/output_offset_src_constant_folding and set it where needed to prevent nir_opt_constant_folding from breaking those drivers. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38277>	2025-11-29 00:16:38 +00:00
Marek Olšák	7330bca9db	nir: handle load_fs_input_interp_deltas in nir_is_input_load for nir_opt_constant_folding Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38277>	2025-11-29 00:16:37 +00:00
Marek Olšák	ffcbbeb54a	nir/validate: don't require offset src to be 0 if constant nir_opt_constant_folding does the folding, so this can be non-zero before that. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38277>	2025-11-29 00:16:36 +00:00
Georg Lehmann	653716b745	nir/opt_algebraic: create more bit test Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Helps hackends with has_bit_test more (i.e. ACO), but it shouldn't hurt others either. Foz-DB Navi21: Totals from 1138 (1.17% of 97591) affected shaders: Instrs: 5478747 -> 5476055 (-0.05%); split: -0.05%, +0.00% CodeSize: 29850188 -> 29853140 (+0.01%); split: -0.04%, +0.05% SpillSGPRs: 1406 -> 1401 (-0.36%) Latency: 42324245 -> 42325921 (+0.00%); split: -0.01%, +0.01% InvThroughput: 11396940 -> 11394048 (-0.03%); split: -0.04%, +0.01% VClause: 142294 -> 142309 (+0.01%); split: -0.00%, +0.01% SClause: 124412 -> 124411 (-0.00%); split: -0.00%, +0.00% Copies: 572696 -> 572749 (+0.01%); split: -0.02%, +0.03% Branches: 199932 -> 199929 (-0.00%) PreSGPRs: 73372 -> 74970 (+2.18%) PreVGPRs: 79514 -> 79511 (-0.00%) VALU: 3628764 -> 3625744 (-0.08%); split: -0.08%, +0.00% SALU: 818258 -> 818475 (+0.03%); split: -0.03%, +0.06% Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38700>	2025-11-28 13:25:24 +00:00
Aleksi Sapon	cef4102548	nir, vk: fix MSVC unused variable warning Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38663>	2025-11-28 01:52:12 +00:00
Tapani Pälli	95938823f4	compiler/glsl: validate input blocks with opaque/booleans Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Commit adds a check for booleans/opaque types inside interfaces, there is existing check for "regular varyings". Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14338 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38613>	2025-11-27 17:40:15 +00:00
Marek Olšák	eea5959a22	nir/lower_io_passes: call nir_opt_undef to eliminate undef output stores If we do it here, we won't have to call nir_recompute_io_bases later again. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38598>	2025-11-26 16:23:49 -05:00
Karol Herbst	626c6b35f0	nak: add Movm Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37998>	2025-11-26 14:09:37 +00:00
Karol Herbst	c4f07f3d79	nir: mark cmat_load_shared_nv as CAN_ELIMINATE It's just a special load shared and has no side effects. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37998>	2025-11-26 14:09:35 +00:00
Alyssa Rosenzweig	1574a71438	nir/lower_wrmasks: clean up & deprecate pass Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The usual pass modernization with the twist that I don't want new drivers actually using it (-: Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Marek Olšák <maraeo@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38533>	2025-11-26 03:20:39 +00:00
Alyssa Rosenzweig	2c2dd835af	nir/lower_wrmasks: drop callback All drivers use the same callback and it is unlikely that new drivers will use this pass since it has better replacements today (lower_mem_bit_sizes for memory, and it never worked for I/O). This should discourage as much. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Marek Olšák <maraeo@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38533>	2025-11-26 03:20:39 +00:00
Alyssa Rosenzweig	5515160b55	nir/lower_wrmasks: drop support for I/O nir_lower_wrmasks as-is is broken for semantic I/O, since semantic I/O is slot based and nir_lower_wrmasks is purely byte-based. No drivers use it as such, and no drivers should. Remove the support so people don't think it works. This came up in !38482. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Marek Olšák <maraeo@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38533>	2025-11-26 03:20:39 +00:00
Aitor Camacho	bdaff0b457	kk: Handle memory coherency for textures and buffers Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details M1 chips are more restrictive than M2 and above. We need to enforce memory coherency when needed through "coherent" for buffer memory and "memory_coherence_device" for textures. Without these the memory operations are not visible to other threads. Reviewed-by: Arcady Goldmints-Orlov <arcady@lunarg.com> Signed-off-by: Aitor Camacho <aitor@lunarg.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38595>	2025-11-26 02:26:21 +00:00
Faith Ekstrand	fcb107accb	poly: Fetch the index size from a sysval On asahi, we can still specialize based on the shader key and get everything folded. But this gives drivers the option to make it dynamic if they wish. Co-authored-by: Mary Guillemard <mary.guillemard@collabora.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Mary Guillemard <mary@mary.zone> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38404>	2025-11-25 23:20:23 +00:00
Faith Ekstrand	05aaa7df65	nir: Improve comments for a couple poly intrinsics Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Mary Guillemard <mary@mary.zone> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38404>	2025-11-25 23:20:22 +00:00
Faith Ekstrand	05723bfa35	poly,asahi: Fetch directly from poly_vertex_state::output_buffer in GS We have access to the poly_vertex_state from the GS so we might as well use it. Asahi uses a single poly_vertex_state for VS and TCS and just assumes the tessellator stalls before we update it for TCS. If a driver wants to use two separate poly_vertex_state buffers, it will be the driver's responsibility to make the system values return the right one. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Mary Guillemard <mary@mary.zone> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38404>	2025-11-25 23:20:19 +00:00
Faith Ekstrand	89fbb9cf84	poly,asahi: Move vertex_output_buffer to poly_vertex_param Instead of having the vertex output buffer be a system value and something the driver needs to manage, put it in poly_vertex_param. We already need to have it somewhere GPU-writable so we can write it from indirect setup kernels. Instead of manually allocating 8B all over the place just to hold this one pointer, stick it in poly_vertex_param. This also lets us get rid of a NIR intrinsic. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Mary Guillemard <mary@mary.zone> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38404>	2025-11-25 23:20:18 +00:00
Faith Ekstrand	f36465d574	poly,asahi: Rename poly_ia_state to poly_vertex_params We're about to put more than just input assembly data in there so the name will make a lot more sense. Also, add a comment to make it more clear that this buffer applys to both VS and TES. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Mary Guillemard <mary@mary.zone> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38404>	2025-11-25 23:20:16 +00:00
Kenneth Graunke	96d331766a	brw: Generalize read_attribute_payload_intel to handle more cases We were using this for indirect loads of the shader input thread payload, but there's no reason we can't use it for constant access too. In this case we can just MOV from the ATTR file directly without a special opcode that turns into MOV_INDIRECT later. We also allow it to load multiple components now. This is useful for say, returning vec4 pushed inputs. And, we allow it in more stages than just the fragment stage. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:43:59 +00:00
Kenneth Graunke	792762617a	brw: Rename read_attribute_payload_intel to load_attribute_payload_intel We're going to change the intrinsic to a load(...) which puts "load" in the name. Also, it's just more consistent with our usual terminology. We also rename the corresponding backend opcode so they remain matched. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:43:58 +00:00
Kenneth Graunke	f1ab64ad74	nir: add new intrinsics to load/store from URB on intel We add several new intrinsics for accessing URB handles: - load_urb_output_handle_intel - load_urb_input_handle_intel - load_urb_input_handle_intel_indexed The latter is used by stages like TCS and GS where each input control point has a unique handle. The index is which ICP to read from. The others are for most stages, where all inputs or outputs are accessed via a single handle. Then we have URB load and store operations, split for Xe2+ (URB via LSC) and earlier (HDC OWord messages): - load_urb_vec4_intel - load_urb_lsc_intel - store_urb_vec4_intel - store_urb_lsc_intel The legacy vec4 variants take a handle and a 128-bit OWord offset as sources. Additionally, stores take a set of channel enables to mask off and avoid writing vec4 components. We don't use the WRITE_MASK const-index as our channel enables are not required to be constant. The Xe2+ LSC variants are simpler. Handles are byte offsets into the URB memory region, and offsets are expressed in bytes. So we simply add them into a single "address" source. We don't support writemasks here, as they aren't really necessary with the better addressability. (Plus, the store_cmask operations work significantly differently than the previous HDC OWord messages). We will lower disjoint writemasks to multiple stores. Based on earlier code by Lionel Landwerlin. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:43:54 +00:00
Marek Olšák	5ee9a76058	nir: fix a typo in NIR_PASS_ASSERT_NO_PROGRESS for non-debug builds Fixes: `4e834b4321` - nir: add NIR_PASS_ASSERT_NO_PROGRESS Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38608>	2025-11-25 21:38:50 +00:00
Lionel Landwerlin	4b9aa9dc91	nir/lower_printf: fix missing singleton add If we're using the singleton, we need to add to it. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38638>	2025-11-25 14:18:42 +00:00
Lionel Landwerlin	d24633023f	nir/lower_printf: fix array alignment The pointer arithmetic doesn't need a 4byte alignment, otherwise everything is broken. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38638>	2025-11-25 14:18:42 +00:00

1 2 3 4 5 ...

11405 commits