fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-21 00:18:09 +02:00

Author	SHA1	Message	Date
Georg Lehmann	17e597093d	radv: eliminate unused FS output channels For formats that don't have all color channels, there is no reason to output all of them. Games often write to R only or RGB formats with non trivial remaining channels. Foz-DB Navi21: Totals from 10270 (10.55% of 97347) affected shaders: MaxWaves: 249166 -> 250950 (+0.72%); split: +0.73%, -0.01% Instrs: 8442016 -> 8354715 (-1.03%); split: -1.05%, +0.01% CodeSize: 45939644 -> 45487156 (-0.98%); split: -1.01%, +0.02% VGPRs: 472584 -> 463784 (-1.86%); split: -1.98%, +0.12% SpillSGPRs: 1502 -> 1448 (-3.60%) LDS: 6024192 -> 6011904 (-0.20%) Inputs: 42463 -> 41773 (-1.62%) Outputs: 24601 -> 23955 (-2.63%) Latency: 78011745 -> 77653907 (-0.46%); split: -0.56%, +0.10% InvThroughput: 19767826 -> 19274046 (-2.50%); split: -2.53%, +0.03% VClause: 177891 -> 176681 (-0.68%); split: -0.80%, +0.12% SClause: 236784 -> 235324 (-0.62%); split: -0.72%, +0.10% Copies: 621048 -> 616096 (-0.80%); split: -1.03%, +0.23% Branches: 202608 -> 201811 (-0.39%); split: -0.44%, +0.05% PreSGPRs: 441032 -> 437698 (-0.76%); split: -0.77%, +0.01% PreVGPRs: 378067 -> 369564 (-2.25%); split: -2.26%, +0.01% VALU: 5906415 -> 5833179 (-1.24%); split: -1.25%, +0.01% SALU: 973428 -> 968088 (-0.55%); split: -0.61%, +0.06% VMEM: 298277 -> 296504 (-0.59%); split: -0.61%, +0.01% SMEM: 402244 -> 399612 (-0.65%); split: -0.71%, +0.06% Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38853>	2025-12-12 17:00:51 +00:00
Georg Lehmann	a1fbf91ff2	radv/nir: fix radv_nir_remap_color_attachment progress And switch to SPDX header. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38853>	2025-12-12 17:00:51 +00:00
Georg Lehmann	0fe8250bf4	radv: optimize known front_face_fsign too Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Foz-DB Navi21: Totals from 1941 (1.99% of 97581) affected shaders: MaxWaves: 44196 -> 44612 (+0.94%); split: +0.97%, -0.03% Instrs: 1553182 -> 1548823 (-0.28%); split: -0.36%, +0.08% CodeSize: 8261308 -> 8261496 (+0.00%); split: -0.17%, +0.18% VGPRs: 98488 -> 97968 (-0.53%); split: -0.56%, +0.03% SpillSGPRs: 1288 -> 1347 (+4.58%) Latency: 19136399 -> 19094748 (-0.22%); split: -0.38%, +0.16% InvThroughput: 5424693 -> 5409469 (-0.28%); split: -0.32%, +0.04% VClause: 29941 -> 29943 (+0.01%); split: -0.26%, +0.27% SClause: 39922 -> 39972 (+0.13%); split: -1.02%, +1.14% Copies: 109736 -> 109684 (-0.05%); split: -1.45%, +1.40% Branches: 24523 -> 24499 (-0.10%); split: -0.12%, +0.02% PreSGPRs: 99206 -> 99191 (-0.02%); split: -0.02%, +0.00% PreVGPRs: 79019 -> 78240 (-0.99%); split: -1.00%, +0.02% VALU: 1145088 -> 1140731 (-0.38%); split: -0.44%, +0.06% SALU: 164035 -> 164077 (+0.03%); split: -0.48%, +0.51% SMEM: 80668 -> 80658 (-0.01%) We used to call this pass before front_face_fsign is created but that has changed. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38906>	2025-12-12 08:24:38 +00:00
Konstantin Seurer	85e8f815e0	radv/nir: Use fmt_idx correctly Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38462>	2025-12-11 16:26:01 +00:00
Marek Olšák	308da55f1a	radv,radeonsi: use FRAG_RESULT_DUAL_SRC_BLEND this is slightly nicer Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38604>	2025-12-10 19:16:46 +00:00
Natalie Vock	f692ac76ca	radv/rt: Use traversal vars for object origin/direction in ahit/isec Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details We already did the work of transforming the ray data, no need to do it multiple times. Should theoretically be a lot better. However, none of the fossils appear to use object-space ray data in anyhit/intersection shaders. :( Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38809>	2025-12-08 18:29:49 +00:00
Natalie Vock	a488203e85	radv/rt: Refactor and split radv_nir_rt_shader.c This splits up radv_nir_rt_shader.c into several parts. The first part is all ray traversal lowering for RT pipelines, located at radv_nir_rt_traversal_shader.c. It implements building the traversal loop, including inlined any-hit/intersection shaders (optionally as a completely separate shader). The second part is lowering for individual RT stages (right now, monolithic vs. CPS-style separate compilation). Each lowering technique lives in its own file (radv_nir_rt_stage_{monolithic,cps}.c). Code shared between RT lowering techniques (shader inlining helpers and storage lowering passes) gets moved into radv_nir_rt_stage_common.c. One header, radv_nir_rt_stage.h, is the public interface for RT pipeline stage lowering. Functions exposed to users (really just radv_pipeline_rt.c) go there. The header for internal shared helpers is radv_nir_rt_stage_common.c. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38809>	2025-12-08 18:29:49 +00:00
Marek Olšák	fa0bea5ff8	nir: remove nir_io_add_const_offset_to_base Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details nir_opt_constant_folding does it now. Acked-by: Emma Anholt <emma@anholt.net> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38277>	2025-11-29 00:16:38 +00:00
Dave Airlie	3eef0c0245	radv: add support for cooperative matrix per element operations. Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36992>	2025-11-22 13:16:20 +10:00
Anna Maniscalco	3e01031f10	radv: consistently use the value in bytes for esgs_itemsize Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Previosuly this value was in bytes for vs/tes and in dwords for gs. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38514>	2025-11-20 16:45:37 +00:00
Marek Olšák	9e339f4b32	nir: rename nir_lower_indirect_derefs -> nir_lower_indirect_derefs_to_if_else_trees This describes better what it does. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Acked-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38471>	2025-11-20 05:42:11 +00:00
Dave Airlie	ad25196d35	radv: add support for cooperative matrix reductions. Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This add support to the lowering the reduction operations. Thanks to Georg Lehmann for a lot of the ideas and optimising in this. Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38389>	2025-11-17 23:33:59 +00:00
Marek Olšák	e372365cf4	nir: rename nir_copy_prop -> nir_opt_copy_prop Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38411>	2025-11-15 02:16:38 +00:00
Konstantin Seurer	de32f9275f	treewide: add & use parent instr helpers We add a bunch of new helpers to avoid the need to touch >parent_instr, including the full set of: * nir_def_is_* * nir_def_as__or_null nir_def_as_* [assumes the right instr type] * nir_src_is_* * nir_src_as_* * nir_scalar_is_* * nir_scalar_as_* Plus nir_def_instr() where there's no more suitable helper. Also an existing helper is renamed to unify all the names, while we're churning the tree: * nir_src_as_alu_instr -> nir_src_as_alu ..and then we port the tree to use the helpers as much as possible, using nir_def_instr() where that does not work. Acked-by: Marek Olšák <maraeo@gmail.com> --- To eliminate nir_def::parent_instr we need to churn the tree anyway, so I'm taking this opportunity to clean up a lot of NIR patterns. Co-authored-by: Konstantin Seurer <konstantin.seurer@gmail.com> Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38313>	2025-11-12 21:22:13 +00:00
Samuel Pitoiset	0dba538643	radv/meta: fuse depth/stencil aspects copy with the GFX path Depth/stencil copies on graphics are twice as fast now. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38139>	2025-11-12 07:35:33 +00:00
Samuel Pitoiset	9d3dd174b8	radv/meta: rework radv_meta_nir_texel_fetch_build_func This add a binding parameter that will be used for fused depth/stencil copies. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38139>	2025-11-12 07:35:33 +00:00
Faith Ekstrand	35cdddf632	nir: Simplify assign_io_var_locations() The size and stage parameters are left-overs from history. Originally, the function acted on a list and so it needed an explicit stage and size output. Now that it takes a NIR shader and a mode, we can just take the stage from the shader and set num_(in\|out)puts. The one caller that actually used the explicit output parameter was turnip. However, given that the helper sorts and re-numbers all the I/O variables, it's not like changing num_(in\|out)puts instead of writing it to some other location is that big of a deal. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38297>	2025-11-07 16:29:56 +00:00
Daniel Schürmann	e1bcbbf3dd	radv/shader_info: rename gs_ring_info -> legacy_gs_info and use union with ngg_info Reduces the size of radv_shader_info from 784 bytes to 760 bytes. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37931>	2025-11-03 13:38:38 +01:00
Marek Olšák	9125e34372	amd: lower get_ssbo_size in ac_nir_lower_resinfo The code for lowering get_ssbo_size will be different in future chips, so do it in common code to reduce duplication in the future. Lower get_ssbo_size to ssbo_descriptor_amd + nir_channel. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38097>	2025-11-02 01:42:07 +00:00
Daniel Schürmann	ef9ecc4058	nir: add nir_imul_nuw() and nir_imul_imm_nuw() helpers Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37195>	2025-10-30 19:28:06 +00:00
Marek Olšák	966cb36722	amd: constify struct radeon_surf Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38093>	2025-10-29 12:50:44 +00:00
Rhys Perry	241bbb4977	radv: remove NIR_PASS in insert_rt_case When NIR_DEBUG=serialize or NIR_DEBUG=clone is used, NIR_PASS recreates nir_function_impl and nir_variable objects, causing use-after-free since insert_rt_case() keeps pointers to those in local variables and var_remap. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <maraeo@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37573>	2025-10-23 10:44:38 +00:00
Rhys Perry	55018df38d	radv: stop rallocing objects which don't belong to the shader under it This doesn't work with NIR_DEBUG=serialize or NIR_DEBUG=clone. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <maraeo@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37573>	2025-10-23 10:44:38 +00:00
Rhys Perry	762a803aa7	radv: remove NIR_PASS in radv_nir_lower_rt_abi When NIR_DEBUG=serialize or NIR_DEBUG=clone is used, NIR_PASS recreates nir_function_impl and nir_variable objects, causing use-after-free since radv_nir_lower_rt_abi() keeps pointers to those in local variables. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <maraeo@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37573>	2025-10-23 10:44:38 +00:00
Eric Engestrom	4ab65cdaa4	docs: update/fix vk spec urls Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37993>	2025-10-22 09:23:34 +02:00
Georg Lehmann	654bd74c60	treewide: use nir_store_global alias of nir_build_store_global Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37959>	2025-10-21 12:37:58 +02:00
Georg Lehmann	9e41a7c139	treewide: use nir_load_global alias of nir_build_load_global Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37959>	2025-10-21 12:37:58 +02:00
Daniel Schürmann	9b1a635bb3	amd/common: merge radv_nir_opt_access_speculate() into ac_nir_flag_smem_for_loads() One shader is negatively affected, but we save 2 entire iterations over every shader. This effect is also mitigated with the next commits. Totals from 1 (0.00% of 79839) affected shaders: (Navi48) Instrs: 947 -> 958 (+1.16%) CodeSize: 4728 -> 4732 (+0.08%) Latency: 20678 -> 20723 (+0.22%) InvThroughput: 2697 -> 2698 (+0.04%) SClause: 26 -> 27 (+3.85%) Copies: 139 -> 145 (+4.32%) Branches: 46 -> 47 (+2.17%) VALU: 460 -> 463 (+0.65%) SALU: 201 -> 204 (+1.49%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37843>	2025-10-14 16:33:12 +00:00
Daniel Schürmann	9553e56c67	radv: use load_global instead of load_global_amd for load_sample_positions_amd For consistency. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37843>	2025-10-14 16:33:10 +00:00
Samuel Pitoiset	a47952d495	radv: upload and emit dynamic descriptors separately from push constants Dynamic descriptors are rarely used and this will allow to do more optimizations for push constants, like gathering the size from shaders themselves instead of using the pipeline layout. fossils-db (GFX1201): Totals from 21740 (27.30% of 79646) affected shaders: Instrs: 11186407 -> 11192061 (+0.05%); split: -0.05%, +0.10% CodeSize: 59842068 -> 59864412 (+0.04%); split: -0.04%, +0.08% Latency: 56333136 -> 56325208 (-0.01%); split: -0.03%, +0.02% InvThroughput: 8576452 -> 8576516 (+0.00%); split: -0.00%, +0.00% SClause: 279186 -> 279713 (+0.19%); split: -0.06%, +0.25% Copies: 577854 -> 581735 (+0.67%); split: -0.28%, +0.95% PreSGPRs: 867163 -> 866409 (-0.09%) SALU: 1391187 -> 1395055 (+0.28%); split: -0.12%, +0.39% Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37768>	2025-10-14 15:34:43 +00:00
Samuel Pitoiset	bc32286e5b	radv: declare a new user SGPR for dynamic descriptors To move them out of push constants. fossils-db (GFX1201): Totals from 20700 (25.99% of 79646) affected shaders: Instrs: 14375624 -> 14370051 (-0.04%); split: -0.07%, +0.03% CodeSize: 76746128 -> 76723772 (-0.03%); split: -0.05%, +0.02% Latency: 74103586 -> 74113651 (+0.01%); split: -0.01%, +0.02% InvThroughput: 11908817 -> 11908798 (-0.00%); split: -0.00%, +0.00% VClause: 249605 -> 249607 (+0.00%); split: -0.00%, +0.00% SClause: 337914 -> 337772 (-0.04%); split: -0.08%, +0.04% Copies: 843585 -> 839233 (-0.52%); split: -0.62%, +0.10% PreSGPRs: 836283 -> 837260 (+0.12%) SALU: 1790713 -> 1786374 (-0.24%); split: -0.29%, +0.05% Co-authored-by: Konstantin Seurer <konstantin.seurer@gmail.com> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37768>	2025-10-14 15:34:43 +00:00
Samuel Pitoiset	876e6a3bfe	radv/rt: fix memory leak in lower_rt_instructions_monolithic() Found with ASAN. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37844>	2025-10-14 06:54:02 +00:00
Samuel Pitoiset	08dbab0600	radv: rename shader arg descriptor_sets to descriptors It's more generic and descriptor heaps will use it too. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37786>	2025-10-10 13:22:03 +00:00
Samuel Pitoiset	609ae4e647	radv: rename indirect_descriptor_sets to indirect_descriptors With descriptor heap the driver will also have to emit indirect descriptor heaps in some cases. Rename couple of things to make them more generic. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37786>	2025-10-10 13:22:03 +00:00
Samuel Pitoiset	08ddf2f878	radv: lower embedded/immutable samplers earlier Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Lowering them earlier right after VTN would allow us to implement embedded samplers for descriptor heap properly for merged shaders. Non-immediate samplers are still lowered in radv_nir_apply_pipeline_layout because they require shader arguments. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37688>	2025-10-07 09:25:28 +00:00
Georg Lehmann	cf30742a66	radv,aco: don't end monolithic ray tracing with unconditional terminate The terminate requires more code and blocks us from deallocating VGPRs early. Foz-DB Navi31: Totals from 63 (0.08% of 80273) affected shaders: Instrs: 3372702 -> 3372467 (-0.01%) CodeSize: 17441676 -> 17440736 (-0.01%) Latency: 19763447 -> 19763288 (-0.00%) InvThroughput: 3860502 -> 3860478 (-0.00%) Branches: 96204 -> 96141 (-0.07%) SALU: 406648 -> 406549 (-0.02%) Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37542>	2025-09-25 15:35:55 +00:00
Rhys Perry	591b498e1f	radv: fix progress reporting in lower_rt_derefs Only create nir_load_rt_arg_scratch_offset_amd if needed. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Marek Olšák <maraeo@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35069>	2025-09-24 08:20:27 +00:00
Marek Olšák	bbab69d343	radv: fix load_smem alignment radv_cmd_buffer_upload_alloc_aligned is used with alignment=0, which guarantees that the alignment is at least 4. Fixes: `9e16ed7a13` - ac/nir: switch nir_load_smem_amd uses to ac_nir_load_smem wrapper Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37345>	2025-09-19 21:08:25 -04:00
Georg Lehmann	a2d3cbac2a	radv: determine subgroup/wave size early This means we can actually implement varying subgroup size correctly. It also means that we implement the implicit SPIR-V 1.6 full subgroups requirement in compute shaders with cswave32/rtwave32. In the future it will also allow more optimizations that use the subgroup size. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> The only somewhat complex case here is GFX10 geometry shaders, if gewave32 is used. We then only know the subgroup size when is_ngg is decided, as legacy GS doesn't support wave32. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37294>	2025-09-14 13:21:21 +00:00
Georg Lehmann	4143f0725a	radv/nir/lower_cmat: clean up GFX11 ACC->B convert Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37213>	2025-09-09 06:08:55 +00:00
Georg Lehmann	5c0ebcdaef	radv/nir/lower_cmat: clean up gfx12 transpose Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37213>	2025-09-09 06:08:55 +00:00
Georg Lehmann	2da7b4bd0a	radv/nir/lower_cmat: add shuffle_xor_imm helper Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37213>	2025-09-09 06:08:54 +00:00
Christian Gmeiner	1492de1bc3	radv: re-format using clang-format No manual changes here, this is simply running $ ninja -C build/ clang-format Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37226>	2025-09-09 05:48:56 +00:00
Samuel Pitoiset	8e4d5743d2	radv: move debug related drirc to radv_drirc::debug Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37145>	2025-09-05 05:56:17 +00:00
Georg Lehmann	83326af899	nir/builder: add nir_inverse_ballot_imm Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37178>	2025-09-04 14:03:56 +00:00
Georg Lehmann	ef8c364d3d	nir: make inverse_ballot 1bit only Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37178>	2025-09-04 14:03:56 +00:00
Samuel Pitoiset	decf9af472	radv/rt: only use one user SGPR for the traversal shader addr All shaders are allocated in the 32-bit addr space. To avoid an issue with alignment, and also for future work, there is an unused user SGPR. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37133>	2025-09-03 05:53:41 +00:00
Daniel Schürmann	fcf8899c9e	radv/rt: use ACCESS_CAN_REORDER when loading SBT entries Totals from 56 (0.07% of 79839) affected shaders: (Navi48) Instrs: 2790220 -> 2790130 (-0.00%); split: -0.00%, +0.00% CodeSize: 14704952 -> 14704292 (-0.00%) Latency: 13994383 -> 13953444 (-0.29%); split: -0.29%, +0.00% InvThroughput: 2717973 -> 2710748 (-0.27%); split: -0.27%, +0.00% VClause: 68783 -> 68687 (-0.14%) SClause: 51910 -> 52007 (+0.19%) Copies: 223192 -> 223190 (-0.00%); split: -0.01%, +0.01% VALU: 1557513 -> 1557451 (-0.00%); split: -0.00%, +0.00% VMEM: 118789 -> 118692 (-0.08%) SMEM: 66498 -> 66595 (+0.15%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36933>	2025-09-02 19:07:30 +00:00
Samuel Pitoiset	bc9a020dd3	radv: rename NGG culling user SGPRs Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37022>	2025-09-01 08:52:55 +00:00
Marek Olšák	9e16ed7a13	ac/nir: switch nir_load_smem_amd uses to ac_nir_load_smem wrapper Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details ac_nir_load_smem will use load_global_amd Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37101>	2025-08-30 15:04:32 -04:00

1 2 3 4 5 ...

378 commits