fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-22 15:18:09 +02:00

Author	SHA1	Message	Date
Marek Olšák	4f2e2e10bc	ac/nir: vectorize streamout stores for legacy pipeline optimally Walk the whole vertex stride thanks to XFB info sorted by offset, gather individual components from same or different outputs, and once we have gathered 4, store them as vec4. It also removes the COHERENT flag from VMEM stores because NGG streamout doesn't use it either and I don't think it's needed. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32686>	2025-01-09 20:47:16 +00:00
Marek Olšák	e399f3bed9	ac/nir: sort xfb info to facilitate vectorization of xfb stores xfb stores are not vectorized properly, leading to generating random soup of b32, b64, b96, and b128 stores. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32686>	2025-01-09 20:47:16 +00:00
Samuel Pitoiset	f09f31d093	ac/nir: fix a comment typo in load_subgroup_id_lowered() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32940>	2025-01-09 08:02:19 +00:00
Samuel Pitoiset	44ba856089	ac/nir: fix lowering subgroup ID for compute shaders on GFX12 This is lowered in backend compilers (LLVM or ACO) because it needs to access ttmp registers which aren't exposed to NIR. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32940>	2025-01-09 08:02:19 +00:00
Marek Olšák	7fbca998b1	amd: optimize atomics before lowering intrinsics ac_nir_lower_intrinsics_to_args will lower most system values. I have to keep the divergence analysis in ACO, otherwise it goes haywire. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32782>	2025-01-02 17:36:56 +00:00
Marek Olšák	5dd9171765	ac/nir: set upper ranges for range analysis while lowering system values Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32782>	2025-01-02 17:36:55 +00:00
Marek Olšák	0d5b03f2b9	ac/nir: split local_invocation_ids to 3 separate VGPR inputs so that we can set the upper range per VGPR. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32782>	2025-01-02 17:36:55 +00:00
Marek Olšák	65d241c947	ac/nir: set arg_upper_bound_u32 for vs_rel_patch_id Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32782>	2025-01-02 17:36:55 +00:00
Marek Olšák	1d9fbe5387	ac/nir: add helper ac_nir_load_arg_upper_bound Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32782>	2025-01-02 17:36:55 +00:00
Marek Olšák	cfeaa45dc6	ac/nir: clean up ac_nir_lower_indirect_derefs IO variables can't occur here anymore. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32782>	2025-01-02 17:36:55 +00:00
Marek Olšák	ae22da2ff8	ac/nir: lower more loads in ac_nir_lower_intrinsics_to_args instead of drivers Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32782>	2025-01-02 17:36:55 +00:00
Marek Olšák	ceb6f8fc32	amd: lower load_tess_rel_patch_id/primitive_id/tess_coord and overwrite.. in NIR The overwrite instruction complicates it a little, which is why these intrinsics are lowered together. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32782>	2025-01-02 17:36:55 +00:00
Marek Olšák	61bfb4fa06	amd: lower load_subgroup_invocation in NIR Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32782>	2025-01-02 17:36:55 +00:00
Marek Olšák	e69f47faee	amd: lower load_local_invocation_index in NIR This is the last intrinsic that needed the LS VGPR bug workaround in ACO and ac_nir_to_llvm. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32782>	2025-01-02 17:36:55 +00:00
Marek Olšák	433ca6ba38	ac/nir: extract a load_subgroup_id lowered helper this will be used in the next commit Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32782>	2025-01-02 17:36:55 +00:00
Marek Olšák	342dcbdc8b	amd: lower load_vertex_id/instance_id and overwrite_vs_arguments in NIR 2 things complicate this: - overwrite_vs_arguments_amd - the LS VGPR bug workaround Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32782>	2025-01-02 17:36:55 +00:00
Marek Olšák	66dd70adc5	amd: lower load_gs_wave_id_amd in NIR Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32782>	2025-01-02 17:36:55 +00:00
Marek Olšák	923f59c971	amd: lower load_barycentric_at_offset in NIR Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32782>	2025-01-02 17:36:55 +00:00
Marek Olšák	16ab05fad1	amd: lower load_barycentric_pixel/centroid/sample in NIR radeonsi needs to preserve interp_mode in the arg load. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32782>	2025-01-02 17:36:55 +00:00
Marek Olšák	7e83f6ca8b	amd: lower load_front_face in NIR radeonsi must do this after si_lower_nir_abi, which optimizes front_face, but doesn't lower it. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32782>	2025-01-02 17:36:55 +00:00
Marek Olšák	6ad5225b2a	amd: lower load_frag_shading_rate in NIR Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32782>	2025-01-02 17:36:55 +00:00
Marek Olšák	6d2e29ff6e	amd: lower load_sample_pos in NIR Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32782>	2025-01-02 17:36:55 +00:00
Marek Olšák	110e474b4f	amd: lower load_sample_id in NIR Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32782>	2025-01-02 17:36:55 +00:00
Marek Olšák	684c8da553	amd: lower load_invocation_id in NIR ACO can't look for it because it's lowered there. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32782>	2025-01-02 17:36:55 +00:00
Marek Olšák	d281240c57	amd: lower load_first_vertex/base_instance/draw_id/view_index in NIR Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32782>	2025-01-02 17:36:55 +00:00
Marek Olšák	0d372b043b	amd: lower load_local_invocation_id in NIR This is based on ACO. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32782>	2025-01-02 17:36:55 +00:00
Marek Olšák	13cb5c7b72	amd: lower load_frag_coord in NIR Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32782>	2025-01-02 17:36:55 +00:00
Marek Olšák	58cb155068	amd: lower load_pixel_coord in NIR Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32782>	2025-01-02 17:36:55 +00:00
Marek Olšák	85c3b5159a	ac/nir: handle disabled PS VGPRs in ac_nir_load_arg_at_offset Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32782>	2025-01-02 17:36:55 +00:00
Timur Kristóf	de2cb4a7d3	ac/nir: Only store params to attribute ring that are varying. On GFX11+, varying outputs from the last pre-rasterization stage are implemented by storing the outputs to the so-called attribute ring. Make sure to only store them when necessary. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32640>	2024-12-28 10:31:41 -06:00
Timur Kristóf	13234a8a8a	ac/nir: Only export parameters when they are actually varying. In AMD terminology, varying outputs are implemented by parameter export instructions on GFX6-10.3 GPUs. Only emit those when actually necessary. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32640>	2024-12-28 10:31:38 -06:00
Timur Kristóf	4d6c00944b	ac/nir: Only export positions when they are really system values. In AMD terminology, a system value is implemented by position export instructions. Make sure to only emit those when they are needed. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32640>	2024-12-28 10:31:36 -06:00
Timur Kristóf	f5981e8c0b	ac/nir: Split GS output usage masks to varying and sysval masks. To keep track which output is used for what purpose. Note that this commit just adds the capability to track this separately in ac/nir. The drivers will need to be updated in the future to take advantage of this. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32640>	2024-12-28 10:31:33 -06:00
Timur Kristóf	92464109e3	ac/nir: Mark when pre-rast output is used as varying or sysval. In this commit, just collect the info. It will be taken into use by subsequent commits. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32640>	2024-12-28 10:31:29 -06:00
Timur Kristóf	f38680aa1c	ac/nir: Introduce ac_nir_store_parameters_to_attr_ring. This function is going to be used for storing parameter outputs to the attribute ring, instead of the current implementation. It is going to be taken into use in the following commits. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32640>	2024-12-28 10:31:06 -06:00
Timur Kristóf	c4b45f1ec8	ac/nir: Pass ac_nir_prerast_out to ac_nir_export_position. In a subsequent commit, ac_nir_export_position will start using other fields from ac_nir_prerast_out. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32640>	2024-12-28 10:31:04 -06:00
Timur Kristóf	3d291a98c4	ac/nir: Pass ac_nir_prerast_out to ac_nir_export_parameters. In a subsequent commit, ac_nir_export_parameters will start using other fields from ac_nir_prerast_out. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32640>	2024-12-28 10:31:01 -06:00
Qiang Yu	5f601361ed	ac/nir: lower access for shared and scratch memory OpenCL may load and store vec16 data, while ACO only support <=32byte. Radeonsi is going to use ac_nir_lower_mem_access_bit_sizes() for lowering these access. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32781>	2024-12-27 01:58:38 +00:00
Rhys Perry	033e76a82a	ac/nir: have ac_nir_lower_mem_access_bit_sizes preserve >128 bit SMEM Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32408>	2024-12-09 16:56:29 +00:00
Marek Olšák	d8468d5463	amd,zink: remove options.varying_estimate_instr_cost callbacks They are a maintainenance burden since they would need changes to support more instruction types that nir_opt_varyings will be able to move between shaders, and they are almost identical to default_varying_estimate_instr_cost, so just use that. The cost threshold is adjusted for AMD because default_varying_estimate_instr_cost is slightly different. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32424>	2024-12-04 13:40:41 +00:00
Rhys Perry	7d4cc04156	radv,ac/nir: split global access using nir_lower_mem_access_bit_sizes Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31904>	2024-11-13 12:59:26 +00:00
Rhys Perry	8fdc5d7f9f	radv,ac/nir: lower sub-dword loads using nir_lower_mem_access_bit_sizes fossil-db (navi21): Totals from 427 (0.54% of 79395) affected shaders: Instrs: `2939637` -> 2937224 (-0.08%); split: -0.08%, +0.00% CodeSize: 15982272 -> 15969880 (-0.08%); split: -0.08%, +0.00% Latency: 21128645 -> 21125738 (-0.01%); split: -0.04%, +0.03% InvThroughput: 5626811 -> 5626220 (-0.01%); split: -0.03%, +0.02% SClause: 65771 -> 65731 (-0.06%); split: -0.07%, +0.00% Copies: 243247 -> 242917 (-0.14%); split: -0.14%, +0.01% Branches: 100089 -> 100085 (-0.00%) PreSGPRs: 17879 -> 18118 (+1.34%) VALU: 1899641 -> 1899278 (-0.02%) SALU: 468508 -> 466469 (-0.44%) SMEM: 84305 -> 84291 (-0.02%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31904>	2024-11-13 12:59:26 +00:00
Rhys Perry	d3ae1842a2	aco,ac/nir: flag loads to use smem in NIR This pass will be re-used later. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31904>	2024-11-13 12:59:26 +00:00
Timur Kristóf	79df320463	ac/nir: Move varying cost functions from radeonsi to common code. This code will be shared between RADV and RadeonSI. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28676>	2024-09-06 09:16:21 +00:00
Timur Kristóf	f317311bad	ac/nir: Shorten the name of ac_nir_calc_io_offset_mapped. The other variant of this function doesn't exist anymore, so there is no ambiguity. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29812>	2024-08-08 16:55:02 +00:00
Timur Kristóf	d43466e917	ac/nir: Remove ac_nir_calc_io_offset function. This function is not used anymore, because none of the callers rely on driver locations (intrinsic base) anymore. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29812>	2024-08-08 16:55:02 +00:00
Timur Kristóf	b162c7962f	ac/nir: Add helper for I/O location mapping. Map I/O locations based on a prefix sum (for linked shaders), or based on the provided callback. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29812>	2024-08-08 16:55:02 +00:00
Zan Dobersek	7fd5f76393	nir/lower_vars_to_scratch: calculate threshold-limited variable size separately ir3's lowering of variables to scratch memory has to treat 8-bit values as 16-bit ones when comparing such value's size against the given threshold since those values are handled through 16-bit half-registers. But those values can still use natural 8-bit size and alignment for storing inside scratch memory. nir_lower_vars_to_scratch now accepts two size-and-alignment functions, one used for calculating the variable size and the other for calculating the size and alignment needed for storing inside scratch memory. Non-ir3 uses of this pass can just duplicate the currently-used function. ir3 provides a separate variable-size function that special-cases 8-bit types. Signed-off-by: Zan Dobersek <zdobersek@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29875>	2024-08-07 14:32:28 +00:00
Alyssa Rosenzweig	da752ed7c1	treewide: use nir_def_replace sometimes Two Coccinelle patches here. Didn't catch nearly as much as I would've liked but it's a start. Coccinelle patch: @@ expression intr, repl; @@ -nir_def_rewrite_uses(&intr->def, repl); -nir_instr_remove(&intr->instr); +nir_def_replace(&intr->def, repl); Coccinelle patch: @@ identifier intr; expression instr, repl; @@ nir_intrinsic_instr *intr = nir_instr_as_intrinsic(instr); ... -nir_def_rewrite_uses(&intr->def, repl); -nir_instr_remove(instr); +nir_def_replace(&intr->def, repl); Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Juan A. Suarez Romero <jasuarez@igalia.com> [broadcom] Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> [lima] Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> [etna] Reviewed-by: Pavel Ondračka <pavel.ondracka@gmail.com> [r300] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29817>	2024-06-21 15:36:56 +00:00
Alyssa Rosenzweig	15257b65c6	treewide: use nir_metadata_control_flow Via Coccinelle patch: @@ @@ -nir_metadata_block_index \| nir_metadata_dominance +nir_metadata_control_flow ...plus some manual fixups for call sites missed by coccinelle. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Acked-by: Karol Herbst <kherbst@redhat.com> Acked-by: Juan A. Suarez Romero <jasuarez@igalia.com> [broadcom] Acked-by: Vasily Khoruzhick <anarsoul@gmail.com> [lima] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29745>	2024-06-17 16:28:14 -04:00

1 2 3

122 commits