fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-24 08:28:16 +02:00

Author	SHA1	Message	Date
Marek Olšák	fa5e07d5f7	ac/nir/tess: write TCS patch outputs to memory as vec4 stores at the end This moves per-patch output VMEM stores to the end of the shader where they execute only once. They are skipped if the whole workgroup discards all patches. If tcs_vertices_out == 1, per-patch output VMEM stores use the same lanes as per-vertex output VMEM stores, which are aligned to 4 or 8 lanes to get cached bandwidth for the stores. Previously, per-patch outputs were stored to memory for every store_output intrinsic in TCS. Additionally, LDS is no longer allocated for per-patch outputs that are only written and read by invocation 0, or they are written by all invocations but not read, and don't have indirect indexing. This reduces LDS usage and LDS traffic. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34780>	2025-06-07 16:29:39 +00:00
Marek Olšák	9d9cfd89da	ac/nir/tess: compute the number of remapped VRAM outputs in common code This unifies it for both drivers. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34780>	2025-06-07 16:29:39 +00:00
Marek Olšák	ea70060826	ac/nir/tess: stop using tes_inputs_read / tes_patch_inputs read for TCS & TES use ac_nir_tess_io_info instead Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34780>	2025-06-07 16:29:39 +00:00
Marek Olšák	42445e271e	radv,radeonsi: use ac_nir_tess_io_info for LDS size computation Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34780>	2025-06-07 16:29:39 +00:00
Marek Olšák	c678844ccb	ac/nir/tess: move LDS and VMEM output masks into a new info structure This will replace LDS and VMEM output size computations in drivers. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34780>	2025-06-07 16:29:39 +00:00
Marek Olšák	9c16228359	ac/nir/tess: write TCS per-vertex outputs to memory as vec4 stores at the end This improves write throughput for TCS outputs. It follows the same idea as attribute stores in hw GS. The improvement is easily measurable with a microbenchmark. It also has the advantage that multiple output stores to the same address don't result in multiple memory stores. Each output components gets only one memory store at the end of the shader. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34780>	2025-06-07 16:29:39 +00:00
Marek Olšák	be8977811b	ac/nir: remove shader_info parameter from ac_nir_compute_tess_wg_info Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34544>	2025-04-19 22:55:00 -04:00
Marek Olšák	0f97dc707d	ac/nir/cull: rename skip_viewport_culling -> skip_viewport_state_culling Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34016>	2025-04-07 19:44:22 +00:00
Marek Olšák	ce716d009f	ac/nir/cull: cull small prims using a point-triangle intersection test Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This is based on Timur Kristof's code, but there are a lot of differences. The idea is that it doesn't just compute an intersection between a point and a triangle. It computes the distance between a point and a triangle and it does so in screen space. It accurately takes the subpixel precision of the rasterizer into account, so that it works optimally at all resolutions, all MSAA modes, and all quant modes. The distance computation is only approximated because it only considers the infinite lines going through triangle edges. However, it seems to be more than sufficient in practice because the existing rounding-based small prim culling compensates for it. The performance improvement is up to 10% in some geometry-bound tests, though targeted microbenchmarks can show a lot more than that. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33361>	2025-04-01 16:12:22 +00:00
Georg Lehmann	09ff1c28d8	ac/nir/lower_ps_late: consider dcc decompression for null exports Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33835>	2025-03-07 15:00:37 +00:00
Marek Olšák	d2141e6751	ac/nir/ngg: add an option to skip viewport-based culling We can do W and face culling when we have multiple viewports, but not frustum and small prim culling because those are dependent on the viewport. When a shader writes the viewport index, the new option allows skipping viewport-based culling while keeping W and face culling. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33482>	2025-03-06 21:10:48 +00:00
Timur Kristóf	b8797180e9	ac/nir/ngg: Add bool return value to ac_nir_lower_ngg_mesh. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33609>	2025-02-22 08:54:17 +01:00
Timur Kristóf	cd01e17e81	ac/nir/ngg: Add bool return value to ac_nir_lower_ngg_gs. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33609>	2025-02-22 08:54:17 +01:00
Timur Kristóf	25adf353cc	ac/nir/ngg: Add bool return value to ac_nir_lower_ngg_nogs. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33609>	2025-02-22 08:54:17 +01:00
Timur Kristóf	fad58a99e8	ac/nir: Add bool return value to ac_nir_lower_legacy_gs. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33609>	2025-02-22 08:54:17 +01:00
Timur Kristóf	d8ad068968	ac/nir: Add bool return value to ac_nir_lower_legacy_vs. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33609>	2025-02-22 08:54:17 +01:00
Timur Kristóf	407aedeff8	ac/nir: Add bool return value to ac_nir_lower_mesh_inputs_to_mem. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33609>	2025-02-22 08:54:17 +01:00
Timur Kristóf	9e7609b0ff	ac/nir: Add bool return value to ac_nir_lower_task_outputs_to_mem. And fixup its NIR counterparts too. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33609>	2025-02-22 08:54:17 +01:00
Timur Kristóf	65645f6841	ac/nir: Add bool return value to ac_nir_lower_gs_inputs_to_mem. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33609>	2025-02-22 08:54:17 +01:00
Timur Kristóf	c593110f5f	ac/nir: Add bool return value to ac_nir_lower_es_outputs_to_mem. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33609>	2025-02-22 08:54:17 +01:00
Timur Kristóf	6e9ede61c4	ac/nir: Add bool return value to ac_nir_lower_tes_inputs_to_mem. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33609>	2025-02-22 08:54:17 +01:00
Timur Kristóf	6e78aef0e9	ac/nir: Add bool return value to ac_nir_lower_hs_outputs_to_mem. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33609>	2025-02-22 08:54:17 +01:00
Timur Kristóf	bb3f33014d	ac/nir: Add bool return value to ac_nir_lower_hs_inputs_to_mem. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33609>	2025-02-22 08:54:17 +01:00
Timur Kristóf	0438cc0afb	ac/nir: Add bool return value to ac_nir_lower_ls_outputs_to_mem. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33609>	2025-02-22 08:54:17 +01:00
Timur Kristóf	305944def9	ac/nir: Don't include nir.h in headers anymore. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33439>	2025-02-12 22:33:07 +01:00
Rhys Perry	f034aa9cd3	radv: don't use bit_sizes_int to skip nir_lower_bit_size Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29242>	2025-02-07 13:52:57 +00:00
Timur Kristóf	f7305f776e	ac/nir/ngg: Pass radeon_info to mesh shader lowering. Same idea as the VS/TES and GS lowering: Make shader compilation decisions based on the features of the current GPU instead of ad-hoc deciding according to GFX level. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33218>	2025-01-30 15:26:46 +00:00
Timur Kristóf	b8204c8df9	ac/nir/ngg: Remove gfx_level and family from NGG lowering options. They can be read from radeon_info. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33218>	2025-01-30 15:26:46 +00:00
Timur Kristóf	e76361d626	ac/nir/ngg: Add radeon_info to NGG lowering options. The intention is to have all the HW features affecting shader compilation in one place, instead of ad-hoc decisions in the code based on the GFX level and chip class. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33218>	2025-01-30 15:26:45 +00:00
Marek Olšák	71e95b373b	radeonsi: remove si_shader_info code that is no longer needed A lot of this info is now derived from shader variant NIR. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32910>	2025-01-29 07:19:56 +00:00
Marek Olšák	f7e3689fe1	ac/nir: lower sample_pos to load_sample_positions_amd when frag_coord is center Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:26 -05:00
Marek Olšák	eddd063420	ac/nir: cosmetic stuff for ac_nir_lower_ps Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:26 -05:00
Marek Olšák	247c0593eb	ac/nir: eliminate sample_mask_in without MSAA in ac_nir_lower_ps_early Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:26 -05:00
Marek Olšák	e57b52ff6c	ac/nir: optimize frag_coord <-> pixel_coord in ac_nir_lower_ps_early Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:26 -05:00
Marek Olšák	33134f9503	ac/nir: optimize barycentric_at_sample(sample_id) in ac_lower_ps_early Replace it with barycentric_sample. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:26 -05:00
Marek Olšák	43f6b2655e	ac/nir: simplify force_*_center_interp options in ac_nir_lower_ps_early This only indicates whether MSAA is disabled. Having a separate option for each sysval is better for the PS prolog, but not for monolithic compilation. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:25 -05:00
Marek Olšák	3bccfa72cc	ac/nir: simplify force_*_sample_interp options in ac_nir_lower_ps_early The only thing we need here is whether sample shading is enabled and how many samples. Having a separate option for each sysval is better for the PS prolog, but not for monolithic compilation. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:25 -05:00
Marek Olšák	38d3fc7a6a	ac/nir: return progress from ac_nir_lower_ps_late This changes the creation of barycentric coordinate variables to on-demand. There is also some reordering of export code to return progress. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:25 -05:00
Marek Olšák	231009be65	ac/nir: return progress from ac_nir_lower_ps_early This changes the creation of barycentric coordinate variables to on-demand. Everything else is ready to return progress. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:25 -05:00
Marek Olšák	234b416ffb	ac/nir: lower fbfetch_output in ac_nir_lower_ps_early so that we can gather shader_info after this and all system values that this adds will be gathered. shader_info won't be gathered after si_nir_lower_abi, which is why we have to lower fbfetch_output here. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:25 -05:00
Marek Olšák	1570387aaa	ac/nir: lower barycentric_at_offset/sample in ac_nir_lower_ps_early i.e. before future linking optimizations and shader_info gathering They are lowered together because one depends on the other. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:25 -05:00
Marek Olšák	580304350b	ac/nir: optimize front_face in ac_nir_lower_ps_early i.e. before future linking optimizations and shader_info gathering Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33024>	2025-01-25 12:20:25 -05:00
Timur Kristóf	50035f0316	ac/nir: Move all ac_nir_* files to a new folder. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32966>	2025-01-14 13:46:30 +01:00

43 commits