mesa/src/amd/common
Marek Olšák 52c41f25de ac/nir/tess: don't allocate LDS for HS inputs that are passed via VGPRs
Right now we don't allocate LDS for HS inputs when all HS inputs are passed
via VGPRs.

This changes it to skip allocating exactly the HS inputs passed via VGPRs
by reducing the inputs_read mask to remove holes.

radeonsi changes to the LDS allocation will be in a different MR.

Reviewed-by: Timur Kristóf <timur.kristof@gmail.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30962>
2024-09-05 19:54:29 +00:00
..
ac_binary.c amd: add initial common code for gfx12 2024-05-11 22:14:05 -04:00
ac_binary.h ac,radeonsi,winsyses: switch to SPDX-License-Identifier: MIT 2023-05-24 21:48:19 +00:00
ac_cmdbuf.c ac,radeonsi,radv: add common GFX preambles 2024-08-27 14:14:57 +00:00
ac_cmdbuf.h ac,radeonsi,radv: add common GFX preambles 2024-08-27 14:14:57 +00:00
ac_debug.c amd: add initial common code for gfx12 2024-05-11 22:14:05 -04:00
ac_debug.h ac/debug,radv: Read UMR wave dumps into memory before parsing 2024-04-25 10:20:03 +00:00
ac_descriptors.c amd/common: add MIN_LOD for texture descriptors on GFX12 2024-06-06 10:15:10 +00:00
ac_descriptors.h ac,radv,radeonsi: add a function to build texture descriptors 2024-06-06 10:15:10 +00:00
ac_drm_fourcc.h ac/surface: add GFX12 256B tile mode for video 2024-06-11 12:29:11 -04:00
ac_formats.c ac,radeonsi: add ac_is_reduction_mode_supported() 2024-07-10 07:57:42 +00:00
ac_formats.h ac,radeonsi: add ac_is_reduction_mode_supported() 2024-07-10 07:57:42 +00:00
ac_gather_context_rolls.c ac: Improve context roll readability 2024-03-19 16:08:14 +00:00
ac_gpu_info.c ac: add ac_gfx103_get_cu_mask_ps() 2024-08-27 14:14:57 +00:00
ac_gpu_info.h ac: add ac_gfx103_get_cu_mask_ps() 2024-08-27 14:14:57 +00:00
ac_hw_stage.h amd: Move ac_hw_stage to its own file 2023-07-03 21:12:45 +00:00
ac_ib_parser.c ac/parse_ib: Replace the parameter list with ac_ib_parser 2024-03-19 16:08:13 +00:00
ac_msgpack.c ac/msgpack: make fixstrs a const char 2023-08-22 11:33:10 +00:00
ac_msgpack.h ac/msgpack: make fixstrs a const char 2023-08-22 11:33:10 +00:00
ac_nir.c ac/nir: Shorten the name of ac_nir_calc_io_offset_mapped. 2024-08-08 16:55:02 +00:00
ac_nir.h ac/nir/tess: don't allocate LDS for HS inputs that are passed via VGPRs 2024-09-05 19:54:29 +00:00
ac_nir_cull.c ac/nir: Move some helpers to new file. 2024-05-02 12:04:53 +02:00
ac_nir_helpers.h ac/nir: Shorten the name of ac_nir_calc_io_offset_mapped. 2024-08-08 16:55:02 +00:00
ac_nir_lower_esgs_io_to_mem.c ac/nir: Shorten the name of ac_nir_calc_io_offset_mapped. 2024-08-08 16:55:02 +00:00
ac_nir_lower_global_access.c treewide: use nir_metadata_control_flow 2024-06-17 16:28:14 -04:00
ac_nir_lower_image_opcodes_cdna.c ac/nir/cdna: don't use image_descriptor intrinsics if the src is a descriptor 2024-06-25 10:09:08 +00:00
ac_nir_lower_ngg.c ac/nir: remove sleeps from gfx12 streamout code 2024-07-13 01:32:48 +00:00
ac_nir_lower_ps.c treewide: use nir_def_replace sometimes 2024-06-21 15:36:56 +00:00
ac_nir_lower_resinfo.c treewide: use nir_metadata_control_flow 2024-06-17 16:28:14 -04:00
ac_nir_lower_subdword_loads.c treewide: use nir_metadata_control_flow 2024-06-17 16:28:14 -04:00
ac_nir_lower_taskmesh_io_to_mem.c nir: add ACCESS_CP_GE_COHERENT_AMD 2024-04-30 17:17:25 +00:00
ac_nir_lower_tess_io_to_mem.c ac/nir/tess: don't allocate LDS for HS inputs that are passed via VGPRs 2024-09-05 19:54:29 +00:00
ac_nir_lower_tex.c amd: switch to derivative intrinsics 2024-08-08 15:26:07 +00:00
ac_nir_meta.h ac/nir: import the clear/copy_buffer compute shader from radeonsi 2024-08-10 02:14:44 +00:00
ac_nir_meta_cs_blit.c ac/nir: adjust gfx11 tuning for the compute blit 2024-08-10 02:14:44 +00:00
ac_nir_meta_cs_clear_copy_buffer.c ac/nir: adjust performance-related decisions for clear/copy_buffer shader 2024-08-10 02:14:44 +00:00
ac_nir_meta_ps_resolve.c ac/nir: import the MSAA resolving pixel shader from radeonsi 2024-06-08 05:48:11 +00:00
ac_nir_opt_outputs.c treewide: use nir_metadata_control_flow 2024-06-17 16:28:14 -04:00
ac_parse_ib.c ac/parse_ib: dump PKT3_DISPATCH_{TASKMESH_GFX,TASKMESH_DIRECT_ACE} 2024-06-25 09:20:48 +00:00
ac_perfcounter.c ac/perfcounter: remove a bogus assert to fix an assertion failure on gfx11 2023-09-30 22:23:04 +00:00
ac_perfcounter.h ac/perfcounter: compute the number of global instances of TCP,SQ,GL1C and GL2C 2023-09-14 14:17:19 +00:00
ac_pm4.c amd: use a valid size for ac_pm4_state allocation 2024-07-22 10:09:34 +00:00
ac_pm4.h ac,radeonsi import PM4 state from RadeonSI 2024-06-06 20:26:47 +00:00
ac_rgp.c ac/rgp: assume GFX11_5 use the same SQTT/RGP versions as GFX11 2024-07-17 16:25:19 +00:00
ac_rgp.h ac/rgp: update dumping queue event records to the capture 2023-11-13 08:53:09 +00:00
ac_rgp_elf_object_pack.c amd: Use align64 instead of ALIGN for 64 bit value parameter 2024-01-03 22:02:17 +00:00
ac_rtld.c ac/llvm: implement WA in nir to llvm 2024-06-20 13:14:33 +00:00
ac_rtld.h ac/llvm: implement WA in nir to llvm 2024-06-20 13:14:33 +00:00
ac_shader_args.c nir: change "user_data_amd" sysval from 4 to 8 components 2024-04-13 16:45:08 +00:00
ac_shader_args.h radv/rt: Track ray_launch_size reads 2024-05-28 12:23:45 +00:00
ac_shader_util.c ac: do not lower some ops in nir_lower_packing 2024-08-30 05:46:51 +00:00
ac_shader_util.h ac: stop using radeon_info for ac_get_hw_cache_flags 2024-06-07 13:22:43 +00:00
ac_shadowed_regs.c amd: add initial common code for gfx12 2024-05-11 22:14:05 -04:00
ac_shadowed_regs.h amd: add a new helper that prints all non-shadowed regs 2023-06-17 23:42:21 +00:00
ac_spm.c amd/common: add missing stuff for gfx11.5 2023-10-24 01:07:12 +00:00
ac_spm.h ac/spm: add support for GFX11 2023-09-25 09:05:58 +02:00
ac_sqtt.c ac/sqtt: make VA helpers static 2024-06-14 10:32:17 +02:00
ac_sqtt.h ac/sqtt: make VA helpers static 2024-06-14 10:32:17 +02:00
ac_surface.c amd: update addrlib 2024-08-16 21:44:32 +00:00
ac_surface.h ac: add gfx12 DCC shared code 2024-06-06 01:01:46 +00:00
ac_surface_meta_address_test.c ac/surface/tests: add gfx12 tests 2024-05-11 22:14:06 -04:00
ac_surface_modifier_test.c ac/surface/tests: add the test for ADDR3_256B_2D 2024-06-11 12:29:11 -04:00
ac_surface_test_common.h ac/surface/tests: add gfx12 tests 2024-05-11 22:14:06 -04:00
ac_uvd_dec.h ac,radeonsi,winsyses: switch to SPDX-License-Identifier: MIT 2023-05-24 21:48:19 +00:00
ac_vcn.h radv/video: add event support for VCN4 2024-08-26 22:19:09 +00:00
ac_vcn_av1_default.h ac/radeonsi: add av1 defaults header file from radeonsi 2023-06-16 05:53:44 +00:00
ac_vcn_dec.c ac/radv/radeon: move film grain init to common code. 2024-06-19 20:51:53 +00:00
ac_vcn_dec.h ac/radv/radeon: move film grain init to common code. 2024-06-19 20:51:53 +00:00
ac_vcn_enc.h radeonsi/vcn: Rename transform_skip_disabled and remove hardcoded value for VCN5 2024-08-30 01:17:22 +00:00
ac_vcn_enc_av1_default_cdf.h ac,radeonsi: move vcn enc av1 default cdf file to common 2023-09-14 07:51:24 +00:00
amd_family.c amd: add GFX v11.5.2 support 2024-07-02 12:05:23 +00:00
amd_family.h amd: add GFX v11.5.2 support 2024-07-02 12:05:23 +00:00
amd_kernel_code_t.h amd/common: add AMD_CODE_PROPERTY_ENABLE_WAVEFRONT_SIZE32 property 2023-08-31 20:30:03 +00:00
gfx10_format_table.h amd/common: only pass gfx_level to ac_get_gfx10_format_table() 2024-05-22 08:31:39 +00:00
gfx10_format_table.py ac,radeonsi,winsyses: switch to SPDX-License-Identifier: MIT 2023-05-24 21:48:19 +00:00
meson.build ac/nir: import the clear/copy_buffer compute shader from radeonsi 2024-08-10 02:14:44 +00:00
sid.h amd: add gfx12 register definitions 2024-05-11 22:14:05 -04:00
sid_tables.py ac,radeonsi,winsyses: switch to SPDX-License-Identifier: MIT 2023-05-24 21:48:19 +00:00