fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-02 10:08:08 +02:00

Author	SHA1	Message	Date
Iago Toral Quiroga	4483cd24af	broadcom/compiler: sink uniform loads total instructions in shared programs: 13014428 -> 13000420 (-0.11%) instructions in affected programs: 743624 -> 729616 (-1.88%) helped: 1392 HURT: 611 total threads in shared programs: 415858 -> 415874 (<.01%) threads in affected programs: 16 -> 32 (100.00%) helped: 8 HURT: 0 total uniforms in shared programs: 3720410 -> 3711652 (-0.24%) uniforms in affected programs: 113442 -> 104684 (-7.72%) helped: 635 HURT: 29 total max-temps in shared programs: 2154268 -> 2144876 (-0.44%) max-temps in affected programs: 61279 -> 51887 (-15.33%) helped: 1124 HURT: 187 total spills in shared programs: 4002 -> 3870 (-3.30%) spills in affected programs: 265 -> 133 (-49.81%) helped: 6 HURT: 0 total fills in shared programs: 5788 -> 5560 (-3.94%) fills in affected programs: 603 -> 375 (-37.81%) helped: 6 HURT: 0 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15168>	2022-03-02 08:09:11 +00:00
Iago Toral Quiroga	e228642cf5	broadcom/compiler: move constants before their first user For us they are basically uniforms too so we want to make their lifespans short to facilitate allocating them to accumulators. total instructions in shared programs: 13043585 -> 13015385 (-0.22%) instructions in affected programs: 8326040 -> 8297840 (-0.34%) helped: 24939 HURT: 19894 total threads in shared programs: 415860 -> 415858 (<.01%) threads in affected programs: 4 -> 2 (-50.00%) helped: 0 HURT: 1 total uniforms in shared programs: 3721953 -> 3720451 (-0.04%) uniforms in affected programs: 96134 -> 94632 (-1.56%) helped: 744 HURT: 435 total max-temps in shared programs: 2173431 -> 2154260 (-0.88%) max-temps in affected programs: 264598 -> 245427 (-7.25%) helped: 10858 HURT: 841 total spills in shared programs: 4005 -> 4010 (0.12%) spills in affected programs: 700 -> 705 (0.71%) helped: 5 HURT: 10 total fills in shared programs: 5801 -> 5817 (0.28%) fills in affected programs: 1346 -> 1362 (1.19%) helped: 6 HURT: 11 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15168>	2022-03-02 08:09:11 +00:00
Iago Toral Quiroga	a1998a9f43	broadcom/compiler: disallow TMU spills if max tmu spills is 0 If we are compiling with a strategy that does not allow TMU spills we should not allow spilling anything that is not a uniform. Otherwise the RA cost/benefit algorithm may choose to spill a temp that is not uniform and that will cause us to immediately fail the strategy and fallback to the next one, even if we could've instead chosen to spill more uniforms to compile the program successfully with that strategy. Some relevant shader-db stats: total instructions in shared programs: 13040711 -> 13043585 (0.02%) instructions in affected programs: 234238 -> 237112 (1.23%) helped: 73 HURT: 172 total threads in shared programs: 415664 -> 415860 (0.05%) threads in affected programs: 196 -> 392 (100.00%) helped: 98 HURT: 0 total uniforms in shared programs: 3717266 -> 3721953 (0.13%) uniforms in affected programs: 12831 -> 17518 (36.53%) helped: 6 HURT: 100 total max-temps in shared programs: 2174177 -> 2173431 (-0.03%) max-temps in affected programs: 4597 -> 3851 (-16.23%) helped: 79 HURT: 21 total spills in shared programs: 4010 -> 4005 (-0.12%) spills in affected programs: 55 -> 50 (-9.09%) helped: 5 HURT: 0 total fills in shared programs: 5820 -> 5801 (-0.33%) fills in affected programs: 186 -> 167 (-10.22%) helped: 5 HURT: 0 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15168>	2022-03-02 08:09:11 +00:00
Iago Toral Quiroga	cbb4d0dded	broadcom/compiler: increase cost of TMU spills to 10 Our cost was 5 which matches the number of instructions we have to add for a TMU spill (a fill is 4 instructions). Uniform spills on the other hand add an extra instruction for each fill and remove one instruction for the spill itself. These have a cost of 1. Therefore, if we have a single spill+fill, we end up with +9 instructions if it is a TMU spill and +0 instructions with a uniform spill, so making the former only 5 times more costly is probably not a good idea, and this is without even considering the added latency of the TMU accesses. Relevant shader-db changes show this causes as a marginal instruction count increase in a few shaders but better thread counts and lower TMU spilling overall: total instructions in shared programs: 13037315 -> 13040711 (0.03%) instructions in affected programs: 370106 -> 373502 (0.92%) helped: 187 HURT: 321 total threads in shared programs: 415090 -> 415664 (0.14%) threads in affected programs: 574 -> 1148 (100.00%) helped: 287 HURT: 0 total uniforms in shared programs: 3706674 -> 3717266 (0.29%) uniforms in affected programs: 63075 -> 73667 (16.79%) helped: 40 HURT: 395 total max-temps in shared programs: 2176080 -> 2174177 (-0.09%) max-temps in affected programs: 15838 -> 13935 (-12.02%) helped: 316 HURT: 34 total spills in shared programs: 4247 -> 4010 (-5.58%) spills in affected programs: 2599 -> 2362 (-9.12%) helped: 107 HURT: 14 total fills in shared programs: 6121 -> 5820 (-4.92%) fills in affected programs: 3622 -> 3321 (-8.31%) helped: 108 HURT: 13 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15168>	2022-03-02 08:09:11 +00:00
Marek Olšák	a02dd17cb3	radeonsi: fix an assertion failure with register shadowing The problem is that dirty_states must be 0 for any state that is NULL in "queued". This code was flagging dirty_states for such states because it was only looking at "emitted". It should have been looking at "queued". Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15209>	2022-03-01 22:30:24 +00:00
Marek Olšák	0f96948dfa	radeonsi: fix register shadowing after the pm4 state size was decreased Fixes: `946bd90a09` "radeonsi: decrease the size of si_pm4_state::pm4 except for cs_preamble_state" Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15209>	2022-03-01 22:30:24 +00:00
Marek Olšák	66e20d2bf7	ac: add an environment variable that parses IBs in files Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15209>	2022-03-01 22:30:24 +00:00
Marek Olšák	3394f0ae14	ac: define PKT3_ATOMIC_MEM Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15209>	2022-03-01 22:30:24 +00:00
Marek Olšák	ff9e4409c1	ac: parse SET_SH_REG_INDEX packet Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15209>	2022-03-01 22:30:24 +00:00
Marek Olšák	0cae7a59c0	ac/llvm: update LLVM processor names for gfx10.3 Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15209>	2022-03-01 22:30:24 +00:00
Marek Olšák	87d83f4103	ci: add point coord failures to d3d12 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:56 +00:00
Marek Olšák	5ca7c20cf7	st/mesa: do nir_lower_io() for inputs & outputs with transform feedback info Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:56 +00:00
Marek Olšák	ee4c5b1699	gallium/aux: add helper nir_gather_stream_output_info Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:55 +00:00
Marek Olšák	2a708efec3	gallium/util: add util_dump_stream_output_info Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:55 +00:00
Marek Olšák	1dcd1eac6a	nir: pass nir_shader into nir_recompute_io_bases instead of func_impl Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:55 +00:00
Marek Olšák	606811bded	nir: add nir_print_xfb_info Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:55 +00:00
Marek Olšák	ad68a1ee5a	nir: add nir_gather_xfb_info_from_intrinsics for lowered IO Drivers will use this. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:55 +00:00
Marek Olšák	d4c051b047	nir: add nir_lower_io_passes() with new transform feedback moved from radeonsi without the vectorization, which won't be needed for now. We will lower IO in st/mesa instead of radeonsi to get the transform feedback info into store instructions. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:55 +00:00
Marek Olšák	3528dcdfa1	nir: add nir_io_semantics::no_varying, no_sysval_output, and helpers This is for drivers that have separate store instructions for varyings, system value outputs (such as clip distances), and transform feedback. The flags tell the driver not to store the output to those locations. This will be used by radeonsi initially, and then maybe by a new linker. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:55 +00:00
Marek Olšák	548b2d47b2	nir: scalarize transform feedback info in nir_lower_io_to_scalar Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:55 +00:00
Marek Olšák	cc5505088b	nir: add shader_info::xfb_strides NIR now fully contains pipe_stream_output_info in shader_info and IO intrinsics if lower_io_variables is true. radeonsi will not use pipe_stream_output_info after this, and other drivers are free to follow that. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:55 +00:00
Marek Olšák	4636fa7f38	nir: add transform feedback info into nir_intrinsic_store_output This will allow compaction of transform feedback varyings because they are no longer tied to varying slots with this information. It will also make transform feedback info available to all NIR passes after IO is lowered. It's meant to replace pipe_stream_output_info. Other intrinsics are not used with transform feedback. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:55 +00:00
Marek Olšák	2c6e41bfe1	nir: fix nir_io_semantics::gs_streams in nir_lower_io_to_scalar gs_streams is relative to the component. Also clear the high bits. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:55 +00:00
Marek Olšák	73ef225fc2	nir: validate write_mask for all intrinsics that have it Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:55 +00:00
Marek Olšák	dd733fa52e	radeonsi: fix broken VK-GL buffer interop Fixes: `ad9b5ac0a1` - radeonsi: more fixes for si_buffer_from_winsys_buffer for GL-VK interop Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6063 Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Tested-by: Michel Dänzer <mdaenzer@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15124>	2022-03-01 20:51:04 +00:00
Nanley Chery	c1a7d520f3	anv: Disable aux if the explicit modifier lacks it For dmabuf imports, configure the primary surface without support for compression if the modifier doesn't specify it. This helps to create VkImages with memory requirements that are compatible with the buffers apps provide. Suggested-by: Philip Langdale <philipl@overt.org> Cc: 22.0 <mesa-stable> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5940 Tested-by: Philip Langdale <philipl@overt.org> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15181>	2022-03-01 20:05:50 +00:00
Nanley Chery	cada519482	anv: Refactor anv_image_init_from_create_info Use a variable to store the anv_image_create_info struct. We'll modify it for a bug fix in the next patch. Cc: 22.0 <mesa-stable> Tested-by: Philip Langdale <philipl@overt.org> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15181>	2022-03-01 20:05:50 +00:00
Nanley Chery	8d2b7e558b	anv: Change a parameter of the implicit layout fn Replace the create_info parameter with isl_extra_usage_flags to more closely match the parameters of explicit layout function. Tested-by: Philip Langdale <philipl@overt.org> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15181>	2022-03-01 20:05:50 +00:00
Alyssa Rosenzweig	c3eee6327c	pan/va: Add missing copyright notice Minor. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:23 +00:00
Alyssa Rosenzweig	eda00fd39d	pan/bi: Extract INSTRUCTION_CASE macro Useful across multiple optimization tests. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:23 +00:00
Alyssa Rosenzweig	ffde1f359b	pan/bi: Adapt bi_lower_branch for Valhall Disable the Bifrost optimization; it's not portable. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:23 +00:00
Alyssa Rosenzweig	f3937d9874	pan/bi: Trade off registers/threads on Valhall It's only v6 that's missing this feature. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:23 +00:00
Alyssa Rosenzweig	7637502c8d	pan/bi: Add BI_SUBGROUP_SUBGROUP16 option Valhall uses 16-wide warps. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	ec9c1f8fa6	pan/bi: Wire Valhall disassembler into compiler Useful when we grow Valhall support (soon!) Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	31e991d801	pan/bi: Support standalone Valhall disassembly $ bifrost_compiler disasm --gpu=G78 foo.bin Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	600f689a98	pan/bi: Allow CSE of preloaded registers Needed to CSE `LEA_VARY` in varying shaders on Valhall. No shader-db changes on Bifrost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	3154df232b	pan/bi: Use a progress loop for constant folding Needed to fold the dependent patterns produced by texture instructions during NIR->Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	e5582710f3	pan/bi: Mark NOP as having no destinations More accurate and more convenient. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	2604c65174	panfrost: Unify barrier+helper handling These are unified in the hardware, so let's unify them in pan_shader_info. Hoisting this logic to pan_shader.c avoids the need to duplicate this logic for Midgard/Bifrost (RSD packing) and Valhall (SPD packing). Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	30d0c2e390	panfrost: Set texel_interleave on Valhall Instead of specifying the tiling on the texture descriptor, Valhall specifies it on the plane descriptor. There is a new flag on the texture descriptor specifying only whether the planes are interleaved or not. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	407bda4d8c	panfrost: Adapt estimate_texture_payload_size to Valhall The plane descriptor is larger than earlier surface descriptors, so we need to be somewhat careful here. This removes a memory micro-optimization in the interest of simplifying the code. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	469a36d071	panfrost: Don't emit compression tags on Valhall Unnecessary. To avoid even more #if/#endif soup, merge the v4, v5-v8, and v9 paths together -- by returning 0 as the compression tag on v4 or v9. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	087b63cb07	panfrost: Allow uploading fragment SPDs SPDs don't have the state dependence that fragment RSDs do. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	e42b0c68f4	panfrost: Don't pack blend constants with blend shaders It's probably harmless, but it is logically meaningless. The DDK doesn't do it, I don't see a reason for us to, either. In theory this should be a small overhead win. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	111f5af303	panfrost: Generalize some is_bifrost users Valhall would want these too. Regretting the is_bifrost check at all.. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	36a2b8d039	panfrost: Add PAN_MESA_DEBUG=dump option To dump all graphics memory via the new pandecode_dump_mappings function(), since for Valhall I have to do this often enough to warrant a dynamic flag. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	28743a5556	panfrost: Rename prepare_rsd->prepare_shader This hook will be repurposed on Valhall to prepare the Shader Program Descriptor, which takes the role of the RSD. Rename to avoid confusion. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	631c01fc42	panfrost: Add an enum for Valhall resource tables Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	f3c971e0fe	panfrost: Make Divisor E an integer on v9 For consistency with previous architecture's XML files. Logically this is an 1-bit unsigned integer, not a boolean. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	b19afaf307	panfrost: Clarify contains descriptor? bit Influences cache prefetching. I don't see a good reason to put anything other than descriptors inside shader resources, meaning always setting this bit is appropriate (at least for GLES). Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00

1 2 3 4 5 ...

150687 commits