fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-22 02:18:10 +02:00

Author	SHA1	Message	Date
Rhys Perry	7e54fea373	aco: fix assembler.gfx11.vinterp test This was missed. I guess CI doesn't have a recent enough LLVM for these tests. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17710>	2022-09-30 20:57:02 +00:00
Rhys Perry	4544490df0	aco: limit hard clauses to 63 instructions See https://reviews.llvm.org/D127391 fossil-db (gfx1100): Totals from 4 (0.00% of 161689) affected shaders: Latency: 24545 -> 24539 (-0.02%) InvThroughput: 102867 -> 102835 (-0.03%) fossil-db (navi10): Totals from 4 (0.00% of 161220) affected shaders: Latency: 25969 -> 25959 (-0.04%) InvThroughput: 112917 -> 112869 (-0.04%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17710>	2022-09-30 20:57:02 +00:00
Rhys Perry	7cecc81683	aco/gfx11: fix s_waitcnt printing Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17710>	2022-09-30 20:57:02 +00:00
Rhys Perry	2cdb3e4b6b	aco: add VMEMtoScalarWriteHazard tests Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18270>	2022-09-30 11:44:38 +00:00
Rhys Perry	826ed52174	aco/tests: add GFX11 assembly tests Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17333>	2022-09-26 14:49:57 +00:00
Rhys Perry	48c8c25e68	aco: omit read-only memory_sync_info when printing Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17333>	2022-09-26 14:49:57 +00:00
Rhys Perry	aadb7aef01	aco: add VINTERP instruction format Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17333>	2022-09-26 14:49:56 +00:00
Timur Kristóf	a8dd07518c	aco/optimizer_postRA: Fix logical control flow handling. Change reset_block() so it only considers the logical predecessors for VGPRs. Relevant for some optimizations across loops. This commit fixes an assertion failure which was triggered by Zink in a piglit test. Fossil DB stats unaffected on Navi 21. Fixes: `2e56e23420` Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18488>	2022-09-21 16:56:57 +00:00
Timur Kristóf	5e80edfa78	aco/tests: Add post-RA SCC no-compare tests cases with control flow. - scc_nocmp_across_cf: passes - scc_nocmp_across_cf_partially_overwritten: fails (fixed later) Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18488>	2022-09-21 16:56:56 +00:00
Timur Kristóf	d4b3f81d94	aco/tests: Add post-RA DPP test cases with control flow. These are intended to make sure that the post-RA optimizer works correctly across control flow. The new tests emit a divergent if-else branch (with full logical+linear CFG). - dpp_across_cf: Simple case of DPP optimizable across control flow. Should pass. - dpp_across_cf_overwritten: Similar case but the DPP source register is overwritten in CF. This shows a bug so the test fails now (will be fixed). Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18488>	2022-09-21 16:56:56 +00:00
Timur Kristóf	d7cd49d54b	aco/tests: Add post-RA optimizer testcase for partially overwritten VCC. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18488>	2022-09-21 16:56:56 +00:00
Rhys Perry	061b8bfd29	aco/ra: rework fixed operands This moves all fixed operands at once, so they don't interfere with one another. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17493>	2022-09-01 11:22:46 +00:00
Rhys Perry	efcbccaf0e	aco/ra: handle empty def_reg interval in get_regs_for_copies If def_reg is empty, then def_reg.lo() may be lower than bounds.lo() if we're moving VGPRs and info.bounds will be invalid. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17493>	2022-09-01 11:22:46 +00:00
Rhys Perry	fb13ed6ff0	aco: fix long-jump version of discard early exit It isn't safe to modify the exec mask before the discard block, and the definition interferes with GFX11 NOP insertion. Just use s[0:1] instead, since we won't be using it. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18125>	2022-08-25 16:10:53 +00:00
Yonggang Luo	2af3b6756a	amd/compiler: Fixes warning [-Wunused-variable] in test_optimizer_postRA.cpp Warning message: ../src/amd/compiler/tests/test_optimizer_postRA.cpp:137:13: warning: unused variable 'reg_s1' [-Wunused-variable] Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18205>	2022-08-23 14:14:52 +00:00
Yonggang Luo	4a607c2df4	amd/compiler: Fixes warning [-Wunused-variable] in test_to_hw_instr.cpp Warning message: ../src/amd/compiler/tests/test_to_hw_instr.cpp:793:12: warning: unused variable 'reg_s1' [-Wunused-variable] Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18205>	2022-08-23 14:14:52 +00:00
Eric Engestrom	013b022924	aco: drop unused variable Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18193>	2022-08-22 23:05:20 +00:00
Timur Kristóf	1762e6b540	aco: Improve SCC nocompare optimization when SCC is clobbered. When SCC is clobbered between s_cmp and its operand's writer, the current optimization that eliminates s_cmp won't kick in. However, when s_cmp is the only user of its operand temporary, it is possible to "pull down" the instruction that wrote the operand. Fossil DB stats on Navi 21: Totals from 63302 (46.92% of 134906) affected shaders: CodeSize: 176689272 -> 176418332 (-0.15%) Instrs: 33552237 -> 33484502 (-0.20%) Latency: 205847485 -> 205816205 (-0.02%); split: -0.02%, +0.00% InvThroughput: 34321285 -> 34319908 (-0.00%); split: -0.00%, +0.00% Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16266>	2022-08-20 15:27:40 +00:00
Rhys Perry	dd105f7c1e	aco: fix assembly of vopc_sdwa writing exec We would assemble an instruction writing vcc instead. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Fixes: `5ffc73896f` ("aco/assembler: Fix v_cmpx with SDWA.") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18077>	2022-08-16 17:31:33 +00:00
Rhys Perry	d55c4180d5	aco/tests: add vop3p constant combine tests Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16296>	2022-07-05 16:39:56 +00:00
Rhys Perry	9739c07d9e	aco: fix single-alignbyte do_pack_2x16() path with fp inline constants We were using a 16-bit inline constant with a 32-bit instruction and the test would have created "v1: %_:v[0] = v_alignbyte_b32 0.5, %_:v[1][16:32], 2" instead. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16296>	2022-07-05 16:39:56 +00:00
Rhys Perry	5d8f5615d0	aco: ignore precise flag when optimizing integer clamps Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16296>	2022-07-05 16:39:56 +00:00
Rhys Perry	33e7ba2e3e	aco: update SMEM offset workaround for LLVM 15 This isn't needed since LLVM 15's b0ccf38b018. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-By: Tatsuyuki Ishi <ishitatsuyuki@gmail.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6663 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17036>	2022-06-16 00:47:51 +00:00
Rhys Perry	982cc9bcf5	aco/tests: update for GFX11's removal of SDWA Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16595>	2022-05-31 18:07:34 +00:00
Rhys Perry	e68e6c75ca	aco: use v_perm_b32 to copy 0xff00/0x00ff/0xff/0x00 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16595>	2022-05-31 18:07:34 +00:00
Rhys Perry	dae1629778	aco: disable sdwa on gfx11 Instead of SDWA v_mov_b32/v_xor_b32, we can use a combination of v_add_u16/v_sub_u16 (add/sub swap, similar to xor swap) and v_perm_b32 with a literal. I don't know yet if GFX11 adds any new instructions which makes this easier, but this approach should have full functionality. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16595>	2022-05-31 18:07:34 +00:00
Rhys Perry	d51dd7527b	aco/tests: fix gfx11 variants printed as gfx12 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16595>	2022-05-31 18:07:34 +00:00
Rhys Perry	c8bde76a42	aco/tests: disable regalloc.subdword_alloc.reuse_16bit_operands on GFX11 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16595>	2022-05-31 18:07:34 +00:00
Rhys Perry	4513cb8d41	aco: only add/subtract low bits of program addresses fossil-db (Sienna Cichlid): Totals from 4007 (2.47% of 162293) affected shaders: Instrs: 3733239 -> 3728018 (-0.14%) CodeSize: 20770340 -> 20749456 (-0.10%) Latency: 46883958 -> 46872764 (-0.02%); split: -0.02%, +0.00% InvThroughput: 10550392 -> 10548698 (-0.02%); split: -0.02%, +0.00% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16460>	2022-05-23 11:52:54 +00:00
Rhys Perry	69d1f4186a	aco/tests: add test for p_constaddr with a non-zero offset Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16460>	2022-05-23 11:52:54 +00:00
Rhys Perry	bd8f8dda8c	aco: fix p_constaddr with a non-zero offset Seems this broke a while ago and we never noticed. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Fixes: `0af7ff49fd` ("aco: lower p_constaddr into separate instructions earlier") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16460>	2022-05-23 11:52:54 +00:00
Marek Olšák	2443054932	amd: rename fishes to Navi21, Navi22, Navi23, Navi24, and Rembrandt Reviewed-by: Mihai Preda <mhpreda@gmail.com> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Martin Roukala <martin.roukala@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16604>	2022-05-19 11:55:50 +00:00
Marek Olšák	39800f0fa3	amd: change chip_class naming to "enum amd_gfx_level gfx_level" This aligns the naming with PAL. Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Pierre-Eric Pellou-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16469>	2022-05-13 14:56:22 -04:00
Samuel Pitoiset	0cb1b12ec0	aco: recognize GFX11 in few places Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16369>	2022-05-12 15:46:20 +00:00
Dave Airlie	04c07a2413	aco/radv: convert to aco shader info at the radv level. This removes the radv shader info type from aco completely. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16342>	2022-05-11 19:07:11 +00:00
Jason Ekstrand	1b8a43a0ba	util: Remove util_cpu_detect util_cpu_detect is an anti-pattern: it relies on callers high up in the call chain initializing a local implementation detail. As a real example, I added: ...a Mali compiler unit test ...that called bi_imm_f16() to construct an FP16 immediate ...that calls _mesa_float_to_half internally ...that calls util_get_cpu_caps internally, but only on x86_64! ...that relies on util_cpu_detect having been called before. As a consequence, this unit test: ...crashes on x86_64 with USE_X86_64_ASM set ...passes on every other architecture ...works on my local arm64 workstation and on my test board ...failed CI which runs on x86_64 ...needed to have a random util_cpu_detect() call sprinkled in. This is a bad design decision. It pollutes the tree with magic, it causes mysterious CI failures especially for non-x86_64 developers, and it is not justified by a micro-optimization. Instead, let's call util_cpu_detect directly from util_get_cpu_caps, avoiding the footgun where it fails to be called. This cleans up Mesa's design, simplifies the tree, and avoids a class of a (possibly platform-specific) failures. To mitigate the added overhead, wrap it all in a (fast) atomic load check and declare the whole thing as ATTRIBUTE_CONST so the compiler will CSE calls to util_cpu_detect. Co-authored-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Marek Olšák <maraeo@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15580>	2022-04-20 18:44:35 +00:00
Rhys Perry	63e40adf8c	aco: fix disassembly of SMEM with both SGPR and constant offset Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15890>	2022-04-14 20:58:36 +00:00
Daniel Schürmann	2fe005a3fe	aco: remove occurences of VCC hint Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15408>	2022-04-13 21:52:43 +00:00
Rhys Perry	5b4e41e4db	aco: don't use v_mad_mix on GFX9 if 16-bit denormals must be preserved This probably effectively disables the v_mad_mix optimization on GFX9. fossil-db (Vega): Totals from 11545 (7.15% of 161366) affected shaders: MaxWaves: 43025 -> 42780 (-0.57%); split: +0.06%, -0.63% Instrs: 18571635 -> 18734201 (+0.88%); split: -0.00%, +0.88% CodeSize: 96483568 -> 96611012 (+0.13%); split: -0.11%, +0.24% SGPRs: 1079056 -> 1077616 (-0.13%); split: -0.14%, +0.01% VGPRs: 819248 -> 821868 (+0.32%); split: -0.04%, +0.36% SpillSGPRs: 13313 -> 12464 (-6.38%) Latency: 293804093 -> 295046122 (+0.42%); split: -0.09%, +0.51% InvThroughput: 110002239 -> 110994978 (+0.90%); split: -0.03%, +0.93% VClause: 342458 -> 342596 (+0.04%); split: -0.12%, +0.16% SClause: 648566 -> 648046 (-0.08%); split: -0.12%, +0.04% Copies: 1728225 -> 1726679 (-0.09%); split: -0.66%, +0.57% Branches: 552973 -> 552963 (-0.00%); split: -0.02%, +0.02% PreSGPRs: 862360 -> 856820 (-0.64%); split: -0.69%, +0.05% PreVGPRs: 773689 -> 776818 (+0.40%); split: -0.02%, +0.42% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6178 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15718>	2022-04-04 19:27:12 +00:00
Daniel Schürmann	b98a9dcc36	aco/optimizer: fix call to can_use_opsel() in apply_insert() The definition index is -1. Fixes: `54292e99c7` ('aco: optimize 32-bit extracts and inserts using SDWA ') Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15551>	2022-03-25 22:02:50 +00:00
Rhys Perry	177b54ebe9	aco/tests: add v_fma_mix tests Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14769>	2022-03-17 19:04:17 +00:00
Daniel Schürmann	70aea6b41a	aco/ra: refactor collect_vars() to return a sorted vector The vector of IDs is sorted with decreasing sizes, and by increasing assigned registers. This decouples register assingment from ssa IDs. Totals from 12694 (9.41% of 134913) affected shaders: (GFX10.3) VGPRs: 757864 -> 757848 (-0.00%); split: -0.00%, +0.00% CodeSize: 72350540 -> 72348688 (-0.00%); split: -0.02%, +0.02% MaxWaves: 237018 -> 237020 (+0.00%); split: +0.00%, -0.00% Instrs: 13545494 -> 13544699 (-0.01%); split: -0.03%, +0.02% Latency: 148539203 -> 148533292 (-0.00%); split: -0.01%, +0.00% InvThroughput: 30319086 -> 30320382 (+0.00%); split: -0.01%, +0.01% VClause: 326875 -> 327028 (+0.05%); split: -0.05%, +0.09% SClause: 479833 -> 479837 (+0.00%); split: -0.00%, +0.00% Copies: 862152 -> 860914 (-0.14%); split: -0.43%, +0.28% Branches: 317775 -> 317777 (+0.00%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11526>	2022-03-14 08:32:10 +00:00
Rhys Perry	c3070773f8	aco/tests: add test for branch definition RA Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13432>	2022-03-03 20:21:08 +00:00
Rhys Perry	5e3b8eeac4	aco: add test for optimizations with casts Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14810>	2022-02-03 16:02:04 +00:00
Rhys Perry	27f1f5537d	aco/tests: implement sub-dword program inputs Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14810>	2022-02-03 16:02:04 +00:00
Rhys Perry	e86b88f85b	aco/tests: add a bunch more building helpers Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14810>	2022-02-03 16:02:04 +00:00
Daniel Schürmann	8a78706643	nir: refactor nir_opt_move This patch is a rewrite of nir_opt_move. Differently from the previous version, each instruction is checked if it can be moved downwards and then inserted before the first user of the definition. The advantage is that less insert operations are performed, the original order is kept if two movable instructions have the same first user, and instructions without user in the same block are moved towards the end. v2: Only return true if an instruction really changed the position. Don't care for discards, this will be handled by another MR. v3: fix self-referring phis and update according to nir_can_move_instr(). v4: use nir_can_move_instr() and nir_instr_ssa_def() v5: deduplicate some code Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3657>	2022-01-12 13:41:54 +00:00
Tatsuyuki Ishi	da0412e55b	aco: support DPP8 Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13971>	2021-12-31 20:56:39 +00:00
Rhys Perry	6afba80534	aco: don't create DPP instructions with SGPR operands Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Fixes: `2e6834d4f6` ("aco: combine DPP into VALU before RA") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13976>	2021-11-30 20:11:48 +00:00
Tony Wasserka	b70e551a51	aco/tests: Assert that the requested IR is actually provided In particular, assembly will not be provided if no disassembler is available for the given GPU architecture. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11319>	2021-10-01 10:40:18 +02:00

1 2 3 4 5

208 commits