fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-23 22:00:13 +01:00

Author	SHA1	Message	Date
Timur Kristóf	db5c3f170f	aco: Emulate Wave64 bpermute on GFX11. Similar to emit_gfx10_wave64_bpermute, but uses the new v_permlane64_b32 instruction to swap data between wave halves. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20293>	2022-12-14 13:54:04 +00:00
Timur Kristóf	853e76f007	aco: Stylistic changes to emit_gfx10_wave64_bpermute. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20293>	2022-12-14 13:54:04 +00:00
Timur Kristóf	640e801651	aco: Split opcodes for GFX6 and GFX10 emulated bpermute. Different sequences are emitted for these, so it makes sense to have different opcodes too. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20293>	2022-12-14 13:54:04 +00:00
Bas Nieuwenhuizen	89663828ea	aco: Don't use v_lshrrev_b64 for moves on GFX11. Looking at VOPD things, shifts are not very likely to get dual issued but plain moves are. Looking at RDNA2 v_lshrrev_b64 are half the perf of v_mov_b32 (but you need twice as many moves), so on GFX11 this likely reaches the threshold where moves are faster. Totals from 68400 (50.70% of 134906) affected shaders: CodeSize: 275489516 -> 275459536 (-0.01%); split: -0.01%, +0.00% Instrs: 51775474 -> 51991286 (+0.42%) Latency: 589884847 -> 589066439 (-0.14%); split: -0.15%, +0.01% InvThroughput: 127154986 -> 126037619 (-0.88%); split: -0.88%, +0.00% Copies: 3756157 -> 3976193 (+5.86%) Branches: 1259604 -> 1260072 (+0.04%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19633>	2022-12-02 13:25:57 +00:00
Rhys Perry	9b6ab40b3b	aco: improve do_pack_2x16() with zero constants We can skip the v_or_b32 or use an instruction smaller than v_alignbyte_b32. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19933>	2022-12-01 21:43:28 +00:00
Rhys Perry	ce5838599d	aco/gfx11: use v_cvt_i32_i16/v_cvt_u32_u16 fossil-db (gfx1100): Totals from 52753 (39.07% of 135032) affected shaders: CodeSize: 153603860 -> 153163384 (-0.29%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19933>	2022-12-01 21:43:28 +00:00
Samuel Pitoiset	ce11c06429	aco: fix emitting DEALLOC_VGPRS in the discard block It should be emitted right before s_endpgm. Cc: 22.3 mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19931>	2022-11-22 19:52:04 +00:00
Samuel Pitoiset	bb90d29660	aco: add p_dual_src_export_gfx11 for dual source blending on GFX11 Dual source blending must be in strict WQM mode. Cc: 22.3 mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19643>	2022-11-16 18:35:10 +00:00
Daniel Schürmann	efc0835787	aco: move statistics enum to aco_shader_info.h to make it accessible from the driver. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19721>	2022-11-15 16:34:07 +00:00
Samuel Pitoiset	369c9b6425	aco: fix p_interp_gfx11 to not overwrite SCC s_wqm_b64 clobbers SCC. Found this while working on dual source blending. Fixes: `6113ee650a` ("aco/gfx11: fix FS input loads in quad-divergent control flow") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19747>	2022-11-15 15:57:31 +00:00
Georg Lehmann	9746ddf1d6	aco: Use s_pack_ll_b32_b16 for scalar zero extend. Foz-DB Navi21: Totals from 2403 (1.78% of 134913) affected shaders: CodeSize: 25329156 -> 25311244 (-0.07%) Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19413>	2022-11-01 18:59:53 +00:00
Rhys Perry	6113ee650a	aco/gfx11: fix FS input loads in quad-divergent control flow This is not ideal and it would be great to somehow make it better some day. fossil-db (gfx1100): Totals from 5208 (3.86% of 135032) affected shaders: MaxWaves: 127058 -> 126962 (-0.08%); split: +0.01%, -0.09% Instrs: 3983440 -> 4072736 (+2.24%); split: -0.00%, +2.24% CodeSize: 21872468 -> 22230852 (+1.64%); split: -0.00%, +1.64% VGPRs: 206688 -> 206984 (+0.14%); split: -0.05%, +0.20% Latency: 37447383 -> 37491197 (+0.12%); split: -0.05%, +0.17% InvThroughput: 6421955 -> 6422348 (+0.01%); split: -0.03%, +0.03% VClause: 71579 -> 71545 (-0.05%); split: -0.09%, +0.04% SClause: 148289 -> 147146 (-0.77%); split: -0.84%, +0.07% Copies: 259011 -> 258084 (-0.36%); split: -0.61%, +0.25% Branches: 101366 -> 101314 (-0.05%); split: -0.10%, +0.05% PreSGPRs: 223482 -> 223460 (-0.01%); split: -0.21%, +0.20% PreVGPRs: 184448 -> 184744 (+0.16%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19370>	2022-11-01 12:42:43 +00:00
Rhys Perry	3da4fe9c6d	aco: fix typo in branch lowering Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Fixes: `aadb7aef01` ("aco: add VINTERP instruction format") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19370>	2022-11-01 12:42:43 +00:00
Rhys Perry	f32dde2902	aco: swap v_perm_b32 operands I misread the ISA doc and got the order wrong. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Fixes: `dae1629778` ("aco: disable sdwa on gfx11") Fixes: `e68e6c75ca` ("aco: use v_perm_b32 to copy 0xff00/0x00ff/0xff/0x00") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19223>	2022-10-24 10:52:56 +00:00
Timur Kristóf	d8639b7a80	aco: Allow explicitly removing jumps on GFX10+ when beneficial. "Removing jumps" in ACO means skipping the jump instruction at the beginning of a divergent branch (but still modify exec). ACO already supports implicitly removing jumps when it decides that executing a branch with empty exec mask is more beneficial than a jump. This commit adds the possibility to use this explicitly through nir_selection_control. ACO will respect this setting and remove the branch instructions when this is specified, unless it decides that this would cause bugs (eg. exp instruction). There are two cases that benefit from the new change: 1. When the application requests to "flatten" a branch (ie. remove control flow), we now respect that. 2. When the compiler stack determines that a divergent branch is always taken. v2 by Georg Lehmann: fixed applying sel_ctrl to else blocks Fossil DB stats on Navi 21: Totals from 13 (0.01% of 134906) affected shaders: CodeSize: 136616 -> 136496 (-0.09%) Instrs: 26196 -> 26166 (-0.11%) Latency: 417928 -> 417889 (-0.01%) Branches: 1241 -> 1211 (-2.42%) Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-By: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17921>	2022-10-11 15:42:54 +00:00
Rhys Perry	2930317cea	aco/gfx11: deallocate VGPRs at the end of the shader fossil-db (gfx1100): Totals from 65987 (40.81% of 161689) affected shaders: Instrs: 57123207 -> 57199947 (+0.13%) CodeSize: 308402500 -> 308709460 (+0.10%) Latency: 680527139 -> 680527160 (+0.00%) InvThroughput: 131620026 -> 131620045 (+0.00%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17710>	2022-09-30 20:57:02 +00:00
Rhys Perry	aadb7aef01	aco: add VINTERP instruction format Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17333>	2022-09-26 14:49:56 +00:00
Rhys Perry	55cd74d468	aco: add LDSDIR instruction format Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17333>	2022-09-26 14:49:56 +00:00
Timur Kristóf	16c14663e5	aco: Fix p_init_scratch for task shaders. Fixes: `d2d94b62f2` Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18339>	2022-09-01 09:10:47 +00:00
Daniel Schürmann	3d6ea4f666	aco: use std::vector::reserve() more often This removes the majority of vector re-allocations. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18105>	2022-08-30 16:03:26 +00:00
Rhys Perry	fb13ed6ff0	aco: fix long-jump version of discard early exit It isn't safe to modify the exec mask before the discard block, and the definition interferes with GFX11 NOP insertion. Just use s[0:1] instead, since we won't be using it. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18125>	2022-08-25 16:10:53 +00:00
Samuel Pitoiset	8bdcc20815	aco: add new pseudo instruction p_jump_to_epilog The first operand of this new pseudo-instruction is a 64-bit SGPR for the continue PC, followed by a variable list of fixed VGPRS for the color exports which are the PS epilog inputs. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17485>	2022-07-18 18:40:02 +00:00
Rhys Perry	d2d94b62f2	aco: initialize scratch base registers on GFX9-GFX10.3 fossil-db (navi21): Totals from 1142 (0.70% of 162293) affected shaders: Instrs: 271636 -> 271974 (+0.12%) CodeSize: 1532020 -> 1533792 (+0.12%) Latency: 7484066 -> 7485698 (+0.02%) InvThroughput: 4048824 -> 4049579 (+0.02%) SClause: 4171 -> 4212 (+0.98%) PreSGPRs: 11203 -> 12276 (+9.58%) fossil-db (vega10): Totals from 3327 (2.06% of 161355) affected shaders: Instrs: 257413 -> 257601 (+0.07%) CodeSize: 1424244 -> 1425372 (+0.08%) Latency: 8598402 -> 8600466 (+0.02%) InvThroughput: 7906335 -> 7908234 (+0.02%) SClause: 4932 -> 4973 (+0.83%) PreSGPRs: 22010 -> 25405 (+15.42%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17079>	2022-07-08 14:49:03 +00:00
Rhys Perry	84b404d34d	aco: don't use 32-bit fp inline constants for fp16 vop3p literals If we're applying the literal 0x3f800000 to a fp16 vop3p instruction, we shouldn't use the 1.0 inline constant, because the hardware will use the 16-bit 1.0: 0x00003c00. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16296>	2022-07-05 16:39:56 +00:00
Rhys Perry	9739c07d9e	aco: fix single-alignbyte do_pack_2x16() path with fp inline constants We were using a 16-bit inline constant with a 32-bit instruction and the test would have created "v1: %_:v[0] = v_alignbyte_b32 0.5, %_:v[1][16:32], 2" instead. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16296>	2022-07-05 16:39:56 +00:00
Rhys Perry	e68e6c75ca	aco: use v_perm_b32 to copy 0xff00/0x00ff/0xff/0x00 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16595>	2022-05-31 18:07:34 +00:00
Rhys Perry	dae1629778	aco: disable sdwa on gfx11 Instead of SDWA v_mov_b32/v_xor_b32, we can use a combination of v_add_u16/v_sub_u16 (add/sub swap, similar to xor swap) and v_perm_b32 with a literal. I don't know yet if GFX11 adds any new instructions which makes this easier, but this approach should have full functionality. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16595>	2022-05-31 18:07:34 +00:00
Rhys Perry	9b60a67841	aco: clarify a portion of do_pack_2x16 This confused me a bit when I first saw it. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16595>	2022-05-31 18:07:34 +00:00
Rhys Perry	4513cb8d41	aco: only add/subtract low bits of program addresses fossil-db (Sienna Cichlid): Totals from 4007 (2.47% of 162293) affected shaders: Instrs: 3733239 -> 3728018 (-0.14%) CodeSize: 20770340 -> 20749456 (-0.10%) Latency: 46883958 -> 46872764 (-0.02%); split: -0.02%, +0.00% InvThroughput: 10550392 -> 10548698 (-0.02%); split: -0.02%, +0.00% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16460>	2022-05-23 11:52:54 +00:00
Rhys Perry	bd8f8dda8c	aco: fix p_constaddr with a non-zero offset Seems this broke a while ago and we never noticed. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Fixes: `0af7ff49fd` ("aco: lower p_constaddr into separate instructions earlier") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16460>	2022-05-23 11:52:54 +00:00
Marek Olšák	39800f0fa3	amd: change chip_class naming to "enum amd_gfx_level gfx_level" This aligns the naming with PAL. Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Pierre-Eric Pellou-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16469>	2022-05-13 14:56:22 -04:00
Samuel Pitoiset	bc8da20dda	aco: export MRT0 instead of NULL on GFX11 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16369>	2022-05-12 15:46:20 +00:00
Rhys Perry	d28b6b6856	aco: rework removal of jumps over branches Only allow this in situations where we know it's safe. In particular, this stops removal of unconditional branches like with block_kind_continue_or_break. Fixes dEQP-VK.graphicsfuzz.fragcoord-control-flow hang. fossil-db (Sienna Cichlid): Totals from 34 (0.02% of 162293) affected shaders: Instrs: 84115 -> 84178 (+0.07%); split: -0.00%, +0.08% CodeSize: 463372 -> 463624 (+0.05%); split: -0.00%, +0.06% Latency: 3467316 -> 3467652 (+0.01%) InvThroughput: 3085493 -> 3085578 (+0.00%) Branches: 3221 -> 3284 (+1.96%); split: -0.03%, +1.99% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Fixes: `f030b75b7d` ("aco: relax condition to remove branches in case of few instructions") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15214>	2022-03-04 12:32:36 +00:00
Daniel Schürmann	f030b75b7d	aco: relax condition to remove branches in case of few instructions This patch relaxes the conditions under which we remove branch instructions. Totals from 27246 (20.20% of 134913) affected shaders: (GFX10.3) CodeSize: 193413312 -> 192924928 (-0.25%) Instrs: 36146788 -> 36024692 (-0.34%) Latency: 528374112 -> 528469044 (+0.02%); split: -0.01%, +0.02% InvThroughput: 106198759 -> 106216583 (+0.02%); split: -0.00%, +0.02% Branches: 1040640 -> 918543 (-11.73%) Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8647>	2022-02-25 15:38:08 +00:00
Daniel Schürmann	08b8500dfb	aco: remove block_kind_discard This case doesn't seem to happen in practice. No need to micro-optimize it. This patch merges instruction selection for discard/discard_if. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14805>	2022-02-08 16:16:07 +00:00
Timur Kristóf	f14023666c	aco: Allow p_extract to have different definition and operand sizes. Makes p_extract more flexible and prepares it to be usable for other use cases. No Fossil DB changes. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11560>	2021-09-28 17:59:27 +00:00
Rhys Perry	8d50385bbd	aco: implement linear vgpr copies Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12172>	2021-09-17 14:36:03 +00:00
Rhys Perry	e2b77e6cd4	aco: add and use RegClass::resize helper Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12172>	2021-09-17 14:36:03 +00:00
Rhys Perry	f241bd3749	aco: don't coalesce constant copies into non-power-of-two sizes Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12702>	2021-09-03 14:01:27 +01:00
Daniel Schürmann	8bd7e2392b	aco: preserve subdword RC when lowering p_insert/p_extract Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12640>	2021-09-02 20:39:17 +02:00
Daniel Schürmann	0988f7b9ba	aco: remove explicit dst_preserve flag Instead, we can rely on the fact that subdword definitions must preserve the unused bits while dword definitions either pad or sign-extend. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12640>	2021-09-02 20:39:17 +02:00
Daniel Schürmann	9e3ff06c38	aco: rewrite SDWA selector This commit introduces a new struct SubdwordSel in order to ease and clean up the usage of SDWA selections. This includes removing the distinction between register-allocated and fixed SDWA selections. Instead, SDWA selections can now also access the high bits of subdword variables. Alignment and sizes are validated accordingly. Size, offset and sign_extend can be evaluated via helper methods. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12640>	2021-09-02 20:39:17 +02:00
Daniel Schürmann	cc4682ed47	aco: fix p_insert lowering with 16bit sources The previous lowering only wrote a single byte. Fixes: `2f94353735` ('aco: add p_extract/p_insert') Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12640>	2021-09-02 20:39:17 +02:00
Tony Wasserka	66e51dc474	aco: Remove use of deprecated Operand constructors This migration was done with libclang-based automatic tooling, which performed these replacements: * Operand(uint8_t) -> Operand::c8 * Operand(uint16_t) -> Operand::c16 * Operand(uint32_t, false) -> Operand::c32 * Operand(uint32_t, bool) -> Operand::c32_or_c64 * Operand(uint64_t) -> Operand::c64 * Operand(0) -> Operand::zero(num_bytes) Casts that were previously used for constructor selection have automatically been removed (e.g. Operand((uint16_t)1) -> Operand::c16(1)). Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11653>	2021-07-13 17:43:26 +00:00
Daniel Schürmann	98136bda05	aco: fix self-intersecting register swaps Splitting self-intersecting register swaps into 3 sections was unnecessary and only worked because the middle section was always empty for full dword swaps. No fossil-db changes. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11823>	2021-07-13 09:14:43 +02:00
Daniel Schürmann	1e2639026f	aco: Format. Manually adjusted some comments for more intuitive line breaks. Reviewed-by: Tony Wasserka <tony.wasserka@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11258>	2021-07-12 21:27:31 +00:00
Daniel Schürmann	0eea0e55ad	aco: add 'common/' and 'llvm/' prefix to #includes Reviewed-by: Tony Wasserka <tony.wasserka@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11271>	2021-07-12 12:09:31 +00:00
Daniel Schürmann	3f9e986d33	aco: add missing Licenses and remove Authors from files Reviewed-by: Tony Wasserka <tony.wasserka@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11271>	2021-07-12 12:09:31 +00:00
Daniel Schürmann	59fdaa1985	aco: reorder and cleanup #includes Reviewed-by: Tony Wasserka <tony.wasserka@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11271>	2021-07-12 12:09:31 +00:00
Rhys Perry	2f94353735	aco: add p_extract/p_insert These will let us make the SDWA optimizer much simpler than if we were to recognize combinations of shift/and/bfe. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3151>	2021-06-08 08:57:42 +00:00

1 2 3

143 commits