fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-23 15:30:14 +01:00

Author	SHA1	Message	Date
Samuel Pitoiset	fdc212bd7b	aco: create a new builder variant for ds_add_rtn This instruction can use 1 definition and 3 operands. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19317>	2022-10-31 13:48:39 +00:00
Samuel Pitoiset	c481978ac2	aco: split the sendmsg enumeration into sendmsg_rtn Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19267>	2022-10-25 20:23:07 +02:00
Rhys Perry	6407d783ea	aco: update sendmsg enum from LLVM Add GFX11 enums and some new ones that apparently existed before. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17710>	2022-09-30 20:57:02 +00:00
Rhys Perry	826ed52174	aco/tests: add GFX11 assembly tests Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17333>	2022-09-26 14:49:57 +00:00
Rhys Perry	aadb7aef01	aco: add VINTERP instruction format Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17333>	2022-09-26 14:49:56 +00:00
Rhys Perry	55cd74d468	aco: add LDSDIR instruction format Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17333>	2022-09-26 14:49:56 +00:00
Rhys Perry	7a1b522148	aco: rename Interp_instruction to VINTRP_instruction These is clearer since GFX11 adds another interpolation format. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17333>	2022-09-26 14:49:56 +00:00
Rhys Perry	931a456db1	aco: improve support for scratch_* instructions Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17079>	2022-07-08 14:49:03 +00:00
Rhys Perry	dae1629778	aco: disable sdwa on gfx11 Instead of SDWA v_mov_b32/v_xor_b32, we can use a combination of v_add_u16/v_sub_u16 (add/sub swap, similar to xor swap) and v_perm_b32 with a literal. I don't know yet if GFX11 adds any new instructions which makes this easier, but this approach should have full functionality. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16595>	2022-05-31 18:07:34 +00:00
Marek Olšák	39800f0fa3	amd: change chip_class naming to "enum amd_gfx_level gfx_level" This aligns the naming with PAL. Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Pierre-Eric Pellou-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16469>	2022-05-13 14:56:22 -04:00
Daniel Schürmann	d703a0e808	aco: remove register hints entirely Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15408>	2022-04-13 21:52:43 +00:00
Daniel Schürmann	2fe005a3fe	aco: remove occurences of VCC hint Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15408>	2022-04-13 21:52:43 +00:00
Tatsuyuki Ishi	da0412e55b	aco: support DPP8 Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13971>	2021-12-31 20:56:39 +00:00
Rhys Perry	1988a78430	aco: fix vadd32() when b is neither a constant nor temporary This will be useful for compiling vertex shader prologs, where we basically use ACO as an assembler. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11717>	2021-10-13 05:13:10 +00:00
Tony Wasserka	76554419b3	aco: Remove use of deprecated Operand constructors in aco_builder.h Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11653>	2021-07-13 17:43:26 +00:00
Daniel Schürmann	59fdaa1985	aco: reorder and cleanup #includes Reviewed-by: Tony Wasserka <tony.wasserka@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11271>	2021-07-12 12:09:31 +00:00
Rhys Perry	c768d7d8f2	aco/tests: add SDWA tests Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3151>	2021-06-08 08:57:43 +00:00
Rhys Perry	298d400e5c	aco/tests: add test for NSAToVMEMBug Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9187>	2021-03-17 12:31:05 +00:00
Rhys Perry	441ead5fb3	aco: remove Format::{VOP3A,VOP3B} These are really the same as Format::VOP3. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8595>	2021-01-22 14:12:32 +00:00
Daniel Schürmann	5ad52ac906	aco: create helpers to emit vop3p instructions Also make get_alu_src() capable to return unswizzled multi-component SGPR sources. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6680>	2021-01-13 17:46:56 +00:00
Rhys Perry	382f50ad2c	aco: implement sparse texture fetches Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7775>	2021-01-08 14:27:07 +00:00
Samuel Pitoiset	be600b009a	aco: add a new Operand flag to indicate that is 24-bit To indicate that the upper 8-bits are always 0 to optimize more MADs. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7673>	2020-11-23 18:34:40 +00:00
Rhys Perry	02c5519e6c	aco: try harder to not create v_mul_lo_u32 fossil-db (Vega): Totals from 4 (0.00% of 137413) affected shaders: CodeSize: 13708 -> 13716 (+0.06%) Instrs: 2742 -> 2744 (+0.07%) Cycles: 24348 -> 24236 (-0.46%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5390>	2020-11-20 19:50:31 +00:00
Rhys Perry	8ca23bcf39	aco: copy constant to sgpr in Builder::v_mul_imm() No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5390>	2020-11-20 19:50:31 +00:00
Rhys Perry	70d665d981	aco: don't create v_mov_b32 in v_mul_imm() We switched to p_parallelcopy for everything else. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5390>	2020-11-20 19:50:31 +00:00
Tony Wasserka	2bb8874320	aco: Fix -Wshadow warnings Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7430>	2020-11-20 09:29:19 +00:00
Samuel Pitoiset	0ea763a727	aco: add a new Operand flag to indicate that is 16-bit To indicate that the upper 16-bits are always 0 and that optimizing v_mad_u32_u16 to v_mul_u32_u24 is valid. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7425>	2020-11-12 12:32:26 +00:00
Rhys Perry	ef95ba8cdd	aco: implement some 16-bit arithmetic instead of lowering fossil-db (parallel-rdp, Navi): Totals from 210 (30.75% of 683) affected shaders: SGPRs: 9704 -> 10248 (+5.61%) VGPRs: 5884 -> 5368 (-8.77%) CodeSize: 1155564 -> 1098752 (-4.92%) Instrs: 199927 -> 189940 (-5.00%) Cycles: 20438392 -> 19860124 (-2.83%) v2: use divergence analysis to determine which instructions to lower. Co-Authored-by: Daniel Schürmann <daniel@schuermann.dev> Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4791>	2020-11-04 11:50:37 +00:00
Rhys Perry	e54c111c45	aco: always use p_parallelcopy for pre-RA copies Most fossil-db changes are because literals are applied earlier (in label_instruction), so use counts are more accurate and more literals are applied. fossil-db (Navi): Totals from 79551 (57.89% of 137413) affected shaders: SGPRs: 4549610 -> 4542802 (-0.15%); split: -0.19%, +0.04% VGPRs: 3326764 -> 3324172 (-0.08%); split: -0.10%, +0.03% SpillSGPRs: 38886 -> 34562 (-11.12%); split: -11.14%, +0.02% CodeSize: 240143456 -> 240001008 (-0.06%); split: -0.11%, +0.05% MaxWaves: 1078919 -> `1079281` (+0.03%); split: +0.04%, -0.01% Instrs: 46627073 -> 46528490 (-0.21%); split: -0.22%, +0.01% fossil-db (Polaris): Totals from 98463 (70.90% of 138881) affected shaders: SGPRs: 5164689 -> 5164353 (-0.01%); split: -0.02%, +0.01% VGPRs: 3920936 -> 3921856 (+0.02%); split: -0.00%, +0.03% SpillSGPRs: 56298 -> 52259 (-7.17%); split: -7.22%, +0.04% CodeSize: 258680092 -> 258692712 (+0.00%); split: -0.02%, +0.03% MaxWaves: 620863 -> 620823 (-0.01%); split: +0.00%, -0.01% Instrs: 50776289 -> 50757577 (-0.04%); split: -0.04%, +0.00% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7216>	2020-10-27 15:24:38 +00:00
Rhys Perry	74e2e9b682	aco: don't use bld.copy() in handle_operands() No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7216>	2020-10-27 15:24:38 +00:00
Rhys Perry	1a652244e4	aco: implement 16-bit literals We can copy any value into a 16-bit subregister with a 3 dword v_pack_b32_f16 on GFX10 or a v_and_b32+v_or_b32 on GFX9. Because the generated code can depend on the register assignment and to improve constant propagation, Builder::copy creates a p_create_vector in the case of sub-dword literals. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7111>	2020-10-15 11:33:42 +00:00
Rhys Perry	bf77f539ee	aco: optimize more uniform reductions/scans Uniform atomic optimization will create these. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6558>	2020-10-13 12:47:20 +00:00
Timur Kristóf	ecfabfd606	aco: Add wave-specific opcode for s_lshl and s_flbit. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6964>	2020-10-09 15:26:14 +02:00
Rhys Perry	ec2185c598	aco: keep track of temporaries' regclasses in the Program A future change will switch the liveness sets to bit vectors, which don't contain regclass information. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6733>	2020-09-21 13:47:28 +00:00
Rhys Perry	ae6330d955	aco/tests: add test for GFX10 0x3f bug Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6212>	2020-08-26 13:26:58 +00:00
Rhys Perry	fe2dc41258	aco: create long jumps When the branch offset can't be encoded, we have to use s_setpc_b64. Fixes hang in RPCS3 vertex ubershader. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3231 Cc: 20.2 <mesa-stable> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6212>	2020-08-26 13:26:58 +00:00
Rhys Perry	156fd58cda	aco: reserve 2 sgprs for each branch We'll need two sgprs for the possibility of a long jump. fossil-db (Navi): Totals from 10197 (7.50% of 135946) affected shaders: SGPRs: 946268 -> 946468 (+0.02%) VGPRs: 705884 -> 707956 (+0.29%); split: -0.00%, +0.30% SpillSGPRs: 31485 -> 36212 (+15.01%); split: -0.04%, +15.05% CodeSize: 88296484 -> 88384604 (+0.10%); split: -0.01%, +0.11% MaxWaves: 81379 -> 81171 (-0.26%) Instrs: 17219111 -> 17231682 (+0.07%); split: -0.03%, +0.10% Cycles: 1594875900 -> 1596450136 (+0.10%); split: -0.05%, +0.15% VMEM: 1687263 -> 1689080 (+0.11%); split: +0.14%, -0.03% SMEM: 657726 -> 660262 (+0.39%); split: +0.61%, -0.22% VClause: 294806 -> 294638 (-0.06%); split: -0.08%, +0.02% SClause: 556702 -> 556210 (-0.09%); split: -0.12%, +0.03% Copies: 1466323 -> 1469349 (+0.21%); split: -0.57%, +0.78% Branches: 619793 -> 618556 (-0.20%); split: -0.28%, +0.08% PreSGPRs: 806364 -> 811477 (+0.63%); split: -0.14%, +0.77% PreVGPRs: 655845 -> 657174 (+0.20%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Cc: 20.2 <mesa-stable> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6212>	2020-08-26 13:26:58 +00:00
Rhys Perry	fc9f502a5b	aco: fix regclass checks when fixing to vcc/exec with Builder Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Cc: 20.2 <mesa-stable> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6212>	2020-08-26 13:26:58 +00:00
Rhys Perry	e6366f9094	aco: add framework for unit testing And add some "tests" to test and document currently unused features of the framework. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Daniel Schürmann <daniel@schuermann.dev> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3521>	2020-07-30 16:13:08 +00:00
Rhys Perry	2694a34aa2	aco: add NUW flag This (combined with a pass to actually set the corresponding NIR flags) should help fix a lot of the regressions from the SMEM addition combining change. fossil-db (Navi): Totals from 12 (0.01% of 135946) affected shaders: CodeSize: 12376 -> 12304 (-0.58%) Instrs: 2436 -> 2422 (-0.57%) VMEM: 1105 -> 1096 (-0.81%) SClause: 133 -> 130 (-2.26%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/2720>	2020-07-21 18:25:35 +00:00
Rhys Perry	d169f09e37	aco: be more careful combining additions that could wrap into loads/stores SMEM does the addition with 64-bits, not 32. So if the original code relied on wrapping around (for example, for subtraction), it would break. Apparently swizzled MUBUF accesses also have issues with combining additions that could overflow. Normal MUBUF accesses seem fine. fossil-db (Navi): Totals from 27219 (20.02% of 135946) affected shaders: CodeSize: 128303256 -> 131062756 (+2.15%); split: -0.00%, +2.15% Instrs: 24818911 -> 25280558 (+1.86%); split: -0.01%, +1.87% VMEM: 162311926 -> 177226874 (+9.19%); split: +9.36%, -0.17% SMEM: 18182559 -> 20218734 (+11.20%); split: +11.53%, -0.34% VClause: 423635 -> 424398 (+0.18%); split: -0.02%, +0.20% SClause: 865384 -> 1104986 (+27.69%); split: -0.00%, +27.69% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2748 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/2720>	2020-07-21 18:25:35 +00:00
Rhys Perry	d377fbf95d	aco: optimize some masked swizzles to DPP Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5695>	2020-07-13 14:11:50 +00:00
Rhys Perry	3d6f67950d	aco: improve 8/16-bit constants fossil-db (Navi, fp16 enabled): Totals from 1 (0.00% of 127638) affected shaders: CodeSize: 4540 -> 4388 (-3.35%) Instrs: 861 -> 830 (-3.60%) Cycles: 3444 -> 3320 (-3.60%) VMEM: 489 -> 465 (-4.91%) SMEM: 107 -> 110 (+2.80%) SClause: 31 -> 30 (-3.23%) Copies: 58 -> 54 (-6.90%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5245>	2020-06-15 18:24:22 +00:00
Rhys Perry	1b6a319c15	aco: add and set precise flag No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5245>	2020-06-15 18:24:22 +00:00
Samuel Pitoiset	f31c9b4edf	aco: fix subdword copies on GFX6-GFX7 SDWA is only GFX8+. Use v_mov_b32 since the upper 16 bits don't matter. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5227>	2020-06-03 19:48:42 +02:00
Timur Kristóf	14a5021aff	aco/gfx10: Refactor of GFX10 wave64 bpermute. The emulated GFX10 wave64 bpermute no longer needs a linear_vgpr, so we don't consider it a reduction anymore. Additionally, the code is slightly reorganized in preparation for the GFX6 emulated bpermute. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5223>	2020-06-02 21:12:12 +00:00
Rhys Perry	2ab45f41e0	aco: implement sub-dword swaps Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4469>	2020-04-22 13:25:17 +00:00
Daniel Schürmann	ca38c1f1f1	aco: add builder function for subdword copy() Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-By: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4002>	2020-04-03 23:13:15 +01:00
Rhys Perry	215df21dea	aco: fix carry-out size for wave32 v_add_co_u32_e64 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-By: Timur Kristóf <timur.kristof@gmail.com> Fixes: `e0bcefc3a0` ('aco/wave32: Use lane mask regclass for exec/vcc.') Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3902>	2020-03-03 18:31:06 +00:00
Daniel Schürmann	71440ba0f5	aco: reorder VMEM operands in ACO IR For all VMEM instructions, the resource constant is now in operands[0]. For MIMG instructions, the sampler shares operands[1] with write data in case this instruction writes memory. Moving the VADDR to be the last operand for MIMG is the first step to support Navi NSA encoding. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3602>	2020-01-29 18:45:23 +00:00

1 2

61 commits