fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-19 00:38:06 +02:00

Author	SHA1	Message	Date
Timur Kristóf	09b9e52c0d	aco/ngg: Export a zero-area triangle when primitive count is 0. This is a workaround for a bug in Navi 1x NGG HW. Very rarely, the Navi 1x PA can hang when an NGG workgroup exports 0 total primitives. According to AMD, we always need this workaround when it is possible that the number of primitives is 0. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7232>	2020-10-28 21:55:47 +01:00
Timur Kristóf	73449f9a62	aco: Add a few assertions about LDS usage. This is to make sure we don't compile a shader which doesn't fit the available LDS space. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7232>	2020-10-28 21:47:22 +01:00
Timur Kristóf	b6654adc0e	aco: Make emitting reduction instructions a bit more convenient. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7232>	2020-10-28 21:47:22 +01:00
Timur Kristóf	8d6246205a	aco: Add some validation for PSEUDO_REDUCTION instructions. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7232>	2020-10-28 21:47:22 +01:00
Timur Kristóf	260f9c503a	aco/ngg: Put shader query reduction operand into a VGPR. The p_reduce instruction only works if this operand is in a VGPR, and otherwise gets lowered to incorrect code. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7232>	2020-10-28 21:47:22 +01:00
Timur Kristóf	9757c3cb6b	aco: Assert that workgroup barriers are not used inappropriately. Example: It is possible for some NGG GS waves to have 0 ES and/or GS invocations, and in that case having an s_barrier inside divergent control flow can very possibly hang the GPU. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7232>	2020-10-28 21:47:19 +01:00
Rhys Perry	ecdcf22d5d	aco: switch aco_print_asm to a FILE * Streams are really stateful and (IMO) difficult to read for non-trivial usage. This is also more consistent with NIR and the rest of ACO. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7166>	2020-10-28 17:32:32 +00:00
Rhys Perry	a293fad4ef	aco: refactor repeated instruction disassembly This seems simpler to me. It should also work correctly when repeated instructions cross blocks. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7166>	2020-10-28 17:32:32 +00:00
Rhys Perry	ed2449d55b	aco: move individual instruction disassembly to its own helper Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7166>	2020-10-28 17:32:32 +00:00
Rhys Perry	483657de32	aco: use mubuf helper in select_gs_copy_shader Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6103>	2020-10-28 14:59:49 +00:00
Rhys Perry	ec7ecfe9cb	aco: use control flow creation helpers in select_gs_copy_shader Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6103>	2020-10-28 14:59:49 +00:00
Rhys Perry	57d977a23f	aco: round bytes_written to dwords if larger than 4 bytes Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7276>	2020-10-28 10:56:27 +00:00
Rhys Perry	41839d38cf	aco: default to a definition size of 32 For non-arithmetic opcodes such as buffer_load_dword and buffer_load_short, default to a definition size of 32. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7276>	2020-10-28 10:56:27 +00:00
Daniel Schürmann	543f50789a	aco: implement nir_op_unpack_[64/32]_* Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6527>	2020-10-28 10:14:26 +00:00
Rhys Perry	26e53e3afa	aco: ignore the ACO-inserted continue in create_continue_phis() Otherwise, for loops without continue_or_break, create_continue_phis() always returns an undef operand. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Fixes: `638cbc21a1` ("aco: handle when ACO adds new continue edges") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2848 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7148>	2020-10-27 19:53:38 +00:00
Rhys Perry	437995bb70	aco: remove all-undef phi opt This doesn't look like it would create correct IR for 8/16-bit phis and doesn't seem to help anything. If we ever want to do this, it's probably better done in nir_opt_remove_phis(). No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7216>	2020-10-27 15:24:38 +00:00
Rhys Perry	70ff262cda	aco: use v_mov_b32_sdwa for some 16-bit constants Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7216>	2020-10-27 15:24:38 +00:00
Rhys Perry	b882598ee1	aco: remove some unused optimizations These are unused now that we almost always use p_parallelcopy for simple copies. No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7216>	2020-10-27 15:24:38 +00:00
Rhys Perry	d20a752c0d	aco: use Builder::copy more fossil-db (Navi): Totals from 6973 (5.07% of 137413) affected shaders: SGPRs: 381768 -> 381776 (+0.00%) VGPRs: 306092 -> 306096 (+0.00%); split: -0.00%, +0.00% CodeSize: 24440844 -> 24421196 (-0.08%); split: -0.09%, +0.01% MaxWaves: 86581 -> 86583 (+0.00%) Instrs: 4682161 -> 4679578 (-0.06%); split: -0.06%, +0.00% Cycles: 68793116 -> 68261648 (-0.77%); split: -0.83%, +0.05% fossil-db (Polaris): Totals from 8154 (5.87% of 138881) affected shaders: VGPRs: 338916 -> 338920 (+0.00%); split: -0.00%, +0.00% CodeSize: 23540428 -> 23540488 (+0.00%); split: -0.00%, +0.00% MaxWaves: 49090 -> 49091 (+0.00%) Instrs: 4576085 -> 4576101 (+0.00%); split: -0.00%, +0.00% Cycles: 51720704 -> 51720888 (+0.00%); split: -0.00%, +0.00% Most of the Navi cycle/instruction changes are from 8/16-bit parallel-rdp shaders. They appear to be improved because the p_create_vector from lower_subdword_phis() was blocking constant propagation. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7216>	2020-10-27 15:24:38 +00:00
Rhys Perry	e54c111c45	aco: always use p_parallelcopy for pre-RA copies Most fossil-db changes are because literals are applied earlier (in label_instruction), so use counts are more accurate and more literals are applied. fossil-db (Navi): Totals from 79551 (57.89% of 137413) affected shaders: SGPRs: 4549610 -> 4542802 (-0.15%); split: -0.19%, +0.04% VGPRs: 3326764 -> 3324172 (-0.08%); split: -0.10%, +0.03% SpillSGPRs: 38886 -> 34562 (-11.12%); split: -11.14%, +0.02% CodeSize: 240143456 -> 240001008 (-0.06%); split: -0.11%, +0.05% MaxWaves: 1078919 -> `1079281` (+0.03%); split: +0.04%, -0.01% Instrs: 46627073 -> 46528490 (-0.21%); split: -0.22%, +0.01% fossil-db (Polaris): Totals from 98463 (70.90% of 138881) affected shaders: SGPRs: 5164689 -> 5164353 (-0.01%); split: -0.02%, +0.01% VGPRs: 3920936 -> 3921856 (+0.02%); split: -0.00%, +0.03% SpillSGPRs: 56298 -> 52259 (-7.17%); split: -7.22%, +0.04% CodeSize: 258680092 -> 258692712 (+0.00%); split: -0.02%, +0.03% MaxWaves: 620863 -> 620823 (-0.01%); split: +0.00%, -0.01% Instrs: 50776289 -> 50757577 (-0.04%); split: -0.04%, +0.00% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7216>	2020-10-27 15:24:38 +00:00
Rhys Perry	6db5fbf9f2	aco: allow literals on sub-dword p_parallelcopy Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7216>	2020-10-27 15:24:38 +00:00
Rhys Perry	74e2e9b682	aco: don't use bld.copy() in handle_operands() No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7216>	2020-10-27 15:24:38 +00:00
Rhys Perry	a834d9ef86	aco: expand vectors passed as copy operands Most copies which hit this case use p_create_vector, but in the future p_parallelcopy will be used instead. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7216>	2020-10-27 15:24:38 +00:00
Rhys Perry	e092f34dfa	aco: copy-propgate through p_create_vector during value numbering fossil-db (Navi): Totals from 182 (0.13% of 137413) affected shaders: SGPRs: 9304 -> 9312 (+0.09%) VGPRs: 7636 -> 7620 (-0.21%); split: -0.26%, +0.05% CodeSize: 733516 -> 733092 (-0.06%); split: -0.07%, +0.01% MaxWaves: 2478 -> 2479 (+0.04%) Instrs: 139664 -> 139561 (-0.07%); split: -0.09%, +0.02% Cycles: 3215104 -> 3214080 (-0.03%); split: -0.04%, +0.01% fossil-db (Polaris): Totals from 161 (0.12% of 138881) affected shaders: VGPRs: 5608 -> 5596 (-0.21%); split: -0.29%, +0.07% CodeSize: 605336 -> 605120 (-0.04%); split: -0.05%, +0.02% Instrs: 117957 -> 117902 (-0.05%); split: -0.07%, +0.02% Cycles: 3105008 -> 3103876 (-0.04%); split: -0.04%, +0.00% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7216>	2020-10-27 15:24:38 +00:00
Rhys Perry	0f31fa1b64	aco: skip value numbering of copies Instead, copy-propagate through and remove them. This improves value numbering in this situation: a = ... b = copy a c = copy a use(b) use(c) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7216>	2020-10-27 15:24:38 +00:00
Rhys Perry	72b307a338	aco: don't do divergent break+discard If the shader does: loop { if (divergent) discard else a() b() } then a()'s block will dominate b()'s block in the logical CFG, but not the linear CFG. This will cause value numbering to try to combine SLAU from a() and b(). This didn't happen with break/continue because sanitize_if() would move a() out of the branch. Using sanitize_if() to fix this doesn't look easy, because discards are not control flow instructions in NIR. No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7216>	2020-10-27 15:24:38 +00:00
Rhys Perry	d4503a9020	aco: update phi_map in add_subdword_operand() Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Fixes: `56345b8c61` ("aco: allow reading/writing upper halves/bytes when possible") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7216>	2020-10-27 15:24:38 +00:00
James Park	23fb54bf7f	aco: Clean up some C++ usages Iterate over maps by reference to avoid copies. Replace find/insert with insert to avoid double search. Use range-based for loop, avoiding copies by reference. Delete comment. Erase by iterator instead of key to avoid repeat search. Iterators unneeded to modify unwaited_instrs. Use range-based for loop. Reviewed-by: Tony Wasserka <tony.wasserka@gmx.de> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7285>	2020-10-27 14:57:16 +00:00
Daniel Schürmann	cb12879401	aco: fix GFX8 16-bit packing def.physReg() was uninitialized. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Fixes: `d96f387e7a` ('aco: improve code sequences for 16bit packing') Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7334>	2020-10-27 12:56:14 +01:00
Rhys Perry	27ce5d921e	aco: remove isel_context::allocated Now that we have Program::temp_rc, we can replace it with the first temporary id allocated for NIR's ssa defs. No fossil-db changes on Navi. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7067>	2020-10-26 15:14:32 +00:00
Daniel Schürmann	cf083f1d02	aco: use do_pack() for self-intersecting operations. This improves the code for GFX8+, but is slightly worse for GFX6_7. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7189>	2020-10-26 12:21:13 +00:00
Daniel Schürmann	d96f387e7a	aco: improve code sequences for 16bit packing This includes using alignbyte for GFX6 and GFX7, and 32-bit instructions for GFX8. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7189>	2020-10-26 12:21:13 +00:00
Daniel Schürmann	40bfb08828	aco: refactor GFX6_7 subdword copy lowering The new code uses alignbyte which leads to shorter code and preserves the operand's registers. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7189>	2020-10-26 12:21:13 +00:00
Samuel Pitoiset	4e2fe34aa9	aco: fix determining if LOD is zero for nir_texop_txf/nir_texop_txs txf/txs expects LOD to be a 32-bit unsigned integer while other texture operations expects a float. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3668 Fixes: `93c8ebfa78` ("aco: Initial commit of independent AMD compiler") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7256>	2020-10-22 11:30:43 +00:00
Samuel Pitoiset	eb6877d3af	radv,aco: fix use of texop_samples_identical in the resolve meta path The return value of this texture intrinsic should be a NIR 1-bit bool. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7236>	2020-10-21 13:06:53 +02:00
Tony Wasserka	fd038132de	aco/isel: Miscellaneous cleanups using the new Stage API Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7094>	2020-10-21 09:49:38 +00:00
Tony Wasserka	34bc9477de	aco: Clean up symbol names and comments related to NGG Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7094>	2020-10-21 09:49:38 +00:00
Tony Wasserka	86c227c10c	aco: Use strong typing to model SW<->HW stage mappings Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7094>	2020-10-21 09:49:38 +00:00
Bas Nieuwenhuizen	76421667ec	aco: Add VK_KHR_shader_terminate_invocation support. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7226>	2020-10-20 22:53:08 +00:00
Samuel Pitoiset	4ca1030774	radv: move all NIR pass outside of ACO This has several advantages: - it generates roughly the same NIR for both compiler backends (this might help for debugging purposes) - it might allow to move around some NIR pass to improve compile time - it might help for RadeonSI support - it improves fossils-db stats for RADV/LLVM (this shouldn't matter much but it's a win for free) fossil-db (Navi/LLVM): Totals from 80732 (59.18% of 136420) affected shaders: SGPRs: 5390036 -> 5382843 (-0.13%); split: -3.38%, +3.24% VGPRs: 3910932 -> 3890320 (-0.53%); split: -2.38%, +1.85% SpillSGPRs: 319212 -> 283149 (-11.30%); split: -17.69%, +6.39% SpillVGPRs: 14668 -> 14324 (-2.35%); split: -7.53%, +5.18% CodeSize: 265360860 -> 267572132 (+0.83%); split: -0.47%, +1.30% Scratch: 5338112 -> 6134784 (+14.92%); split: -2.65%, +17.57% MaxWaves: 1077230 -> 1086902 (+0.90%); split: +2.79%, -1.90% No fossils-db changes on RADV/ACO. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7077>	2020-10-20 10:21:39 +00:00
Timur Kristóf	d8435c1628	aco/ngg: Add assertion to make sure we always know the vertex count. Just a sanity check to avoid hangs caused by missing this in the future. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7213>	2020-10-20 07:11:29 +00:00
James Park	af8d488ea5	util,ac,aco,radv: Cross-platform memstream API POSIX memstream is not available on Windows. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7143>	2020-10-19 03:37:42 -07:00
Rhys Perry	fdb65b8b23	aco: add missing SCC clobber in get_buffer_size Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Fixes: `fcd6d83245` ("aco: fix imageSize()/textureSize() with large buffers on GFX8") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7162>	2020-10-15 21:11:45 +00:00
Rhys Perry	d75d12f507	aco: don't use v_pack_b32_f16 if 16-bit input denormals are flushed Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7111>	2020-10-15 11:33:42 +00:00
Rhys Perry	d4b3e869ee	aco: propagate literals into sub-dword pseudo instructions on GFX9+ Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7111>	2020-10-15 11:33:42 +00:00
Rhys Perry	1a652244e4	aco: implement 16-bit literals We can copy any value into a 16-bit subregister with a 3 dword v_pack_b32_f16 on GFX10 or a v_and_b32+v_or_b32 on GFX9. Because the generated code can depend on the register assignment and to improve constant propagation, Builder::copy creates a p_create_vector in the case of sub-dword literals. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7111>	2020-10-15 11:33:42 +00:00
Tony Wasserka	d5a72319d6	aco/isel: Remove now unused VS-related code from create_null_export Also replaced a hardcoded constant with the appropriate register macro. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7102>	2020-10-14 16:22:51 +00:00
Tony Wasserka	c22c702f35	aco/isel: Remove some dead code exported_pos was always initialized to true (due to the is_pos argument of the first export_vs_varying call being true), so none of this code has any effect. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7102>	2020-10-14 16:22:51 +00:00
Tony Wasserka	bf51b11c04	aco/isel: Always export position data from VS/NGG AMD ISA docs explicitly require this for VS, and this likely extends to NGG too. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3615 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7102>	2020-10-14 16:22:51 +00:00
Daniel Schürmann	f29c81f863	aco: use VOP2 for v_cvt_pkrtz_f16_f32 if possible This patch also does a slight rework of export_fs_mrt_color() to avoid setting of enabled channels which are not used. Totals from 52404 (38.38% of 136546) affected shaders (NAVI): SGPRs: 3097443 -> 3097435 (-0.00%) CodeSize: 189151600 -> 188546200 (-0.32%) Instrs: 36445061 -> 36445104 (+0.00%); split: -0.00%, +0.00% Cycles: 1739388020 -> 1739388192 (+0.00%); split: -0.00%, +0.00% VMEM: 21071501 -> 21071665 (+0.00%); split: +0.00%, -0.00% SMEM: 3470983 -> 3470982 (-0.00%); split: +0.00%, -0.00% PreSGPRs: 2058965 -> 2058962 (-0.00%) PreVGPRs: 1860294 -> 1860295 (+0.00%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6777>	2020-10-14 15:31:38 +00:00

1 2 3 4 5 ...

1028 commits