mirror of
https://gitlab.freedesktop.org/mesa/mesa.git
synced 2026-05-18 07:18:06 +02:00
This should be one clause (all of the instructions load from the same vertex buffer)
s_clause 0x2 ; bfa10002
tbuffer_load_format_xyzw v[8:11], v5, s[4:7], 0 format:[BUF_FMT_8_8_8_8_UNORM] idxen offset:36 ; e9c32024 80010805
tbuffer_load_format_xyzw v[12:15], v5, s[4:7], 0 format:[BUF_FMT_8_8_8_8_UNORM] idxen offset:16 ; e9c32010 80010c05
tbuffer_load_format_xyzw v[16:19], v5, s[4:7], 0 format:[BUF_FMT_8_8_8_8_UNORM] idxen offset:12 ; e9c3200c 80011005
s_clause 0x2 ; bfa10002
buffer_load_dwordx3 v[20:22], v5, s[4:7], 0 idxen ; e03c2000 80011405
buffer_load_dwordx3 v[23:25], v5, s[4:7], 0 idxen offset:20 ; e03c2014 80011705
buffer_load_dwordx4 v[28:31], v5, s[4:7], 0 idxen offset:48 ; e0382030 80011c05
tbuffer_load_format_xy v[0:1], v5, s[4:7], 0 format:[BUF_FMT_8_8_UNORM] idxen offset:32 ; e8712020 80010005
Foz-DB Navi21:
Totals from 5624 (7.08% of 79395) affected shaders:
MaxWaves: 149894 -> 149898 (+0.00%)
Instrs: 3032697 -> 3034853 (+0.07%); split: -0.05%, +0.12%
CodeSize: 15907852 -> 15915752 (+0.05%); split: -0.05%, +0.10%
VGPRs: 216248 -> 216144 (-0.05%)
Latency: 10955137 -> 11008760 (+0.49%); split: -0.22%, +0.70%
InvThroughput: 2032857 -> 2033916 (+0.05%); split: -0.03%, +0.08%
VClause: 50120 -> 41778 (-16.64%); split: -16.66%, +0.02%
SClause: 62034 -> 62004 (-0.05%); split: -0.33%, +0.29%
Copies: 253836 -> 254505 (+0.26%); split: -0.17%, +0.43%
VALU: 1621606 ->
|
||
|---|---|---|
| .. | ||
| check_output.py | ||
| framework.h | ||
| glsl_scraper.py | ||
| helpers.cpp | ||
| helpers.h | ||
| main.cpp | ||
| meson.build | ||
| README.md | ||
| test_assembler.cpp | ||
| test_builder.cpp | ||
| test_d3d11_derivs.cpp | ||
| test_hard_clause.cpp | ||
| test_insert_nops.cpp | ||
| test_insert_waitcnt.cpp | ||
| test_isel.cpp | ||
| test_lower_subdword.cpp | ||
| test_optimizer.cpp | ||
| test_optimizer_postRA.cpp | ||
| test_reduce_assign.cpp | ||
| test_regalloc.cpp | ||
| test_scheduler.cpp | ||
| test_sdwa.cpp | ||
| test_tests.cpp | ||
| test_to_hw_instr.cpp | ||
Tests are wrapped in a BEGIN_TEST/END_TEST and write data to the output file pointer. Tests have checks against the output. They are single line comments prefixed with certain characters:
!fails the test if the current line does not match the pattern>>skips to the first line which matches the pattern, or fails the test if there is none;executes python code to extend the pattern syntax by inserting functions into the variable dictionary, fail the test, insert more checks or consume characters from the output
Before this prefix, there can be a ~ to only perform the check for certain
variants (a regex directly following the ~ is used).
Pattern Syntax
Patterns can define variables which can be accessed in both python code and the pattern itself. These are useful for readability or dealing with unstable identifiers in the output. Variable identifiers are sequences of digits, ascii letters or _ (though they cannot start with a digit).
\can be used to match the following literal character without interpreting it.- Most characters expect the same characters in the output.
- A sequence of spaces in the pattern expects a sequence of spaces or tabs in the output.
- A
#in the pattern expects an unsigned integer in the output. The#can be followed by an identifier to store the integer in a variable. - A
$in the pattern stores the output until the first whitespace character into a variable. - A
%in the pattern followed by an identifier is the same as a#but it expects a%before the integer in the output. It basically matches a ACO temporary. - A
@calls a variable as a function. It can be followed by an argument string wrapped in(and).
Functions
s64,s96,s128,v2,v3, etc, expand to a pattern which matches a disassembled instruction's definition or operand. It later checks that the size and alignment is what's expected.match_funcexpands to a sequence of$and inserts functions with expand to the extracted outputsearch_reconsumes the rest of the line and fails the test if the pattern is not found