fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-29 16:18:20 +02:00

Author	SHA1	Message	Date
David Heidelberg	68215332a8	build: pass licensing information in SPDX form Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Dylan Baker <dylan.c.baker@intel.com> Acked-by: Eric Engestrom <eric@igalia.com> Acked-by: Daniel Stone <daniels@collabora.com> Signed-off-by: David Heidelberg <david@ixit.cz> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29972>	2024-06-29 12:42:49 -07:00
Karmjit Mahil	9164ea7032	freedreno/isa: Fix isaspec map for a3xx-ld When LDP uses a negative offset (which it valid), since `struct ir3_register` uses `{i,u}nt32_t` for the immediate values, using `extract_reg_uim()` wasn't sign extending negative immediate values. Addresses: ``` src/freedreno/isa/encode.h:84: pack_field: Assertion '!(( val & ~BITFIELD64_MASK(1 + high - low)) && (~val & ~BITFIELD64_MASK(1 + high - low)))' failed. ``` seen in https://gitlab.freedesktop.org/mesa/mesa/-/issues/11153 . Signed-off-by: Karmjit Mahil <karmjit.mahil@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29768>	2024-06-19 12:38:53 +00:00
Danylo Piliaiev	e9c764c825	freedreno/ir3: mova has special meaning for (r) flag It prevents the hazard when in the following case: ldc.1.k.imm c[a1.x], 0, 1 (ss)mova1 a1.x, 8 The correct way is: ldc.1.k.imm c[a1.x], 0, 1 (ss)mova1 a1.x, (r)8 Without it ldc may use a1.x which is set after ldc. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27462>	2024-06-18 16:52:31 +00:00
Job Noorman	759a4679a3	ir3: add encoding of ldib/stib offsets Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28664>	2024-06-14 17:12:59 +00:00
Job Noorman	455ebcccfb	ir3: add encoding for isam.v isam.v is a version of isam that can load multiple components from IBOs. It uses some bits that are used for different purposes in other tex instructions: - bit 50 (.v): .s elsewhere - bit 53 (indicates whether an immediate offset is used): .p elsewhere - bit 18 (.1d when not set, has to be set for .v): 0 elsewhere For this reason, the bitset hierarchy for cat5 had to be reordered a bit. The immediate offset is encoded as an extra (immed) source register and an instruction flag (to be able to make the distinction between offset zero and no offset, although this might not be useful). This also adds a flag for the .1d field. Since this bit is active-low, this flag has inverted semantics: setting it will make .1d inactive. Note that some existing disassembler tests for isam had to be updated because the bit is never set and this is now disassembled as .1d. This matches the blob's disassembler. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28664>	2024-06-14 17:12:59 +00:00
Connor Abbott	736570b74d	ir3: Add support for ldc.u This will be important for using shared registers as much as possible. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22075>	2024-04-26 12:55:13 +00:00
Job Noorman	2288ef916c	ir3: model predt/predf without sources We used to model predt/predf as taking a predicate register source. The blob disassembler shows them taking a label argument. However, it seems that both are incorrect: the condition is always taken from p0.x and I have not been able to construct a test case were the label makes any difference. This patch changes predt/predf to not take any arguments and adds documentation about how predicated execution works. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27982>	2024-04-23 19:18:29 +00:00
Job Noorman	c37e9c1e29	ir3-disasm: add option to disassemble hex number Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28457>	2024-04-04 19:37:25 +00:00
Job Noorman	7eeb781c8b	ir3-disasm: add options to specify GPU by chip ID or name Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28457>	2024-04-04 19:37:25 +00:00
Job Noorman	86468ab8af	ir3-disasm: remove unused #includes Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28457>	2024-04-04 19:37:25 +00:00
Job Noorman	b9d2dd0788	ir3-disasm: run clang-format Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28457>	2024-04-04 19:37:25 +00:00
Christian Gmeiner	e0ca29e7a3	isaspec: deocde: Remove generic functions from public interface This will switch everyone to the isa specific functions. Fixes the output of etnaviv's pre_instr_cb callback if freedreno and etnaviv are build at the same time. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28176>	2024-03-21 07:51:18 +00:00
Christian Gmeiner	3f2295d99b	isaspec: decode: Add libisaspec Create a static library that just contains isa_print(..). We need to do this step to make lto happy. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28176>	2024-03-21 07:51:18 +00:00
Christian Gmeiner	f396899983	freedreno/isa: Rework meson dependency for libir3decode Any component that links against libir3decode should not need to take care if the generated isa files exists. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Eric Engestrom <eric@igalia.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28176>	2024-03-21 07:51:18 +00:00
Christian Gmeiner	381d19d138	isaspec: encode: Constify encode.type Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Acked-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27965>	2024-03-05 07:29:08 +00:00
Job Noorman	49b2fbe2f0	ir3: remove comp1/2 from cat0 Just take the component values from the source registers. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27411>	2024-03-01 13:45:10 +00:00
Job Noorman	a720eef12d	ir3: remove OPC_B and brtype from cat0 We currently have a bit of a confusing situation where we have both opcodes for the different branches (OPC_BR, OPC_BRAA,...) and branch types which are supposed to be used with OPC_B (BRANCH_PLAIN, BRANCH_AND,...). However, not every kind of branch has a corresponding type. For example, getone is represented by OPC_GETONE instead of a branch type. This patch proposes to get rid of the branch types and use opcodes everywhere. I think this makes the representation of branches more consistent. It also removes the for the encoder to translate branch types into opcodes. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27411>	2024-03-01 13:45:10 +00:00
Danylo Piliaiev	bc6b847017	ir3: Add ldg.k instruction Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26934>	2024-02-12 22:05:13 +00:00
Danylo Piliaiev	94af08421b	ir3: Fix values of #wrmask not being compatible with ir3 parser IR3 parser expects wrmask values to be in xyzw order. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25661>	2023-10-11 18:35:32 +00:00
Danylo Piliaiev	99457286c9	ir3/a7xx: Add ccinv instruction _Presumably_ invalidates workgroup-wide cache for image/buffer data access. so while "fence" is enough to synchronize data access inside a workgroup, for cross-workgroup synchronization we have to invalidate that cache. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23217>	2023-09-05 16:19:30 +00:00
Danylo Piliaiev	5f89ce8799	ir3/a7xx: Don't multiply global mem instruction's offset by 4 a7xx global memory instructions don't have implied shift. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23217>	2023-09-05 16:19:29 +00:00
Danylo Piliaiev	5d0d5108d7	ir3/a7xx: cat5 mode1 has swapped tex/samp ids Though blob is not seen to even use mode1 on a740, it uses S2EN variant instead. Fixes: dEQP-VK.binding_model.descriptor_buffer.multiple.* dEQP-VK.binding_model.descriptor_buffer.embedded_imm_samplers.* dEQP-VK.pipeline.monolithic.descriptor_limits.compute_shader.* Adapted from Jonathan Marek's changes. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23217>	2023-09-05 16:19:29 +00:00
Connor Abbott	6aabdb7a57	ir3: Parse (eq) flag Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24433>	2023-08-10 10:09:27 +00:00
Yonggang Luo	3b731d92d9	freedreno: decouple compiler and vulkan driver from gallium Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23438>	2023-08-03 07:29:36 +00:00
Connor Abbott	2faf344f03	isaspec: Rename isa_decode() to isa_disasm() This actually disassembles the binary, and we will add a function that actually decodes it to the same structure that the encoder uses. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23949>	2023-07-28 18:41:58 +00:00
Connor Abbott	26cce0a133	isaspec: Add callback after decoding an instruction This will be used by afuc for printing register decodings in a comment. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23949>	2023-07-28 18:41:58 +00:00
Danylo Piliaiev	4dd15177d0	ir3: documents (ss) flag for cat7 instructions Blob produces "lock" instructions with (ss), so our past guess that cat7 supports (ss) is true. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21498>	2023-04-27 21:06:47 +00:00
Mark Collins	85c6c9068a	ir3/a7xx: Add definitions for (last) src GPR attribute A new attribute on source GPRs reflecting if a certain usage of a value is the last usage of it was added in A7xx. This is seemingly a performance hint and doesn't affect anything when not applied. Signed-off-by: Mark Collins <mark@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21498>	2023-04-27 21:06:47 +00:00
Danylo Piliaiev	1613d767c1	ir3/a7xx: Document "alias" instruction Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21498>	2023-04-27 21:06:47 +00:00
Danylo Piliaiev	b909eda0b3	ir3: Document that stc has higher DST upper bound than we defined Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21498>	2023-04-27 21:06:47 +00:00
Danylo Piliaiev	11b2c54a9a	ir3/a7xx: Add STSC definition STore Shared Const - loads SIZE dwords from HLSQ_SHARED_CONSTS_IMM starting from HLSQ_SHARED_CONSTS_IMM[SRC] and writing them to c[DST] Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21498>	2023-04-27 21:06:47 +00:00
Danylo Piliaiev	80f878b103	ir3/a7xx: Add new form of stg.a/ldg.a addressing The new stg.a/ldg.a addressing form supersedes the a6xx's one. The new form is: ldg.a.f32 r4.y, g[c0.z+r4.y+2], 4 There are no shift comparing to the a6xx: ldg.a.f32 r4.y, g[r0.z+(r4.y)<<2], 4 Also on a7xx the first src is allowed to be both const and gpr. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21498>	2023-04-27 21:06:46 +00:00
Danylo Piliaiev	3b0daf29e5	ir3/a7xx: Add new lock/unlock CS instructions Seen at the end of every compuite shader: %shader_assmebly% lock unlock end Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21498>	2023-04-27 21:06:46 +00:00
Danylo Piliaiev	52ee3943eb	ir3/a7xx: NOPs may have some no-op bits set [00000001x_00000000x] nop ; dontcare bits in nop: 0000000100000000 [00000002x_00000000x] nop ; dontcare bits in nop: 0000000200000000 Doesn't seem to make them different from ordinary nops. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21498>	2023-04-27 21:06:46 +00:00
Danylo Piliaiev	e6f5480180	ir3: Add cat7 sleep instruction Has short and long variants, long seem to be ~20 times longer. The exact difference between it and a bunch of nops is unknown. The emission of this instruction were not observed in the wild. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14419>	2023-02-21 19:59:14 +00:00
Danylo Piliaiev	121e4ca87d	ir3: Add cat5/cat7 cache related instructions - tcinv - Likely Texture Cache Invalidate (unverified) - icinv - Mostly sure that it is Instruction Cache Invalidate - dccln - Data Cache Clean - dcinv - Data Cache Invalidate - dcflu - Data Cache Flush The emission of these instructions were not observed in the wild. TODO: find out the difference between .shr and .all modes of dccln, dcinv, dcflu. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14419>	2023-02-21 19:59:14 +00:00
Amber	228d812a0c	ir3, isaspec: add raw instruction to assembler/disassembler. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20789>	2023-01-26 14:26:11 +00:00
Jason Ekstrand	e8945a8ce6	isaspec: Stop depending on glue headers and out-of-folder C files The way the isaspec decoder used to work was that it would generate a header and a C file, each with ISA-specific stuff in it. Then that would get built together with a stand-alone decode.c file which lives in the isaspec folder, not the driver's folder. In order for decode.c to find the ISA-specific headers, it would also generate a glue header which had to be named isaspec-isa.h. This effectively meant that you can't have multiple isaspec definitions in the same folder. To solve this, we make do it the other way around and make the generated header and C files include the stand-alone files. This is a bit awkward because it means including a C file from another C file but it's better for the build system. Acked-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20525>	2023-01-05 18:21:02 +00:00
Jason Ekstrand	4953a8db25	isaspec: Use argparse This also cleans up some of our python script execution conventions and handles mako errors better. Copied a bit from vk_entrypoints_gen.py. Acked-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20525>	2023-01-05 18:21:02 +00:00
Christian Gmeiner	912d0383b4	isaspec: Move isa_decode(..) declaration The implementation of isa_decode(..) is already part of isaspec. So lets move the function declaration and some related structs to a src/isaspec. Also make the header C++ safe. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18403>	2022-09-03 19:26:04 +00:00
Connor Abbott	acba08b58f	ir3: Implement and document ldc.k Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	944f4e6f8a	ir3: Better assemble/disassemble stc Add in the type, even though it turns out to not be that useful. Add in support for assembling it. Add some notes based on computerator experiments. And add support for the indirect a1.x mode that's needed for storing c64.x and later. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Ilia Mirkin	65c4b6a4c6	freedreno/ir3: document GETINFO's x/y results The zw were already known, but throw them in here too. I'm not extremely happy with the description of "y", feels like there's a simpler explanation there, but I couldn't find it. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14672>	2022-02-21 02:09:19 +00:00
Ilia Mirkin	3d41414d26	freedreno/ir3: split up load/store/atomic by generation Some bits are slightly different on a4xx. Use the encodings that work. Perhaps these can be combined at some point if we get a proper understanding of what they mean. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14789>	2022-02-12 13:46:11 -05:00
Ilia Mirkin	b91b036322	isaspec: add gen-based leaf bitset separation This is necessary for some ops which have slightly different encoding on a4xx/a5xx, but are otherwise identical. This helps keeping the compiler from having to worry about these details and creating separate ops. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14789>	2022-02-12 13:46:07 -05:00
Emma Anholt	3d5ee08c15	freedreno/isaspec: Add missing dep of encode.py/decode.py calls on isa.py Fixes: #5921 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14725>	2022-02-02 11:21:56 -08:00
Danylo Piliaiev	c1d5c318bc	ir3: New cat3 instructions * shrm - (src2 >> src1) & src3 * shlm - (src2 << src1) & src3 * shrg - (src2 >> src1) \| src3 * shlg - (src2 << src1) \| src3 * andg - (src2 & src1) \| src3 * dp2acc - dot product of two {i,u}8vec2 packed into SRC1 and SRC2, added to 32b SRC3 * dp4acc - dot product of two {i,u}8vec4 packed into SRC1 and SRC2, added to 32b SRC3 * wmm - vec4(x_1, x_2, x_3, x_4) * (y_1 + y_2 + y_3 + y_4), which is duplicated (1 << (SRC3 / 32)) times starting from DST register * wmm.accu - same as wmm but result is added to DST registers, however the first reg in each vec4 result is overwritten instead of accumulating. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13986>	2022-01-10 13:20:39 +02:00
Vinson Lee	1d6f6f9102	ir3: Make shift operand 64-bit. Fix defect reported by Coverity Scan. Unintentional integer overflow (OVERFLOW_BEFORE_WIDEN) overflow_before_widen: Potentially overflowing expression 2 << W with type int (32 bits, signed) is evaluated using 32-bit arithmetic, and then used in a context that expects an expression of type uint64_t (64 bits, unsigned). Signed-off-by: Vinson Lee <vlee@freedesktop.org> Acked-by: Rob Clark <robclark@freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14258>	2021-12-22 01:19:46 +00:00
Danylo Piliaiev	d1c49901df	ir3: Add gen4 new subgroup instructions * getlast.w8 #4 - Perform jump for the first (CLUSTER_SIZE-1) fibers in a subgroup * brcst.active.w8 - necessary to implement arithmetic subgroup operations with prefix sum. * quad_shuffle.brcst - subgroupQuadBroadcast * quad_shuffle.horiz - subgroupQuadSwapHorizontal * quad_shuffle.vert - subgroupQuadSwapVertical * quad_shuffle.diag - subgroupQuadSwapDiagonal * getfiberid - gl_SubgroupID Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13817>	2021-12-07 20:45:53 +00:00
Danylo Piliaiev	5d5b1fc472	freedreno/ir3: add a6xx global atomics and separate atomic opcodes Separating atomic opcodes makes possible to express a6xx global atomics which take iova in SRC1. They would be needed by VK_KHR_buffer_device_address. The change also makes easier to distiguish atomics in conditions. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8717>	2021-11-23 18:26:37 +00:00

1 2 3

102 commits