fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-20 19:58:19 +02:00

Author	SHA1	Message	Date
Eric Anholt	00910e3057	broadcom/vc5: Don't annotate dumps with stale live intervals. As you're debugging register allocation, you may have changed the intervals and not recomputed yet. Just skip the dump in that case.	2018-03-19 16:44:20 -07:00
Eric Anholt	facc3c6f58	broadcom/vc5: Add support for register spilling. Our register spilling support is nice to have since vc4 couldn't at all, but we're still very restricted due to needing to not spill during a TMU operation, or during the last segment of the program (which would be nice to spill a value of, when there's a long-lived value being passed through with little modification from the start to the end). We could do better by emitting unspills for the last-segment values just before the last thrsw, since the last segment is probably not the maximum interference area. Fixes GTF uniform_buffer_object_arrays_of_all_valid_basic_types and 3 others.	2018-03-19 16:44:06 -07:00
Eric Anholt	271fc58ba1	broadcom/vc5: Remove redundant last_inst lookup. The point was to get the MOV, which the MOV_dest already returned.	2018-03-19 16:42:59 -07:00
Eric Anholt	34dc64f627	broadcom/vc5: On QPU pack error, dump the instruction and return cleanly. This is nice for debugging when you've made a bad instruction.	2018-03-19 16:42:59 -07:00
Eric Anholt	d721348dcd	broadcom/vc5: Add cursors to the compiler infrastructure, like NIR's. This will let me do lowering late in compilation using the same instruction builder as we use in nir_to_vir.	2018-03-19 16:42:59 -07:00
Eric Anholt	c81d681742	broadcom/vc5: Move the umul macro to a header. Anywhere we want to multiply, we probably want this.	2018-03-19 16:42:59 -07:00
Eric Anholt	9e28c18cd1	broadcom/vc5: Correct the arg count of TIDX/EIDX.	2018-03-19 16:42:59 -07:00
Eric Anholt	55bf298333	broadcom/vc5: Re-do live variables after removing thrsws. Otherwise our start/ends ips won't line up with the actual instructions.	2018-03-19 16:42:59 -07:00
Eric Anholt	c3a504f470	broadcom/vc5: Add a QPU helper for instructions using the TLB. This will be used for detecting last thread segment in register spilling.	2018-03-19 16:42:59 -07:00
Eric Anholt	09c4dd1971	broadcom/vc5: Introduce v3d_qpu_reads_vpm()/v3d_qpu_writes_vpm(). These helpers will be used in register spilling to determine where to add a last thrsw if needed, and might help refactor QPU scheduling.	2018-03-19 16:42:59 -07:00
Eric Anholt	407f21ef1b	broadcom/vc5: The ldvpm signal also a case of using the VPM. The QPU scheduling code calling this function already separately checked this signal.	2018-03-19 16:42:59 -07:00
Eric Anholt	4760040c09	broadcom/vc5: Extract v3d_qpu_writes_tmu() helper. This will be reused in register spilling.	2018-03-19 16:42:59 -07:00
Timothy Arceri	a050ea60ee	nir: add lower_ldexp to nir compiler options Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-02-28 09:23:49 +11:00
Eric Anholt	e29988c908	broadcom/vc5: Fix "hardwrae" typo in a field name in XML.	2018-02-05 13:53:38 +00:00
Eric Anholt	8bb000f460	broadcom/vc5: Try to merge more than 2 QPU instructions together. Obviously it would be good to have an ADD and a MUL and a signal together, but we can even potentially have multiple signals merged, as well. total instructions in shared programs: 100423 -> 97874 (-2.54%) instructions in affected programs: 78812 -> 76263 (-3.23%)	2018-02-05 09:29:37 +00:00
Eric Anholt	dc78643ace	broadcom/vc5: Remove no-op MOVs after register allocation. We emit some MOVs to track lifetimes of payload registers, but we don't need there to be actual MOV instructions for them. total instructions in shared programs: 101045 -> 100423 (-0.62%) instructions in affected programs: 37083 -> 36461 (-1.68%)	2018-02-05 09:29:37 +00:00
Eric Anholt	f3978a7380	broadcom/vc5: Add missing shader-db instruction counting. I must have misplaced it in the instruction packing rework.	2018-02-05 09:29:37 +00:00
Eric Anholt	353b42ccc7	broadcom/vc5: Fix a segfault on mix of booleans. We don't have a src1 to look up if the compare instruction is "i2b".	2018-02-01 11:02:29 -08:00
Timothy Arceri	9a2e085680	nir: add lower_all_io_to_temps flag This will be used for freedreno and vc4 which require all inputs and outputs to be copied to temps. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-01-31 09:14:08 +11:00
Eric Anholt	71c7e9bea1	broadcom/vc5: Enable CLIF dumping of V3D 4.2.	2018-01-27 19:04:21 +11:00
Eric Anholt	91f899cbc1	broadcom/vc5: Update the compiler for V3D 4.2.	2018-01-27 19:04:21 +11:00
Eric Anholt	f2e41daac5	broadcom/vc5: Update QPU instruction pack/unpack for v4.2. After the 4.1 spec, 4.2 retroactively renamed patchid to barrierid because it's used for other barriers in compute.	2018-01-27 19:03:55 +11:00
Eric Anholt	96d3e8f134	broadcom/vc5: Add XML for V3D 4.2.	2018-01-27 18:57:58 +11:00
Eric Anholt	b026063b16	broadcom/vc5: Fix a race between XML codegen build and CLIF build.	2018-01-27 18:57:58 +11:00
Eric Anholt	de60ea4432	Android: Attempt to fix broadcom build after vc5 changes.	2018-01-27 18:03:58 +11:00
Dylan Baker	436ed65d38	autotools: include meson build files in tarball This adds the meson.build, meson_options.txt, and a few scripts that are used exclusively by the meson build. v2: - Remove accidentally included changes needed to test make dist with LLVM > 3.9 Signed-off-by: Dylan Baker <dylan.c.baker@intel.com> Acked-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2018-01-19 16:30:51 -08:00
Emil Velikov	393cf04fa4	broadcom: add missing headers to the tarball Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-01-18 11:21:35 +00:00
Eric Anholt	5bc0b63799	broadcom/vc5: Use MSF to ignore discards/non-dispatched channels in loops. Prevents potential infinite loops when a non-dispatched or discarded channel never triggers the loop break condition.	2018-01-12 21:58:24 -08:00
Eric Anholt	762dd52951	broadcom/vc5: Use XOR instead of SUB for execute flags comparisons. I think this should be equivalent other than power, and it's the kind of comparison we use for nir_op_ieq.	2018-01-12 21:58:18 -08:00
Eric Anholt	8e4cba9d92	broadcom/vc5: Also check the update flags for avoiding DCE. I was trying to do a NULL-destination UF, and it got removed.	2018-01-12 21:58:11 -08:00
Eric Anholt	aa77a9cf5a	broadcom/vc5: Rename V3D 3.x Flat Shade Action to match v4.x naming. Now that the actions are reused for centroid and nonperspective, give them a more generic name.	2018-01-12 21:57:45 -08:00
Eric Anholt	368bab43fd	broadcom/vc5: Add support for loading varyings in V3D 4.1. The LDVARY signal now writes an arbitrary register, so I took out the magic src register file and replaced it with an instruction with LDVARY set so we have somewhere to hang a QFILE_TEMP destination for register allocation.	2018-01-12 21:57:21 -08:00
Eric Anholt	5aaea3c4a0	broadcom/vc5: Add compiler support for V3D 4.x texturing.	2018-01-12 21:56:57 -08:00
Eric Anholt	028f6b327c	broadcom/vc5: Add the new TMU write addresses for V3D 4.x (and r5rep). The V3D 3.x series of TMU writes with meaning depending on the texture type is replaced with writes to specific registers for each texture argument semantic.	2018-01-12 21:56:48 -08:00
Eric Anholt	42a35da96d	broadcom/vc5: Move V3D 3.3 texturing to a separate file. V3D 4.x texturing changes enough that #ifdefs would just make a mess of it.	2018-01-12 21:56:37 -08:00
Eric Anholt	acf30e4916	broadcom/vc5: Move V3D 3.3 VPM write setup to a separate file. For V4.1 texturing, I need the V4.1 XML, so the main compiler needs to stop including V3.3 XML.	2018-01-12 21:56:24 -08:00
Eric Anholt	34898c8c45	broadcom/vc5: Add support for V3D 4.1 CLIF dumping.	2018-01-12 21:55:49 -08:00
Eric Anholt	409696b76e	broadcom/vc5: Move the body of CLIF dumping to a per-version file. I want the library's entrypoints to still be unversioned, but the actual packet dumping needs to be per-version.	2018-01-12 21:55:38 -08:00
Eric Anholt	90269ba353	broadcom/vc5: Use THRSW to enable multi-threaded shaders. This is a major performance boost on all of V3D, but is required on V3D 4.x where shaders are always either 2- or 4-threaded.	2018-01-12 21:55:30 -08:00
Eric Anholt	86a12b4d5a	broadcom/vc5: Properly schedule the thread-end THRSW. This fills in the delay slots of thread end as much as we can (other than being cautious about potential TLBZ writes). In the process, I moved the thread end THRSW instruction creation to the scheduler. Once we start emitting THRSWs in the shader, we need to schedule the thread-end one differently from other THRSWs, so having it in there makes that easy.	2018-01-12 21:55:23 -08:00
Eric Anholt	a075bb6726	broadcom/vc5: Implement GFXH-1684 workaround. Apparently the VPM writes need to be flushed out before we end the shader.	2018-01-12 21:55:15 -08:00
Eric Anholt	f50d39ab49	broadcom/vc5: Add a test for .ifb in ADD ops. I had a .ifb being decoded weird in sampid, so this is to check that .ifb is fine.	2018-01-12 21:54:57 -08:00
Eric Anholt	267f13dbee	broadcom/vc5: Add the new tesselation opcodes in V3D 4.1.	2018-01-12 21:54:50 -08:00
Eric Anholt	edbd817c30	broadcom/vc5: Use a physical-reg-only register class for LDVPM. This is needed for LDVPM on V3D 4.x, but will also be needed for keeping values out of the accumulators across THRSW.	2018-01-12 21:54:42 -08:00
Eric Anholt	22a02f3e34	broadcom/vc5: Use the new LDVPM/STVPM opcodes on V3D 4.1. Now, instead of a magic write register for VPM stores we have an instruction to do them (which means no packing of other ALU ops into it), with the ability to reorder the VPM stores due to the offset being baked into the instruction. VPM loads also gain the ability to be reordered by packing the row into the A argument. They also no longer write to the r3 accumulator, and instead must be stored to a physical register.	2018-01-12 21:54:33 -08:00
Eric Anholt	55f8a01aca	broadcom/vc5: Drop dead VC5_QPU_* defines from qpu_instr.c. I had all the packing code in this file at one point, but these defines now live in qpu_pack.c.	2018-01-12 21:54:27 -08:00
Eric Anholt	2bd378647b	broadcom/vc5: Add support for QPU pack/unpack/disasm of small immediates.	2018-01-12 21:54:18 -08:00
Eric Anholt	c81cc767e4	broadcom/vc5: Drop signal bit #defines. Signals are more complicated than that, and tables ended up being better.	2018-01-12 21:53:53 -08:00
Eric Anholt	dfee62eed3	broadcom/vc5: Add support for V3Dv4 signal bits. The WRTMUC replaces the implicit uniform loads in the first two texture instructions. LDVPM disappears in favor of an ALU op. LDVARY, LDTMU, LDTLB, and LDUNIF*RF now write to arbitrary registers, which required passing the devinfo through to a few more functions.	2018-01-12 21:53:45 -08:00
Eric Anholt	81ec2ba229	broadcom/vc5: Fix pack/unpack of vfmul input unpack flags.	2018-01-12 21:53:38 -08:00

1 2 3 4

153 commits