fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-27 01:28:12 +02:00

Author	SHA1	Message	Date
Rob Clark	faf276b4c8	freedreno/ir3/ra: split building regs/classes and conflicts Split out the construction of registers and classes (which is the same on all gens) from setting up conflicts. Prep to re-work how we setup conflicts on a6xx+ which merged half/full register file. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4272>	2020-03-27 22:41:36 +00:00
Rob Clark	90f7d12236	freedreno/ir3/ra: pick higher numbered scalars in first pass Since we are re-assigning the scalars anyways in the second pass, assign them to the highest free reg in the first pass (rather than lowest) to allow packing vecN regs as low as possible. Note this required some changes specifically for tex instructions with a single component writemask that is not necessarily .x, as previously these would get assigned in the first RA pass, and since they are still scalar, we'd end up w/ some r47.* and other similarly way-to-high assignments after the 2nd pass. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4272>	2020-03-27 22:41:36 +00:00
Rob Clark	1da90ca9bf	freedreno/ir3/ra: compute register target from liveranges Using the output of the first pass isn't ideal, as it can bake in the losses from fragmentation which the scalar pass is intended to fill in. This gets worse when we start using "vectorish" instructions, due to higher use of vecN values. Instead, we can just use the outputs of the liveness analysis to get a more accurate # of maximum live values at any point. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4272>	2020-03-27 22:41:36 +00:00
Rob Clark	d2cc92c747	freedreno/ir3/ra: fix array liveranges Fixes: `1b658533e1` ("freedreno/ir3: extend liverange of arrays") Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4272>	2020-03-27 22:41:36 +00:00
Rob Clark	6347c2ea89	freedreno/ir3/ra: add def/use iterators Decouple the messy logic of figuring out vreg names defined/used by an instruction from the logic of what to do about it by introducing iterators. There is still some array vs ssa special casing in ra_block_compute_live_ranges(), but less than before. And this will avoid introducing a second copy of the def/use logic in a following patch which uses the liveranges to calculate the maximum # of live values (which is the optimal target for max physical register window to round-robin within). Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4272>	2020-03-27 22:41:36 +00:00
Rob Clark	bf0aa7ed90	freedreno/ir3/ra: drop extending output live-ranges This is no longer needed as we create meta:collect instructions in the end block, which achieves the same result. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4272>	2020-03-27 22:41:36 +00:00
Rob Clark	0e7d24b532	freedreno/ir3/ra: add helper to map name to array For vreg names that refer to arrays rather than SSA values, this is the counterpart to name_to_instr(). Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4272>	2020-03-27 22:41:36 +00:00
Rob Clark	d99d358389	freedreno/ir3/ra: fix target register calculation Account for the # of regs an instruction writes, and fix an off-by-one. (We are about to replace this with calculating the register target using the live-ranges, but in debugging that it was useful to assert() if it chose a higher target.) Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4272>	2020-03-27 22:41:36 +00:00
Rob Clark	d20a06e401	freedreno/ir3/ra: add helper to map name to instruction Extract out a helper from the select_reg callback. And include all the instructions in the hashtable, not just SFU. This will be useful in the following commits. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4272>	2020-03-27 22:41:36 +00:00
Rob Clark	29992a039e	freedreno/ir3/ra: split-up Split out regset and shared header, since the RA pass is already getting large-ish. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4272>	2020-03-27 22:41:36 +00:00
Rob Clark	6da53911c1	freedreno/ir3/ra: add debug option for RA debug msgs Similar to the debug switch for sched debug msgs Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4272>	2020-03-27 22:41:36 +00:00
Rob Clark	142f2d4551	freedreno/ir3: convert debug bitfield to BITFIELD_BIT() (Little more verbose than the kernel's BIT()) Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4272>	2020-03-27 22:41:36 +00:00
Rob Clark	3d0905582a	freedreno/ir3: reformat disasm output In particular, make sure we see all the shader-db stats. The format (order) is the sameish, except split across multiple lines to make it easier to read. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4272>	2020-03-27 22:41:36 +00:00
Rob Clark	afdb8e3907	freedreno/ir3: fix bogus register footprint with tess/gs When we have a tess or gs stage, VS outputs aren't normal varyings, so regid is r63.x.. we shouldn't extend our registerfootprint to 64! Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4272>	2020-03-27 22:41:36 +00:00
Rob Clark	1b4b455739	freedreno/ir3: remove unused helper Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4272>	2020-03-27 22:41:36 +00:00
Rob Clark	c6a8792753	freedreno/ir3: add bary_ij as src for meta:tex_prefetch This way RA doesn't have to special case it in use/def accounting.. This gets rid of an extra level of split/collect, which shouldn't be needed. And interferes with scheduler trying to put tex-prefetches after inputs but before other instructions. (Otherwise it would have to figure out which split/collects need to go before the tex-prefetch) Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4272>	2020-03-27 22:41:36 +00:00
Rob Clark	a0de0db0e4	freedreno/ir3: small cleanup and comments Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4272>	2020-03-27 22:41:36 +00:00
Rob Clark	7d9a794f35	freedreno/a6xx: register update No functional change, and this register isn't used in userspace. Just syncing from envytools tree to eliminate the delta. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4272>	2020-03-27 22:41:36 +00:00
Marek Olšák	7164674500	util: don't include p_defines.h and u_pointer.h from gallium It's a mess, but this is what I arrived at. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4324>	2020-03-27 21:00:10 +00:00
Hyunjun Ko	1ee2ad584c	freedreno/ir3: enable nir_opt_loop_unroll on a6xx If precision lowering happens at GLSL IR, loop_analysis at IR doesn't work as expected since it can't handle things like: "(expression bool < (expression float16_t f2fmp (var_ref ndx) ) (constant float16_t (1.000000)) )" So we'd rather do this optimization at the NIR stage. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3885>	2020-03-24 23:21:21 +00:00
Neil Roberts	61f7a1dfc5	freedreno/ir3: Lower bools to bitsize Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3885>	2020-03-24 23:21:21 +00:00
Jonathan Marek	04509dae7f	turnip: implement depth clamp Signed-off-by: Jonathan Marek <jonathan@marek.ca> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4293> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4293>	2020-03-24 13:01:44 +00:00
Jonathan Marek	afe27d5345	turnip: fix znear clipping Vulkan clips znear at 0 instead of -1. Fixes dEQP-VK.draw.inverted_depth_ranges.nodepthclamp_* Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4293>	2020-03-24 13:01:44 +00:00
Jonathan Marek	07a8100aed	freedreno/registers: more GRAS_CL_CNTL bits, Z_CLAMP Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4293>	2020-03-24 13:01:44 +00:00
Eduardo Lima Mitev	55b0a676fd	turnip: Instance can be NULL resolving 'GetInstanceProcAddr' entry point Using turnip driver without a vulkan loader is currently broken because the entry point resolver is expecting a valid instance when resolving 'vkGetInstanceProcAddr' through vk_icdGetInstanceProcAddr(). Reviewed-by: Jonathan Marek <jonathan@marek.ca> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4257> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4257>	2020-03-21 18:56:07 +01:00
Eric Anholt	5b57aa79e2	freedreno: Switch to exposing only half-integer pixel centers. This is what the HW provides us. If we need integer pixel centers, we want the state tracker to do the lowering pass so that it gets to optimize on the subtract. This is also the shader instructions that the blob is doing on GLES, and is what Vulkan wants too, as was noted in MR !4172. shader-db on a630: total instructions in shared programs: 186689 -> 186168 (-0.28%) total nops in shared programs: 66253 -> 66139 (-0.17%) total non-nops in shared programs: 120436 -> 120029 (-0.34%) total dwords in shared programs: 292192 -> 291168 (-0.35%) total last-baryf in shared programs: 4810 -> 4734 (-1.58%) total full in shared programs: 10176 -> 10195 (0.19%) total constlen in shared programs: 54589 -> 54575 (-0.03%) total sstall in shared programs: 24582 -> 24802 (0.89%) total (ss) in shared programs: 3921 -> 3925 (0.10%) total (sy) in shared programs: 1934 -> 1923 (-0.57%) Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4223> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4223>	2020-03-19 21:35:49 +00:00
Hyunjun Ko	a6625b15a4	turnip: Do gathering xfb info after nir_remove_dead_variables So we could align stream outputs correctly even if unused in/outs are removed. Fixes: dEQP-VK.transform_feedback.fuzz.random_vertex.scalar_types.* dEQP-VK.transform_feedback.fuzz.random_vertex.vector_types.* Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Jonathan Marek <jonathan@marek.ca> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4207> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4207>	2020-03-18 09:47:04 +00:00
Hyunjun Ko	c11a2bc202	turnip: Fix wrong assignment of xfb output's offset. Should be divided by 4 so we could calculate the offset correctly in tu6_setup_streamout. Fixes: `2a1d6b81ed` Related: `374406a7c4` Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4207>	2020-03-18 09:47:04 +00:00
Connor Abbott	3ff437abb3	tu: Fix border color with compute shaders I wasn't able to find any CTS tests that used compute shaders with samplers and set a border color, so I hacked one of the tests included with amber: https://gist.github.com/cwabbott0/e72f0ed8259b84ed6bf3920c68fefee6 The register was found via looking at dumps of the Vulkan blob, and setting it fixes this test. Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4204> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4204>	2020-03-17 09:40:26 +00:00
Connor Abbott	3349fe9a26	tu: Rewrite border color handling Emit a single table of all possible Vulkan border colors up front, and then index into it using the Vulkan enum directly. In fact this seems to be the entire point of separating out border colors in the first place. In addition to being simpler and having less CPU overhead, and fixing cases where more than one sampler uses border color, this paves the way for bindless samplers because the existing approach isn't great for bindless. Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4200> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4200>	2020-03-16 13:17:54 +00:00
Hyunjun Ko	1896b44aee	turnip: Add tu6_control struct. Follow the way that freedreno is doing so that we could see the whole layout of the scratch buffer. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Jonathan Marek <jonathan@marek.ca> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3942> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3942>	2020-03-12 03:10:17 +00:00
Hyunjun Ko	e4f1697b54	turnip: Enable VK_EXT_transform_feedback Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3942>	2020-03-12 03:10:16 +00:00
Hyunjun Ko	4a45c84672	turnip: Implement an empty function vkCmdDrawIndirectByteCountEXT TODO. We should implement this since indirect draw is enabled. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3942>	2020-03-12 03:10:16 +00:00
Hyunjun Ko	9ff1959ca5	turnip: Implement stream-out emit and vkApis for transform feedback 1. Implement vkCmdBindTransformFeedbackBuffersEXT, vkCmdBeginTransformFeedbackEXT and vkCmdEndTransformFeedbackEXT. - Not handling counter buffers yet. 2. Implement streamout emit function, mostly taken from fd6_emit.c v2. Replace emit_pkt4 funcs with emit_regs. v3. Don't copy the state of stream-output from tu_pipeline. v4. Set zero to VPC_SO_CNTL/VPC_SO_BUF_CNTL in tu6_init_hw. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3942>	2020-03-12 03:10:16 +00:00
Hyunjun Ko	374406a7c4	turnip: Setup stream-output when linking program Mostly taken from fd6_program.c. v2. Note that it forces to use full VS instead of binning pass VS if there's stream output as the binning pass VS will have outputs on other than position/psize stripped out, which is the same as freedreno. v3. fix indentation. v4. Use register index instead of location when setup streamout. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3942>	2020-03-12 03:10:16 +00:00
Hyunjun Ko	82fdb13c25	turnip: Define structs for transform feedback Define new structures for streamout buffers and state. Most members of the state struct are taken from freedreno driver. v2. Use IR3_MAX_SO_* and avoid using magic values. v3. Remove the state of stream-output in tu_cmd_state and use one in tu_pipeline and split out reset and enabled fields. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3942>	2020-03-12 03:10:16 +00:00
Hyunjun Ko	2a1d6b81ed	turnip: Gather information for transform feedback - Add one member to the existed ir3_stream_output so that we could assign location information from nir_xfb_info, rather than defining new struct. - Redefine maximum of so buffers, streams and outputs, which will be used for turnip. - Also enable caps for transform feedback for spirv_to_nir. v2. Remove redefined maximums and use IR3_MAX_SO_* and add IR3_MAX_SO_STREAMS. v3. Remove the newly added location field so that we could keep aligned with 32 bytes. Instead we create an array mapping between the location and consecutive index, which is GL driver is doing. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3942>	2020-03-12 03:10:16 +00:00
John Stultz	67aae8f98f	freedreno: Add ir3_cf.c and ir3_delay.c to Makefile.sources This patch adds missing ir3_cf.c and ir3_delay.c files to the Makefile.sources file to address build issues seen when trying to build mesa/master on AOSP Signed-off-by: John Stultz <john.stultz@linaro.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4141>	2020-03-11 02:52:58 +00:00
Rob Clark	ba03e308b6	freedreno/fdperf: set locale Set local to get numbers printed w/ commas.. much easier to read that way. Signed-off-by: Rob Clark <robdclark@chromium.org> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4119> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4119>	2020-03-10 16:52:02 +00:00
Rob Clark	30dd059925	freedreno/computerator: add performance counter support Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4119>	2020-03-10 16:52:02 +00:00
Rob Clark	3c96e25de7	freedreno/ir3: try to avoid syncs Update postsched to be better aware of where costly (ss) syncs would result. Sometimes it is better to allow a nop or two, to avoid a sync quickly after an SFU. Signed-off-by: Rob Clark <robdclark@chromium.org> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4071> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4071>	2020-03-10 16:01:39 +00:00
Rob Clark	cc82521de4	freedreno/ir3: round-robin RA In the second (scalar pass) use the information about # of registers used in the first pass as the target max, and round-robin within that range. This generally gives the post-RA sched pass more opportunities to re-order instructions to remove nop's. Also, we can be a bit clever when assigning dest registers for SFU instructions, by picking the register used for it's src (if available and already assigned). This avoids some (ss) syncs caused by write after read hazards. (Ie. the SFU instruction will read it's own src before writing dest.) Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4071>	2020-03-10 16:01:39 +00:00
Rob Clark	b2b349096f	freedreno/ir3: track register usage in first RA pass We'll use the feedback from the first pass to select a target register usage in the second pass. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4071>	2020-03-10 16:01:39 +00:00
Rob Clark	9ae93be8fb	freedreno/ir3: fix has_latency_to_hide Also count tex-prefetch instructions. And only let the no-latency rule kick in for frag shaders. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4071>	2020-03-10 16:01:39 +00:00
Rob Clark	b6eb11295a	freedreno/ir3: split out has_latency_to_hide() Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4071>	2020-03-10 16:01:39 +00:00
Rob Clark	752b9985be	freedreno/ir3: add simplified stall estimation Doesn't take into account stalls that result from a register written in a different block, etc. But this should be more useful than just using number of (ss)'s by trying to estimate how costly a given sync is. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4071>	2020-03-10 16:01:39 +00:00
Rob Clark	64ae2ef8bb	freedreno/ir3: remove extra nops inserted in scheduler They were inserting a nop between back to back SFU instrucions. But that doesn't actually appear to be required. And they get stripped out later anyways before legalize. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4071>	2020-03-10 16:01:39 +00:00
Rob Clark	ad2ff7a278	freedreno/computerator: add hrsq/hlog2/hexp2 Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4071>	2020-03-10 16:01:39 +00:00
Rob Clark	4a8e4c18d2	freedreno/ir3: also lower lowp frag outputs Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4071>	2020-03-10 16:01:39 +00:00
Kristian H. Kristensen	b75a166e68	freedreno/ir3: Don't fold conversions into sign Not supported. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3929>	2020-03-09 16:31:07 +00:00

1 2 3 4 5 ...

875 commits