fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-18 13:48:06 +02:00

Author	SHA1	Message	Date
Eric Engestrom	8fa577340c	panvk: use updated tokens from vk.xml Signed-off-by: Eric Engestrom <eric@igalia.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17342>	2022-07-12 15:53:11 +00:00
Alyssa Rosenzweig	e0e2294f47	panfrost/ci: Disable T760 jobs These keep timing out due to abusive jobs. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17433>	2022-07-08 21:33:19 +00:00
Alyssa Rosenzweig	35a80418a1	panfrost/ci: Disable 0ad trace on T860 The last few frames of the trace are expensive (in terms of GPU time) and are close to hitting the timeout. With the next commit, they do hit the timeout due to using a larger batch. Nevertheless the next commit should be an overall perf improvement on average, so this remove to unblock CI. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Suggested-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17112>	2022-07-08 01:50:03 +00:00
Icecream95	91d9a34925	pan/decode: Change indent when decoding resources Make the separation between entries in the resource table more obvious. Increase the indent by two levels to keep descriptors distinct from the resource entry itself. Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17371>	2022-07-08 01:30:23 +00:00
Icecream95	e05889c8c9	pan/decode: Use tag bits for resource entry count Fixes crashes when decoding the blob, which sometimes uses fewer than 9 entries. Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17371>	2022-07-08 01:30:23 +00:00
Icecream95	f7da4eade4	pan/decode: fflush buffers after dumping and before aborts Otherwise trace files or other files being written (dEQP TestResults?) might be truncated. Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17371>	2022-07-08 01:30:23 +00:00
Icecream95	bcd85a74e8	pan/va: Use the _safe iterator when adding blend shader calls Otherwise the list 'next' changing will cause the assertion in list_for_each_entry to be hit. This was not hit before because list_assert is defined for debug builds but not debugoptimized. Fixes: `5067a26f44` ("pan/bi: Use flow control lowering on Valhall") Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17371>	2022-07-08 01:30:23 +00:00
Alyssa Rosenzweig	fbe430fae9	panfrost: Move bifrost_lanes_per_warp to common Whereas the compiler needs to know the warp size for lowering divergent indirects, the driver needs to know it to report the subgroup size. Move the Bifrost-specific helper to common and add the trivial implementation for Midgard. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17265>	2022-07-08 01:14:55 +00:00
Alyssa Rosenzweig	6f3eea5ddb	panfrost: Separate core ID range from core count To query the core count, the hardware has a SHADERS_PRESENT register containing a mask of shader cores connected. The core count equals the number of 1-bits, regardless of placement. This value is useful for public consumption (like in clinfo). However, internally we are interested in the range of core IDs. We usually query core count to determine how many cores to allocate various per-core buffers for (performance counters, occlusion queries, and the stack). In each case, the hardware writes at the index of its core ID, so we have to allocate enough for entire range of core IDs. If the core mask is discontiguous, this necessarily overallocates. Rename the existing core_count to core_id_range, better reflecting its definition and purpose, and repurpose core_count for the actual core count. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17265>	2022-07-08 01:14:55 +00:00
Alyssa Rosenzweig	5aa740bc8e	pan/bi: Implement f2f16{_rtz, _rtne} Float conversions with explicit rounding modes are required for OpenCL, as well as for Vulkan with the VK_KHR_16bit_storage extension (mandatory in Vulkan 1.1). Since the hardware conversion instructions allow configuring the round mode, this is easy to support :-) Fixes test_half.vstore_half_rtz. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17262>	2022-07-08 00:57:18 +00:00
Alyssa Rosenzweig	5f599fdef6	pan/va: Add missing <roundmode/> to V2F32_TO_V2F16 So we can implement f2f16_rtz. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17262>	2022-07-08 00:57:18 +00:00
Alyssa Rosenzweig	9bd7570e96	pan/bi: Fix unpack_32_2x16 definition This got messed up when scalarizing the IR. Fix the definition of the opcode to return (instead of break, asserting out) and to respect the swizzle (instead of failing validation). Noticed when bringing up OpenCL on Valhall. Fixes: `5febeae58e` ("pan/bi: Emit collect and split") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17222>	2022-07-07 23:16:39 +00:00
Jason Ekstrand	642283a2c1	panfrost,asahi: Use util_sign_extend for unpacking Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17214>	2022-07-06 11:23:18 +00:00
Alyssa Rosenzweig	154929d731	pan/va: Handle terminal barriers If a shader ends with a workgroup barrier, it must wait for slot #7 at the end to finish the barrier. After inserting flow control, we get: BARRIER NOP.wait NOP.end Currently, the flow control pass assumes that .end implies all other control flow, and will merge this down to BARRIER.end However, this is incorrect. Slot #7 is no longer waited on. In theory, this cannot affect the correctness of the shader. In practice, the hardware checks that all barriers are reached. Terminating without waiting on slot #7 first raises an INSTR_BARRIER_FAULT. We need to weaken the flow control merging slightly to avoid this incorrect merge, instead emitting: BARRIER.wait NOP.end Of course, all of these cases are inefficient: terminal barriers shouldn't be emitted in the first place. I wrote out an optimization for this. We can merge it if we find a workload that it actually helps. Fixes test_half.vstore_half. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17264>	2022-07-05 14:48:09 +00:00
Jason Ekstrand	d06335ed76	vulkan: Depend on vk_pipeline_layout in vk_cmd_enqueue Now that we have a common pipeline layout with reference counting, we don't need these driver hooks for reference counting anymore. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17286>	2022-06-29 20:31:58 +00:00
Jason Ekstrand	73eecffabd	panvk: Use the vk_pipeline_layout base struct Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17286>	2022-06-29 20:31:58 +00:00
Jason Ekstrand	f66f37a99e	panvk: Use the vk_descriptor_set_layout base struct Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17286>	2022-06-29 20:31:58 +00:00
David Heidelberg	6c4cc0abc6	ci: traces: switch to brotli compressed traces virgl: Also drop old pre-trim glxgears trace (cached). Acked-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Acked-by: Emma Anholt <emma@anholt.net> Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17280>	2022-06-29 00:58:28 +00:00
Jason Ekstrand	b8882718b7	panvk: Use the new border color helpers Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15359>	2022-06-23 00:01:41 +00:00
Alyssa Rosenzweig	3fedf22b60	pan/bi: Tune lower_vars_to_scratch Increase the threshold to lower indirect indexing of arrays to scratch memory all the way up to 256 bytes, which was the lowest power-of-two threshold for which enabling the pass on Mali-G57 was a win in shaderdb. It's difficult to tell what threshold is optimal here. The shader-db stats are based on a rough cycle model that assumes a 16:1 ratio between CVT and load/store on Valhall, and a 24:1 ratio between arithmetic and load/store on Bifrost. Those ratios are at most rules of thumb, as the number of cycles required by a load/store instruction will vary tremendously based on caching and the memory controller. However, they may well be lower bounds (if those are the upper bounds on instruction issuing in the Mali shader cores). As such, a large threshold seems well motivated. shader-db results on Mali-G52 follow, results on Mali-G57 were similar. Note the shader that's hurt for spills/fills is helped for load/store overall. cycles helped: 129 -> 98 (-24.03%) (spills: 17 -> 20 (17.65%); fills: 34 -> 40 (17.65%)) ldst helped: 129 -> 98 (-24.03%) (spills: 17 -> 20 (17.65%); fills: 34 -> 40 (17.65%)) total instructions in shared programs: 2415410 -> 2415372 (<.01%) instructions in affected programs: 1041 -> 1003 (-3.65%) helped: 3 HURT: 0 helped stats (abs) min: 2.0 max: 31.0 x̄: 12.67 x̃: 5 helped stats (rel) min: 2.08% max: 6.02% x̄: 3.90% x̃: 3.60% total tuples in shared programs: 1928558 -> 1928527 (<.01%) tuples in affected programs: 826 -> 795 (-3.75%) helped: 2 HURT: 1 helped stats (abs) min: 6.0 max: 26.0 x̄: 16.00 x̃: 16 helped stats (rel) min: 3.72% max: 9.68% x̄: 6.70% x̃: 6.70% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 1.54% max: 1.54% x̄: 1.54% x̃: 1.54% total clauses in shared programs: 355013 -> 354981 (<.01%) clauses in affected programs: 220 -> 188 (-14.55%) helped: 3 HURT: 0 helped stats (abs) min: 2.0 max: 27.0 x̄: 10.67 x̃: 3 helped stats (rel) min: 13.99% max: 21.43% x̄: 16.93% x̃: 15.38% total cycles in shared programs: 166610.27 -> 166574.90 (-0.02%) cycles in affected programs: 138 -> 102.62 (-25.63%) helped: 3 HURT: 0 helped stats (abs) min: 0.4583330000000001 max: 31.0 x̄: 11.79 x̃: 3 helped stats (rel) min: 15.28% max: 65.28% x̄: 34.86% x̃: 24.03% total arith in shared programs: 73690.13 -> 73690.58 (<.01%) arith in affected programs: 29.71 -> 30.17 (1.54%) helped: 1 HURT: 2 helped stats (abs) min: 0.0833339999999998 max: 0.0833339999999998 x̄: 0.08 x̃: 0 helped stats (rel) min: 3.85% max: 3.85% x̄: 3.85% x̃: 3.85% HURT stats (abs) min: 0.125 max: 0.4166659999999993 x̄: 0.27 x̃: 0 HURT stats (rel) min: 1.66% max: 5.17% x̄: 3.42% x̃: 3.42% total ldst in shared programs: 135611 -> 135571 (-0.03%) ldst in affected programs: 138 -> 98 (-28.99%) helped: 3 HURT: 0 helped stats (abs) min: 3.0 max: 31.0 x̄: 13.33 x̃: 6 helped stats (rel) min: 24.03% max: 100.00% x̄: 74.68% x̃: 100.00% total quadwords in shared programs: 1674599 -> 1674523 (<.01%) quadwords in affected programs: 838 -> 762 (-9.07%) helped: 3 HURT: 0 helped stats (abs) min: 2.0 max: 65.0 x̄: 25.33 x̃: 9 helped stats (rel) min: 3.39% max: 15.00% x̄: 9.14% x̃: 9.04% total spills in shared programs: 37 -> 40 (8.11%) spills in affected programs: 17 -> 20 (17.65%) helped: 0 HURT: 1 total fills in shared programs: 190 -> 196 (3.16%) fills in affected programs: 34 -> 40 (17.65%) helped: 0 HURT: 1 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17101>	2022-06-21 22:42:34 +00:00
Alyssa Rosenzweig	fd021a618f	pan/va: Replace MKVEC.v4i8 with MKVEC.v2i8 This is the instruction that the hardware actually supports. Do the rename, use the more specific accurate model in the IR, and rework the Valhall texturing code to emit MKVEC.v2i8 instead of MKVEC.v4i8. Will fix: dEQP-GLES31.functional.texture.gather.offset_dynamic.implementation_offset.* Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17101>	2022-06-21 22:42:34 +00:00
Alyssa Rosenzweig	c570693c19	pan/va: Pack MKVEC.v2i8 byte lanes They are in a different place, but the encoding is otherwise as usual. This will be required for texture gathers with dynamic offsets. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17101>	2022-06-21 22:42:34 +00:00
Alyssa Rosenzweig	10301885ab	pan/bi: Constant fold MKVEC.v2i8 Constant MKVEC.v2i8 will be generated during texturing on Valhall, just like constant MKVEC.v4i8 is currently generated. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17101>	2022-06-21 22:42:34 +00:00
Alyssa Rosenzweig	2833d0472a	pan/bi: Model MKVEC.v2i8 Valhall does not have Bifrost's 4-source MKVEC.v4i8. Instead, it has a (somewhat limtied) 3-source MKVEC.v2i8. The full MKVEC.v4i8 may be lowered to a pair of MKVEC.v2i8 instructions. For good code quality on both Bifrost and Valhall, we need to model both instructions in their full generality. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17101>	2022-06-21 22:42:34 +00:00
Alyssa Rosenzweig	6792b15971	pan/bi: Remove FRSCALE from IR It's just LDEXP in different clothing. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17101>	2022-06-21 22:42:34 +00:00
Alyssa Rosenzweig	21bedd2c97	pan/va: Rename RSCALE to LDEXP This avoids needless variation from Bifrost. While at it, fix the opcode definition: there are no abs/neg/swizzle modifiers on the signed integer source, and there's no clamp. However, there are round and infinity modes, like on Bifrost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17101>	2022-06-21 22:42:34 +00:00
Alyssa Rosenzweig	0da28ee2c7	pan/va: Implement sample positions FAU packing This will fix: dEQP-GLES31.functional.shaders.multisample_interpolation.interpolate_at_offset.at_sample_position.default_framebuffer Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17101>	2022-06-21 22:42:34 +00:00
Alyssa Rosenzweig	9dd0bc92b5	pan/va: Lower FADD_RSCALE.f32 to FMA_RSCALE.f32 We generate FADD_RSCALE.f32 in our sample variables implementations. Valhall doesn't have a dedicated FADD_RSCALE.f32 implementation, it should be aliased to FMA_RSCALE.f32. Handle that alias in isel lowering. This will fix: dEQP-GLES31.functional.shaders.multisample_interpolation.interpolate_at_offset.* Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17101>	2022-06-21 22:42:34 +00:00
Alyssa Rosenzweig	1a882ecdab	pan/bi: Align accesses with packed TLS When lowering vars to scratch, we need to be careful with alignment on Valhall, where packed TLS access must not straddle a 16-byte boundary. Fixes regressions when enabling indirect access to temps on Valhall. Fixes: `6761dbf891` ("panfrost: Use packed TLS on Valhall") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17101>	2022-06-21 22:42:34 +00:00
Alyssa Rosenzweig	5ee1179c94	pan/bi: Fix LD_BUFFER.i16 definition This was missing the message, breaking UBO-to-push and who-knows-what-else, when enabling fp16 const buffers. Fixes: `3dc2095b07` ("pan/bi: Model LD_BUFFER instructions") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17101>	2022-06-21 22:42:34 +00:00
Alyssa Rosenzweig	40accfd3b7	pan/va: Unit test va_mark_last This pass is super easy to unit test, so we have no excuse not to test thoroughly. va_mark_last only inserts annotations in a shader without any annotations, so our test cases are simply annotated shaders. The CASE macro just has to compare the case against the case with the annotations stripped and added back with va_mark_last. In retrospect, I should have used that technique for the flow control insertion tests too. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17091>	2022-06-21 22:19:59 +00:00
Alyssa Rosenzweig	4b7e337b45	pan/va: Mark last register reads On Valhall, register reads may be marked as "last" [1]. Setting the last flag promises the hardware that the value of the register is no longer required. This may enable hardware optimizations. In particular, it may permit the hardware to avoid register file writes if a write to the marked register is still in the forwarding buffer. This may improve power efficiency. In principle, this is trivial: run liveness analysis and mark killed sources, like we would in an SSA-based register allocator. In practice, there are a few wrinkles to avoid hazards around staging registers and 64-bit register pairs, requiring some additional data flow analysis and fix ups. However, nothing here is particularly "hard", and all the ideas are already in use for the Bifrost scheduler and the Bifrost/Valhall scoreboard analyses. [1] In Mesa's compiler, this is called discard for historical reasons. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17091>	2022-06-21 22:19:59 +00:00
Alyssa Rosenzweig	d4377e1255	pan/va: Use validate_register_pair for BLEND pack Instead of open-coding. Noticed by inspection. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17091>	2022-06-21 22:19:59 +00:00
Alyssa Rosenzweig	b48933d641	pan/va: Include BLEND for va_swap_12 This helps "contain the crazy" and avoids special casing BLEND in compiler passes. The Valhall instruction is roughly the same as its Bifrost counterpart, as long as we fix up the source order (as we already do for bitwise operations) everything works out. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17091>	2022-06-21 22:19:59 +00:00
Alyssa Rosenzweig	738a1572d2	pan/va: Move va_flow_is_wait_or_none to common We want to use this helper in the "mark last" pass too. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17091>	2022-06-21 22:19:59 +00:00
Alyssa Rosenzweig	1b29a99b7b	pan/va: Add header guards to valhall_enums.h Otherwise we can't #include in multiple places. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17091>	2022-06-21 22:19:59 +00:00
Alyssa Rosenzweig	c5a8736552	pan/bi: Constify bi_is_staging_src argument Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17091>	2022-06-21 22:19:59 +00:00
Alyssa Rosenzweig	2075bff4e8	pan/bi: Mark bi_postra_liveness_ins as MUST_CHECK Post-RA liveness relies on the caller updating the live variable with the results of bi_postra_liveness_ins. It is not automatic, as with regular liveness. This means ignoring the result of bi_postra_liveness_ins is surely an error. Mark it as MUST_CHECK to catch that error at compile time. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17091>	2022-06-21 22:19:59 +00:00
Alyssa Rosenzweig	43d00c2971	pan/va: Unit test barrier handling Add a unit test for the quirk discovered in the previos commit, because this will cause flakes (instead of fails) if we get it wrong. Better have a deterministic fail mode. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17091>	2022-06-21 22:19:59 +00:00
Alyssa Rosenzweig	8c6b9b9c92	pan/va: Workaround quirk of barrier handling For some unknown reason, waiting for general slots (at least for memory stores) doesn't work properly on a BARRIER instruction. We need to wait for all general slots right before issuing the BARRIER in addition to the general wait on the BARRIER itself. I don't know if this is a hardware bug or some hideous gate-saving quirk, but I observe the Mali-G78 DDK using the same workaround, which implies this really is necessary. Fixes rare flakes in: dEQP-GLES31.functional.compute.shared_var.work_group_size.float_128_1_1 Note that the flakes from that test are extremely timing dependent. Without this change, that test is racy but we almost always win the race. Reproducing the issue reliably requires high system load (e.g. running the CTS in the background) and simultaneously running that test a large number of times. Minimal shader-db impact. In particular, no cycle count regressions. total instructions in shared programs: 2699419 -> 2699458 (<.01%) instructions in affected programs: 22014 -> 22053 (0.18%) helped: 2 HURT: 25 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 0.12% max: 0.12% x̄: 0.12% x̃: 0.12% HURT stats (abs) min: 1.0 max: 3.0 x̄: 1.64 x̃: 1 HURT stats (rel) min: 0.07% max: 2.82% x̄: 0.69% x̃: 0.49% 95% mean confidence interval for instructions value: 1.01 1.87 95% mean confidence interval for instructions %-change: 0.38% 0.88% Instructions are HURT. total cvt in shared programs: 14468.81 -> 14469.42 (<.01%) cvt in affected programs: 221.33 -> 221.94 (0.28%) helped: 2 HURT: 25 helped stats (abs) min: 0.015625 max: 0.015625 x̄: 0.02 x̃: 0 helped stats (rel) min: 0.18% max: 0.18% x̄: 0.18% x̃: 0.18% HURT stats (abs) min: 0.015625 max: 0.046875 x̄: 0.03 x̃: 0 HURT stats (rel) min: 0.10% max: 4.44% x̄: 1.06% x̃: 0.79% 95% mean confidence interval for cvt value: 0.02 0.03 95% mean confidence interval for cvt %-change: 0.57% 1.36% Cvt are HURT. total quadwords in shared programs: 1462496 -> 1462528 (<.01%) quadwords in affected programs: 4632 -> 4664 (0.69%) helped: 0 HURT: 4 HURT stats (abs) min: 8.0 max: 8.0 x̄: 8.00 x̃: 8 HURT stats (rel) min: 0.35% max: 7.69% x̄: 4.03% x̃: 4.03% 95% mean confidence interval for quadwords value: 8.00 8.00 95% mean confidence interval for quadwords %-change: -2.71% 10.76% Inconclusive result (%-change mean confidence interval includes 0). Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17091>	2022-06-21 22:19:59 +00:00
Alyssa Rosenzweig	7fa545528d	pan/va: Simplify insert flow tests Test cases for insert flow are necessarily the reference test cases with the NOPs stripped out. That means we don't need to duplicate the test bodies. Deduplicate. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17091>	2022-06-21 22:19:59 +00:00
Alyssa Rosenzweig	35fcf8d3d7	pan/va: Move VA_NUM_GENERAL_SLOTS to common This definition is a hardware property. It's not specific to the flow control insertion pass, so move it to common code where other passes can use it. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17091>	2022-06-21 22:19:59 +00:00
Alyssa Rosenzweig	b6a30b72ab	panfrost: Implement provoking vertices on Valhall Starting with Valhall, the provoking vertex state is specified per-framebuffer (batch) instead of per-draw. We use the pan_tristate infrastructure to translate between desktop OpenGL's per-draw semantics to Valhall's per-framebuffer semantic. This is notably not required for GLES or Vulkan. If the provoking vertex is unset when the tiler context is generated, it could be set (incompatibly) later in the batch, and the tiler context's provoking vertex field would no longer match the framebuffer's. That would violate a hardware invariant. To ensure that doesn't happen, we make sure to set provoking vertexes before generating the tiler context so it can't change after. Fixes arb-provoking-vertex-render on Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17068>	2022-06-20 18:38:16 +00:00
Icecream95	ec70291da9	panfrost: Stop using sparse_array for batch BOs Iterating over a util_sparse_array is very expensive; replace this with a standard dynarray. Using the sparse 'nodearray' datastructure instead was tested, but found to be slower in some cases. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16988>	2022-06-14 23:44:02 +00:00
David Heidelberg	96f0944a69	ci/panfrost: add Blender, Warzone2100, Freedoom and Unvanquished traces Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16909>	2022-06-14 11:52:45 +00:00
Alyssa Rosenzweig	9bdd0854ea	panvk: Use common CmdBeginRenderPass The runtime already handles this. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16950>	2022-06-13 17:27:36 +00:00
Alyssa Rosenzweig	f00e0bfd8a	panvk: Simplify depth clear preload condition Easier to understand and equivalent in practice. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16950>	2022-06-13 17:27:36 +00:00
Alyssa Rosenzweig	ec2bf34d97	panvk: Fix stencil clears of combined ZS images If we have a combined Z/S image, the image has depth, so we proceed down the depth path, which does not set clear.s even though there's also a stencil component. Unify the control flow to fix this. Fixes (among others): dEQP-VK.api.image_clearing.core.clear_depth_stencil_image.single_layer.d24_unorm_s8_uint_multiple_subresourcerange Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16950>	2022-06-13 17:27:36 +00:00
Alyssa Rosenzweig	588ee38351	panvk: Clear Z/S attachments without a shader Rather than generating shaders to clear depth and stencil attachments, run the rasterizer without a shader and configure the depth/stencil hardware to do the clear. These settings are known to be efficient on Valhall, presumably the depth/stencil pipeline on Bifrost is similar enough that it is also the efficient way there. It's certainly much simpler. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Acked-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16950>	2022-06-13 17:27:36 +00:00
Alyssa Rosenzweig	03c34a8887	panvk: Remove unused pushmaps These were removed in an earlier series containing `ae77c207e0` ("panvk: Use push constants for copy shaders"), but the unused variables hung around. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16950>	2022-06-13 17:27:36 +00:00

1 2 3 4 5 ...

4222 commits