fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-22 02:18:10 +02:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	771d23584a	pan/midgard: Remove util/ra support It's now unused, in favour of LCRA. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-13 15:27:56 +00:00
Alyssa Rosenzweig	e343f2ceb9	pan/midgard: Integrate LCRA Pretty routine, we do have a hack to force swizzle alignment for !32-bit for until we implement !32-bit the right way. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-13 15:27:56 +00:00
Alyssa Rosenzweig	66ad64d73d	pan/midgard: Implement linearly-constrained register allocation Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-13 15:27:56 +00:00
Alyssa Rosenzweig	fd81916ee5	pan/midgard: Add blend shader selection bits for MRT This is less complicated than previously thought. Note we have no way of specifying the work register count for blend shaders; it must be strictly less than the work register count of the corresponding fragment shader (which is fine since we force the fragment shader to report a count of 16 with a blend shader as a major hack until we get register pressure down for blend shaders). TODO: pandecode the flags. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-13 15:27:56 +00:00
Alyssa Rosenzweig	3295edaadf	pan/midgard: Pack load/store masks While most load/store operations on 32-bit/vec4 intriniscally, some are not and have special type-size-dependent semantics for the mask. We need to convert into this native format. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com>	2019-11-11 15:23:44 +00:00
Alyssa Rosenzweig	843874c7c3	pan/midgard: Implement nir_intrinsic_load_output_u8_as_fp16_pan We can use the native Midgard ops for this, depending what chip we're on. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com>	2019-11-11 15:23:44 +00:00
Alyssa Rosenzweig	5885b64e42	pan/midgard: Identify ld_color_buffer_u8_as_fp16* There are two versions of this opcode, depending what version of the ISA you're using. I'm not sure if there's a semantic difference; I think there might be some slight subtleties but it's too early to know at this stage. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com>	2019-11-11 15:23:44 +00:00
Alyssa Rosenzweig	5f768eda43	pan/midgard: Switch base for vertex texturing on T720 There aren't texture pipeline registers anymore; instead, space is shared with work and ldst registers for output and input respectively. We need to shift the base registers to represent this correctly. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-08 06:45:03 +00:00
Alyssa Rosenzweig	ac14facf7a	pan/midgard: Pass shader stage to disassembler Vertex texturing behaves differently from fragment texturing on some GPUs. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-08 06:45:03 +00:00
Alyssa Rosenzweig	515941202d	pan/midgard: Disassemble half-steps correctly The meaning of some bits shifts; we need to account for this to print swizzles sanely. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-08 06:45:03 +00:00
Alyssa Rosenzweig	ec2af6bc97	pan/midgard: Fix printing of half-registers in texture ops We were using old style half-registers; let's update that to be consistent, preparing us for more disassmbler changes in this area. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-08 06:45:03 +00:00
Tomeu Vizoso	072207bc18	panfrost: Pipe the GPU ID into compiler and disassembler Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com>	2019-11-07 08:48:45 +00:00
Tomeu Vizoso	94e6d17043	panfrost: Print the right zero field Copy paste error. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reported-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Reviewed-by: Daniel Stone <daniels@collabora.com>	2019-11-06 18:13:16 +01:00
Tomeu Vizoso	8e1ae5fa14	panfrost: Decode blend shaders for SFBD Also set MALI_HAS_BLEND_SHADER as needed. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-06 16:18:46 +01:00
Tomeu Vizoso	9447a84f69	panfrost: Rework format encoding on SFBD Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-06 16:18:46 +01:00
Tomeu Vizoso	23fe7cd2d6	panfrost: Add checksum fields to SFBD descriptor During tests on T720, these fields were discovered. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-06 16:17:13 +01:00
Alyssa Rosenzweig	12d071024b	pan/midgard: Extend default_phys_reg to !32-bit We can pass through a size. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-04 15:36:08 -05:00
Alyssa Rosenzweig	762623381d	pan/midgard: Extend swizzle packing for vec4/16-bit We would like to pack not just xyzw swizzles but also efgh swizzles. This should work for vec4/16-bit. More work will be needed to pack swizzles for vec8/16-bit and even more work for 8-bit, of course. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-04 15:36:08 -05:00
Alyssa Rosenzweig	bf5508f7b9	pan/midgard: Extend offset_swizzle to non-32-bit We take a size parameter; use it. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-04 15:36:08 -05:00
Alyssa Rosenzweig	f538981384	pan/midgard: offset_swizzle doesn't need dstsize This argument should be omitted. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-04 15:36:08 -05:00
Alyssa Rosenzweig	9eac9389fb	pan/midgard: Add bizarre corner case Someone really needs to look into this. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-04 15:36:08 -05:00
Alyssa Rosenzweig	4ae4d82e21	pan/midgard: Compute bundle interference Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-04 15:36:08 -05:00
Alyssa Rosenzweig	45ac8ea8bd	pan/midgard: Fix quadword_count handling Spilling can mess with this considerably. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-04 15:36:08 -05:00
Alyssa Rosenzweig	0a77dd3203	pan/midgard: Validate tags when branching Midgard prefetches instructions based on tag (ALU, LD/ST, texture * size). To do so, the shader descriptor specifies the tag of the first instruction, all instructions specify the tag of the next linear instruction is, and all branches explicitly specify the tag of the branch target. If you mess this up, you get an INSTR_TYPE_MISMATCH, which unambiguously refers to this problem, but it's still annoying to try to work out all the branch targets in your head to debug. Instead, let's track the tags of various blocks over time, so we can automatically validate tags of branch targets, to make INSTR_TYPE_MISMATCH issues immediately obvious in a disassembly. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-04 15:36:08 -05:00
Boris Brezillon	28440820ef	panfrost: MALI_DEPTH_TEST is actually MALI_DEPTH_WRITEMASK MALI_DEPTH_TEST should only be set when depth->writemask is true, not when the depth test is enabled. Let's rename the flag and patch panfrost_bind_depth_stencil_state() to do the right thing. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-04 16:14:09 +01:00
Alyssa Rosenzweig	c3a46e7644	pan/midgard: Eliminate blank_alu_src We don't need it in practice, so this is some more cleanup. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-01 01:01:47 +00:00
Alyssa Rosenzweig	70072a20e0	pan/midgard: Refactor swizzles Rather than having hw-specific swizzles encoded directly in the instructions, have a unified swizzle arary so we can manipulate swizzles generically. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-01 01:01:47 +00:00
Alyssa Rosenzweig	e7fd14ca8a	pan/midgard: Add a dummy source for loads We want symmetry between loads and stores, so we add a dummy source. So we get, e.g. st_int4 _, val, arg_1, arg_2 ld_int4 dest, _, arg_1, arg_2 Semantically, this dummy source represents the data itself, as if the load is simply a move. That means it has a swizzle that acts as a source. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-01 01:01:47 +00:00
Alyssa Rosenzweig	b5938be51d	pan/midgard: Remove OP_IS_STORE_VARY Unused. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-01 01:01:46 +00:00
Robert Foss	f140467b5b	android: Add panfrost support to build scripts Currently the Android build system doesn't expose the panfrost driver. This patch enables the panfrost driver to be build on for the Android platform. Signed-off-by: Robert Foss <robert.foss@collabora.com> Reviewed-By: Rohan Garg <rohan.garg@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-10-31 10:03:54 +01:00
Alyssa Rosenzweig	44971b84b7	panfrost: Remove unused definitions in mali-job.h Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-10-29 13:02:53 +00:00
Alyssa Rosenzweig	fa14cdf6e4	panfrost: Cleanup _shader_upper -> shader I don't believe this is actually a tagged pointer; warn if it is. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-10-29 13:02:53 +00:00
Alyssa Rosenzweig	f98e9a2771	pan/midgard: Express allocated registers as offsets Rather than supplying a mask/swizzle to compose with the original, just supply the offset of the allocated register so we can directly offset the mask/swizzle, without resorting to composition. This is simpler, cleaner, and will generalize to non-32-bit. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-10-25 08:45:39 -04:00
Alyssa Rosenzweig	c1d36eb115	pan/midgard: Expose more typesize manipulation routines These internal mir.c routines will help the RA. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-10-25 08:45:39 -04:00
Alyssa Rosenzweig	9bba182840	pan/midgard: Add mir_set_bytemask helper Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-10-25 08:45:39 -04:00
Rhys Perry	8b98d0954e	nir/lower_idiv: add new llvm-based path v2: make variable names snake_case v2: minor cleanups in emit_udiv() v2: fix Panfrost build failure v3: use an enum instead of a boolean flag in nir_lower_idiv()'s signature v4: remove nir_op_urcp v5: drop nv50 path v5: rebase v6: add back nv50 path v6: add comment for nir_lower_idiv_path enum v7: rename _nv50/_llvm to _fast/_precise v8: fix etnaviv build failure Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev>	2019-10-21 18:49:46 +00:00
Alyssa Rosenzweig	b8c4fb235e	pan/midgard: Implement SIMD-aware dead code elimination We would like to eliminate not just entire dead instructions, but also dead components, which increases scheduler flexibility (since some vector instructions can become scalar after eliminating dead components). This also will allow better RA in the future. Results are meh. total instructions in shared programs: 3453 -> 3451 (-0.06%) instructions in affected programs: 60 -> 58 (-3.33%) helped: 2 HURT: 0 total bundles in shared programs: 1826 -> 1824 (-0.11%) bundles in affected programs: 33 -> 31 (-6.06%) helped: 2 HURT: 0 total quadwords in shared programs: 3144 -> 3144 (0.00%) quadwords in affected programs: 0 -> 0 helped: 0 HURT: 0 total registers in shared programs: 321 -> 321 (0.00%) registers in affected programs: 45 -> 45 (0.00%) helped: 11 HURT: 11 helped stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 helped stats (rel) min: 16.67% max: 50.00% x̄: 39.70% x̃: 50.00% HURT stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 HURT stats (rel) min: 100.00% max: 100.00% x̄: 100.00% x̃: 100.00% 95% mean confidence interval for registers value: -0.45 0.45 95% mean confidence interval for registers %-change: -1.87% 62.18% Inconclusive result (value mean confidence interval includes 0). total threads in shared programs: 445 -> 447 (0.45%) threads in affected programs: 2 -> 4 (100.00%) helped: 1 HURT: 0 Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-10-20 12:02:31 +00:00
Alyssa Rosenzweig	6c4b97011b	pan/midgard: Create dependency graph bytewise This allows for vec16 dependencies in the scheduler, not that we have any yet (thankfully). Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-10-20 12:02:31 +00:00
Alyssa Rosenzweig	825f11e739	pan/midgard: Handle nontrivial masks in texture RA The texture instruction has a mask we need to take into account. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-10-20 12:02:31 +00:00
Alyssa Rosenzweig	d1d3411ba5	pan/midgard: Implement per-byte liveness tracking Now that we have notion of byte masks, liveness tracking can be updated to reflect this extra granularity without loss of correctness. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-10-20 12:02:31 +00:00
Alyssa Rosenzweig	43fd730fc4	pan/midgard: Simplify mir_bytemask_of_read_components There are easy ways to iterate sources! Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-10-20 12:02:31 +00:00
Alyssa Rosenzweig	e9202ff3cb	pan/midgard: Report byte masks for read components Read component masks don't have a particular type associated, since the type of the ALU operation may not match the type of the operands in question. So let's generate byte masks instead, and update the rest of the compiler to use byte masks when analyzing reads. Preparation for mixed types. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-10-20 12:02:31 +00:00
Alyssa Rosenzweig	d079631248	pan/midgard: Add helpers for manipulating byte masks There are essentially two formats of masks in play beginning with this commit: masks per-channel and masks per-byte. The former make sense within a given fixed-size instruction; the latter are typesize-independent. It turns out you need the latter to meaningfully manipulate instructions containing multiple sizes (which is quite possible with ALU operations). Similarly, we have mir_srcsize. We calculate the size of the source by analyzing the size of the instruction itself and stepping down if there is a half-modifier. Finally, we have mir_round_bytemask_down, for when we want to take a byte mask and "round it down" to a given component size, so that we can use it as a component mask. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-10-20 12:02:31 +00:00
Alyssa Rosenzweig	e981b69484	pan/midgard: Implement OP_IS_STORE with table ..rather than open-coding. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-10-20 12:02:31 +00:00
Alyssa Rosenzweig	8e31b14858	pan/midgard: Tableize load/store ops This will allow us to encode properties about the load/store ops like we do for ALU ops. We include now properties about whether we have a store, and if there are special cases on the load/store op. We also tag each instruction by its natural size... this is probably not totally right, but it's a start. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-10-20 12:02:31 +00:00
Alyssa Rosenzweig	5952add9a9	pan/midgard: Factor out mir_get_alu_src This helper is used in a bunch of places ... might as well make that common. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-10-20 12:02:31 +00:00
Alyssa Rosenzweig	f77ea9798d	pan/midgard/disasm: Fix printing 8-bit/16-bit masks The trick is realizing even with a destination override, the masks are encoded in the same mode as the instruction itself, rather than stepping down. The override means that the smaller type is used, but the mask is parsed as if it were the higher type. Overriding down is down by printed by blinding doing this. Overriding up can be thought of as printing in the upper size, but shifting the alphabet to use the upper half, i.e. shifting xyzw to become abcd. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-10-20 12:02:31 +00:00
Alyssa Rosenzweig	d49fdca229	pan/midgard: Identify 64-bit atomic opcodes They are symmetric to their 32-bit counterparts, just shifted. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-10-20 12:02:31 +00:00
Alyssa Rosenzweig	6601570ead	pan/midgard: Debug mir_insert_instruction_after_scheduled Add some comments explaining what's going on in a more natural flow in order to solve the actual bug. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Fixes: `2d914ebe81` ("pan/midgard: Fix memory corruption in register spilling")	2019-10-20 12:02:31 +00:00
Erik Faye-Lund	2da792d398	panfrost: do not report alpha-test as supported This triggers lowering in the state-tracker, which makes things a bit simpler. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-10-17 10:41:36 +02:00

1 2 3 4 5 ...

443 commits