fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-16 07:38:14 +02:00

Author	SHA1	Message	Date
Iago Toral Quiroga	79dee14cc2	broadcom/compiler: don't move ldvary earlier if current instruction has ldunif If we did, we would have the instruction coming right after ldvary write to the same implicit destination as ldvary at the same time. We prevent this when merging instructions, but we should make sure we prevent this when we move ldvary around for pipelining too. Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13921>	2021-11-23 10:52:24 +00:00
Iago Toral Quiroga	7fec4f4135	broadcom/compiler: fix scoreboard locking checks According to the spec the hardware locks the scoreboard on the first or last thread switch (selected via shader state) and any TLB accesses executed before this are not synchronized by hardware. This change updates the logic to ensure we respect this requirement and that we don't assume that the lock is acquired automatically on the first TLB access, which is not valid at least since V3D 4.1+. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13910>	2021-11-22 12:53:43 +00:00
Iago Toral Quiroga	bd7584c16b	broadcom/compiler: don't allow RF writes from signals after thrend Writes to physical registers are not allowed after thread end. We were checking this for ALU writes, but we need to check it for signal writes too. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13910>	2021-11-22 12:53:43 +00:00
Juan A. Suarez Romero	457dbb81f5	broadcom/compiler: apply constant folding on early GS lowering This solves a case where a NIR geometry shader was storing the output in a non-constant: vec4 32 ssa_1 = load_const (0xc0800000 /* -4.000000 /, 0xc1100000 / -9.000000 /, 0x40400000 / 3.000000 /, 0x40e00000 / 7.000000 /) vec1 32 ssa_7 = load_const (0x00000000 / 0.000000 /) vec1 32 ssa_8 = load_const (0x00000001 / 0.000000 /) vec1 32 ssa_9 = iadd ssa_7, ssa_8 vec1 32 ssa_19 = mov ssa_1.x intrinsic store_output (ssa_19, ssa_9) (1, 1, 0, 160, 288) / base=1 / / wrmask=x / / component=0 / / src_type=float32 / / location=32 slots=2 gs_streams(x=0 y=0 z=0 w=0) / When lowering the VPM output we check if the destination (ssa_9 in this case) is a constant to add to the VPM offset. We run a constant folding optimization in an earlier VS lowering, and we should do the same for GS. This fixes multiple dEQP-VK.pipeline.interface_matching. failures. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13884>	2021-11-22 09:32:50 +00:00
Juan A. Suarez Romero	7b21635057	broadcom/compiler: handle array of structs in GS/FS inputs While fragment and geometry shader were handling structs as inputs, they weren't doing for it arrays of structures. This fixes multiple dEQP-VK.pipeline.interface_matching.* failures and assertions. v2: - Fix style (Iago). Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13884>	2021-11-22 09:32:50 +00:00
Iago Toral Quiroga	5e536c97a9	broadcom/compiler: fix early fragment tests setup When early fragment tests are mandated by the shader, we must use the Z value produced by the FEP even if there are elements that would typically require late fragment tests (such as discards, sample to coverage, etc). This change means we also need to be a bit more careful when we promote shaders to use early fragment tests so we don't promote anything with discards for example. Fixes: dEQP-VK.fragment_operations.early_fragment.discard_early_fragment_tests_depth dEQP-VK.fragment_operations.early_fragment.discard_early_fragment_tests_stencil Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13837>	2021-11-18 07:39:32 +00:00
Connor Abbott	508f917d8c	util/dag: Make edge data a uintptr_t Nobody was actually using it as a pointer, and I'm going to introduce a shared function which relies on it not being a pointer so let's fix this once and for all. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13722>	2021-11-17 13:41:47 +00:00
Iago Toral Quiroga	0cb58f80d2	v3d: use V3D_MAX_DRAW_BUFFERS instead of hardcoded constant Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13775>	2021-11-12 11:04:07 +00:00
Iago Toral Quiroga	3a95e25e84	v3dv,v3d: don't store swizzle pointer in shader/pipeline keys We had been storing pointers to a driver owned swizzle table rather than storing the actual swizzle value in various shader and pipeline keys on both GL and Vulkan drivers. This doesn't look very robust, particularly since we also compute sha1 hashes from these values and we may store these hashes to disk (for the disk cache). Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13738>	2021-11-10 11:24:26 +00:00
Iago Toral Quiroga	aa5a0e1dad	broadcom/compiler: copy packing when converting add to mul Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13675>	2021-11-04 13:57:39 +00:00
Iago Toral Quiroga	a794bdf953	broadcom/compiler: check that sig packing is valid when pipelining ldvary Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13641>	2021-11-03 10:49:06 +00:00
Iago Toral Quiroga	6b9bd3f038	broadcom/compiler: make opt passes set current block Typically, optimization passes go through all the blocks in a shader and make adjustments on the fly, so we always want them to update the current block or the current block pointer will become outdated. Also, we don't need to keep track of the previous current block pointer to restore it, since optimization passes run after we have completed conversion to VIR, and therefore, anything that comes after that should always set the current block before emitting code. Fixes debug assert crashes when running shader-db: vir.c:1888: try_opt_ldunif: Assertion `found \|\| &c->cur_block->instructions == c->cursor.link' failed Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13625>	2021-11-02 11:17:01 +00:00
Iago Toral Quiroga	3fbd6662b7	broadcom/compiler: rework simultaneous peripheral access checks This was not quite correct in that our checks for the allowed cases were not checking that there were no other peripheral access other than the ones allowed. For example, we allowed wrtmuc signal and TMU write other than TMUC, and we also allowed TMU read and VPM read/write. But we cannot allow wrtmuc with TMU write other than TMUC and at the same time a VPM write for example, so we can't just check if we have a combination of allowed peripherals, we still need to check that those are the only ones in use by the combined instructions. Another example is that even if we allow a TMU write (other than TMUC) with a wrtmuc signal, the resulting instruction must still have just one TMU write other than TMUC, but we were allowing the merge if one instruction signaled wrtmuc and the other wrote to tmu other than tmuc without testing if the combined result would have 2 tmu writes. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13527>	2021-10-27 06:03:12 +00:00
Iago Toral Quiroga	1561d0126a	broadcom/compiler: fix assert that current instruction must be in current block This was not considering the possibility that the driver has called nir_before_block() or nir_after_block() to update the cursor, in which case the cursor link points to the instruction list header and not to an actual instruction. Fixes incorrect debug-assert crash in: dEQP-VK.graphicsfuzz.cov-increment-vector-component-with-matrix-copy Fixes: `265515fa62` ("broadcom/compiler: check instruction belongs to current block") Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13467>	2021-10-22 05:39:05 +00:00
Iago Toral Quiroga	75bd37dc6a	broadcom/compiler: disallow tsy barrier in thrsw delay slots A TSY barrier becomes effective at the point of the next thread switch, so if we have one coming after a previous thread switch we need to be careful not to emit it in its delay slots, or we would be effectively moving the barrier earlier than intended. Fixes simulator assert crash in: dEQP-VK.graphicsfuzz.two-for-loops-with-barrier-function Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13468>	2021-10-21 12:40:00 +02:00
Alejandro Piñeiro	d50be41f8f	broadcom/compiler: remove unused macro and function definition Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13444>	2021-10-20 10:08:27 +00:00
Alejandro Piñeiro	9e41c42ed4	broadcom/compiler: remove qpu_acc helper It is really small, and used just twice, so we just call qpu_magic. We also update how it is used: * QFILE_NULL is an undef so we can just load anything. Previously we were using accumulator 0, but there isn't any real reason to use an accumulator for this. Using reg 0. * QFILE_LOAD_IMM: it seems that we don't use at all right now, so let's add an assert Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13008>	2021-09-24 08:46:06 +00:00
Alejandro Piñeiro	193898c8b0	broadcom/compiler: remove commented out vir_LOAD_IMM methods It has been commented several years now. Let's remove it to reduce the noise. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13008>	2021-09-24 08:46:06 +00:00
Juan A. Suarez Romero	d220d8cb51	broadcom/compiler: add V3D_DEBUG_NO_LOOP_UNROLL debug option Disables loop unrolling. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12803>	2021-09-13 08:51:54 +00:00
Ella-0	53ae5c3aae	v3d/compiler: Handle point_coord_upper_left Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12524>	2021-09-12 21:01:11 +00:00
Juan A. Suarez Romero	c98ddc778a	broadcom/compiler: force a last thrsw for spilling As we don't know if we are going to have spilling or not, emit always a last thrsw at the end of the shader. If later we don't have spillings and we don't need that last thrsw, we remove it and switch back to the previous one. This way we ensure all the spilling happens always before the last thrsw. v2 (Juan): - Rework the code to force a last thrsw and remove later if no spilling v3: - Merge functionality inside vir_emit_last_thrsw (Iago) - Add vir_restore_last_thrsw (Juan) v4 (Iago): - Fix/add new comments - Rename variables/parameters v5 (Iago): - Fix comments - Add assertion Cc: mesa-stable Fixes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4760 Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12322>	2021-09-10 09:18:05 +00:00
Juan A. Suarez Romero	53c8b4c093	broadcom: make vir_emit_last_thrsw() private This function is only used in v3d_nir_to_vir(), so make it private. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12322>	2021-09-10 09:18:05 +00:00
Juan A. Suarez Romero	265515fa62	broadcom/compiler: check instruction belongs to current block Check in the ldunif optimization if the current instruction belongs to current block. These avoids again searching the instruction when current block is not correctly set, as it happened in https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12339 and in https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12221. v2: - Remove extra blank line (Iago) Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12358>	2021-09-06 10:38:06 +00:00
Iago Toral Quiroga	3ef2ca9cbf	broadcom/compiler: don't enable early fragment tests if shader writes Z We had an optimization to auto-enable early fragment tests when a shader didn't have side effects, but of course, we cannot do that this if the shader writes Z, as in that case the fragment tests need to use the value written from the shader. Also, if the shader enables early fragment tests, then any shader Z writes should be ignored. Fixes: dEQP-VK.spirv_assembly.instruction.graphics.early_fragment.* Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12736>	2021-09-06 12:01:43 +02:00
Vinson Lee	0a4c4f4459	broadcom/compiler: Fix qpu.flags.muf typo. Fix defect reported by Coverity Scan. Same on both sides (CONSTANT_EXPRESSION_RESULT) pointless_expression: The expression inst->qpu.flags.auf != V3D_QPU_UF_NONE \|\| inst->qpu.flags.auf != V3D_QPU_UF_NONE does not accomplish anything because it evaluates to either of its identical operands, inst->qpu.flags.auf != V3D_QPU_UF_NONE. Fixes: `3f2c54a27f` ("broadcom/compiler: rewrite partial update liveness tracking") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12385>	2021-08-24 08:30:59 +00:00
Juan A. Suarez Romero	2a86d51960	broadcom/compiler: set current block on incrementing unifa When incrementing unifa address in DCE optimization, ensure that we setup correctly the current block, so the ldfunif optimization is also executed correctly. This fixes dEQP-VK.graphicsfuzz.cov-struct-float-array-mix-uniform-vectors heap-buffer overflow with address sanitizer enabled. v2 (Iago): - Save and restore current block Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12339>	2021-08-12 12:33:46 +00:00
Iago Toral Quiroga	3f2c54a27f	broadcom/compiler: rewrite partial update liveness tracking The code we had for this was a work in progress and not finished. Also, it was geared towards partial writes caused by output packing (i.e. fp16) and was ignoring partial updates caused by conditional writes, which are far more common in our case. This change provides an implementation for tracking conditional writes that works in tandem with the previous spill change to narrow liveness for their spills. Fixes register allocation failures in: dEQP-VK.graphicsfuzz.spv-stable-maze-flatten-copy-composite We also gain one shader from shader-db: total instructions in shared programs: 13339969 -> 13338584 (-0.01%) instructions in affected programs: 185520 -> 184135 (-0.75%) helped: 375 HURT: 130 Instructions are helped. total threads in shared programs: 412038 -> 412040 (<.01%) threads in affected programs: 2 -> 4 (100.00%) helped: 1 HURT: 0 total uniforms in shared programs: 3746581 -> 3746585 (<.01%) uniforms in affected programs: 49 -> 53 (8.16%) helped: 0 HURT: 1 total max-temps in shared programs: 2359960 -> 2359947 (<.01%) max-temps in affected programs: 289 -> 276 (-4.50%) helped: 7 HURT: 0 Max-temps are helped. total sfu-stalls in shared programs: 34351 -> 34359 (0.02%) sfu-stalls in affected programs: 218 -> 226 (3.67%) helped: 35 HURT: 37 Inconclusive result (value mean confidence interval includes 0). total inst-and-stalls in shared programs: 13374320 -> 13372943 (-0.01%) inst-and-stalls in affected programs: 186653 -> 185276 (-0.74%) helped: 373 HURT: 132 Inst-and-stalls are helped. LOST: 0 GAINED: 1 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12278>	2021-08-10 08:47:40 +00:00
Iago Toral Quiroga	c335c03ae2	broadcom/compiler: make spills of conditional writes also conditional A spill of a conditional write generates code like this: mov.ifa t5000, 0 mov tmud, t5000 nop t5001; ldunif (0x00008100 / 0.000000) add tmua, t11, t5001 Here, we are spilling t5000, which has a conditional write, and we produce an inconditional spill for it. This implicitly means that our spill requires a correct value for all channels of t5000. If we do a conditional spill, then we emit: mov.ifa t5000, 0 mov tmud.ifa, t5000 nop t5001; ldunif (0x00008100 / 0.000000) add tmua.ifa, t11, t5001 Which only uses channels of t5000 that have been written by the instruction being spilled. By doing the latter, we can then narrow down the liveness for t5000 more effectively, as we can use this to detect that the block only reads (in the tmud instruction) the values that have been written previously in the same block (in the mov instruction). This means that values in other channels are not used, and therefore, we don't need them to be alive at the start of the block. This means that if this is the only write of t5000 in this block, we can consider that the block completely defines t5000. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12278>	2021-08-10 08:47:40 +00:00
Iago Toral Quiroga	314eb97dcb	broadcom/compiler: Flags are per-thread state in V3D 4.2+ This means they survive a thread switch, so we can remove redundant flag setups across thread switches. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12278>	2021-08-10 08:47:40 +00:00
Iago Toral Quiroga	b727eaac3c	broadcom/compiler: add a vir_get_cond helper Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12278>	2021-08-10 08:47:40 +00:00
Juan A. Suarez Romero	d0e83b6174	broadcom/compiler: change current block on setting spill base The spill base setting instructions (which includes some uniforms) are added in the entry block, not in the current block. When ldunif optimization is applied, the cursor is pointing to instructions in the entry block, but the current block is a different one. This leads to a heap-buffer-overflow when going through the list of instructions (detected by the address sanitizer). Thus change the current block to entry block, and restore it after the setup is done. This fixes dEQP-VK.ssbo.readonly.layout.single_struct.single_buffer.std430_instance_array_comp_access_store_cols with address sanitizer enabled. v2: - Set current block instead of disabling ldunif optimization (Iago) Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12221>	2021-08-09 13:15:24 +00:00
Timothy Arceri	a9ed4538ab	nir: add indirect loop unrolling to compiler options This is where it should be rather than having to pass it into the optimisation pass every time. It also allows us to call the loop analysis pass without having to duplicate these options which we will do later in this series. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12064>	2021-08-03 10:54:50 +00:00
Iago Toral Quiroga	d5acae3206	broadcom/compiler: implement nir_intrinsic_load_view_index This is used for multiview's gl_ViewIndex built-in. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12034>	2021-07-27 07:31:31 +00:00
Juan A. Suarez Romero	dc40157888	broadcom/compiler: emit TMU flush before a jump Like in the case of emitting a block, process pending TMU operations before a jump is executed. Fixes dEQP-VK.graphicsfuzz.stable-binarysearch-tree-nested-if-and-conditional. Fixes: `197090a3fc` ("broadcom/compiler: implement pipelining for general TMU operations") Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11971>	2021-07-20 10:15:21 +00:00
Iago Toral Quiroga	940725a7d9	broadcom/compiler: implement gl_PrimitiveID in FS without a GS OpenGL ES 3.1 specifies that a geometry shader can write to gl_PrimitiveID, which can then be read by a fragment shader. OpenGL ES 3.2 additionally adds the capacity for the fragment shader to read gl_PrimitiveID even if there is no geometry shader. This commit adds support for this feature, which is also implicitly expected by the geometry shader feature in Vulkan 1.0. Fixes: dEQP-VK.pipeline.framebuffer_attachment.no_attachments dEQP-VK.pipeline.framebuffer_attachment.no_attachments_ms Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11874>	2021-07-14 12:05:56 +00:00
Thomas H.P. Andersen	cf05a7e66f	broadcom/compiler: fix add vs. mul Spotted by a compile warning Fixes: `7f61ff7b4d` ("broadcom/compiler: Merge instructions more efficiently") Reviewed-by: Iago Torral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11764>	2021-07-12 11:54:03 +00:00
Thomas H.P. Andersen	458801e2c3	broadcom/compiler: use correct flag enum They have the same value, so no functional change Reviewed-by: Iago Torral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11764>	2021-07-12 11:54:03 +00:00
Iago Toral Quiroga	ee11e9183d	broadcom/compiler: don't ignore constant offset on per-vertex input loads Fixes: dEQP-VK.clipping.user_defined.clip_distance.vert_geom.{5,6,7,8} Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11783>	2021-07-12 08:35:56 +02:00
Iago Toral Quiroga	e1a24a0047	broadcom/compiler: handle compact input arrays for geometry shaders Clip distance arrays will come as compact array variables, so we need to handle them as such, like we did for vertex inputs. Fixes: dEQP-VK.clipping.user_defined.clip_distance.vert_geom.{1,2,3,4} Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11783>	2021-07-12 08:35:56 +02:00
Iago Toral Quiroga	353f0a180f	broadcom/compiler: create a helper for computing VPM config This code is the same across drivers. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11783>	2021-07-12 08:35:55 +02:00
Iago Toral Quiroga	2733a17b14	broadcom/compiler: track if geometry shaders write gl_PointSize Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11783>	2021-07-12 08:35:55 +02:00
Iago Toral Quiroga	8fada5cb21	broadcom/compiler: use nir_sort_variables_with_modes Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11624>	2021-06-29 10:11:58 +00:00
Iago Toral Quiroga	10313b03b5	broadcom/compiler: track if a compute shader uses subgroup functionality Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11620>	2021-06-29 08:43:06 +02:00
Iago Toral Quiroga	5081de07f7	broadcom/compiler: add a set_a_flags_for_subgroup helper We will need this in the future to implement more subgroup operations, so make this code available in a helper. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11620>	2021-06-29 08:43:06 +02:00
Iago Toral Quiroga	b9f510087d	broadcom/compiler: add a ntq_emit_cond_to_bool helper Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11620>	2021-06-29 08:43:06 +02:00
Iago Toral Quiroga	53341e44ad	broadcom/compiler: implement more subgroup intrinsics Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11620>	2021-06-29 08:43:06 +02:00
Iago Toral Quiroga	87fa5908b3	broadcom/compiler: add FLAFIRST and FLNAFIRST opcodes We will at least need the former to implement subgroupElect() Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11620>	2021-06-29 08:43:06 +02:00
Iago Toral Quiroga	a9ad04f17d	broadcom/compiler: lower nir_intrinsic_load_num_subgroups The number of subgroups is the local workgroup size divided by the dispatch width. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11620>	2021-06-29 08:43:06 +02:00
Iago Toral Quiroga	30dec8b414	broadcom/compiler: implement nir_intrinsic_load_subgroup_id correctly For some reason, this was implemented with the bulk of the compute shader enablement, but this intrinsic is specific to subgroups and thus was not really used. Also, its implementation was not correct, since it was returning the element index within the subgroup, not the subgroup index itself, which is the index of the batch in the dispatch. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11620>	2021-06-29 08:43:06 +02:00
Iago Toral Quiroga	21b0a4c80c	v3dv: don't lower vulkan resource index result to scalar The intrinsic produces a vec2, so let's honor that and avoid the weird lowering to scalar and later reconstruction to vec2 when we find load vulkan descriptor intrinsics. It fixes tests like this (which require that we expose KHR_spirv_1_4): dEQP-VK.spirv_assembly.instruction.spirv1p4.opptrequal.null_comparisons_ssbo_equal that otherwise produce bad code that tries to access a vec2 from the result of that intrinsic, leading to NIR validation errors. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11257>	2021-06-10 05:47:29 +00:00

1 2 3 4 5 ...

572 commits