fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-19 13:38:19 +02:00

Author	SHA1	Message	Date
Iago Toral Quiroga	3a95e25e84	v3dv,v3d: don't store swizzle pointer in shader/pipeline keys We had been storing pointers to a driver owned swizzle table rather than storing the actual swizzle value in various shader and pipeline keys on both GL and Vulkan drivers. This doesn't look very robust, particularly since we also compute sha1 hashes from these values and we may store these hashes to disk (for the disk cache). Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13738>	2021-11-10 11:24:26 +00:00
Iago Toral Quiroga	aa5a0e1dad	broadcom/compiler: copy packing when converting add to mul Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13675>	2021-11-04 13:57:39 +00:00
Iago Toral Quiroga	a794bdf953	broadcom/compiler: check that sig packing is valid when pipelining ldvary Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13641>	2021-11-03 10:49:06 +00:00
Iago Toral Quiroga	6b9bd3f038	broadcom/compiler: make opt passes set current block Typically, optimization passes go through all the blocks in a shader and make adjustments on the fly, so we always want them to update the current block or the current block pointer will become outdated. Also, we don't need to keep track of the previous current block pointer to restore it, since optimization passes run after we have completed conversion to VIR, and therefore, anything that comes after that should always set the current block before emitting code. Fixes debug assert crashes when running shader-db: vir.c:1888: try_opt_ldunif: Assertion `found \|\| &c->cur_block->instructions == c->cursor.link' failed Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13625>	2021-11-02 11:17:01 +00:00
Iago Toral Quiroga	3fbd6662b7	broadcom/compiler: rework simultaneous peripheral access checks This was not quite correct in that our checks for the allowed cases were not checking that there were no other peripheral access other than the ones allowed. For example, we allowed wrtmuc signal and TMU write other than TMUC, and we also allowed TMU read and VPM read/write. But we cannot allow wrtmuc with TMU write other than TMUC and at the same time a VPM write for example, so we can't just check if we have a combination of allowed peripherals, we still need to check that those are the only ones in use by the combined instructions. Another example is that even if we allow a TMU write (other than TMUC) with a wrtmuc signal, the resulting instruction must still have just one TMU write other than TMUC, but we were allowing the merge if one instruction signaled wrtmuc and the other wrote to tmu other than tmuc without testing if the combined result would have 2 tmu writes. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13527>	2021-10-27 06:03:12 +00:00
Iago Toral Quiroga	1561d0126a	broadcom/compiler: fix assert that current instruction must be in current block This was not considering the possibility that the driver has called nir_before_block() or nir_after_block() to update the cursor, in which case the cursor link points to the instruction list header and not to an actual instruction. Fixes incorrect debug-assert crash in: dEQP-VK.graphicsfuzz.cov-increment-vector-component-with-matrix-copy Fixes: `265515fa62` ("broadcom/compiler: check instruction belongs to current block") Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13467>	2021-10-22 05:39:05 +00:00
Iago Toral Quiroga	75bd37dc6a	broadcom/compiler: disallow tsy barrier in thrsw delay slots A TSY barrier becomes effective at the point of the next thread switch, so if we have one coming after a previous thread switch we need to be careful not to emit it in its delay slots, or we would be effectively moving the barrier earlier than intended. Fixes simulator assert crash in: dEQP-VK.graphicsfuzz.two-for-loops-with-barrier-function Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13468>	2021-10-21 12:40:00 +02:00
Alejandro Piñeiro	d50be41f8f	broadcom/compiler: remove unused macro and function definition Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13444>	2021-10-20 10:08:27 +00:00
Alejandro Piñeiro	9e41c42ed4	broadcom/compiler: remove qpu_acc helper It is really small, and used just twice, so we just call qpu_magic. We also update how it is used: * QFILE_NULL is an undef so we can just load anything. Previously we were using accumulator 0, but there isn't any real reason to use an accumulator for this. Using reg 0. * QFILE_LOAD_IMM: it seems that we don't use at all right now, so let's add an assert Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13008>	2021-09-24 08:46:06 +00:00
Alejandro Piñeiro	193898c8b0	broadcom/compiler: remove commented out vir_LOAD_IMM methods It has been commented several years now. Let's remove it to reduce the noise. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13008>	2021-09-24 08:46:06 +00:00
Juan A. Suarez Romero	d220d8cb51	broadcom/compiler: add V3D_DEBUG_NO_LOOP_UNROLL debug option Disables loop unrolling. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12803>	2021-09-13 08:51:54 +00:00
Ella-0	53ae5c3aae	v3d/compiler: Handle point_coord_upper_left Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12524>	2021-09-12 21:01:11 +00:00
Juan A. Suarez Romero	c98ddc778a	broadcom/compiler: force a last thrsw for spilling As we don't know if we are going to have spilling or not, emit always a last thrsw at the end of the shader. If later we don't have spillings and we don't need that last thrsw, we remove it and switch back to the previous one. This way we ensure all the spilling happens always before the last thrsw. v2 (Juan): - Rework the code to force a last thrsw and remove later if no spilling v3: - Merge functionality inside vir_emit_last_thrsw (Iago) - Add vir_restore_last_thrsw (Juan) v4 (Iago): - Fix/add new comments - Rename variables/parameters v5 (Iago): - Fix comments - Add assertion Cc: mesa-stable Fixes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4760 Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12322>	2021-09-10 09:18:05 +00:00
Juan A. Suarez Romero	53c8b4c093	broadcom: make vir_emit_last_thrsw() private This function is only used in v3d_nir_to_vir(), so make it private. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12322>	2021-09-10 09:18:05 +00:00
Juan A. Suarez Romero	265515fa62	broadcom/compiler: check instruction belongs to current block Check in the ldunif optimization if the current instruction belongs to current block. These avoids again searching the instruction when current block is not correctly set, as it happened in https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12339 and in https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12221. v2: - Remove extra blank line (Iago) Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12358>	2021-09-06 10:38:06 +00:00
Iago Toral Quiroga	3ef2ca9cbf	broadcom/compiler: don't enable early fragment tests if shader writes Z We had an optimization to auto-enable early fragment tests when a shader didn't have side effects, but of course, we cannot do that this if the shader writes Z, as in that case the fragment tests need to use the value written from the shader. Also, if the shader enables early fragment tests, then any shader Z writes should be ignored. Fixes: dEQP-VK.spirv_assembly.instruction.graphics.early_fragment.* Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12736>	2021-09-06 12:01:43 +02:00
Vinson Lee	0a4c4f4459	broadcom/compiler: Fix qpu.flags.muf typo. Fix defect reported by Coverity Scan. Same on both sides (CONSTANT_EXPRESSION_RESULT) pointless_expression: The expression inst->qpu.flags.auf != V3D_QPU_UF_NONE \|\| inst->qpu.flags.auf != V3D_QPU_UF_NONE does not accomplish anything because it evaluates to either of its identical operands, inst->qpu.flags.auf != V3D_QPU_UF_NONE. Fixes: `3f2c54a27f` ("broadcom/compiler: rewrite partial update liveness tracking") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12385>	2021-08-24 08:30:59 +00:00
Juan A. Suarez Romero	2a86d51960	broadcom/compiler: set current block on incrementing unifa When incrementing unifa address in DCE optimization, ensure that we setup correctly the current block, so the ldfunif optimization is also executed correctly. This fixes dEQP-VK.graphicsfuzz.cov-struct-float-array-mix-uniform-vectors heap-buffer overflow with address sanitizer enabled. v2 (Iago): - Save and restore current block Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12339>	2021-08-12 12:33:46 +00:00
Iago Toral Quiroga	3f2c54a27f	broadcom/compiler: rewrite partial update liveness tracking The code we had for this was a work in progress and not finished. Also, it was geared towards partial writes caused by output packing (i.e. fp16) and was ignoring partial updates caused by conditional writes, which are far more common in our case. This change provides an implementation for tracking conditional writes that works in tandem with the previous spill change to narrow liveness for their spills. Fixes register allocation failures in: dEQP-VK.graphicsfuzz.spv-stable-maze-flatten-copy-composite We also gain one shader from shader-db: total instructions in shared programs: 13339969 -> 13338584 (-0.01%) instructions in affected programs: 185520 -> 184135 (-0.75%) helped: 375 HURT: 130 Instructions are helped. total threads in shared programs: 412038 -> 412040 (<.01%) threads in affected programs: 2 -> 4 (100.00%) helped: 1 HURT: 0 total uniforms in shared programs: 3746581 -> 3746585 (<.01%) uniforms in affected programs: 49 -> 53 (8.16%) helped: 0 HURT: 1 total max-temps in shared programs: 2359960 -> 2359947 (<.01%) max-temps in affected programs: 289 -> 276 (-4.50%) helped: 7 HURT: 0 Max-temps are helped. total sfu-stalls in shared programs: 34351 -> 34359 (0.02%) sfu-stalls in affected programs: 218 -> 226 (3.67%) helped: 35 HURT: 37 Inconclusive result (value mean confidence interval includes 0). total inst-and-stalls in shared programs: 13374320 -> 13372943 (-0.01%) inst-and-stalls in affected programs: 186653 -> 185276 (-0.74%) helped: 373 HURT: 132 Inst-and-stalls are helped. LOST: 0 GAINED: 1 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12278>	2021-08-10 08:47:40 +00:00
Iago Toral Quiroga	c335c03ae2	broadcom/compiler: make spills of conditional writes also conditional A spill of a conditional write generates code like this: mov.ifa t5000, 0 mov tmud, t5000 nop t5001; ldunif (0x00008100 / 0.000000) add tmua, t11, t5001 Here, we are spilling t5000, which has a conditional write, and we produce an inconditional spill for it. This implicitly means that our spill requires a correct value for all channels of t5000. If we do a conditional spill, then we emit: mov.ifa t5000, 0 mov tmud.ifa, t5000 nop t5001; ldunif (0x00008100 / 0.000000) add tmua.ifa, t11, t5001 Which only uses channels of t5000 that have been written by the instruction being spilled. By doing the latter, we can then narrow down the liveness for t5000 more effectively, as we can use this to detect that the block only reads (in the tmud instruction) the values that have been written previously in the same block (in the mov instruction). This means that values in other channels are not used, and therefore, we don't need them to be alive at the start of the block. This means that if this is the only write of t5000 in this block, we can consider that the block completely defines t5000. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12278>	2021-08-10 08:47:40 +00:00
Iago Toral Quiroga	314eb97dcb	broadcom/compiler: Flags are per-thread state in V3D 4.2+ This means they survive a thread switch, so we can remove redundant flag setups across thread switches. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12278>	2021-08-10 08:47:40 +00:00
Iago Toral Quiroga	b727eaac3c	broadcom/compiler: add a vir_get_cond helper Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12278>	2021-08-10 08:47:40 +00:00
Juan A. Suarez Romero	d0e83b6174	broadcom/compiler: change current block on setting spill base The spill base setting instructions (which includes some uniforms) are added in the entry block, not in the current block. When ldunif optimization is applied, the cursor is pointing to instructions in the entry block, but the current block is a different one. This leads to a heap-buffer-overflow when going through the list of instructions (detected by the address sanitizer). Thus change the current block to entry block, and restore it after the setup is done. This fixes dEQP-VK.ssbo.readonly.layout.single_struct.single_buffer.std430_instance_array_comp_access_store_cols with address sanitizer enabled. v2: - Set current block instead of disabling ldunif optimization (Iago) Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12221>	2021-08-09 13:15:24 +00:00
Timothy Arceri	a9ed4538ab	nir: add indirect loop unrolling to compiler options This is where it should be rather than having to pass it into the optimisation pass every time. It also allows us to call the loop analysis pass without having to duplicate these options which we will do later in this series. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12064>	2021-08-03 10:54:50 +00:00
Iago Toral Quiroga	d5acae3206	broadcom/compiler: implement nir_intrinsic_load_view_index This is used for multiview's gl_ViewIndex built-in. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12034>	2021-07-27 07:31:31 +00:00
Juan A. Suarez Romero	dc40157888	broadcom/compiler: emit TMU flush before a jump Like in the case of emitting a block, process pending TMU operations before a jump is executed. Fixes dEQP-VK.graphicsfuzz.stable-binarysearch-tree-nested-if-and-conditional. Fixes: `197090a3fc` ("broadcom/compiler: implement pipelining for general TMU operations") Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11971>	2021-07-20 10:15:21 +00:00
Iago Toral Quiroga	940725a7d9	broadcom/compiler: implement gl_PrimitiveID in FS without a GS OpenGL ES 3.1 specifies that a geometry shader can write to gl_PrimitiveID, which can then be read by a fragment shader. OpenGL ES 3.2 additionally adds the capacity for the fragment shader to read gl_PrimitiveID even if there is no geometry shader. This commit adds support for this feature, which is also implicitly expected by the geometry shader feature in Vulkan 1.0. Fixes: dEQP-VK.pipeline.framebuffer_attachment.no_attachments dEQP-VK.pipeline.framebuffer_attachment.no_attachments_ms Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11874>	2021-07-14 12:05:56 +00:00
Thomas H.P. Andersen	cf05a7e66f	broadcom/compiler: fix add vs. mul Spotted by a compile warning Fixes: `7f61ff7b4d` ("broadcom/compiler: Merge instructions more efficiently") Reviewed-by: Iago Torral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11764>	2021-07-12 11:54:03 +00:00
Thomas H.P. Andersen	458801e2c3	broadcom/compiler: use correct flag enum They have the same value, so no functional change Reviewed-by: Iago Torral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11764>	2021-07-12 11:54:03 +00:00
Iago Toral Quiroga	ee11e9183d	broadcom/compiler: don't ignore constant offset on per-vertex input loads Fixes: dEQP-VK.clipping.user_defined.clip_distance.vert_geom.{5,6,7,8} Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11783>	2021-07-12 08:35:56 +02:00
Iago Toral Quiroga	e1a24a0047	broadcom/compiler: handle compact input arrays for geometry shaders Clip distance arrays will come as compact array variables, so we need to handle them as such, like we did for vertex inputs. Fixes: dEQP-VK.clipping.user_defined.clip_distance.vert_geom.{1,2,3,4} Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11783>	2021-07-12 08:35:56 +02:00
Iago Toral Quiroga	353f0a180f	broadcom/compiler: create a helper for computing VPM config This code is the same across drivers. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11783>	2021-07-12 08:35:55 +02:00
Iago Toral Quiroga	2733a17b14	broadcom/compiler: track if geometry shaders write gl_PointSize Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11783>	2021-07-12 08:35:55 +02:00
Iago Toral Quiroga	8fada5cb21	broadcom/compiler: use nir_sort_variables_with_modes Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11624>	2021-06-29 10:11:58 +00:00
Iago Toral Quiroga	10313b03b5	broadcom/compiler: track if a compute shader uses subgroup functionality Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11620>	2021-06-29 08:43:06 +02:00
Iago Toral Quiroga	5081de07f7	broadcom/compiler: add a set_a_flags_for_subgroup helper We will need this in the future to implement more subgroup operations, so make this code available in a helper. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11620>	2021-06-29 08:43:06 +02:00
Iago Toral Quiroga	b9f510087d	broadcom/compiler: add a ntq_emit_cond_to_bool helper Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11620>	2021-06-29 08:43:06 +02:00
Iago Toral Quiroga	53341e44ad	broadcom/compiler: implement more subgroup intrinsics Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11620>	2021-06-29 08:43:06 +02:00
Iago Toral Quiroga	87fa5908b3	broadcom/compiler: add FLAFIRST and FLNAFIRST opcodes We will at least need the former to implement subgroupElect() Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11620>	2021-06-29 08:43:06 +02:00
Iago Toral Quiroga	a9ad04f17d	broadcom/compiler: lower nir_intrinsic_load_num_subgroups The number of subgroups is the local workgroup size divided by the dispatch width. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11620>	2021-06-29 08:43:06 +02:00
Iago Toral Quiroga	30dec8b414	broadcom/compiler: implement nir_intrinsic_load_subgroup_id correctly For some reason, this was implemented with the bulk of the compute shader enablement, but this intrinsic is specific to subgroups and thus was not really used. Also, its implementation was not correct, since it was returning the element index within the subgroup, not the subgroup index itself, which is the index of the batch in the dispatch. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11620>	2021-06-29 08:43:06 +02:00
Iago Toral Quiroga	21b0a4c80c	v3dv: don't lower vulkan resource index result to scalar The intrinsic produces a vec2, so let's honor that and avoid the weird lowering to scalar and later reconstruction to vec2 when we find load vulkan descriptor intrinsics. It fixes tests like this (which require that we expose KHR_spirv_1_4): dEQP-VK.spirv_assembly.instruction.spirv1p4.opptrequal.null_comparisons_ssbo_equal that otherwise produce bad code that tries to access a vec2 from the result of that intrinsic, leading to NIR validation errors. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11257>	2021-06-10 05:47:29 +00:00
Caio Marcelo de Oliveira Filho	8af6766062	nir: Move workgroup_size and workgroup_variable_size into common shader_info Move it out the "cs" sub-struct, since these will be used for other shader stages in the future. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11225>	2021-06-08 09:23:55 -07:00
Caio Marcelo de Oliveira Filho	c8a7bd0dc8	nir: Rename WORK_GROUP (and similar) to WORKGROUP Be consistent with other usages in Vulkan and SPIR-V, and the recently added workgroup_size field. Acked-by: Emma Anholt <emma@anholt.net> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11190>	2021-06-07 22:34:42 +00:00
Caio Marcelo de Oliveira Filho	430d2206da	compiler: Rename local_size to workgroup_size Acked-by: Emma Anholt <emma@anholt.net> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11190>	2021-06-07 22:34:42 +00:00
Eric Anholt	ec3bc5da74	v3d: Use the ra_alloc_contig_reg_class() function to speed up RA. It means we don't need to do the n^2 loop over the regs to set up the pq values, nor do we need the register conflicts lists. Acked-by: Erico Nunes <nunes.erico@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9437>	2021-06-04 19:08:57 +00:00
Eric Anholt	95d41a3525	ra: Use struct ra_class in the public API. All these unsigned ints are awful to keep track of. Use pointers so we get some type checking. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9437>	2021-06-04 19:08:57 +00:00
Alejandro Piñeiro	4e9f1261ee	broadcom/compiler: use proper type field for atomic operations We were using the num_components to infer it, but in the end it is VEC2 for CMPXCHG and 32BIT for anything else. This doesn't affect any test with the real hw, but fixes an assert with the last version of the simulator. Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11039>	2021-06-01 12:22:22 +02:00
Iago Toral Quiroga	f07c797e93	v3dv: implement vkCmdDispatchBase This was added with VK_KHR_device_group and allows users to specify a base offset that will be automatically added to gl_WorkGroupID. Unfortunately, V3D doesn't support this natively, so we need to add the base to the workgroup id generated by hardware manually. For this, we inject add instructions that source from a QUNIFORM that will retrieve the actual dispatch base from the compute job when it is dispatched. Since a compute shader can be dispatched with CmdDispatch and/or CmdDispatchBase, we always need to add these additional add instructions and use a base of (0,0,0) for regular dispatches. Since we don't support any version of OpenGL with this dispatch base functionality we can avoid the extra instructions there. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11037>	2021-05-31 09:06:18 +00:00
Iago Toral Quiroga	39c41169ba	broadcom/compiler: consider RT component size when lowering logic ops in Vulkan In Vulkan we configure our integer RTs to clamp automatically, so with logic operations we need to be careful and avoid overflows by discarding any bits that won't fit in the RT component size. Fixes remaining CTS test failures in: dEQP-VK.pipeline.logic_op.* Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10801>	2021-05-18 11:28:17 +00:00

1 2 3 4 5 ...

564 commits