fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-01-02 07:20:16 +01:00

Author	SHA1	Message	Date
Yonggang Luo	80fac8637b	tree-wide: Convert all usage of defined(PIPE_(OS\|ARCH\|CC)_) to DETECT_(OS\|ARCH\|CC)_ by use grep From: defined[\s]$[\s]PIPE_(OS\|ARCH\|CC)_([0-9A-Z_]+)[\s]*$ To: DETECT_$1_$2 Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19674>	2022-11-15 20:35:55 +00:00
Alejandro Piñeiro	ec1cdc13d5	v3dv/bo: reset bo and then call gem close After 'v3dv: fix debug dump on BO free' we changed the order, and this lead to the following test dEQP-VK.api.object_management.multithreaded_per_thread_resources.device_memory_small v2: Expanded comment just before the reset, explaining that we need to do the reset before we free the BO from the kernel (Iago) Raising this assertion: deqp-vk: ../src/broadcom/vulkan/v3dv_bo.c:281: v3dv_bo_alloc: Assertion `bo && bo->handle == 0' failed. Fixes: `2c44597181` ('v3dv: fix debug dump on BO free') Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19693>	2022-11-14 12:37:52 +01:00
Eric Engestrom	b4b09bf8f2	Revert "ci/broadcom: v3dv-rpi4-vk:arm64 flakes too often" This reverts commit `cb02cf464c`. There are 3 reported flakes over a period of a month, and we have been unable to reproduce it even once. It clearly doesn't happen often enough to warrant disabling our vulkan CI, so let's restore it while we continue to try to reproduce the issue on our side. Signed-off-by: Eric Engestrom <eric@igalia.com> Acked-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19720>	2022-11-14 10:16:28 +00:00
Iago Toral Quiroga	f14e2ca099	v3dv: ignore imported BOs when tracking BO memory usage Imported BOs are not allocated by the device so we don't update BO stats when they are imported. Therefore, we should not be updating them when they are freed either. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19675>	2022-11-11 13:25:28 +00:00
Iago Toral Quiroga	2c44597181	v3dv: fix debug dump on BO free We were resetting the BO struct right before dumping its data. Fix this by moving the reset later. Fixes: `44fa8304d4` ('v3dv: add a refcount mechanism to BOs') Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19672>	2022-11-11 11:54:36 +00:00
David Heidelberg	cb02cf464c	ci/broadcom: v3dv-rpi4-vk:arm64 flakes too often See https://gitlab.freedesktop.org/mesa/mesa/-/issues/7403 Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19659>	2022-11-10 23:49:15 +00:00
Yonggang Luo	e399dc3544	util: normalize include files under src/util/*.h with util/ prefix in mesa code base Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19546>	2022-11-10 06:27:25 +00:00
Iago Toral Quiroga	1174f37609	broadcom/compiler: avoid using ldvary sequence to hide latency of branching This can cause us to stomp the contents of r5 before we have a chance to read it, like this: 0x3d103186bb800000 nop ; nop ; ldvary.r0 0x3d105686bbf40000 nop ; mov rf26, r5 ; ldvary.r1 0x020000ef0000d000 bu.allna 232, r:unif (0x0000001c / 0.000000) 0x3d1096c6bbf40000 nop ; mov rf27, r5 ; ldvary.r2 Here, the MOV in the last instruction is supposed to read r5 produced from ldvary.r0, but because we have inserted the bu instruction in between now that read happens at the same time that ldvary.r1 updates r5, stomping the value we were supposed to read. Fix this by disallowing injection of a branch instruction in between an ldvary instruction and its write to the r5 register 2 instructions later. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7062 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19616>	2022-11-09 20:51:25 +00:00
David Heidelberg	26e742c661	ci/bare-metal: remove consolidations leftovers All defined in the baremetal-test-arm* Reviewed-by: Eric Engestrom <eric@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19548>	2022-11-09 02:23:37 +00:00
Eric Engestrom	83b1cb936e	vc4: add DRM_VC4_CREATE_SHADER_BO support to drm-shim Signed-off-by: Eric Engestrom <eric@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19305>	2022-11-08 21:23:27 +00:00
David Heidelberg	cc485cfd7c	ci/broadcom: juint is already defined in .piglit-traces-test Reviewed-by: Emma Anholt <emma@anholt.net> Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18532>	2022-11-05 14:04:44 +00:00
Benjamin Tissoires	67cee534a8	CI: convert to use the new S3 server instead of the legacy minio We don't need to login anymore, but we can't use plain minio commands now. `ci-fairy` got a helper as `s3cp` to keep an almost identical API. Signed-off-by: Benjamin Tissoires <benjamin.tissoires@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19076>	2022-11-04 11:41:42 +00:00
Guilherme Gallo	a108e4f70c	ci: Update piglit-traces tests expectations Found some: - crashes in zink, softpipe - fails in a630-restricted - unexpectedpass in broadcom - fixed by https://gitlab.freedesktop.org/mesa/piglit/-/merge_requests/730 More details in the test expectations files comments. Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19076>	2022-11-04 11:41:42 +00:00
Iago Toral Quiroga	c7150ad8e6	broadcom/compiler: drop unused v3d_compile parameter for nir pass Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19519>	2022-11-04 09:58:10 +00:00
Iago Toral Quiroga	22789d34be	v3dv: use vk_alloc instead of malloc Fixes: `e6884df088` ('v3dv: fix event synchronization') Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19518>	2022-11-04 09:22:05 +00:00
Iago Toral Quiroga	36ef75b6eb	v3dv: vkCmdWaitEvents2 takes an array of VkDependencyInfo We have been incorrectly assuming there was just one for all the events, apparently CTS never uses more than one event. Fixes: `e6884df088` ('v3dv: fix event synchronization') Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19518>	2022-11-04 09:22:05 +00:00
Yonggang Luo	b0016bc36a	mesa: Use DEBUG_NAMED_VALUE_END for const struct debug_named_value Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19448>	2022-11-03 14:40:33 +00:00
Iago Toral Quiroga	e6884df088	v3dv: fix event synchronization Since we now implement events in the GPU we need to be more careful and insert barriers to honor the dependencies provided by the API as well as ensuring we are synchronizing these with the compute queue, since that is how we implement GPU event functionality. Fixes: `ecb01d53fd` ("v3dv: refactor events") Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19458>	2022-11-03 14:16:12 +01:00
Iago Toral Quiroga	8113f973b3	v3dv: make the helper to emit pipeline barriers public to other files Fixes: `ecb01d53fd` ("v3dv: refactor events") Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19458>	2022-11-03 14:16:12 +01:00
Iago Toral Quiroga	67e82fd1f2	v3dv: always check VK_ACCESS_2_MEMORY_READ_BIT for read accesses Fixes: `a981ac0539` ('v3dv: skip binning sync if binning shaders don't access external resources') Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19458>	2022-11-03 14:16:12 +01:00
Iago Toral Quiroga	4c861cf22a	v3dv: increase limit for active event objects Fixes: `ecb01d53fd` ("v3dv: refactor events") Fixes: dEQP-VK.api.command_buffers.execute_large_primary Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19458>	2022-11-03 14:16:12 +01:00
Iago Toral Quiroga	5e97150e21	v3dv: do better cleanup on failure during pipeline cache operation Fixes (with disk cache enabled): dEQP-VK.api.device_init.create_instance_device_intentional_alloc_fail.basic dEQP-VK.api.object_management.alloc_callback_fail.device dEQP-VK.api.object_management.alloc_callback_fail.device_group Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Eric Engestrom <eric@igalia.com> cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19458>	2022-11-03 14:16:11 +01:00
Iago Toral Quiroga	1f5966397a	v3dv: handle allocation failure during pipeline initialization Fixes (with disk cache disabled): dEQP-VK.api.device_init.create_instance_device_intentional_alloc_fail.basic dEQP-VK.api.object_management.alloc_callback_fail.device dEQP-VK.api.object_management.alloc_callback_fail.device_group Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Eric Engestrom <eric@igalia.com> cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19458>	2022-11-03 14:16:11 +01:00
Iago Toral Quiroga	7f905a8117	v3dv: fix incorrect return type Fixes: `ecb01d53fd` ("v3dv: refactor events") Partially fixes: dEQP-VK.api.device_init.create_instance_device_intentional_alloc_fail.basic dEQP-VK.api.object_management.alloc_callback_fail.device dEQP-VK.api.object_management.alloc_callback_fail.device_group Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19458>	2022-11-03 14:16:11 +01:00
Iago Toral Quiroga	b78fd50e90	v3dv: do a better job at cleaning up the device on init failure These leaks on device creation failure have been there before, but were only exposed as CTS failures after the recent event refactoring. Partially fixes: dEQP-VK.api.device_init.create_instance_device_intentional_alloc_fail.basic dEQP-VK.api.object_management.alloc_callback_fail.device dEQP-VK.api.object_management.alloc_callback_fail.device_group Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Eric Engestrom <eric@igalia.com> cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19458>	2022-11-03 14:16:11 +01:00
Eric Engestrom	aff368fe83	v3dv: avoid freeing already-freed memory Fixes: `ecb01d53fd` ("v3dv: refactor events") Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19458>	2022-11-03 14:16:11 +01:00
Iago Toral Quiroga	c793d384c1	v3dv: remove unnecessary check for NULL We are initializing the device, so we know this will be NULL. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Eric Engestrom <eric@igalia.com> cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19458>	2022-11-03 14:16:11 +01:00
Illia Abernikhin	aa4ac5ff8b	utils: Merge util/debug.* into util/u_debug.* and remove util/debug.* Rename env_var_as_unsigned() -> debug_get_num_option(), because duplicate Rename env_var_as_bool() -> debug_get_bool_option(), because duplicate Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7177 Signed-off-by: Illia Abernikhin <illia.abernikhin@globallogic.com> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19336>	2022-11-02 07:25:39 +00:00
Iago Toral Quiroga	004f431b7f	v3dv: split event implementation to a separate file Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19366>	2022-10-28 10:55:37 +00:00
Iago Toral Quiroga	6748d22a7b	v3dv: return out of host memory if we fail to create event pipelines Fixes: `ecb01d53fd` ('v3dv: refactor events') Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19366>	2022-10-28 10:55:37 +00:00
Iago Toral Quiroga	29588fe116	v3dv: re-enable sync_fd import/export Now that we implement GPU-side event functions in the GPU we no longer have the issue that didn't allow us to expose sync_fd. Further more, new spec text has also made the problematic behavior undefined, so the test that caused this issue, dEQP-VK.api.external.semaphore.sync_fd.import_twice_temporary, is incorrect and should be fixed. It should be noted that we still keep sync_fd disabled in the simulator, at least until the CTS tests are fixed, since the synchronous execution model of the simulator means that in the problematic scenario we can block the CPU on the execution of the command buffer before we ever submit the signaling job, still causing a deadlock. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19313>	2022-10-28 08:41:13 +02:00
Iago Toral Quiroga	ecb01d53fd	v3dv: refactor events This replaces our current implementation, which is 100% CPU based, with an implementation that uses compute shaders for the GPU-side event functions. The benefit of this solution is that we no longer need to stall on the CPU when we need to handle GPU-side event commands. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19313>	2022-10-28 08:35:00 +02:00
Iago Toral Quiroga	8cd50ef071	broadcom/compiler: handle vec2 load/store index In vulkan, we load descriptors via vulkan resource index, which returns a vec2, of which we want component 0 which holds the actual index. Typically, this will be cleaned-up by the time we get to emitting VIR so the index is a single scalar component, but there are some cases where this might no be the case, so make sure we don't assume it to be a scalar, like we do in other places. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19313>	2022-10-28 08:23:32 +02:00
Iago Toral Quiroga	07c7d846e5	v3dv: drop layout refs for all allocated sets from a pool on destroy / reset In `7f6ecb8667` we added reference counting for descriptor set layouts, however, we didn't realize that pools created without the flag VK_DESCRIPTOR_POOL_CREATE_FREE_DESCRIPTOR_SET_BIT don't free individual descriptors and can only be reset or destroyed. Since we only drop references when individual descriptor sets were destroyed, we would leak set layouts referenced from descriptor sets allocated from these pools. Fix that by keeping a list of all allocated descriptor sets (no matter whether VK_DESCRIPTOR_POOL_CREATE_FREE_DESCRIPTOR_SET_BIT is present or not) and then traversing the list dropping the references on pool resets and destroys. Fixes: `7f6ecb8667` ('v3dv: add reference counting for descriptor set layouts') Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19337>	2022-10-27 11:35:29 +00:00
Iago Toral Quiroga	24d9a80247	v3dv: implement VK_EXT_pipeline_robustness Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18883>	2022-10-27 08:17:11 +00:00
Iago Toral Quiroga	9deef4cde6	vulkan/runtime: include robustness info when hashing a shader stage Suggested by Jason Ekstrand. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18883>	2022-10-27 08:17:11 +00:00
Iago Toral Quiroga	c3641f413a	broadcom/compiler: trivial code clean-up Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18883>	2022-10-27 08:17:11 +00:00
Iago Toral Quiroga	86503aaba4	v3dv: use enabled features from vk_device Reviewed-by: Eric Engestrom <eric@igalia.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18883>	2022-10-27 08:17:11 +00:00
Iago Toral Quiroga	1a2ca58aed	v3dv: use NIR_PASS with v3d_nir_lower_robust_image_access Reviewed-by: Eric Engestrom <eric@igalia.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18883>	2022-10-27 08:17:11 +00:00
Alejandro Piñeiro	019529aa11	broadcom/compiler: call nir_opt_gcm with a custom strategy nir_opt_gcm get us worse shader-db stats, but that is expected. But we want to prevent to get worse values on spill/fills. Analyzing the outcome with shader-db, this mostly happen with shaders that are already complex, and are already spilling/filling. So the best option here is adding a new strategy, that fall backs if we get spill/fill using nir_opt_gcm. It is not clear in which order we should disable gcm. For now we disable it before loop unrolling. We get a slight performance gain (in average) using nir_opt_gcm. We don't show the shaderdb stats, as they are worse, but as mentioned, this is expected. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17185>	2022-10-26 12:29:30 +00:00
Alejandro Piñeiro	afc6de356a	broadcom/compiler: pass a strategy struct to vir_compile_init That allows to reduce the number of parameters of the method. And after all, they were already filled using an existing strategy struct. This would make easier adding new fields on a strategy. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17185>	2022-10-26 12:29:30 +00:00
Alejandro Piñeiro	33878a12dd	v3dv/pipeline: use v3d_optimize_nir Instead of using a custom optimize_nir method, with the same purpose. Running the fossils for the v3dv well know applications (ue4 demos, Quake3d, etc) we got somewhat inconclusive outcome in general, although slightly worse values: Instrs: 265129 -> 265277 (+0.06%); split: -0.06%, +0.12% Thread Count: 5504 -> 5506 (+0.04%) Totals from 153 (10.23% of 1495) affected shaders: Instrs: 84603 -> 84751 (+0.17%); split: -0.19%, +0.37% Thread Count: 316 -> 318 (+0.63%) Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17185>	2022-10-26 12:29:30 +00:00
Alejandro Piñeiro	0bf31b0710	broadcom/compiler: add more lowerings/optimizations on v3d_optimize_nir Optimizations that we are already calling on the Vulkan driver. As preparation to the Vulkan frontend to use v3d_optimize_nir too. We need to add a new parameter to v3d_optimize_nir in order to know if we can call nir_opt_find_array_copies. As we don't track if we are calling nir_var_lower_copies, we explicitly call it when we create the uncompiled shader create. So instead of tracking, we assume that each driver (v3d/v3dv) would call it when the shader is created. So when v3d_optimize_nir is called as part of the process to compile it at the compiler, we call it with allow_copies as false. We exclude on purpose nir_opt_gcm as it is a case of a optimization that could help performance even if it hurts shader db stats. shaderdb stats: total instructions in shared programs: 11705923 -> 11705034 (<.01%) instructions in affected programs: 88350 -> 87461 (-1.01%) helped: 201 HURT: 80 Instructions are helped. total threads in shared programs: 375552 -> 375558 (<.01%) threads in affected programs: 6 -> 12 (100.00%) helped: 3 HURT: 0 total uniforms in shared programs: 3486108 -> 3485789 (<.01%) uniforms in affected programs: 7473 -> 7154 (-4.27%) helped: 90 HURT: 1 Uniforms are helped. total max-temps in shared programs: 2021860 -> 2021802 (<.01%) max-temps in affected programs: 800 -> 742 (-7.25%) helped: 21 HURT: 3 Max-temps are helped. total sfu-stalls in shared programs: 19299 -> 19296 (-0.02%) sfu-stalls in affected programs: 18 -> 15 (-16.67%) helped: 10 HURT: 7 Inconclusive result (value mean confidence interval includes 0). total inst-and-stalls in shared programs: 11725222 -> 11724330 (<.01%) inst-and-stalls in affected programs: 88402 -> 87510 (-1.01%) helped: 201 HURT: 80 Inst-and-stalls are helped. total nops in shared programs: 269674 -> 269386 (-0.11%) nops in affected programs: 3641 -> 3353 (-7.91%) helped: 103 HURT: 29 Nops are helped. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17185>	2022-10-26 12:29:30 +00:00
Alejandro Piñeiro	9cbc3ab239	broadcom/compiler: update how we compute return_words_of_texture_data on non-ssa For the non-ssa case, we were trying to use reg->num_components. But this is not the same that nir_ssa_def_components_read. It is the number of components of the destination register. And in the 16bit case, even if nir_lower_tex packs the outcome, it doesn't update the number of components, as nir_tex_instr_dest_size would still return 4. And nir validate would check that those values are the same. So this change focuses on the last part of this comment at nir_lower_tex: * Note that we don't change the destination num_components, because * nir_tex_instr_dest_size() will still return 4. The driver is just * expected to not store the other channels, given that nothing at the * NIR level will read them. We just limit how many channels we would use for the f16 case. It is also worth to note, based on the CTS and different applications we test, that this is a corner case. This was detected when we experimented to enable nir_opt_gcm for v3d, that lead to raise an assertion slightly below with some shaderdb tests, but technically it could happen without it. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17185>	2022-10-26 12:29:30 +00:00
Alejandro Piñeiro	ec10a37a52	broadcom/compiler: don't call nir_opt_load_store_vectorize on all v3d_optimize_nir calls For compute shaders, to avoid a crash with that optimization, it requires doing some optimizations and lowerings before. Example: static void lower_cs_shared(struct nir_shader *nir) { NIR_PASS_V(nir, nir_lower_vars_to_explicit_types, nir_var_mem_shared, shared_type_info); NIR_PASS_V(nir, nir_lower_explicit_io, nir_var_mem_shared, nir_address_format_32bit_offset); } In the same way other drivers (like anv) calls nir_opt_load_store_vectorize as part of their post-process-nir. So one option would be to move nir_opt_load_store_vectorize outsize the common v3d_nir_optimize, to a post-process nir method. To make things simpler, this change calls that optimization only if we have a v3d_compiler object, that is when each frontend has already done their lowerings, and call the v3d_compiler to get the final assembly (so we are already on a kind of post processing nir step). This avoids dEQP-VK.memory_model.shared.basic_types.3 crashing if we start to call v3d_optimize_nir on v3dv directly. Slight shaderdb changes, but not significant. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17185>	2022-10-26 12:29:30 +00:00
Eric Engestrom	bdfdc40a25	vc4: mark piglit copypixels-(draw-)sync as flaky They sometimes fail when running all the tests together, but never when running just them; not sure how to diagnose this, but for now jusk mark them as flaky. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19295>	2022-10-25 16:24:59 +00:00
Rob Clark	22ec93cc1a	v3d/ci: Add a flake I've seen this one flake, add it to flakes.txt to avoid blocking CI. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19259>	2022-10-24 21:39:38 +00:00
Alejandro Piñeiro	8148e934a6	v3d: replace all TODO around for FIXME Even if there is a slight difference of meaning between FIXME and TODO, at some point we agreed to use just FIXME for all pending things to do, just to make it easier to grepping for things that can be done. And after all, one could argue that is there is something pending TO DO, is that needs FIXING. Acked-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19225>	2022-10-24 08:33:44 +00:00
Eric Engestrom	2fd087cd19	v3dv: drop error overwrite Let the error returned be bubbled up. Fixes: dEQP-VK.api.device_init.create_instance_device_intentional_alloc_fail.basic Fixes: `591103d04d` ("v3dv: don't return incompatible driver if GPU is not present") Signed-off-by: Eric Engestrom <eric@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18901>	2022-10-18 22:22:58 +00:00
Alejandro Piñeiro	c1cb7506bb	v3dv/pipeline: keep qpu_insts around if we expect them to be used later If the pipeline was created with the creation flags VK_PIPELINE_CREATE_CAPTURE_STATISTICS_BIT_KHR or VK_PIPELINE_CREATE_CAPTURE_INTERNAL_REPRESENTATIONS_BIT_KHR it is really likely that methods from VK_KHR_pipeline_executable_properties that would require having access to the qpu insts around will be called. Instead of getting those back from the BO where we upload them, we just keep them around. This could require more host memory, but would allow us to avoid needing to handle map/unmap the BO when needed (so needing the host memory in any case). This can be tricky if those methods are being called from different threads (so we can avoid adding a mutex there). In the same way, if the pipeline was not created with those flags, we skip collecting data that requires the QPU. Only GetPipelineExecutableProperties is allowed to be called without any of those flags, and doesn't require that info. This fixes a race condition crash at GetPipelineExecutableProperties when using fossilize-replay with some fossils with several shaders, and using several threads, as some thread would be unmapping the bo before other thread stopped to use it. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18859>	2022-10-17 10:06:23 +00:00

1 2 3 4 5 ...

2246 commits