mesa/src
Georg Lehmann 13f6be262a aco/insert_exec: only restore wqm mask after control flow if necessary
The next commit will make this not free, so we should avoid it if possible.

Foz-DB Navi31:
Totals from 3933 (4.93% of 79789) affected shaders:
Instrs: 5726914 -> 5727295 (+0.01%); split: -0.00%, +0.01%
CodeSize: 31307100 -> 31308884 (+0.01%); split: -0.00%, +0.01%
SpillSGPRs: 1797 -> 1793 (-0.22%); split: -0.33%, +0.11%
Latency: 58973929 -> 58974343 (+0.00%); split: -0.00%, +0.00%
InvThroughput: 8591893 -> 8591911 (+0.00%); split: -0.00%, +0.00%
SClause: 209074 -> 209115 (+0.02%); split: -0.00%, +0.02%
Copies: 423965 -> 432420 (+1.99%)
Branches: 149976 -> 149979 (+0.00%); split: -0.00%, +0.00%
PreSGPRs: 200175 -> 200663 (+0.24%)
VALU: 3440165 -> 3440156 (-0.00%); split: -0.00%, +0.00%
SALU: 555727 -> 556143 (+0.07%); split: -0.00%, +0.08%

Fixes: b872ff6ef2 ("aco/insert_exec_mask: if applicable, use s_wqm to restore exec after divergent CF")
Reviewed-by: Daniel Schürmann <daniel@schuermann.dev>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34659>
2025-04-23 09:37:50 +00:00
..
amd aco/insert_exec: only restore wqm mask after control flow if necessary 2025-04-23 09:37:50 +00:00
android_stub
asahi nir/alpha_to_coverage: Add an intrinsic for better dithering 2025-04-23 09:03:41 +00:00
broadcom v3d/ci: move depthstencil-default_fb-drawpixels-24_8 samples=4 to flakes 2025-04-23 09:03:41 +00:00
c11 c11: use SPDX-License-Identifier header 2025-01-08 20:37:51 +00:00
compiler nir/alpha_to_coverage: Add an intrinsic for better dithering 2025-04-23 09:03:41 +00:00
drm-shim
egl EGL: legacy-x11=dri2 should support hardware driver 2025-04-17 21:37:58 +00:00
etnaviv etnaviv/ml: Fix depthwise convolutions 2025-04-22 14:58:34 +00:00
freedreno Uprev Piglit to c50d9aa54f85e0af9d72fab86c73f89356d96399 2025-04-22 21:52:29 +00:00
gallium v3d: Implement dual source blending 2025-04-23 09:03:41 +00:00
gbm meson: support building with system libgbm 2025-04-09 12:15:33 +00:00
getopt
gfxstream gfxstream: update code generator for simplified dep graph 2025-04-09 18:29:57 +00:00
glx glx: drop dead GL_LIB_NAME define 2025-04-18 12:14:11 +00:00
gtest
imagination treewide: Switch to nir_progress 2025-02-26 15:19:53 +00:00
imgui
intel anv: Add new debug flag to show shader stage 2025-04-22 23:09:26 +00:00
loader loader: Use RTLD_LOCAL not RTLD_GLOBAL 2025-04-18 07:14:56 +00:00
mapi glapi: import noop_array and public stubs earlier. 2025-04-04 00:18:28 +00:00
mesa mesa: Add CPU traces 2025-04-15 10:37:39 +00:00
microsoft microsoft/compiler: Force load_output => undef in tess_ctrl main func 2025-04-16 16:02:07 +00:00
nouveau nvk: Maxwell+ is now conformant 2025-04-22 16:54:50 +00:00
panfrost panvk: Set supportsNonZeroFirstInstance=true 2025-04-23 07:08:25 +00:00
tool perfetto/android: align datasource names with tooling expectations 2025-04-08 18:29:10 +00:00
util util/dynarray: add util_dynarray_resize_zero() 2025-04-22 15:29:34 +00:00
virtio venus: virtgpu: Require stable wire format 2025-04-19 21:02:17 +00:00
vulkan vulkan: Add vk_ir_header::dst_leaf_node_offset 2025-04-17 20:20:40 +00:00
x11 glx/egl/x11: fix x11_dri3_check_multibuffer 2025-02-17 02:50:15 +00:00
.clang-format radv: Add radv_foreach_stage to ForEachMacros again. 2025-04-11 18:01:47 +00:00
meson.build meson: support building with system libgbm 2025-04-09 12:15:33 +00:00