fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-07 04:58:05 +02:00

Author	SHA1	Message	Date
Paulo Zanoni	3946bb6b84	anv: extract anv_color_format_supports_drm_modifier_tiling() A 76-line chunk of code just to decide if the format is supported, let's move it to its own function. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39840>	2026-02-17 11:49:47 +00:00
Paulo Zanoni	f160d8db91	anv: unify blit_cts_workaround handling This makes the code easier to read. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39840>	2026-02-17 11:49:46 +00:00
Paulo Zanoni	5fbaef795a	anv: don't pass vk_format to anv_get_image_format_features2() It's redundant information, as it's already part of struct anv_format. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39840>	2026-02-17 11:49:46 +00:00
Paulo Zanoni	3f3e0a4774	anv: anv_get_image_format_features2() can be static No callers outside anv_formats.c. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39840>	2026-02-17 11:49:45 +00:00
Samuel Pitoiset	95c4d8d726	radv/meta: rework get_image_stride_for_96bit() and make it non-static Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39909>	2026-02-17 10:39:01 +00:00
Samuel Pitoiset	c1a507bf42	radv/meta: rename r32g32b32 to 96bit Tt's shorter. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39909>	2026-02-17 10:39:01 +00:00
Samuel Pitoiset	29ce18cb6f	radv/meta: rename some variables for btoi 96-bit shader To match push constants. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39909>	2026-02-17 10:39:01 +00:00
Samuel Pitoiset	9c90622c94	radv: remove a redundant check in radv_image_is_renderable() RADEON_SURF_NO_RENDER_TARGET is already sets for such an image. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39909>	2026-02-17 10:39:01 +00:00
Samuel Pitoiset	61b20e726f	radv/ci: mark more WSI tests as flakes on NAVI21 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39909>	2026-02-17 10:39:01 +00:00
Georg Lehmann	6a662a59b7	nir/opt_algebraic: optimize 1.0 - b2f(a) to b2f(inot(a)) Which can then be cleaned up further. Foz-DB Navi48: Totals from 4156 (3.62% of 114655) affected shaders: MaxWaves: 102580 -> 102620 (+0.04%) Instrs: 11696222 -> 11679986 (-0.14%); split: -0.16%, +0.02% CodeSize: 64452544 -> 64379204 (-0.11%); split: -0.13%, +0.02% VGPRs: 288256 -> 288172 (-0.03%) SpillSGPRs: 7290 -> 7297 (+0.10%) Latency: 160690992 -> 160643825 (-0.03%); split: -0.05%, +0.02% InvThroughput: 26869332 -> 26849963 (-0.07%); split: -0.09%, +0.02% VClause: 237078 -> 237003 (-0.03%); split: -0.04%, +0.01% SClause: 270560 -> 270564 (+0.00%); split: -0.01%, +0.01% Copies: 936165 -> 937970 (+0.19%); split: -0.07%, +0.26% Branches: 302981 -> 302992 (+0.00%); split: -0.00%, +0.00% PreSGPRs: 244967 -> 245303 (+0.14%) PreVGPRs: 232930 -> 232886 (-0.02%); split: -0.02%, +0.00% VALU: 6200283 -> 6187264 (-0.21%); split: -0.23%, +0.02% SALU: 1759176 -> 1760275 (+0.06%); split: -0.10%, +0.16% VOPD: 447502 -> 446194 (-0.29%); split: +0.14%, -0.43% Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39917>	2026-02-17 10:01:21 +00:00
Eric Engestrom	bbb2f884ad	ci: drop redundant MESA_IMAGE Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details It's already set by the extended `.set-image` 8 lines further up. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39923>	2026-02-17 08:54:45 +00:00
Krzysztof Sobiecki	615e2d15d1	os: Add support for GNU/HURD compilation and use of dri swrast/llvmpipe. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39277>	2026-02-17 08:24:02 +00:00
Krzysztof Sobiecki	0662febcaf	os: Don't use PATH_MAX as it's not portable. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39277>	2026-02-17 08:24:02 +00:00
Krzysztof Sobiecki	400bec37ed	gallium/dril: Don't use gbm if there is no gbm configured Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39277>	2026-02-17 08:24:02 +00:00
Pavel Ondračka	ee80f909e4	i915/ci: update expectation missed in piglit uprev Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39930>	2026-02-17 07:31:16 +00:00
Samuel Pitoiset	7fceeff970	radv/ci: mark more WSI flakes for NAVI21 Fixes: `c332ee5dd6` ("ci/radv: Add some flakes I hit while testing WSI.") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39929>	2026-02-17 08:04:03 +01:00
Timothy Arceri	b59c3ac82a	mesa/st: use same path for setting state ref locations Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details After the fix in `a6fcc2835e` we can now take the same path whether allow_st_finalize_nir_twice is set or not. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39910>	2026-02-16 23:49:41 +00:00
Kenneth Graunke	add69407c7	brw: Use memset for initializing varying/slot maps Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38121>	2026-02-16 15:15:38 -08:00
Kenneth Graunke	19d9e10f4d	brw: Drop VUE header values and position from wm_prog_data->inputs The FS doesn't read these from the VUE so we don't care about them. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38121>	2026-02-16 15:15:36 -08:00
Kenneth Graunke	5e48094d72	brw: Drop BRW_VARYING_SLOT_PAD and brw_varying_slot enum In elk, we tried to store our own "driver" enum values after Mesa's VARYING_SLOT_MAX. In brw, we eliminated all of these except for an unnecessary "BRW_VARYING_SLOT_PAD" value. This was used for empty slots, so vue_map::slot_to_varying[] could store something. This patch replaces BRW_VARYING_SLOT_PAD with -1. Our "driver" enum values overlapped with VARYING_SLOT_PATCH0, leading to unnecessary headaches. Now gl_varying_slot_name_for_stage will do the right thing for both regular and patch varyings. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38121>	2026-02-16 15:15:35 -08:00
Kenneth Graunke	16ab31f358	brw: Use NUM_TOTAL_VARYING_SLOTS instead of VARYING_SLOT_TESS_MAX This is a bit larger, but also clearer. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38121>	2026-02-16 15:15:34 -08:00
Kenneth Graunke	3dbeaf18c8	iris: Defeature native two-sided color support This drops native support for legacy GL's two-sided color feature in favor of lowering it via nir_lower_two_sided_color(). Instead of having a whole bunch of state management hassle to set up the SBE unit to swizzle between the COL and BFC VUE slots, and have it transparently deliver one or the other to the fragment shader, we simply deliver both and insert a conditional select there: (is-front-facing ? front color : back color) This also works even for > 16 varyings, where swizzling via the SBE unit isn't viable. zink, asahi, freedreno, lima, panfrost, r600, v3d, and vc4 all use this lowering rather than having native support. Only four games in our shader-db even use this feature. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38121>	2026-02-16 15:15:33 -08:00
Kenneth Graunke	613fe1600e	iris: Drop use of BFC(n) when it exists but COL(n) is unwritten Writing a back-face color but not a front-face color is undefined behavior. We were trying to politely work around potential application bugs, but this is not required to work, and other drivers don't do it. Drop the extra complexity. If we do find a broken application that needs this hack, then a better way to handle it is to have brw_compute_vue_map set the slot for VARYING_SLOT_BFC(n) to the slot for VARYING_SLOT_COL(n) when COL(n) is unwritten. That way, this override is handled at shader compilation time, and the run-time code can remain simple. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38121>	2026-02-16 15:15:31 -08:00
Kenneth Graunke	b17df36529	iris: Drop sprite coord checks from SBE_SWIZ setup The "override with a constant" handling appears to take precedence over the "override with point sprite coordinates" handling. Because we were overriding undefined inputs to <0, 0, 0, 1>, we needed to avoid this for sprite coordinates, as they aren't written by a previous stage, but shouldn't be overridden to zero. Now that we've dropped that in the previous patch, there's no need to special case sprite coordinates any longer. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38121>	2026-02-16 15:15:30 -08:00
Kenneth Graunke	4aaa19c3e7	iris: Use the first FS input's value for all undefined FS inputs iris (and i965 before it) tried to to politely return <0, 0, 0, 1.0> as the value of undefined FS inputs. anv, however, just returns the value of the first FS input attribute. This makes iris match anv's behavior, eliminating some overrides and simplifying the code. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38121>	2026-02-16 15:15:29 -08:00
Kenneth Graunke	b5c23b4785	iris: Drop SBE component overrides for layer/viewport varyings We no longer read the VUE header values in the fragment shader, instead relying on the payload fields. So there's no need to do anything with them here. (Note that OpenGL's rules for preserving exact values of layer/viewport built-ins were relaxed a while back, allowing us to use the payload fields directly. So this code might've been necessary in the past, but it isn't now.) Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38121>	2026-02-16 15:15:24 -08:00
Janne Grunau	139106d0d8	asahi: Add OpenGL / EGL CTS CI expectations Signed-off-by: Janne Grunau <j@jannau.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39763>	2026-02-16 22:44:41 +00:00
Janne Grunau	8aad4cd495	asahi: ci: Rename asahi-g13g Vulkan CTS suite to asahi-agx2 Apple M1 and M2 GPUs are similar enough to use the same deqp-runner suite. Use "agx2" as suffix to cover GPUs implementing the AGX2 ISA. This covers at least the GPUs in all M1 and M2 SoCs. Extend the `renderer_check` to match M2 (G14x) GPUs as well. The original check already included M1 Pro/Max/Utra (G13S, G13C and G13D) erroneously. Signed-off-by: Janne Grunau <j@jannau.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39763>	2026-02-16 22:44:41 +00:00
Pavel Ondračka	1da57c5023	r300/ci: enable glx tests Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36458>	2026-02-16 22:02:34 +00:00
Kenneth Graunke	e0fc4a7c54	brw: Drop brw_compiler option from brw_no_indirect_mask() It's unused. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39839>	2026-02-16 21:33:49 +00:00
Kenneth Graunke	c2df854359	brw: Make a devinfo temporary in lower_mem_access_bitsizes Less typing. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39839>	2026-02-16 21:33:49 +00:00
Kenneth Graunke	f873cfd7a0	brw: Pass devinfo to lower_bit_size, not compiler We only need devinfo. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39839>	2026-02-16 21:33:48 +00:00
Kenneth Graunke	1df2158f50	brw: Delete use_bindless_sampler_offset flag No drivers use this. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39839>	2026-02-16 21:33:48 +00:00
Kenneth Graunke	4bdef9824a	anv, brw: Consolidate ex_bso bits to a static devinfo inline If we have extended bindless surface offset (ExBSO) support, we want to use it. Consolidate the anv_physical_device and brw_compiler bits into a single static inline that take devinfo. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39839>	2026-02-16 21:33:47 +00:00
Kenneth Graunke	14c64a88c7	anv: Drop an outdated comment about indirect descriptors The infrastructure was built-up, and this was updated...a while ago. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39839>	2026-02-16 21:33:46 +00:00
Kenneth Graunke	0a6f6bcd5b	anv: Pass devinfo to anv_shader_compute_fragment_rts, not compiler We don't need anything but devinfo. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39839>	2026-02-16 21:33:46 +00:00
Kenneth Graunke	06ff3a23dc	anv: Make a devinfo local in anv_shader_lower_nir Shorter to use, and also clearer where something more than devinfo is used from brw_compiler. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39839>	2026-02-16 21:33:44 +00:00
Kenneth Graunke	aa939db0c5	iris: Move recompile debugging to work on iris program keys iris decides to do recompiles or not based on its own program keys, not the brw or elk keys. So, it makes sense to handle the "why did we have to recompile a new variant" debugging based on those keys as well. It also unifies the code, eliminating a brw/elk split, so it's actually less code. Additionally, this was the only remaining user of the brw code, so we can delete that, resulting in even larger cleanups. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39839>	2026-02-16 21:33:42 +00:00
Kenneth Graunke	d013ef4c0f	brw: Make use_tcs_multi_patch a static inline taking devinfo This simplifies some iris wrapping for multiple compilers and also saves some space in the brw_compiler singleton. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39839>	2026-02-16 21:33:42 +00:00
Kenneth Graunke	9531c6b89e	brw: Make indirect_ubos_use_sampler a static inline bool taking devinfo Having the named field allowed us to indicate that our code conditions are referring to the specific decision about how we handle indirect UBOs, rather than some other arbitrary hardware change. Still, there's no need to store this in a singleton struct - we can easily have a static inline bool that does the devinfo check for us. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39839>	2026-02-16 21:33:42 +00:00
Kenneth Graunke	de03b38daa	intel/elk, hasvk: Drop indirect_ubos_use_sampler option and DP code This is always set to true for elk platforms. No need for the option. crocus also assumes that we take the sampler path. hasvk had support for both paths (leftover from when the driver still supported Gfx12). We started using HDC messages for indirect UBO access on Tigerlake (Gfx12.x) because of cache reworks that made it more viable. On all prior platforms, we used the sampler because it has additional L1/L2 caches that the dataport lacks. Additionally, Ivybridge and nearby platforms had notoriously slow L3 access in some very common cases. Note that we do use the dataport for constant-offset UBO access, since we can combine many reads into larger block loads. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39839>	2026-02-16 21:33:42 +00:00
Erik Faye-Lund	d5551eba27	pan/ci: add missing xfails from nightly run Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39718>	2026-02-16 20:17:27 +00:00
Rhys Perry	e4b8ade092	ac/nir,radv,radeonsi: flip branches to avoid waitcnts fossil-db (navi31): Totals from 5123 (6.42% of 79825) affected shaders: Instrs: 12712435 -> 12703672 (-0.07%); split: -0.12%, +0.05% CodeSize: 67068852 -> 67033244 (-0.05%); split: -0.10%, +0.05% VGPRs: 363896 -> 363956 (+0.02%) SpillSGPRs: 5035 -> 5074 (+0.77%); split: -0.83%, +1.61% Latency: 115048972 -> 111944013 (-2.70%); split: -2.89%, +0.19% InvThroughput: 19102126 -> 18696069 (-2.13%); split: -2.34%, +0.22% VClause: 258693 -> 258770 (+0.03%); split: -0.01%, +0.04% SClause: 346271 -> 346225 (-0.01%); split: -0.02%, +0.00% Copies: 1040815 -> 1042017 (+0.12%); split: -0.23%, +0.34% Branches: 332467 -> 332565 (+0.03%); split: -0.04%, +0.07% PreSGPRs: 304888 -> 304699 (-0.06%); split: -0.10%, +0.04% PreVGPRs: 296652 -> 296654 (+0.00%) VALU: 7591803 -> 7594601 (+0.04%); split: -0.01%, +0.05% SALU: 1454420 -> 1455764 (+0.09%); split: -0.24%, +0.33% VOPD: 1826 -> 1810 (-0.88%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38262>	2026-02-16 19:39:43 +00:00
Rhys Perry	c0143829f9	nir/opt_intrinsics: optimize inot(inverse_ballot(const)) No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38262>	2026-02-16 19:39:43 +00:00
Rhys Perry	f81aaee7f1	aco/ra: create vectors for affinities of split definitions For example: a = ... b = ... if { c, d = split } phi(a, c) phi(b, d) This patch will allocate 'a' and 'b' as a vector. fossil-db (navi31): Totals from 2556 (3.20% of 79825) affected shaders: MaxWaves: 59957 -> 59955 (-0.00%) Instrs: 9170941 -> 9154954 (-0.17%); split: -0.19%, +0.02% CodeSize: 48245956 -> 48182620 (-0.13%); split: -0.15%, +0.02% VGPRs: 189372 -> 189900 (+0.28%); split: -0.04%, +0.32% Latency: 85469322 -> 85262360 (-0.24%); split: -0.32%, +0.08% InvThroughput: 14515911 -> 14486970 (-0.20%); split: -0.27%, +0.07% VClause: 197980 -> 197959 (-0.01%); split: -0.02%, +0.01% Copies: 787838 -> 774288 (-1.72%); split: -1.91%, +0.19% Branches: 271810 -> 271799 (-0.00%); split: -0.01%, +0.01% VALU: 5331813 -> 5318566 (-0.25%); split: -0.28%, +0.03% SALU: 1133559 -> 1133054 (-0.04%); split: -0.05%, +0.01% VOPD: 2435 -> 2418 (-0.70%); split: +0.12%, -0.82% fossil-db (navi21): Totals from 37513 (46.99% of 79825) affected shaders: Instrs: 26734825 -> 26681225 (-0.20%); split: -0.23%, +0.03% CodeSize: 141353284 -> 141144360 (-0.15%); split: -0.17%, +0.02% VGPRs: 1556760 -> 1556384 (-0.02%); split: -0.21%, +0.18% Latency: 146201548 -> 146156473 (-0.03%); split: -0.20%, +0.17% InvThroughput: 33921803 -> 33867398 (-0.16%); split: -0.23%, +0.07% VClause: 502263 -> 502209 (-0.01%); split: -0.27%, +0.26% SClause: 593142 -> 593155 (+0.00%); split: -0.00%, +0.00% Copies: 2600995 -> 2551257 (-1.91%); split: -2.16%, +0.25% Branches: 857910 -> 857787 (-0.01%); split: -0.03%, +0.02% VALU: 15674532 -> 15625013 (-0.32%); split: -0.35%, +0.04% SALU: 4635548 -> 4634680 (-0.02%); split: -0.04%, +0.02% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38262>	2026-02-16 19:39:43 +00:00
Rhys Perry	86f0195f5c	aco/ra: prefer phi operands which don't create waitcnt fossil-db (navi31): Totals from 89 (0.11% of 79825) affected shaders: Instrs: 343443 -> 343384 (-0.02%); split: -0.10%, +0.09% CodeSize: 1792948 -> 1792668 (-0.02%); split: -0.10%, +0.08% Latency: 2656294 -> 2656490 (+0.01%); split: -0.02%, +0.02% InvThroughput: 517696 -> 517691 (-0.00%); split: -0.01%, +0.01% SClause: 9213 -> 9215 (+0.02%); split: -0.01%, +0.03% Copies: 39138 -> 39089 (-0.13%); split: -0.84%, +0.71% Branches: 10863 -> 10872 (+0.08%); split: -0.05%, +0.13% SALU: 49185 -> 49136 (-0.10%); split: -0.67%, +0.57% fossil-db (navi21): Totals from 34490 (43.21% of 79825) affected shaders: Instrs: 23005853 -> 22956529 (-0.21%); split: -0.25%, +0.04% CodeSize: 120532004 -> 120341412 (-0.16%); split: -0.19%, +0.03% VGPRs: 1396928 -> 1397520 (+0.04%); split: -0.07%, +0.11% Latency: 108740068 -> 108499644 (-0.22%); split: -0.53%, +0.30% InvThroughput: 25286526 -> 25358695 (+0.29%); split: -0.11%, +0.39% VClause: 421179 -> 421132 (-0.01%); split: -0.29%, +0.27% SClause: 446414 -> 446423 (+0.00%); split: -0.00%, +0.00% Copies: 2242236 -> 2243168 (+0.04%); split: -0.42%, +0.46% Branches: 724556 -> 724903 (+0.05%); split: -0.02%, +0.07% VALU: 13321078 -> 13321940 (+0.01%); split: -0.07%, +0.08% SALU: 4069929 -> 4070580 (+0.02%); split: -0.02%, +0.03% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38262>	2026-02-16 19:39:43 +00:00
Rhys Perry	310f588f92	aco/ra: move variables from affinity register to avoid waitcnt If we don't use this affinity register, we're likely to end up moving the temporary later. If it's a memory instruction destination, that's probably more expensive than just copying the blocking variables. fossil-db (navi31): Totals from 504 (0.63% of 79825) affected shaders: Instrs: 4108284 -> 4109026 (+0.02%); split: -0.01%, +0.03% CodeSize: 21226764 -> 21229764 (+0.01%); split: -0.01%, +0.02% Latency: 26931635 -> 26806989 (-0.46%); split: -0.47%, +0.00% InvThroughput: 8443520 -> 8439235 (-0.05%); split: -0.06%, +0.01% VClause: 99209 -> 99314 (+0.11%); split: -0.00%, +0.11% SClause: 85089 -> 85085 (-0.00%) Copies: 340323 -> 340993 (+0.20%); split: -0.06%, +0.26% Branches: 117225 -> 117209 (-0.01%); split: -0.02%, +0.00% VALU: 2421859 -> 2422529 (+0.03%); split: -0.01%, +0.04% SALU: 503465 -> 503470 (+0.00%); split: -0.00%, +0.00% fossil-db (navi21): Totals from 582 (0.73% of 79825) affected shaders: Instrs: 3714908 -> 3714990 (+0.00%); split: -0.02%, +0.02% CodeSize: 19977880 -> 19973076 (-0.02%); split: -0.04%, +0.01% VGPRs: 40480 -> 40496 (+0.04%) Latency: 26028895 -> 25772711 (-0.98%); split: -0.99%, +0.00% InvThroughput: 9827389 -> 9818194 (-0.09%); split: -0.10%, +0.01% VClause: 103702 -> 103815 (+0.11%); split: -0.02%, +0.13% SClause: 90861 -> 90857 (-0.00%) Copies: 335276 -> 335992 (+0.21%); split: -0.09%, +0.30% Branches: 123912 -> 123897 (-0.01%); split: -0.02%, +0.00% VALU: 2466032 -> 2466748 (+0.03%); split: -0.01%, +0.04% SALU: 533658 -> 533667 (+0.00%); split: -0.00%, +0.00% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38262>	2026-02-16 19:39:43 +00:00
Rhys Perry	681ec4cba7	aco/ra: track cost of moving variables Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38262>	2026-02-16 19:39:43 +00:00
Rhys Perry	69bc4efa37	aco/sched_ilp: improve scheduling with VMEM/DS->VALU WaW This improves scheduling with one side of a divergent branch writing to a VGPR using VMEM/DS, and the other writing using VALU. At the merge block, it will properly consider that the VGPR was written by a VMEM/DS. fossil-db (navi31): Totals from 1224 (1.53% of 79825) affected shaders: Instrs: `5264815` -> 5267604 (+0.05%); split: -0.00%, +0.06% CodeSize: 27406404 -> 27422132 (+0.06%); split: -0.00%, +0.06% Latency: 48325204 -> 48293975 (-0.06%); split: -0.09%, +0.03% InvThroughput: 8923880 -> 8919191 (-0.05%); split: -0.07%, +0.02% fossil-db (navi21): Totals from 1267 (1.59% of 79825) affected shaders: Instrs: 4628583 -> 4629190 (+0.01%); split: -0.00%, +0.01% CodeSize: 24974672 -> 24977188 (+0.01%); split: -0.00%, +0.01% Latency: 45080476 -> 44998120 (-0.18%); split: -0.20%, +0.02% InvThroughput: 12288202 -> 12269634 (-0.15%); split: -0.16%, +0.01% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38262>	2026-02-16 19:39:43 +00:00
Rhys Perry	88b6b6db17	aco: only consider cost of memory loads at waitcnt We don't run this code before waitcnt insertion, so this isn't necessary. This change improves accuracy in these two situations, because the waitcnt insertion pass is more aware of divergent control flow: v0 = valu if (divergent) { v0 = vmem } else { use(v0) } v0 = vmem if (divergent) { wait vmcnt(0) } else { wait vmcnt(0) } use(v0) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38262>	2026-02-16 19:39:43 +00:00

1 2 3 4 5 ...

218754 commits