fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-07 09:18:04 +02:00

Author	SHA1	Message	Date
Michael Cheng	f002b34576	hasvk: enable perf warning logging in release builds Enable perf warning in release builds for hasvk. Signed-off-by: Michael Cheng <michael.cheng@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40551>	2026-03-24 21:42:33 +00:00
Michael Cheng	ebe94d4903	anv: enable perf warning logging in release builds Call process_intel_debug_variable() early in anv_CreateInstance() so the intel_debug bitset is populated, then set enable_debug_logging when INTEL_DEBUG=perf is active. This makes anv_perf_warn() messages visible in non-debug builds. Signed-off-by: Michael Cheng <michael.cheng@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40551>	2026-03-24 21:42:33 +00:00
Michael Cheng	7d787ba8c6	vulkan/runtime: allow drivers to enable vk_log output in release builds In non-debug builds, __vk_log_impl() silently drops all messages due to two compile/link-time gates: an early return when no debug callbacks are registered, and the MESA_VK_LOG=0 guard around the mesa_log*() calls. Add vk_instance::enable_debug_logging so drivers can opt in to log output at runtime. When set, both gates are bypassed. No functional change without a driver setting the flag. Signed-off-by: Michael Cheng <michael.cheng@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40551>	2026-03-24 21:42:33 +00:00
Tim Van Patten	1e04e7ee74	anv: Enable Vulkan 1.4 for SDK 37+ Enable Vulkan 1.4 for SDK 37+ to satisfy the VRA17 (Vulkan Requirement for Android 17). Signed-off-by: Tim Van Patten <timvp@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40593>	2026-03-24 21:15:45 +00:00
juntak0916	dd3e153a10	nvk: fix BindImageMemory2 per-bind status result The per-bind status was always being set to VK_SUCCESS instead of the actual result from nvk_bind_image_memory. Fixes: `93792b5ef2` ("nvk: Add static wrappers for image/buffer binding") Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40579>	2026-03-24 21:00:50 +00:00
Job Noorman	b2a44da9e9	ir3/legalize: don't drop sync flags on removed predt/predf Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details When a predt/predf branch can be removed, any sync flags set on the terminator were removed as well. Fix this by copying these flags to the prede that replaces the terminator. Fixes frame instability in "Devil May Cry 5" and "Resident Evil 3". Signed-off-by: Job Noorman <jnoorman@igalia.com> Fixes: `39088571f0` ("ir3: add support for predication") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40608>	2026-03-24 20:33:31 +00:00
Mel Henning	e46f596325	nir/mem_access_bit_sizes: Handle global_bounded Fixes: `f7ad45e5fc` ("nak: support has_load_global_bounded on turing and newer") Reviewed-by: Mary Guillemard <mary@mary.zone> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40577>	2026-03-24 18:55:30 +00:00
Mel Henning	f9a847114d	nir/lower_io: Add global_bounded to io_offset_src along with constant and offset variants Fixes: `f7ad45e5fc` ("nak: support has_load_global_bounded on turing and newer") Reviewed-by: Mary Guillemard <mary@mary.zone> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40577>	2026-03-24 18:55:30 +00:00
Sagar Ghuge	af2d51eafa	anv: enable BTP+BTI RCC keying for some workloads Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details We can drop RT flush and PS Scoreboard stall if state cache perf fix disabled is set to 1. If bit is set RCC uses the sum of Binding Table Pointer and Binding Table Index as tag in state cache instead of just Binding Table Index. On DX12 this is a performance win on all workloads we've tested. On DX11 there are a bunch of performance of regression. We think this is due to the fact that to avoid trashing the RCC, we need to remove all but render targets from the binding table, meaning all shader resource accesses have to go through the bindless HW heap. This leads to additional register usage due to the need to push the base offset of descriptor sets. Improvement in the compiler would likely mitigate this. This change introduce a DRIRC key we only turn on for DX12. Also platforms prior to DG2/LSC have a really small bindless heap that leads to additional register usage, so this optimization is completely disable there. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10872 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10873 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14075 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39982>	2026-03-24 18:17:42 +00:00
Lionel Landwerlin	3054192a08	intel/dev: add state cache perf fix support xe detection Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39982>	2026-03-24 18:17:42 +00:00
Lionel Landwerlin	f43caa77c2	drm-uapi: Sync xe_drm.h Sync with: commit d93f8ea0e5ad41d661496d205fac3e2fbd9358c0 Merge: 02e778f12359 9876394f64a7 Author: Dave Airlie <airlied@redhat.com> Date: Tue Mar 17 11:27:01 2026 +1000 Merge tag 'drm-intel-next-2026-03-16' of https://gitlab.freedesktop.org/drm/i915/kernel into drm-next Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39982>	2026-03-24 18:17:42 +00:00
Sagar Ghuge	5391e37b6b	intel/genxml: Add new State Cache Perf Fix Disabled field This patch adds new field to COMMON_SLICE_CHICKEN3 register. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39982>	2026-03-24 18:17:42 +00:00
Lionel Landwerlin	adf18761f8	anv: rework color_aux operation tracking The current tracking seems to have hidden issues related to MCS ambiguate that are currently hidden by the fact that we're inserting pb-stall+RT-flush on BTI changes which we're going to be remove in the next commits. The issues appear to be related to a missing pb-stall+RT-flush between MCS ambiguate and fast-clear causing failures on the following tests once BTP+BTI RCC caching is enabled : dEQP-VK.pipeline..multisample.misc.multi* dEQP-VK.pipeline..framebuffer_attachment.diff_attachments_2d_32x32_39x41_ms dEQP-VK.pipeline..framebuffer_attachment.diff_attachments_2d_32x32_48x48_ms Here we rework the tracking with a new enum to track 3 classes of operations. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39982>	2026-03-24 18:17:42 +00:00
Lionel Landwerlin	ab10ee1dd4	anv: document more stalling Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39982>	2026-03-24 18:17:42 +00:00
Lionel Landwerlin	dc79d6b13a	anv: merge null surface state packing with previous attachments Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39982>	2026-03-24 18:17:42 +00:00
Lionel Landwerlin	d1eed2239d	anv: batch rendering initialization commands Instead of : foreach color attachment transition layout fast clear slow clear do this : foreach color attachment transition layout foreach color attachment fast clear foreach color attachment slow clear Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39982>	2026-03-24 18:17:42 +00:00
Lionel Landwerlin	268c7f2a44	anv: rename variables in CmdBeginRendering Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39982>	2026-03-24 18:17:42 +00:00
Lionel Landwerlin	bbcb7c7838	anv: move depth/stencil BeginRendering handling prior to color When rendering only has depth/stencil, we need to look at the depth/stencil view size to generate a dummy null color attachments. So do that first, so we don't have to iterate color attachments once more with the final size. This change also has the nice impact of removing a BTI change flush due to the sequence moving from : - before blorp BTI-flush - color fast-clear - after blorp BTI-flush - depth fast-clear - change RT due to shader outputs (BTI-flush) - draw call to : - depth fast-clear - before blorp BTI-flush - color fast-clear - combined after blorp BTI-flush (pending) - change RT due to shader outputs (BTI-flush, combined with above) - draw call Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39982>	2026-03-24 18:17:42 +00:00
Lionel Landwerlin	7be8af1dad	anv: deal with Wa 14024015672 on the blorp path This is going to bite us a lot more when RCC BTP+BTI is enabled. In particular this test will hang pretty reliably on LNL : dEQP-VK.renderpasses.dynamic_rendering.primary_cmd_buff.suballocation.multisample_resolve.layers_3.r32g32_sfloat.samples_4_baseLayer1 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `f66ff97d58` ("drirc/anv: implement steps to disable RHWO for Wa_14024015672") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39982>	2026-03-24 18:17:42 +00:00
Lionel Landwerlin	2160e751e3	vulkan/runtime: fix incorrect entrypoint call for CmdCopyImageToBuffer2 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `a8e49be9d9` ("vulkan/runtime: add implementation of older entrypoints using KHR_DAC") Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40587>	2026-03-24 17:47:01 +00:00
Lionel Landwerlin	1cbb8c82b7	vulkan/runtime: fix missing copy image layout Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `a8e49be9d9` ("vulkan/runtime: add implementation of older entrypoints using KHR_DAC") Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40587>	2026-03-24 17:47:01 +00:00
Collabora's Gfx CI Team	a83a88df91	Uprev VVL to adfdda5b665f59aae31acf5c12c73e64a8f89553 `f020266ade...adfdda5b66` Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40209>	2026-03-24 17:13:54 +00:00
Juan A. Suarez Romero	ea9e64e037	broadcom/ci: update expected results Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40605>	2026-03-24 16:47:13 +00:00
Kenneth Graunke	204af7e09f	intel/nir: Replace tg4 with txl/txb/tex when splitting texture residency Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details textureGather() returns the four taps that would have been filtered together to produce the value that ordinary texturing operations would return. As such, it should access the same data, so we can use either interchangeably when we're only checking for residency and not returning the actual data. This allows us to mask out some unneeded registers. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40590>	2026-03-24 16:06:29 +00:00
Kenneth Graunke	605ef577b3	intel/nir: Generalize lower_tex_compare to split_tex_residency This splits a single texture-with-residency operation into two halves, one which returns texture data, and another which queries residency. We're currently using this only for a shadow sampling workaround, but the technique is more broadly applicable, if we ever wanted. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40590>	2026-03-24 16:06:29 +00:00
Kenneth Graunke	dc760104ba	intel/nir: Set new image intrinsic parameters via builder helpers A bit less code. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40590>	2026-03-24 16:06:28 +00:00
Kenneth Graunke	9d07e85287	intel/nir: Use txf builder in intel_nir_lower_sparse Newer helpers make NIR easier to write. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40590>	2026-03-24 16:06:28 +00:00
Kenneth Graunke	0bbb48afb4	nir: Add is_sparse flag to texture builders This sets the is_sparse flag on the resulting nir_tex_instr and the resulting def to be one component larger. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40590>	2026-03-24 16:06:27 +00:00
Valentine Burley	17d38c9668	zink/ci: Move zink-tu-a618 to sc7180-trogdor-kingoftown The sc7180-trogdor-lazor-limozeen devices are having issues, so move the job to a different device with available capacity. Signed-off-by: Valentine Burley <valentine.burley@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40566>	2026-03-24 15:22:12 +00:00
Valentine Burley	3fa7f73a83	ci: Enable ZSTD support for ZRAM in the kernel The only functional change is that the CONFIG_ZRAM_BACKEND_ZSTD and CONFIG_ZRAM_DEF_COMP_ZSTD options are enabled. Suggested by @anholt. Signed-off-by: Valentine Burley <valentine.burley@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40566>	2026-03-24 15:22:12 +00:00
Jose Maria Casanova Crespo	89229f08bb	broadcom/common: fix V3D 7.1 TFU ICFG IFORMAT values The V3D 7.1 TFU ICFG register restructured the IFORMAT field to 3 bits (25:23) vs 4 bits on V3D 4.2. The defines were still using the V3D 4.2 encoding (11-15) which overflows the 3-bit field. Fix values to the correct 3-7 range. This was working by accident because the overflow bits land in the SVTWID field, which is not used for the affected tiling formats. Also rename SAND_128 to SAND since V3D 7.1 has a single SAND input format; the tile width is now controlled by SVTWID. Fixes: `146ceadcf4` ("v3dv: add support for TFU jobs in v71") Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40540>	2026-03-24 14:55:50 +00:00
Yiwei Zhang	89b36fe180	util/list: fix formatting Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This trivial change is to improve readability of this header: 1. replaces random tabs to spaces 2. use 3-spaces indent consistently across the header 3. minor clang-format fixes Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40523>	2026-03-24 12:58:43 +00:00
Valentine Burley	acd7337b31	ci: Run Intel shader-db on Lunar Lake and Panther Lake Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Valentine Burley <valentine.burley@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38734>	2026-03-24 11:11:59 +00:00
Valentine Burley	9dd0f19198	ci: Drop duplicate Intel shader-db run Skylake is the default device for the Intel shim, and it's already included in the four Intel families listed below. Fixes: `183d57aa9e` ("ci: Run intel shader-db on Haswell, Broadwell, and Meteorlake") Signed-off-by: Valentine Burley <valentine.burley@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38734>	2026-03-24 11:11:59 +00:00
Eric Engestrom	8a20cbbf24	ci: vmware farm is offline, stop using it Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40597>	2026-03-24 10:02:41 +00:00
Tapani Pälli	735ad7cefb	anv: add required barrier for Wa_14026570320 Ensure RT is not processing rays while requesting state cache invalidate by making sure compute is done first. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13830 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40388>	2026-03-24 09:34:29 +00:00
Tapani Pälli	1cce7c79f0	anv: remove barrier special handling for RT_BTI_CHANGE This has been dead code since commit `4b2b824112`. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40388>	2026-03-24 09:34:29 +00:00
Georg Lehmann	17a9ee7152	aco/optimizer: apply dpp to v_dot before RA for gfx10.3 This is a bit unusual, as we otherwise only use the VOP2 codesize optimization opcodes in the register allocator. But unless we change the scheduler to not split v_mov_b32_dpp and v_dot, we have no other choice. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40510>	2026-03-24 09:05:40 +00:00
Samuel Pitoiset	62f1268d78	radv/ci: remove skipping mesh shader tests for NAVI31 RADV_DEBUG=nomeshshader is still the default for NAVI31, so these tests are already skipped. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40580>	2026-03-24 08:54:52 +01:00
Samuel Pitoiset	d9a0a6b1a3	radv/ci: stop skipping some memory related tests on POLARIS10 These no longer timeout. And it's a post-merge job, it shouldn't matter if the job is slightly slower anyways. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40580>	2026-03-24 08:54:52 +01:00
Samuel Pitoiset	b6e3658d8b	radv/ci: stop skipping ASTC tests with ANGLE+STONEY RADV supports ASTC emulation. Though it seems broken to some extent but it's better to run the tests and mark them as expected failures anyways. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40580>	2026-03-24 08:54:51 +01:00
Icenowy Zheng	7fa0add77d	pvr: re-indent pseudocode for DDMADT behavior Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details In pvr_pipeline_pds.c, there's a pseudocode snippet describing the behavior of DDMADT PDS instruction, which seems to be copied from some internal document about PDS behavior. However, the pseudocode isn't properly indented, especially some brackets are misaligned. This blocks fluent reading of the pseudocode and may even mislead the reader. Re-indent the pseudo code with similar rules with C codes in the driver. Signed-off-by: Icenowy Zheng <zhengxingda@iscas.ac.cn> Acked-by: Simon Perretta <simon.perretta@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40533>	2026-03-24 14:11:35 +08:00
Tapani Pälli	c75256b2ab	intel/compiler: move validation assert after brw_shader_debug_log Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details When validation fails we print instructions to use INTEL_DEBUG=shaders but that will not help if we assert before dumping shader debug log. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40529>	2026-03-24 04:54:31 +00:00
Yiwei Zhang	8351c6070d	vulkan/anv: use vk_device_get_timestamp and drop vk_clock_gettime vk_clock_gettime hasn't been used by other implementations ever since venus and kk migrated over to the common implementation. It'd be better to drop that helper (or move into anv) because it's not OS agnostic as compare to the more comprehensive vk_device_get_timestamp. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40582>	2026-03-24 04:08:39 +00:00
anonymix007	8982056fa8	vulkan/runtime: Implement VK_TIME_DOMAIN_QUERY_PERFORMANCE_COUNTER_KHR This change adds win32 VK_TIME_DOMAIN_QUERY_PERFORMANCE_COUNTER_KHR support to vk_device_get_timestamp. Meanwhile, vk_clock_gettime is left untouched preparing for deprecation (anv is the only user). The latter also only has the host clock part and doesn't handle error cases in a robust manner. v2 (zzyiwei): - vk_device_get_timestamp updates - use DETECT_OS_WINDOWS - add commit messages Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40582>	2026-03-24 04:08:39 +00:00
Ian Romanick	b5e023777c	brw: Change the flags written by some CMP Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details One frustrating thing about the CMP and CMPN instructions is that they always write the flags. Sometimes, however, it is desirable to generate the comparison result without modifying the flags. This would, theoretically, reduce false dependencies that restrict the scheduler's ability to rearrange code, create more opportunities for cmod propagation, save a kitten from a tree, and make a rainbow. Consider this sequence: cmp.ge.f0.0(8) g103<1>F g101<8,8,1>F g39<8,8,1>F cmp.nz.f0.0(8) null<1>D g81<8,8,1>D 0D (+f0.0) if(8) JIP: LABEL19 UIP: LABEL19 It would be advantageous to put the first CMP between the second CMP and the IF, but this cannot be done since the IF depends on the flags generated by the second CMP. This pass enables this rescheduling by changing the first CMP to write to a different flags register. cmp.ge.f1.0(8) g103<1>F g101<8,8,1>F g39<8,8,1>F cmp.nz.f0.0(8) null<1>D g81<8,8,1>D 0D (+f0.0) if(8) JIP: LABEL19 UIP: LABEL19 Sometimes this is also possible by using a different instruction. For example, consider cmp.l.f0.0(8) g103<1>D g101<8,8,1>D 0D This produces 0xffffffff when g101 negative and zero otherwise. This instruction, which does not modifiy the flag, also produces these results: asr(8) g103<1>D g101<8,8,1>D 31D Gfx9 platforms take a hit on instructions due to the instruction added at the end of short shaders by brw_workaround_source_arf_before_eot. shader-db: Lunar Lake, Meteor Lake, DG2, Tiger Lake, and Ice Lake had similar results. (Lunar Lake shown) total instructions in shared programs: 17089451 -> 17088766 (<.01%) instructions in affected programs: 766613 -> 765928 (-0.09%) helped: 653 / HURT: 0 total cycles in shared programs: 888832986 -> 887873068 (-0.11%) cycles in affected programs: 549441852 -> 548481934 (-0.17%) helped: 10474 / HURT: 130 LOST: 9 GAINED: 0 Skylake total instructions in shared programs: 19037976 -> 19049719 (0.06%) instructions in affected programs: 3979914 -> 3991657 (0.30%) helped: 503 / HURT: 12303 total cycles in shared programs: 867918242 -> 866930801 (-0.11%) cycles in affected programs: 512773919 -> 511786478 (-0.19%) helped: 13858 / HURT: 66 LOST: 32 GAINED: 0 fossil-db: Lunar Lake Totals: Instrs: 925023504 -> 924950382 (-0.01%); split: -0.01%, +0.00% Cycle count: 106348432916 -> 106116809009 (-0.22%); split: -0.22%, +0.00% Spill count: 3423988 -> 3423930 (-0.00%); split: -0.00%, +0.00% Fill count: 4877087 -> 4876960 (-0.00%); split: -0.01%, +0.00% Max dispatch width: 49087552 -> 49078448 (-0.02%); split: +0.00%, -0.02% Totals from 1099332 (54.44% of 2019443) affected shaders: Instrs: 742670473 -> 742597351 (-0.01%); split: -0.01%, +0.00% Cycle count: 100455549635 -> 100223925728 (-0.23%); split: -0.23%, +0.00% Spill count: 3384366 -> 3384308 (-0.00%); split: -0.00%, +0.00% Fill count: 4837434 -> 4837307 (-0.00%); split: -0.01%, +0.00% Max dispatch width: 26725152 -> 26716048 (-0.03%); split: +0.00%, -0.03% Meteor Lake and DG2 had similar results. (Meteor Lake shown) Totals: Instrs: 997603774 -> 997529238 (-0.01%); split: -0.01%, +0.00% Cycle count: 93904012762 -> 93646730006 (-0.27%); split: -0.28%, +0.00% Spill count: 3710155 -> 3710125 (-0.00%); split: -0.00%, +0.00% Fill count: 5032908 -> 5032819 (-0.00%); split: -0.01%, +0.00% Max dispatch width: 37929640 -> 37811560 (-0.31%) Totals from 1334920 (58.52% of 2281134) affected shaders: Instrs: 817377787 -> 817303251 (-0.01%); split: -0.01%, +0.00% Cycle count: 88468851658 -> 88211568902 (-0.29%); split: -0.29%, +0.00% Spill count: 3663353 -> 3663323 (-0.00%); split: -0.00%, +0.00% Fill count: 4991629 -> 4991540 (-0.00%); split: -0.01%, +0.00% Max dispatch width: 20245832 -> 20127752 (-0.58%) Tiger Lake and Ice Lake had similar results. (Tiger Lake shown) Totals: Instrs: 1013433769 -> 1013363273 (-0.01%); split: -0.01%, +0.00% Cycle count: 85766921182 -> 85509316620 (-0.30%); split: -0.31%, +0.00% Spill count: 3903923 -> 3903944 (+0.00%); split: -0.00%, +0.00% Fill count: 6801983 -> 6801948 (-0.00%); split: -0.00%, +0.00% Max dispatch width: 37896320 -> 37805320 (-0.24%); split: +0.00%, -0.24% Totals from 1333814 (58.54% of 2278396) affected shaders: Instrs: 830200531 -> 830130035 (-0.01%); split: -0.01%, +0.00% Cycle count: 80746184101 -> 80488579539 (-0.32%); split: -0.32%, +0.01% Spill count: 3855771 -> 3855792 (+0.00%); split: -0.00%, +0.00% Fill count: 6755513 -> 6755478 (-0.00%); split: -0.00%, +0.00% Max dispatch width: 20301456 -> 20210456 (-0.45%); split: +0.00%, -0.45% Skylake Totals: Instrs: 519389758 -> 519874108 (+0.09%); split: -0.00%, +0.10% Cycle count: 57932316132 -> 57789433956 (-0.25%); split: -0.25%, +0.00% Spill count: 636741 -> 636715 (-0.00%); split: -0.01%, +0.00% Fill count: 860470 -> 860357 (-0.01%); split: -0.02%, +0.00% Max dispatch width: 32527800 -> 32481792 (-0.14%); split: +0.00%, -0.14% Totals from 1080380 (62.25% of 1735462) affected shaders: Instrs: 411976399 -> 412460749 (+0.12%); split: -0.00%, +0.12% Cycle count: 54291447615 -> 54148565439 (-0.26%); split: -0.27%, +0.00% Spill count: 602993 -> 602967 (-0.00%); split: -0.01%, +0.00% Fill count: 734459 -> 734346 (-0.02%); split: -0.02%, +0.00% Max dispatch width: 18626096 -> 18580088 (-0.25%); split: +0.00%, -0.25% Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38978>	2026-03-24 01:31:26 +00:00
Ian Romanick	31de96d321	brw/lower_regioning: Allow integer conversions in SEL The Bspec says that SEL sources and destination can be any mix of B, W, and *D. We should allow those. Specifically, without this change, this instruction sel.sat.l(8) v548:UD, v899:D, 255d gets unnecessarily split into two instructions. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38978>	2026-03-24 01:31:26 +00:00
Ian Romanick	dff1e8ae28	brw: Handle scalars and swizzles correctly in is_const_zero v2: Massive simplification based on feedback from Ken. Fixes: `96cde9cc01` ("intel/fs: Emit better code for bfi(..., 0)") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38978>	2026-03-24 01:31:25 +00:00
Ian Romanick	985ace332b	brw/algebraic: Allow mixed types in saturate constant folding Prevents assertion failures in func.shader-ballot.basic.q0 and other tests starting with "nir/algebraic: Optimize some b2f of integer comparison". Vector immediates, bfloat, and 8-bit floats are still not supported. v2: Almost complete re-write based on suggestions from Ken. v3: Don't retype() on a brw_imm_f value. Fixes: `f8e54d02f7` ("intel/compiler: Relax mixed type restriction for saturating immediates") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38978>	2026-03-24 01:31:25 +00:00
Marek Olšák	dee99b38c5	radeonsi: fix an assertion failure for sampler descriptor loads with LLVM Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Pierre-Eric Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40589>	2026-03-24 01:05:29 +00:00

... 6 7 8 9 10 ...

220683 commits