fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-18 02:58:06 +02:00

Author	SHA1	Message	Date
Samuel Pitoiset	9d059a60f5	nir: introduce nir_descriptor_type for Vulkan like descriptors This removes a Vulkan dependency in NIR core. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40670>	2026-03-31 07:16:20 +00:00
Samuel Pitoiset	ee7c6e3752	treewide: cleanup non-existent descriptor types from nir_intrinsic_desc_type() The only possible values are: - VK_DESCRIPTOR_TYPE_UNIFORM_BUFFER - VK_DESCRIPTOR_TYPE_STORAGE_BUFFER - VK_DESCRIPTOR_TYPE_ACCELERATION_STRUCTURE_KHR Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40670>	2026-03-31 07:16:20 +00:00
José Roberto de Souza	889cf429ee	anv: Fix placed address mmap with slab bo Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The current implmentation adjust the mmap() parameters to make it work, but that causes us to map more addresses than application asked what could cause us to overwrite other application mmaps(). So here we export the slab parent as a dma-buf, then do the mmap with almost no adjustment, the only change is the offset that needs to include the difference between bo address and slab bo parent address. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40441>	2026-03-30 13:59:27 +00:00
Lionel Landwerlin	fdc1fae740	isl: speedup buffer fills by dropping swizzle programming In vkoverhead ubo/ssbo tests, this is about 15/20% improvement. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40697>	2026-03-30 12:05:28 +00:00
Tapani Pälli	3160fbb6ec	anv: use mi_set_autostrip_state for autostrip control Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40344>	2026-03-30 11:02:27 +00:00
Tapani Pälli	bafa1120ce	genxml/mi: add additional bit to FF_MODE and autostrip helper This provides bit and common code to control autostrip state. Requirement for this is coming from Wa_14026781792. We are writing register for Wa_14024997852 and since this register is nonmaskable the bit needs to be written always. Helper takes care to touch only required bits. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40344>	2026-03-30 11:02:27 +00:00
Faith Ekstrand	4d56fa661f	vulkan: Rename some VK_EXT_descriptor_buffer properties Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40649>	2026-03-30 06:51:26 +00:00
Kenneth Graunke	ca3cabd2f8	brw: Use nir_texop_resinfo_intel for query_levels and txs Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This eliminates the need to special case query_levels. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40451>	2026-03-29 12:53:10 +00:00
Natalie Vock	6f80027447	vulkan: Rename {encode,update}_bind_pipeline to {encode,update}_prepare Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39985>	2026-03-28 16:12:09 +01:00
Lionel Landwerlin	fa523aedd0	brw: fence SLM writes between workgroups Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details On LSC platforms the SLM writes are unfenced between workgroups. This means a workgroup W1 finishing might have uncompleted SLM writes. Another workgroup W2 dispatched after W1 which gets allocated an overlapping SLM location might have writes that race with the previous W1 operations. The solution to this is fence all write operations (store & atomics) of a workgroup before ending the threads. We do this by emitting a single SLM fence either at the end of the shader or if there is only a single unfenced right, at the end of that block. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13924 Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40430>	2026-03-26 22:38:55 +00:00
Georg Lehmann	eef0fa22e0	brw: preserve fp_math_ctrl when lowering cmat alu Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40630>	2026-03-26 13:15:50 +00:00
Casey Bowman	db34a92c48	intel/tools: Add xe3p format for intel_monitor The kernel uses an updated buffer format for xe3p gpus when EU stall sampling, so this updates intel_monitor to use the correct formatting, leaving room for any future formatting updates. This also addresses an issue with not packing the formatted structure with the correct macro, which lead to incorrect offsets being used for parsing the buffer. BSpec: 79847 v2: Add BSpec reference number, suggested by Lionel Signed-off-by: Casey Bowman <casey.g.bowman@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40622>	2026-03-26 07:31:09 +00:00
Yonggang Luo	d067d6e163	vulkan/anv:Remove unused anv_clock_gettime Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40595>	2026-03-25 09:23:33 +00:00
Faith Ekstrand	3ea2e51c8b	treewide: Enable lowering of primitive ID in a bunch of Vulkan drivers Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Mary Guillemard <mary@mary.zone> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40512>	2026-03-25 03:11:56 +00:00
Michael Cheng	f002b34576	hasvk: enable perf warning logging in release builds Enable perf warning in release builds for hasvk. Signed-off-by: Michael Cheng <michael.cheng@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40551>	2026-03-24 21:42:33 +00:00
Michael Cheng	ebe94d4903	anv: enable perf warning logging in release builds Call process_intel_debug_variable() early in anv_CreateInstance() so the intel_debug bitset is populated, then set enable_debug_logging when INTEL_DEBUG=perf is active. This makes anv_perf_warn() messages visible in non-debug builds. Signed-off-by: Michael Cheng <michael.cheng@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40551>	2026-03-24 21:42:33 +00:00
Tim Van Patten	1e04e7ee74	anv: Enable Vulkan 1.4 for SDK 37+ Enable Vulkan 1.4 for SDK 37+ to satisfy the VRA17 (Vulkan Requirement for Android 17). Signed-off-by: Tim Van Patten <timvp@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40593>	2026-03-24 21:15:45 +00:00
Sagar Ghuge	af2d51eafa	anv: enable BTP+BTI RCC keying for some workloads Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details We can drop RT flush and PS Scoreboard stall if state cache perf fix disabled is set to 1. If bit is set RCC uses the sum of Binding Table Pointer and Binding Table Index as tag in state cache instead of just Binding Table Index. On DX12 this is a performance win on all workloads we've tested. On DX11 there are a bunch of performance of regression. We think this is due to the fact that to avoid trashing the RCC, we need to remove all but render targets from the binding table, meaning all shader resource accesses have to go through the bindless HW heap. This leads to additional register usage due to the need to push the base offset of descriptor sets. Improvement in the compiler would likely mitigate this. This change introduce a DRIRC key we only turn on for DX12. Also platforms prior to DG2/LSC have a really small bindless heap that leads to additional register usage, so this optimization is completely disable there. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10872 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10873 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14075 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39982>	2026-03-24 18:17:42 +00:00
Lionel Landwerlin	3054192a08	intel/dev: add state cache perf fix support xe detection Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39982>	2026-03-24 18:17:42 +00:00
Sagar Ghuge	5391e37b6b	intel/genxml: Add new State Cache Perf Fix Disabled field This patch adds new field to COMMON_SLICE_CHICKEN3 register. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39982>	2026-03-24 18:17:42 +00:00
Lionel Landwerlin	adf18761f8	anv: rework color_aux operation tracking The current tracking seems to have hidden issues related to MCS ambiguate that are currently hidden by the fact that we're inserting pb-stall+RT-flush on BTI changes which we're going to be remove in the next commits. The issues appear to be related to a missing pb-stall+RT-flush between MCS ambiguate and fast-clear causing failures on the following tests once BTP+BTI RCC caching is enabled : dEQP-VK.pipeline..multisample.misc.multi* dEQP-VK.pipeline..framebuffer_attachment.diff_attachments_2d_32x32_39x41_ms dEQP-VK.pipeline..framebuffer_attachment.diff_attachments_2d_32x32_48x48_ms Here we rework the tracking with a new enum to track 3 classes of operations. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39982>	2026-03-24 18:17:42 +00:00
Lionel Landwerlin	ab10ee1dd4	anv: document more stalling Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39982>	2026-03-24 18:17:42 +00:00
Lionel Landwerlin	dc79d6b13a	anv: merge null surface state packing with previous attachments Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39982>	2026-03-24 18:17:42 +00:00
Lionel Landwerlin	d1eed2239d	anv: batch rendering initialization commands Instead of : foreach color attachment transition layout fast clear slow clear do this : foreach color attachment transition layout foreach color attachment fast clear foreach color attachment slow clear Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39982>	2026-03-24 18:17:42 +00:00
Lionel Landwerlin	268c7f2a44	anv: rename variables in CmdBeginRendering Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39982>	2026-03-24 18:17:42 +00:00
Lionel Landwerlin	bbcb7c7838	anv: move depth/stencil BeginRendering handling prior to color When rendering only has depth/stencil, we need to look at the depth/stencil view size to generate a dummy null color attachments. So do that first, so we don't have to iterate color attachments once more with the final size. This change also has the nice impact of removing a BTI change flush due to the sequence moving from : - before blorp BTI-flush - color fast-clear - after blorp BTI-flush - depth fast-clear - change RT due to shader outputs (BTI-flush) - draw call to : - depth fast-clear - before blorp BTI-flush - color fast-clear - combined after blorp BTI-flush (pending) - change RT due to shader outputs (BTI-flush, combined with above) - draw call Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39982>	2026-03-24 18:17:42 +00:00
Lionel Landwerlin	7be8af1dad	anv: deal with Wa 14024015672 on the blorp path This is going to bite us a lot more when RCC BTP+BTI is enabled. In particular this test will hang pretty reliably on LNL : dEQP-VK.renderpasses.dynamic_rendering.primary_cmd_buff.suballocation.multisample_resolve.layers_3.r32g32_sfloat.samples_4_baseLayer1 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `f66ff97d58` ("drirc/anv: implement steps to disable RHWO for Wa_14024015672") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39982>	2026-03-24 18:17:42 +00:00
Kenneth Graunke	204af7e09f	intel/nir: Replace tg4 with txl/txb/tex when splitting texture residency Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details textureGather() returns the four taps that would have been filtered together to produce the value that ordinary texturing operations would return. As such, it should access the same data, so we can use either interchangeably when we're only checking for residency and not returning the actual data. This allows us to mask out some unneeded registers. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40590>	2026-03-24 16:06:29 +00:00
Kenneth Graunke	605ef577b3	intel/nir: Generalize lower_tex_compare to split_tex_residency This splits a single texture-with-residency operation into two halves, one which returns texture data, and another which queries residency. We're currently using this only for a shadow sampling workaround, but the technique is more broadly applicable, if we ever wanted. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40590>	2026-03-24 16:06:29 +00:00
Kenneth Graunke	dc760104ba	intel/nir: Set new image intrinsic parameters via builder helpers A bit less code. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40590>	2026-03-24 16:06:28 +00:00
Kenneth Graunke	9d07e85287	intel/nir: Use txf builder in intel_nir_lower_sparse Newer helpers make NIR easier to write. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40590>	2026-03-24 16:06:28 +00:00
Tapani Pälli	735ad7cefb	anv: add required barrier for Wa_14026570320 Ensure RT is not processing rays while requesting state cache invalidate by making sure compute is done first. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13830 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40388>	2026-03-24 09:34:29 +00:00
Tapani Pälli	1cce7c79f0	anv: remove barrier special handling for RT_BTI_CHANGE This has been dead code since commit `4b2b824112`. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40388>	2026-03-24 09:34:29 +00:00
Tapani Pälli	c75256b2ab	intel/compiler: move validation assert after brw_shader_debug_log Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details When validation fails we print instructions to use INTEL_DEBUG=shaders but that will not help if we assert before dumping shader debug log. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40529>	2026-03-24 04:54:31 +00:00
Yiwei Zhang	8351c6070d	vulkan/anv: use vk_device_get_timestamp and drop vk_clock_gettime vk_clock_gettime hasn't been used by other implementations ever since venus and kk migrated over to the common implementation. It'd be better to drop that helper (or move into anv) because it's not OS agnostic as compare to the more comprehensive vk_device_get_timestamp. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40582>	2026-03-24 04:08:39 +00:00
Ian Romanick	b5e023777c	brw: Change the flags written by some CMP Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details One frustrating thing about the CMP and CMPN instructions is that they always write the flags. Sometimes, however, it is desirable to generate the comparison result without modifying the flags. This would, theoretically, reduce false dependencies that restrict the scheduler's ability to rearrange code, create more opportunities for cmod propagation, save a kitten from a tree, and make a rainbow. Consider this sequence: cmp.ge.f0.0(8) g103<1>F g101<8,8,1>F g39<8,8,1>F cmp.nz.f0.0(8) null<1>D g81<8,8,1>D 0D (+f0.0) if(8) JIP: LABEL19 UIP: LABEL19 It would be advantageous to put the first CMP between the second CMP and the IF, but this cannot be done since the IF depends on the flags generated by the second CMP. This pass enables this rescheduling by changing the first CMP to write to a different flags register. cmp.ge.f1.0(8) g103<1>F g101<8,8,1>F g39<8,8,1>F cmp.nz.f0.0(8) null<1>D g81<8,8,1>D 0D (+f0.0) if(8) JIP: LABEL19 UIP: LABEL19 Sometimes this is also possible by using a different instruction. For example, consider cmp.l.f0.0(8) g103<1>D g101<8,8,1>D 0D This produces 0xffffffff when g101 negative and zero otherwise. This instruction, which does not modifiy the flag, also produces these results: asr(8) g103<1>D g101<8,8,1>D 31D Gfx9 platforms take a hit on instructions due to the instruction added at the end of short shaders by brw_workaround_source_arf_before_eot. shader-db: Lunar Lake, Meteor Lake, DG2, Tiger Lake, and Ice Lake had similar results. (Lunar Lake shown) total instructions in shared programs: 17089451 -> 17088766 (<.01%) instructions in affected programs: 766613 -> 765928 (-0.09%) helped: 653 / HURT: 0 total cycles in shared programs: 888832986 -> 887873068 (-0.11%) cycles in affected programs: 549441852 -> 548481934 (-0.17%) helped: 10474 / HURT: 130 LOST: 9 GAINED: 0 Skylake total instructions in shared programs: 19037976 -> 19049719 (0.06%) instructions in affected programs: 3979914 -> 3991657 (0.30%) helped: 503 / HURT: 12303 total cycles in shared programs: 867918242 -> 866930801 (-0.11%) cycles in affected programs: 512773919 -> 511786478 (-0.19%) helped: 13858 / HURT: 66 LOST: 32 GAINED: 0 fossil-db: Lunar Lake Totals: Instrs: 925023504 -> 924950382 (-0.01%); split: -0.01%, +0.00% Cycle count: 106348432916 -> 106116809009 (-0.22%); split: -0.22%, +0.00% Spill count: 3423988 -> 3423930 (-0.00%); split: -0.00%, +0.00% Fill count: 4877087 -> 4876960 (-0.00%); split: -0.01%, +0.00% Max dispatch width: 49087552 -> 49078448 (-0.02%); split: +0.00%, -0.02% Totals from 1099332 (54.44% of 2019443) affected shaders: Instrs: 742670473 -> 742597351 (-0.01%); split: -0.01%, +0.00% Cycle count: 100455549635 -> 100223925728 (-0.23%); split: -0.23%, +0.00% Spill count: 3384366 -> 3384308 (-0.00%); split: -0.00%, +0.00% Fill count: 4837434 -> 4837307 (-0.00%); split: -0.01%, +0.00% Max dispatch width: 26725152 -> 26716048 (-0.03%); split: +0.00%, -0.03% Meteor Lake and DG2 had similar results. (Meteor Lake shown) Totals: Instrs: 997603774 -> 997529238 (-0.01%); split: -0.01%, +0.00% Cycle count: 93904012762 -> 93646730006 (-0.27%); split: -0.28%, +0.00% Spill count: 3710155 -> 3710125 (-0.00%); split: -0.00%, +0.00% Fill count: 5032908 -> 5032819 (-0.00%); split: -0.01%, +0.00% Max dispatch width: 37929640 -> 37811560 (-0.31%) Totals from 1334920 (58.52% of 2281134) affected shaders: Instrs: 817377787 -> 817303251 (-0.01%); split: -0.01%, +0.00% Cycle count: 88468851658 -> 88211568902 (-0.29%); split: -0.29%, +0.00% Spill count: 3663353 -> 3663323 (-0.00%); split: -0.00%, +0.00% Fill count: 4991629 -> 4991540 (-0.00%); split: -0.01%, +0.00% Max dispatch width: 20245832 -> 20127752 (-0.58%) Tiger Lake and Ice Lake had similar results. (Tiger Lake shown) Totals: Instrs: 1013433769 -> 1013363273 (-0.01%); split: -0.01%, +0.00% Cycle count: 85766921182 -> 85509316620 (-0.30%); split: -0.31%, +0.00% Spill count: 3903923 -> 3903944 (+0.00%); split: -0.00%, +0.00% Fill count: 6801983 -> 6801948 (-0.00%); split: -0.00%, +0.00% Max dispatch width: 37896320 -> 37805320 (-0.24%); split: +0.00%, -0.24% Totals from 1333814 (58.54% of 2278396) affected shaders: Instrs: 830200531 -> 830130035 (-0.01%); split: -0.01%, +0.00% Cycle count: 80746184101 -> 80488579539 (-0.32%); split: -0.32%, +0.01% Spill count: 3855771 -> 3855792 (+0.00%); split: -0.00%, +0.00% Fill count: 6755513 -> 6755478 (-0.00%); split: -0.00%, +0.00% Max dispatch width: 20301456 -> 20210456 (-0.45%); split: +0.00%, -0.45% Skylake Totals: Instrs: 519389758 -> 519874108 (+0.09%); split: -0.00%, +0.10% Cycle count: 57932316132 -> 57789433956 (-0.25%); split: -0.25%, +0.00% Spill count: 636741 -> 636715 (-0.00%); split: -0.01%, +0.00% Fill count: 860470 -> 860357 (-0.01%); split: -0.02%, +0.00% Max dispatch width: 32527800 -> 32481792 (-0.14%); split: +0.00%, -0.14% Totals from 1080380 (62.25% of 1735462) affected shaders: Instrs: 411976399 -> 412460749 (+0.12%); split: -0.00%, +0.12% Cycle count: 54291447615 -> 54148565439 (-0.26%); split: -0.27%, +0.00% Spill count: 602993 -> 602967 (-0.00%); split: -0.01%, +0.00% Fill count: 734459 -> 734346 (-0.02%); split: -0.02%, +0.00% Max dispatch width: 18626096 -> 18580088 (-0.25%); split: +0.00%, -0.25% Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38978>	2026-03-24 01:31:26 +00:00
Ian Romanick	31de96d321	brw/lower_regioning: Allow integer conversions in SEL The Bspec says that SEL sources and destination can be any mix of B, W, and *D. We should allow those. Specifically, without this change, this instruction sel.sat.l(8) v548:UD, v899:D, 255d gets unnecessarily split into two instructions. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38978>	2026-03-24 01:31:26 +00:00
Ian Romanick	dff1e8ae28	brw: Handle scalars and swizzles correctly in is_const_zero v2: Massive simplification based on feedback from Ken. Fixes: `96cde9cc01` ("intel/fs: Emit better code for bfi(..., 0)") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38978>	2026-03-24 01:31:25 +00:00
Ian Romanick	985ace332b	brw/algebraic: Allow mixed types in saturate constant folding Prevents assertion failures in func.shader-ballot.basic.q0 and other tests starting with "nir/algebraic: Optimize some b2f of integer comparison". Vector immediates, bfloat, and 8-bit floats are still not supported. v2: Almost complete re-write based on suggestions from Ken. v3: Don't retype() on a brw_imm_f value. Fixes: `f8e54d02f7` ("intel/compiler: Relax mixed type restriction for saturating immediates") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38978>	2026-03-24 01:31:25 +00:00
José Roberto de Souza	c0f1689e11	anv: Fix invalid resource barrier signal stage Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Simulator is crashing when receiving GPGPU + Pixel as resource barrier signal stage, what according to spec is invalid. So here replacing the pixel stage by color, over synchronizing it a bit but keeping it functional. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14641 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Suggested-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40516>	2026-03-23 16:30:39 +00:00
José Roberto de Souza	347e82c718	anv: Always have a valid Resource barrier::Wait stage set Simulator hangs if a resource barrier has wait stage = None, HW seens to don't care but something bad could be happning internaly. So here making sure Wait stage is set to TOP when it is None. Simulator hangs if a resource barrier has wait stage = None. The HW seems to ignore it, but something bad could be happening internally. So here I'm making sure the wait stage is set to TOP when it is None. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40516>	2026-03-23 16:30:39 +00:00
Lionel Landwerlin	3a503b4898	anv: limit aux disabling on concurrent images to pre-Xe2 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40141>	2026-03-23 15:13:02 +00:00
Marek Olšák	fa5175023b	Final rename of sha1 names to blake3 Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40383>	2026-03-23 07:03:28 +00:00
Marek Olšák	ae9ea27e0d	Rename _sha1 names to _blake3 Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40383>	2026-03-23 07:03:28 +00:00
Marek Olšák	102d41799b	Rename more sha and sha1 names to blake3 Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40383>	2026-03-23 07:03:28 +00:00
Marek Olšák	d4831aaf5f	Rename sha1_* and sha_* names to blake3_* Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40383>	2026-03-23 07:03:28 +00:00
Marek Olšák	0877be34f5	Rename SHA1_* names to BLAKE3_* Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40383>	2026-03-23 07:03:28 +00:00
Marek Olšák	c0ac992a2a	Remove mesa-sha1.h Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40383>	2026-03-23 07:03:27 +00:00
Marek Olšák	53c64973e8	Inline _mesa_sha1_compute/format, remove the other unused ones _mesa_sha1_format has a few remaining uses, so it's moved to build_id.c, which is its last user. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40383>	2026-03-23 07:03:27 +00:00
Marek Olšák	699f9d7066	Inline _mesa_sha1_init/update/final functions Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40383>	2026-03-23 07:03:27 +00:00

1 2 3 4 5 ...

15736 commits