fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-07 09:18:04 +02:00

Author	SHA1	Message	Date
Samuel Pitoiset	3237666fc4	radv: run the pass to fixup SMEM loads with NULL PRT pages Right after global access are lowered. fossils-db (NAVI21): Totals from 37734 (33.68% of 112041) affected shaders: MaxWaves: 898021 -> 897961 (-0.01%); split: +0.00%, -0.01% Instrs: 34145252 -> 34267942 (+0.36%); split: -0.00%, +0.36% CodeSize: 182360344 -> 182943952 (+0.32%); split: -0.00%, +0.32% VGPRs: 1796672 -> 1796816 (+0.01%); split: -0.00%, +0.01% SpillSGPRs: 13708 -> 13964 (+1.87%); split: -0.28%, +2.15% Latency: 442451029 -> 442827188 (+0.09%); split: -0.02%, +0.10% InvThroughput: 105259490 -> 105287803 (+0.03%); split: -0.01%, +0.03% VClause: 672269 -> 672252 (-0.00%); split: -0.12%, +0.12% SClause: 847133 -> 847677 (+0.06%); split: -0.35%, +0.41% Copies: 2974422 -> 2979443 (+0.17%); split: -0.35%, +0.52% Branches: 860896 -> 861639 (+0.09%); split: -0.00%, +0.09% PreSGPRs: 1677701 -> 1682387 (+0.28%); split: -0.01%, +0.29% VALU: 22386780 -> 22386984 (+0.00%); split: -0.01%, +0.01% SALU: 5282218 -> 5406460 (+2.35%); split: -0.01%, +2.36% fossils-db (POLARIS10): Totals from 15054 (21.74% of 69255) affected shaders: MaxWaves: 87688 -> 87689 (+0.00%); split: +0.01%, -0.00% Instrs: 12542117 -> 12596734 (+0.44%); split: -0.00%, +0.44% CodeSize: 65209280 -> 65458732 (+0.38%); split: -0.00%, +0.39% SGPRs: 1149639 -> 1149975 (+0.03%); split: -0.24%, +0.27% VGPRs: 749928 -> 749956 (+0.00%); split: -0.02%, +0.02% SpillSGPRs: 11139 -> 11413 (+2.46%); split: -0.29%, +2.75% Latency: 169204114 -> 169533989 (+0.19%); split: -0.01%, +0.21% InvThroughput: 88091947 -> 88185872 (+0.11%); split: -0.01%, +0.11% VClause: 280519 -> 280318 (-0.07%); split: -0.18%, +0.10% SClause: 343474 -> 344686 (+0.35%); split: -0.32%, +0.67% Copies: 1529440 -> 1530545 (+0.07%); split: -0.30%, +0.38% Branches: 286849 -> 286856 (+0.00%); split: -0.01%, +0.01% PreSGPRs: 661815 -> 663239 (+0.22%); split: -0.02%, +0.23% VALU: 8758472 -> 8759214 (+0.01%); split: -0.01%, +0.01% SALU: 1775129 -> 1829513 (+3.06%); split: -0.02%, +3.08% Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38698>	2026-04-30 09:29:43 +00:00
Samuel Pitoiset	a4668733e5	ac/nir: add a pass to fixup SMEM loads with NULL PRT pages Only global/SSBO SMEM loads are considered because for UBOs the "LOW" VA will be set in descriptors. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38698>	2026-04-30 09:29:43 +00:00
Samuel Pitoiset	60b406e233	ac/gpu_info: query the PRT workaround control bit from libdrm libdrm splits the HIGH address space in two equal parts for GPUs that are affected by the SMEM loads with NULL PRT page. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38698>	2026-04-30 09:29:43 +00:00
Samuel Pitoiset	978605fd06	ac/gpu_info: add has_smem_with_null_prt_bug Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38698>	2026-04-30 09:29:43 +00:00
Samuel Pitoiset	ecfda339ca	ac/gpu_info: store more addr space info Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38698>	2026-04-30 09:29:43 +00:00
Lionel Landwerlin	5a462d77ff	anv: remove a bunch of KHR alias uses Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41233>	2026-04-30 09:04:01 +00:00
Lionel Landwerlin	4c7948ec0d	anv: stop using queue priority KHR aliases Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41233>	2026-04-30 09:04:01 +00:00
Lionel Landwerlin	dad8f65611	anv: fix null pointer access Reproduces with dEQP-VK.pipeline.no_queues.pipeline_binary.compute Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `595889018a` ("anv: implement VK_KHR_maintenance9") Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41233>	2026-04-30 09:04:01 +00:00
xueyuli2	da7ed1c576	amd/virtio: fix bo use-after-free race condition in amdvgpu_bo_free In amdvgpu_bo_free(), when the reference count drops to 0, vdrm_flush() is called before removing the bo from the handle_to_vbo hash table. Since vdrm_flush() is a time-consuming operation and is executed outside of the handle_to_vbo_mutex lock, another thread calling amdvgpu_bo_import() can concurrently find this bo in the hash table, increment its refcount, and attempt to use it. Once vdrm_flush() finishes, amdvgpu_bo_free() proceeds to remove the bo and call free(), leaving the importing thread with a dangling pointer, which leads to a use-after-free or double free crash. To fix this race condition, we must remove the bo from the hash table under the lock first. After the bo is safely unlinked and the lock is released, we can then perform the time-consuming vdrm_flush() and the actual memory release. Signed-off-by: zhaqian <zhaqian@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41146>	2026-04-30 08:41:50 +00:00
Icenowy Zheng	9960637b26	pvr: record deferred RTA clears for secondary cmdbuf subcmds Currently the code handling deferred RTA clears cannot handle them for secondary command buffers within render passes, because the code immediately configures the transfer command for the deferred clear operation, but the specific attachment image view isn't known when recording secondary command buffers to be executed inside render passes. Add code to record parameters for deferred RTA clears in secondary command buffers when the attachment is unknown, and bind the recorded clears to the attachment's image view when executing the secondary command buffer inside a render pass. Fixes many dynamic rendering random tests. Backport-to: 26.0 Signed-off-by: Icenowy Zheng <zhengxingda@iscas.ac.cn> Reviewed-by: Luigi Santivetti <luigi.santivetti@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40838>	2026-04-30 08:23:27 +00:00
Icenowy Zheng	ecd4e93456	pvr: add deferred RTA clear command to list after checking it's not NULL The code that adds deferred RTA clear transfer commands checks whether the newly allocated transfer command is NULL. However the list_addtail call is before the check, which means that the check does not prevent NULL dereference. Reorder the code to ensure no NULL transfer commands would ever be added to the deferred clear list. In addition, pvr_transfer_cmd_alloc() has already set the command buffer's error status when it returns NULL, so it's not needed to set it again. Fixes: `2eabbbe57d` ("pvr: use linked list to back deferred clears") Signed-off-by: Icenowy Zheng <zhengxingda@iscas.ac.cn> Reviewed-by: Luigi Santivetti <luigi.santivetti@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40838>	2026-04-30 08:23:27 +00:00
Icenowy Zheng	40e0f0f933	pvr: properly handle deferred RTA clears for 2D array view of 3D image For 2D array views of 3D images, the layer of the view corresponds to the depth (instead of the layer, which should be always 0) of the image. Fix the code emitting deferred RTA clears to set the depth instead of the layer of the image to clear. Fixes the flakiness of `dEQP-VK.renderpasses.renderpass. remaining_array_layers.multi_layer_fb.`. Fixes: `95820584d0` ("pvr: Add deferred RTA clears for cores without gs_rta_support.") Signed-off-by: Icenowy Zheng <zhengxingda@iscas.ac.cn> Reviewed-by: Luigi Santivetti <luigi.santivetti@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40838>	2026-04-30 08:23:26 +00:00
Icenowy Zheng	43b22a477c	pvr: do not setup deferred RTA clear for active render targets Deferred RTA clear will happen after the current graphics subcommand is executed, which may override rendered image in the graphics subcommand. In addition, the active render targets do not need "emulated" clear -- they can be really cleared by drawing rectangles. Skip set up deferred RTA clear for active render target layers, and continue to do immediate clear for these layers. Fixes a few dynamic rendering random CTS tests, but the issue should also exist in legacy renderpasses RTA clears. Fixes: `95820584d0` ("pvr: Add deferred RTA clears for cores without gs_rta_support.") Signed-off-by: Icenowy Zheng <zhengxingda@iscas.ac.cn> Reviewed-by: Luigi Santivetti <luigi.santivetti@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40838>	2026-04-30 08:23:26 +00:00
Nick Hamilton	08c13564d6	pvr: Revert don't csb emit multi-layer clear attachments without rta support While testing HW without gs_rta_support it was raised that this change had been made in error. After retesting with the change reverted the listed tests still pass. This reverts commit `d68344bffe`. Backport-to: 26.0 Reported-by: Luigi Santivetti <luigi.santivetti@imgtec.com> Signed-off-by: Nick Hamilton <nick.hamilton@imgtec.com> Reviewed-by: Luigi Santivetti <luigi.santivetti@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40838>	2026-04-30 08:23:26 +00:00
Samuel Pitoiset	69680f3d66	radv/ci: bump timeouts for radv-{navi21,gfx1201}-vkcts-full Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details They usually don't finish otherwise. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41282>	2026-04-30 07:56:04 +00:00
Icenowy Zheng	1027059baa	pvr: increase maxPerStageResources for new maxPerStageDescriptorStorageBuffers When maxPerStageResources is less than 128, it must be at least the sum of maxPerStageDescriptorUniformBuffers, maxPerStageDescriptorStorageBuffers, maxPerStageDescriptorSampledImages, maxPerStageDescriptorStorageImages, maxPerStageDescriptorInputAttachments and maxColorAttachments. As maxPerStageDescriptorStorageBuffers is previously increased, the value of maxPerStageResources should be increased too. This fixes regression on two limit validation tests in the Vulkan CTS -- dEQP-VK.info.device_properties and dEQP-VK.api.info. vulkan1p2_limits_validation.general . Fixes: `35f57a2739` ("pvr: increase value of maxPerStageDescriptorStorageBuffers") Signed-off-by: Icenowy Zheng <zhengxingda@iscas.ac.cn> Reviewed-by: Luigi Santivetti <luigi.santivetti@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41270>	2026-04-30 07:38:53 +00:00
Marek Olšák	a3e3bf0ac2	nir/opt_dce: add shader_info::assert_inputs_not_dead Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41166>	2026-04-30 07:07:32 +00:00
Marek Olšák	7bd5856cc6	nir/opt_dce: factor out dead instruction removal into a helper Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41166>	2026-04-30 07:07:32 +00:00
Samuel Pitoiset	52669c3b5b	radv: re-organize radv_cmd_state slightly Trying to regroup states. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41152>	2026-04-30 06:18:29 +00:00
Samuel Pitoiset	730a5b725e	radv: move vertex buffer state to radv_cmd_state Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41152>	2026-04-30 06:18:29 +00:00
Samuel Pitoiset	9feb722b31	radv: move conditional rendering state to radv_cond_render_state Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41152>	2026-04-30 06:18:29 +00:00
Samuel Pitoiset	57ecb1c1ec	radv: cleanup suspending/resuming cond rendering with DGC Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41152>	2026-04-30 06:18:28 +00:00
Samuel Pitoiset	43abb73273	radv: move index buffer state to radv_index_buffer_state Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41152>	2026-04-30 06:18:28 +00:00
Samuel Pitoiset	93468750ef	radv: remove unnecessary radv_cmd_state::mesh_shading Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41152>	2026-04-30 06:18:27 +00:00
Samuel Pitoiset	5246eee299	radv: move streamout bindings to radv_streamout_state Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41152>	2026-04-30 06:18:26 +00:00
Samuel Pitoiset	02ec87672b	radv: move suspend_streamout to radv_streamout_state And rename it to suspended. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41152>	2026-04-30 06:18:25 +00:00
Karol Herbst	b2aa92b523	softfloat: make sign bit an unsigned int Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details According to ubsan shifting an int32_t by 31 bits to the left is undefined behavior. So just declare it as uint32_t. Backport-to: 26.1 Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41252>	2026-04-30 01:12:14 +00:00
Marek Olšák	7c69f31ecc	radeonsi/ci/build: also fetch video decode/encode sample for VK CTS Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41171>	2026-04-30 00:39:20 +00:00
Marek Olšák	97597d3696	radeonsi/ci: remove the fixed XFB test from fails/flakes Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41171>	2026-04-30 00:39:20 +00:00
Eric Engestrom	d433ea59eb	docs: add sha sum for 26.0.6 Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41276>	2026-04-29 22:32:41 +00:00
Eric Engestrom	ba7ed5e309	docs: add release notes for 26.0.6 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41276>	2026-04-29 22:32:40 +00:00
Eric Engestrom	f9b205bfbc	docs: update calendar for 26.0.6 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41276>	2026-04-29 22:32:40 +00:00
Eric Engestrom	bac8b0bcb9	docs: update calendar for 26.1.0-rc3 Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41273>	2026-04-29 21:26:20 +00:00
Danylo Piliaiev	59f626ac81	tu/u_trace: Fix explicit toggle_name not being used Fixes: `889f71f71a` ("tu: Add tracepoints for clear/copy/blit/lrz ops") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41102>	2026-04-29 20:17:35 +00:00
Danylo Piliaiev	6e8ec44287	tu/u_trace: Correct the order of tracepoints clonning for binning Otherwise we'd get tracepoints out of logical order, which doesn't matter for perfetto at the moment, but would matter with future perf warnings. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41102>	2026-04-29 20:17:35 +00:00
Danylo Piliaiev	50cc9c723c	tu/u_trace: Prevent cloning stale RB_DONE_TS results Otherwise, at best, we get results for the previous tile. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41102>	2026-04-29 20:17:35 +00:00
Danylo Piliaiev	6ac25aac23	tu/u_trace: Use correct u_trace destination in tu_clone_trace_range This fixes in-RP tracepoints not being duplicated for secondary command buffers. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41102>	2026-04-29 20:17:35 +00:00
Rohit Athavale	c5b184a02a	mediafoundation: Test compile steps v/s step , and set build flag Closes: https://gitlab.freedesktop.org/mesa/mesa/-/work_items/15244 Backport-to: * Reviewed-by: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41221>	2026-04-29 19:59:50 +00:00
Emma Anholt	06ebe40ca1	tu: Set HALF_PRECISION on blits to R11G11B10. Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Fixes many new image_to_image tests copying to this format as of CTS 1.4.5.3. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41231>	2026-04-29 18:58:45 +00:00
Icenowy Zheng	82925935d4	pvr: wait for graphics jobs in CopyQueryPoolResults The last graphics job, which might write to the occlusion query result, could still be running when vkCmdCopyQueryPoolResults is called. Additionally wait for graphics jobs before copying the results. Fixes: `24b1e3946c` ("pvr: Add support to submit occlusion query sub cmds.") Signed-off-by: Icenowy Zheng <zhengxingda@iscas.ac.cn> Reviewed-by: Simon Perretta <simon.perretta@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40884>	2026-04-29 17:41:17 +00:00
Karol Herbst	528ceeb49b	rusticl: link the C++ runtime statically Apparently some applications don't have their C++ situation under control. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/work_items/14090 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41037>	2026-04-29 16:55:39 +00:00
Karol Herbst	5512680581	ci: install libstdc++-static on fedora Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41037>	2026-04-29 16:55:39 +00:00
Danylo Piliaiev	5fcde4d65d	freedreno: Fix CP_CCHE_INVALIDATE not being applied at the right point Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Apparently CP_CCHE_INVALIDATE is just a plain register write underneath, so it needs WFI before it, in order to invalidate at the right point. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41266>	2026-04-29 16:25:16 +00:00
Danylo Piliaiev	c2e78f1b22	tu: Fix CP_CCHE_INVALIDATE not being applied at the right point Apparently CP_CCHE_INVALIDATE is just a plain register write underneath, so it needs WFI before it, in order to invalidate at the right point. ``` CP_CCHE_INVALIDATE: mov $addr, 0x9881 mov $data, 0x1 waitin mov $01, $data ``` Fixes misrendering in Doom Eternal on A750. Fixes: `fb1c3f7f5d` ("tu: Implement CCHE invalidation") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41266>	2026-04-29 16:25:16 +00:00
Caio Oliveira	e1745e0bd9	brw: Fix max_dispatch_width collection for CS with variable size The intention of the original commit was to make all the shaders report the same max_dispatch_width. When CS has multiple variants, this was not happening as expected. Fixes: `2acc2f18ea` ("intel/compiler: report max dispatch width statistic") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41209>	2026-04-29 15:52:04 +00:00
Roman Stratiienko	bdbf4ed739	v3dv/android: Add deferred ANB allocation support Fixes: dEQP-VK.wsi.android.maintenance1.deferred_alloc.mailbox#basic dEQP-VK.wsi.android.maintenance1.deferred_alloc.mailbox#bind_image dEQP-VK.wsi.android.maintenance1.deferred_alloc.fifo#basic dEQP-VK.wsi.android.maintenance1.deferred_alloc.fifo#bind_image Signed-off-by: Roman Stratiienko <r.stratiienko@gmail.com> Acked-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41235>	2026-04-29 15:31:28 +00:00
Mike Blumenkrantz	d4d7055aee	radv: add RADV_QUEUE_DISABLE env var for selectively disabling queues it is sometimes useful to test radv without certain queues disabled in order to exercise alternative codepaths. this exposes that capability Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41086>	2026-04-29 15:08:28 +00:00
Job Noorman	aaf4d77f43	ir3/shared_ra: fix live-out reload after src reload Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details When reloading live-out values along loop back-edges, we make sure to reuse the original register. However, we failed to detect cases where the spilled value got reloaded earlier for a src in a different register. Fix this by reloading the value again in the original register. Fixes a RA validation failure in Windrose. Signed-off-by: Job Noorman <jnoorman@igalia.com> Fixes: `fa22b0901a` ("ir3/ra: Add specialized shared register RA/spilling") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41262>	2026-04-29 14:40:48 +00:00
Jose Maria Casanova Crespo	3a8d5aeaa1	v3dv: Expose hardware-accelerated integer dot products on V3D 7.1+ Expose VK_KHR_shader_integer_dot_product 4x8-bit packed dot products using native HW instructions v8dot and setnnmode. QPU instruction count for sdot_4x8_iadd compute shader: Before (scalar decomposition): 18 ALU cycles After (setnnmode + v8dot): 3 ALU cycles (6x) We advertise integerDotProduct4x8BitPacked*Accelerated for V3D 7.1+ Assisted-by: Claude Opus 4.6 Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41255>	2026-04-29 13:21:08 +00:00
Jose Maria Casanova Crespo	8f06961bf5	broadcom/compiler: Eliminate redundant setnnmode instructions This new VIR optimization pass tracks the current NN signedness mode per block and removes duplicate setnnmode instructions. When consecutive dot products use the same signedness mode, the backend emits one setnnmode per dot product. This pass removes the redundant ones, keeping only the first. Assisted-by: Claude Opus 4.6 Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41255>	2026-04-29 13:21:08 +00:00

1 2 3 4 5 ...

221833 commits