fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-06 15:58:05 +02:00

Author	SHA1	Message	Date
Rob Clark	0632161956	freedreno/a6xx: Hide 10_10_10_2 for opencl Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The alpha channel seems to be internally returned as f16 (up-converted to f32 is that is the dest type of the sam instruction). This expresses 1/3 and 2/3 with less precision than cl cts expects (f32). This may be a test bug. But the format is not required. Signed-off-by: Rob Clark <rob.clark@oss.qualcomm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40028>	2026-02-25 00:47:51 +00:00
Rob Clark	d487358527	rusticl: Add CL specific bind flag In some cases CL has higher precision requirements for format support. Add a PIPE_BIND_x flag so that drivers can expose formats in GL(ES) that they cannot expose in CL. Signed-off-by: Rob Clark <rob.clark@oss.qualcomm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40028>	2026-02-25 00:47:51 +00:00
Lionel Landwerlin	57c90e86bf	anv: add a drirc to control binding table block size Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39527>	2026-02-25 00:17:03 +00:00
Lionel Landwerlin	8a5ac96a67	anv: predicate BTP emissions The previous commit enable different command buffers to program the same 3DSTATE_BINDING_TABLE_POOL_ALLOC instruction even though they allocated different chunks of binding tables. Now we can just predicate this programming and skip the stalling, flushing & invalidation. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39527>	2026-02-25 00:17:03 +00:00
Lionel Landwerlin	725c2a39d5	anv: enable sharing binding table pool programming We currently allocate 64KiB chunks of binding table pools for each command buffers and program the 3DSTATE_BINDING_TABLE_POOL_ALLOC instruction accordingly. But 3DSTATE_BINDING_TABLE_POINTERS_* instructions can address 2^20 bytes. So it's possible to have 2 command buffers share the same programming if they just add some offsets to their 3DSTATE_BINDING_TABLE_POINTERS_* programming and round down 3DSTATE_BINDING_TABLE_POOL_ALLOC addresses to 2^20. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39527>	2026-02-25 00:17:02 +00:00
Lionel Landwerlin	9ef47c743c	anv: move ALU registers used for mi commands Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39527>	2026-02-25 00:17:01 +00:00
Rob Clark	bfc5865a1b	gallium: Add warning about PIPE_QUERY_x's ABIness Someone somewhere decided to use PIPE_QUERY_x as the ABI between host and guest. Add a warning about this. Signed-off-by: Rob Clark <rob.clark@oss.qualcomm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40051>	2026-02-24 23:19:34 +00:00
Rob Clark	7dc6fbd5c5	gallium: Switch TIMESTAMP_RAW back to callback The original MR switched to use a float raw_timestamp_period to scale the raw timestamp outside of the gallium driver. This better matched how vulkan works. But unlike vulkan, gallium has timestamp related queries/APIs that return already scaled time, resulting in small errors if the way the scaling is done differs between driver scaling and frontend scaling. The important thing is that any error introduced by scaling must be the same error across APIs. (In particular, a f64 value cannot preciesly represent an arbitrary u64 value. OTOH the driver's scaling could be simply multiply be an integer. But differing precision errors of the two approachs causes problems when comparing between timestamps that are converted in different ways.) In some, but not all, cases this could be addressed by changing the driver to use the same scaling function, but this is not always possible (if, for ex, the scaling is done on the GPU CP). So switch back to the original approach from !39995, using a pscreen->convert_timestamp() callback, to put the control back in the hands of the driver. Signed-off-by: Rob Clark <rob.clark@oss.qualcomm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40051>	2026-02-24 23:19:34 +00:00
Urja Rannikko	c768797ab3	hash_table: fix use-after-free by reorganization of destruct callbacks Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Remove the allocation of a dummy context for the destruction callback of hash_table_u64 (on 32-bit), instead have hash_table provide callback(s) for handing destruction of the table contents. Signed-off-by: Urja Rannikko <urjaman@gmail.com> Fixes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14521 Acked-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Achill Gilgenast <achill@achill.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39085>	2026-02-24 22:04:38 +00:00
Emma Anholt	f13f88b749	freedreno/crashdec: Print an error instead of crashing on fopen() fail. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40077>	2026-02-24 21:49:37 +00:00
Samuel Pitoiset	2eb9420061	ac/nir: fix writemask for dual source blending on GFX11+ Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This should definitely be an OR operation if MRT0 and MRT1 don't write the same channels. This also requires to set the writemask manually because when it's 0 (in case a dual-source output is missing), the intrinsic computes the mask itself with the number of components. No fossils-db changes on NAVI33. Fixes: `45d8cd037a` ("ac/nir: rewrite ac_nir_lower_ps epilog to fix dual src blending with mono PS") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14878 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39996>	2026-02-24 20:20:02 +00:00
Alyssa Rosenzweig	42c4f7935a	nir: optimize u2u32(unpack_32_2x16_split_*) Noticed while playing with pixel coord things. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40056>	2026-02-24 19:16:56 +00:00
Benjamin Otte	0b6dd167ac	lavapipe: Fix features for nonsubsampled ycbcr formats Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The Vulkan spec says about VkFormatFeatureFlagBits: If a format does not incorporate chroma downsampling (it is not a “422” or “420” format) but the implementation supports sampler Y′CBCR conversion for this format, the implementation must set VK_FORMAT_FEATURE_MIDPOINT_CHROMA_SAMPLES_BIT. Fixes: `af062126ae` Signed-off-by: Benjamin Otte <otte@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39820>	2026-02-24 18:56:10 +00:00
Rhys Perry	05586fae67	radv: combine v_mov_dpp into fma in frag_coord.z adjust This wasn't possible before because of the literal. fossil-db (vangogh): Totals from 3766 (3.36% of 112041) affected shaders: Instrs: 5646975 -> 5642967 (-0.07%); split: -0.09%, +0.02% CodeSize: 30751152 -> 30749956 (-0.00%); split: -0.02%, +0.02% VGPRs: 262640 -> 262672 (+0.01%); split: -0.01%, +0.02% Latency: 184869423 -> 184883681 (+0.01%); split: -0.02%, +0.03% InvThroughput: 43462111 -> 43456113 (-0.01%); split: -0.03%, +0.02% VClause: 93337 -> 93353 (+0.02%); split: -0.10%, +0.12% SClause: 186857 -> 186833 (-0.01%); split: -0.07%, +0.06% Copies: 390583 -> 390327 (-0.07%); split: -0.44%, +0.38% Branches: 106070 -> 106086 (+0.02%); split: -0.03%, +0.05% VALU: 4112856 -> 4108839 (-0.10%); split: -0.12%, +0.02% SALU: 593814 -> 593758 (-0.01%); split: -0.05%, +0.04% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40013>	2026-02-24 18:34:38 +00:00
Rhys Perry	437a3d5ade	radv: disable fast math for frag_coord.z adjust No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40013>	2026-02-24 18:34:37 +00:00
Caterina Shablia	5e25953690	radv: move all image checks into radv_image_need_retile There's no reason to have these checks be smeared between radv_image_need_retile and radv_retile_transition. Make radv_image_need_retile verify that the image might ever need to have its displayable DCC updated. Also, radv_image_need_retile should not care about the command buffer. We should never try to do retile transition on a command buffer that can't do compute to begin with. Make radv_retile_transition only check whether the layout we're transitioning to might involve reading the displayable DCC, and perform retiling if so. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39990>	2026-02-24 18:08:53 +00:00
Caterina Shablia	4529be010b	radv: skip retiling if transitioning away from ZERO_INITIALIZED Zeros are already a sensible value for display DCC metadata to have and correctly represent a zeroed image. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39990>	2026-02-24 18:08:53 +00:00
Nick Hamilton	14508b4c9a	pvr: Add support for fragment pass through shader On the Rogue architecture add support for using a fragment passthrough shader when there is no fragment shader present in a graphics pipeline but the sample mask is required. fix: dEQP-VK.pipeline.monolithic.empty_fs.masked_samples Backport-to: 26.0 Signed-off-by: Nick Hamilton <nick.hamilton@imgtec.com> Co-authored-by: Simon Perretta <simon.perretta@imgtec.com> Reviewed-by: Frank Binns <frank.binns@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40048>	2026-02-24 17:50:10 +00:00
Rob Clark	005ce4057c	freedreno+ir3: Implement CL isam mode CL expected different coord rounding, etc. Switch based on shader type. Signed-off-by: Rob Clark <rob.clark@oss.qualcomm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40027>	2026-02-24 17:22:03 +00:00
Rob Clark	ac7b457927	ir3: More COMPUTE vs KERNEL We were inconsistently handling MESA_SHADER_KERNEL, which for the most part should be treated identically to MESA_SHADER_COMPUTE. Signed-off-by: Rob Clark <rob.clark@oss.qualcomm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40027>	2026-02-24 17:22:03 +00:00
Nick Hamilton	b87d995d32	pvr: Update CI fails list after render pass fixes Backport-to: 26.0 Signed-off-by: Nick Hamilton <nick.hamilton@imgtec.com> Reviewed-by: Frank Binns <frank.binns@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40002>	2026-02-24 17:03:20 +00:00
Jarred Davies	d1f2ad17dd	pvr: Add missing support for tile buffers to SPM EOT programs Configure the EOT setup for SPM EOT programs so that the generated programs load the tile buffer into the output buffer before doing the emit Partial fix for: dEQP-VK.renderpass.*.attachment_allocation.input_output.71 Backport-to: 26.0 Signed-off-by: Nick Hamilton <nick.hamilton@imgtec.com> Reviewed-by: Frank Binns <frank.binns@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40002>	2026-02-24 17:03:20 +00:00
Nick Hamilton	0e01b9ef2d	pvr: Add missing support for preserve attachments In subpasses preserve attachments are not used by the subpass but their contents must be preserved throughout the subpass. Add a list for the preserve attachments info specified by a subpass and when determining a subpass attachments total uses check the preserve attachments list and add it uses to the total. Partial fix for: dEQP-VK.renderpass.*.attachment_allocation.input_output.71 Backport-to: 26.0 Signed-off-by: Nick Hamilton <nick.hamilton@imgtec.com> Reviewed-by: Frank Binns <frank.binns@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40002>	2026-02-24 17:03:20 +00:00
Nick Hamilton	e18670347a	pvr: Rename pvr_render_input_attachment The struct will also be used for preserve attachments in the next commit. Backport-to: 26.0 Signed-off-by: Nick Hamilton <nick.hamilton@imgtec.com> Reviewed-by: Frank Binns <frank.binns@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40002>	2026-02-24 17:03:20 +00:00
Jarred Davies	df445dc9b9	pvr: Fix allocating the required scratch buffer space for tile buffers When calculating the dwords per pixel the output registers should always be taken into account in addition to the number of tile buffers. Fixes incorrect scratch buffer space calculation when both output registers and tile buffers are emitted by a render. Partial fix for: dEQP-VK.renderpass.*.attachment_allocation.input_output.71 Fixes: `3457f8083a` ("pvr: Acquire scratch buffer on framebuffer creation.") Signed-off-by: Nick Hamilton <nick.hamilton@imgtec.com> Reviewed-by: Frank Binns <frank.binns@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40002>	2026-02-24 17:03:20 +00:00
Nick Hamilton	0640ac7e3b	pvr: Fix incorrect subpass merging optimisation The subpass merging optimisation check for when subpasses are using tile buffers was in the incorrect location. The current check is in a function called from two places but only the first of these should have been doing the optimisation check. This was incorrectly affecting the number of renders that subpass merging could avoid. Partial fix for: dEQP-VK.renderpass.*.attachment_allocation.input_output.71 Fixes: `10b6a0d567` ("pvr: Add support for generating render pass hw setup data.") Signed-off-by: Nick Hamilton <nick.hamilton@imgtec.com> Reviewed-by: Frank Binns <frank.binns@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40002>	2026-02-24 17:03:19 +00:00
Duncan Brawley	0ea39c6305	pco: Use vertex input registers in register allocation Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Add support for the use of vertex input registers as additional general purpose registers which previously was restricted to temporary registers. Use of vertex input registers as additional general purpose registers is not available for fragment shaders. Vertex input registers are similar to temporary registers. The only difference is that vertex input registers can contain pre-initialised data when the shader starts. By default, the number of vertex input registers used for register allocation is the number of vertex input registers used for their pre-initialised data rounded up to the nearest multiple of 4, as vertex input registers are allocated in blocks of 4. If PCO_DEBUG=alloc_extra_vtxins is used, a mimimum of 12 vertex input registers are available for register allocation. Signed-off-by: Duncan Brawley <duncan.brawley@imgtec.com> Reviewed-by: Simon Perretta <simon.perretta@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39886>	2026-02-24 16:27:45 +00:00
Robert Mader	8592c177d1	lavapipe: Remove some dead code Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40030>	2026-02-24 16:12:13 +00:00
Robert Mader	0b6340fd94	lavapipe: enable dmabuf import for planar drm formats Like e.g. NV12. This just requires some minor fixes around offset handling. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40030>	2026-02-24 16:12:13 +00:00
Daivik Bhatia	7e2693f3d3	v3dv: use vk_graphics_pipeline_state for pipeline creation Refactor pipeline creation path to use the vk_graphics_pipeline_state structures provided by runtime instead of raw Vulkan CreateInfo structs. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39834>	2026-02-24 15:52:18 +00:00
Danylo Piliaiev	2b5b41bb24	tu: Fix double emission of PC_DS_CNTL due to missing break Didn't cause any issues. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39890>	2026-02-24 15:21:50 +00:00
Danylo Piliaiev	47251b2e2d	ir3: Align TCS per-patch output to 64 bytes to prevent stale reads Empirically, TCS outputs have to be aligned to 64 bytes, otherwise stale data may be read in rare cases. The exact reason is not clear, but tests and proprietary driver behavior strongly point at the need for 64 byte alignment. Fixes tesselation issues in at least "Conan Exiles" but likely in many more cases. CC: mesa-stable Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39890>	2026-02-24 15:21:50 +00:00
Erik Faye-Lund	e111422f7f	pan/ci: correct syntax for flakes These shouldn't have the result at the end, silly me. Fixes: `6eeede8a52` ("pan/ci: add missing t720-flakes") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40065>	2026-02-24 15:02:17 +00:00
osy	3f75aa3b75	kk: enable VK_KHR_external_{fence,semaphore}_fd We only support VK_EXTERNAL_SEMAPHORE_HANDLE_TYPE_SYNC_FD_BIT with FD == -1. Reviewed-by: Aitor Camacho <aitor@lunarg.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39067>	2026-02-24 14:23:05 +00:00
osy	5f8a9f0aeb	vulkan: external sync for vk_sync_binary Venus uses vkImportSemaphoreFdKHR() with FD == -1 to signal a binary semaphore and vkGetSemaphoreFdKHR() to wait on a binary semaphore. Both KK and Dzn uses vk_sync_binary so supporting this special case for import/export sync file will enable Venus host support. Reviewed-by: Aitor Camacho <aitor@lunarg.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39067>	2026-02-24 14:23:05 +00:00
Georg Lehmann	07260dc210	nir/lower_subgroups: lower shuffles and bitwise reduce to 32bit before scalarizing Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Pack/unpack should be a lot faster than duplicating the subgroup op. No fossil-db changes, but multiple people complained about this to me. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40024>	2026-02-24 13:48:35 +00:00
Rhys Perry	613b4fe407	aco: resolve hazards before calls Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Backport-to: 26.0 Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39825>	2026-02-24 13:20:55 +00:00
Rhys Perry	dfda890ae8	aco: reset all vgpr_used_by_vmem_ in resolve_all_gfx11 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Backport-to: 26.0 Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39825>	2026-02-24 13:20:55 +00:00
Rhys Perry	72923ad2f0	aco: fix VALUReadSGPRHazard with s_call_b64/s_swappc_b64 This probably doesn't do anything because sgpr_read_by_valu are all set already for raytracing shaders. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39825>	2026-02-24 13:20:54 +00:00
Benjamin Cheng	2bb26573e7	radv/video: Split cdf buffer and encode ctx These resources are fundamentally different and should be treated as such. Reviewed-by: David Rosca <david.rosca@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40050>	2026-02-24 12:29:25 +00:00
Christian Gmeiner	8dc8a2eec4	panvk: Support VK_EXT_legacy_dithering Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Wire up the existing Panfrost dithering infrastructure to the Vulkan extension. The library already supports dithered formats via pan_dithered_format_from_pipe_format(), so this plumbs the VK_RENDERING_ENABLE_LEGACY_DITHERING_BIT_EXT flag through to the blend descriptor emission and color attachment internal conversion paths. Dithering is only applied to color attachments, not depth or framebuffer preloads. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39781>	2026-02-24 11:34:19 +00:00
Samuel Pitoiset	755cb6cb75	radv: fix independent sets with dynamic buffers and GPL If a set layout is missing the driver can't compute the dynamic buffer start offsets correctly. The only solution is to load these offsets from an user SGPR. To avoid adding more complexity, these offsets are re-emitted every time dynamic buffers are dirty. That shouldn't matter because the combination of dynamic buffers and independent sets is just super rare. This fixes new VKCTS coverage dEQP-VK.pipeline.pipeline_library.graphics_library.independent_sets_random.*. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39988>	2026-02-24 11:12:14 +00:00
Georg Lehmann	c6e1c23faf	radv: expose VK_VALVE_shader_mixed_float_dot_product on supported hardware Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details In practice this means Vega20, Navi14, and RDNA2+. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40003>	2026-02-24 08:55:53 +00:00
Georg Lehmann	892274d20d	ac/llvm: implement mixed float dot Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40003>	2026-02-24 08:55:53 +00:00
Georg Lehmann	dd067088ef	aco: allow dpp for fp8/bf8 dot4 Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40003>	2026-02-24 08:55:53 +00:00
Georg Lehmann	a033cd95a4	aco: allow modifiers for fp16 dot Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40003>	2026-02-24 08:55:53 +00:00
Georg Lehmann	3238e64d3c	aco/ra: create v_dot2c_f32_f16 Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40003>	2026-02-24 08:55:53 +00:00
Georg Lehmann	237b8ca205	aco: mixed float dot product opcodes Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40003>	2026-02-24 08:55:52 +00:00
Georg Lehmann	4dc0f7d540	spirv: implement SPV_VALVE_mixed_float_dot_product Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40003>	2026-02-24 08:55:52 +00:00
Georg Lehmann	0d6fe16ce8	nir: add mixed float dot opcodes Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40003>	2026-02-24 08:55:52 +00:00

1 2 3 4 5 ...

219069 commits