fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-05 13:58:04 +02:00

Author	SHA1	Message	Date
Lorenzo Rossi	dc0dcc993b	nvk: implement VK_EXT_discard_rectangles Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Signed-off-by: Lorenzo Rossi <git@rossilorenzo.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33476>	2025-10-21 23:17:38 +00:00
Lorenzo Rossi	4c62e09505	vulkan: increase MESA_VK_MAX_DISCARD_RECTANGLES Turing and newer Nvidia cards can work with up to 8 discard rectangles Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Signed-off-by: Lorenzo Rossi <git@rossilorenzo.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33476>	2025-10-21 23:17:38 +00:00
Rhys Perry	b18421ae3d	amd/lower_mem_access_bit_sizes: fix shared access when bytes<bit_size/8 Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This can happen with (for example) 32x2 loads with align_mul=4,align_offset=2. This patch does bit_size=min(bit_size,bytes) to prevent num_components from being 0. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Fixes: `52cd5f7e69` ("ac/nir_lower_mem_access_bit_sizes: Split unsupported shared memory instructions") Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37953>	2025-10-21 22:10:34 +00:00
Rhys Perry	64ec757688	nir/lower_mem_access_bit_sizes: increase chunk limit Not sure about creating u64vec16 loads, but creating unaligned loads is possible with opt_if_rewrite_uniform_uses. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37953>	2025-10-21 22:10:34 +00:00
Rhys Perry	e89b22280f	amd/lower_mem_access_bit_sizes: be more careful with 8/16-bit scratch load Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Backport-to: 25.3 Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37953>	2025-10-21 22:10:34 +00:00
Rhys Perry	8829fc3bd6	amd/lower_mem_access_bit_sizes: improve subdword/unaligned SMEM lowering Summary of changes: - handle unaligned 16-bit scalar loads when supported_dword=true - increases the size of 8/16/32/64-bit buffer loads which are not dword aligned, which can create less SMEM loads. - handles when "bytes" is less than "bit_size / 8" fossil-db (gfx1201): Totals from 26 (0.03% of 79839) affected shaders: Instrs: 12676 -> 12710 (+0.27%); split: -0.30%, +0.57% CodeSize: 67272 -> 67384 (+0.17%); split: -0.24%, +0.40% Latency: 44399 -> 44375 (-0.05%); split: -0.09%, +0.04% SClause: 352 -> 344 (-2.27%) SALU: 3972 -> 3992 (+0.50%) SMEM: 554 -> 528 (-4.69%) fossil-db (navi21): Totals from 6 (0.01% of 79825) affected shaders: Instrs: 2192 -> 2186 (-0.27%) CodeSize: 12188 -> 12140 (-0.39%) Latency: 10037 -> 10033 (-0.04%); split: -0.12%, +0.08% SMEM: 124 -> 118 (-4.84%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Fixes: `fbf0399517` ("amd/lower_mem_access_bit_sizes: lower all SMEM instructions to supported sizes") Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37953>	2025-10-21 22:10:34 +00:00
Rhys Perry	79b2fa785d	amd/lower_mem_access_bit_sizes: don't create subdword UBO loads with LLVM These are unsupported. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14127 Fixes: `fbf0399517` ("amd/lower_mem_access_bit_sizes: lower all SMEM instructions to supported sizes") Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37953>	2025-10-21 22:10:33 +00:00
Dylan Baker	38e1a43f53	intel/mda: Fix potential underflow in printing code The actual chances of this happening seem dubious, but the cleaned up code seems nice. printf returns a value >= 0 on success, which is the number of characters it writes a return < 0 means that an error occurred, and then errno is set. Which negative value doesn't seem to be specified, but it also seems unlikely that any implementation would return `-MAX_INT`... Anyway, this is fixed by converting the generic `print_repeated` to a `print_separator` that avoids the need to do arithmetic at all by just stopping the loop at 1 instead of 0, and then printing a newline. CID: 1666497 CID: 1666256 CID: 1666531 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37746>	2025-10-21 21:55:53 +00:00
Dylan Baker	f25e59b951	intel/mda/tests: use an ASSERT on fread() Coverity is pointing out that we should check this, and in reality if this isn't what we expect the rest of the test is probably invalid anyway. CID: `1666504` CID: 1666544 CID: 1666552 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37750>	2025-10-21 14:39:18 -07:00
Mel Henning	28fbc6addb	nvk: VK_DEPENDENCY_ASYMMETRIC_EVENT_BIT_KHR This was missed in the original maintenance9 MR. Fixes the flakes in test dEQP-VK.synchronization2.op.single_queue.event.write_ssbo_compute_read_ssbo_compute.buffer_16384_maintenance9 Fixes: `7692d3c0` ("nvk: Advertise VK_KHR_maintenance9") Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37964>	2025-10-21 20:57:41 +00:00
Karol Herbst	e7dca5a6ca	nak: fix MMA latencies on Ampere Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Fixes: `7a01953a39` ("nak: Add Ampere and Ada latency information") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37941>	2025-10-21 20:12:30 +00:00
Karol Herbst	cf4df97093	nak: improve fp16 latencies on Ampere Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37941>	2025-10-21 20:12:30 +00:00
Karol Herbst	85480200f8	nak: simplify SM80 HMMA latency categorization Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37941>	2025-10-21 20:12:30 +00:00
Karol Herbst	3bbf3f7826	nak: ensure deref has a ptr_stride in cmat load/store lowering With untyped pointer we might get a deref_cast with a 0 ptr_stride. But we were supposed to ignore the stride information on the pointer anyway, so let's do that properly now. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Fixes: `05dca16143` ("nak: extract nir_intrinsic_cmat_load lowering into a function") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37941>	2025-10-21 20:12:30 +00:00
Karol Herbst	f632bfc715	nak: extract cmat load/store element offset calculation Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Fixes: `05dca16143` ("nak: extract nir_intrinsic_cmat_load lowering into a function") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37941>	2025-10-21 20:12:30 +00:00
Konstantin Seurer	d423554e9e	radv/bvh: Pair compress triangles in more cases Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36965>	2025-10-21 19:32:55 +00:00
Konstantin Seurer	c0f332f1cb	vulkan/bvh: Add leaf.h to vk_bvh_includes Otherwise, the shader will not recompile when the file was modified. Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36965>	2025-10-21 19:32:55 +00:00
Konstantin Seurer	020bd86d30	vulkan: Remove the vk_ir_triangle_node::id field Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36965>	2025-10-21 19:32:55 +00:00
Konstantin Seurer	c18a7d0e2b	radv: Emit compressed primitive nodes on GFX12 The normal encode pass writes batches to a section in build scratch memory. Those batches contain information about the internal node and the primitive nodes. The encoder is split to avoid the register pressure of the compressor and maximize occupancy. The compressor works in two passes because one pass can not guarantee that every primitive node (except) has at least two triangles. This guarantee is used to advertise a smaller acceleration structure size to the application. During compression, every invocation processes at most two triangles. Groups of 8 invocations are used to support the maximum triangle count of 16 that the hardware supports. The first step of compression is loading the triangle(s). Shared vertices are deduplicated early to avoid doing it in the compression loop. The compression loop tries to add triangles to a list of triangles until the computed node size needed for storing the triangles reaches the hardware node size. For this, each invocation first deduplicates vertices with the triangles that have already been picked. It then computes the node size of the picked triangles plus the candidate triangles of the current invocation. The invocation that computed the smallest size is added to the list. Because it may not be possible to fit every triangle into the same node, there can be multiple hardware nodes which are written in parallel for optimal performance. If there are no nodes with only one triangle, all nodes are written. If there is, compression of the batch is aborted and the index of the batch is written to build scratch memory. The second compression pass will repeat the steps above but only for those aborted batches. The nodes with only one triangle can and are now merged. It can not be determined during box node encode which triangles will be compressed together so the encoder also has to fix up the parent box node's child infos. Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36965>	2025-10-21 19:32:55 +00:00
Konstantin Seurer	c5f9fe5e3b	radv/rra/gfx12: Properly validate geometry indices Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36965>	2025-10-21 19:32:54 +00:00
Konstantin Seurer	82728380a2	vulkan/bvh: Add some debug helpers Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36965>	2025-10-21 19:32:54 +00:00
Konstantin Seurer	639cc4d937	vulkan: Bump MAX_ENCODE_PASSES to 4 Triangle compression will be performed in two extra passes. Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36965>	2025-10-21 19:32:54 +00:00
Konstantin Seurer	6a53aae6b2	vulkan: Add vk_ir_header::driver_internal Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36965>	2025-10-21 19:32:54 +00:00
Konstantin Seurer	2ee8bfefe6	radv/bvh: Add radv_first_active_invocation Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36965>	2025-10-21 19:32:53 +00:00
Yiwei Zhang	bd53bbbc57	panvk: support VK_EXT_external_memory_acquire_unmodified Upon acquiring an external image from external/foreign queue family, skip AFBC metadata invalidation if the app has explicitly requested acquireUnmodifiedMemory. This also applies to CRC which may or may not get hooked up later. Reviewed-by: John Anthony <john.anthony@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37972>	2025-10-21 19:15:58 +00:00
Konstantin Seurer	990f1868ec	vulkan/cmd_queue: Free all elements of struct arrays Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37710>	2025-10-21 19:50:47 +02:00
Konstantin Seurer	a3e77fe5d2	vulkan/cmd_queue: Fix indentation for struct array copies Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37710>	2025-10-21 19:49:54 +02:00
Faith Ekstrand	38950083ae	panvk: Fix integer dot product properties Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details We already set has_[su]dot_4x8[_sat] in nir_shader_compiler_options so we're already getting the opcodes. We just need to advertise the features properly. If bifrost_compile.h is to be believed, those are all available starting at gen 9. Closes: https://gitlab.freedesktop.org/panfrost/mesa/-/issues/218 Closes: https://gitlab.freedesktop.org/panfrost/mesa/-/issues/219 Fixes: `f7f9b3d170` ("panvk: Move to vk_properties") Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37980>	2025-10-21 17:24:41 +00:00
Silvio Vilerino	d380e54422	d3d12: Fix d3d12_video_enc.cpp(4794,33): Error C4244: initializing: conversion from uint64_t to SIZE_T, possible loss of data Reviewed-by: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37982>	2025-10-21 09:23:36 -07:00
Silvio Vilerino	44d8e999e2	mediafoundation: Also set pSyncObjectQueue = m_spStagingQueue when DX11 input sample Reviewed-by: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37982>	2025-10-21 09:23:31 -07:00
Silvio Vilerino	d2cbbccaaa	mediafoundation: Only wait on pSyncObjectQueue for stats completion if any stat was enabled Reviewed-by: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37982>	2025-10-21 09:23:27 -07:00
Silvio Vilerino	4f7aa40222	mediafoundation: Allocate pro-rated buffer sizes for multi-slice encoding Reviewed-by: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37982>	2025-10-21 09:23:21 -07:00
Silvio Vilerino	b454c35318	mediafoundation: Only use sliced mode when CODECAPI_AVEncSliceGenerationMode is set, disregarding num slices configured Reviewed-by: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37982>	2025-10-21 09:23:11 -07:00
Silvio Vilerino	71aecf4a93	mediafoundation: SliceGeneration=1: Zero copy IMFSample output with wrapped ID3D12Resource frame/slice buffers Reviewed-by: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37982>	2025-10-21 09:23:06 -07:00
Silvio Vilerino	45e56e4c96	d3d12: Only check for GetDeviceRemovedReason in debug builds Reviewed-by: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37982>	2025-10-21 09:23:01 -07:00
Silvio Vilerino	4e1bb2111f	d3d12: d3d12_promote_to_permanent_residency to accept res array batch Reviewed-by: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37982>	2025-10-21 09:22:56 -07:00
Silvio Vilerino	07224f6d15	d3d12: Make output metadata frame buffer READBACK and use direct Map() in get_feedback Reviewed-by: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37982>	2025-10-21 09:22:48 -07:00
Silvio Vilerino	e4d8a49fcd	d3d12: Only check H264 video caps if configuration changed between frames Reviewed-by: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37982>	2025-10-21 09:22:40 -07:00
Silvio Vilerino	8fd82cb339	d3d12: d3d12_video_encoder_get_slice_bitstream_data use regular Map/Unmap Reviewed-by: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37982>	2025-10-21 09:22:34 -07:00
Silvio Vilerino	1dc76fcaa8	d3d12: Use readback heaps for staging bitstream allocations Reviewed-by: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37982>	2025-10-21 09:22:30 -07:00
Silvio Vilerino	9b131f1407	d3d12: Video Encode - Make some parameters const & instead of by value Reviewed-by: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37982>	2025-10-21 09:22:24 -07:00
Silvio Vilerino	1ffefc3e32	d3d12: Use cached heap allocations for output bitstreams instead of allocating per frame Reviewed-by: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37982>	2025-10-21 09:22:18 -07:00
Silvio Vilerino	adbb07e927	d3d12: Use cached heap allocations for barriers instead of allocating per frame Reviewed-by: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37982>	2025-10-21 09:22:14 -07:00
Silvio Vilerino	b076cfdf22	d3d12: Remove unused d3d12_video_encoder::m_transitionsBeforeCloseCmdList Reviewed-by: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37982>	2025-10-21 09:22:10 -07:00
Silvio Vilerino	6f9c49f6f5	d3d12: Only check HEVC video caps if configuration changed between frames Reviewed-by: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37982>	2025-10-21 09:22:05 -07:00
Silvio Vilerino	e3ab866fea	d3d12: Only call CheckFeatureSupport(D3D12_FEATURE_FORMAT_INFO when video format changes Reviewed-by: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37982>	2025-10-21 09:22:01 -07:00
Silvio Vilerino	ca2a1e470a	d3d12: Remove per frame allocation slice_sizes(picture->num_slice_descriptors) Reviewed-by: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37982>	2025-10-21 09:21:54 -07:00
Silvio Vilerino	53e07e78c7	d3d12: Cache ID3D12VideoEncodeCommandList4 instance if supported Reviewed-by: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37982>	2025-10-21 09:21:50 -07:00
Silvio Vilerino	b1ea2b06eb	d3d12: Cache ID3D12VideoEncoderHeap1 instance if supported Reviewed-by: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37982>	2025-10-21 09:21:46 -07:00
Silvio Vilerino	a51c3b5bd0	d3d12: Cache ID3D12VideoDevice4 instance if supported Reviewed-by: Pohsiang (John) Hsu <pohhsu@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37982>	2025-10-21 09:21:41 -07:00

1 2 3 4 5 ...

213890 commits