mesa/src/panfrost/libpan
Christian Gmeiner 3d7d2115f8
Some checks are pending
macOS-CI / macOS-CI (dri) (push) Waiting to run
macOS-CI / macOS-CI (xlib) (push) Waiting to run
panvk: Implement vkCmdFillBuffer with panlib kernels
Replace the vk_meta_fill_buffer call with direct panlib precomp
dispatches: a KERNEL(32) uint4 bulk path for 16-byte-aligned fills and a
KERNEL(32) uint32 path otherwise, each with a KERNEL(1) scalar tail for
sub-workgroup remainders.

gpu-ratemeter vk.bufbw on Mali-G610 MC4 shows a 1.15-1.18x median
speedup across alignment classes and roughly 5x on fills <= 512 B,
thanks to the removed pipeline bind / descriptor-set setup that
vk_meta_fill_buffer pays per call.

Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com>
Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41079>
2026-04-27 08:19:20 +00:00
..
copy.cl panvk: Implement vkCmdFillBuffer with panlib kernels 2026-04-27 08:19:20 +00:00
draw_helper.cl panvk/csf: implement VK_EXT_primitives_generated_query primitive restart 2026-01-21 09:03:34 +00:00
draw_helper.h panvk: Parallelize min max index search on JM 2025-08-04 12:12:56 +00:00
indirect_dispatch.cl libpan: Add indirect dispatch implementation 2025-06-06 12:58:21 +00:00
libpan.h pan/clc: Build for v13 2025-04-15 13:36:07 +02:00
libpan_dgc.h panvk/csf: implement dynamic precomp dispatch size 2026-01-21 09:03:34 +00:00
libpan_shaders.h pan/clc: Build for v13 2025-04-15 13:36:07 +02:00
meson.build panvk: Implement vkCmdFillBuffer with panlib kernels 2026-04-27 08:19:20 +00:00
query_pool.cl build: avoid redefining unreachable() which is standard in C23 2025-07-31 17:49:42 +00:00