fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-06-21 14:38:36 +02:00

Author	SHA1	Message	Date
Hans-Kristian Arntzen	779eb97cd2	wsi/x11: Add helper to find appropriate screen resources for a window. Correlate the screen roots with the xcb_window_t root. This mapping should be static. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no> Reviewed-and-tested-by: Mario Kleiner <mario.kleiner.de@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39551>	2026-06-19 21:07:09 +00:00
Hans-Kristian Arntzen	3778c321c4	wsi/x11: Setup screen resources on x11_connection creation. Makes it possible to query refresh rates of screens. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no> Reviewed-and-tested-by: Mario Kleiner <mario.kleiner.de@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39551>	2026-06-19 21:07:09 +00:00
Hans-Kristian Arntzen	4c7266a505	loader: Clear screen resources struct on init. Somewhat surprising this was not done already. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no> Reviewed-and-tested-by: Mario Kleiner <mario.kleiner.de@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39551>	2026-06-19 21:07:09 +00:00
Hans-Kristian Arntzen	36d77ef0f9	loader: Separate out X11 specific screen queries from dri_helper.h. Allows these helpers to be used for X11 WSI as well. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no> Acked-by: Emma Anholt <emma@anholt.net> Reviewed-by: Mario Kleiner <mario.kleiner.de@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39551>	2026-06-19 21:07:09 +00:00
Faith Ekstrand	2ed09c8b11	kraid: Add OpRegIn and OpRegOut These will be implemented entirely in the register allocator as register assignment constraings (and possibly a copy in the case of OpRegOut). Only OpRegIn is implemented in the trivial RA. OpRegOut will have to wait for the real RA. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42344>	2026-06-19 20:52:05 +00:00
Faith Ekstrand	ed9c430375	kraid: Use a tuple struct for SSAValue Since it only contains an SSAValueInner, this is a little tidier. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42344>	2026-06-19 20:52:05 +00:00
Faith Ekstrand	f4be78358c	kraid: Add an SSAValue::bytes() helper This is just bits() divided by 8 but it's a bit more efficient and we want bytes often enough that we might as well have the helper. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42344>	2026-06-19 20:52:05 +00:00
Yiwei Zhang	8bc99ecb19	venus: amend roundtrip between fence submit and wait idle The last virtgpu submission can end up with a tailing fence submit racing with renderer side sync thread teardown during vkDestroyDevice. This change ensures virtgpu side fence submits are done before the idle wait fence submit via the ring. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/work_items/15672 Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42327>	2026-06-19 20:29:22 +00:00
Mike Blumenkrantz	7695a2729e	zink: don't invalidate cbufs without inlined resolve this probably means the resolve couldn't be inlined and will follow separately Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42341>	2026-06-19 20:10:44 +00:00
Mike Blumenkrantz	7cdb5dbe56	zink: proactively apply transfer sync when tracking renderpasses triggering a TRANSFER_WRITE->FRAGMENT_READ barrier on rp start should (mostly) eliminate renderpass splitting from transfer op sync Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42341>	2026-06-19 20:10:44 +00:00
Mike Blumenkrantz	0cbf3baaee	zink: properly invalidate fb attachments on dontcare stores Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42341>	2026-06-19 20:10:44 +00:00
Mike Blumenkrantz	dbce242de9	zink: stop forcing barriers if previous access was write this is a redundant condition that triggers additional sync Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42341>	2026-06-19 20:10:44 +00:00
Mike Blumenkrantz	206740815b	zink: when triggering zink_blit_barriers() for src==dst, apply separate barriers combining the barriers prevents the write barrier from correctly clearing the read flags, which breaks analysis for successive read barriers cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42341>	2026-06-19 20:10:43 +00:00
Charmaine Lee	d55b2cafb3	nir_to_tgsi: fix shared memory index The index for shared memory should be set to TGSI_MEMORY_TYPE_SHARED which matches the index used in the declaration. Fixes spec@arb_compute_shader@execution@shared-atomic* with SVGA driver Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16458>	2026-06-19 19:38:16 +00:00
Thomas H.P. Andersen	805b6e75e7	nvk: add env var to allow backwards compat in dlss Adds NVK_EXPERIMENTAL=dlss_backwards_compat Allows using a SASS binary with a matching major version number, but smaller minor number than the device. Reviewed-by: Mary Guillemard <mary@mary.zone> Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Tested-by: Mary Guillemard <mary@mary.zone> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40686>	2026-06-19 19:01:39 +00:00
Thomas H.P. Andersen	613a246880	nvk: hide NVX_binary_import behind NVK_EXPERIMENTAL=dlss env var Reviewed-by: Mary Guillemard <mary@mary.zone> Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Tested-by: Mary Guillemard <mary@mary.zone> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40686>	2026-06-19 19:01:39 +00:00
Autumn Ashton	a7b3494629	nvk: Implement VK_NVX_binary_import This commit implements the subset of cubin launch functionality in VK_NVX_binary_import used by DLSS. With this, DLSS works in Control on a RTX 2060 Super (sm_75) and a RTX 4060 (sm_89). Right now, this will only work where there is compatible bytecode available for the current physical device and will return an error in vkCreateCuModuleNVX if none is present. DXVK-NVAPI and DLSS handle this error gracefully and will disable the affected features. The NVIDIA driver would do PTX -> bytecode on the fly to handle this, but we don't have PTX->NIR yet, and that is a very large undertaking not done by this MR. This doesn't fully close #12439, as we don't have PTX->NIR, but is a big step towards it. Signed-off-by: Autumn Ashton <misyl@froggi.es> Reviewed-by: Mohamed Ahmed <mohamedahmedegypt2001@gmail.com> Reviewed-by: Mary Guillemard <mary@mary.zone> Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Tested-by: Mary Guillemard <mary@mary.zone> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40686>	2026-06-19 19:01:39 +00:00
Thomas H.P. Andersen	87004ffcae	nouveau/cubin: use libelf 64 bit instead of gelf gelf was causing problems on android. We only need 64 bit so gelf should not be need Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40686>	2026-06-19 19:01:38 +00:00
Autumn Ashton	21245929fc	nouveau/cubin: Add cubin and fatbin parsers These cubin and fatbin parsers implement a subset of the functionality exposed in order to launch the modern Cuda kernels used for DLSS. Co-authored-by: Mary Guillemard <mary@mary.zone> Signed-off-by: Autumn Ashton <misyl@froggi.es> Reviewed-by: Mohamed Ahmed <mohamedahmedegypt2001@gmail.com> Reviewed-by: Mary Guillemard <mary@mary.zone> Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Acked-by: Karol Herbst <kherbst@redhat.com> Tested-by: Mary Guillemard <mary@mary.zone> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40686>	2026-06-19 19:01:38 +00:00
Autumn Ashton	61577fd631	nvk: Add nvk_cmd_dispatch_with_root This allows making dispatch with a specifically inputted root descriptor, primarily for cubin kernel launches. Signed-off-by: Autumn Ashton <misyl@froggi.es> Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Reviewed-by: Mohamed Ahmed <mohamedahmedegypt2001@gmail.com> Reviewed-by: Mary Guillemard <mary@mary.zone> Reviewed-by: Karol Herbst <kherbst@redhat.com> Tested-by: Mary Guillemard <mary@mary.zone> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40686>	2026-06-19 19:01:38 +00:00
Autumn Ashton	0dfe4aaa24	nvk: Allow nvk_cmd_upload_qmd to take a custom root descriptor The cubin kernel launches need to use a root descriptor that's compatible with the bytecode that nvcc generates which contains block dim, grid dim and the kernel params at specific layouts which can be influenced by ELF .nv.info attributes. Thus, expose the ability to input custom root descriptors in nvk_cmd_upload_qmd. Signed-off-by: Autumn Ashton <misyl@froggi.es> Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Reviewed-by: Mohamed Ahmed <mohamedahmedegypt2001@gmail.com> Reviewed-by: Mary Guillemard <mary@mary.zone> Reviewed-by: Karol Herbst <kherbst@redhat.com> Tested-by: Mary Guillemard <mary@mary.zone> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40686>	2026-06-19 19:01:38 +00:00
Autumn Ashton	acfea9e03f	nak: Expose max_warps_per_sm Previously this was only accessible from Rust, but VK_NVX_binary_import needs to calculate this for imported cubin kernels from EIATTR_REGCOUNT. Signed-off-by: Autumn Ashton <misyl@froggi.es> Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Reviewed-by: Mohamed Ahmed <mohamedahmedegypt2001@gmail.com> Reviewed-by: Mary Guillemard <mary@mary.zone> Reviewed-by: Karol Herbst <kherbst@redhat.com> Tested-by: Mary Guillemard <mary@mary.zone> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40686>	2026-06-19 19:01:38 +00:00
Adrián Larumbe	c95edade04	panvk: Talk directly to pankmod when binding sparse resources Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details There's no longer need for the panvk_sparse library, or for panvk to care about whether the KMD can do native sparse mapping. Submit sparse VM bindings as a single operation and let pankmod handle the gory details. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Signed-off-by: Adrián Larumbe <adrian.larumbe@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40400>	2026-06-19 18:20:31 +00:00
Adrián Larumbe	19cf49f02f	panvk: Use pankmod instead of panthor drm interfaces in bind queues On top of that, leverage the new push/flush interface so that management of the black hole in older KMD versions can be handled by the pankmod layer. Merging of operations is now done in conjunction with buffering the latest submission, so that the very last operation can have its signal syncs assigned before being delivered to the pankmod layer. Co-developed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Signed-off-by: Adrián Larumbe <adrian.larumbe@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40400>	2026-06-19 18:20:31 +00:00
Adrián Larumbe	306539a3c7	pan/kmod: Introduce vm_op buffering and sparse mapping emulation The goal is moving the need for prebuffering when the total number of vm_bind operations isn't known in advance away from panvk, and into the pankmod layer, and also to consolidate that treatment in a single place. At the moment, both panvk_vX_bind_queue.c and panvk_sparse.c roll their own workarounds for the blackhole-mapping sparse bind mechanism. For older KMD versions with no sparse mapping support, emulate it by cyclically mapping over a dummy BO, which is allocated on demand and per VM. This behaviour is similar to that of the Panthor KMD. This moves responsibility over whether to use native KMD sparse mapping or the blackhole method into the pankmod layer, so that the sparse mapping mechanism is transparent to the Vulkan driver. Also disallow automatic VA assignment when sparse emulation is required, because relaying auto va's back to the caller is both cumbersome and unsafe, and also not a practical use case. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Signed-off-by: Adrián Larumbe <adrian.larumbe@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40400>	2026-06-19 18:20:31 +00:00
Adrián Larumbe	831bc9bb65	pan/kmod: Introduce sparse binding Register whether the underlying KMD supports sparse mappings in a device property. Add a new VM operation field that holds flags, for the time being only sparse is a valid operation modifier. Disallow sparse operations when an automatic VA is requested or when a BO is provided accidentally. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Signed-off-by: Adrián Larumbe <adrian.larumbe@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40400>	2026-06-19 18:20:31 +00:00
Adrián Larumbe	f20b7adbca	pan/kmod: Handle sync object signals in Panthor's vm_bind A future commit will want to have a binary sync object attached to a vm_bind operation or a sync operation only, so rather than creating a separate pankmod flag for it, we simply check the point (always 0). Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Signed-off-by: Adrián Larumbe <adrian.larumbe@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40400>	2026-06-19 18:20:31 +00:00
Adrián Larumbe	1dd5ea7feb	pan/kmod: Pass signal and wait syncs separately This is done in preapration of kmod support for blackhole sparse mappings. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Signed-off-by: Adrián Larumbe <adrian.larumbe@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40400>	2026-06-19 18:20:31 +00:00
Adrián Larumbe	9ded6f3d38	pan/kmod: Use kernel-reported page sizes for new VM when available Instead of hard-coding available page sizes in UM, have pan_kmod backends query the KMD when these are exposed by the kernel. This is not yet done for Panfrost, but it might be added soon. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Signed-off-by: Adrián Larumbe <adrian.larumbe@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40400>	2026-06-19 18:20:30 +00:00
Adrián Larumbe	1ca84d67ac	drm-uapi: Sync the panthor header Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Signed-off-by: Adrián Larumbe <adrian.larumbe@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40400>	2026-06-19 18:20:30 +00:00
Juan A. Suarez Romero	df96a100ae	v3dv: fix assertion on push constants Fixes a compiler warning regarding the assertion. Fixes: `6d6a3ab679` ("v3dv: asserts push constants data is valid") Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42269>	2026-06-19 18:01:40 +00:00
David Rosca	08c2bb3b31	radeonsi/mm: Set correct usage in si_dec_fill_surface Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Fixes: `26979becec` ("radeonsi/video: Add video decoder using ac_video_dec") Reviewed-by: Benjamin Cheng <benjamin.cheng@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42149>	2026-06-19 15:18:03 +00:00
David Rosca	6cd7dd852a	radeonsi/mm: Only setup ref surfaces with tier3 For lower tiers this adds unnecessary dependency on the ref surface. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/work_items/15630 Fixes: `26979becec` ("radeonsi/video: Add video decoder using ac_video_dec") Reviewed-by: Benjamin Cheng <benjamin.cheng@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42149>	2026-06-19 15:18:03 +00:00
David Rosca	61d2f8d0f1	radeonsi/mm: Return error when decoding H264 P/B frame with no refs The firmware expects at least one valid reference when decoding P and B frames, otherwise it may pagefault. If the app doesn't handle missing references by using dummy surfaces, error out when trying to decode such frame. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/work_items/15659 Fixes: `26979becec` ("radeonsi/video: Add video decoder using ac_video_dec") Reviewed-by: Benjamin Cheng <benjamin.cheng@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42256>	2026-06-19 14:57:39 +00:00
Matt Turner	5bb025f953	gallivm: fix small_unorm -> unorm8 fetch path on big-endian Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Two bugs in lp_build_fetch_rgba_aos's small_unorm fast path: - vector_justify=true shifted the loaded value into the MSB of the wider type on big-endian. The format_desc already carries big-endian-corrected channel shifts, so the extra shift broke channel extraction for sub-32-bit formats (e.g. R8G8B8, B5G5R5). - The output OR-loop packed channels assuming little-endian byte order (shift = j * width), so after bitcast to vec4-u8 on big-endian the alpha channel landed at byte[0] instead of byte[3]. The fix is simple: gather with vector_justify=false so format_desc shifts apply directly; use (3-j)*width on UTIL_ARCH_BIG_ENDIAN to match the memory layout that big-endian bitcast produces. This fixes the lp_test_format test on big-endian platforms. Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42228>	2026-06-19 14:17:27 +00:00
Matt Turner	a9d70225ec	gallivm: fix lp_build_round on altivec/VSX The software fallback in lp_build_round (used when arch_rounding_available returns false, e.g. altivec with length < 4) used lp_build_iround's bias-and-truncate path, which rounds half-away-from-zero due to float32 rounding of the (a + nextafterf(0.5)) sum. This caused lp_test_arit failures for v1 and v2 vector widths on ppc64. For altivec/VSX, llvm.nearbyint lowers to vrfin (AltiVec) or xvrspic (VSX) — both single instructions that round to nearest-even — for any vector width. Use it in the else branch when has_altivec is set, preserving the lp_build_iround path for x86 pre-SSE4.1 where llvm.nearbyint would expand to scalar nearbyintf calls. Update the length==2 expected-failure condition in lp_test_arit to exclude altivec (now fixed), keeping it for other platforms that still use the software fallback. This fixes the lp_test_arit test on ppc64. Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42227>	2026-06-19 13:46:11 +00:00
Juan A. Suarez Romero	07b53cd328	v3d/ci: update expected results and document failures Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42292>	2026-06-19 12:31:56 +00:00
Job Noorman	99a268c889	ir3/lower_vars_to_scratch_global: use stable sort for variables Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details To ensure we pick variables to spill deterministically. Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42168>	2026-06-19 12:00:49 +00:00
Job Noorman	3596d63338	nir/lower_vars_to_scratch_global: make callback deterministic We pass the found variables as a pointer set to the driver. Since the callback is supposed to be used for global decisions, the driver might end up picking different variables based on the (non-deterministic) iteration order of the set. Fix this by passing the variables as a util_dynarray instead. To make sure the contents of the util_dynarray don't have to be shuffled around every time the drivers wants to remove a variable from it, introduce nir_variable::pass_flags that we use to create an intrusive ordered set using a util_dynarray. Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42168>	2026-06-19 12:00:49 +00:00
Eric Engestrom	eec4f5712d	ci: fix the fix for perfetto download in `make-git-archive` nightly job The previous fix used `grep -P` which is not supported by the grep implementation used in this job, so replace it with `grep -E` + `cut` which is supported by that implementation. Fixes: `df3756e6dc` ("ci: fix perfetto download in `make-git-archive` nightly job") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42331>	2026-06-19 11:22:49 +00:00
Christian Gmeiner	ad4e3cad54	etnaviv: Gate 128-bit render targets on HALF_FLOAT 128-bit render targets are emulated as paired G32R32F targets. There is no integer 64-bit PE format, so the integer formats also render through G32R32F, as the blob does. The real hardware requirement is the half-float pipe that provides G32R32F, so gate on HALF_FLOAT instead of the conservative halti5 level. This enables the formats on older GPUs that have the pipe. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42201>	2026-06-19 10:49:24 +00:00
Christian Gmeiner	57f5acf849	etnaviv: Support split sampler for 128-bit formats on the state path 128-bit formats (RGBA32) are emulated as two stacked G32R32 planes. The bound sampler reads the RG plane and a companion sampler reads the BA plane, which etna_nir_lower_128bit(..) reassembles in the shader. Only the descriptor path set up the companion, so the state path could not sample these formats. Set up the companion on the state path too and share companion_slot(..) between both paths. The real requirement is the plane format, not the descriptors. The float plane G32R32F samples through the half-float pipe, so gate it on HALF_FLOAT and advertise GL_OES_texture_float, also on halti2 GPUs like GC3000. The integer plane G32R32I needs halti5, so keep the integer formats there. The KHR-GLES2 internalformat tests for sized RGB32F/RGBA32F need an ES3 context, so list them as expected fails on GC3000 too. Verified on GC7000 with and without ETNA_MESA_DEBUG=no_texdesc. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42201>	2026-06-19 10:49:24 +00:00
Christian Gmeiner	0783eaf6d6	etnaviv: Set per-RT sRGB bit on non-zero render target slots sRGB encoding was only handled through the global PE.LOGIC_OP SRGB bit, which the hardware applies to the primary render target alone. An sRGB surface bound to any other MRT slot was written as linear. Fixes dEQP-GLES3.functional.fragment_out.random.{1,17,39,64,86,93,96}. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Daniel Lang <dalang@gmx.at> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42201>	2026-06-19 10:49:23 +00:00
Christian Gmeiner	2e0b5f4b96	etnaviv: Update headers from rnndb Update to rnndb commit 0fd26f92cfd7 Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Daniel Lang <dalang@gmx.at> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42201>	2026-06-19 10:49:23 +00:00
Christian Gmeiner	37eb09ed06	etnaviv: Disable TS per render target on mixed TS modes PE.MEM_CONFIG.COLOR_TS_MODE is a single global field, so every TS-enabled color render target in a framebuffer has to share one TS mode. With CACHE128B256BPERLINE the mode is picked per resource (256B for compressible formats, 128B otherwise), so a compressible format bound next to an integer format disagrees and the odd target gets decoded in the wrong mode, reading back as the clear color. The blob keeps TS on the targets that match the global mode and disables it only on the odd one, instead of giving up TS for the whole framebuffer. Compute a per-RT TS mask once in etna_set_framebuffer_state(..), store it in etna_framebuffer_state and reuse it when arming the BLT fast clear, so the two consumers stay consistent by construction. A disabled target keeps its tile status allocated, so it recovers once a later framebuffer is compatible again. Fixes 23 dEQP-GLES3.functional.draw_buffers_indexed.random.* cases that mix integer and unorm render targets, with no regression in fbo.color or fbo.blit. Fixes: `d70531ca93` ("etnaviv: Extend etna_update_ts_config(..) for MRTs") Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Daniel Lang <dalang@gmx.at> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42201>	2026-06-19 10:49:23 +00:00
Christian Gmeiner	47a2f9e420	etnaviv: Advertise 128-bit color formats as renderable and samplable The 128-bit emulation now covers the clear, blit, copy and sample paths, so stop rejecting the three emulated RGBA32 formats. The format table is the remaining filter. Sampling still relies on the halti5 texture descriptors, so halti5 is the gate. Sampling RGBA32F enables GL_OES_texture_float, and with the existing half-float support also GL_ARB_texture_float, so advertise both. The KHR-GLES2 internalformat tests for sized RGB32F/RGBA32F need an ES3 context, so they fail on the ES2 driver. List them as expected fails, as other ES2 drivers do. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42201>	2026-06-19 10:49:23 +00:00
Christian Gmeiner	9013b56a22	etnaviv: Limit nir_lower_fragcolor(..) to advertised render targets nir_lower_fragcolor(..) expands a broadcast gl_FragColor into one store per render target. It was passed specs->num_rts, the physical HW count, but on HALTI2 only half of them are advertised (caps.max_render_targets) since the upper half is reserved for float and 128-bit format emulation companions. A broadcast shader thus wrote into the reserved slots. For a 128-bit target the clear meta shader stores to every gl_FragData and overwrote the BA companion plane filled by etna_nir_lower_128bit(..), so the clear came back with the RG half replicated into BA. Pass the advertised count instead to keep the broadcast inside the user visible range. Fixes: `928a276b78` ("etnaviv: Limit max supported render targets") Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Daniel Lang <dalang@gmx.at> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42201>	2026-06-19 10:49:23 +00:00
Christian Gmeiner	d75d08437b	etnaviv: rs: Support 128-bit color clears A 128-bit color level is laid out as two stacked G32R32F planes, so clear it with two 64bpp RS fills, the RG half at the level offset and the BA half at the second-plane offset. A cache flush and stall separate the two fills. etna_clear_rs(..) needs the same flush between its color and depth clears to avoid a GC600 hang, and the blob brackets every RS operation this way. The blob clears RGBA32F render targets through RS with the same plane split, verified with a cmdstream capture on a faked GC7000 rev 6204 identity. Fixes dEQP-GLES3.functional.fbo.color.repeated_clear.* for 128-bit formats on RS-only halti5 hardware. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Daniel Lang <dalang@gmx.at> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42201>	2026-06-19 10:49:22 +00:00
Christian Gmeiner	3d8a718181	etnaviv: Save the framebuffer without 128-bit companion slots etna_blit_save_state(..) saved the expanded framebuffer including the appended companion slots. The util_blitter restore goes through etna_set_framebuffer_state(..), which appends companions again, so every blitter round trip with a 128-bit color buffer bound grew nr_cbufs until the expansion assert fired. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Daniel Lang <dalang@gmx.at> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42201>	2026-06-19 10:49:22 +00:00
Christian Gmeiner	38eb315407	etnaviv: blt: Use block-layout offset for 128-bit second-plane blit The 128-bit emulation stores all RG halves in the first half of the BO and all BA halves in the second half. The sampler descriptors, the CPU upload and the BLT clear all compute the second plane as (size * depth) / 2. etna_try_blt_blit(..) advanced source and destination by layer_stride instead, an interleaved layout nothing else uses. For single-layer 2D targets both formulas coincide, so plain blits worked, but per-layer blits of a multi-layer 128-bit array texture corrupted the BA half of every layer. Use the same (size * depth) / 2 offset as the rest of the emulation. Fixes: `1f60a0397b` ("etnaviv: blt: Support 128 bit blit operations") Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Daniel Lang <dalang@gmx.at> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/42201>	2026-06-19 10:49:22 +00:00

1 2 3 4 5 ...

224646 commits