fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-03 03:28:09 +02:00

Author	SHA1	Message	Date
Francisco Jerez	3e0544ea2d	intel/brw/gfx12.5+: Fix IR of sub-dword atomic LSC operations. We were currently emitting logical atomic instructions with a packed destination region for sub-dword LSC atomics, along the lines of: > untyped_atomic_logical(32) dst<1>:HF, ... However, these instructions use an LSC data size D16U32, which means that the 16b data on the return payload is expanded to 32b by the LSC shared function, so we were lying to the compiler about the location of the individual channels on the return payload, its execution masking, etc. This is why the hacks that manually set the 'inst->size_written' of the instruction were required. In some cases this worked, but any non-trivial manipulation of the instruction destination by lowering or optimization passes could have led to corruption, as has been reproduced in deqp-vk during lower_simd_width() for shaders that use 16-bit atomics in SIMD32 dispatch mode. Note that LSC sub-dword reads aren't affected by this because they use raw UD destinations and specify the actual bit size of the operation datatype as the immediate SURFACE_LOGICAL_SRC_IMM_ARG, which doesn't work for atomic operations since that immediate specifies the atomic opcode. Instead, have the logical operation implement the behavior of 16-bit destinations correctly instead of silently replacing the 16-bit region with an inconsistent 32-bit region -- This is done by emitting the MOV instructions used to pack the data from the UD temporary into the packed destination from the lower_logical_sends() pass instead of from the NIR translation pass. Fixes: `43169dbbe5` ("intel/compiler: Support 16 bit float ops") Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30683> (cherry picked from commit `71ca8529c5`)	2024-08-21 19:06:46 +02:00
Nanley Chery	02be7e928d	iris: Invalidate state cache for some depth fast clears We need to invalidate the state cache when updating the value in the indirect clear color so that existing surface states can pick up the new value. Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30520> (cherry picked from commit `55dbc58bf4`)	2024-08-21 19:06:45 +02:00
Mike Blumenkrantz	b13248632d	st/pbo: reject vs/fs pbo ops if rowstride < width this pbo shader works by iterating over the framebuffer size and storing a value to an offset for each source pixel. if the number of pixels being written out does not correspond to fragcoord to the extent that certain source pixels are not written at all, however, then this method should not be used in order to avoid giving broken results cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30689> (cherry picked from commit `c2dcecffc5`)	2024-08-21 19:06:44 +02:00
Mike Blumenkrantz	9788cf4271	zink: bail on choose_pdev immediately if no devices are available cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30746> (cherry picked from commit `a442f67d2f`)	2024-08-21 11:07:58 +02:00
Kenneth Graunke	feb9b69220	intel/brw: Fix Xe2+ SWSB encoding/decoding for DPAS instructions SBID SET can only be used on SEND, SENDC, or DPAS instructions. The existing code was handling SET for SEND/SENDC, but was using the wrong encoding for DPAS. Add a new case to handle that and make it clear that the existing code is only for SEND/SENDC. While here, rewrite the encoder to use 2-bit binary immediates shifted up into the mode [9:8] field, rather than pre-shifted hex values. This matches the documentation better and is a little easier to follow. On the decode side, we were incorrectly decoding MATH instructions. Because they're marked is_unordered, we were hitting the SEND/SENDC decoding, which is incorrect for MATH. Fixes 22 cooperative matrix tests on Lunar Lake. Huge thanks to Paulo Zanoni for bisecting failures to one of my commits, then analyzing shaders and experimenting to discover that the failure was really an unrelated bug, just being provoked by different choices of registers. His work narrowing the problem down made it much easier to discover and fix this bug. Backport-to: 24.2 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30705> (cherry picked from commit `d22d6d814d`)	2024-08-21 11:07:56 +02:00
Kenneth Graunke	9bf42eeb30	intel/brw: Pass opcode to brw_swsb_encode/decode We're going to need to handle encoding/decoding differently for DPAS vs. SEND/SENDC vs. other instructions. Pass the opcode so we can figure out the encodings for each type of instruction. Backport-to: 24.2 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30705> (cherry picked from commit `89f9a6e10b`)	2024-08-21 11:07:55 +02:00
Rohan Garg	c43c32500a	anv: migrate indirect mesh draws to indirect draws on ARL+ Backport-to: 24.2 Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30690> (cherry picked from commit `1f06e70bdc`)	2024-08-21 11:07:02 +02:00
Rohan Garg	2afcbcce6d	anv: dispatch indirect draws with a count buffer through the XI hardware on ARL+ ARL+ can dispatch indirect draws through the hardware. Backport-to: 24.2 Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30690> (cherry picked from commit `f69c74b6d5`)	2024-08-21 11:07:02 +02:00
Rohan Garg	75deeb5f49	anv: refactor indirect draw support into it's own function ARL+ supports some form of indirect draws, instead of trying to mash support for indirect draws across various generations, let's make things cleaner by factoring out XI support into it's own function. Backport-to: 24.2 Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30690> (cherry picked from commit `74cd70841d`)	2024-08-21 11:07:02 +02:00
Rohan Garg	906fd7ff48	anv,iris: prefix the argument format with XI for a upcoming refactor Backport-to: 24.2 Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30690> (cherry picked from commit `c1af71c9c2`)	2024-08-21 11:07:02 +02:00
Rohan Garg	dd89329cc6	anv: program a custom byte stride on Xe2 for indirect draws Xe2 allows us to program in a custom byte stride for indirect draws Backport-to: 24.2 Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30690> (cherry picked from commit `dc23db2a0d`)	2024-08-21 11:07:02 +02:00
Tapani Pälli	55670b0676	gbm: depend on libdrm indepedent of dri2 setting Suggested-by: @stefan11111 Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10585 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Eric Engestrom <eric@igalia.com> Reviewed-by: Simon Ser <contact@emersion.fr> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30716> (cherry picked from commit `35a6824e88`)	2024-08-21 11:07:01 +02:00
Jianxun Zhang	b0c2e2453d	Revert "iris: Disable PAT-based compression on depth surfaces (xe2)" This reverts commit `b6f9702cf1`. With the progress on Xe2 platforms, we are not seeing many issues caused by compression on depth buffers. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11361 Backport-to: 24.2 Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30653> (cherry picked from commit `72925f59e6`)	2024-08-21 11:07:01 +02:00
Jianxun Zhang	563ec75366	Revert "anv: Disable PAT-based compression on depth images (xe2)" This reverts commit `6073f091bb`. With the progress on Xe2 platforms, we are not seeing many issues caused by compression on depth buffers. Backport-to: 24.2 Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30653> (cherry picked from commit `8c623b6a7e`)	2024-08-21 11:07:01 +02:00
Samuel Pitoiset	04ba8d0f92	aco: fix bogus assert in RT prolog on GFX11+ in_scratch_offset isn't defined on GFX11+ and only useful on < GFX9. Fixes: `bd525f4282` ("aco: Fix 1D->2D dispatch conversion on <gfx9") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30717> (cherry picked from commit `aad503ecfa`)	2024-08-20 18:42:17 +02:00
José Roberto de Souza	53d99c4847	iris/gfx20: Enable depth buffer write through for multi sampled images BSpec: 56419 Backport-to: 24.2 Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29615> (cherry picked from commit `48e46c71c0`)	2024-08-20 18:42:16 +02:00
Nanley Chery	61eb11e779	iris: Add and use want_hiz_wt_for_res Backport-to: 24.2 Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29615> (cherry picked from commit `b78273c66c`)	2024-08-20 18:42:15 +02:00
José Roberto de Souza	03ecbdcbfd	anv/gfx20: Enable depth buffer write through for multi sampled images BSpec: 56419 Backport-to: 24.2 Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29615> (cherry picked from commit `12656571fd`)	2024-08-20 18:42:14 +02:00
Nanley Chery	1921b99961	anv: Add want_hiz_wt_for_image() Backport-to: 24.2 Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29615> (cherry picked from commit `ebe3eabda6`)	2024-08-20 18:42:13 +02:00
José Roberto de Souza	98c3209098	intel/isl/gfx20: Alow hierarchial depth buffer write through for multi sampled surfaces BSpec: 56419 Backport-to: 24.2 Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29615> (cherry picked from commit `2553878fba`)	2024-08-20 18:42:12 +02:00
Mike Blumenkrantz	aa3304fedf	glx/dri2: strdup driver name this is freed by the caller cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30619> (cherry picked from commit `046728f47a`)	2024-08-20 18:34:45 +02:00
Lionel Landwerlin	9817d44c27	anv: only set 3DSTATE_CLIP::MaximumVPIndex once Currently we can end up merging 2 prepacked 3DSTATE_CLIP instructions where 2 different places in the driver fill the MaximumVPIndex. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `50f6903bd9` ("anv: add new low level emission & dirty state tracking") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30684> (cherry picked from commit `982106e676`)	2024-08-20 17:58:07 +02:00
Lionel Landwerlin	98f760e17b	anv: fix extended buffer flags usages Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `bcc0ec8e6c` ("anv: enable KHR_maintenance5") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30714> (cherry picked from commit `9eff285a46`)	2024-08-20 17:57:16 +02:00
Lionel Landwerlin	7d562392f0	vulkan/runtime: fix GetBufferMemoryRequirements2 for maintenance4 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `2649ee0724` ("vulkan/runtime: implement vkGetBufferMemoryRequirements2()") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30714> (cherry picked from commit `eacb8f85a2`)	2024-08-20 17:52:20 +02:00
GKraats	2d8ccf3134	i915g: fix count of buffers at i915_drm_batchbuffer_validate_buffers This commit contains the fix with num_of_buffers at validation-call at i915_drm_batchbuffer_validate_buffers. Cc: mesa-stable Signed-off-by: GKraats <vd.kraats@hccnet.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26769> (cherry picked from commit `0311159bed`)	2024-08-20 17:52:19 +02:00
GKraats	dc084c6716	i915g: Screen corruption with ENOBUFS caused by fence register shortage This commit solves the shortage-problem at the blit-functions by checking the number of fence-registers after updating the batch. If too many registers are used, the batch-entries and relocs for the current blit function are removed by setting batch->ptr and reloc_count to value before the blit call and calling drm_intel_gem_bo_clear_relocs. This truncated batch is flushed, and the batch is updated again for the current blit function. Cc: mesa-stable Signed-off-by: GKraats <vd.kraats@hccnet.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26769> (cherry picked from commit `ed2123158d`)	2024-08-20 17:52:18 +02:00
Faith Ekstrand	b7315f736a	nouveau/mme: Fix add64 of immediates on Fermi Fixes: `162269f049` ("nouveau/mme: Add Fermi builder") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30703> (cherry picked from commit `381be88473`)	2024-08-20 17:45:04 +02:00
Friedrich Vock	31d951fb36	aco: Fix 1D->2D dispatch conversion on <gfx9 out_args->scratch_offset and in_wg_id_x will alias on <gfx9. To avoid the conversion code reading a garbage WG ID, move the scratch/ring offset writing to the very end. Fixes: `1e354172` ("radv,aco: Convert 1D ray launches to 2D") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30707> (cherry picked from commit `bd525f4282`)	2024-08-20 17:45:03 +02:00
Rob Clark	6cac51d074	nir/opt_loop: Don't peel initial break if loop ends in break A loop that looks like: loop { do_work_1(); if (cond) { break; } else { } do_work_2(); break; } We can't pull that break ahead of do_work_1() after hoisting the initial do_work_1() out of the loop. So bail in this case. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11711 Fixes: `6b4b044739` ("nir/opt_loop: add loop peeling optimization") Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30702> (cherry picked from commit `563ec4754a`)	2024-08-20 17:45:01 +02:00
Guilherme Gallo	27c36ffddc	ci/a618: Fix zink-tu-a618-full rules We should use `.zink-turnip-collabora-manual-rules` instead of `.collabora-turnip-manual-rules`, since the former correctly reacts to the zink+turnip file changes. Fixes: `69eac6dd15` ("ci/a618: Add zink-tu-a618-full") Reported-by: Valentine Burley <valentine.burley@gmail.com> Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30698> (cherry picked from commit `8aa52ac666`)	2024-08-20 17:45:00 +02:00
Sagar Ghuge	db8d043296	intel/compiler: Fix indirect offset in GS input read for Xe2+ Make sure to take new GRF size into consideration and adjust the indirect offset according to new size so that when we do the indirect load with address register, we load right values. This helps pass the following tests: - dEQP-VK.binding_model.descriptor_buffer.mutable_descriptor.geom - dEQP-VK.ray_query.geometry_shader. Backport-to: 24.2 Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30679> (cherry picked from commit `c4f2a8d984`)	2024-08-20 17:44:55 +02:00
Boris Brezillon	0d8a86f519	panvk: Adjust RGB component order for fixed-function blending Basically what `004e0eb3ab` ("panfrost: use RGB1 component ordering for R5G6B5 pixel formats") was doing in the gallium driver, but applied to panvk this time. Fixes: `004e0eb3ab` ("panfrost: use RGB1 component ordering for R5G6B5 pixel formats") Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30685> (cherry picked from commit `9241af23e5`)	2024-08-16 17:53:35 +02:00
Mary Guillemard	c799cfe577	panvk: Fix NULL deref on model name when device isn't supported Instead of reproting an VK_ERROR_INCOMPATIBLE_DRIVER we were crashing as device->model was init after this error check. Tested on G57 but should work the same on all unsupported arch. Signed-off-by: Mary Guillemard <mary.guillemard@collabora.com> Fixes: `f7f9b3d170` ("panvk: Move to vk_properties") Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30686> (cherry picked from commit `c95ef9e323`)	2024-08-16 17:52:28 +02:00
Faith Ekstrand	00cbc96dfd	nvk: Enable shader bounds checking when nullDescriptor is enabled Fixes: `c9eac89da8` ("nvk: Advertise VK_EXT_robustness2") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30663> (cherry picked from commit `85a70bbc05`)	2024-08-16 17:32:42 +02:00
Faith Ekstrand	ab14d080c7	nvk: Plumb the whole vk_pipeline_robustness_state through to nvk_ubo/ssbo_addr_format Fixes: `c9eac89da8` ("nvk: Advertise VK_EXT_robustness2") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30663> (cherry picked from commit `8445190663`)	2024-08-16 17:32:40 +02:00
Faith Ekstrand	fe366a0642	vulkan: Add null descriptor bits to vk_pipeline_robustness_state Fixes: `c9eac89da8` ("nvk: Advertise VK_EXT_robustness2") Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30663> (cherry picked from commit `6ae401aa86`)	2024-08-16 17:32:39 +02:00
Matt Turner	a9ff59a485	nir: Skip opt_if_merge when next_if has block ending in a jump Similar to commit `6cef804067` ("nir/opt_if: fix opt_if_merge when destination branch has a jump"), we shouldn't combine if statements when the second if-then-else has a block that ends in a jump. This fixes a case where opt_if_merge combines if (cond) { [then-block-1] } else { [else-block-1] } if (cond) { [then-block-2] } else { [else-block-2] } where `then-block-2` or `else-block-2` ends in a jump. The phi nodes following the control flow will be incorrectly updated to have an input from a block that is not a predecessor. Fixes: `4d3f6cb973` ("nir: merge some basic consecutive ifs") Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30629> (cherry picked from commit `d2e6be94ae`)	2024-08-16 17:32:38 +02:00
Connor Abbott	2d18c5b395	Revert "tu/a750: Disable HW binning when there is GS" This reverts commit `7eb6123e98`. The root cause was actually the bug fixed by the previous commit. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30675> (cherry picked from commit `c59be8516b`)	2024-08-16 17:32:37 +02:00
Connor Abbott	17321da8ca	ir3, tu: Use a UBO for VS primitive params on a750+ Before we were using direct CP_LOAD_STATE, which is broken with multiple back-to-back draws. This caused regressions in some DX11 traces when enabling early preamble. We still need to use indirect CP_LOAD_STATE for VS params, which are sometimes written by the CP, however for everything else we should use the new UBO path instead. Fixes: `76e417ca59` ("turnip,ir3/a750: Implement consts loading via preamble") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30675> (cherry picked from commit `850f2aab03`)	2024-08-16 17:32:17 +02:00
Connor Abbott	03350de955	tu: Fix off-by-one in UBO CP_LOAD_STATE size It's one header dword and 5 payload dwords. This was papered over by us not actually using the UBO path for one of the loads, but that's changed in the next commit. Fixes: `76e417ca59` ("turnip,ir3/a750: Implement consts loading via preamble") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30675> (cherry picked from commit `4f2b5442a6`)	2024-08-16 17:32:16 +02:00
Job Noorman	489676b3e0	ir3/legalize: handle scalar ALU WAR hazards for a0.x It turns out that mova executes on the normal pipeline, which means that users of a0.x on the scalar pipeline might cause a WAR hazard with mova. Fixes: `876c5396a7` ("ir3: Add support for "scalar ALU"") Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341> (cherry picked from commit `72bb4d79dc`)	2024-08-16 17:32:12 +02:00
Job Noorman	5987d91886	ir3: fix wrong dstn used in postsched Fixes: `750e6843c0` ("ir3: Rewrite postsched dependency handling") Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341> (cherry picked from commit `ddb0f5f4e6`)	2024-08-16 17:32:11 +02:00
Job Noorman	15ba4d64b5	ir3: fix clearing merge sets after shared RA After spilling during regular RA, merge sets need to be fixed up. To find all merge sets, fixup_merge_sets used ra_foreach_dst. However, after shared RA has run, shared dsts wouldn't have the IR3_REG_SSA flag set anymore leaving their merge sets lingering. This patch fixes this by using foreach_dst instead. Fixes: `fa22b0901a` ("ir3/ra: Add specialized shared register RA/spilling") Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341> (cherry picked from commit `28d2a27030`)	2024-08-16 17:31:22 +02:00
Job Noorman	66dc3912bd	ir3: update merge set affinity in shared RA The preferred register for merge sets was not updated after allocating one. This caused a new merge set to be allocated for every register it contains. This patch fixes this by reusing the update function from the standard RA. Fixes: `fa22b0901a` ("ir3/ra: Add specialized shared register RA/spilling") Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28341> (cherry picked from commit `9013e11d8c`)	2024-08-16 17:31:22 +02:00
Pavel Ondračka	d93b5b499c	r300: fix RGB10_A2 CONSTANT_COLOR blending Just reverse the color order the same way we do for RGBA8. Fixes: `910bac63df` Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30656> (cherry picked from commit `a1a06f386e`)	2024-08-16 17:31:21 +02:00
David Rosca	93f637bdbb	radeonsi: Don't allow DCC for encode in is_video_target_buffer_supported This accidentally allowed DCC with format conversion, which is not supported. Also disable EFC with VCN5 for now. Fixes: `40c3a53fec` ("radeonsi: Implement is_video_target_buffer_supported") Reviewed-by: Ruijing Dong <ruijing.dong@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30562> (cherry picked from commit `4b60918138`)	2024-08-16 17:31:21 +02:00
David Rosca	587a134260	frontends/va: Fix use after free with EFC This happens when the source surface is destroyed before being used in encoding operation. It also needs to disable EFC in this case. Fixes: `a7469a9ffd` ("frontends/va: Rework EFC logic") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11653 Reviewed-by: Ruijing Dong <ruijing.dong@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30562> (cherry picked from commit `79ce0e3b2f`)	2024-08-16 17:31:20 +02:00
Karol Herbst	042c23cba1	rusticl/mem: do not check against image base alignment for 1Dbuffer images The CL cap in question is only valid for 2D images created from buffer. Fixes: `20c90fed5a` ("rusticl: added") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30655> (cherry picked from commit `5d0c870c00`)	2024-08-16 17:31:20 +02:00
Sagar Ghuge	66aa172910	intel/compiler: Adjust trace ray control field on Xe2 Bspec 64643: Structure_TraceRayPayload::Trace Ray Control Bit field moved from 9-8 to 10-8 on Xe2. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30600> (cherry picked from commit `83c2524124`)	2024-08-16 17:27:13 +02:00
Sagar Ghuge	79dd1f226f	intel/compiler: Ray query requires write-back register Bspec 57508: Structure_SIMD16TraceRayMessage:: RayQuery Enable "When this bit is set in the header, Trace Ray Message behaves like a Ray Query. This message requires a write-back message indicating RayQuery for all valid Rays (SIMD lanes) have completed." If we don't pass the write-back register, somehow it was stepping on over R0 register and can mess up the scratch space accesses which could potentially lead to GPU hang. It can be noticed while running it under simulator trace. send.rta (16\|M0) null r124 r126:1 0x0 0x02000100 {$15} // wr:1+1, rd:0; simd16 trace ray R0 = 00000001 00000000 00000000 00000001 00000000 00000000 00000001 00000000 00000000 00000001 00000000 00000000 00000001 00000000 00000000 00000001 Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Suggested-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30600> (cherry picked from commit `c3c62e493f`)	2024-08-16 17:27:08 +02:00

1 2 3 4 5 ...

178356 commits