fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-07 13:38:06 +02:00

Author	SHA1	Message	Date
Marcin Ślusarz	ea4ecc3e72	intel/compiler/mesh: handle const data in task & mesh programs Started showing up when nir_opt_large_constants call was moved in `88756cee8d`. Fixes dEQP-VK.mesh_shader.ext.smoke.monolithic.fullscreen_gradient* Suggested-by: Kenneth Graunke <kenneth@whitecape.org> Fixes: `88756cee8d` ("intel/compiler: Run nir_opt_large_constants before scalarizing consts") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20876> (cherry picked from commit `536a2acfc2`)	2023-01-24 15:27:06 -08:00
Samuel Pitoiset	598e985d65	radv/winsys: fix incorrect PCIID for GFX11 in the null winsys Fixes: `bbad550f3d` ("radv/winsys: fill real info for CHIP_GFX1100") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20850> (cherry picked from commit `bf3c14b8a5`)	2023-01-24 15:27:06 -08:00
Lionel Landwerlin	0a9dc20094	intel/fs: avoid cmod optimization on instruction with different write_mask I've been running into failures with tests like : dEQP-VK.robustness.robustness2.bind.notemplate.rgba32i.unroll.nonvolatile.uniform_buffer_dynamic.no_fmt_qual.len_4.samples_1.1d.frag With the load_global_const_block_intel NIR intrinsic, you can load a vec8/vec16 with a predicate. The predicate is correctly uniformized to feed into the SEND instruction's flag register. The problem is that a series of optimization first remove the find_live_channel and then changes the broadcast into a simple MOV instruction, on the assumption that the first channel is always active if there is not control flow. This is correct. But after that the cmod optimzation will remove this instruction : mov.nz.f0.0(16) null:D, vgrf16+0.0<0>:D NoMask because it seems to be equivalent to : cmp.g.f0.0(16) vgrf16:D, vgrf12:D, 63d In this case vgrf16 is the predicate to the load block SEND instruction. Since the execution mask is different between both, some of the channels of the SEND instruction end up not being loaded or loaded with the wrong predication and we end up with incorrect UBO data. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20852> (cherry picked from commit `a50d2fdb46`)	2023-01-24 15:27:06 -08:00
Dylan Baker	488c900b08	.pick_status.json: Update to `12a471afac`	2023-01-24 15:27:06 -08:00
Julia Tatz	eb112a38f6	zink: correct sparse bo mem_type_idx placement VK_MEMORY_PROPERTY_DEVICE_LOCAL_BIT = 0x01 has been incidently the correct memory type index, but isn't guaranteed to be, which is why it hasn't caused issues yet Fixes: `f9515d93` ("zink: allocate/place memory using memoryTypeIndex directly") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20264> (cherry picked from commit `c71287e70c`)	2023-01-24 15:27:06 -08:00
Julia Tatz	6f59991d29	zink: trival renames heap_idx -> memoryTypeIndex Trival renames to correctly identify vulkan memory type indices aren't the same as zink heaps Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20264> (cherry picked from commit `e20e8f2243`)	2023-01-24 15:27:06 -08:00
Julia Tatz	b435372078	zink: zink_heap isn't 1-to-1 with memoryTypeIndex Clarify the relationship between zink heaps and vulkan memory type indices, and resolve the issues from mixing the two up. Closes: #7588, #7813 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20264> (cherry picked from commit `f6d3a5755f`)	2023-01-24 15:27:06 -08:00
Lionel Landwerlin	becfa703df	anv: fix preemption enable emission in gpu_memcpy This has to be before the MI_BATCH_BUFFER_END otherwise it has no effect. This also was messing around with you batch length alignment. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `b9aa66d5d0` ("anv: disable preemption for 3DPRIMITIVE during streamout") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20802> (cherry picked from commit `6f02f9d108`) Conflicts: src/intel/vulkan/genX_gpu_memcpy.c	2023-01-24 15:27:06 -08:00
Erik Faye-Lund	a3c2f78c3d	radeonsi: respect smoothing_enabled When this was last changed, the smoothing_enabled flag seems to have been forgotten about, breaking line-smoothing (and probably also polygon smoothing). Fixes: `4147add280` ("radeonsi: update db_eqaa even if msaa is disabled") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20810> (cherry picked from commit `9f4f131f2e`) Conflicts: src/amd/ci/radeonsi-raven-fails.txt src/amd/ci/radeonsi-stoney-fails.txt	2023-01-24 15:27:06 -08:00
Georg Lehmann	01ff451d34	Revert "aco: Combine v_cvt_u32_f32 with insert to v_cvt_pk_u8_f32." This reverts commit `6d02054047`. v_cvt_pk_u8_f32 returns 0xff instead of v_cvt_u32_f32 & 0xff if the input is larger than 255. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8128 Cc: mesa-stable Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20829> (cherry picked from commit `e527f686ca`)	2023-01-24 15:27:05 -08:00
Mike Blumenkrantz	14ecd96183	zink: use actual swapchain object for surface comparison the outer swapchain object is persistent, which means checking it will never yield an update after the first check fixes #8122 Fixes: `b2739c9f00` ("zink: set surface->dt when updating swapchain" Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20814> (cherry picked from commit `474ed4b877`)	2023-01-24 15:27:05 -08:00
Jonathan Gray	6bd3bd5ae1	egl/dri2: avoid undefined unlocks unlocks were incorrectly added to paths using dri2_egl_display() as well as those using dri2_egl_display_lock() pthread_mutex_unlock() when unlocked is documented by posix as being undefined behaviour. On OpenBSD pthread_mutex_unlock() will call abort(3) if this happens. Fixes: `f1efe037df` ("egl/dri2: Add display lock") Reviewed-by: Rob Clark <robclark@freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20712> (cherry picked from commit `0594b3c143`)	2023-01-24 15:27:05 -08:00
Marek Olšák	1984d994fb	glthread: handle GL_*_ARRAY in glEnable/Disable Surprisingly, the GL compatibility profile allows these in both glEnableClientState and glEnable. Fixes: `0b1dd18591` - glthread: track which vertex array attribs are enabled Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20824> (cherry picked from commit `777166cc66`)	2023-01-24 15:27:05 -08:00
Marek Olšák	9cd9c7e863	mesa: allow GL_UNSIGNED_INT64_ARB as vertex format for ARB_bindless_texture This wasn't implemented, but the spec requires it. Fixes: `1fe7b1f972` - mesa: implement ARB_bindless_texture Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20824> (cherry picked from commit `721526227c`)	2023-01-24 15:27:05 -08:00
Marek Olšák	7289901ce7	util: fix util_is_vbo_upload_ratio_too_large It was wrong. For example, if the draw vertex count was 10 and the upload vertex count was 150, u_vbuf wouldn't unroll the draw and would instead memcpy 150 vertices. This fixes that case. Fixes: `068a3bf0d7` - util: move and adjust the vertex upload heuristic equation from u_vbuf Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20824> (cherry picked from commit `4f6e785876`)	2023-01-24 15:27:05 -08:00
Marek Olšák	cb9b189166	glthread: fix an upload buffer leak Fixes: `befbd54864` - glthread: don't use atomics for refcounting to decrease overhead on AMD Zen Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20804> (cherry picked from commit `4d4995b32b`)	2023-01-24 15:27:05 -08:00
Mike Blumenkrantz	f0d643eb27	zink: don't use ds3 blend states without color attachments this is illegal and causes validation errors cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20799> (cherry picked from commit `5d44318566`)	2023-01-24 15:27:05 -08:00
Mike Blumenkrantz	0255c80d78	zink: delete need_blend_constants this is an artifact of very old code before the dynamic state was set for all graphics pipelines now the checks only cause blend constants to not be updated, which triggers bugs and validation failures cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20799> (cherry picked from commit `b4d18f2ad1`)	2023-01-24 15:27:05 -08:00
Mike Blumenkrantz	73091e4d27	zink: preserve present resources during async presentation ensure that these have a lifetime great enough to be presented fixes #7781 cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20793> (cherry picked from commit `020db79340`)	2023-01-24 15:27:05 -08:00
Dylan Baker	440050346e	.pick_status.json: Update to `5039acfd9d`	2023-01-24 15:27:05 -08:00
Rose Hudson	c1d8827f5a	radeonsi: report 0 block size for Polaris HEVC encoding makes encoded videos resemble the input again :) Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7992 Fixes: `c4482a3c1a` ("radeonsi/vcn: enable multi-slice encoding") Reviewed-by: Ruijing Dong <ruijing.dong@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20714> (cherry picked from commit `e8a60633da`)	2023-01-24 15:27:05 -08:00
Tapani Pälli	c34510badf	iris: add restrictions for 3DSTATE_RASTER::AntiAliasingEnable Field must be disabled if any render targets have integer format, additionally for Gfx12+ field must be disabled when num multisamples > 1 or forced multisample count > 1. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7892 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20671> (cherry picked from commit `247c06d419`)	2023-01-24 15:27:05 -08:00
Tapani Pälli	53e77d072b	hasvk: add restrictions for 3DSTATE_RASTER::AntiAliasingEnable Field must be disabled if any render targets have integer format. Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20671> (cherry picked from commit `58dd9d5134`)	2023-01-24 15:27:05 -08:00
Tapani Pälli	97cdeddcf3	anv: add restrictions for 3DSTATE_RASTER::AntiAliasingEnable Field must be disabled if any render targets have integer format, additionally for Gfx12+ field must be disabled when num multisamples > 1 or forced multisample count > 1. Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20671> (cherry picked from commit `9b37ef40f8`)	2023-01-24 15:27:05 -08:00
Danylo Piliaiev	e6bc0076e1	tu/kgsl: do not use kgsl_command_object::offset offset field in kgsl_command_object is NOT used by KGSL, so we should offset directly to iova. Fixes weird hangs on KGSL. E.g. fixes the hang in: dEQP-VK.memory.pipeline_barrier.transfer_dst_storage_texel_buffer.1024 cc: mesa-stable Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20795> (cherry picked from commit `926f626b95`)	2023-01-24 15:27:05 -08:00
Tatsuyuki Ishi	03980525cd	radv: Fix depth-only-with-discard when epilogs are used. For a depth-only-with-discard pipeline, spi_shader_col_format needs to be fixed up to a single channel export, or otherwise discard will not work. Since col_format can change depending on the dynamic state, precompute the need for this workaround on pipeline creation and apply it when emitting prolog states. Fixes: `eb07a11b8f` ("radv: add support for compiling PS epilogs on-demand") Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20704> (cherry picked from commit `1617dac6c3`)	2023-01-24 15:27:04 -08:00
Dylan Baker	89e4ad1b83	.pick_status.json: Update to `e8a60633da`	2023-01-24 15:27:04 -08:00
Paulo Zanoni	a1e3b3964c	anv: check the return value of anv_execbuf_add_bo_bitset() Because anv_execbuf_add_bo_bitset() calls anv_execbuf_add_bo(), which can fail if its memory allocations fail. I have seen dEQP tests exercising memory allocation failures during anv_execbuf_add_bo(), but I don't think the path coming from add_bo_biset() was specifically exercised. Anyway, add the error check just in case. v2: Rebase. Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20703> stable: reapplied to src/intel/vulkan/anv_batch_chain.c, as it has not been moved to src/intel/vulkan/i915/ in 23.0	2023-01-24 15:27:04 -08:00
Paulo Zanoni	2d11fda237	anv: don't leave undefined values in exec->syncobj_values In anv_execbuf_add_syncobj(), we try to not create or use exec->syncobj_values if we don't need to. But when we figure we're going to need it (i.e., when timeline_value is not zero), then we create exec->syncobj_values with vk_zalloc, which means every previous value is set to zero, as it should be. This is all correct. The problem starts when we add a 16th element. In this case we double exec->syncobj_array_length and realloc the buffer by using vk_alloc and copying the old array to the new one. After that, we write the timeline_value to the array only if it's not zero, and that's the problem: since we just used vkalloc and memcpy, we don't have any guarantees that the new array will be zero after the 16th element, and if timeline_value is zero we write nothing to that position. Once we start using exec->syncobj_values we have to commit to using it, so the "if (timeline_value)" check near the end of the function has to be changed to "if (exec->syncobj_values)", so we actually set elements after the 16th to zero when they need to be zero. Another approach to fix this would be to memset the new elements once we double syncobj_array_length. In practice, I couldn't find any application or deqp test that used more than 3 elements in exec->syncobj_array_length, and we need more than 16 elements in order to be able to reproduce the bug, so I'm not aware of any real-world bug that goes away with this patch. This issue was found while reading code. If we craft a little Vulkan program that submits a ton of timeline and binary semaphores on vkQueueSubmit, then waits for them, we get the following error without this patch: MESA: error: ../../src/intel/vulkan/anv_batch_chain.c:1910: execbuf2 failed: Invalid argument (VK_ERROR_DEVICE_LOST) v2: Rebase. Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20703> (cherry picked from commit `ad6a036a68`) Conflicts: src/intel/vulkan/i915/anv_batch_chain.c Stable: reapplied to src/intel/vulkan/anv_batch_chain.c, as this hasn't been moved in the staging branch.	2023-01-24 15:27:04 -08:00
Mike Blumenkrantz	34055e7822	zink: handle modifier nplanes queries correctly for planar formats this just returns the number of planes in the base format as a default, which matches the behavior of other drivers cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20753> (cherry picked from commit `6ff334e54a`)	2023-01-24 15:27:04 -08:00
Mike Blumenkrantz	d965740df1	zink: store drm format as internal_format for imported resources internal_format is the "real" format of a resource, and the "real" format of imported resources is the external-facing format, not the pipe format this ensures the correct format is available for internal ops, such as nplanes queries Fixes: `2e2775c11b` ("zink: fix PIPE_RESOURCE_PARAM_NPLANES with format modifier") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20753> (cherry picked from commit `072e29a22e`)	2023-01-24 15:27:04 -08:00
Samuel Pitoiset	6ad6660013	radv: fix creating BC image views when the base layer is > 0 When the base array layer of the image view is > 0, addrlib computes the offset (in HwlComputeSubResourceOffsetForSwizzlePattern) which is then added to the base VA in RADV. But if the driver doesn't reset the base array layer, the hw will compute incorrect addressing (ie. base array will be added twice). This also matches AMDVLK. This fixes a VM fault followed by a GPU hang on RDNA2 when trying to join a multiplayer game with medium settings in Halo Infinite. Fixes: `98ba1e0d81` ("radv: Fix mipmap views on GFX10+") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20761> (cherry picked from commit `8d191b2cfb`)	2023-01-24 15:27:04 -08:00
Samuel Pitoiset	9590bf141f	radv: fix buffer to image copies with BC views on the graphics queue The color surface descriptor needs to be adjusted, otherwise addressing is wrong. Fixes tests performed on the graphics queue from dEQP-VK.api.copy_and_blit..image_to_buffer.2d_images.mip_copies_. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7900 Fixes: `98ba1e0d81` ("radv: Fix mipmap views on GFX10+") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20761> (cherry picked from commit `18aaa373b7`)	2023-01-24 15:27:04 -08:00
Samuel Pitoiset	638dae71b0	radv: fix setting MAX_MIP for BC views MAX_MIP should always be the number of levels minus one from the hw perspective. This doesn't fix anything known. Fixes: `98ba1e0d81` ("radv: Fix mipmap views on GFX10+") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20761> (cherry picked from commit `aff5fe3f94`)	2023-01-24 15:27:04 -08:00
Pierre-Eric Pelloux-Prayer	22f2a32ed6	glthread: fix glArrayElement handling This must be marshalled synchronously or the attrib pointers' content might change by the time we use them. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8068 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20748> (cherry picked from commit `ddc721e15c`)	2023-01-24 15:27:04 -08:00
Pierre-Eric Pelloux-Prayer	dc13019b78	vbo: lower VBO_SAVE_BUFFER_SIZE to avoid large VRAM usage The ideal case for performance is to have a single buffer for all display list. The caveat is that large buffers are less likely to be freed because they're refcounted: it only takes 1 user (diplay list) to keep it in VRAM. This lowers VRAM usage when replaying the trace attached of the trace attached to !6140 from 5.5 GB to about 1.8 GB. Viewperf snx performance isn't affected. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6140 Cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20748> (cherry picked from commit `0f5c8c3dc3`)	2023-01-24 15:27:04 -08:00
Pierre-Eric Pelloux-Prayer	22d6d67a46	vbo: remove bogus assert grow_vertex_storage may call wrap_filled_vertex, which will trigger the assert incorrectly because the new size will be smaller than 'new_size' but it's correct because 'vertex_store->used' has been reset to 0. Fixes: `a08baaff97` ("vbo/dlist: fix indentation in vbo_save_api.c") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20748> (cherry picked from commit `491f6b138e`)	2023-01-24 15:27:04 -08:00
Lionel Landwerlin	c90f932257	nir/lower_io: fix bounds checking for 64bit_bounded_global If the offset is negative like it's the case in dEQP-VK.robustness.robustness2.bind.notemplate.r32i.unroll.volatile.storage_buffer_dynamic.readwrite.no_fmt_qual.len_256.samples_1.1d.comp we end up passing the bounds checking condition because it's using signed integers. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Suggested-by: Jason Ekstrand <jason.ekstrand@collabora.com> Cc: mesa-stable Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20762> (cherry picked from commit `ff34e96701`)	2023-01-24 15:27:04 -08:00
Kenneth Graunke	e6313e8be3	intel/compiler: Drop redundant 32-bit expansion for shared float atomics We already expanded data to 32-bit a few lines earlier, so this is just redundantly doing it a second time. Fixes: `43169dbbe5` ("intel/compiler: Support 16 bit float ops") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20604> (cherry picked from commit `f7b29d7924`)	2023-01-24 15:27:04 -08:00
Lionel Landwerlin	5c92a8394b	anv: fix generated indirect draw shader stats checks Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `c950fe97a0` ("anv: implement generated (indexed) indirect draws") Tested-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20776> (cherry picked from commit `5ff3d4a8a2`)	2023-01-24 15:27:04 -08:00
Francisco Jerez	4e341d22c2	intel/fs/gfx12: Ensure that prior reads have executed before barrier with acquire semantics. This avoids a violation of the Vulkan memory model that was leading to intermittent failures of at least 8k test-cases of the Vulkan CTS (within the group dEQP-VK.memory_model.) on TGL and DG2 platforms. In theory the issue may be reproducible on earlier platforms like IVB and ICL, but the SYNC.ALLWR instruction is not available on those platforms so a different (likely costlier) fix will be needed. The issue occurs within the sequence we emit for a NIR memory barrier with acquire semantics requiring the synchronization of multiple caches, e.g. in pseudocode for a barrier involving the TGM and UGM caches on DG2: x <- load.ugm // Atomic read sequenced-before the barrier y <- fence.ugm z <- fence.tgm wait(y, z) w <- load.tgm // Read sequenced-after the barrier In the example we must provide the guarantee that the memory load for x is completed before the one for w, however this ordering can be reversed with the intervention of a concurrent thread, since the UGM fence will block on the prior UGM load and potentially take a long time, while the TGM fence may complete and invalidate the TGM cache immediately, so a concurrent thread could pollute the TGM cache with stale contents for the w location before* the UGM load has completed, leading to an inversion of the expected memory ordering. v2: Apply the workaround regardless of whether the NIR barrier intrinsic specifies multiple storage classes or a single one, since an acquire barrier is required to order subsequent requests relative to previous atomic requests of unknown storage class not necessarily specified by the memory scope information of the intrinsic. Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20690> (cherry picked from commit `4a2e7306dd`)	2023-01-24 15:27:04 -08:00
Maíra Canal	2dbdee9909	v3dv: remove unused clamp_to_transparent_black_border property Commit `e07c5467` ("v3dv/format: use XYZ1 swizzle for three-component formats") removes the only code that handled the clamp_to_transparent_black_border variable. Therefore, the variable can be deleted, as it is not currently being used. Fixes: `e07c5467` ("v3dv/format: use XYZ1 swizzle for three-component formats") Signed-off-by: Maíra Canal <mcanal@igalia.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20746> (cherry picked from commit `86c9bdcd9a`)	2023-01-24 15:27:03 -08:00
Emma Anholt	5510c75a4e	Revert "nouveau/ci: temporary disable gk20a-gles" This reverts commit `8a1a3a31da`. The farm should be back up, and I swear nginx startup is fixed for real this time. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20775> (cherry picked from commit `11669c96bc`)	2023-01-24 15:27:03 -08:00
Lionel Landwerlin	5c3cd5da22	nir/divergence: add missing RT intrinsinc handling Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20763> (cherry picked from commit `b82d9b1a3d`)	2023-01-24 15:27:03 -08:00
Dylan Baker	291c51a2ec	.pick_status.json: Update to `75276deebc`	2023-01-19 09:42:36 -08:00
Kenneth Graunke	41648b0e3f	intel/blorp: Lower base_workgroup_id to zero We don't use a base workgroup ID for BLOCS. It needs to be lowered, or else we'll assert fail when compiling the compute shader. (Note for stable: this patch doesn't fix a bug in `4abdecce22` specifically, but rather is a missing patch that needed to go along with the rest of MR 20068, on whichever branches it exists on.) Fixes: `4abdecce22` ("iris: Lower load_base_workgroup_id to zero") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20750> (cherry picked from commit `a6c6a4ad04`)	2023-01-18 11:43:56 -08:00
Erik Faye-Lund	269dba25b6	zink: fix depth-clip disable cap We use EXT_depth_clip_enable for this, not EXT_depth_clip_control, which is what depth_clip_control_missing is a proxy for. Fixes: `721f33cd0f` ("zink: fix return for PIPE_CAP_DEPTH_CLIP_DISABLE") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20740> (cherry picked from commit `c12fed1804`)	2023-01-18 11:43:55 -08:00
Samuel Pitoiset	de650de6d8	ac/nir: clear unused components before storing XFB outputs to LDS Shader variables don't always exactly match intrinsics and they might contain unused slots. Fixes a bunch of regressions with RADV_PERFTEST=ngg_streamout on RDNA2, and also fixes RDNA3 NGG streamout. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8099 Fixes: `cd22bf90e7` ("ac/nir/ngg: refine nogs outputs handling") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20735> (cherry picked from commit `84241b1f75`)	2023-01-18 11:43:55 -08:00
Rob Clark	48adf7ae1e	freedreno: Restore GL_VENDOR string We cannot change this, as it has already been communicated to app partners. Also this breaks chrome's GPU quirk matching (which in some cases is non-gpu-related, but when all you have is a hammer, everything looks like a nail). Fixes: `9c1fbc076a` ("Return 'Mesa' for GL_VENDOR for community drivers") Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20757> (cherry picked from commit `6f91a5ab07`)	2023-01-18 11:43:54 -08:00
Jason Ekstrand	0b3e2cda24	gallium,util: Pull u_indices and u_primconvert back into gallium This was moved in !13741 but doing so created a link-time dependency between util and gallium which causes problems for Vulkan drivers. Meanwhile, having mesa/main depend on gallium is fine now that we don't have any classic drivers. It's a bit circular but should be harmless. Fixes: `97ba2f2fd4` ("move util/indices to core util") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8098 Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20734> (cherry picked from commit `d292cb82b8`)	2023-01-18 11:43:47 -08:00

1 2 3 4 5 ...

165245 commits