fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-06 18:08:40 +02:00

Author	SHA1	Message	Date
Rhys Perry	e0e7bfadee	aco: ignore exec and literals when mitigating VALUMaskWriteHazard LLVM ignores exec and literals don't seem to work in some cases. fossil-db (navi31): Totals from 2676 (3.37% of 79395) affected shaders: Instrs: 10638979 -> 10646019 (+0.07%); split: -0.00%, +0.07% CodeSize: 55929640 -> 55959416 (+0.05%); split: -0.00%, +0.06% Latency: 107707408 -> 107712893 (+0.01%); split: -0.00%, +0.01% InvThroughput: 18119843 -> 18120442 (+0.00%); split: -0.00%, +0.00% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Backport-to: 24.1 Backport-to: 24.2 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30818> (cherry picked from commit `ee648326d9`)	2024-08-28 20:33:17 +02:00
Lionel Landwerlin	cbee9e7a88	brw: switch mesh/task URB fence prior to EOT to GPU Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30849> (cherry picked from commit `93fba40389`)	2024-08-28 15:52:41 +02:00
Yiwei Zhang	c6187962a2	venus: workaround cacheline overflush issue on Intel JSL We observed that Venus on ANV on JSL platform has some cacheline flush issue. The overflush shows up as: 1. There're 2 threads venus bliting the feedback buffers suballocated from the same backing device memory, back to back. 2. On thread A, flushing the feedback buffer for cpu read is placed behind flushing a shader storage buffer for cpu read. 3. On thread B, flushing a different feedback buffer with the same backing device memory (different offset bound to) can kick the feedback buffer flush in (2) earlier than it should be flushed. 4. As a result, CPU polling thread for thread B results would see venus feedback buffer update earlier than shader storage buffer results being updated, breaking Venus sync primitives optimization. During investigation, a solid workaround for JSL platform is to force Venus to align up to 128 bytes for feedback buffer suballocation while the default is at 64 bytes. Cc: mesa-stable Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30879> (cherry picked from commit `7941d705c3`)	2024-08-28 15:31:32 +02:00
bbhtt	83e71e0ebd	pipe_loader_drm: Fix virtgpu_drm header path Fixes: `2ea4a59ab7` ("loader: Add better support for virtgpu nctx driver loading") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30673> (cherry picked from commit `284ad7da39`)	2024-08-28 15:31:31 +02:00
Eric Engestrom	da70827656	vc4: Add missing libvc4_neon build dependencies Duplicates the libvc4 dependencies. Fixes: `ebcb4c2156` ("meson: Enable VC4's NEON assembly support.") Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Co-authored-by: David Heidelberg <david@ixit.cz> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30819> (cherry picked from commit `fda6f8638a`)	2024-08-28 15:31:25 +02:00
Sviatoslav Peleshko	dcfb085a6a	anv: Add full subgroups WA for the shaders with barriers in Breaking Limit When barriers are used in invalid shaders with non-uniform control flow we might get a hang. Forcing 32-wide group can help by making it more probable that barrier instruction is executed by at least one channel in each thread, and thus hang will be avoided. This shouldn't affect Xe2+, where active-thread-only barriers are used anyway. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11497 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30581> (cherry picked from commit `7e52b67801`)	2024-08-28 15:31:23 +02:00
Sviatoslav Peleshko	97fdcffb0c	anv: Release correct BO in anv_cmd_buffer_set_ray_query_buffer If p_atomic_cmpxchg doesn't set the ray_query_shadow_bos[bucket] to new_bo allocated by this thread, it returns the bucket BO allocated by the other thread and we use it. But due to a mistake, we also release that BO, not the candidate just allocated by this thread and never used again. Fixes: `5d3e4193` ("anv: enable ray queries") Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30581> (cherry picked from commit `1904fe1186`)	2024-08-28 14:26:48 +02:00
Sviatoslav Peleshko	f6432697e7	brw,elk: Fix opening flags on dumping shader binaries Truncation is needed for overwriting correctly in cases when old file is bigger than the one we want to dump (e.g. when the old one was edited inplace). Also, creation permissions are way too broad. Fixes: `4f41c44d` ("intel/compiler: Add variable to dump binaries of all compiled shaders") Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30581> (cherry picked from commit `09122e2be0`)	2024-08-28 14:26:47 +02:00
Eric Engestrom	a24002fc1a	.pick_status.json: Update to `7392e3306e`	2024-08-28 14:26:41 +02:00
David Rosca	afb9e01454	frontends/va: Fix leaks with multiple coded buffer segments The buffers can be reused, so we must only allocate added segments and free unused segments. Fixes: `be4287c3aa` ("pipe: Extend get_feedback with additional metadata") Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30779> (cherry picked from commit `1ebff2220d`)	2024-08-27 09:25:38 +02:00
Lionel Landwerlin	7d7a9cce41	nir/divergence: add missing load_constant_base_ptr Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30712> (cherry picked from commit `2158fe2ae2`)	2024-08-27 09:25:15 +02:00
Faith Ekstrand	56a5cf145e	nil,nvk: Disable modifiers for B10G11R11_UFLOAT and E5B9G9R9_UFLOAT The CTS tests fail due to precision issues (arguably a CTS bug) but it also doesn't make a lot of sense to advertise modifiers on them at all. Fixes: `cd428e01d7` ("nvk: Advertise VK_EXT_image_drm_format_modifier") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30861> (cherry picked from commit `b78a691ce2`)	2024-08-27 09:08:38 +02:00
Lionel Landwerlin	c64ba9c8d3	anv: always use workaround_address, not workaround_bo The workaround BO has some debug information at the beginning. The workaround address is placed after that. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30844> (cherry picked from commit `d8ec8acede`)	2024-08-27 09:08:29 +02:00
Dave Airlie	87ea4ab7d2	llvmpipe: make sure to duplicate the fd handle before giving out This handle is given to the user to close, so make sure to dup it first. Fixes: `d74ea2c117` ("llvmpipe: Implement dmabuf handling") Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Lucas Fryzek <lfryzek@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30839> (cherry picked from commit `4bf257a18f`)	2024-08-27 09:08:28 +02:00
Dave Airlie	6cd669911e	llvmpipe: handle stride properly on lvp udmabuf imports The import data comes in via the fd import, but we need to make sure to store the row stride value here. Fixes: `c44d65a467` ("lp: only map dt buffer on import from dmabuf") Reviewed-by: Lucas Fryzek <lfryzek@igalia.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30839> (cherry picked from commit `521dc42e6c`)	2024-08-27 09:08:27 +02:00
Mike Blumenkrantz	631732e3df	dril: add zink stub ironically this was the only driver left out Fixes: `3de62b2f9a` ("gallium/dril: Compatibility stub for the legacy DRI loader interface") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30851> (cherry picked from commit `786be05df3`)	2024-08-27 09:08:26 +02:00
Valentine Burley	6acc836b91	llvmpipe: Only use udmabuf with libdrm It's possible to have the linux/udmabuf.h header but not libdrm in some setups, like under Termux. Fixes: `112063a060` ("llvmpipe: Only use udmabuf if header is found") Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Lucas Fryzek <lfryzek@igalia.com> Signed-off-by: Valentine Burley <valentine.burley@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30853> (cherry picked from commit `4cfaf10c10`)	2024-08-27 09:08:25 +02:00
Lionel Landwerlin	5eb0a7d809	anv: explicitly disable BT pool allocations at device init The default state doesn't seem well defined (or kernel driver bug maybe?). Let's just set it to disabled on platforms where we're not using it. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Found-by: Chuansheng Liu <chuansheng.liu@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30841> (cherry picked from commit `1f9c40a8d1`)	2024-08-27 09:08:24 +02:00
Dave Airlie	591cfcaf79	radv/video: fix reporting video format props for encode. When encode isn't enabled, refuse the image usage, also use the correct error on the decode check. Fixes: `05cd42417f` ("radv/video: enable video encoding behind perftest flag") Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30838> (cherry picked from commit `68cd36d9b4`)	2024-08-27 09:08:23 +02:00
David Heidelberg	71812c26ea	etnaviv: build dependency for the etnaviv tests Resolves failures as: ... -o src/etnaviv/isa/tests/etnaviv_disasm.p/disasm.cpp.o -c ../src/etnaviv/isa/tests/disasm.cpp In file included from ../src/etnaviv/isa/tests/disasm.cpp:12: ../src/etnaviv/isa/asm.h:15:10: fatal error: etnaviv/isa/enums.h: No such file or directory 15 \| #include "etnaviv/isa/enums.h" \| ^~~~~~~~~~~~~~~~~~~~~ Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11740 Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Signed-off-by: David Heidelberg <david@ixit.cz> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30829> (cherry picked from commit `8f8a51ac5c`)	2024-08-27 09:08:23 +02:00
David Heidelberg	dab7b68cbd	etnaviv: rename enums_h appropriately Needed for the follow-up change. Cc: mesa-stable Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Signed-off-by: David Heidelberg <david@ixit.cz> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30829> (cherry picked from commit `43bff3b9eb`)	2024-08-27 09:08:22 +02:00
Lepton Wu	e32dcad055	egl/android: Fix wrong pipe format for RGB_565 We were actually using PIPE_FORMAT_B5G6R5_UNORM for HAL_PIXEL_FORMAT_RGB_565 since Android support was added to Mesa. This restores the original behavior. Fixes: `273e54391a` ("egl/android: Remove hard-coded color-channel data") Acked-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Signed-off-by: Lepton Wu <lepton@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30827> (cherry picked from commit `72506ac8c7`)	2024-08-27 08:56:06 +02:00
Karol Herbst	a508c236bf	vtn: ignore volatile on functions for now Not sure if we have to do something about it here, but maybe at some point we do? Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30739> (cherry picked from commit `e9d908206b`)	2024-08-27 08:56:05 +02:00
Karol Herbst	6f48c16b0f	rusticl/device: limit CL_DEVICE_IMAGE_MAX_BUFFER_SIZE more aggressively We can't exceed c_int::MAX, because the CTS casts to ints in a few places. We also need to take into account max pixel size when restricting to max_mem_alloc as this cap is pixel based, not byte based. Cc: mesa-stable Reviewed-by: @LingMan Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30739> (cherry picked from commit `eef1af8128`)	2024-08-27 08:56:03 +02:00
Timothy Arceri	54fea2435e	nir/glsl: set deref cast mode during function inlining See code comment for details. Issue: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11535 Fixes: `c6c150b4cd` ("glsl_to_nir: support conversion of opaque function params") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30338> (cherry picked from commit `d681cf96fb`)	2024-08-27 08:48:46 +02:00
Timothy Arceri	baa301d5f7	glsl: make use of new tex src deref intrinsic The bindless spec has no language requiring functions params to be defined as bindless so we need to be able to look at the values being passed to functions to decide if they are bindless or not. This intrinsic allows us to wait until function inlining is complete to make this assessment. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11535 Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30315> (cherry picked from commit `a629d829dc`)	2024-08-27 08:48:32 +02:00
Timothy Arceri	738d009a85	nir: add nir_tex_src_{sampler,texture}_deref_intrinsic To be used as a placeholder until after function inlining so we can replace function params with bindless handles if needed. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30315> (cherry picked from commit `017770ff14`)	2024-08-27 08:48:28 +02:00
Timothy Arceri	ecdc6e1cc3	nir: create validate_tex_src_texture_deref() helper Will be used in a following patch. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30315> (cherry picked from commit `ef13ff00d1`)	2024-08-27 08:48:24 +02:00
Eric Engestrom	39053d8caa	.pick_status.json: Update to `4a8f3181ba`	2024-08-27 08:48:17 +02:00
Mauro Rossi	49ee36baea	nvk: Fix regression observed on Kepler vkcube, vkgears and vkmark are crashing with the following error/segfault: $ NVK_I_WANT_A_BROKEN_VULKAN_DRIVER=1 vkcube WARNING: NVK is not a conformant Vulkan implementation, testing use only. Selected GPU 0: GeForce GT 640 (NVK GK107), type: DiscreteGpu ERROR: couldn't get DataFile for op ldc_nv Segmentation fault (core dumped) Handling nir_intrinsic_ldc_nv as per nir_intrinsic_load_ubo in Converter::getFile() allows to run vkcube, vkgears and vkmark on Nvidia GT640 Fixes: `dc99d9b2` ("nvk,nak: Switch to nir_intrinsic_ldc_nv") Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30832> (cherry picked from commit `7b32df696e`)	2024-08-25 17:09:34 +02:00
Faith Ekstrand	809fc297c3	nvk: Disable conditional rendering around CopyQueryPoolResults Fixes: `57c38a5669` ("nvk: Implement CopyQueryPoolResults with a compute shader") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30826> (cherry picked from commit `cdef36c422`)	2024-08-25 17:08:03 +02:00
Nanley Chery	e4f95a6c0e	intel/isl: Fix packing of SINT formats Prevents the next patch from failing many multisampled, signed integer rendering tests. For example: dEQP-VK.renderpass2.suballocation.multisample_resolve.r8_sint.samples_4 Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30646> (cherry picked from commit `dfcd93d12f`)	2024-08-25 17:08:00 +02:00
Lionel Landwerlin	886406f4f4	anv: don't miss workaround for indirect draws Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30803> (cherry picked from commit `195c5b68ba`)	2024-08-25 17:07:59 +02:00
Lionel Landwerlin	5ed277b5d4	anv: move conditional render predicate after gfx_flush_state Following up on `f8c0a99d52` ("anv: emit conditional after gfx state flushing"), this should have been applied everywhere. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `0147908a89` ("anv: predicate emission of STATE_BASE_ADDRESS") Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30803> (cherry picked from commit `f25b500af4`)	2024-08-25 17:07:57 +02:00
Lionel Landwerlin	b12ada2d96	anv: optimize CLIP::MaximumVPIndex setting Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11746 Fixes: `982106e676` ("anv: only set 3DSTATE_CLIP::MaximumVPIndex once") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30762> (cherry picked from commit `a88898a28f`)	2024-08-25 17:07:30 +02:00
Connor Abbott	c3d68212e7	tu: Treat partially-bound depth/stencil attachments as passthrough Make sure to preserve the depth or stencil components of D24S8 using the fixed codepath just added. While we're here, fix the detection of whether an attachment is bound. Fixes: `cb0f414b` ("tu: Add support for suspending and resuming renderpasses") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26154> (cherry picked from commit `812c8f6abe`)	2024-08-25 17:06:02 +02:00
Connor Abbott	61d6cbf7aa	tu: Fix passthrough D24S8 attachments We need to make sure that we don't trash a passthrough depth/stencil aspect if we need to store the whole attachment by loading it beforehand. Fixes: `cb0f414b` ("tu: Add support for suspending and resuming renderpasses") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26154> (cherry picked from commit `5377219ca0`)	2024-08-25 16:51:21 +02:00
Eric Engestrom	df2cf27bdb	.pick_status.json: Update to `81e3930ec0`	2024-08-25 16:51:11 +02:00
Ian Romanick	cbd6dc81ae	anv: Larger memory pools for huge shaders At least one ray tracing shader in cp2077 is over 4MB on Xe2. There isn't a memory pool large enough for the allocation, so the driver crashes instead. This commit adds 8MB and 16MB pools. I intend this as a stop gap fix. I would prefer to figure out why this shader is so much larger than on previous platforms. The shader in question has 3824 spills and 8625 fills. That is not good. I suspect dealing with that will also solve the problem, but that will require a bit more time. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11739 Suggested-by: Lionel Landwerlin Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30751> (cherry picked from commit `09cf9fe8ab`)	2024-08-22 10:47:36 +02:00
Ian Romanick	ee320276cd	anv: Protect against OOB access to anv_state_pool::buckets Suggested-by: Paulo Zanoni Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30751> (cherry picked from commit `0921dfa044`)	2024-08-22 10:47:35 +02:00
Mike Blumenkrantz	0f82e06741	tc: set resolve on renderpass info if blit terminates the renderpass this avoids a scenario where invalidate happens after a non-winsys blit for a renderpass and the driver skips storing framebuffer contents because the invalidate flag is set cc: mesa-stable Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30750> (cherry picked from commit `2fa52bf6e5`)	2024-08-22 10:47:34 +02:00
Mike Blumenkrantz	f79163d200	zink: don't skip cbuf store ops if resolve is set inlined resolve ops are still somehow slower than explicit ones, so the data has to be written out for the resolve cc: mesa-stable Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30750> (cherry picked from commit `38f4501a5c`)	2024-08-22 10:47:32 +02:00
Mary Guillemard	627ef26a8d	panvk: Fix viewport calculation This fix "dEQP-VK.dynamic_state..general_state.{bind_order, state_persistence, state_switch}" Fixes: `1f57aae4e4` ("panvk: Move vkCmdDraw functions to their own file") Signed-off-by: Mary Guillemard <mary.guillemard@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30754> (cherry picked from commit `a869237d50`)	2024-08-22 10:47:16 +02:00
Konstantin	b97d6c617c	radv: Handle repeated instructions when splitting disassembly cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30734> (cherry picked from commit `19d633af0b`)	2024-08-22 10:47:15 +02:00
Konstantin	db5e0d790b	radv: Handle instruction encodings > 8 bytes when splitting disassembly Choosing the wrong instruction length prevents radv_dump_annotated_shader from matching waves. cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30734> (cherry picked from commit `1cf507b806`)	2024-08-22 10:47:14 +02:00
Eric Engestrom	4565faec11	.pick_status.json: Update to `c5156257d9`	2024-08-22 10:47:10 +02:00
Rhys Perry	0850b0183b	aco: only remove branch jumping over SMEM/barrier if it's never taken SMEM might be an invalid access, and barriers are probably expensive. fossil-db (navi21): Totals from 126 (0.16% of 79395) affected shaders: Instrs: 2764965 -> 2765377 (+0.01%) CodeSize: 15155348 -> 15156788 (+0.01%) Latency: 17604293 -> 17604296 (+0.00%) Branches: 105211 -> 105623 (+0.39%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Backport-to: 24.1 Backport-to: 24.2 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30321> (cherry picked from commit `c29d9f1184`)	2024-08-22 10:40:24 +02:00
Rhys Perry	c224584921	aco: split selection_control_remove into rarely_taken and never_taken No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Backport-to: 24.1 Backport-to: 24.2 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30321> (cherry picked from commit `b934255510`)	2024-08-22 10:40:24 +02:00
Francisco Jerez	3e0544ea2d	intel/brw/gfx12.5+: Fix IR of sub-dword atomic LSC operations. We were currently emitting logical atomic instructions with a packed destination region for sub-dword LSC atomics, along the lines of: > untyped_atomic_logical(32) dst<1>:HF, ... However, these instructions use an LSC data size D16U32, which means that the 16b data on the return payload is expanded to 32b by the LSC shared function, so we were lying to the compiler about the location of the individual channels on the return payload, its execution masking, etc. This is why the hacks that manually set the 'inst->size_written' of the instruction were required. In some cases this worked, but any non-trivial manipulation of the instruction destination by lowering or optimization passes could have led to corruption, as has been reproduced in deqp-vk during lower_simd_width() for shaders that use 16-bit atomics in SIMD32 dispatch mode. Note that LSC sub-dword reads aren't affected by this because they use raw UD destinations and specify the actual bit size of the operation datatype as the immediate SURFACE_LOGICAL_SRC_IMM_ARG, which doesn't work for atomic operations since that immediate specifies the atomic opcode. Instead, have the logical operation implement the behavior of 16-bit destinations correctly instead of silently replacing the 16-bit region with an inconsistent 32-bit region -- This is done by emitting the MOV instructions used to pack the data from the UD temporary into the packed destination from the lower_logical_sends() pass instead of from the NIR translation pass. Fixes: `43169dbbe5` ("intel/compiler: Support 16 bit float ops") Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30683> (cherry picked from commit `71ca8529c5`)	2024-08-21 19:06:46 +02:00
Nanley Chery	02be7e928d	iris: Invalidate state cache for some depth fast clears We need to invalidate the state cache when updating the value in the indirect clear color so that existing surface states can pick up the new value. Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30520> (cherry picked from commit `55dbc58bf4`)	2024-08-21 19:06:45 +02:00

1 2 3 4 5 ...

192602 commits