fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-25 08:18:11 +02:00

Author	SHA1	Message	Date
Mike Blumenkrantz	5555783012	nir: fix slot calculations for compact variables with location_frac a variable with a component offset may span multiple slots, and this cannot be inferred from its type alone (e.g., compacted clip+cull distances) cc: mesa-stable Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24163> (cherry picked from commit `59396eefe6`)	2023-07-28 18:48:25 +01:00
Nanley Chery	bd9177f20e	intel/blorp: Ambiguate after CCS resolves on gfx7-8 ISL's state-machine of CCS_D describes full resolves as leaving the aux buffer in the pass-through state. Hardware doesn't behave this way on gfx8 however. On that platform, full resolves transition the aux buffer to the resolved state. This was verified by dumping the CCS before and after a full resolve on BDW (gfx7 is simply assumed to behave the same). Ambiguate after resolving to match driver expectations. Prevents iris from failing piglit's fcc-write-after-clear on BDW with a future patch which relies on fast-clear encodings being removed after a resolve. The avoided failure is: Testing implicit read of partial block UNORM -> SNORM Probe color at (0,1,0) Expected: 1.000000 1.000000 1.000000 1.000000 Observed: 0.000000 0.000000 0.000000 0.000000 Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23676> (cherry picked from commit `1d12b29b3f`)	2023-07-28 18:48:24 +01:00
Nanley Chery	186da6f7db	anv: Don't support ASTC images with modifiers Before this change, anv_get_image_format_features2 reported support for ASTC formats with any modifier (even those not supported by anv). But, we didn't intend to support that compressed image format with modifiers. With this change, the format feature function reports no support for modifiers on ASTC-formatted images. This prevents the next patch from causing assertion failures due to unsupported modifiers. Fixes: `355f318843` ("anv: Allow transfer-only linear ASTC images") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24120> (cherry picked from commit `e50af52e3d`)	2023-07-21 18:08:11 +01:00
Lionel Landwerlin	e649aaa746	intel/fs: fix missing predicate on SEL instruction Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `d8dfd153c5` ("intel/fs: Make per-sample and coarse dispatch tri-state") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9381 Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24236> (cherry picked from commit `46958bcb74`)	2023-07-21 18:08:07 +01:00
Iván Briano	8104b4a92b	anv: ensure CFE_STATE is emitted for ray tracing pipelines Fixes sporadic failures in dEQP-VK.robustness.robustness2.*.rgen Fixes: `ecb709c853` ("anv: only emit CFE_STATE when scratch space increases") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9382 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24206> (cherry picked from commit `75990e5564`)	2023-07-18 22:48:01 +01:00
Rohan Garg	7910039e51	intel/perf: add perf query support for Intel Raptorlake Fixes: `4e0eca7dc3` ("intel/dev: Add device info for RPL") Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24180> (cherry picked from commit `36d4e83299`)	2023-07-18 22:44:09 +01:00
Lionel Landwerlin	7880ada0df	anv: fix utrace signaling with Xe utrace submits can either have a batch or not. When there is a batch, the utrace vk_sync is signaled by the utrace batch (because utrace does a timestamp buffer copy using its own batch). When there is no batch, the utrace vk_sync should be signaled by the application batch (no timestamp copy required, utrace can read the timestamps when the application batch has completed). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `fdea48df5e` ("anv: Implement Xe version of anv_queue_exec_locked() and queue_exec_trace()") Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24085> (cherry picked from commit `a85b84ba1e`)	2023-07-18 22:38:07 +01:00
Lionel Landwerlin	e887283d70	intel/fs: disable coarse pixel shader with interpolater messages at sample Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9292 Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23962> (cherry picked from commit `c26c0a36d3`)	2023-07-18 22:38:06 +01:00
Janne Grunau	8841b3ab06	st/mesa: Set gl_config.floatMode based on color_format Sets the float color component type in st_visual_to_context_mode() ensuring float color values are not clamped. Fixes dEQP-EGL.functional.wide_color.window_fp16_default_colorspace on asahi, iris and most likely every other driver having it marked as fail or flake. Closes: mesa/mesa#9276 Signed-off-by: Janne Grunau <j@jannau.net> Reviewed-by: Adam Jackson <ajax@redhat.com> Acked-by: David Heidelberg <david.heidelberg@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23914> (cherry picked from commit `fd4d0e1cc2`)	2023-07-16 14:18:22 +01:00
José Roberto de Souza	5217a77d6d	anv: Fix compute maximum number of threads value There is no mention in spec about subtract one of the number of threads, also Iris and blorp code don't subtract. Alchemist PRMs: Volume 2a: Command Reference: Instructions: CFE_STATE: Maximum Number of Threads: Normally set to the maximum number of threads: (# EUs) * (# threads/EU) Cc: mesa-stable Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23973> (cherry picked from commit `c142736f52`)	2023-07-15 23:00:48 +01:00
Lionel Landwerlin	836d642c34	anv: fix utrace batch allocation The introduction of a workaround adding lots of MI_NOOPs broke our computation. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `b9aa66d5d0` ("anv: disable preemption for 3DPRIMITIVE during streamout") Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23792> (cherry picked from commit `a13ac83f1b`)	2023-06-28 18:30:05 +01:00
Matt Turner	455c266ec5	anv: Only expose video decode bits with KHR_video_decode_queue This fixes dEQP-VK.api.info.format_properties.g8_b8r8_2plane_420_unorm in combination with the CTS fix from https://gerrit.khronos.org/c/vk-gl-cts/+/12191 Fixes: `9361481780` ("anv: add video format features for the one supported video output format") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8263 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23776> (cherry picked from commit `561cce32f1`)	2023-06-28 18:25:17 +01:00
Matt Turner	c6f3c29ab0	anv: Pipe anv_physical_device to anv_get_image_format_features2 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23776> (cherry picked from commit `727335045d`)	2023-06-28 18:23:19 +01:00
Kenneth Graunke	dbb2b8df97	intel: Initialize FF_MODE2 on all Gfx12 platforms On Alchemist, the FF_MODE2 documentation says that we must set the FF_MODE2 timer values for GS and HS to 224. The hardware performance tuning guide also recommends setting the TDS timer to 4. On Tigerlake, i915 applies workarounds to set the GS timer to 224 (failing to do so can cause HS/DS unit hangs), and the TDS timer to 4 (for performance). It doesn't currently apply a HS timer there, and I'm not sure if it's strictly necessary, but given that Alchemist needed it, and the other two settings matched, let's assume that it ought to match as well. Unfortunately, there has been a bug in the i915 workarounds infrastructure for non-masked context registers where writing one field of the register zeroes out all the others. So, I believe the Tigerlake TDS timer value of 4 isn't being applied correctly there, though the register is also not readable on that platform which makes it hard to verify. So, this may also speed up tessellation. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9233 Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23839> (cherry picked from commit `1b3669a1ed`)	2023-06-28 18:09:26 +01:00
Francisco Jerez	952380c90c	intel/gfx12.5: Enable L3 partial write merging for compressible surfaces among other cases. This enables L3 partial write merging for a number of cases that seem to be getting accidentally disabled by the kernel, which was causing a serious performance bottleneck on DG2 and MTL platforms. The "Compressible Partial Write Merge Enable", "Coherent Partial Write Merge Enable" and "Cross-Tile Partial Write Merge Enable" bits in L3SQCREG5 were expected to be enabled by default (and confusingly, they even read off as enabled if you ran 'intel_reg read 0xb158' on an idle system), but they are getting clobbered during 3D context initialization by an i915 workaround. Enabling L3 partial write merging of compressible surfaces in particular seems to increase rendering fillrate by over 3x in some cases (e.g. the "VulkanFillRate/FillRateGPU/resolution:1[0-3]/format:*/blend:0" fillrate-bound microbenchmarks). Significant improvements can also be reproduced in most real-world workloads we've tested so far, e.g. Counter Strike GO improves by ~11%, Shadow Of the Tomb Raider improves by ~5.5%, and AztecRuins-VK improves by ~6.5% on DG2-512 -- Thanks a lot to Caleb Callaway for these figures. No regressions have been observed so far. Even though this patch might strike as surprisingly simple for such a large payoff, it's the result of Felix DeGrood and I trying to root-cause the rendering performance gap of DG2 on Linux vs Windows on and off during the last year, and some of the OA statistics captured by Felix early this month were greatly helpful for me to connect the last few dots, so Felix deserves a big chunk of the credit for this work. Cc: mesa-stable Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23783> (cherry picked from commit `427fee3507`)	2023-06-28 18:04:06 +01:00
Hyunjun Ko	4a288b1dd5	anv/video: fix to set U/V offset correctly. Fixes: `98c58a16ef` ("anv: add initial video decode support for h264.") Closes: mesa/mesa#9227 Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23819> (cherry picked from commit `23d4e21d83`)	2023-06-27 13:54:07 +01:00
Lionel Landwerlin	249c76e27b	anv: align buffers to a cache line Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9217 Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23794> (cherry picked from commit `8509ebb68a`)	2023-06-27 13:54:03 +01:00
Lionel Landwerlin	ed62199266	anv: track buffer writes from shaders for query results writes In the following sequence : - write buffer B with a shader - barrier on buffer from shader-write to transfer - vkCmdCopyQueryPoolResults to buffer B The barrier should take care of ordering things between the shader writes and vkCmdCopyQueryPoolResults. The problem is that vkCmdCopyQueryPoolResults runs on the command streamer and that is not coherent or synchronized in the same way as shaders. This change marks the barrier has potentially containing pending buffer writes for queries so that we can insert the necessary flush for vkCmdCopyQueryPoolResults later. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9013 Cc: mesa-stable Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23675> (cherry picked from commit `cab8495625`)	2023-06-24 14:09:40 +01:00
Lionel Landwerlin	b99d3dd3d9	anv: limit ANV_PIPE_RENDER_TARGET_BUFFER_WRITES to blorp operations using 3D Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23074> (cherry-picked from commit `455a13fb7f`)	2023-06-24 14:09:40 +01:00
Lionel Landwerlin	9c599dff88	anv: avoid private buffer allocations in vkGetDeviceImageMemoryRequirementsKHR The whole point of vkGetDeviceImageMemoryRequirementsKHR is to avoid creating an image so we should completely avoid any allocation like the private binding. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `4075dd16ab` ("anv: implement vkGetDeviceImageMemoryRequirementsKHR") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23720> (cherry picked from commit `0e728ea7b0`)	2023-06-20 13:38:03 +01:00
Nanley Chery	de04140973	intel/blorp: Avoid 32bpc fast clear sampling issue For 32bpc formats, the ICL+ sampler fetches the raw clear color dwords used for rendering instead of the converted pixel dwords typically used for sampling. The CLEAR_COLOR struct page documents this for 128bpp formats, but not for 32bpp and 64bpp formats. In blorp_copy, map R11G11B10_FLOAT to R8G8B8A8_UINT instead of R32_UINT. This will cause the sampler to fetch the clear color pixel, allowing drivers to keep clear color support enabled during copies. If iris is forced to convert blits to copies, this patch fixes the following test on gfx12: dEQP-GLES3.functional.fbo.color.repeated_clear.blit.rbo.r11f_g11f_b10f At the moment, both iris and anv won't hit this issue outside of blorp_copy. This is due to the read/write access restrictions they currently place on texture views that reinterpret the surface format. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8964 Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23604> (cherry picked from commit `f0b6b57c77`)	2023-06-15 22:28:13 +01:00
Lionel Landwerlin	6cfd019a85	anv: fix incorrect batch for 3DSTATE_CONSTANT_ALL emission Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `c950fe97a0` ("anv: implement generated (indexed) indirect draws") Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23074> (cherry picked from commit `d7c28e526b`)	2023-06-15 22:12:42 +01:00
Lionel Landwerlin	d6d2f3661f	anv: disable mesh/task for generated draws Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `c950fe97a0` ("anv: implement generated (indexed) indirect draws") Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23074> (cherry picked from commit `0da39bf8ee`)	2023-06-15 22:12:42 +01:00
Dylan Baker	09a04dda8c	meson: Key whether to build batch decoder on expat Instead of on Android. Which allows an end user to turn off expat without breaking or disabling Intel support. I've additionally refactored to separate expat and xmlconfig a bit more in the root meson.build This does make expat a hard dependency for building Intel tools, despite the fact that only aubinator actually requires it. This simplifies the build for the common case, and in the event that someone wants to build the Intel tools and doesn't have libexpat, they can fall back to the meson wrap for expat instead. fixes: `75276deebc` closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8791 Reviewed-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23605> (cherry picked from commit `ce07aabab1`)	2023-06-15 22:10:48 +01:00
Rohan Garg	6d48542bbc	anv: split ANV_PIPE_RENDER_TARGET_BUFFER_WRITES for finer grained flushing split ANV_PIPE_RENDER_TARGET_BUFFER_WRITES into separate CS_STALL, RT_FLUSH & TILE_FLUSH flags in order to have finer control over cache coherency. Tigerlake CS has it's own cache fetching directly from the memory controller, so we need to do a tile flush to ensure the query data is visible. This fixes test_resolve_non_issued_query_data in vkd3d on TGL. Signed-off-by: Rohan Garg <rohan.garg@intel.com> Fixes: `3c4c18341a` ("anv: narrow flushing of the render target to buffer writes") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23500> (cherry picked from commit `d0e0ba897f`)	2023-06-15 22:09:29 +01:00
Sagar Ghuge	4ebd09e445	anv: Set CS stall bit during HIZ_CCS_WT surface fast clear It make sense to enable CS stall so that it guarantees that the fast clear will start after tile cache flush has completed. cc: mesa-stable closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9030 Fixes: `e488773b` ("anv: Fast clear depth/stencil surface in vkCmdClearAttachments" Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23063> (cherry picked from commit `688ee02864`)	2023-06-07 18:02:01 +02:00
Kenneth Graunke	e584100f43	intel/compiler: Fix a fallthrough in components_read() for atomics In commit `284f0c9a57` I refactored the handling of the data source to just call a helper rather than special casing opcodes with 0 or 2 sources. Unfortunately, I also dropped the "else return 1", creating a fallthrough for all sources other than SURFACE_LOGICAL_SRC_ADDRESS and SURFACE_LOGICAL_SRC_DATA. The case below happened to return the correct value for all cases except SURFACE_LOGICAL_SRC_SURFACE, which has been returning 2 instead of 1 since that commit. Restore the else case. Thanks to Marcin Ślusarz for catching this. Fixes: `284f0c9a57` ("intel/compiler: Add an lsc_op_num_data_values() helper") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23347> (cherry picked from commit `2d9a3bb093`)	2023-06-07 10:58:21 +02:00
José Roberto de Souza	0fd50430ae	intel: Fix support of kernel versions without DRM_I915_QUERY_ENGINE_INFO As Matt Turner pointed out, the commit here fixed breaks in Iris and ANV in kernel versions without support for DRM_I915_QUERY_ENGINE_INFO. As compute engines are only present in gfx12 and newer, and support for DRM_I915_QUERY_ENGINE_INFO was added before any gfx12 platform, we can check for gfx version before trying to get engine info. For ANV, this is done by checking if engine_info is not NULL, like in other places in the ANV source code. Fixes: `a364f23a6c` ("intel: Make gen12 URB space reservation dependent on compute engine presence") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9099 Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Tested-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23257> (cherry picked from commit `42f707e459`)	2023-06-02 19:34:01 +01:00
Rohan Garg	7b6fc2c9cd	hasvk: enable single texel alignment Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23321> (cherry picked from commit `732db2b60c`)	2023-06-02 19:34:01 +01:00
Rohan Garg	bdc40be418	anv: fix incorrect asserts when combining CPS and per sample interpolation CPS is dynamically turned off when per sample interpolation is active. Update the asserts to reflect this. Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `5644011f06` ("intel/compiler: Convert wm_prog_key::persample_interp to a tri-state") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23103> (cherry picked from commit `ef2b763d9c`)	2023-06-02 19:34:01 +01:00
Lionel Landwerlin	841d63ab08	anv: fix null descriptor handling with A64 messages global load/store (or A64 messages) need the NIR bound checking which is enabled by "robust" behavior even when robust behavior is disabled. Many thanks to Christopher Snowhill for pointing out the pushed constant related issue with the initial version of this patch. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21645> (cherry picked from commit `efcda1c530`)	2023-06-02 19:34:00 +01:00
Lionel Landwerlin	0340da7c82	anv: fix push range for descriptor offsets Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `379b9bb7b0` ("anv: Support fetching descriptor addresses from push constants") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21645> (cherry picked from commit `e1ffa067d3`)	2023-06-02 19:34:00 +01:00
Rohan Garg	a913343de5	anv: enable single texel alignment Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23263> (cherry picked from commit `f6a83ec988`)	2023-06-01 16:52:36 +01:00
Francisco Jerez	a825322179	anv: Fix calculation of guardband clipping region. The existing guardband region calculation was mixing up x/y_min with x/y_max in cmd_buffer_emit_viewport(), causing the calculated viewport area to always be an empty region. Luckily intel_calculate_guardband_size() returns a non-empty but bogus guardband region in that case, so this doesn't seem to have led to conformance regressions, but the off-center guardbands could potentially impact performance in geometry-heavy rendering. Fixes: `893fa30afe` ("anv: Include scissors in viewport calculations") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23174> (cherry picked from commit `9c26a6b3bb`)	2023-06-01 16:49:22 +01:00
Lionel Landwerlin	28f89e96cd	intel/fs: fix size_read() for LOAD_PAYLOAD With Anv/Zink, the piglit test : arb_shader_storage_buffer_object-max-ssbo-size -auto -fbo fsexceed is failing validation after copy propagation : load_payload(8) vgrf15:F, vgrf1+0.12<0>:F, vgrf1+0.0<0>:F, vgrf1+0.4<0>:F, vgrf1+0.8<0>:F, vgrf1+0.12<0>:F ../src/intel/compiler/brw_fs_validate.cpp:191: A <= B failed A = inst->src[i].offset / REG_SIZE + regs_read(inst, i) = 2 B = alloc.sizes[inst->src[i].nr] = 1 In most cases it works because src[0] would be at offset 0 and so reading a full reg passes validation, but Anv/Zink started emitting slightly different code adding an offset maybe the size read 2 GRFs. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23126> (cherry picked from commit `21c7b55f6f`)	2023-05-25 14:06:13 +01:00
Lionel Landwerlin	7a6aef60b6	anv: fix push descriptor deferred surface state packing Yuzu is running into a segfault because it writes the push descriptor twice with 2 different layouts, but without a draw/dispatch in between. First vkCmdPushDescriptorSetKHR() writes descriptor 0 & 1 with a uniform buffer. We toggle the 2 first bits of anv_descriptor_set::generate_surface_states. Second vkCmdPushDescriptorSetKHR() writes descriptor 0 with uniform buffer and descriptor 1 with an image view. The first bit of anv_descriptor_set::generate_surface_states stays, but the second bit was already set before and it should now be off. When we finally flush the push descriptor, we try to generate a surface state for descriptor 1, but there is no valid buffer view for it, we access an invalid pointer and segfault. This fix resets the anv_descriptor_set::generate_surface_states when the descriptor layout changes. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `b49b18f0b7` ("anv: reduce BT emissions & surface state writes with push descriptors") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23156> (cherry picked from commit `cab7ba00e2`)	2023-05-25 14:06:13 +01:00
Kenneth Graunke	fb39004b72	intel/compiler: Fix 64-bit ufind_msb, find_lsb, and bit_count We only support 32-bit versions of ufind_msb, find_lsb, and bit_count, so we need to lower them via nir_lower_int64. Previously, we were failing to do so on platforms older than Icelake and let those operations fall through to nir_lower_bit_size, which used a callback to determine it should lower them for bit_size != 32. However, that pass only emulates small bit-size operations by promoting them to supported, larger bit-sizes (i.e. 16-bit using 32-bit). It doesn't support emulating larger operations (i.e. 64-bit using 32-bit). So nir_lower_bit_size would just u2u32 the 64-bit source, causing us to flat ignore half of the bits. Commit `78a195f252` (intel/compiler: Postpone most int64 lowering to brw_postprocess_nir) provoked this bug on Icelake and later as well, by moving the nir_lower_int64 handling for ufind_msb until late in compilation, allowing it to reach nir_lower_bit_size which broke it. To fix this, we always set int64 lowering for these opcodes, and also correct the nir_lower_bit_size callback to ignore 64-bit operations. Cc: mesa-stable Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23123> (cherry picked from commit `a2d384a5c0`)	2023-05-25 14:06:12 +01:00
José Roberto de Souza	1b4720b305	anv: Fix ANV_BO_ALLOC_NO_LOCAL_MEM flag VK_MEMORY_PROPERTY_DEVICE_LOCAL_BIT is also set in all memory types of integrated GPUs. This flag means that memory will be allocated in the most efficient place for the GPU to access, which is true in integrated GPUs. However, this was causing ANV_BO_ALLOC_WRITE_COMBINE to be set in integrated GPUs in the block right below when allocating in the non-cached memory type. But the comment only talks about lmem, so to still keep the write combine behavior for iGPUs it was used VkMemoryPropertyFlags in mmap_calc_flags(). Additionally, this was causing anv_bo.has_implicit_ccs to always be set, which could change the expected behavior of anv_BindImageMemory2() in MTL. Fixes: `fbd32a04da` ("anv: add a third memory type for LLC configuration") added a new heap Fixes: `582bf4d9f7` ("anv: flag BO for write combine when CPU visible and potentially in lmem") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22483> (cherry picked from commit `a6c5746b37`)	2023-05-25 14:06:12 +01:00
Lionel Landwerlin	d8290f5f38	anv: mark images compressed for untracked layout/access Most of the compressed writes are tracked by the driver, for instances : - blorp writes - render target writes But we don't have any tracking for storage images (which have gained compression support on DG2+). So inspect the layout transition and when we see a layout/access that can do writes outside of our driver tracking, update the image state tracking. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8946 Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22988> (cherry picked from commit `1a89b1a301`)	2023-05-25 14:06:12 +01:00
Lionel Landwerlin	754e0e2176	anv: put private binding BOs into execlists Not doing so all the reads/writes go to the scratch page on i915. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `f9fa09ec92` ("anv/image: Add ANV_IMAGE_MEMORY_BINDING_PRIVATE") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22957> (cherry picked from commit `7f7b2fc53a`)	2023-05-25 14:06:12 +01:00
Tapani Pälli	8fe0aea591	anv: handle missing astc for gfx125 in CreateImageView Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22818> (cherry picked from commit `b0b6811b9b`)	2023-05-25 14:06:10 +01:00
Matt Turner	f93ea427b2	intel: Disable shader cache when executing intel_clc during the build With the shader cache enabled, intel_clc attempts to write to ~/.cache. Many distributions' build systems limit file-system access, and will kill the process thus causing the build to fail. Fixes: `639665053f` ("anv/grl: Build OpenCL kernels") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22968> (cherry picked from commit `435a607909`)	2023-05-25 14:06:10 +01:00
Lionel Landwerlin	c8c7dd030f	anv: fixup workaround 16011411144 We're missing it for the memcpy with streamout Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `5cc4075f95` ("anv, iris: Add Wa_16011411144 for DG2") Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22930> (cherry picked from commit `7381405095`)	2023-05-25 14:06:10 +01:00
Daniel Schürmann	0dff70ecb1	vulkan/pipeline_cache: don't log warnings for client-invisible caches Fixes: `d3f06cf5ce` ('vulkan/pipeline_cache: don't log warnings for internal caches') Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22986> (cherry picked from commit `8bfd18b8c5`)	2023-05-25 14:06:10 +01:00
Daniel Schürmann	2510200d5e	vulkan/pipeline_cache: don't log warnings for internal caches Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22850> (cherry picked from commit `d3f06cf5ce`)	2023-05-25 14:06:10 +01:00
Tapani Pälli	9a1bc5e34a	isl: fix layout for comparing surf and view properties These asserts were checking isl_format_layout against itself, change to compare surface format layout against view format layout. Fixes: `628bfaf1c6` ("intel/isl: Add some sanity checks for compressed surfaces") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22790> (cherry picked from commit `c35d430460`)	2023-05-05 19:07:15 +01:00
Lionel Landwerlin	6e3d86c809	intel/fs: fix scheduling of HALT instructions With the following test : dEQP-VK.spirv_assembly.instruction.terminate_invocation.terminate.no_out_of_bounds_load There is a : shader_start: ... <- no control flow g0 = some_alu g1 = fbl g2 = broadcast g3, g1 g4 = get_buffer_size g2 ... <- no control flow halt <- on some lanes g5 = send <surface>, g4 eliminate_find_live_channel will remove the fbl/broadcast because it assumes lane0 is active at get_buffer_size : shader_start: ... <- no control flow g0 = some_alu g4 = get_buffer_size g0 ... <- no control flow halt <- on some lanes g5 = send <surface>, g4 But then the instruction scheduler will move the get_buffer_size after the halt : shader_start: ... <- no control flow halt <- on some lanes g0 = some_alu g4 = get_buffer_size g0 g5 = send <surface>, g4 get_buffer_size pulls the surface index from lane0 in g0 which could have been turned off by the halt and we end up accessing an invalid surface handle. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20765> (cherry picked from commit `9471ffa70a`)	2023-05-05 19:07:14 +01:00
Sviatoslav Peleshko	571992af74	anv: Improve image/view usage bits verification This change makes usage bits verification closer to the Vulkan spec. i.e. VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT does not always require all formats to support all the requested usage bits. Also, VK_IMAGE_CREATE_EXTENDED_USAGE_BIT, when combined with VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT can relax the requirements for the usage supported by the original image format. v2: Removed strict verification of the format_list_info formats usage per chadversary's suggestion. Other minor style/comments tweaks. v3: Added checking of all compatible formats when VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT and VK_IMAGE_CREATE_EXTENDED_USAGE_BIT are specified, but no list of possible formats was given. v4: Add VK_IMAGE_CREATE_BLOCK_TEXEL_VIEW_COMPATIBLE_BIT handling. Cc: 22.2 <mesa-stable> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6031 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17182> (cherry picked from commit `697ed61e7c`)	2023-05-05 16:02:30 +01:00
Sviatoslav Peleshko	b189a72fe4	anv: Handle UNDEFINED format in image format list It's not invalid to have this value in the list, but the only case it is actually valid as format in the creation of an image or image view is with Android Hardware Buffers which have their format specified externally. So we can just ignore all entries with VK_FORMAT_UNDEFINED. Cc: 22.2 <mesa-stable> Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17182> (cherry picked from commit `9899151361`)	2023-05-05 16:02:23 +01:00
Sviatoslav Peleshko	1122231f95	isl: Check all channels in isl_formats_have_same_bits_per_channel Cc: 22.2 <mesa-stable> Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17182> (cherry picked from commit `0ed8a48ce9`)	2023-05-05 16:02:14 +01:00

1 2 3 4 5 ...

9446 commits