fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-22 08:48:07 +02:00

Author	SHA1	Message	Date
José Roberto de Souza	2c05488be1	anv: Align size of bos larger than 1MB to 64k to enable 64k pages BOs larger than 1MB don't go memory pool due the size but applications tend to use a lot of VkMemory with size larger than 1MB so to reduce the number of pages and improve performance here I'm aligning the size of BOs larger than 1MB to 64kb, this allows 64kb pages to be used at least on Xe KMD. This bring substantial perfomance benefit in exchange of a small memory waste. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:40 +00:00
José Roberto de Souza	dde91cf9cb	anv: Always grow fixed address pools by 2MB in platforms that there is a performance gain MTL and newer integrated platforms has a performance gain when using transparent huge pages, because of the fixed address requirement we can't use slab for this case but we can change the initial pool size to 2MB so all allocations get the transparent huge page optimization. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:40 +00:00
José Roberto de Souza	7361b3287f	anv: Remove useless if block I can't think in any case where that would be false, so lets drop it. While at it, also making some variables const. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:40 +00:00
José Roberto de Souza	6f7a32ec92	anv: Add support for batch buffers in anv_slab_bo in i915 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:40 +00:00
José Roberto de Souza	39bb51ab27	anv: Add support for batch buffers in anv_slab_bo in Xe KMD Because of the ANV_BO_ALLOC_CAPTURE flag, batch buffers were not allowed to use memory pool. So to workaround that here adding a new anv_bo_slab_heap heap for cached+coherent+capture buffers with the main goal to get batch buffer to memory pool but other buffers will as well. For now that will only work in Xe KMD as i915 requires more changes to support it. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:40 +00:00
José Roberto de Souza	a0a600ca5f	anv: Skip anv_bo_pool if memory pool is enabled The whole purpose of anv_bo_pool is to reduce the number of gem_create/destroy calls in command buffers that is something with a short life span. But slab_bo/memory pool does the same with even other benefits like doing 2MB allocations to enable THP. So here skipping the meat of anv_bo_pool_free() to directly return the bo to slab_bo. This change is also necessary because the way anv_bo_pool stores freed buffers it requires that all bos has a unique gem handle, what not true of buffer allocated by anv_slab. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Suggested-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:40 +00:00
José Roberto de Souza	0b561f691b	anv: Add support for ANV_BO_ALLOC_DYNAMIC_VISIBLE_POOL in anv_slab_bo This flag was not supported in anv_slab_bo because it is set together with ANV_BO_ALLOC_CAPTURE and more important it has a specific VMA range. We can support it by adding a custom heap and allocating all bos in the heap with all necessary flags, but because application can also allocate those with vkAllocateMemory() here the ANV_BO_ALLOC_CAPTURE is appended to the vkAllocateMemory() path for integrated gpu and anv_slab_bo check if all the alloc_flags matches, because application could choose to allocate it in a cached but not coherent memory type for example. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:40 +00:00
José Roberto de Souza	8fd4423d99	anv: Add support for ANV_BO_ALLOC_DESCRIPTOR_POOL in anv_slab_bo This flag was not supported in anv_slab_bo because it is set together with ANV_BO_ALLOC_CAPTURE and more important it has a specific VMA range. But we can easily support it by adding a custom heap with it and allocating all bos in the heap with all necessary flags. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:39 +00:00
José Roberto de Souza	ea18572ff2	anv: Add support for ANV_BO_ALLOC_AUX_CCS in anv_slab_bo This changes allow us to support memory pool of bos with ANV_BO_ALLOC_AUX_CCS set. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:39 +00:00
José Roberto de Souza	dabb012423	anv: Implement anv_slab_bo and enable memory pool This is implementing the functions in anv_slab_bo and actually enabling memory pool. This is heavily based on Iris memory pool implementation, the main difference is that the concept of heaps only exist in anv_slab_bo, we have function that takes the anv_bo_alloc_flags and decides what heap to place that bo. Some anv_bo_alloc_flags blocks memory pool, we can relax and remove some flags from this denied list later. This feature can be disabled in runtime by setting ANV_DISABLE_SLAB=true, this can help us to easily check if bugs are due to this feature or not. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:39 +00:00
José Roberto de Souza	3bf6d42fda	anv: Add the base infrastructure to support memory pool Allocating larger buffers allows KMD/HW to enable optimizations that makes access to memory faster, also because of minimum alignment required in some cases we allocate 4k or 64k long buffers for usages that only needs a few bytes, wasting a lot of memory. Memory pool takes care of both of those things and here I'm adding the base infrastruture to implement this feature. The next patch will implement the functions in anv_slab_bo.c, spliting it in two to make review easier. The idea here is take the same approach as Iris and use pb_slab.h. In 99% of the places it will be transparent that anv_bo is actually a slab of a larger and real anv_bo, the remaning 1% of the places are handled here. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:39 +00:00
José Roberto de Souza	5d8ec0ce5c	anv: Move VMA alignment requirements to its own function That will make easy to implement memory pool in the next patches as we need to calculate the VMA aligment without the KMD alignment requirement for memory pool. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:39 +00:00
José Roberto de Souza	4e7ba17413	anv: Export anv_bo_is_small_heap() This function will be needs in two places in the next patches. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:39 +00:00
Lionel Landwerlin	374ef9228b	anv: add ability to mmap at offset Jose: Added support for placed address Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:39 +00:00
Lionel Landwerlin	1d46a663ae	anv: update Wa_22019225126 check Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34754>	2025-04-30 11:55:24 +00:00
Rohan Garg	b9fe5aad37	anv: enable VK_KHR_shader_bfloat16 Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34105>	2025-04-29 16:29:37 +00:00
Caio Oliveira	07fa3b3785	intel: Add support for BFloat16 as cooperative matrix source Re-organize the configuration lists to make easier to include BFloat16 only for the Gfx125+ that support it, while keeping MTL supporting the "lowered" configurations from pre-Gfx125. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34105>	2025-04-29 16:29:37 +00:00
Tapani Pälli	ed9f135936	anv: put parenthesis to the set_sampler_size equation Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This fixes errors seen with some renderdoc captures failing to allocate descriptor sets. Fixes: `76096d04bb` ("anv: relax restriction on variable count descriptors") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34671>	2025-04-28 04:45:01 +00:00
Lionel Landwerlin	e60416b4e4	anv: use companion batch for operations with HIZ/STC_CCS destination We're currently crashing a couple of tests : dEQP-VK.pipeline.monolithic.depth.xfer_queue_layout.* deqp-vk: ../src/intel/blorp/blorp_blit.c:2935: blorp_copy: Assertion `blorp_copy_supports_blitter(batch->blorp, src_surf->surf, dst_surf->surf, src_surf->aux_usage, dst_surf->aux_usage)' failed. Tested on: dEQP-VK.api.copy_and_blit.copy_commands2.image_to_image_transfer_queue.all_formats.depth_stencil.* dEQP-VK.api.copy_and_blit.multiplanar_xfer.* dEQP-VK.pipeline.monolithic.depth.xfer_queue_layout.* Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `31eeb72e45` ("blorp: Add support for blorp_copy via XY_BLOCK_COPY_BLT") Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34023>	2025-04-24 14:47:40 +00:00
Lionel Landwerlin	1f6cca0800	intel: fixup a few debugging option checks Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `ad328bc58d` ("intel: Switch uint64_t intel_debug to a bitset") Reviewed-by: Michael Cheng <michael.cheng@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34667>	2025-04-23 18:47:42 +00:00
Michael Cheng	3c267535ae	anv: Add new debug flag to show shader stage Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Add debug option to show current shader type being compiled within anv_shader_bin_create. Signed-off-by: Michael Cheng <michael.cheng@intel.com> Reviewed-by: Casey Bowman <casey.g.bowman@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34596>	2025-04-22 23:09:26 +00:00
José Roberto de Souza	161c412a82	intel: Fix the MOCS values in XY_FAST_COLOR_BLT for Xe2+ Xe2 changed the MOCS field in few instructions, those now have a field for the MOCS index and other the encryption enable bit but ISL returns the combination of both aka MEMORY_OBJECT_CONTROL_STATE. To minimize changes I have added 2 macros to extract the values from the value returned by isl. From all the instructions changed Mesa only make use of two, so the other instruction will be handled in the next patch. Cc: mesa-stable Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34592>	2025-04-22 20:42:25 +00:00
Lina Versace	1bf8542490	anv: Enable VK_EXT_external_memory_acquire_unmodified Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Change-Id: If0480721f7f1fceec093e4ab7b5c9b712eb62ba1 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32295>	2025-04-21 13:55:32 -07:00
Lina Versace	3613b9c4f7	anv: Fix comment about external queue transitions Not all images with DRM format modifiers use ANV_IMAGE_MEMORY_BINDING_PRIVATE. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Change-Id: Idc6bae70ec7080f96555a85dcdc0ead915b02935 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32295>	2025-04-21 13:55:27 -07:00
Lina Versace	e87a04c6c1	anv: Assert that only external images have private bindings Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Change-Id: If2f18d88d48f70a58e236080632e72afb94f5e0b Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32295>	2025-04-21 13:55:08 -07:00
Sagar Ghuge	0463e14b94	anv: Enable 64bit memory structure mode for RT Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kevin Chuang <kaiwenjon23@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33047>	2025-04-21 20:10:45 +00:00
Kevin Chuang	703f29874b	intel/bvh/debug: Adapt instance leaf dumping to support 64-bit RT Adding a boolean "enable_64b_rt" in anv_accel_struct_header for the interpret.py to properly decode anv_instance_leaf Signed-off-by: Kevin Chuang <kaiwenjon23@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33047>	2025-04-21 20:10:45 +00:00
Kevin Chuang	cbc8af4555	intel/bvh: Compile and adapt bvh shaders separately into Xe1/2 and Xe3+ This change separate the encode, header, and copy shader into versions for Xe1/2 and Xe3+, including adding compile options and handling 64bit version of instance leaf for Xe3+. Signed-off-by: Kevin Chuang <kaiwenjon23@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33047>	2025-04-21 20:10:45 +00:00
Sagar Ghuge	5cd0f4ba2f	intel/compiler: Update MemRay data structure to 64-bit Rework: (Kevin) - Fix miss_shader_index offset - Handle hit group index Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kevin Chuang <kaiwenjon23@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33047>	2025-04-21 20:10:45 +00:00
Kevin Chuang	7b526de18f	intel/compiler/rt: Calculate barycentrics on demand This commit moves the calculation of tri_bary out of brw_nir_rt_load_mem_hit_from_addr(), and only do the calculation on demand, since unorm_float_convert can be expensive. We do this for both Xe1/2 and Xe3+ for consistency. Signed-off-by: Kevin Chuang <kaiwenjon23@gmail.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33047>	2025-04-21 20:10:45 +00:00
Sagar Ghuge	6deb1950a4	anv: Update RT dispatch globals to use 64bit data structure Rework (Kevin) - Fix Hit/Miss/Resume shader group table value Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kevin Chuang <kaiwenjon23@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33047>	2025-04-21 20:10:45 +00:00
Iván Briano	949d2e507d	anv: expose promoted KHR_depth_clamp_zero_one Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34614>	2025-04-18 21:31:37 +00:00
Rohan Garg	a5033c54e7	anv: use the common function for detecting a mesh shader stage Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34604>	2025-04-18 10:08:22 +00:00
Konstantin Seurer	2dee1117b7	vulkan: Add a vk_device parameter to get_encode_key Useful for selecting different encoding options based on hardware generation. Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34273>	2025-04-17 20:20:40 +00:00
Caio Oliveira	fd0a7efb5a	spirv, nir: Delay calculation of shared_size when using explicit layout Move the calculation to nir_lower_vars_to_explicit_types(). This consolidates the check of shader_info::shared_memory_explicit_layout in a single place instead of in all drivers. This is motivated by SPV_KHR_untyped_pointers. Before that extension we had essentially two modes for shared memory variables - No layout decorations in the SPIR-V, and both internal layout and driver location was _given by the driver_. - Explicitly laid out, i.e. they are blocks, and decorated with Aliased. Because they all alias, we could assign them driver location directly to the start of the shared memory. With the untyped pointers extension, there's a third option, to be added by a later commit - Explicitly laid out, i.e. they are blocks, and NOT decorated with Aliased. Driver location is _given by the driver_. Blocks with and without Aliased can be mixed. The driver location of multiple blocks that don't alias depend on alignment that is driver-specific, which we can more easily do from the nir_lower_vars_to_explicit_types() that already has access to a function to obtain such value. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> (hk) Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> (v3dv) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (anv/hasvk) Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> (panvk) Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (radv) Reviewed-by: Rob Clark <robdclark@gmail.com> (tu) Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34139>	2025-04-17 19:13:17 +00:00
Rohan Garg	cbc1ec4f73	anv: re enable compression for CPS surfaces on platforms other than Xe I accidentally disabled compression on CPS surfaces marked as storage or color attachment for all platforms, when this should only be limited to Xe. Fixes: 80f9b6 ('anv: CPB surfaces that are used as color attachments or for stores cannot be compressed') Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34297>	2025-04-17 14:24:11 +00:00
Konstantin Seurer	cb31b5a958	clc,libcl: Clean up CL includes This patch does a couple of things to make CL integration with drivers as seamless as possible: - We pull in opencl-c.h and opencl-c-base.h to stop relying on system headers. - Parts of libcl.h are moved to new headers that are incomplete CL-safe variants of libc headers. - A couple of util headers are changed to remove now unnecessary __OPENCL_VERSION__ guards and make more headers CL safe. - Drivers now include src/compiler/libcl and use headers like macros.h,u_math.h instead of libcl.h. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33576>	2025-04-11 21:27:37 +00:00
Lionel Landwerlin	243c01c703	anv/iris: implement Wa_18040903259 Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34433>	2025-04-11 13:54:35 +00:00
Lionel Landwerlin	d123aedfc7	anv: remove ALWAYS_INLINE from globally visible functions Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34433>	2025-04-11 13:54:35 +00:00
Lionel Landwerlin	938f79ed82	anv: update Wa_1607156449 to use WA infrastructure Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34433>	2025-04-11 13:54:35 +00:00
Lionel Landwerlin	e321c438dc	anv: fix self dependency computation Some upcoming changes in the runtime will make it impossible to rely on the pipeline or runtime information to know whether a fragment shader has input attachments. Instead we gather that information at compile time and store it in our shader bind_map. At runtime we check whether the fragment shader has input attachments and whether those map to the runtime depth/stencil input attachments to set the 3DSTATE_PS_EXTRA::PixelShaderKillsPixel. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `d2f7b6d5a7` ("anv: implement VK_KHR_dynamic_rendering_local_read") Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32540>	2025-04-10 13:17:53 +00:00
Paulo Zanoni	fdbdfaed01	anv: add ANV_SYS_MEM_LIMIT for debugging system memory restrictions If you suspect a workload is failing because it needs more memory, you can set ANV_SYS_MEM_LIMIT=100 to give it all the memory available. This could make, for example, certain games start working (it really depends on how much RAM you have and how much the game wants). If you suspect a workload is too resource hungry, you can try to limit it with ANV_SYS_MEM_LIMIT=30 (or some other value) to see if it can deal with the more restricted environment and behave accordingly. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28513>	2025-04-09 22:48:18 +00:00
Paulo Zanoni	ec4b2ce664	anv: restore the old behavior of up to 75% of RAM for the system heap "We paid for sixteen gigs of RAM, so we gonna use the whole damn sixteen gigs of RAM!" - My Mom First, some history: The Anv 50%-or-75% rule was originally added in 2017 by `060a6434ec` ("anv: Advertise larger heap sizes"). When i915.ko started reporting memory sizes in its ioctls, it didn't impose any restrictions: 100% of SRAM was reported as available, so the restriction was in Mesa. When xe.ko was introduced, it only reported 50% of the SRAM as available through its ioctls, so commit `b571ae6e7a` ("intel: Make memory heaps consistent between KMDs") adapted the code to not take an extra 25% of the 50% that was already cut, and restricted i915.ko to 50% instead of the 50%-or-75%. In Kernel commit d2d5f6d57884 ("drm/xe: Increase the XE_PL_TT watermark"), xe.ko changed to reporting 100% of SRAM through its ioctls, so we adapted Mesa to do the right thing depending on which Kernel version was running. While this was all happening, we were discussing about which behavior was actually the best: restrict everything to 50% in order to avoid issues when many things are running in parallel, or keep the restriction only at 75% in order to allow high demanding workloads to make full use of the hardware. The way I see, if parallel applications are causing the system to run out of resources, the user always has the option to kill applications and use one thing at a time. On the other hand, if a single application needs more than 50% of the SRAM and we don't allow it in our heaps, the application will never work (unless, of course, the user patches Mesa). So in this commit we go back to allowing high-demanding applications to work by restoring the 50%-or-75% rule. This commit is especially useful in systems with integrated graphics, like LNL, where the option to upgrade RAM is not present. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28513>	2025-04-09 22:48:18 +00:00
Paulo Zanoni	02e896bc49	anv/xe: detect the newer xe.ko memory reporting model and act accordingly Kernel commit d2d5f6d57884 ("drm/xe: Increase the XE_PL_TT watermark") changed how xe.ko reportes memory: its ioctls now report 100% of the system RAM as available. Since our policy is to report 50% of the SRAM as available for the heaps, add some code to check the amount reported by xe.ko against the amount reported by the system, then act accordingly. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28513>	2025-04-09 22:48:18 +00:00
Paulo Zanoni	3db8931d4a	intel/i915: restrict the RAM size restrictions to Anv Before commit `b571ae6e7a` ("intel: Make memory heaps consistent between KMDs"), we had the following policy for reporting Sytem RAM memory sizes: - For OpenGL, we reported the total available RAM. - For Vulkan, we reported the total available RAM as: - 50% of the total RAM if the total RAM was <= 4GB, - 75% otherwise - In addition, the Memory Budget (for VK_EXT_memory_budget) is 90% of the "free" memory, which can be an extra 10% off of the 50% or 75%. When xe.ko was added, one key difference was noted: while i915.ko reported the "real" RAM memory sizes in its ioctls, xe.ko reported only 50% of the system RAM as available. Because of that (and other reasons, see this discussion on MR 28513), commit `b571ae6e7a` decided to unify the behavior by changing the Anv i915.ko rule to "always 50%" instead of "50% or 75%". This also changed the Iris rule to 50% instead of 100%. In my research, I couldn't find any reason why this restriction should also apply to Iris, so here we revert back to handling these size restrictions on Anv only. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28513>	2025-04-09 22:48:18 +00:00
Lionel Landwerlin	76096d04bb	anv: relax restriction on variable count descriptors VUID-VkDescriptorSetAllocateInfo-pSetLayouts-09380 says that : "If pSetLayouts[i] was created with an element of pBindingFlags that includes VK_DESCRIPTOR_BINDING_VARIABLE_DESCRIPTOR_COUNT_BIT, and VkDescriptorSetVariableDescriptorCountAllocateInfo is included in the pNext chain, and VkDescriptorSetVariableDescriptorCountAllocateInfo::descriptorSetCount is not zero, then VkDescriptorSetVariableDescriptorCountAllocateInfo::pDescriptorCounts[i] must be less than or equal to VkDescriptorSetLayoutBinding::descriptorCount for the corresponding binding used to create pSetLayouts[i]" But applications like are not following the spec. RADV doesn't apply that limit and allocates if there is enough space in the pool. Let's just do the same. Note that this issue got resolved with a vkd3d-proton change : `a7ac1a7d2f` But since this change is deleting more code than it adds, might as well go with it. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12185 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32305>	2025-04-09 16:29:21 +03:00
Felix DeGrood	a09ddc3b77	anv: add INTEL_DEBUG=shaders-lineno Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30142>	2025-04-08 19:39:53 +00:00
Felix DeGrood	7a3de9e877	intel/brw: support for dumping shader line numbers Add support for dumping shader asm containing instruction line numbers matching offsets within instruction state pool buffer. Offsets should match values collected from eu stall sampling. This is required for match eu stall data with individual shader instructions. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30142>	2025-04-08 19:39:53 +00:00
Lionel Landwerlin	72bc74f0be	anv: add shader-hash debug option Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Emits a dummy MI_STORE_DATA_IMM with the shader hash in front of : - 3DSTATE_VS - 3DSTATE_HS - 3DSTATE_DS - 3DSTATE_HS - 3DSTATE_PS - COMPUTE_WALKER / GPGPU_WALKER Example : 0x00000000: 0x10000002: MI_STORE_DATA_IMM 0x00000000: 0x10000002 : Dword 0 DWord Length: 2 Force Write Completion Check : false Store Qword: 0 Use Global GTT: false 0x00000004: 0xffffe0c0 : Dword 1 Core Mode Enable: 0 0x00000008: 0x0000effe : Dword 2 Address: 0xeffeffffe0c0 0x0000000c: 0x126e815a : Dword 3 <------------ shader hash 0x00000010: 0x78100007 : Dword 4 Immediate Data: 309231962 0x00000000: 0x78100007: 3DSTATE_VS 0x00000000: 0x78100007 : Dword 0 DWord Length: 7 0x00000004: 0x00000000 : Dword 1 0x00000008: 0x00000000 : Dword 2 Kernel Start Pointer: 0x00000000 0x0000000c: 0x00040000 : Dword 3 Software Exception Enable: false Accesses UAV: false It'll correlate with the value emitted in the pipeline stats from fossil replay : $ grep -i 126e815a /tmp/stats.csv fossilize.aab93c5c3f965151.1.foz,GRAPHICS,de1b925dec8a8083,507378,498283,303434,vertex,8,50,4,0,1826,0,0,0,8,17,0,0x00000000126e815a,15 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34332>	2025-04-04 15:18:28 +00:00
Lionel Landwerlin	789f13359a	anv: consolidate environment variables Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34332>	2025-04-04 15:18:28 +00:00

1 2 3 4 5 ...

6260 commits