fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-21 15:28:18 +02:00

Author	SHA1	Message	Date
Tapani Pälli	85978ccd28	anv: route clear operations on compute to companion Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This fixes bunch of cts tests hitting issues when attempting anv_image_mcs_op with compute. Fixes: `ab9d3528dc` ("anv: fix queue check in anv_blorp_execute_on_companion on xe3") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39581>	2026-01-29 14:25:54 +00:00
Michael Cheng	4f82dfc5f5	anv: Implement RT shader group handle capture/replay Signed-off-by: Michael Cheng <michael.cheng@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33022>	2026-01-29 08:46:50 +00:00
Lionel Landwerlin	9e4d9d3f35	anv: fix shader heap replay addr Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Michael Cheng <michael.cheng@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33022>	2026-01-29 08:46:50 +00:00
Lionel Landwerlin	a05fc97bc9	anv/iris: add drirc to enable sampler state & compute surface state prefetch I noticed we disable the prefetch only on Gfx12.5. But surely that recommendation carries on on later platforms. It seems other drivers just disable it all the time and only have an option to force the prefetch. So implementing the same thing here. Blorp path is left untouched. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39424>	2026-01-28 13:13:40 +00:00
Iván Briano	5b48805b42	brw: fix local_invocation_index with quad derivaties on mesh/task shaders For mesh/task shaders, the thread payload provides a local invocation index, but it's always linear so it doesn't give the correct value when quad derivatives are in use. The lowering pass where all of this is done correctly for compute shaders assumes load_local_invocation_index will be lowered in the backend for mesh/task, calculates the values for the quads correctly but then avoid replacing the original intrinsic and we remain with the wrong results. Add an intel specific intrinsic and always lower the generic one to that (or whatever else was calculated) to avoid ambiguities and fix the value for quad derivatives. Fixes future CTS tests using mesh/task shaders under: dEQP-VK.spirv_assembly.instruction.compute.compute_shader_derivatives.* Fixes: `d89bfb1ff7` ("intel/brw: Reorganize lowering of LocalID/Index to handle Mesh/Task") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39276>	2026-01-27 22:28:19 +00:00
Nanley Chery	e42b2a5d70	anv: Don't partial resolve LOD1+ for non-FCV CCS We don't allow fast-clears in this case. Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37660>	2026-01-27 18:46:54 +00:00
Nanley Chery	21d187b7f5	anv: Support fast clears on more layers On Xe2+, support multi-layer and non-zero-layer CCS fast-clears. To do this in a simple manner, drop the code which splits multi-layer clears into fast clears and slow clears. The performance CI reports no regressions nor improvements on BMG. For MCS on all platforms and for CCS on prior platforms, use a new heuristic. Instead of only allowing fast clears on the first slice/layer, do the following: For 3D images, only fast-clear if all slices are cleared. Enables fast-clearing every slice of 3D textures in: * Terminator Resistance - 480x270x128. * Ghostrunner 2 - 320x180x128. For 2D arrays, match the Xe2+ behavior and allow clearing to any layer. This is possible because we only allow fast-clearing if the clear color matches the default value. Enables fast-clearing every layer of 2D array textures in: * Assassin's Creed - 128x128, 6-layers. * Blackops 3 - 1024x1024, 6-layers. * Borderlands 3 - 128x128, 6-layers. * Cyberpunk - 1024x1024, 10-layers. * Unigine Superposition - 4K, 2-layers. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11893 Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37660>	2026-01-27 18:46:54 +00:00
Nanley Chery	b8f6ad9060	anv: Use variable default value for some images using CLEAR A future commit will enable clearing to more than the first layer of 2D array images. To ensure consistency for the clear color, require the ANV_FAST_CLEAR_DEFAULT_VALUE for such images if they make use of ISL_AUX_STATE_CLEAR. Also, use a non-zero default value for some image formats. I tested the majority of workloads in the performance CI. This will cause those which clear to 2D array layers to gain clears on more than just the first layer. At the moment, we still only support clearing the first layer, so there should be no change in performance. Affected games are documented in the code. Acked-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37660>	2026-01-27 18:46:53 +00:00
Nanley Chery	811c413f98	anv: Don't return the Xe2+ fast-clear type early Don't return early from anv_layout_to_fast_clear_type() for Xe2+. We'll need to make more use of the function for some MCS changes in later commits. Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37660>	2026-01-27 18:46:53 +00:00
Nanley Chery	7bb7b63b96	anv: Line wrap anv_CmdClearColorImage Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37660>	2026-01-27 18:46:52 +00:00
Nanley Chery	390c9e3fda	anv: Inline the CCS/MCS predicated resolve functions Now we can see the MI writes performed before and after the resolves in transition_color_buffer(). Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37660>	2026-01-27 18:46:52 +00:00
Nanley Chery	4d8c71ab1f	anv: Delete conversion of CCS_D partial resolve Now that hasvk is the driver for supporting HSW and BDW, we no longer need to convert CCS_D partial resolves to full resolves to avoid an assert-failure in BLORP. Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37660>	2026-01-27 18:46:51 +00:00
Nanley Chery	b1db1179c2	anv: Set compressed bit separately from fast-clear type This will make handling fast-clears on multiple layers simpler by saving us from having to pass more parameters into fast-clear state setting functions. It also allows us to set more complex fast-clear state for FCV_CCS_E without marking the image as compressed. Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37660>	2026-01-27 18:46:50 +00:00
Nanley Chery	c054d4fe2f	anv: Support partial resolves on any level/layer Enables more support for FCV_CCS_E partial resolves if we ever need it. Also enables support for multiple layers being fast cleared and needing resolves. Support for that will arrive in several commits. Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37660>	2026-01-27 18:46:50 +00:00
Nanley Chery	0a8ab13b9d	anv: Reset fast-clear type in transition_color_buffer() Moving the code here will simplify the task of supporting fast-clears on multiple array layers and depth slices. Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37660>	2026-01-27 18:46:49 +00:00
Nanley Chery	ce196c9de5	anv: Fix the fast clear type for FCV writes We started allowing non-default clear colors with FCV in commit `cd8e120b97`. When rendering to an image with FCV, set the fast-clear type to ANV_FAST_CLEAR_ANY if the image properties allow such fast-clears. Fixes: `cd8e120b97` ("anv: Allow more single subresource fast-clears with FCV") Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37660>	2026-01-27 18:46:49 +00:00
Nanley Chery	e7854d06a5	anv: Update predicated resolve documentation * Don't mention gfx7-8 due to the hasvk split. * Account for the array of clear colors. Fixes: `0e6b132a75` ("anv: Access more colors in fast_clear_memory_range") Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37660>	2026-01-27 18:46:48 +00:00
Caleb Callaway	a91a636faf	driconf: LTO disable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39544>	2026-01-27 14:57:20 +00:00
Hans-Kristian Arntzen	22bd72aa58	anv: Enable VK_EXT_present_timing. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38770>	2026-01-27 11:09:51 +00:00
Hans-Kristian Arntzen	c18b14aea2	anv: Add PRESENT_STAGE_LOCAL_EXT path for calibration. Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38770>	2026-01-27 11:09:50 +00:00
Francisco Jerez	349b09f8a2	anv/gfx12.5: Apply HIZ-CCS resolve TC flush on full resolves for all gfx12.5. This appears to be needed to guarantee that a resolved depth surface has no remaining fast-cleared blocks on DG2 as well as MTL. After this series this should no longer be hit in practice since we'll be doing partial resolves in most cases, but it seems sensible to keep and correct the workaround for our peace of mind to make sure that full resolves are truly resolving the main surface. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31139>	2026-01-27 08:52:17 +00:00
Francisco Jerez	8e1b4b62ce	anv/gfx12.5: Take advantage of partial resolves in depth layout transitions. Issue a partial resolve instead of a full resolve from transition_depth_buffer() when the final usage requires the CCS-compressed surface to provide a complete representation of the image. This significantly improves performance of applications that frequently interleave depth rendering and sampling on non-WT surfaces (e.g. MSAA surfaces). Nba2K23-trace-dx11-2160p-ultra improves performance by about 260% with this on MTL, DG2 shows a similar benefit. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31139>	2026-01-27 08:52:17 +00:00
Francisco Jerez	cc66f5ff1d	intel/blorp: Add support for partial resolves of HiZ-CCS surfaces. v2: Define additional enum BLORP_OP_HIZ_PARTIAL_RESOLVE to track partial resolves (Nanley). v3: Add comment regarding fall back to full resolve on Gfx12.0 (Nanley). Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31139>	2026-01-27 08:52:17 +00:00
Francisco Jerez	f9ce1b9c40	anv/gfx12.5+: Keep HIZ_CCS aux usage while sampling from depth surfaces. As long as the surface is in a state with valid AUX state with identity contents of the HiZ surface (E.g. in ISL_AUX_STATE_COMPRESSED_CLEAR, ISL_AUX_STATE_COMPRESSED_NO_CLEAR, ISL_AUX_STATE_RESOLVED or ISL_AUX_STATE_PASS_THROUGH states) we can keep compression enabled, which works around hardware bugs on MTL and DG2, and will be helpful to switch to partial resolves in a future commit. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31139>	2026-01-27 08:52:16 +00:00
Francisco Jerez	baf39d4322	anv/gfx12.5: Infer ISL_AUX_STATE_COMPRESSED_HIER_DEPTH from anv_layout_to_aux_state(). Update anv_layout_to_aux_state() to return the ISL_AUX_STATE_COMPRESSED_HIER_DEPTH state in cases where we may be rendering into a HiZ surface in non-WT aux mode, instead of ISL_AUX_STATE_COMPRESSED_CLEAR. v2: No need to handle ISL_AUX_STATE_COMPRESSED_HIER_DEPTH in anv_layout_to_fast_clear_type() since it should never be reached (Nanley). Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31139>	2026-01-27 08:52:16 +00:00
Francisco Jerez	157a4cc6d0	anv/gfx12.5: Resolve depth during layout transitions from ISL_AUX_STATE_COMPRESSED_HIER_DEPTH. For transitions to a state that requires the image to be fully defined by the primary+CCS surface without necessarily requiring a valid primary we have to perform a resolve if the initial state was ISL_AUX_STATE_COMPRESSED_HIER_DEPTH, which isn't fully defined by its primary+CCS surface. This full resolve will be replaced with a more efficient partial resolve in a future commit, but we have to do this up front in order to avoid breaking bisectability. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31139>	2026-01-27 08:52:16 +00:00
Francisco Jerez	7f1ed1e411	anv/gfx12.5: Can't fast clear multisampled Z/S with HIZ CCS WT aux usage. We can end up in this situation in cases where the application uses a layout that allows both rendering and sampling from a depth surface, since in such cases we will attempt to render with HIZ CCS WT usage as a side effect of using ISL_AUX_USAGE_HIZ_CCS_WT for all layouts that allow the image to be sampled from. Disabling fast clears for that case isn't expected to cause performance regression since before this series for HiZ CCS non-WT images transitioning to such a layout we would have issued a full resolve and used ISL_AUX_USAGE_NONE, which also doesn't support fast clears. Multisample depth images should still get fast clears after this commit in cases where the rendering and sampling is split into separate render pasess with a layout transition between them that transitions the image from a W/O layout into a R/W one -- Such transitions will be handled with a relatively cheap partial resolve in a subsequent commit. v2: Add details of additional findings about these hardware issues in comment. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> v3: Pass aspect bit consistent with layout to anv_layout_to_aux_usage() instead of defaulting to VK_IMAGE_ASPECT_DEPTH_BIT. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31139>	2026-01-27 08:52:15 +00:00
Francisco Jerez	02030b4b8f	anv: Use actual layout in anv_fast_clear_depth_stencil() instead of ANV_IMAGE_LAYOUT_EXPLICIT_AUX. Currently anv_fast_clear_depth_stencil() doesn't know the correct layout of the depth and stencil images, instead it uses ANV_IMAGE_LAYOUT_EXPLICIT_AUX to force the base AUX usage of each plane, which can be inconsistent with the VkImageLayout currently in use. Plumb the correct depth and stencil layouts. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31139>	2026-01-27 08:52:15 +00:00
Francisco Jerez	d283e44634	anv/gfx12.5: Allocate indirect color state for depth surfaces. The clear color state has to be allocated since we will be sampling from non-WT HiZ CCS depth surfaces without disabling compression. v2: Use isl_aux_usage_has_ccs() instead of open coding (Nanley). v3: Use stricter condition on Gfx12.0 to avoid allocating buffer unnecessarily (Nanley). Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31139>	2026-01-27 08:52:14 +00:00
Nanley Chery	c5f01414da	anv,iris: Don't fast-clear 3D + Ys on gfx12.0 BSpec 46969 (r45602) tells us that we get no fast-clears for 3D: 3D/Volumetric surfaces do not support Fast Clear operation. For Y-tiled surfaces, we work around this in BLORP with convert_rt_from_3d_to_2d(). However, that function doesn't support Ys-tiling. We could modify our surface redescription code paths to support clearing entire Ys tiles, but we choose to hold off on the added complexity until we have a use-case. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38063>	2026-01-26 21:09:05 +00:00
Nanley Chery	525077f160	anv: Query the plane in anv_can_fast_clear_color() Instead of assuming the first plane, use anv_image_aspect_to_plane(). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38063>	2026-01-26 21:09:05 +00:00
Nanley Chery	07539af097	intel/isl: Drop HIZ/MCS checks in CCS support query We'll use isl_surf_supports_ccs() in a scenario in which we want to check for CCS support without creating a HIZ or MCS surface beforehand. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38063>	2026-01-26 21:09:05 +00:00
Nanley Chery	ab07c4066a	intel: Add and use ISL_SURF_USAGE_PREFER_4K_ALIGNMENT Does nothing for now. This will be used in future patch where a 64K-aligned image may be selected over a 4K-aligned one. Follows the alignment request behavior specified in VkImageAlignmentControlCreateInfoMESA. Specifically, this preference does not override attempts by ISL to enable compression. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38063>	2026-01-26 21:09:04 +00:00
Nanley Chery	103ec323e3	anv: Ensure host-transfer tilings are supported by ISL ISL's tiled-memcpy functions don't support Yf, Ys, and Tile64. Remove those tilings when creating an image which will be used with host-image copies. The identical memory layout flag is checked by tests such as: dEQP-VK.image.host_image_copy.identical_memory_layout.optimal.bc5_snorm_block dEQP-VK.image.host_image_copy.query.linear.r16_unorm Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38063>	2026-01-26 21:09:03 +00:00
Nanley Chery	0e1cc2216d	anv: Disable multisampled host transfer support We don't actually handle this case. The next patch will limit the amount of tilings used when an image is created with VK_IMAGE_USAGE_HOST_TRANSFER_BIT_EXT. This prevents zink failures on DG2 for various multisampled test cases. For example: arb_internalformat_query2-internalformat-size-checks -auto -fbo Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38063>	2026-01-26 21:09:03 +00:00
José Roberto de Souza	1bd83ba819	intel/dev: Add INTEL_DEVICE_INFO_MMAP_MODE_INVALID Adding this mmap mode makes explicit in code that PAT compressed buffers should not be mmaped. Although there is no CPU access Xe KMD uAPI still requires a cpu_caching to be set, so setting WC. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34222>	2026-01-26 15:24:55 +00:00
José Roberto de Souza	ac23454d1c	anv: Move anv_bo_get_mmap_mode() to i915 backend That function is only called from i915 backend no needed to be on common code. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34222>	2026-01-26 15:24:55 +00:00
José Roberto de Souza	60e38344a0	intel/dev: Remove INTEL_DEVICE_INFO_MMAP_MODE_UC This is not used and we don't have any future plans to use it, so removing it. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34222>	2026-01-26 15:24:54 +00:00
Calder Young	895ff7fe92	Revert "anv,brw: Allow multiple ray queries without spilling to a shadow stack" Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This optimization doesn't work when the ray query index isn't uniform across the subgroup, which is something the spec allows. While there are some smart ways to fix this and still avoid unnecessary spilling, its not worth investing the time until we find a realtime raytracing workload that actually needs to use multiple live ray queries for something. Fixes: `1f1de7eb` ("anv,brw: Allow multiple ray queries without spilling to a shadow stack") Acked-by: Sagar Ghuge <sagar.ghuge@intel.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39445>	2026-01-23 21:33:55 +00:00
Tapani Pälli	f66ff97d58	drirc/anv: implement steps to disable RHWO for Wa_14024015672 Disable RHWO by default for singlesample draws and for MSAA draws if a drirc key is set (avoid perf hit if not needed). Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39404>	2026-01-23 11:10:07 +00:00
Tapani Pälli	840e6e855b	anv: add handling for Wa_14026600921 This is the Xe3 version of the earlier workaround. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39404>	2026-01-23 11:10:07 +00:00
Sagar Ghuge	6aa3b70382	anv: Mark RootNodeOffset at 256B always This commit change the BVH layout a little so that we can load the BVH offset as constant rather than reading from memory. We have to force the instance leaves pointer at the end which gets used in copy.comp shader. Totals: Instrs: 54798 -> 54728 (-0.13%) Send messages: 3854 -> 3847 (-0.18%) Cycle count: 1915106 -> 1913954 (-0.06%); split: -0.07%, +0.01% Non SSA regs after NIR: 18594 -> 18575 (-0.10%) Totals from 7 (7.37% of 95) affected shaders: Instrs: 5532 -> 5462 (-1.27%) Send messages: 367 -> 360 (-1.91%) Cycle count: 132592 -> 131440 (-0.87%); split: -1.01%, +0.14% Non SSA regs after NIR: 1989 -> 1970 (-0.96%) PERCENTAGE DELTAS Shaders Instrs Send messages Cycle count Non SSA regs after NIR q2rtx-rt-pipeline 95 -0.13% -0.18% -0.06% -0.10% -------------------------------------------------------------------------------------- All affected 7 -1.27% -1.91% -0.87% -0.96% -------------------------------------------------------------------------------------- Total 95 -0.13% -0.18% -0.06% -0.10% Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39106>	2026-01-22 23:20:04 +00:00
Arzaq Naufail Khan	dc702671d9	anv: eliminate dead code Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39400>	2026-01-21 01:21:55 +00:00
Sagar Ghuge	8e85607130	anv/rt: Drop atomic operations on opacity flags Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Each node has their own opacity bits, so we don't need to track these opacity flags at header level. This commit also fixes the instance flag. Instance flag is 8bit wide, but we were always using 4 lower bits. Cc: mesa-stable Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39053>	2026-01-20 22:20:28 +00:00
Sagar Ghuge	61691034ac	anv/rt: Don't always set disableOpacityCull bit Setting this bit always might hurt performance. It might forces traversal to treat all leafs always valid. Cc: mesa-stable Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39053>	2026-01-20 22:20:28 +00:00
Lionel Landwerlin	a7d7492f10	anv: enable debug printfs on internal shaders Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39399>	2026-01-20 12:19:41 +00:00
Lionel Landwerlin	61b35c9d2b	anv: remove all kinds of useless info for internal shaders Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39399>	2026-01-20 12:19:41 +00:00
José Roberto de Souza	132bcbee74	anv/hasvk: Add intel_perf_get_configuration_id() and replace intel_perf_load_configuration() usage We have no usage of the information returned by intel_perf_load_configuration(). It is only used to add a copy of the configuration so we have the metric id but we could instead get the metric id from sysfs, that is added by mdapi. Xe KMD don't have a uAPI to query the metrics configuration, so using sysfs also fixes the integration of mdapi with Xe KMD. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Tested-by: Lukasz Stalmirski <lukasz.stalmirski@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32842>	2026-01-19 19:24:15 +00:00
José Roberto de Souza	5b39137ba0	anv/hasvk: Nuke register_config from anv_performance_configuration_intel There is no usage for register_config outside of anv_AcquirePerformanceConfigurationINTEL(), so we don't need to store it. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Tested-by: Lukasz Stalmirski <lukasz.stalmirski@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32842>	2026-01-19 19:24:15 +00:00
Tapani Pälli	ab9d3528dc	anv: fix queue check in anv_blorp_execute_on_companion on xe3 Fixes: dEQP-VK.api.copy_and_blit.dedicated_allocation.resolve_image.whole_copy_before_resolving_transfer.2_bit Otherwise we attempt to use blorp and hit various asserts later in: - blorp_copy_supports_blitter - blorp_xy_block_copy_blt Fixes: `61287b00f3` ("anv: Stop using RCS companion for MSAA copy/clear on Xe3+") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39346>	2026-01-18 17:19:05 +00:00

... 2 3 4 5 6 ...

7014 commits