fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-06-14 15:48:20 +02:00

Author	SHA1	Message	Date
Kenneth Graunke	fa1564fb87	intel/brw: Recreate GS output registers after EmitVertex Geometry shaders write outputs multiple times, with EmitVertex() between them. The value of output variables becomes undefined after calling EmitVertex(), so we don't need to preserve those. This lets us recreate new registers after each EmitVertex(), assuming we aren't in control flow, allowing them to have separate live ranges. It also means that those registers are more likely to be written once, rather than having multiple writes, which can make optimization easier. This is pretty much a total hack, but it's helpful. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29624>	2024-06-08 02:18:51 -07:00
Jianxun Zhang	9654aa4c31	intel/isl: Allow multi-sample on depth aux usage (xe2) The restriction on depth aux mode is gone on Xe2 in spec. Fix: piglit arb_post_depth_coverage-multisampling -auto -fbo isl_surface_state.c:723: isl_gfx20_surf_fill_state_s: Assertion `info->surf->samples == 1' failed. Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29274>	2024-06-07 21:06:37 +00:00
Vignesh Raman	cea3aeefd0	ci: add farm variable for devices in collabora farm Add farm variable for devices in the collabora farm so that the LAVA job submitter uses this in structured log files. Signed-off-by: Vignesh Raman <vignesh.raman@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29583>	2024-06-07 14:32:49 +00:00
Rohan Garg	6f6da58315	intel/compiler: fix shuffle generation on LNL There doesn't seem to be a restriction on the mentioned data types on LNL anymore. Default to a maximum exec size of SIMD16. This patch fixes dEQP-VK.subgroups.shuffle.framebuffer.* on LNL Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29504>	2024-06-07 12:51:30 +00:00
Sviatoslav Peleshko	94989b45a5	anv,driconf: Add fake non device local memory WA for Total War: Warhammer 3 Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8721 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29127>	2024-06-07 04:14:10 +00:00
Nanley Chery	de22e20294	anv: Rely more on ISL_SURF_USAGE_DISABLE_AUX_BIT In order to support CCS, ISL may upgrade a main surface from Tile4 to Tile64 with miptails disabled. To avoid using this space consuming layout when not needed, inform ISL as soon as possible that compression won't be used. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29094>	2024-06-07 00:58:41 +00:00
Nanley Chery	fc57991b66	anv: Support multiple aspects in anv_formats_ccs_e_compatible Prevents the next patch from causing the following assert failure: Test case 'dEQP-VK.ycbcr.copy.g8_b8_r8_3plane_420_unorm.g8_b8_r8_3plane_444_unorm.linear_linear_disjoint'.. deqp-vk: ../../src/intel/vulkan/anv_private.h:4962: anv_aspect_to_plane: Assertion `!(aspect & ~all_aspects)' failed. We still disable CCS for multiplane formats elsewhere. I've attempted enabling CCS for those cases but end up with failures in CI that I cannot reproduce locally. Hopefully this change gets the next person a step closer towards enabling this feature. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29094>	2024-06-07 00:58:41 +00:00
Nanley Chery	14a0f7391d	anv,hasvk: Drop anv_get_isl_format_with_usage Since `3beaaa9ae8` ("anv: drop lowered storage images code"), this function has not used the VkImageUsageFlags parameter. So, we can drop it and simplify its callers. This function isn't used in hasvk. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29094>	2024-06-07 00:58:41 +00:00
Nanley Chery	3e9dc450a6	anv: Rely on the primary surf usage to disable aux Instead of passing isl_extra_usage_flags to add_aux_surface_if_supported, use the isl_surf::usage field of the primary surface to check for ISL_SURF_USAGE_DISABLE_AUX_BIT. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29094>	2024-06-07 00:58:41 +00:00
Nanley Chery	8e96b516ca	intel/isl: Assert alignments of surface addresses In the import paths in iris, there are several cases where surface VMAs are created without relying on the calculated surface alignment. Asserting the alignments of surface addresses, should help catch any cases where we end up with the wrong alignment. This found a couple issues during development. One which required a change to existing code is that when creating uncompressed surfaces from compressed ones, ISL will sometimes increase the image alignment as a result of the new format supporting CCS. This patch adds the usage flag to disable that behavior. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29094>	2024-06-07 00:58:41 +00:00
Nanley Chery	6b969a4b43	intel/isl: Add and use multi-engine surf usage bits Add and use two new surf usage bits: * ISL_SURF_USAGE_MULTI_ENGINE_SEQ_BIT: the surface may be accessed by multiple engines, but not in parallel. * ISL_SURF_USAGE_MULTI_ENGINE_PAR_BIT: the surface may be accessed by multiple engines in parallel. Both usages are not concerned with read-after-read access patterns. Using these bits allows ISL to conditionally use Tile64 or a 64KB alignment to account for the gfx12.5 CCS WA from HSD 22015614752. Apart from the potential space savings, there are three benefits of this approach: 1) CCS can now be used with miptails (though nothing makes use of this today). 2) CCS can now be used with 3D depth/stencil surfaces in GL. 3) CCS can now be used with 3D depth/stencil surfaces in Vulkan when apps only use a single queue. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11111 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11117 Tested-by: Mark Janes <markjanes@swizzler.org> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29094>	2024-06-07 00:58:41 +00:00
Nanley Chery	53440554c4	intel/isl: Add and use ISL_MAIN_TO_CCS_SIZE_RATIO_XE In iris, use the CCS scale down factor to calculate the impact of CCS on TBIMR tile sizes. Even though we fall back to a seemingly less accurate method to calculate the impact of CCS, it ends up giving the same answer, 1bpp. Anv already uses this factor, so this patch replaces the constant with this macro. There are two benefits to doing this: 1) Consistency between anv and iris. 2) Preparation for a future where we no longer use ISL surfaces to describe CCS on Xe+. In fact, in iris, we already don't create such surfaces on ACM. I considered using INTEL_AUX_MAP_MAIN_SIZE_SCALEDOWN for the calculation in both drivers, but the naming is aux-map specific and the scaledown actually exists on flat-ccs platforms as well. So, we introduce a new macro for all Xe platforms, currently only used for the specific use case of TBIMR calculations. We can add more such macros for future platforms, as needed. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28942>	2024-06-06 23:57:52 +00:00
Nanley Chery	26655a137f	intel/aux_map: Add and use INTEL_AUX_MAP_MAIN_SIZE_SCALEDOWN Introduce a macro so that drivers don't need to rely on the isl_surf struct to determine the size of the CCS buffer on gfx12. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28942>	2024-06-06 23:57:52 +00:00
Nanley Chery	4ae50eaf70	intel/aux_map: Add and use INTEL_AUX_MAP_META_ALIGNMENT_B Introduce a macro defining the alignment which aux data start addresses should have. This alignment is for the worst case of the CCS buffer being included in a dmabuf. Although a smaller alignment is possible for non-dmabuf cases on TGL, no drivers would make use of that today as they place CCS surfaces directly after tiled surfaces. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28942>	2024-06-06 23:57:52 +00:00
Nanley Chery	e27d951527	intel/aux_map: Add and use INTEL_AUX_MAP_MAIN_PITCH_SCALEDOWN Introduce a macro so that drivers don't need to rely on the isl_surf struct to determine the pitch of the CCS buffer on gfx12. This is useful during layout queries of dmabufs. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28942>	2024-06-06 23:57:52 +00:00
Nanley Chery	e9653b5833	anv: Refactor modifier plane layout queries Before this patch, we special-cased the clear color plane for layout queries. This was because that plane lacks an ISL surface whereas all others have one. We plan to drop the ISL surface for CCS buffers on gfx12 in a future commit. So, in preparation, generalize the clear color plane code to work for every plane queried on a surface that uses modifiers. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28942>	2024-06-06 23:57:52 +00:00
Nanley Chery	0194290bb5	intel/isl: Add and use ISL_DRM_CC_PLANE_PITCH_B At the interfaces which query the pitch of the clear color plane in GL and Vulkan, we've been returning 64B for various reasons. Unify the rationale under a macro. The documentation for the macro is picked from anv, which reflects the most recently synchronized copy of drm_fourcc.h. See the notable changes at `8cd8f3d697`. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28942>	2024-06-06 23:57:52 +00:00
Lionel Landwerlin	62c52fb59d	anv: expose VK_MESA_image_alignment_control Our implementation is a no-op for the following reasons : - ISL always tries to go for the smallest tiling mode (see isl_surf_choose_tiling()) - In the few cases where we need to use Tile64 for compression workarounds, VK_MESA_image_alignment_control doesn't require use to disable compression - vkd3d-proton has the ability to disable compression using VK_EXT_image_compression_control, disabling Tile64 requirements and ensuring ISL can select a 4k tiling mode So vkd3d-proton should always be able to get a 4k tiling mode if it wants to. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29175>	2024-06-06 19:00:47 +00:00
Jordan Justen	e02c6663e9	intel/tools: Fix intel_dev_info --hwconfig switch Since `a42a5bf87e`, we've been closing the file descriptor immediately after loading the devinfo struct. intel_get_and_print_hwconfig_table() re-queries the hwconfig info from the device to print out all the entries, so we need to leave the fd open for this use. I moved the close() call to all paths which exit the for loop's current iteration. Ref: `a42a5bf87e` ("intel/devinfo: add an option to pick platform to print") Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29549>	2024-06-06 00:41:13 +00:00
Sagar Ghuge	2dba5d484b	intel/fs: Adjust destination register size for global atomic on Xe2+ For 16-bit data type, we are padding 16-bit and using 32-bit data type, so we need to account for the padded portion while calculating the size_written. Rework: (Rohan) - Drop unnecessary fs_builder instance Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29271>	2024-06-06 00:18:37 +00:00
Sagar Ghuge	55c7b24899	intel/fs: Adjust destination register size for untyped atomic on Xe2+ For 16-bit data type, we are padding 16-bit and using 32-bit data type, so we need to account for the padded portion while calculating the size_written. Rework: (Rohan) - Drop unnecessary fs_builder instance Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29271>	2024-06-06 00:18:37 +00:00
Jordan Justen	1fa84d34ef	intel/compiler: Don't set size written in brw_lower_logical_sends.cpp Rework: (Sagar) - Drop unused variable Suggested-by: Francisco Jerez <currojerez@riseup.net> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29271>	2024-06-06 00:18:37 +00:00
Zach Battleman	ecfe8b0f75	intel/brw: update Wa_1805992985 to use workarounds mechanism Replaced two instances of checking version 11 with the new workaround mechanism. Reviewed-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29560>	2024-06-05 23:45:33 +00:00
Zach Battleman	ddaa7c4221	intel/brw: update comment to accurately reflect intended behavior Removed mention of Wa_* when referencing an intended harware behavior since version 12. This will prevent the erroneous usage of the `intel_needs_workaround` in the future. Reviewed-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29559>	2024-06-05 23:23:30 +00:00
Iván Briano	1c6a6349b0	intel/brw: always read LAYER/VIEWPORT from the FS payload Following on https://gitlab.freedesktop.org/mesa/mesa/-/issues/9811 the restriction that kept us from using the payload values for non-mesh cases is gone, so just use the same codepath for everything. But since we have functions that correctly read those for all gens, use those instead of the broken hack we had until now. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9796 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29448>	2024-06-05 21:52:51 +00:00
Iván Briano	3d071fe7db	intel/brw: add fetch_viewport_index function Like fetch_render_target_array_index(), it reads the values provided by the FS payload. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29448>	2024-06-05 21:52:51 +00:00
Lionel Landwerlin	816b21cd87	anv: fix pipeline flag fields Using the wrong type truncate the top bits of the pipeline flags. Currently we don't have any bit in the top bits so not fixing any bug, but in the future new extension could add some. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `688bb37552` ("anv: deal with new pipeline flags") Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29553>	2024-06-05 20:30:16 +00:00
Nanley Chery	53e77cef36	intel/blorp: Allow gfx12 fast-clears without CCS surf I'd like to phase out the ISL surface representation of CCS on gfx120 in order to enable CCS without a 512B-aligned main surface pitch. Remove the dependency on CCS ISL surfaces when fast-clearing to move drivers one step towards that goal. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28536>	2024-06-05 20:08:26 +00:00
Nanley Chery	18326211c3	intel/blorp: Factor bpb into the fast-clear rect The vertical alignment of the fast-clear rectangle shrinks as the bits-per-block of the CCS format increases. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28536>	2024-06-05 20:08:26 +00:00
Tapani Pälli	e6b24221af	anv: implement WA 14018283232 WA 14018283232 indicates that we need to emit the resource barrier when the following expression toggles value : STATE_DEPTH_BOUNDS::depthboundstestenable & 3DSTATE_PS_EXTRA:: Pixel Shader Kills Pixel & 3DSTATE_PS_EXTRA:: Pixel Shader Valid Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29297>	2024-06-05 15:22:25 +00:00
Rohan Garg	01faec2709	intel/genxml: Add RESOURCE_BARRIER for xe2 Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29297>	2024-06-05 15:22:25 +00:00
Lionel Landwerlin	108e79db1a	anv: factor out some more gpu_memcpy setup We want to have all the setup/workaround in a single spot. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29297>	2024-06-05 15:22:25 +00:00
Lionel Landwerlin	d98c47ccc3	anv: rewrite Wa_18019816803 tracking to be more like state Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29297>	2024-06-05 15:22:25 +00:00
Kevin Chuang	9f22b31ce8	anv: toggle meshShaderQueries based on whether we support mesh_shader or not Fixes: `4c7f51d3` ("anv: implement mesh shader queries") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29538>	2024-06-05 04:35:11 +00:00
Sagar Ghuge	415c5ad989	intel/compiler: No need to re-type the destination register For 16-bit float case handling, intermediate destination register is already 32-bit wide, we don't have to retype it to 32-bit. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29506>	2024-06-04 18:07:44 +00:00
José Roberto de Souza	1e0a0b4dd5	anv: Initialize variable to fix static analyzer warning Static analyzer is complaning that tex_src could be not initialized and then used, this should not happen as an instruction with type of 'tex' type needs to have source a texture handle. But to make static analyzer happy here just initializing it to zero. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29530>	2024-06-04 17:38:17 +00:00
Faith Ekstrand	705dc133c2	hasvk: Advertise VK_EXT_shader_replicated_composites Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29509>	2024-06-04 16:34:48 +00:00
Faith Ekstrand	a7db1e80d0	anv: Advertise VK_EXT_shader_replicated_composites Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29509>	2024-06-04 16:34:48 +00:00
Kevin Chuang	4c7f51d3b4	anv: implement mesh shader queries Mesh shader queries include mesh-primitives-generated count and task/mesh shader pipeline statistics. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29523>	2024-06-04 16:24:48 +00:00
Kevin Chuang	b69f7f625b	anv: Update pipeline statistics mask for task/mesh shader invocations Since VkQueryPipelineStatisticFlagBits is extended by two bits for task/mesh shader invocations, ANV_PIPELINE_STATISTICS_MASK should be defined conditionally based on GFX_VER. This commit modifies the mask and updates the vk_pipeline_stat_to_reg array accordingly. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29523>	2024-06-04 16:24:48 +00:00
Kevin Chuang	d07321e3d8	intel/genxml: add task/mesh shader statistics registers Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29523>	2024-06-04 16:24:48 +00:00
Lionel Landwerlin	d9567b5ee4	anv: fix Gfx9 fast clears on srgb formats Only MCS surfaces are affected because SRGB format are not listed as supporting CCS compression. Fixes CTS test : dEQP-VK.api.image_clearing.core.clear_color_attachment.single_layer._srgb_sample_count_* dEQP-VK.api.image_clearing.dedicated_allocation.clear_color_attachment.single_layer.srgb This is similar to what we did in Iris in `f8961ea0` ("iris: Disable sRGB fast-clears for non-0/1 values"). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10003 Fixes: `4cfb4f7d12` ("anv: support fast color clears on vkCmdClearAttachments") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29518>	2024-06-04 16:12:32 +00:00
Kevin Chuang	3349963645	anv: Properly handle cases for different query types in copy_query_results_with_shader Like it describes in the comment section of VK_QUERY_TYPE_OCCLUSION, only occlusion and timestamps queries needs ANV_COPY_QUERY_FLAG_PARTIAL. VK_QUERY_TYPE_PRIMITIVES_GENERATED_EXT is captured by MI commands. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29493>	2024-06-01 13:05:48 +00:00
Lionel Landwerlin	724bb7fa15	brw: better model READ_ARF_REG opcode This opcode gets translated to 2 ALU instructions with dependency ALU stall. This change reproduces the FS_OPCODE_PACK_HALF_2x16_SPLIT values which is another opcode that generates 2 instructions. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29446>	2024-05-31 20:22:27 +00:00
Lionel Landwerlin	ac03cefb28	brw: limit dependencies on SR register Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29446>	2024-05-31 20:22:27 +00:00
Lionel Landwerlin	d8b78924c5	brw: use a single virtual opcode to read ARF registers In `2c65d90bc8` I forgot to add the new SHADER_OPCODE_READ_MASK_REG opcode to the list of barrier instruction in the scheduler. Let's just use a single opcode for all ARF registers that need special scoreboarding and put the register as source (nicer for the debug output). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `2c65d90bc8` ("intel/brw: ensure find_live_channel don't access arch register without sync") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29446>	2024-05-31 20:22:27 +00:00
Francisco Jerez	588c725f27	intel/xe2+: Enable native 64-bit integer arithmetic. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29148>	2024-05-31 09:14:01 -07:00
Ian Romanick	7b7e5cf5d4	nir/algebraic: intel/fs: Optimize some patterns before lowering 64-bit integers v2: Add some comments explaining some of the nuance of the shift optimizations. Fix a bug in the shift count calculation of the upper 32-bits. Move the @64 from the variable to the opcode. All suggested by Jordan. No shader-db changes on any Intel platform. fossil-db: Meteor Lake and DG2 had similar results. (Meteor Lake shown) Totals: Instrs: 154507026 -> 154506576 (-0.00%) Cycle count: 17436298868 -> 17436295016 (-0.00%) Max live registers: 32635309 -> 32635297 (-0.00%) Totals from 42 (0.01% of 632575) affected shaders: Instrs: 5616 -> 5166 (-8.01%) Cycle count: 133680 -> 129828 (-2.88%) Max live registers: 1158 -> 1146 (-1.04%) No fossil-db changes on any other Intel platform. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29148>	2024-05-31 09:13:23 -07:00
Ian Romanick	22095c60bc	nir/algebraic: Add nir_lower_int64_options::nir_lower_iadd3_64 This allows us to not generate 64-bit iadd3 on Intel but continue generating it for NVIDIA. No shader-db or fossil-db changes. v2: Add nir_lower_iadd3_64 flag so we can continue to generate 64-bit iadd3 on NVIDIA platforms. v3: s/bit_size == 64/s == 64/. This cut-and-paste bug prevented any of the optimizations from ever occuring. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29148>	2024-05-31 09:13:23 -07:00
Georg Lehmann	dcab408a6c	nir: remove unpack_half_flush_to_zero It doesn't make sense to have two sets of opcodes for this when all backends that support the flush_to_zero variant just rely on the global floating point mode anyway. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29433>	2024-05-31 09:46:35 +00:00

1 2 3 4 5 ...

12085 commits