fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-25 14:48:12 +02:00

Author	SHA1	Message	Date
Rohan Garg	7b6fc2c9cd	hasvk: enable single texel alignment Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23321> (cherry picked from commit `732db2b60c`)	2023-06-02 19:34:01 +01:00
Rohan Garg	bdc40be418	anv: fix incorrect asserts when combining CPS and per sample interpolation CPS is dynamically turned off when per sample interpolation is active. Update the asserts to reflect this. Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `5644011f06` ("intel/compiler: Convert wm_prog_key::persample_interp to a tri-state") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23103> (cherry picked from commit `ef2b763d9c`)	2023-06-02 19:34:01 +01:00
Lionel Landwerlin	841d63ab08	anv: fix null descriptor handling with A64 messages global load/store (or A64 messages) need the NIR bound checking which is enabled by "robust" behavior even when robust behavior is disabled. Many thanks to Christopher Snowhill for pointing out the pushed constant related issue with the initial version of this patch. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21645> (cherry picked from commit `efcda1c530`)	2023-06-02 19:34:00 +01:00
Lionel Landwerlin	0340da7c82	anv: fix push range for descriptor offsets Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `379b9bb7b0` ("anv: Support fetching descriptor addresses from push constants") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21645> (cherry picked from commit `e1ffa067d3`)	2023-06-02 19:34:00 +01:00
Rohan Garg	a913343de5	anv: enable single texel alignment Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23263> (cherry picked from commit `f6a83ec988`)	2023-06-01 16:52:36 +01:00
Francisco Jerez	a825322179	anv: Fix calculation of guardband clipping region. The existing guardband region calculation was mixing up x/y_min with x/y_max in cmd_buffer_emit_viewport(), causing the calculated viewport area to always be an empty region. Luckily intel_calculate_guardband_size() returns a non-empty but bogus guardband region in that case, so this doesn't seem to have led to conformance regressions, but the off-center guardbands could potentially impact performance in geometry-heavy rendering. Fixes: `893fa30afe` ("anv: Include scissors in viewport calculations") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23174> (cherry picked from commit `9c26a6b3bb`)	2023-06-01 16:49:22 +01:00
Lionel Landwerlin	28f89e96cd	intel/fs: fix size_read() for LOAD_PAYLOAD With Anv/Zink, the piglit test : arb_shader_storage_buffer_object-max-ssbo-size -auto -fbo fsexceed is failing validation after copy propagation : load_payload(8) vgrf15:F, vgrf1+0.12<0>:F, vgrf1+0.0<0>:F, vgrf1+0.4<0>:F, vgrf1+0.8<0>:F, vgrf1+0.12<0>:F ../src/intel/compiler/brw_fs_validate.cpp:191: A <= B failed A = inst->src[i].offset / REG_SIZE + regs_read(inst, i) = 2 B = alloc.sizes[inst->src[i].nr] = 1 In most cases it works because src[0] would be at offset 0 and so reading a full reg passes validation, but Anv/Zink started emitting slightly different code adding an offset maybe the size read 2 GRFs. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23126> (cherry picked from commit `21c7b55f6f`)	2023-05-25 14:06:13 +01:00
Lionel Landwerlin	7a6aef60b6	anv: fix push descriptor deferred surface state packing Yuzu is running into a segfault because it writes the push descriptor twice with 2 different layouts, but without a draw/dispatch in between. First vkCmdPushDescriptorSetKHR() writes descriptor 0 & 1 with a uniform buffer. We toggle the 2 first bits of anv_descriptor_set::generate_surface_states. Second vkCmdPushDescriptorSetKHR() writes descriptor 0 with uniform buffer and descriptor 1 with an image view. The first bit of anv_descriptor_set::generate_surface_states stays, but the second bit was already set before and it should now be off. When we finally flush the push descriptor, we try to generate a surface state for descriptor 1, but there is no valid buffer view for it, we access an invalid pointer and segfault. This fix resets the anv_descriptor_set::generate_surface_states when the descriptor layout changes. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `b49b18f0b7` ("anv: reduce BT emissions & surface state writes with push descriptors") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23156> (cherry picked from commit `cab7ba00e2`)	2023-05-25 14:06:13 +01:00
Kenneth Graunke	fb39004b72	intel/compiler: Fix 64-bit ufind_msb, find_lsb, and bit_count We only support 32-bit versions of ufind_msb, find_lsb, and bit_count, so we need to lower them via nir_lower_int64. Previously, we were failing to do so on platforms older than Icelake and let those operations fall through to nir_lower_bit_size, which used a callback to determine it should lower them for bit_size != 32. However, that pass only emulates small bit-size operations by promoting them to supported, larger bit-sizes (i.e. 16-bit using 32-bit). It doesn't support emulating larger operations (i.e. 64-bit using 32-bit). So nir_lower_bit_size would just u2u32 the 64-bit source, causing us to flat ignore half of the bits. Commit `78a195f252` (intel/compiler: Postpone most int64 lowering to brw_postprocess_nir) provoked this bug on Icelake and later as well, by moving the nir_lower_int64 handling for ufind_msb until late in compilation, allowing it to reach nir_lower_bit_size which broke it. To fix this, we always set int64 lowering for these opcodes, and also correct the nir_lower_bit_size callback to ignore 64-bit operations. Cc: mesa-stable Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23123> (cherry picked from commit `a2d384a5c0`)	2023-05-25 14:06:12 +01:00
José Roberto de Souza	1b4720b305	anv: Fix ANV_BO_ALLOC_NO_LOCAL_MEM flag VK_MEMORY_PROPERTY_DEVICE_LOCAL_BIT is also set in all memory types of integrated GPUs. This flag means that memory will be allocated in the most efficient place for the GPU to access, which is true in integrated GPUs. However, this was causing ANV_BO_ALLOC_WRITE_COMBINE to be set in integrated GPUs in the block right below when allocating in the non-cached memory type. But the comment only talks about lmem, so to still keep the write combine behavior for iGPUs it was used VkMemoryPropertyFlags in mmap_calc_flags(). Additionally, this was causing anv_bo.has_implicit_ccs to always be set, which could change the expected behavior of anv_BindImageMemory2() in MTL. Fixes: `fbd32a04da` ("anv: add a third memory type for LLC configuration") added a new heap Fixes: `582bf4d9f7` ("anv: flag BO for write combine when CPU visible and potentially in lmem") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22483> (cherry picked from commit `a6c5746b37`)	2023-05-25 14:06:12 +01:00
Lionel Landwerlin	d8290f5f38	anv: mark images compressed for untracked layout/access Most of the compressed writes are tracked by the driver, for instances : - blorp writes - render target writes But we don't have any tracking for storage images (which have gained compression support on DG2+). So inspect the layout transition and when we see a layout/access that can do writes outside of our driver tracking, update the image state tracking. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8946 Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22988> (cherry picked from commit `1a89b1a301`)	2023-05-25 14:06:12 +01:00
Lionel Landwerlin	754e0e2176	anv: put private binding BOs into execlists Not doing so all the reads/writes go to the scratch page on i915. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `f9fa09ec92` ("anv/image: Add ANV_IMAGE_MEMORY_BINDING_PRIVATE") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22957> (cherry picked from commit `7f7b2fc53a`)	2023-05-25 14:06:12 +01:00
Tapani Pälli	8fe0aea591	anv: handle missing astc for gfx125 in CreateImageView Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22818> (cherry picked from commit `b0b6811b9b`)	2023-05-25 14:06:10 +01:00
Matt Turner	f93ea427b2	intel: Disable shader cache when executing intel_clc during the build With the shader cache enabled, intel_clc attempts to write to ~/.cache. Many distributions' build systems limit file-system access, and will kill the process thus causing the build to fail. Fixes: `639665053f` ("anv/grl: Build OpenCL kernels") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22968> (cherry picked from commit `435a607909`)	2023-05-25 14:06:10 +01:00
Lionel Landwerlin	c8c7dd030f	anv: fixup workaround 16011411144 We're missing it for the memcpy with streamout Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `5cc4075f95` ("anv, iris: Add Wa_16011411144 for DG2") Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22930> (cherry picked from commit `7381405095`)	2023-05-25 14:06:10 +01:00
Daniel Schürmann	0dff70ecb1	vulkan/pipeline_cache: don't log warnings for client-invisible caches Fixes: `d3f06cf5ce` ('vulkan/pipeline_cache: don't log warnings for internal caches') Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22986> (cherry picked from commit `8bfd18b8c5`)	2023-05-25 14:06:10 +01:00
Daniel Schürmann	2510200d5e	vulkan/pipeline_cache: don't log warnings for internal caches Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22850> (cherry picked from commit `d3f06cf5ce`)	2023-05-25 14:06:10 +01:00
Tapani Pälli	9a1bc5e34a	isl: fix layout for comparing surf and view properties These asserts were checking isl_format_layout against itself, change to compare surface format layout against view format layout. Fixes: `628bfaf1c6` ("intel/isl: Add some sanity checks for compressed surfaces") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22790> (cherry picked from commit `c35d430460`)	2023-05-05 19:07:15 +01:00
Lionel Landwerlin	6e3d86c809	intel/fs: fix scheduling of HALT instructions With the following test : dEQP-VK.spirv_assembly.instruction.terminate_invocation.terminate.no_out_of_bounds_load There is a : shader_start: ... <- no control flow g0 = some_alu g1 = fbl g2 = broadcast g3, g1 g4 = get_buffer_size g2 ... <- no control flow halt <- on some lanes g5 = send <surface>, g4 eliminate_find_live_channel will remove the fbl/broadcast because it assumes lane0 is active at get_buffer_size : shader_start: ... <- no control flow g0 = some_alu g4 = get_buffer_size g0 ... <- no control flow halt <- on some lanes g5 = send <surface>, g4 But then the instruction scheduler will move the get_buffer_size after the halt : shader_start: ... <- no control flow halt <- on some lanes g0 = some_alu g4 = get_buffer_size g0 g5 = send <surface>, g4 get_buffer_size pulls the surface index from lane0 in g0 which could have been turned off by the halt and we end up accessing an invalid surface handle. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20765> (cherry picked from commit `9471ffa70a`)	2023-05-05 19:07:14 +01:00
Sviatoslav Peleshko	571992af74	anv: Improve image/view usage bits verification This change makes usage bits verification closer to the Vulkan spec. i.e. VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT does not always require all formats to support all the requested usage bits. Also, VK_IMAGE_CREATE_EXTENDED_USAGE_BIT, when combined with VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT can relax the requirements for the usage supported by the original image format. v2: Removed strict verification of the format_list_info formats usage per chadversary's suggestion. Other minor style/comments tweaks. v3: Added checking of all compatible formats when VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT and VK_IMAGE_CREATE_EXTENDED_USAGE_BIT are specified, but no list of possible formats was given. v4: Add VK_IMAGE_CREATE_BLOCK_TEXEL_VIEW_COMPATIBLE_BIT handling. Cc: 22.2 <mesa-stable> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6031 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17182> (cherry picked from commit `697ed61e7c`)	2023-05-05 16:02:30 +01:00
Sviatoslav Peleshko	b189a72fe4	anv: Handle UNDEFINED format in image format list It's not invalid to have this value in the list, but the only case it is actually valid as format in the creation of an image or image view is with Android Hardware Buffers which have their format specified externally. So we can just ignore all entries with VK_FORMAT_UNDEFINED. Cc: 22.2 <mesa-stable> Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17182> (cherry picked from commit `9899151361`)	2023-05-05 16:02:23 +01:00
Sviatoslav Peleshko	1122231f95	isl: Check all channels in isl_formats_have_same_bits_per_channel Cc: 22.2 <mesa-stable> Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17182> (cherry picked from commit `0ed8a48ce9`)	2023-05-05 16:02:14 +01:00
Michel Dänzer	cc3914c472	vulkan: Fix GetPhysicalDeviceSparseImageFormatProperties definitions To match the declarations (and the corresponding definition in Vulkan headers). Pointed out by GCC 13, e.g.: ../src/intel/vulkan_hasvk/anv_formats.c:1589:6: error: conflicting types for 'anv_GetPhysicalDeviceSparseImageFormatProperties' due to enum/integer mismatch; have 'void(struct VkPhysicalDevice_T , VkFormat, VkImageType, uint32_t, VkImageUsageFlags, VkImageTiling, uint32_t , VkSparseImageFormatProperties )' {aka 'void(struct VkPhysicalDevice_T , VkFormat, VkImageType, unsigned int, unsigned int, VkImageTiling, unsigned int , VkSparseImageFormatProperties )'} [-Werror=enum-int-mismatch] 1589 \| void anv_GetPhysicalDeviceSparseImageFormatProperties( \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ In file included from ../src/intel/vulkan_hasvk/anv_private.h:113, from ../src/intel/vulkan_hasvk/anv_formats.c:24: src/intel/vulkan_hasvk/anv_entrypoints.h:120:30: note: previous declaration of 'anv_GetPhysicalDeviceSparseImageFormatProperties' with type 'void(struct VkPhysicalDevice_T , VkFormat, VkImageType, VkSampleCountFlagBits, VkImageUsageFlags, VkImageTiling, uint32_t , VkSparseImageFormatProperties )' {aka 'void(struct VkPhysicalDevice_T , VkFormat, VkImageType, VkSampleCountFlagBits, unsigned int, VkImageTiling, unsigned int , VkSparseImageFormatProperties )'} 120 \| VKAPI_ATTR void VKAPI_CALL anv_GetPhysicalDeviceSparseImageFormatProperties(VkPhysicalDevice physicalDevice, VkFormat format, VkImageType type, VkSampleCountFlagBits samples, VkImageUsageFlags usage, VkImageTiling tiling, uint32_t* pPropertyCount, VkSparseImageFormatProperties* pProperties); \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22718> (cherry picked from commit `6c7400f4e8`)	2023-05-05 15:51:26 +01:00
Lionel Landwerlin	0e6d046f2d	intel/compiler: make uses_pos_offset a tri-state This value depends on the per-sample value which can be unknown at compile time with graphics pipeline libraries. So we need to have this dynamic has well and pick the right value when generating the 3DSTATE_PS/3DSTATE_WM packet. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `d8dfd153c5` ("intel/fs: Make per-sample and coarse dispatch tri-state") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22728> (cherry picked from commit `5489033fa8`)	2023-05-05 15:34:31 +01:00
Lionel Landwerlin	46d57d5de9	intel/fs: fix per vertex input clamping Only apply the clamp in multi patch mode (where the input vertices vary between [1, 32]). The clamp NIR pass operates on lowered intrinsics so we need to call it after the inputs have been lowered. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `e25e17dd0c` ("intel/fs: clamp per vertex input accesses to patchControlPoints") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8912 Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22701> (cherry picked from commit `7ddc31c672`)	2023-05-01 09:02:33 +01:00
Lionel Landwerlin	ddd39dbf89	anv: fix anv_nir_lower_ubo_loads pass In order to use load_global_const_block_intel we need to ensure the 64bit address in src[0] is uniform. This is not the case in the vkd3d-proton test_bindless_cbv tests for example. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22624> (cherry picked from commit `9fb9ae5ac6`)	2023-05-01 09:02:22 +01:00
Tapani Pälli	025b6a79c6	anv: implement state cache invalidate for Wa_16013063087 Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22651> (cherry picked from commit `e501b31e15`)	2023-04-26 17:37:26 +01:00
Tapani Pälli	aa48b8dc8d	anv: cleanup bitmask construction for PIPELINE_SELECT Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22651> (cherry picked from commit `72fc56aa37`)	2023-04-26 17:37:26 +01:00
Lionel Landwerlin	ba5c0f0ffd	anv: rework Wa_14017076903 to only apply with occlusion queries Fixes KHR-GL46.transform_feedback.* tests with zink+anv on DG2 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `c34916f841` ("anv: implement occlusion query related Wa_14017076903") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22586> (cherry picked from commit `56840e4c89`)	2023-04-26 17:37:25 +01:00
Jordan Justen	b8a7fba41f	intel/compiler/gfx12.5+: Lower 64-bit cluster_broadcast with 32-bit ops For MTL (verx10 == 125), float64 is supported, but int64 is not. Therefore we need to lower cluster broadcast using 32-bit int ops. For gfx12.5+ platforms that support int64, the register regions used by cluster broadcast aren't supported by the 64-bit pipeline. On MTL, dEQP-VK.subgroups.clustered._double and dEQP-VK.subgroups.clustered._dvec were failing to validate the compiled shader in debug mode, and reportedly gpu-hanging in release mode. With this change dEQP-VK.subgroups.clustered._double passed all 48 tests and dEQP-VK.subgroups.clustered._dvec passed all 140 tests on MTL. Rework: * Move from generator to brw_fs_lower_regioning.cpp. (Suggested by Francisco) * Apply to verx10 >= 125.. (Suggested by Francisco) Cc: 23.1 <mesa-stable> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> (v1) Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22569> (cherry picked from commit `fcb72ffd0c`)	2023-04-26 17:37:25 +01:00
Lionel Landwerlin	3eeb4bedfa	isl: fix a number of errors on storage format support on Gfx9/12.5 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22302> (cherry picked from commit `d4f498a583`)	2023-04-19 14:37:56 +01:00
Tapani Pälli	0ffe4381b1	isl: disable mcs (and mcs+ccs) for color msaa on gfxver 125 Same/similar issues are seen on MTL platform as DG2 so disable for both. Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22435> (cherry picked from commit `d561bac6bb`)	2023-04-19 14:37:56 +01:00
Lionel Landwerlin	846080db5d	isl: don't set inconsistent fields for depth when using stencil only Since Gfx12+ 3DSTATE_STENCIL_BUFFER gained its own Width/Depth/Format/etc... fields. So don't set those fields but leave the address/pitch to 0. Issue found on simulation. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15637> (cherry picked from commit `3ca1fdc8b5`)	2023-04-19 14:37:56 +01:00
Felix DeGrood	1ec9bcd844	anv: disable reset query pools using blorp opt on MTL This optimization causes some MTL tests to run forever. Not yet sure why. Disabling optimization until we have a fix. Reviewed-by: Mark Janes <markjanes@swizzler.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22373> (cherry picked from commit `0a52002a1c`)	2023-04-16 22:38:09 +01:00
Lionel Landwerlin	4ed3b1ae6b	intel/vec4: force exec_all on float control instruction Applying the same rule as the fs backend so that generation code doesn't assert. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `daa8003e45` ("intel/fs: use nomask for setting cr0 for float controls") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22473> (cherry picked from commit `08cf224c4a`)	2023-04-16 22:37:09 +01:00
Tapani Pälli	b967cbba57	intel/compiler: use intel_needs_workaround for Wa_14012437816 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22437>	2023-04-13 07:33:50 +00:00
Tapani Pälli	ccf16693e1	intel/fs: use intel_needs_workaround for Wa_22013689345 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22437>	2023-04-13 07:33:50 +00:00
Lionel Landwerlin	66edd030ab	anv: add utrace tracking of frame boundaries Based on vkQueuePresentKHR calls. It just helps spotting the beginning end of a frame in perfetto when apps are using 3/4 command buffers per frame. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22276>	2023-04-13 01:14:38 +00:00
Lionel Landwerlin	da6842007f	intel/ds: add a new timeline row for frames Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22276>	2023-04-13 01:14:38 +00:00
Lionel Landwerlin	68bba1539f	anv: exclude performance queries from blorp clears The query buffer contains a batch to implement the multi pass replay/accumulation of results. So we can't clear it with a memset. An optimization for later would be to move the batches to the very end of the query buffer so we can clear the query data without touching the batches. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `4dc7256bf9` ("anv: reset query pools using blorp") Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22421>	2023-04-13 00:44:29 +00:00
José Roberto de Souza	b1299f42ff	anv: Fix vm bind of imported buffers Imported buffers may be created in a device with different memory alignment and this can cause vm bind to fail because bo size can be smaller than the calculated vm bind range using the importer device memory alignment. So here adding actual_size to anv_bo, this will be set with the actual size of the bo allocated by kmd for bos allocate in the current device. For other bo the lseek or the Vulkan API size will be used. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22219>	2023-04-12 10:05:32 +00:00
Lionel Landwerlin	daa8003e45	intel/fs: use nomask for setting cr0 for float controls The instructions manipulation cr0 use the default mask on lane0. So if for some reason that lane is disabled in some of the dispatchs, we can end up not executing the instructions. Fixes flakyness in dEQP-VK.spirv_assembly.instruction.graphics.16bit_storage.uniform_float_32_to_16.uniform_matrix_float_rtz_frag Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22314>	2023-04-11 11:01:31 +00:00
Lionel Landwerlin	cff71ae8ff	anv: fixup streamout write barriers Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8796 Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22336>	2023-04-11 09:53:10 +00:00
Daniel Schürmann	53eb3ad375	vulkan/pipeline_cache: add cache parameter to deserialize() function This allows for secondary cache lookups during deserialization. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21967>	2023-04-10 09:14:30 +00:00
Daniel Schürmann	5daff41e27	vulkan/pipeline_cache: remove vk_device from vk_pipeline_cache_object It is not necessary to store the extra pointer. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21967>	2023-04-10 09:14:30 +00:00
Kenneth Graunke	98bcf650f1	intel/compiler: Use nir_dest_bit_size() for ballot bit size check There's no guarantee that this is a SSA value. Use the helper to handle both SSA values and register correctly. Otherwise we read trash when we encounter a register and make bad decisions on types, possibly leading to our destination being UQ typed when the VGRF is only 32-bit. Fixes compilation with -Dintel-clc=enabled since `7f6491b76d` (nir: Combine if_uses with instruction uses) but the bug is much older than that, circa 2017. We were just getting lucky before. Fixes: `069bf7c907` ("i965/fs: Match destination type to size for ballot") Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22374>	2023-04-07 19:28:56 -07:00
Alyssa Rosenzweig	7f6491b76d	nir: Combine if_uses with instruction uses Every nir_ssa_def is part of a chain of uses, implemented with doubly linked lists. That means each requires 2 * 64-bit = 16 bytes per def, which is memory intensive. Together they require 32 bytes per def. Not cool. To cut that memory use in half, we can combine the two linked lists into a single use list that contains both regular instruction uses and if-uses. To do this, we augment the nir_src with a boolean "is_if", and reimplement the abstract if-uses operations on top of that list. That boolean should fit into the padding already in nir_src so should not actually affect memory use, and in the future we sneak it into the bottom bit of a pointer. However, this creates a new inefficiency: now iterating over regular uses separate from if-uses is (nominally) more expensive. It turns out virtually every caller of nir_foreach_if_use(_safe) also calls nir_foreach_use(_safe) immediately before, so we rewrite most of the callers to instead call a new single `nir_foreach_use_including_if(_safe)` which predicates the logic based on `src->is_if`. This should mitigate the performance difference. There's a bit of churn, but this is largely a mechanical set of changes. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22343>	2023-04-07 23:48:03 +00:00
Alyssa Rosenzweig	4fa2924610	anv,hasvk: Use vk_features2_to_features Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22217>	2023-04-07 18:16:40 -04:00
Felix DeGrood	4dc7256bf9	anv: reset query pools using blorp Previously we used PC to set query data to 0 during CmdResetQueryPool. This was slow when clearing large query pools. Switching to blorp to clear pools is faster for large query pools. Red Dead Redemption 2: +1.5% speedup Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22178>	2023-04-07 15:51:20 +00:00
Lionel Landwerlin	bb49610973	anv: replace query flush before gpu copy by semaphore wait All the flushes should already have happened, we just need CS to wait for the operations to complete. Just use a MI_SEMAPHORE_WAIT to check the availability bit is set. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22178>	2023-04-07 15:51:20 +00:00

1 2 3 4 5 ...

9418 commits