fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-25 16:58:10 +02:00

Author	SHA1	Message	Date
Daniel Schürmann	2510200d5e	vulkan/pipeline_cache: don't log warnings for internal caches Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22850> (cherry picked from commit `d3f06cf5ce`)	2023-05-25 14:06:10 +01:00
Tapani Pälli	9a1bc5e34a	isl: fix layout for comparing surf and view properties These asserts were checking isl_format_layout against itself, change to compare surface format layout against view format layout. Fixes: `628bfaf1c6` ("intel/isl: Add some sanity checks for compressed surfaces") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22790> (cherry picked from commit `c35d430460`)	2023-05-05 19:07:15 +01:00
Lionel Landwerlin	6e3d86c809	intel/fs: fix scheduling of HALT instructions With the following test : dEQP-VK.spirv_assembly.instruction.terminate_invocation.terminate.no_out_of_bounds_load There is a : shader_start: ... <- no control flow g0 = some_alu g1 = fbl g2 = broadcast g3, g1 g4 = get_buffer_size g2 ... <- no control flow halt <- on some lanes g5 = send <surface>, g4 eliminate_find_live_channel will remove the fbl/broadcast because it assumes lane0 is active at get_buffer_size : shader_start: ... <- no control flow g0 = some_alu g4 = get_buffer_size g0 ... <- no control flow halt <- on some lanes g5 = send <surface>, g4 But then the instruction scheduler will move the get_buffer_size after the halt : shader_start: ... <- no control flow halt <- on some lanes g0 = some_alu g4 = get_buffer_size g0 g5 = send <surface>, g4 get_buffer_size pulls the surface index from lane0 in g0 which could have been turned off by the halt and we end up accessing an invalid surface handle. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20765> (cherry picked from commit `9471ffa70a`)	2023-05-05 19:07:14 +01:00
Sviatoslav Peleshko	571992af74	anv: Improve image/view usage bits verification This change makes usage bits verification closer to the Vulkan spec. i.e. VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT does not always require all formats to support all the requested usage bits. Also, VK_IMAGE_CREATE_EXTENDED_USAGE_BIT, when combined with VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT can relax the requirements for the usage supported by the original image format. v2: Removed strict verification of the format_list_info formats usage per chadversary's suggestion. Other minor style/comments tweaks. v3: Added checking of all compatible formats when VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT and VK_IMAGE_CREATE_EXTENDED_USAGE_BIT are specified, but no list of possible formats was given. v4: Add VK_IMAGE_CREATE_BLOCK_TEXEL_VIEW_COMPATIBLE_BIT handling. Cc: 22.2 <mesa-stable> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6031 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17182> (cherry picked from commit `697ed61e7c`)	2023-05-05 16:02:30 +01:00
Sviatoslav Peleshko	b189a72fe4	anv: Handle UNDEFINED format in image format list It's not invalid to have this value in the list, but the only case it is actually valid as format in the creation of an image or image view is with Android Hardware Buffers which have their format specified externally. So we can just ignore all entries with VK_FORMAT_UNDEFINED. Cc: 22.2 <mesa-stable> Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17182> (cherry picked from commit `9899151361`)	2023-05-05 16:02:23 +01:00
Sviatoslav Peleshko	1122231f95	isl: Check all channels in isl_formats_have_same_bits_per_channel Cc: 22.2 <mesa-stable> Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17182> (cherry picked from commit `0ed8a48ce9`)	2023-05-05 16:02:14 +01:00
Michel Dänzer	cc3914c472	vulkan: Fix GetPhysicalDeviceSparseImageFormatProperties definitions To match the declarations (and the corresponding definition in Vulkan headers). Pointed out by GCC 13, e.g.: ../src/intel/vulkan_hasvk/anv_formats.c:1589:6: error: conflicting types for 'anv_GetPhysicalDeviceSparseImageFormatProperties' due to enum/integer mismatch; have 'void(struct VkPhysicalDevice_T , VkFormat, VkImageType, uint32_t, VkImageUsageFlags, VkImageTiling, uint32_t , VkSparseImageFormatProperties )' {aka 'void(struct VkPhysicalDevice_T , VkFormat, VkImageType, unsigned int, unsigned int, VkImageTiling, unsigned int , VkSparseImageFormatProperties )'} [-Werror=enum-int-mismatch] 1589 \| void anv_GetPhysicalDeviceSparseImageFormatProperties( \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ In file included from ../src/intel/vulkan_hasvk/anv_private.h:113, from ../src/intel/vulkan_hasvk/anv_formats.c:24: src/intel/vulkan_hasvk/anv_entrypoints.h:120:30: note: previous declaration of 'anv_GetPhysicalDeviceSparseImageFormatProperties' with type 'void(struct VkPhysicalDevice_T , VkFormat, VkImageType, VkSampleCountFlagBits, VkImageUsageFlags, VkImageTiling, uint32_t , VkSparseImageFormatProperties )' {aka 'void(struct VkPhysicalDevice_T , VkFormat, VkImageType, VkSampleCountFlagBits, unsigned int, VkImageTiling, unsigned int , VkSparseImageFormatProperties )'} 120 \| VKAPI_ATTR void VKAPI_CALL anv_GetPhysicalDeviceSparseImageFormatProperties(VkPhysicalDevice physicalDevice, VkFormat format, VkImageType type, VkSampleCountFlagBits samples, VkImageUsageFlags usage, VkImageTiling tiling, uint32_t* pPropertyCount, VkSparseImageFormatProperties* pProperties); \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22718> (cherry picked from commit `6c7400f4e8`)	2023-05-05 15:51:26 +01:00
Lionel Landwerlin	0e6d046f2d	intel/compiler: make uses_pos_offset a tri-state This value depends on the per-sample value which can be unknown at compile time with graphics pipeline libraries. So we need to have this dynamic has well and pick the right value when generating the 3DSTATE_PS/3DSTATE_WM packet. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `d8dfd153c5` ("intel/fs: Make per-sample and coarse dispatch tri-state") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22728> (cherry picked from commit `5489033fa8`)	2023-05-05 15:34:31 +01:00
Lionel Landwerlin	46d57d5de9	intel/fs: fix per vertex input clamping Only apply the clamp in multi patch mode (where the input vertices vary between [1, 32]). The clamp NIR pass operates on lowered intrinsics so we need to call it after the inputs have been lowered. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `e25e17dd0c` ("intel/fs: clamp per vertex input accesses to patchControlPoints") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8912 Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22701> (cherry picked from commit `7ddc31c672`)	2023-05-01 09:02:33 +01:00
Lionel Landwerlin	ddd39dbf89	anv: fix anv_nir_lower_ubo_loads pass In order to use load_global_const_block_intel we need to ensure the 64bit address in src[0] is uniform. This is not the case in the vkd3d-proton test_bindless_cbv tests for example. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22624> (cherry picked from commit `9fb9ae5ac6`)	2023-05-01 09:02:22 +01:00
Tapani Pälli	025b6a79c6	anv: implement state cache invalidate for Wa_16013063087 Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22651> (cherry picked from commit `e501b31e15`)	2023-04-26 17:37:26 +01:00
Tapani Pälli	aa48b8dc8d	anv: cleanup bitmask construction for PIPELINE_SELECT Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22651> (cherry picked from commit `72fc56aa37`)	2023-04-26 17:37:26 +01:00
Lionel Landwerlin	ba5c0f0ffd	anv: rework Wa_14017076903 to only apply with occlusion queries Fixes KHR-GL46.transform_feedback.* tests with zink+anv on DG2 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `c34916f841` ("anv: implement occlusion query related Wa_14017076903") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22586> (cherry picked from commit `56840e4c89`)	2023-04-26 17:37:25 +01:00
Jordan Justen	b8a7fba41f	intel/compiler/gfx12.5+: Lower 64-bit cluster_broadcast with 32-bit ops For MTL (verx10 == 125), float64 is supported, but int64 is not. Therefore we need to lower cluster broadcast using 32-bit int ops. For gfx12.5+ platforms that support int64, the register regions used by cluster broadcast aren't supported by the 64-bit pipeline. On MTL, dEQP-VK.subgroups.clustered._double and dEQP-VK.subgroups.clustered._dvec were failing to validate the compiled shader in debug mode, and reportedly gpu-hanging in release mode. With this change dEQP-VK.subgroups.clustered._double passed all 48 tests and dEQP-VK.subgroups.clustered._dvec passed all 140 tests on MTL. Rework: * Move from generator to brw_fs_lower_regioning.cpp. (Suggested by Francisco) * Apply to verx10 >= 125.. (Suggested by Francisco) Cc: 23.1 <mesa-stable> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> (v1) Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22569> (cherry picked from commit `fcb72ffd0c`)	2023-04-26 17:37:25 +01:00
Lionel Landwerlin	3eeb4bedfa	isl: fix a number of errors on storage format support on Gfx9/12.5 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22302> (cherry picked from commit `d4f498a583`)	2023-04-19 14:37:56 +01:00
Tapani Pälli	0ffe4381b1	isl: disable mcs (and mcs+ccs) for color msaa on gfxver 125 Same/similar issues are seen on MTL platform as DG2 so disable for both. Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22435> (cherry picked from commit `d561bac6bb`)	2023-04-19 14:37:56 +01:00
Lionel Landwerlin	846080db5d	isl: don't set inconsistent fields for depth when using stencil only Since Gfx12+ 3DSTATE_STENCIL_BUFFER gained its own Width/Depth/Format/etc... fields. So don't set those fields but leave the address/pitch to 0. Issue found on simulation. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15637> (cherry picked from commit `3ca1fdc8b5`)	2023-04-19 14:37:56 +01:00
Felix DeGrood	1ec9bcd844	anv: disable reset query pools using blorp opt on MTL This optimization causes some MTL tests to run forever. Not yet sure why. Disabling optimization until we have a fix. Reviewed-by: Mark Janes <markjanes@swizzler.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22373> (cherry picked from commit `0a52002a1c`)	2023-04-16 22:38:09 +01:00
Lionel Landwerlin	4ed3b1ae6b	intel/vec4: force exec_all on float control instruction Applying the same rule as the fs backend so that generation code doesn't assert. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `daa8003e45` ("intel/fs: use nomask for setting cr0 for float controls") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22473> (cherry picked from commit `08cf224c4a`)	2023-04-16 22:37:09 +01:00
Tapani Pälli	b967cbba57	intel/compiler: use intel_needs_workaround for Wa_14012437816 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22437>	2023-04-13 07:33:50 +00:00
Tapani Pälli	ccf16693e1	intel/fs: use intel_needs_workaround for Wa_22013689345 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22437>	2023-04-13 07:33:50 +00:00
Lionel Landwerlin	66edd030ab	anv: add utrace tracking of frame boundaries Based on vkQueuePresentKHR calls. It just helps spotting the beginning end of a frame in perfetto when apps are using 3/4 command buffers per frame. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22276>	2023-04-13 01:14:38 +00:00
Lionel Landwerlin	da6842007f	intel/ds: add a new timeline row for frames Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22276>	2023-04-13 01:14:38 +00:00
Lionel Landwerlin	68bba1539f	anv: exclude performance queries from blorp clears The query buffer contains a batch to implement the multi pass replay/accumulation of results. So we can't clear it with a memset. An optimization for later would be to move the batches to the very end of the query buffer so we can clear the query data without touching the batches. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `4dc7256bf9` ("anv: reset query pools using blorp") Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22421>	2023-04-13 00:44:29 +00:00
José Roberto de Souza	b1299f42ff	anv: Fix vm bind of imported buffers Imported buffers may be created in a device with different memory alignment and this can cause vm bind to fail because bo size can be smaller than the calculated vm bind range using the importer device memory alignment. So here adding actual_size to anv_bo, this will be set with the actual size of the bo allocated by kmd for bos allocate in the current device. For other bo the lseek or the Vulkan API size will be used. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22219>	2023-04-12 10:05:32 +00:00
Lionel Landwerlin	daa8003e45	intel/fs: use nomask for setting cr0 for float controls The instructions manipulation cr0 use the default mask on lane0. So if for some reason that lane is disabled in some of the dispatchs, we can end up not executing the instructions. Fixes flakyness in dEQP-VK.spirv_assembly.instruction.graphics.16bit_storage.uniform_float_32_to_16.uniform_matrix_float_rtz_frag Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22314>	2023-04-11 11:01:31 +00:00
Lionel Landwerlin	cff71ae8ff	anv: fixup streamout write barriers Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8796 Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22336>	2023-04-11 09:53:10 +00:00
Daniel Schürmann	53eb3ad375	vulkan/pipeline_cache: add cache parameter to deserialize() function This allows for secondary cache lookups during deserialization. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21967>	2023-04-10 09:14:30 +00:00
Daniel Schürmann	5daff41e27	vulkan/pipeline_cache: remove vk_device from vk_pipeline_cache_object It is not necessary to store the extra pointer. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21967>	2023-04-10 09:14:30 +00:00
Kenneth Graunke	98bcf650f1	intel/compiler: Use nir_dest_bit_size() for ballot bit size check There's no guarantee that this is a SSA value. Use the helper to handle both SSA values and register correctly. Otherwise we read trash when we encounter a register and make bad decisions on types, possibly leading to our destination being UQ typed when the VGRF is only 32-bit. Fixes compilation with -Dintel-clc=enabled since `7f6491b76d` (nir: Combine if_uses with instruction uses) but the bug is much older than that, circa 2017. We were just getting lucky before. Fixes: `069bf7c907` ("i965/fs: Match destination type to size for ballot") Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22374>	2023-04-07 19:28:56 -07:00
Alyssa Rosenzweig	7f6491b76d	nir: Combine if_uses with instruction uses Every nir_ssa_def is part of a chain of uses, implemented with doubly linked lists. That means each requires 2 * 64-bit = 16 bytes per def, which is memory intensive. Together they require 32 bytes per def. Not cool. To cut that memory use in half, we can combine the two linked lists into a single use list that contains both regular instruction uses and if-uses. To do this, we augment the nir_src with a boolean "is_if", and reimplement the abstract if-uses operations on top of that list. That boolean should fit into the padding already in nir_src so should not actually affect memory use, and in the future we sneak it into the bottom bit of a pointer. However, this creates a new inefficiency: now iterating over regular uses separate from if-uses is (nominally) more expensive. It turns out virtually every caller of nir_foreach_if_use(_safe) also calls nir_foreach_use(_safe) immediately before, so we rewrite most of the callers to instead call a new single `nir_foreach_use_including_if(_safe)` which predicates the logic based on `src->is_if`. This should mitigate the performance difference. There's a bit of churn, but this is largely a mechanical set of changes. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22343>	2023-04-07 23:48:03 +00:00
Alyssa Rosenzweig	4fa2924610	anv,hasvk: Use vk_features2_to_features Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22217>	2023-04-07 18:16:40 -04:00
Felix DeGrood	4dc7256bf9	anv: reset query pools using blorp Previously we used PC to set query data to 0 during CmdResetQueryPool. This was slow when clearing large query pools. Switching to blorp to clear pools is faster for large query pools. Red Dead Redemption 2: +1.5% speedup Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22178>	2023-04-07 15:51:20 +00:00
Lionel Landwerlin	bb49610973	anv: replace query flush before gpu copy by semaphore wait All the flushes should already have happened, we just need CS to wait for the operations to complete. Just use a MI_SEMAPHORE_WAIT to check the availability bit is set. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22178>	2023-04-07 15:51:20 +00:00
Lionel Landwerlin	abc4111d19	anv: pass steam output as argument for anv_dump_pipe_bits Just if you need to change it at some point ;) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22178>	2023-04-07 15:51:20 +00:00
Felix DeGrood	2415d57a99	anv/blorp: add flush reasons to RT flushes Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22178>	2023-04-07 15:51:20 +00:00
Felix DeGrood	43f93f5043	anv/blorp: implement anv_cmd_buffer_fill_area Implemented function to fill an area at an address. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22178>	2023-04-07 15:51:20 +00:00
Felix DeGrood	0130a4f667	anv/blorp: support surf generation for addresses Already have support for anv_buff. Extended to support addresses. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22178>	2023-04-07 15:51:20 +00:00
Ian Romanick	12e11fa3e4	intel/fs: White space fixes Trivial Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22299>	2023-04-06 19:07:50 +00:00
Ian Romanick	6dfb7061e0	intel/fs: Preserve meta data more often in brw_nir_move_interpolation_to_top This pass rarely makes any changes, so work a little harder to preserve more meta data. On my Ice Lake laptop (using a locked CPU speed and other measures to prevent thermal throttling, etc.) using a debugoptimized build, improves performance of Vulkan CTS "deqp-vk --deqp-case='dEQP-VK.spir'" by -0.2% ± 0.1% (n = 5, pooled s = 0.431885). v2: Add some parenthesis. Suggested by Lionel. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22299>	2023-04-06 19:07:50 +00:00
Ian Romanick	3037603b70	intel/fs: Linked list micro optimizations in brw_nir_move_interpolation_to_top Two linked list management changes: - Use the list head sentinel as the initial cursor. It is, after all, a proper node in the list. - Iterate the list of blocks starting with the second block instead of skipping the first block in the loop. On my Ice Lake laptop (using a locked CPU speed and other measures to prevent thermal throttling, etc.) using a release build, improves performance of compiling shaders from batman_arkham_city_goty.foz by -0.24% ± 0.09% (n = 5, pooled s = 0.324106). v2: Use nir_cursor instead of direct list manipultion. Suggested by Lionel. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22299>	2023-04-06 19:07:50 +00:00
Ian Romanick	78ee74de4a	intel/compiler: Micro optimize regions_overlap On my Ice Lake laptop (using a locked CPU speed and other measures to prevent thermal throttling, etc.) using a release build, improves performance of compiling shaders from batman_arkham_city_goty.foz by -1.09% ± 0.084% (n = 5, pooled s = 0.354471) Reduces the size of a release build by 26k. text data bss dec hex filename 23163641 400720 231360 23795721 16b1809 before/lib64/dri/iris_dri.so 23137264 400720 231360 23769344 16ab100 after/lib64/dri/iris_dri.so Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22299>	2023-04-06 19:07:50 +00:00
Ian Romanick	7873edee6e	intel/fs: Use specialized version of regions_overlap in opt_copy_propagation Since one of the register must always be either VGRF or FIXED_GRF, much of regions_overlap and reg_offset can be elided. On my Ice Lake laptop (using a locked CPU speed and other measures to prevent thermal throttling, etc.) using a debugoptimized build, improves performance of Vulkan CTS "deqp-vk --deqp-case='dEQP-VK.spir'" by -0.29% ± 0.097% (n = 5, pooled s = 0.361697). Using a release build, improves performance of compiling shaders from batman_arkham_city_goty.foz by -3.3% ± 0.04% (n = 5, pooled s = 0.178312). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22299>	2023-04-06 19:07:50 +00:00
Ian Romanick	43cb42df7c	intel/compiler: Micro optimize inst_is_in_block This function only exists in builds with assertions, so it only matters there. On my Ice Lake laptop (using a locked CPU speed and other measures to prevent thermal throttling, etc.) using a debugoptimized build, improves performance of Vulkan CTS "deqp-vk --deqp-case='dEQP-VK.spir'" by -5.2% ± 0.16% (n = 5, pooled s = 0.657887). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22299>	2023-04-06 19:07:50 +00:00
Ian Romanick	d47f521ee4	intel/compiler: Use NIR_PASS instead of NIR_PASS_V Reduce debug log spam by only logging the shader if a pass made some changes. This can also elide some nir_validate calls in debug builds. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22299>	2023-04-06 19:07:50 +00:00
Ian Romanick	fb950a9edf	intel/compiler: Remove one overload of backend_instruction::insert_before The version that takes a list of instructions is not used. I did not do any archaeology to find out when the last user was removed. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22299>	2023-04-06 19:07:50 +00:00
Tapani Pälli	44053c0947	intel/common: limit the amount of SLM with Wa_14017341140 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22307>	2023-04-06 10:54:47 +00:00
Rohan Garg	e21cca78ea	anv,blorp,iris: Set PreferredSLMAllocationSize on gfx125+ Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22307>	2023-04-06 10:54:47 +00:00
Rohan Garg	3b6dbf8902	intel/genxml: Add the preferred slm size enum for gen125 Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22307>	2023-04-06 10:54:46 +00:00
Anuj Phogat	606a39f9d1	intel/genxml/125: Add preferred SLM allocation size field Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22307>	2023-04-06 10:54:46 +00:00

1 2 3 4 5 ...

9402 commits