fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-18 05:08:06 +02:00

Author	SHA1	Message	Date
Jordan Justen	fcb72ffd0c	intel/compiler/gfx12.5+: Lower 64-bit cluster_broadcast with 32-bit ops For MTL (verx10 == 125), float64 is supported, but int64 is not. Therefore we need to lower cluster broadcast using 32-bit int ops. For gfx12.5+ platforms that support int64, the register regions used by cluster broadcast aren't supported by the 64-bit pipeline. On MTL, dEQP-VK.subgroups.clustered._double and dEQP-VK.subgroups.clustered._dvec were failing to validate the compiled shader in debug mode, and reportedly gpu-hanging in release mode. With this change dEQP-VK.subgroups.clustered._double passed all 48 tests and dEQP-VK.subgroups.clustered._dvec passed all 140 tests on MTL. Rework: * Move from generator to brw_fs_lower_regioning.cpp. (Suggested by Francisco) * Apply to verx10 >= 125.. (Suggested by Francisco) Cc: 23.1 <mesa-stable> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> (v1) Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22569>	2023-04-20 11:41:10 -07:00
Lionel Landwerlin	2e2491b76c	anv: enable shaderStorageImageReadWithoutFormat on Gfx12.5+ Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22552>	2023-04-19 06:04:52 +00:00
Michel Dänzer	73e9cf6062	anv/format: Fix GetPhysicalDeviceSparseImageFormatProperties definition To match its declaration (and the corresponding definition in Vulkan headers). Pointed out by GCC 13: ../src/intel/vulkan/anv_formats.c:1597:6: warning: conflicting types for ‘anv_GetPhysicalDeviceSparseImageFormatProperties’ due to enum/integer mismatch; have ‘void(struct VkPhysicalDevice_T , VkFormat, VkImageType, uint32_t, VkImageUsageFlags, VkImageTiling, uint32_t , VkSparseImageFormatProperties )’ {aka ‘void(struct VkPhysicalDevice_T , VkFormat, VkImageType, unsigned int, unsigned int, VkImageTiling, unsigned int , VkSparseImageFormatProperties )’} [-Wenum-int-mismatch] 1597 \| void anv_GetPhysicalDeviceSparseImageFormatProperties( \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ In file included from ../src/intel/vulkan/anv_private.h:123, from ../src/intel/vulkan/anv_formats.c:24: src/intel/vulkan/anv_entrypoints.h:122:30: note: previous declaration of ‘anv_GetPhysicalDeviceSparseImageFormatProperties’ with type ‘void(struct VkPhysicalDevice_T , VkFormat, VkImageType, VkSampleCountFlagBits, VkImageUsageFlags, VkImageTiling, uint32_t , VkSparseImageFormatProperties )’ {aka ‘void(struct VkPhysicalDevice_T , VkFormat, VkImageType, VkSampleCountFlagBits, unsigned int, VkImageTiling, unsigned int , VkSparseImageFormatProperties )’} 122 \| VKAPI_ATTR void VKAPI_CALL anv_GetPhysicalDeviceSparseImageFormatProperties(VkPhysicalDevice physicalDevice, VkFormat format, VkImageType type, VkSampleCountFlagBits samples, VkImageUsageFlags usage, VkImageTiling tiling, uint32_t* pPropertyCount, VkSparseImageFormatProperties* pProperties); \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22517>	2023-04-18 09:49:44 +00:00
Lionel Landwerlin	3beaaa9ae8	anv: drop lowered storage images code Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22302>	2023-04-18 08:38:55 +00:00
Lionel Landwerlin	d04d701cc6	intel/nir: add options to storage image lowering Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22302>	2023-04-18 08:38:55 +00:00
Lionel Landwerlin	d4f498a583	isl: fix a number of errors on storage format support on Gfx9/12.5 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22302>	2023-04-18 08:38:55 +00:00
Tapani Pälli	d561bac6bb	isl: disable mcs (and mcs+ccs) for color msaa on gfxver 125 Same/similar issues are seen on MTL platform as DG2 so disable for both. Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22435>	2023-04-18 07:08:18 +03:00
Lionel Landwerlin	2d4fbb3025	anv: Work around the spec question about pipeline feedback vs GPL. This gives anv the same behavior as turnip in not asserting, and just not filling out feedback for those stages. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15637>	2023-04-17 22:43:38 +00:00
Emma Anholt	e433925789	anv: Refactor repeated pipeline creation feedback output code. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15637>	2023-04-17 22:43:37 +00:00
Emma Anholt	647ca81654	anv: Only enable GPL if ANV_GPL=true, or if zink or DXVK are the engine. Since there are concerns that the VK_EXT_GPL implementation may have issues with mesh shading, disable it by default but give users a knob to turn it on to experiment. This doesn't automatically enable GPL use in zink, because we lack extendedDynamicState2PatchControlPoints, but it means that you only need to set ZINK_DEBUG=gpl and not both env vars. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15637>	2023-04-17 22:43:37 +00:00
Lionel Landwerlin	3d49cdb71e	anv: implement VK_EXT_graphics_pipeline_library Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15637>	2023-04-17 22:43:37 +00:00
Lionel Landwerlin	0b8a2de2a1	anv: add dynamic buffer offsets support with independent sets With independent sets, we're not able to compute immediate values for the index at which to read anv_push_constants::dynamic_offsets to get the offset of a dynamic buffer. This is because the pipeline layout may not have all the descriptor set layouts when we compile the shader. To solve that issue, we insert a layer of indirection. This reworks the dynamic buffer offset storage with a 2D array in anv_cmd_pipeline_state : dynamic_offsets[MAX_SETS][MAX_DYN_BUFFERS] When the pipeline or the dynamic buffer offsets are updated, we flatten that array into the anv_push_constants::dynamic_offsets[MAX_DYN_BUFFERS] array. For shaders compiled with independent sets, the bottom 6 bits of element X in anv_push_constants::desc_sets[] is used to specify the base offsets into the anv_push_constants::dynamic_offsets[] for the set X. The computation in the shader is now something like : base_dyn_buffer_set_idx = anv_push_constants::desc_sets[set_idx] & 0x3f dyn_buffer_offset = anv_push_constants::dynamic_offsets[base_dyn_buffer_set_idx + dynamic_buffer_idx] It was suggested by Faith to use a different push constant buffer with dynamic_offsets prepared for each stage when using independent sets instead, but it feels easier to understand this way. And there is some room for optimization if you are set X and that you know all the sets in the range [0, X], then you can still avoid the indirection. Separate push constant allocations per stage do have a CPU cost. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15637>	2023-04-17 22:43:37 +00:00
Lionel Landwerlin	16c7c37718	anv: move preprocessing of NIR right before compilation For graphics pipelines, we'll need to load NIR for retained shaders. We want to avoid as much processing as possible while doing that when we're able to load ISA from cache. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15637>	2023-04-17 22:43:37 +00:00
Lionel Landwerlin	17e7fe9d97	anv: make input attachments available through bindless With independent sets, we cannot bake into the shader the binding table entry of input attachments anymore because that final location is affected by multiple sets. We can still access them by looking into the descriptor buffer. This change enables the image handle to be stored in the descriptor buffer. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15637>	2023-04-17 22:43:37 +00:00
Lionel Landwerlin	e82b05dc64	anv: move force shading rate writes checks With variable fragment shading rate, the last pre-rasterization stage is responsible to write the shading rate value. The current checks is as follow : If the fragment shader can be dispatched at variable shading rate, look for the last pre-raster stage to force the write. We change this to : If we're the last pre-raster stage, force the write. That way this works for pre-rasterization shaders compiled without a fragment shader. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15637>	2023-04-17 22:43:37 +00:00
Lionel Landwerlin	b2d3d818d5	anv: introduce a base graphics pipeline object Pipeline libraries and linked pipelines will inherit from this. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15637>	2023-04-17 22:43:37 +00:00
Lionel Landwerlin	3ca1fdc8b5	isl: don't set inconsistent fields for depth when using stencil only Since Gfx12+ 3DSTATE_STENCIL_BUFFER gained its own Width/Depth/Format/etc... fields. So don't set those fields but leave the address/pitch to 0. Issue found on simulation. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15637>	2023-04-17 22:43:37 +00:00
José Roberto de Souza	1563210a41	intel/common: Add gt_id to intel_engine_class MTL and newer platforms on Xe kmd will have engines with gt_id != 0. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22477>	2023-04-17 14:43:06 +00:00
Lionel Landwerlin	a787728906	anv: enable blorp query reset for performance queries Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22480>	2023-04-15 12:25:57 +03:00
Felix DeGrood	0417cfd7a0	anv: Enable INTEL_MEASURE=cpu Reviewed-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21505>	2023-04-14 21:57:50 +00:00
Felix DeGrood	55ba4575be	intel: INTEL_MEASURE cpu mode INTEL_MEASURE normally measures timing of GPU events. However, it is sometimes useful to instead measure when these gfx API calls were requested of the driver. INTEL_MEASURE cpu can be used in in conjunction with other driver debug capabilities, like INTEL_DEBUG=pc for analyzing stalls/flushes or when debugger is attached, to track which frame you're currently on or where in the frame you're at. Initial commit, without plumbing into anv/iris. "INTEL_MEASURE=cpu" will collect a cpu timestamp for each INTEL_MEASURE event instead of GPU timestamps. Reviewed-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21505>	2023-04-14 21:57:50 +00:00
Felix DeGrood	c45dee34aa	anv: split INTEL_MEASURE multi events Measure performance of each draw separately in multi_draw event. Previously, we measured duration of the sum of all draws launched per multi_draw. This should provide more detailed data for multi_draws. Reviewed-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21505>	2023-04-14 21:57:49 +00:00
Felix DeGrood	50bda45d15	anv: Add flush reason to NEEDS_END_OF_PIPE_SYNC cs_stall gets inserted if both flushes and invalidates are required. This cs_stall reason was not called out explicitly, until now. Reviewed-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21505>	2023-04-14 21:57:49 +00:00
Felix DeGrood	bdeb849e25	anv: Add flush reasons to raytracing flushes Reviewed-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21505>	2023-04-14 21:57:49 +00:00
Felix DeGrood	9a30493ccb	anv: Add END_OF_PIPE_SYNC reporting to INTEL_DEBUG=pc Reviewed-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21505>	2023-04-14 21:57:49 +00:00
Marcin Ślusarz	cf90be90ad	intel: split URB space between task and mesh proportionally to entry sizes Improves performance by 0.5-2.5% in vk_meshlet_cadscene depending on the model. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22445>	2023-04-14 15:43:50 +00:00
Eric Engestrom	e876a018e9	ci: stop removing -x11 suffix for x11 build of deqp-egl Makes it clearer which platform is being run. Signed-off-by: Eric Engestrom <eric@igalia.com> Reviewed-by: David Heidelberg <david.heidelberg@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Helen Koike <helen.koike@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22450>	2023-04-14 11:28:21 +00:00
Lionel Landwerlin	08cf224c4a	intel/vec4: force exec_all on float control instruction Applying the same rule as the fs backend so that generation code doesn't assert. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `daa8003e45` ("intel/fs: use nomask for setting cr0 for float controls") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22473>	2023-04-14 10:54:01 +00:00
Felix DeGrood	0a52002a1c	anv: disable reset query pools using blorp opt on MTL This optimization causes some MTL tests to run forever. Not yet sure why. Disabling optimization until we have a fix. Reviewed-by: Mark Janes <markjanes@swizzler.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22373>	2023-04-13 19:35:34 +00:00
Tapani Pälli	b967cbba57	intel/compiler: use intel_needs_workaround for Wa_14012437816 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22437>	2023-04-13 07:33:50 +00:00
Tapani Pälli	ccf16693e1	intel/fs: use intel_needs_workaround for Wa_22013689345 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22437>	2023-04-13 07:33:50 +00:00
Lionel Landwerlin	66edd030ab	anv: add utrace tracking of frame boundaries Based on vkQueuePresentKHR calls. It just helps spotting the beginning end of a frame in perfetto when apps are using 3/4 command buffers per frame. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22276>	2023-04-13 01:14:38 +00:00
Lionel Landwerlin	da6842007f	intel/ds: add a new timeline row for frames Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22276>	2023-04-13 01:14:38 +00:00
Lionel Landwerlin	68bba1539f	anv: exclude performance queries from blorp clears The query buffer contains a batch to implement the multi pass replay/accumulation of results. So we can't clear it with a memset. An optimization for later would be to move the batches to the very end of the query buffer so we can clear the query data without touching the batches. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `4dc7256bf9` ("anv: reset query pools using blorp") Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22421>	2023-04-13 00:44:29 +00:00
José Roberto de Souza	b1299f42ff	anv: Fix vm bind of imported buffers Imported buffers may be created in a device with different memory alignment and this can cause vm bind to fail because bo size can be smaller than the calculated vm bind range using the importer device memory alignment. So here adding actual_size to anv_bo, this will be set with the actual size of the bo allocated by kmd for bos allocate in the current device. For other bo the lseek or the Vulkan API size will be used. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22219>	2023-04-12 10:05:32 +00:00
Lionel Landwerlin	daa8003e45	intel/fs: use nomask for setting cr0 for float controls The instructions manipulation cr0 use the default mask on lane0. So if for some reason that lane is disabled in some of the dispatchs, we can end up not executing the instructions. Fixes flakyness in dEQP-VK.spirv_assembly.instruction.graphics.16bit_storage.uniform_float_32_to_16.uniform_matrix_float_rtz_frag Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22314>	2023-04-11 11:01:31 +00:00
Lionel Landwerlin	cff71ae8ff	anv: fixup streamout write barriers Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8796 Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22336>	2023-04-11 09:53:10 +00:00
Daniel Schürmann	53eb3ad375	vulkan/pipeline_cache: add cache parameter to deserialize() function This allows for secondary cache lookups during deserialization. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21967>	2023-04-10 09:14:30 +00:00
Daniel Schürmann	5daff41e27	vulkan/pipeline_cache: remove vk_device from vk_pipeline_cache_object It is not necessary to store the extra pointer. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21967>	2023-04-10 09:14:30 +00:00
Kenneth Graunke	98bcf650f1	intel/compiler: Use nir_dest_bit_size() for ballot bit size check There's no guarantee that this is a SSA value. Use the helper to handle both SSA values and register correctly. Otherwise we read trash when we encounter a register and make bad decisions on types, possibly leading to our destination being UQ typed when the VGRF is only 32-bit. Fixes compilation with -Dintel-clc=enabled since `7f6491b76d` (nir: Combine if_uses with instruction uses) but the bug is much older than that, circa 2017. We were just getting lucky before. Fixes: `069bf7c907` ("i965/fs: Match destination type to size for ballot") Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22374>	2023-04-07 19:28:56 -07:00
Alyssa Rosenzweig	7f6491b76d	nir: Combine if_uses with instruction uses Every nir_ssa_def is part of a chain of uses, implemented with doubly linked lists. That means each requires 2 * 64-bit = 16 bytes per def, which is memory intensive. Together they require 32 bytes per def. Not cool. To cut that memory use in half, we can combine the two linked lists into a single use list that contains both regular instruction uses and if-uses. To do this, we augment the nir_src with a boolean "is_if", and reimplement the abstract if-uses operations on top of that list. That boolean should fit into the padding already in nir_src so should not actually affect memory use, and in the future we sneak it into the bottom bit of a pointer. However, this creates a new inefficiency: now iterating over regular uses separate from if-uses is (nominally) more expensive. It turns out virtually every caller of nir_foreach_if_use(_safe) also calls nir_foreach_use(_safe) immediately before, so we rewrite most of the callers to instead call a new single `nir_foreach_use_including_if(_safe)` which predicates the logic based on `src->is_if`. This should mitigate the performance difference. There's a bit of churn, but this is largely a mechanical set of changes. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22343>	2023-04-07 23:48:03 +00:00
Alyssa Rosenzweig	4fa2924610	anv,hasvk: Use vk_features2_to_features Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22217>	2023-04-07 18:16:40 -04:00
Felix DeGrood	4dc7256bf9	anv: reset query pools using blorp Previously we used PC to set query data to 0 during CmdResetQueryPool. This was slow when clearing large query pools. Switching to blorp to clear pools is faster for large query pools. Red Dead Redemption 2: +1.5% speedup Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22178>	2023-04-07 15:51:20 +00:00
Lionel Landwerlin	bb49610973	anv: replace query flush before gpu copy by semaphore wait All the flushes should already have happened, we just need CS to wait for the operations to complete. Just use a MI_SEMAPHORE_WAIT to check the availability bit is set. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22178>	2023-04-07 15:51:20 +00:00
Lionel Landwerlin	abc4111d19	anv: pass steam output as argument for anv_dump_pipe_bits Just if you need to change it at some point ;) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22178>	2023-04-07 15:51:20 +00:00
Felix DeGrood	2415d57a99	anv/blorp: add flush reasons to RT flushes Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22178>	2023-04-07 15:51:20 +00:00
Felix DeGrood	43f93f5043	anv/blorp: implement anv_cmd_buffer_fill_area Implemented function to fill an area at an address. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22178>	2023-04-07 15:51:20 +00:00
Felix DeGrood	0130a4f667	anv/blorp: support surf generation for addresses Already have support for anv_buff. Extended to support addresses. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22178>	2023-04-07 15:51:20 +00:00
Ian Romanick	12e11fa3e4	intel/fs: White space fixes Trivial Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22299>	2023-04-06 19:07:50 +00:00
Ian Romanick	6dfb7061e0	intel/fs: Preserve meta data more often in brw_nir_move_interpolation_to_top This pass rarely makes any changes, so work a little harder to preserve more meta data. On my Ice Lake laptop (using a locked CPU speed and other measures to prevent thermal throttling, etc.) using a debugoptimized build, improves performance of Vulkan CTS "deqp-vk --deqp-case='dEQP-VK.spir'" by -0.2% ± 0.1% (n = 5, pooled s = 0.431885). v2: Add some parenthesis. Suggested by Lionel. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22299>	2023-04-06 19:07:50 +00:00

1 2 3 4 5 ...

9412 commits