fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-18 22:28:06 +02:00

Author	SHA1	Message	Date
Konstantin Seurer	f2514e75f0	radv/rt: Add monolithic raygen lowering Ray traversal is inlined to allow for constant folding and avoid spilling. Reviewed-by: Friedrich Vock <friedrich.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21929>	2023-09-10 11:40:12 +00:00
Konstantin Seurer	e039e3cd76	radv/rt: Store NIR shaders separately In order to compile monolithic shaders with pipeline libraries, we need to keep the NIR around for inlining recursive stages. Reviewed-by: Friedrich Vock <friedrich.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21929>	2023-09-10 11:40:12 +00:00
Konstantin Seurer	28e1e33c32	radv: Don't use the depth image view for depth bias emission If the application records a secondary command buffer that inherits a render pass without specifying a framebuffer, we should still be able to emit the depth bias state properly. Fixes: `266b2cf` ("radv: implement VK_EXT_depth_bias_control") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9588 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25018>	2023-09-08 19:26:59 +00:00
Tatsuyuki Ishi	4171d9ff84	radv/amdgpu: Use rwlock to protect access to virtual BOs. Vulkan provides no external synchronization guarantees on sparse memory objects. Use a per-BO rwlock to prevent reading data mid-update. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24806>	2023-09-08 18:53:37 +00:00
Samuel Pitoiset	64a5472ad7	radv: remove useless PIPELINE_CREATE_2_LIBRARY_BIT check for retained shaders VK_PIPELINE_CREATE_2_RETAIN_LINK_TIME_OPTIMIZATION_INFO_BIT_EXT is only allowed for pipeline libs, so VK_PIPELINE_CREATE_2_LIBRARY_BIT_KHR should also be set. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25110>	2023-09-08 16:26:40 +00:00
Samuel Pitoiset	adaf4460bd	radv: do not use pre-compiled prologs when VS is compiled separately This wouldn't work for VS+TCS or VS+GS if they are compiled separately on GFX9+. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24933>	2023-09-08 12:43:29 +00:00
Samuel Pitoiset	871a383671	radv: adjust emitted prolog regs for merged shaders compiled separately It should also be the merged shader stage. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24933>	2023-09-08 12:43:28 +00:00
Samuel Pitoiset	657cabe17e	radv: adjust next stage for VS prologs and merged shaders compiled separately It should be the merged shader stage. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24933>	2023-09-08 12:43:28 +00:00
Georg Lehmann	524a894ba4	aco/gfx11: don't use bfe for local_invocation_id if the others are always 0 Foz-DB GFX1100: Totals from 4469 (3.37% of 132657) affected shaders: Instrs: 3895053 -> 3893529 (-0.04%); split: -0.04%, +0.00% CodeSize: 20244128 -> 20220952 (-0.11%); split: -0.11%, +0.00% Latency: 37864147 -> 37862227 (-0.01%); split: -0.01%, +0.00% InvThroughput: 5578100 -> 5576469 (-0.03%); split: -0.03%, +0.00% SClause: 108336 -> 108343 (+0.01%); split: -0.00%, +0.01% Copies: 275897 -> 275900 (+0.00%); split: -0.00%, +0.00% Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24514>	2023-09-08 11:28:24 +00:00
Pierre-Eric Pelloux-Prayer	84e61d606b	radv/sdma: use correct limits for gfx10.3 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24771>	2023-09-08 09:15:20 +00:00
Pierre-Eric Pelloux-Prayer	c707cb51e5	radv/sdma: use multiple commands if required Instead of failing the copy we can use multiple chunks. This codepath shouldn't really be used since the source image should usually be tiled but it still better to not fail when possible. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24771>	2023-09-08 09:15:20 +00:00
Timothy Arceri	af1528cc15	nir: replace use of nir_src_copy() Since `03b2c34793` nir_src_copy() no longer does anything useful, it will be removed in the following patch. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24986>	2023-09-08 03:01:39 +00:00
Bas Nieuwenhuizen	4303ea7b9a	radv: Use a double jump to limit nops in DGC for dynamic sequence count. Some RGP data showing that a large amount of NOPs might be a performance concern. Some data from a Granite demo repurposed as benchmark: - with max_count = 16, actual draw count 1-4, the new path is ~5% slower - with max_count = 2048, actual draw count 1-4, the new path is >2x as fast. - with max_count = 16384, actual draw count 1-4, the new path is >7x as fast. Due to the new path being slower in e.g. small cmdbuffers I added a heuristic. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25046>	2023-09-08 00:41:34 +00:00
Samuel Pitoiset	a2ead228ac	radv: avoid emitting THREAD_TRACE_MARKER for predicated draws/dispatches This confused RGP for example when DGC calls are skipped. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25060>	2023-09-07 22:51:51 +00:00
Samuel Pitoiset	51eb072eb6	radv: skip DGC calls when the indirect sequence count is zero with a predicate Starfield has a lot of empty ExecuteIndirect() calls. This optimizes them by using the indirect sequence count as predicate. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25060>	2023-09-07 22:51:51 +00:00
Martin Roukala (né Peres)	13723e3097	radv/ci: use the default kernel on vkcts-navi10 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7888 Signed-off-by: Martin Roukala (né Peres) <martin.roukala@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25095>	2023-09-07 21:02:25 +00:00
Martin Roukala (né Peres)	76ef5f82ab	radv/ci: drop the auto-reboot-on-hang for vkcts-navi10 Anecdotal evidence seems to suggest this is not happening anymore, so let's try dropping it and see how it fares! Signed-off-by: Martin Roukala (né Peres) <martin.roukala@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25095>	2023-09-07 21:02:25 +00:00
Tatsuyuki Ishi	383842fab8	radv: Fix dumping vertex descriptors with RADV_DEBUG=hang. Adding 3 words should be done before the uint32_t ** cast. This is in line with other places which uses pointer arithmetic on trace_id_ptr. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25081>	2023-09-07 11:50:32 +00:00
Tatsuyuki Ishi	0228b294e8	radv: Fix IB size for RADV_DEBUG=hang. cs->base.cdw here is the size of the last CS in the chain, but we are passing in the first CS in the chain to begin decoding. Hence, cs->ib_buffers[0].cdw is the correct size here. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25061>	2023-09-07 09:45:19 +00:00
Samuel Pitoiset	285223d0fd	radv: fix interactions with primitives generated queries and pipeline stats SAMPLE_STREAMOUTSTATS requires PIPELINESTAT_START to be enabled, otherwise the hw doesn't count anything. This fixes dEQP-VK.transform_feedback.primitives_generated_query.concurrent.pipeline_statistics_2.* on GFX8. GFX6-9 are probably also affected by this bug, but with NGG these queries are slightly different and don't use legacy streamout. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25049>	2023-09-07 08:06:40 +00:00
Samuel Pitoiset	17cd153dd0	radv: add support for DGC with SQTT Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25035>	2023-09-06 07:52:50 +00:00
Samuel Pitoiset	63e0fcfb13	radv: avoid emitting SQTT markers for DGC calls This confuses RGP. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25035>	2023-09-06 07:52:50 +00:00
Chris Spencer	9123505dde	radv/video: use correct enum value for max level IDC Signed-off-by: Chris Spencer <spencercw@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24649>	2023-09-06 05:10:33 +00:00
Marek Olšák	3040aa2e26	ac/llvm: don't convert undef to 0 because nir_opt_undef does it now TOTALS FROM AFFECTED SHADERS (29663/58918) Code Size: 39163724 -> 37842360 (-3.37 %) bytes Max Waves: 394813 -> 396334 (0.39 %) Outputs: 84616 -> 84616 (0.00 %) Patch Outputs: 0 -> 0 (0.00 %) Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24059>	2023-09-06 03:24:16 +00:00
antonino	aa657247ce	vulkan/wsi: add `vk_wsi_force_swapchain_to_current_extent` driconf Add a driconf to force the swapchain size to match `VkSurfaceCapabilities2KHR::currentExtent` as a workaround for misbehaved games Fixes: `6139493ae3` ("vulkan/wsi: return VK_SUBOPTIMAL_KHR for sw/x11 on window resize") Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24818>	2023-09-06 00:10:41 +00:00
Samuel Pitoiset	e80fddf81f	radv/amdgpu: do not copy the original chain link for IBs Otherwise, if a secondary CS is grown and then executed without IB2, the INDIRECT_BUFFER packet would have been copied but it shouldn't. This fixes a regression that introduced GPU hangs with gl_vk_meshlet_cadscene on RDNA2. Fixes: `df0c742543` ("radv/amdgpu: rework growing a CS with the chained IB path slightly") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24891>	2023-09-05 12:38:33 +00:00
Samuel Pitoiset	9206aeb077	radv/amdgpu: fix executing secondaries without IB2 If a secondary cmdbuf has been grown and is executed without IB2 (eg. on compute queue or when it's not allowed), the ib size ptr contains chaining info, which means the IB size was wrong. This fixes CPU crashes when running gl_vk_meshlet_cadscene. Fixes: `277b2afd70` ("radv/amdgpu: add support for executing DGC cmdbuf with RADV_DEBUG=noibs") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24891>	2023-09-05 12:38:33 +00:00
Samuel Pitoiset	4a8afc9072	radv/ci: re-enable vkd3d-polaris10-valve Like the vkcts job, this was disabled a while ago but it seems to be working well again. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25025>	2023-09-04 16:56:34 +00:00
Samuel Pitoiset	083e7d3a92	radv: fix capturing indirect dispatches with SQTT Looks like indirect dispatches require an event marker instead of an event marker with dims. That makes sense somehow given the blocks size is not known at record time with indirect dispatches. This allows RGP to report correct block sizes. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24994>	2023-09-04 06:31:40 +00:00
Qiang Yu	b5eaec6c80	aco,radv,radeonsi: rename is_monolithic to merged_shader_compiled_separately Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24990>	2023-09-04 10:53:44 +08:00
Timur Kristóf	19a7d9615c	ac/nir/ngg: Extract nogs_export_vertex_params function. Just for better code readability. No functional changes. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24574>	2023-09-03 11:04:35 +00:00
Timur Kristóf	93b4f200de	ac/nir/ngg: Wait for attribute ring stores in mesh shaders. Make sure that both per-vertex and per-primitive attribute ring stores are finished before position or primitive export instructions are executed. This is necessary because we need to ensure that mesh shader waves work correctly when they have either vertex-only or primitive-only waves. Cc: mesa-stable Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24574>	2023-09-03 11:04:35 +00:00
Timur Kristóf	0721784b78	ac/nir/ngg: Refactor mesh shader primitive export. Cleanup the code that generates the two channels of the primitive export instruction, and move storing the built-in per-primitive outputs out to match how vertex attributes work. Prepares the mesh shader lowering for a workaround that affect export instructions. Cc: mesa-stable Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24574>	2023-09-03 11:04:35 +00:00
Timur Kristóf	edd51655f0	ac/nir/ngg: Wait for attribute stores before VS/TES/GS pos0 export. This is a HW bug workaround for some (all?) GFX11 chips. On these chips, rasterization can start before the attribute ring stores are finished, which can cause issues. As a workaround, wait for attribute ring stores to finish before doing the position export. Mesh shaders will be taken care of in another commit. Cc: mesa-stable Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24574>	2023-09-03 11:04:35 +00:00
Timur Kristóf	9c096e4ace	ac/nir: Slightly refactor how pos0 exports are added when missing. Prepares for a workaround. Makes it possible for this function to not emit the pos0 export at all so that it can be emitted by a subsequent call to the function later. Cc: mesa-stable Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24574>	2023-09-03 11:04:35 +00:00
Timur Kristóf	838d886d90	ac/nir: Add done arg to ac_nir_export_position. This prepares for a workaround where we won't need to add the done flag to the last export in this function, because it will be added in a subsequent call to the same function. Cc: mesa-stable Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24574>	2023-09-03 11:04:35 +00:00
Georg Lehmann	2ae94b3894	aco: implement some exclusive scans with inclusive scans exclusive scan lowering uses full wave shift, for iadd/ixor it's faster to do inclusive scans and subtract/xor the thread's source. Foz-DB Navi21: Totals from 21 (0.02% of 132657) affected shaders: Instrs: 10925 -> 10727 (-1.81%) CodeSize: 58064 -> 56488 (-2.71%) Latency: 178471 -> 177928 (-0.30%) InvThroughput: 24374 -> 24145 (-0.94%) Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24555>	2023-09-02 11:42:22 +00:00
Georg Lehmann	bce9bba90d	nir: add nir_scalar intrinsic helpers Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24656>	2023-09-02 00:26:31 +00:00
Eric Engestrom	a8c7a2fb69	ci/amd: split the polaris10 rules into one for each farm There is now one polaris10 in each farm, so we need two rules for which one to use. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24996>	2023-09-01 19:35:33 +00:00
Georg Lehmann	cda5784eb3	aco: use v_cvt_f32_ubyte for signed casts too The extract is always positive, so signed vs unsigned conversion doesn't matter. Foz-DB GFX11: Totals from 167 (0.13% of 133461) affected shaders: Instrs: 401631 -> 401225 (-0.10%) CodeSize: 2107256 -> 2104344 (-0.14%) VGPRs: 13320 -> 13332 (+0.09%) Latency: 6468063 -> 6467241 (-0.01%) InvThroughput: 801854 -> 801653 (-0.03%) Copies: 13926 -> 13927 (+0.01%); split: -0.08%, +0.09% Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24893>	2023-09-01 17:21:57 +00:00
Samuel Pitoiset	223d00fe0a	radv/ci: re-enable vkcts-polaris10-valve This was disabled a long time ago because of unknown GPU hangs during boot but it seems stable again for some reasons. This also bumps the job timeouts to make sure it will be able to finish. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24968>	2023-09-01 15:14:58 +00:00
Tapani Pälli	48a41c7700	ci: add a fix for KHR-GLES3.packed_pixels.*snorm tests Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24600>	2023-09-01 10:37:43 +00:00
Konstantin Seurer	0385dcac5c	aco/lower_to_cssa: Fix typo Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24906>	2023-09-01 07:23:33 +00:00
Konstantin Seurer	ce4c38ecae	radv: Only generate debug info if required Fixes: `51f2fa1a5e` ("radv: Break up radv_shader_nir_to_asm") Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24906>	2023-09-01 07:23:33 +00:00
Konstantin Seurer	2a5d8d4cf4	aco: Unify demote and demote_if selection Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24906>	2023-09-01 07:23:33 +00:00
Konstantin Seurer	9af91edda9	aco: Use bytes() instead of size() in emit_wqm This should get most of the cases that would fail validation. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24906>	2023-09-01 07:23:33 +00:00
Konstantin Seurer	1ddf8378cb	aco/validate: Handle p_wqm like p_parallelcopy Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24906>	2023-09-01 07:23:32 +00:00
Samuel Pitoiset	e104718c9f	aco: allow separate compilation of NGG shaders Also prevent to emit a long-jump for VS as NGG without a GS. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24907>	2023-09-01 06:52:40 +00:00
Samuel Pitoiset	ee8ba0f98f	aco: adjust fix_exports() for VS/TES as NGG and non-monolithic shaders Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24907>	2023-09-01 06:52:40 +00:00
Samuel Pitoiset	bfb39031f1	aco: flag blocks with long-jump as export_end for separate compilation This will allow us to adjust fix_exports(). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24907>	2023-09-01 06:52:40 +00:00

1 2 3 4 5 ...

13064 commits