fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-14 05:48:04 +02:00

Author	SHA1	Message	Date
Ganesh Belgur Ramachandra	27e340c030	radv: add gfx11.7 Reviewed-by: Marek Olšák <maraeo@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40866>	2026-04-18 18:55:39 +00:00
Natalie Vock	c1a7680d93	radv/rt: Don't enable midpoint sorting Midpoint sorting is incompatible with how our traversal works. Specifically, we change tMax when a hit is committed so we can skip over BVH nodes that are guaranteed not to produce a closer hit. However, changing tMax also changes the intersection interval of box nodes with the ray, and thus, the midpoints of that interval. Stackless traversal relies on getting nodes back in the exact same order as before, and if that requirement is not met, traversal may incorrectly skip over nodes. The likely benefit of midpoint sorting does not make up for the loss of ability to skip over BVH nodes exceeding tMax, so simply disable midpoint sorting. This fixes geometry being visible behind other geometry when it shouldn't be in various applications, including Half-Life 2 RTX. Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40795>	2026-04-06 11:03:04 +00:00
Daniel Schürmann	2d73dd15d6	radv/rt: add and lower loop continue construct in traversal shaders We are going to disallow continue statements without loop continue constructs. Totals from 58 (0.07% of 84369) affected shaders: (Navi21) Instrs: 2320937 -> 2318938 (-0.09%) CodeSize: 12554896 -> 12546892 (-0.06%); split: -0.06%, +0.00% SpillSGPRs: 116 -> 102 (-12.07%) SpillVGPRs: 2648 -> 2627 (-0.79%) Latency: 13440088 -> 13424694 (-0.11%) InvThroughput: 3262579 -> 3259088 (-0.11%) VClause: 69628 -> 69666 (+0.05%) SClause: 51528 -> 51519 (-0.02%) Copies: 203958 -> 203478 (-0.24%); split: -0.29%, +0.05% Branches: 70777 -> 70400 (-0.53%) PreSGPRs: 4768 -> 4660 (-2.27%) VALU: 1537105 -> 1536899 (-0.01%); split: -0.02%, +0.01% SALU: 340938 -> 339623 (-0.39%) VMEM: 120339 -> 120286 (-0.04%) Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39942>	2026-03-21 07:42:55 +00:00
Natalie Vock	b046eaf36d	radv/rt: Fix shared ray query stack on top of application LDS Since the stack pointer may wrap around the stack size in overflow cases, traversal logic calculates the real stack pointer with nir_umod_imm(b, stack, args->stack_entries * args->stack_stride). For ray queries, "stack" was initialized to "stack_base + local_invocation_idx * 4". This was completely broken, as the umod would later delete the stack base completely and overwrite the start of LDS, which belongs to the apps' shared memory. Instead, add the stack base as a constant offset in the load/store_stack callback. (This should also save 1 VALU per ray query) Also, delete radv_ray_traversal_args::stack_base since it's unused now. Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40420>	2026-03-16 12:43:59 +00:00
Rhys Perry	5c3b5688a1	amd: rename ac_cu_info to ac_compiler_info Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40042>	2026-03-03 08:50:12 +00:00
Samuel Pitoiset	d7447012ef	radv/nir: use radv_physical_cache_key::bvh8 more Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40088>	2026-02-25 20:28:43 +00:00
Samuel Pitoiset	1457c39f5a	radv/nir: use radv_physical_cache_key::emulate_rt more Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40088>	2026-02-25 20:28:43 +00:00
Georg Lehmann	a87cdfc6b7	radv/nir/rt: preserve inf/nan for emulated RT intersect Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39641>	2026-02-10 18:42:02 +00:00
Natalie Vock	8baa95e4aa	radv/rt: Use subgroup invocation for stack index Workgroup == subgroup anyway, and we don't have the workgroup thread IDs in RT shaders. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39142>	2026-01-08 19:49:45 +01:00
Georg Lehmann	a706769a0b	nir: move exact bit to nir_fp_math_control Unifies nir per instruction float control. In the future this can be split into contract/reassoc/transform like SPIR-V. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> (except SPIR-V) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39103>	2026-01-07 09:40:57 +00:00
Daniel Schürmann	f7c4aa48a0	ac/gpu_info: add some more flags to ac_cu_info Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38701>	2025-12-22 07:34:46 +00:00
Eric Engestrom	4ab65cdaa4	docs: update/fix vk spec urls Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37993>	2025-10-22 09:23:34 +02:00
Georg Lehmann	9e41a7c139	treewide: use nir_load_global alias of nir_build_load_global Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37959>	2025-10-21 12:37:58 +02:00
Christian Gmeiner	1492de1bc3	radv: re-format using clang-format No manual changes here, this is simply running $ ninja -C build/ clang-format Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37226>	2025-09-09 05:48:56 +00:00
Konstantin Seurer	cc0dc4b566	radv: Store parent node IDs inside nodes on GFX12 Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Saves some space. Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36691>	2025-08-15 13:00:32 +00:00
Konstantin Seurer	c4b18c689f	radv: Emit compressed primitive nodes on GFX12 Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Emits two triangles per node whenever possible. The nir code will revisit the triangle node to handle the second triangle only if both triangles are interescted by the ray. Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35734>	2025-08-07 20:23:15 +00:00
Konstantin Seurer	d28ff8050a	radv/rt: Use inv_dir for software ray-triangle tests Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Reviewed-by: Autumn Ashton <misyl@froggi.es> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36213>	2025-07-19 16:35:37 +00:00
Konstantin Seurer	5494789e89	radv/rt: Optimize emulated ray-triangle tests The imod instructions are lowered to 4 alu instructions each. We can do better by packing the results with the values for kz. Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Reviewed-by: Autumn Ashton <misyl@froggi.es> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36213>	2025-07-19 16:35:37 +00:00
Konstantin Seurer	d140f2a6a2	radv: Implement watertightness for emulated RT Instead of using fp64 (Which is broken in some cases) the new approach only uses fp32 and implements tiebreaking for edge/vertex hits. Using fp32 is also much faster, improving performance of q2rtx by around 40%. Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Reviewed-by: Autumn Ashton <misyl@froggi.es> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36213>	2025-07-19 16:35:36 +00:00
Konstantin Seurer	55641f9ca0	radv: Disable pointer flags and the GFX12 WA for emulated RT Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Reviewed-by: Autumn Ashton <misyl@froggi.es> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36213>	2025-07-19 16:35:36 +00:00
Konstantin Seurer	df44b353ad	radv: Optimize ray tracing position fetch Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Gets rid of a lot of indirection when fetching triangle positions. Storing the primitive address increases register pressure by a bit but the traversal shader which should have the highest register demand should not be affected when position fetch is not used. Totals: Instrs: 4021686 -> 4022435 (+0.02%); split: -0.01%, +0.03% CodeSize: 21235812 -> 21235832 (+0.00%); split: -0.02%, +0.02% Latency: 23402275 -> 23412110 (+0.04%); split: -0.04%, +0.09% InvThroughput: 4352818 -> 4352206 (-0.01%); split: -0.04%, +0.02% VClause: 101906 -> 102058 (+0.15%); split: -0.03%, +0.18% Copies: 342210 -> 342368 (+0.05%); split: -0.09%, +0.14% Branches: 114988 -> 114993 (+0.00%) PreVGPRs: 26551 -> 27111 (+2.11%) VALU: 2249366 -> 2249524 (+0.01%); split: -0.01%, +0.02% SALU: 529828 -> 529808 (-0.00%); split: -0.01%, +0.00% Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35533>	2025-07-19 16:07:59 +00:00
Samuel Pitoiset	ea742877f6	radv: re-run clang-format For style consistency. $ clang-format -i $(find src/amd/vulkan/ -name ".h" -o -name ".c" -o -name "*.cpp") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36118>	2025-07-16 09:10:33 +02:00
Natalie Vock	e978f6e247	radv/rt: Use ds_bvh_stack_push8_pop1_rtn_b32 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35269>	2025-07-15 21:34:40 +00:00
Natalie Vock	f0aa383e09	radv/rt: Use ds_bvh_stack_rtn Improves Quake 2 RTX performance by 5% on RDNA3. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35269>	2025-07-15 21:34:40 +00:00
Natalie Vock	8815845271	radv/rt/gfx12: Always overwrite origin/dir They're unchanged if we don't test against instance nodes. This makes image_bvh8_intersect_ray kill its direction/origin operands, improving RA. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35269>	2025-07-15 21:34:38 +00:00
Natalie Vock	e236a731e4	radv/rt: Enable pointer flags on GFX11+ Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Allows hardware to do some of the culling work, as well as early-cull box nodes with CullOpaque/CullNonOpaque ray masks when all children are (not) opaque. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32417>	2025-06-28 10:31:38 +00:00
Konstantin Seurer	97f6287827	radv: Use the BVH8 format on GFX12 Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34273>	2025-04-17 20:20:40 +00:00
Konstantin Seurer	9d157173b2	radv: Refactor create_bvh_descriptor Make it a bit more extendable since GFX12 introduced more fields. Reviewed-by: Natalie Vock <natalie.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34273>	2025-04-17 20:20:40 +00:00
Konstantin Seurer	676e26aed5	radv: Fix rayTracingPositionFetch with multiple geometies Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The fix adds more indirections to avoid increasing register pressure by tracking the primitive address. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34460>	2025-04-11 22:26:08 +00:00
Timur Kristóf	666f10c174	radv: Add missing includes and remove unnecessary NIR includes. RADV won't compile without the added includes after we stop including the full nir.h from the VK common functions. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33439>	2025-02-12 22:33:07 +01:00
Marek Olšák	82047fa82f	amd: drop support for LLVM 15, 16, 17 Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33211>	2025-02-01 04:22:30 +00:00
Eric Engestrom	dd2629b8b8	radv,lvp: fix url to VkAabbPositionsKHR docs The current URL redirect to a page that does not contain any information about this struct, so let's fix that. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33159>	2025-01-28 14:28:59 +00:00
Ivan Avdeev	14e3231b56	radv: add a flag to indicate ray tracing support Determine whether the device has hardware raytracing support early, and then use this result where needed, instead of checking for `gfx_level` every time. This is a prerequisite for CYAN_SKILLFISH chip enablement. This chip is still GFX10, not GFX10_3, but has hardware support for accelerated `image_bvh{,64}_intersect_ray` instructions. Just checking for `gfx_level` is insufficient for it. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33109>	2025-01-20 08:27:11 +00:00
Arseny Kapoulkine	6a3944a195	radv: On GFX11, use box sorting heuristic based on ray flags Use box with largest ray interval for shadow rays (terminate on first hit) as it maximizes the probability of finding some object in that box; for reflection (closest hit) rays, use midpoint instead, which defers processing of larger boxes the ray origin is in in favor of smaller boxes closer to origin. Since the sorting mode must be uniform, when terminate_on_first_hit flag is divergent, we leave it as closest. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32416>	2024-12-01 05:20:58 +00:00
Friedrich Vock	64c406774f	radv/rt: Skip all AABB code when no_skip_aabbs is not set This avoids having to execute the load_global just to throw the results away and ignore the node. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31443>	2024-10-03 15:22:08 +00:00
Konstantin Seurer	00c94e0cd4	radv: Workaround apps using ray tracing when it is unsupported Emitting bvh64_intersect_ray_amd will crash the compiler on pre-GFX10_3 hardware. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11786 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30886>	2024-09-23 14:02:28 +00:00
Georg Lehmann	97aa3464b9	radv/rt: remove one VALU from traversal loop Not much, but something, I guess? Foz-DB Navi31: Totals from 93 (0.12% of 79395) affected shaders: MaxWaves: 1338 -> 1354 (+1.20%) Instrs: 3689907 -> 3689721 (-0.01%); split: -0.01%, +0.00% CodeSize: 18921812 -> 18922920 (+0.01%); split: -0.00%, +0.01% VGPRs: 9012 -> 8988 (-0.27%) Latency: 23153748 -> 23167640 (+0.06%) InvThroughput: 4490882 -> 4493136 (+0.05%) Copies: 287888 -> 287728 (-0.06%) VALU: 2022082 -> 2021916 (-0.01%) SALU: 458904 -> 459064 (+0.03%) Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30342>	2024-07-25 20:24:23 +00:00
Mike Blumenkrantz	2aaa6ebba1	build/amd: add amd-use-llvm build option this allows amd drivers to disable llvm support while still allowing llvmpipe/lavapipe to be built by disabling llvm support in amd drivers, the load times for these drivers decreases by 5-10ms Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Mike Lothian <mike@fireburn.co.uk> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28969>	2024-05-30 19:05:00 +00:00
Samuel Pitoiset	87fde606cf	radv: use SPDX-License-Identifier Only files under src/amd/vulkan/** are concerned. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28599>	2024-04-08 07:17:31 +00:00
Samuel Pitoiset	896c9cf486	radv: remove radv_device::physical_device Get the logical device object using the base object. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28500>	2024-04-01 16:35:09 +00:00
Samuel Pitoiset	310597cab6	radv: rename radv_physical_device::rad_info to info The extra rad_ prefix isn't necessary and it's longer to type. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28500>	2024-04-01 16:35:09 +00:00
Konstantin Seurer	0e8f2eca9e	radv/rt: Inline constant information about ray flags Totals from 4 (1.06% of 379) affected shaders: Instrs: 2868 -> 2588 (-9.76%) CodeSize: 15184 -> 13544 (-10.80%) Latency: 69280 -> 86922 (+25.46%) InvThroughput: 11546 -> 14486 (+25.46%) Copies: 680 -> 592 (-12.94%) Branches: 148 -> 128 (-13.51%) PreSGPRs: 216 -> 172 (-20.37%) Reviewed-by: Friedrich Vock <friedrich.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27137>	2024-03-07 20:07:07 +00:00
Friedrich Vock	7792ee1c15	radv/rt: Fix frontface culling with emulated RT We need to preserve the divisor's sign for front/backface detection to work correctly. Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28034>	2024-03-07 10:34:52 +00:00
Konstantin Seurer	767f628079	radv/rra: Dump basic ray history tokens This only dumps the begin tokens. Tokens are written to a buffer containing a 12 byte header at the beginning. We use an intermediate format for the ray history tokens because the RRA format is very inefficient. Reviewed-by: Friedrich Vock <friedrich.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25548>	2024-02-03 12:59:47 +01:00
Konstantin Seurer	f27a973283	radv/rt: Use doubles inside intersect_ray_amd_software_tri Increasing precision lets us avoid duplicate hits on shared edges. The previous biasing WA made a bunch of tests fail. Reviewed-by: Friedrich Vock <friedrich.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27243>	2024-01-30 10:26:43 +01:00
Konstantin Seurer	c511b8968a	radv: Implement VK_KHR_ray_tracing_position_fetch Reviewed-by: Friedrich Vock <friedrich.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26895>	2024-01-05 18:20:20 +00:00
Samuel Pitoiset	f30fa9dec8	radv: move radv_rt_{common,shader} files to nir/ These files contains NIR lowering code for RT. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26851>	2024-01-03 09:40:29 +00:00

47 commits