fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-18 00:48:07 +02:00

Author	SHA1	Message	Date
Georg Lehmann	96793fb0c1	aco/isel: implement 16bit vec2 shifts The source bit size mismatch is a bit annoying, but it's still worth it to vectorize these. Foz-DB Navi48: Totals from 85 (0.11% of 80251) affected shaders: Instrs: 119073 -> 118827 (-0.21%); split: -0.21%, +0.00% CodeSize: 669604 -> 667552 (-0.31%); split: -0.31%, +0.00% VGPRs: 4796 -> 4736 (-1.25%) Latency: 1907685 -> 1901983 (-0.30%); split: -0.32%, +0.02% InvThroughput: 642603 -> 640680 (-0.30%); split: -0.33%, +0.03% VClause: 2088 -> 2091 (+0.14%) Copies: 18300 -> 18394 (+0.51%); split: -0.01%, +0.52% Branches: 3452 -> 3440 (-0.35%) VALU: 63378 -> 63144 (-0.37%); split: -0.37%, +0.00% SALU: 23065 -> 23076 (+0.05%); split: -0.00%, +0.05% Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35825>	2025-07-09 07:23:08 +00:00
Konstantin Seurer	4e258f8579	radv/rra/gfx10_3: Fix acceleration structure addresses RRA adds rra_accel_struct_chunk_header::header_offset to the address so we need to subtract it. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35985>	2025-07-09 07:04:37 +00:00
Yiwei Zhang	fb77881262	radv: use AHARDWAREBUFFER_USAGE_CAMERA_MASK Now AHB header has it defined. Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35785>	2025-07-09 03:47:06 +00:00
Yiwei Zhang	0d33739216	radv: use common ANB swapchain gralloc usage query The additional usage bits added for rgba8 and rgb565 are not needed as those are conditionally added based on the consumer side of the Android surface (e.g. display, encoder, etc). Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35785>	2025-07-09 03:47:06 +00:00
Daniel Schürmann	2c51a8870d	nir: add nir_vectorize_cb callback parameter to nir_lower_phis_to_scalar() Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Similar to nir_lower_alu_width(), the callback can return the desired number of components for a phi, or 0 for no lowering. The previous behavior of nir_lower_phis_to_scalar() with lower_all=true can be elicited via nir_lower_all_phis_to_scalar() while the previous behavior with lower_all=false now corresponds to nir_lower_phis_to_scalar() with NULL callback. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35783>	2025-07-08 15:33:59 +00:00
jesse.zhang	56d758d321	amd: Add user queue HQD count to hw_ip info Add a new field userq_num_hqds to drm_amdgpu_info_hw_ip to expose the number of available hardware queue descriptors (HQDs) for user queues. This allows userspace to query the maximum number of user queues that can be created for a particular IP block. the patch link in driver side: https://lists.freedesktop.org/archives/amd-gfx/2025-June/126686.html v2: we should also put userq_num_hqds into radeon_info and print it where other fields are printed. (Marek Olšák) v3: rename num_userqs to num_queue_slots and add print log in ac_print_gpu_info. (Marek Olšák) v4: rename userq_num_hqds to userq_num_slots in hw_ip_info, and update the hw information (Marek Olšák) Signed-off-by: Jesse Zhang <Jesse.Zhang@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35850>	2025-07-08 10:17:51 +00:00
Autumn Ashton	1ceded0c83	radv: Fix handling of NULL pColorAttachmentLocations in vkCmdSetRenderingAttachmentLocations Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details From the Vulkan spec: `If pColorAttachmentLocations is NULL, it is equivalent to setting each element to its index within the array.` Use similar logic to what we do in CmdSetRenderingInputAttachmentIndices to handle this behaviour properly. Signed-off-by: Autumn Ashton <misyl@froggi.es> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35948>	2025-07-08 02:32:56 +00:00
Rhys Perry	34f1a8f707	aco: handle FPAtomicToDenormModeHazard Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This is quite unlikely to happen, but I guess it might be possible and it's relatively simple to work around. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35884>	2025-07-07 13:02:43 +00:00
Marek Olšák	b31f73a1b1	ac/nir: use u_foreach_bit more Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35345>	2025-07-07 11:41:57 +00:00
Marek Olšák	896dd9bc93	ac/nir: eliminate sample_id/sample_pos if MSAA is disabled Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35345>	2025-07-07 11:41:57 +00:00
Marek Olšák	1c2007005e	ac/nir: rename force_center_interp_no_msaa to msaa_disabled Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35345>	2025-07-07 11:41:57 +00:00
Wolf480pl	62b3fd0a5e	radv/virtio: don't leak drm FD when using vpipe Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The fd in radv_physical_device_try_create is one we opened in that function. We don't need it when vpipe is in use, so we should close it, before setting it to -1. Fixes: `999d5098b4` ("radv/virtio: support vpipe") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35947>	2025-07-07 09:51:15 +00:00
Alyssa Rosenzweig	d31cb824df	treewide: use VARYING_BIT_* Some checks failed macOS-CI / macOS-CI (dri) (push) Has been cancelled Details macOS-CI / macOS-CI (xlib) (push) Has been cancelled Details Via Coccinelle patch generated by the following Python: varys = [ "POS", "COL0", "COL1", "FOGC", "TEX0", "TEX1", "TEX2", "TEX3", "TEX4", "TEX5", "TEX6", "TEX7", "PSIZ", "BFC0", "BFC1", "EDGE", "CLIP_VERTEX", "CLIP_DIST0", "CLIP_DIST1", "CULL_DIST0", "CULL_DIST1", "PRIMITIVE_ID", "PRIMITIVE_COUNT", "LAYER", "VIEWPORT", "FACE", "PRIMITIVE_SHADING_RATE", "PNTC", "TESS_LEVEL_OUTER", "TESS_LEVEL_INNER", "PRIMITIVE_INDICES", "BOUNDING_BOX0", "BOUNDING_BOX1", "VIEWPORT_MASK", "CULL_PRIMITIVE" ] t = """ @@ @@ -(1 << VARYING_SLOT_${V}) +VARYING_BIT_${V} @@ @@ -BITFIELD_BIT(VARYING_SLOT_${V}) +VARYING_BIT_${V} @@ @@ -(1ull << VARYING_SLOT_${V}) +VARYING_BIT_${V} @@ @@ -BITFIELD64_BIT(VARYING_SLOT_${V}) +VARYING_BIT_${V} """ for v in varys: from mako.template import Template print(Template(t).render(V = v)) Closes: #13453 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Marek Olšák <maraeo@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> [panfrost, common] Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> [broadcom] Reviewed-by: Corentin Noël <corentin.noel@collabora.com> [virgl] Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> [zink] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35917>	2025-07-04 19:01:04 +00:00
Pierre-Eric Pelloux-Prayer	fab2c9a923	ac: fix invalid array size Reported by static analysis. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35877>	2025-07-04 15:26:38 +00:00
Pierre-Eric Pelloux-Prayer	6e371f0a8a	ac: fix potential overflows Reported by static analysis. Multiplication may overflow before being converted to the larger type, so fix this by casting one of the operands to the destination type. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35877>	2025-07-04 15:26:38 +00:00
Samuel Pitoiset	2af3ef9305	ac/surface: select a different swizzle mode for ASTC formats on GFX12 It seems only 4KiB swizzle works fine with ASTC. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34877>	2025-07-03 15:31:04 +00:00
Samuel Pitoiset	cb6f2d9409	ac/surface: use align with NPOT for estimating surface size Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details ac_estimate_size() triggers an assertion because the block size isn't aligned to a power of two for ASTC formats. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35879>	2025-07-03 08:02:17 +00:00
Marek Olšák	028591aead	ac/nir: remove kill_pointsize and kill_layer options from lowering passes The outputs are removed by a separate pass. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35352>	2025-07-02 20:27:46 +00:00
Marek Olšák	42ad7543b8	ac/nir: switch legacy GS lowering to ac_nir_prerast_out completely This changes legacy GS outputs to use the same logic as NGG GS. It enables the same optimizations that NGG has such as forwarding constant GS output components to the GS copy shader at compile time. ac_nir_gs_output_info is removed. GS output info is no longer passed to ac_nir_lower_legacy_gs and ac_nir_create_gs_copy_shader separately. ac_nir_lower_legacy_gs now gathers ac_nir_prerast_out, generates GSVS ring stores, and also generates the GS copy shader with GSVS ring loads. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35352>	2025-07-02 20:27:45 +00:00
Marek Olšák	723ce13f90	ac/nir: move gs_output_component_mask_with_stream to prerast utils Legacy GS will use it. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35352>	2025-07-02 20:27:45 +00:00
Marek Olšák	2c64cdc047	ac/nir: return the GS copy shader from ac_nir_lower_legacy_gs This way we won't have to pass output info between the two functions. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35352>	2025-07-02 20:27:44 +00:00
Marek Olšák	98f3fc494e	ac/nir: remove no-op loop from ac_nir_create_gs_copy_shader Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35352>	2025-07-02 20:27:43 +00:00
Marek Olšák	098d33766a	ac: add legacy GS subgroup size computation from radeonsi Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35352>	2025-07-02 20:27:43 +00:00
Marek Olšák	fa8db1ccd3	ac: add NGG subgroup size computation from radeonsi RADV will use it. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35352>	2025-07-02 20:27:42 +00:00
Marek Olšák	4263b49778	ac/nir: remove ngg_scratch LDS ABI, allocate it in the lowering pass This is a cleanup. Old gs LDS layout: [es outputs][gs outputs][scratch] Old nogs LDS layout: [xfb/cull][scratch] New gs LDS layout: [es outputs][scratch\|gs outputs] New nogs LDS layout: [scratch\|xfb/cull] The LDS scratch is moved to the beginning of the preceding buffer in LDS, while the addresses in that LDS buffer are offset by the scratch size. It effectively merges the LDS scratch with the preceding buffer in LDS. Thanks to that, we no longer need the ngg_scratch ABI and the offset in a user SGPR. The lowering passes now return the LDS scratch size, which is used by the drivers to determine the final LDS size. The ngg_lds_layout SGPR is now unused without GS in RADV. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35352>	2025-07-02 20:27:41 +00:00
Marek Olšák	b1b581f855	ac/nir/lower_ngg: add an option not to export cull distances if the shader culls them Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35352>	2025-07-02 20:27:40 +00:00
Marek Olšák	8c04a91d12	ac/nir: rename clip_cull_mask parameter to clearer export_clipdist_mask Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35352>	2025-07-02 20:27:40 +00:00
Marek Olšák	ed0f393607	ac/nir/lower_ngg: rename clip_cull_dist_mask and use it correctly We incorrectly used it to determine whether the shader should cull, which luckily had no effect because it wasn't used everywhere. cull_clipdist_mask should be used instead, which also reflects whether clip planes are enabled in GL. clip_cull_dist_mask is renamed to export_clipdist_mask to make it clear. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35352>	2025-07-02 20:27:40 +00:00
Marek Olšák	f6af3c0e17	ac/nir/lower_ngg: forward constant GS & XFB output components from stores to loads for LDS This removes LDS space and loads/stores for constant GS & XFB output components. Constant output components skip LDS stores, and LDS loads are replaced with the gathered constants. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35352>	2025-07-02 20:27:40 +00:00
Marek Olšák	0ba4e3ae83	ac/nir/lower_ngg: add & use new scalar helpers for XFB loads/stores This simplifies the code and scalarizes the loads/stores. Scalar loads/stores will allow forwarding constant output components from stores to loads easily. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35352>	2025-07-02 20:27:40 +00:00
Marek Olšák	4b6ae11207	ac/nir/lower_ngg: add & use new scalar helpers for GS loads/stores This simplifies the code and scalarizes the loads/stores. Scalar loads/stores will allow forwarding constant output components from stores to loads easily. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35352>	2025-07-02 20:27:40 +00:00
Marek Olšák	f407129b7f	ac/nir/lower_ngg_gs: cull against clip/cull distances & clip planes in GS This is finally implemented. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35352>	2025-07-02 20:27:40 +00:00
Samuel Pitoiset	10ef9c6a80	radv: disable RB+ with E5B9G9R9 to workaround failures on GFX10.3-GFX11.5 Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This looks like a hw bug on GFX10.3-GFX11.5 because RB+ seems to only work as expected when all channels (RGBA) are written. With that format, RGB channels must be all set or unset but setting the A channel is legal so far. This will reduce rendering performance with that format but it's the less intrusive solution for now. This might be revisited in the near future, also with more VKCTS coverage. This has been tested and verified on GFX10.3 (NAVI21) and GFX11 (NAVI31) and GFX12 (NAVI48), unfortunately I don't have GFX11.5 but let's assume it's broken there too. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13371 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35631>	2025-07-02 17:21:58 +00:00
Samuel Pitoiset	7017b25d6a	radv: stop disabling the alpha optimization with E5B9G9R9 and RB+ This old workaround was added due to test failures with VKCTS but it turns out the tests were broken. Color writemask for E5B9G9R9 must be all RGB or none and some tests are testing various RGB channels which is illegal. See https://gitlab.khronos.org/Tracker/vk-gl-cts/-/issues/5821. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35631>	2025-07-02 17:21:58 +00:00
Rhys Perry	dce1d4ad4c	aco/ra: fix repeated compact_linear_vgprs() in get_reg() Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Fixes: `b7738de4f9` ("aco/ra: rework linear VGPR allocation") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13431 Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35838>	2025-07-02 09:26:04 +00:00
Rob Clark	6bc47e65d7	rusticl: Fix work group size validation For each dimension, we `threads *= lws`.. which is still zero if threads is initialized to zero. Fixes: `eca4f0f632` ("rusticl/kernel: check that local size on dispatch doesn't exceed limits") Signed-off-by: Rob Clark <rob.clark@oss.qualcomm.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35864>	2025-07-01 18:55:01 +00:00
Rhys Perry	21c4400278	aco: update ctx.block when inserting discard block Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13432 Backport-to: 25.1 Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35833>	2025-07-01 14:31:11 +00:00
Samuel Pitoiset	71397a8162	radv/meta: stop allocating sampler for blit operations Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35790>	2025-07-01 08:58:03 +02:00
Samuel Pitoiset	ba8bd13a14	radv: rework initializing/finishing samplers Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35790>	2025-07-01 08:58:02 +02:00
Alyssa Rosenzweig	3c2f46fcac	treewide: use nir_break_if with named if Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Via Coccinelle patch: @@ expression builder, condition; identifier nif; @@ -nir_if *nif = nir_push_if(builder, condition); -{ -nir_jump(builder, nir_jump_break); -} -nir_pop_if(builder, nif); +nir_break_if(builder, condition); Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35794>	2025-06-30 14:51:54 -04:00
Alyssa Rosenzweig	67237b6f1b	treewide: use nir_break_if Via Coccinelle patch: @@ expression builder, condition; @@ -nir_push_if(builder, condition); -{ -nir_jump(builder, nir_jump_break); -} -nir_pop_if(builder, NULL); +nir_break_if(builder, condition); Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35794>	2025-06-30 14:51:24 -04:00
Samuel Pitoiset	68708cd4da	radv/ci: uprev kernel to 6.15.3 NAVI21/NAVI31 still uses 6.6 for now. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35765>	2025-06-30 08:10:47 +02:00
Mel Henning	10acb44c64	nir: Split lower_vote_eq into int/float versions Recent nvidia hardware has a native instruction for nir_intrinsic_vote_ieq but not for nir_intrinsic_vote_feq. So, split this boolean into two so we can contol the lowering separately for each instruction. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35778>	2025-06-28 16:10:50 +00:00
Natalie Vock	e236a731e4	radv/rt: Enable pointer flags on GFX11+ Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Allows hardware to do some of the culling work, as well as early-cull box nodes with CullOpaque/CullNonOpaque ray masks when all children are (not) opaque. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32417>	2025-06-28 10:31:38 +00:00
Natalie Vock	e82717a5cf	radv: Use common helper to set BLAS node pointer flags on gfx11+ Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32417>	2025-06-28 10:31:38 +00:00
Natalie Vock	06a06bbe09	radv: Encode child opaqueness information in box nodes Also, use one reserved field from the header to store the root node's opaqueness flags. This is used to propagate opaqueness info across the BLAS/TLAS boundary. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32417>	2025-06-28 10:31:37 +00:00
Natalie Vock	3b1f94d00d	radv: Encode child opaqueness information in triangle nodes Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32417>	2025-06-28 10:31:37 +00:00
Marek Olšák	6afa638b18	ac/nir/lower_ngg: rename user_clip_plane_enable_mask -> cull_clipdist_mask Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35351>	2025-06-28 08:20:26 +00:00
Marek Olšák	814990684d	ac/nir/lower_ngg: pack GS outputs and XFB outputs in LDS optimally This switches the code to the new slot offsets from ac_nir_prerast_out instead of using a prefix bitmask over outputs_written. The LDS layout no longer includes these: - GS: output components that are not written by GS - VS/TES+XFB: output components that are not written by XFB - VS/TES+XFB: slots that are not written by XFB (this could be significant) This is also a cleanup because it unduplicates the bitcounts. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35351>	2025-06-28 08:20:26 +00:00
Marek Olšák	75b1602c14	ac/nir/lower_ngg_gs: return LDS size from the pass instead of computing it separately. This is better because ac_nir_lower_ngg_gs knows the final LDS size anyway, and it will be easier to modify the size calculation this way. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35351>	2025-06-28 08:20:26 +00:00

1 2 3 4 5 ...

17969 commits