fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-01-11 10:10:14 +01:00

Author	SHA1	Message	Date
Mary Guillemard	066850bb3a	panfrost: Take tiler memory budget into account in pan_select_tiler_hierarchy_mask Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details On v12+, the hardware report support for 8 levels but effectively only support up to 4 levels. In case more than 4 levels are used, it will default to 0xAA when tile_size is 32x32 or lower, otherwise 0xAC when the tile_size is greater than 32x32. This patch makes it that we now ensure that the bins can fit inside out tiler budget and otherwise drop levels until it fit. This also allows the hardware to decide the hierarchy on v12+ if we know it will fit. This fixes "dEQP-GLES31.functional.fbo.no_attachments.maximums.all" and dEQP-GLES31.functional.fbo.no_attachments.maximums.size" on v12+ but also likely more if we were exhausting the memory budget. Signed-off-by: Mary Guillemard <mary.guillemard@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Backport-to: 25.1 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34559> (cherry picked from commit `92afeb37bf`)	2025-04-22 01:25:05 +02:00
David Rosca	5bae75e3a0	radeonsi/vcn: Fix decode target index for H264 interlaced streams With H264 the target surface can also be in the reference list for current frame, so it can only be inserted into the DPB list after iterating over all references. Fixes: `0e68a2655f` ("radeonsi/vcn: Rework decode ref handling") Reviewed-by: Ruijing Dong <ruijing.dong@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34527> (cherry picked from commit `b0b52d4922`)	2025-04-22 01:25:04 +02:00
Marek Olšák	39e4fe7ab4	radv: fix incorrect patch_outputs_read for TCS with dynamic state Fixes: `8c2f9f0665` - radv: switch to the new TCS LDS/offchip size computation Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34544> (cherry picked from commit `4a51089f30`)	2025-04-22 01:25:00 +02:00
Janne Grunau	b47ada6635	venus: virtgpu: Require stable wire format When VMMs do not support VIRTGPU_DRM_CAPSET_VENUS the capset data remains zeroed. By requiring the stable wire_format_version 1 this can be detected early without initialising the renderer. Avoids triggering `assert(capset->supports_blob_id_0);` in debug builds under such circumstances. Cc: mesa-stable Signed-off-by: Janne Grunau <j@jannau.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34613> (cherry picked from commit `3d3ca9b65e`)	2025-04-22 01:24:59 +02:00
Yiwei Zhang	da4de27515	venus: fix missing renderer destructions With failed compatibility check, the created renderer must be destroyed within vn_instance_init_renderer. Cc: mesa-stable Fixes: `25b8f4f714` ("venus: handle device probing properly.") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34613> (cherry picked from commit `2a4675ee9f`)	2025-04-22 01:24:58 +02:00
Janne Grunau	6f12ae221c	venus: Do not use instance pointer before NULL check Fixes: `a753f50668` ("venus: break up vn_device.c") Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Signed-off-by: Janne Grunau <j@jannau.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34613> (cherry picked from commit `39e4fd98ce`)	2025-04-22 01:24:57 +02:00
Alyssa Rosenzweig	cba13b7c52	asahi: fix possible null deref with indirect non-indexed draws. Backport-to: 25.1 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34612> (cherry picked from commit `84505c5d99`)	2025-04-22 01:24:54 +02:00
Alyssa Rosenzweig	2be4fb62cf	hk: fix patch count = 0 handling fixes fault in dEQP-VK.tessellation.misc_draw.triangles_no_patches Backport-to: 25.1 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34612> (cherry picked from commit `e541ffcbe8`)	2025-04-22 01:24:53 +02:00
Alyssa Rosenzweig	2099c23dab	agx: early-kill sources only if it won't shuffle rather than always early killing and then hitting pathological shuffle situations, only early-kill when we can prove that we won't need to shuffle. it turns out that's most of the time. even with this heuristic, we still get hurt bad in shader-db due to extra moves. but hopefully, the #s here are small enough that we can move on with our lives and fix this source of known unsoundness. this is tagged for backport as it's needed to avoid a perf regression with the previous patch. combined stats from this commit and the previous commit: total instrs in shared programs: 2846065 -> 2852257 (0.22%) instrs in affected programs: 618734 -> 624926 (1.00%) total alu in shared programs: 2329477 -> 2335534 (0.26%) alu in affected programs: 508119 -> 514176 (1.19%) total gprs in shared programs: 894762 -> 901327 (0.73%) gprs in affected programs: 36946 -> 43511 (17.77%) Backport-to: 25.1 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34595> (cherry picked from commit `b1e86b3eae`)	2025-04-22 01:24:52 +02:00
Alyssa Rosenzweig	27b46ecfb8	agx: late-kill sources shader-db stats combined with next commit. this is the rip off the bandaid, next is the optimize. split to enable bisecting. the code we have to shuffle clobbered killed sources is broken and, after thinking about that for a Long time, I don't see a reasonable way to fix it. But if we late-kill sources - or model our calculations as-if we were late-killing souces - we never have to shuffle onto a killed source and the problem goes away entirely. this is similar in spirit to what NAK does. it's not "optimal", but it's sane. Backport-to: 25.1 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34595> (cherry picked from commit `b88fe9b0c5`)	2025-04-22 01:24:51 +02:00
Alyssa Rosenzweig	1c5cd9ff7a	agx: model sources as late-kill in demand calcs This hurts us in two ways: * slightly more spilling (not actually a big problem) * slightly worse occupancy (the shaders that are "helped" here are from trying less hard to fit at higher occupancy levels) However, in exchange we get a LOT more flexibility in the RA. total instrs in shared programs: 2847015 -> 2846065 (-0.03%) instrs in affected programs: 84134 -> 83184 (-1.13%) total alu in shared programs: 2330406 -> 2329477 (-0.04%) alu in affected programs: 62305 -> 61376 (-1.49%) total code size in shared programs: 20497326 -> 20491690 (-0.03%) code size in affected programs: 586664 -> 581028 (-0.96%) total gprs in shared programs: 894202 -> 894762 (0.06%) gprs in affected programs: 8900 -> 9460 (6.29%) total scratch in shared programs: 13292 -> 13304 (0.09%) scratch in affected programs: 2924 -> 2936 (0.41%) total threads in shared programs: 27819712 -> 27814272 (-0.02%) threads in affected programs: 55296 -> 49856 (-9.84%) total spills in shared programs: 907 -> 914 (0.77%) spills in affected programs: 419 -> 426 (1.67%) total fills in shared programs: 857 -> 862 (0.58%) fills in affected programs: 389 -> 394 (1.29%) Backport-to: 25.1 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34595> (cherry picked from commit `7fad96d194`)	2025-04-22 01:24:50 +02:00
Alyssa Rosenzweig	3cc215b1cc	hk: fix null FS corner cases this fixes null FS + cull distance/API sample mask, which require a prolog. fixes upcoming CTS. Backport-to: 25.1 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34585> (cherry picked from commit `3ab8ce8579`)	2025-04-22 01:24:49 +02:00
Alyssa Rosenzweig	5c048f7860	hk: fix tessellation + clipper queries fixes upcoming cts Backport-to: 25.1 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34585> (cherry picked from commit `d959557669`)	2025-04-22 01:24:48 +02:00
GKraats	c196a64471	EGL: legacy-x11=dri2 should support hardware driver Since MR !33891 EGL only supports a software driver (LLVM). Routine dri3_x11_connect at src/egl/drivers/dri2/platform_x11.c fails if DRI3 is not available. So at that location variable *allow_dri2 should be set. Looking at the major codition, we see it is not executed if LIBGL_DRI3_DISABLE is set. In that case the hardware driver is activated as desired. Previously this was not needed. Also it is not practical, and not necessary. I do not understand the major condition, so I did not change it. This causes some duplicate coding. Fixes: `323bad6b18` ("egl/x11: split out dri2 init entirely") Signed-off-by: GKraats <vd.kraats@hccnet.nl> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34530> (cherry picked from commit `995dc61bf5`)	2025-04-22 01:24:45 +02:00
Rhys Perry	76db8496a9	aco: combine VALU lanemask hazard into VALUMaskWriteHazard This is now basically the same as the original VALUMaskWriteHazard, except it now considers both VALU and SALU writes. Now that it's a part of VALUMaskWriteHazard, differences from the original VALU lanemask workaround are: - it includes SALU reads after the write - it includes VALU writes and SALU/VALU reads after the write which are not lanemasks - it combines s_waitcnt_depctr instructions when it's a read after both a SALU write and a VALU write - non-exec VALU SGPR reads reset the SGPRs read by VALU as a lanemask - exec SGPRs are ignored resolve_all_gfx11() is also finished. fossil-db (navi31): Totals from 21538 (27.13% of 79377) affected shaders: Instrs: 27628855 -> 27552972 (-0.27%); split: -0.30%, +0.03% CodeSize: 145968448 -> 145667616 (-0.21%); split: -0.23%, +0.02% Latency: 209537805 -> 209509519 (-0.01%); split: -0.02%, +0.00% InvThroughput: 36304270 -> 36301624 (-0.01%); split: -0.01%, +0.00% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12623 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11480 Backport-to: 25.0 Backport-to: 25.1 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34529> (cherry picked from commit `ce2be5ab8e`)	2025-04-22 01:24:39 +02:00
Mel Henning	614c26c634	nak: Handle idp4 ureg latencies Fixes: `6b8a4e6bb7` ("nak: Add Turing latency information") Fixes: `7a01953a39` ("nak: Add Ampere and Ada latency information") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12993 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34563> (cherry picked from commit `eee3c8eab8`)	2025-04-22 01:24:38 +02:00
Mel Henning	53c8864428	nak/spill_values: Spill constants across edges if needed In a previous iteration of the spilling code, we added an extra check to only spill across edges if the value being spilled is in the W set. This was due to a misunderstanding of the modeling of S and W in Braun and Hack. In the current implementation, we maintain the invariant that every live value is in at least one of S or W so we don't need that check but it was left in by mistake. One exception to this rule was added when we special-cased constant values. Now the invariant is that every live value is in S, in W, or is a constant. When we made this change, the check we accidentally left in bit us because now if a value is constant but not in W, it wasn't getting spilled across the edge. This can result in a value getting filled later which was never spilled, leading to undefined values. Fixes: `7b82e26e3c` ("nak: Don't spill/fill const values") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12993 Co-authored-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34563> (cherry picked from commit `de1ed48325`)	2025-04-22 01:24:36 +02:00
Rohan Garg	e106478551	anv: re enable compression for CPS surfaces on platforms other than Xe I accidentally disabled compression on CPS surfaces marked as storage or color attachment for all platforms, when this should only be limited to Xe. Fixes: 80f9b6 ('anv: CPB surfaces that are used as color attachments or for stores cannot be compressed') Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34297> (cherry picked from commit `cbc1ec4f73`)	2025-04-22 01:24:32 +02:00
Rhys Perry	dd304bfd80	aco/gfx12: don't use second VALU for VOPD's OPX if there is a WaR fossil-db (gfx1201): Totals from 38908 (49.02% of 79377) affected shaders: Instrs: 30268107 -> 30268131 (+0.00%); split: -0.00%, +0.00% CodeSize: 180843648 -> 180843640 (-0.00%); split: -0.00%, +0.00% Latency: 224905962 -> 224906072 (+0.00%); split: -0.00%, +0.00% InvThroughput: 44322988 -> 44323004 (+0.00%) VALU: 15124145 -> 15124167 (+0.00%) VOPD: 4018504 -> 4018482 (-0.00%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Backport-to: 25.0 Backport-to: 25.1 Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34246> (cherry picked from commit `408fa33c09`)	2025-04-22 01:24:31 +02:00
Tapani Pälli	2f1fd84e4d	iris: make sure to not mix compressed vs non-compressed This commit implements the following requirement: "Keep any UMD-recycling of compression-enabled/disabled memory separate." As additional info there are 2 related wa's for the issue: Wa_14018443005 Wa_18038669374 Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34499> (cherry picked from commit `6d70ec449f`)	2025-04-22 00:04:17 +02:00
Tapani Pälli	d0b2f4830d	iris: force reallocate on eglCreateImage with GFX >= 20 Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34499> (cherry picked from commit `c2a4657862`)	2025-04-22 00:04:16 +02:00
Faith Ekstrand	e944636ff7	nak/sm70: Fix the bit74_75_ar_mod assert It's used for src2, not src0. Fixes: `40422927dc` ("nak: Pass has_mod to all form of src2 requiring it") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33107> (cherry picked from commit `47fc468944`)	2025-04-22 00:04:15 +02:00
Faith Ekstrand	0702e54b55	nak/legalize: Take a RegFile in copy_alu_src_and_lower_ineg() Fixes: `af6093a712` ("nak/legalize: Add a helper for lowering ineg") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33107> (cherry picked from commit `328112c6bc`)	2025-04-22 00:04:14 +02:00
Faith Ekstrand	9fa9cd870f	nak/legalize: Take a RegFile in copy_alu_src_and_lower_fmod Otherwise, we'll screw up uniform GPRs. Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33107> (cherry picked from commit `22a30bfa4f`)	2025-04-22 00:04:12 +02:00
Patrick Lerda	da655b10ad	mesa_interface: fix legacy dri2 compatibility Some checks failed macOS-CI / macOS-CI (dri) (push) Has been cancelled Details macOS-CI / macOS-CI (xlib) (push) Has been cancelled Details These values are shared with xcb/dri2.h, and can't be changed without breaking the legacy dri2 compatibility. This change reverses partially the update done by `3b603d1646`. For instance this issue is triggered on dri2 i915 with "piglit/bin/glx-copy-sub-buffer -auto" or "piglit/bin/hiz-depth-read-window-stencil0 -auto". Fixes: `3b603d1646` ("mesa_interface: remove unused stuff") Signed-off-by: Patrick Lerda <patrick9876@free.fr> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34561> (cherry picked from commit `60a31156b0`)	2025-04-17 02:28:20 +02:00
Mike Blumenkrantz	2bfe468661	zink: verify that surface exists when adding implicit feedback loop this can be null if multiple contexts are in use cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34557> (cherry picked from commit `de6efc01c1`)	2025-04-17 02:28:19 +02:00
Tomeu Vizoso	d588bebd75	etnaviv/ml: Use etna_buffer_resource instead of etna_resource Otherwise we hit an assert in newly added code. Fixes: `d738b3ea2b` ("etnaviv: split PIPE_BUFFER resources from other types of resources") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34481> (cherry picked from commit `251d1e2551`)	2025-04-17 02:28:18 +02:00
Alyssa Rosenzweig	5c0cd81232	hk: fix underbinding scratch need to round up to page size (minimally) or we assert out. hit in vulkaninfo of all things. Fixes: `678134add5` ("hk: implement sparse") Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34526> (cherry picked from commit `9b55451ea7`)	2025-04-17 02:28:16 +02:00
Pierre-Eric Pelloux-Prayer	4350f7e7db	winsys/amdgpu: disable VM_ALWAYS_VALID The referenced commit has been identified as the root cause of graphic artifacts / hangs on some APUs. For now disable AMDGPU_GEM_CREATE_VM_ALWAYS_VALID on all chips except when user queues are used. See https://gitlab.freedesktop.org/mesa/mesa/-/issues/12809. Fixes: `8c91624614` ("winsys/amdgpu: use VM_ALWAYS_VALID for all VRAM and GTT allocations") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34547> (cherry picked from commit `555821ff93`)	2025-04-17 02:28:14 +02:00
Mark Collins	93547d45ce	ir3/a7xx: Add post-RA pass to track liveness and insert (last) Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Introduces a backwards dataflow analysis pass to determine when a certain register is always written to prior to being read in a similar manner to SSA liveness but performed after RA which we can use to determine when we can insert (last) on src regs on A7XX. Passing VK-CTS: dEQP-VK.pipeline.* Signed-off-by: Mark Collins <mark@igalia.com> Co-Authored-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25077>	2025-04-16 07:40:50 +00:00
David Rosca	6586689661	radeonsi/vpe: Use studio range for YUV and full for RGB by default If application doesn't specify color range, use studio for YUV and full for RGB. Also stop always forcing full for RGB as that's wrong. Reviewed-by: Peyton Lee <peytolee@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34519>	2025-04-16 07:17:57 +00:00
David Rosca	1a502fcd89	radeonsi/vpe: Fix process_frame return value VPE_STATUS_OK is 1, but the function should return 0 on success. Fixes: `4fe586f71e` ("radeonsi/vpe: support geometric scaling") Reviewed-by: Peyton Lee <peytolee@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34519>	2025-04-16 07:17:56 +00:00
David Rosca	bd6f9e8aee	radeonsi/vpe: Use float division to get scaling ratio Fixes: `e85a6b6a63` ("radeonsi/vpe: check reduction ratio") Reviewed-by: Peyton Lee <peytolee@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34519>	2025-04-16 07:17:56 +00:00
Samuel Pitoiset	b4940255ed	radv/sdma: add support for compression on GFX12 Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Similar to previous generations that support compression, except that the driver don't need to configure a meta VA because DCC is completely transparent to the userspace. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34517>	2025-04-16 06:57:00 +00:00
Samuel Pitoiset	efa0b16bb2	radv/sdma: add a new flag to know if the surface is compressed On GFX12, DCC is transparent to the driver and there is no meta VA. Adding a new flag to know if the SDMA surface is compressed is needed. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34517>	2025-04-16 06:57:00 +00:00
Samuel Pitoiset	03671ccf9e	radv/sdma: use the correct helper to get the number type field This wasn't technically incorrect because V_028C70_BU_NUM_xxx values are similar to V_028C70_NUMBER_xxx but it's better to use the corect helper. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34517>	2025-04-16 06:57:00 +00:00
Samuel Pitoiset	b44dc98cde	radv/sdma: remove redundant check for compression when getting metadata It's already checked by the caller. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34517>	2025-04-16 06:57:00 +00:00
Samuel Pitoiset	d3d5d2fe86	radv/sdma: use SDMA5_DCC_xxx bitfields It's cleaner. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34517>	2025-04-16 06:57:00 +00:00
Samuel Pitoiset	f44342199a	radv/sdma: simplify configuring the number of uncompressed DCC blocks SDMA doesn't support MSAA, so the value can be V_028C78_MAX_BLOCK_SIZE_256B. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34517>	2025-04-16 06:57:00 +00:00
Samuel Pitoiset	13db408e59	ac/perfcounter: add support for GFX12 Sourced from PAL to add SPM support. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34524>	2025-04-16 06:35:33 +00:00
Samuel Pitoiset	c42d43e8eb	radv: print more error messages during SPM initialization Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34524>	2025-04-16 06:35:33 +00:00
Marek Olšák	177427877b	radeonsi: use nir_opt_shrink_vectors It reduces VGPR usage, but the impact is almost none. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34432>	2025-04-16 06:08:48 +00:00
Marek Olšák	b7eff9cd87	radeonsi: always scalarize shared memory instructions to get ds_load_2addr/ds_store_2addr more often and to prevent code size regressions from nir_opt_shrink_vectors. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34432>	2025-04-16 06:08:48 +00:00
Marek Olšák	78cacfd9ce	ac/surface: select 3D tile mode without overallocating too much for gfx6-8 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12466 Fixes: `c87ce78d` - ac/surface: enable thick tiling for 3D textures for better perf on gfx6-8 Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34432>	2025-04-16 06:08:48 +00:00
Marek Olšák	195e7b4f75	ac/surface: make gfx12_estimate_size reusable by gfx6 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12466 Fixes: `c87ce78d` - ac/surface: enable thick tiling for 3D textures for better perf on gfx6-8 Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34432>	2025-04-16 06:08:48 +00:00
Marek Olšák	2c122d478b	ac/nir: set X=0 for task->mesh shader dispatch when Y or Z is 0 The code set X=0 when Y and Z is 0, not "or". Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34432>	2025-04-16 06:08:48 +00:00
Marek Olšák	963147d7fd	ac/gpu_info: add 256 to payload_entry_size to increase future task shader perf It has no effect because num_entries is 1K, but the table shows a lot of potential. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34432>	2025-04-16 06:08:48 +00:00
Marek Olšák	d7c903f258	ac/gpu_info: add payload_entry_size into ac_task_info to stop causing full RADV recompiles when it's changed. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34432>	2025-04-16 06:08:48 +00:00
Marek Olšák	0dafd04695	ac/gpu_info: remove has_tmz_support function It's not needed since: `8b3056343f` - ac/gpu_info: bump required DRM minor version to 3.42.0 (kernel 5.15+) Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34432>	2025-04-16 06:08:48 +00:00
Marek Olšák	0be5a3559a	ac/gpu_info: increase the attribute ring size for gfx12 Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34432>	2025-04-16 06:08:48 +00:00

1 2 3 4 5 ...

189198 commits