fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-20 19:58:19 +02:00

Author	SHA1	Message	Date
Rhys Perry	dfa8ac6b91	aco: remove buffer_load_lds instructions Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details They don't exist See https://github.com/llvm/llvm-project/pull/132916 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14041 Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37716>	2025-10-07 09:50:26 +00:00
Samuel Pitoiset	08ddf2f878	radv: lower embedded/immutable samplers earlier Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Lowering them earlier right after VTN would allow us to implement embedded samplers for descriptor heap properly for merged shaders. Non-immediate samplers are still lowered in radv_nir_apply_pipeline_layout because they require shader arguments. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37688>	2025-10-07 09:25:28 +00:00
Samuel Pitoiset	cb746e2d84	radv: lower ycbcr tex instructions earlier There is no real advantage to delay this lowering. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37688>	2025-10-07 09:25:27 +00:00
Samuel Pitoiset	b8bdc68933	radv/ci: update expected list of failures for VEGA10/NAVI10 Since `a8f4a2a9ba` ("radv/video: Check FW version before using WRITE_MEMORY") presumably. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37733>	2025-10-07 08:06:54 +00:00
Benjamin Cheng	364a2488ad	radv/video: Report extra image usages ENCODE_SRC and DECODE_DST are transparent and can have additional usages. Reviewed-by: David Rosca <david.rosca@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37656>	2025-10-06 21:27:48 +00:00
Benjamin Cheng	d1872c45ae	radv/video: Fix video profile reporting Use vk_video_is_profile_supported first, and add AMD specific restrictions later. vulkaninfo reports on Navi31: H.264 Decode (4:2:0 8-bit) Baseline progressive H.264 Decode (4:2:0 8-bit) Main progressive H.264 Decode (4:2:0 8-bit) High progressive H.264 Decode (4:2:0 8-bit) Baseline interlaced (interleaved lines) H.264 Decode (4:2:0 8-bit) Main interlaced (interleaved lines) H.264 Decode (4:2:0 8-bit) High interlaced (interleaved lines) H.264 Decode (monochrome 8-bit) High progressive H.264 Decode (monochrome 8-bit) High interlaced (interleaved lines) H.265 Decode (4:2:0 8-bit) Main H.265 Decode (4:2:0 8-bit) Main 10 H.265 Decode (4:2:0 8-bit) Main Still Picture H.265 Decode (4:2:0 10-bit) Main 10 VP9 Decode (4:2:0 8-bit) Profile 0 VP9 Decode (4:2:0 10-bit) Profile 2 AV1 Decode (4:2:0 8-bit) Main with film grain support AV1 Decode (4:2:0 8-bit) Main without film grain support AV1 Decode (4:2:0 10-bit) Main with film grain support AV1 Decode (4:2:0 10-bit) Main without film grain support AV1 Decode (4:2:0 12-bit) Professional with film grain support AV1 Decode (4:2:0 12-bit) Professional without film grain support AV1 Decode (monochrome 8-bit) Main with film grain support AV1 Decode (monochrome 8-bit) Main without film grain support AV1 Decode (monochrome 10-bit) Main with film grain support AV1 Decode (monochrome 10-bit) Main without film grain support AV1 Decode (monochrome 12-bit) Professional with film grain support AV1 Decode (monochrome 12-bit) Professional without film grain support H.264 Encode (4:2:0 8-bit) Baseline H.264 Encode (4:2:0 8-bit) Main H.264 Encode (4:2:0 8-bit) High H.265 Encode (4:2:0 8-bit) Main H.265 Encode (4:2:0 8-bit) Main 10 H.265 Encode (4:2:0 8-bit) Main Still Picture H.265 Encode (4:2:0 10-bit) Main 10 AV1 Encode (4:2:0 8-bit) Main AV1 Encode (4:2:0 10-bit) Main Reviewed-by: David Rosca <david.rosca@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37656>	2025-10-06 21:27:48 +00:00
David Rosca	59a3ca2333	radv/video: Fix waiting on encode feedback query Currently we wait until the second dword in feedback buffer changes from 0 to 1, and then the rest of the feedback is read. There is no guarantee that the rest of the feedback will be available, which can cause bitstream size to be incorrectly returned as 0. Add write memory command after encode, marking the query as available to ensure the entire feedback buffer is ready. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13601 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36772>	2025-10-06 10:32:54 +00:00
David Rosca	a8f4a2a9ba	radv/video: Check FW version before using WRITE_MEMORY Move the version check to separate function so that it can also be used elsewhere. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36772>	2025-10-06 10:32:54 +00:00
David Rosca	40c124e67a	radv: Change radv_vcn_write_event to a write memory func Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36772>	2025-10-06 10:32:53 +00:00
Samuel Pitoiset	874bc09537	radv: reserve more CS space when executing DGC calls This can trigger an assert otherwise. The space reserved before executing DGC IBs is an arbitrary number which should be large enough in all cases. Found this while implementing descriptor heap. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37681>	2025-10-06 06:28:18 +00:00
Bas Nieuwenhuizen	82d06b58ad	radv: use vk_drm_syncobj_copy_payloads Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Based on a patch by llyyr <llyyr.public@gmail.com>: !36827 added the copy_sync_payloads function, but didn't enable use of it in radv. This commit mirrors similar MRs for anv/panvk/nvk and uses the common vk_drm_syncobj_copy_payloads function for copy_sync_payloads. I'm not too familiar with radv internals, so there's potentially a good reason why this isn't a good change. However, I've personally been using this patch locally for around a month and have experienced no regressions and around 8% uplift on vkmark test scores with a 6600 XT. [vertex] device-local=true: 45110 -> 48489 (+7.5%) [vertex] device-local=false: 17529 -> 17488 (-0.2%) [texture] anisotropy=0: 44768 -> 48679 (+8.7%) [texture] anisotropy=16: 44920 -> 48572 (+8.1%) [shading] shading=gouraud: 44931 -> 48467 (+7.9%) [shading] shading=blinn-phong-inf: 44849 -> 48740 (+8.7%) [shading] shading=phong: 44695 -> 48645 (+8.8%) [shading] shading=cel: 44809 -> 47938 (+7.0%) [effect2d] kernel=edge: 45185 -> 47837 (+5.9%) [effect2d] kernel=blur: 26919 -> 26762 (-0.6%) [desktop] <default>: 40974 -> 44034 (+7.5%) [cube] <default>: 45090 -> 49270 (+9.3%) [clear] <default>: 41102 -> 44375 (+8.0%) (https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37606) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37640>	2025-10-06 00:45:09 +00:00
Yinjie Yao	f0f95a9ae3	ac/parse_ib: Update vcn ib parser to include missing commands Signed-off-by: Yinjie Yao <yinjie.yao@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: David Rosca <david.rosca@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37672>	2025-10-03 14:44:07 +00:00
Samuel Pitoiset	38892cb558	radv: only expose AMD_device_coherent_memory if actually supported Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This fixes an issue after a recent update to dEQP-VK.info.device_mandatory_features. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37663>	2025-10-03 14:26:32 +00:00
Samuel Pitoiset	e2db50c97b	Revert "radv/ci: document recent unexpected failures on TAHITI" Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This reverts commit `abd2a79264`. Fixed by `93ce29c42e` ("amd: don't allow unsigned wraps for shared memory offsets on GFX6"). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37685>	2025-10-03 13:37:16 +02:00
Daniel Schürmann	0e3bc3d8c0	nir/opt_offsets: call allow_offset_wrap() for try_fold_shared2() Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This prevents applying wrapping offsets on GFX6. Fixes: `e1a692f74b` ('nir/opt_offsets: allow for unsigned wraps when folding load/store_shared2_amd offsets') Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37667>	2025-10-03 07:54:12 +00:00
Daniel Schürmann	93ce29c42e	amd: don't allow unsigned wraps for shared memory offsets on GFX6 Fixes: `10266e7b21` ('radv: allow for unsigned wraps for shared memory intrinsics in nir_opt_offsets') Fixes: `dd68825feb` ('radeonsi: allow for unsigned wraps for shared memory intrinsics in nir_opt_offsets') Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37667>	2025-10-03 07:54:12 +00:00
abdelhadi	5c82a3e114	aco: fix debug info offset Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: abdelhadi <abdelhadims@icloud.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37244>	2025-10-02 13:38:56 +00:00
Samuel Pitoiset	abd2a79264	radv/ci: document recent unexpected failures on TAHITI Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37664>	2025-10-02 13:10:32 +00:00
Vitaliy Triang3l Kuzmin	dea20be1b3	ac: Enable HTILE TC Z clear value bug workaround on GFX1013 Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: Vitaliy Triang3l Kuzmin <triang3l@yandex.ru> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33962>	2025-10-02 08:29:50 +00:00
Vitaliy Triang3l Kuzmin	4e3a5f60e1	radv,ac: Split has_tc_compat_zrange_bug into Z and ZS, document it Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: Vitaliy Triang3l Kuzmin <triang3l@yandex.ru> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33962>	2025-10-02 08:29:49 +00:00
Vitaliy Triang3l Kuzmin	5243f292ef	radv,ac: GFX10 depth/stencil HTILE mipmap bug info variable Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: Vitaliy Triang3l Kuzmin <triang3l@yandex.ru> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33962>	2025-10-02 08:29:48 +00:00
Georg Lehmann	9533e7cdae	aco/optimizer: fix incorrect operand order assumption for neg(mul) opt Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The code that labels instructions doesn't care about the order either. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14013 Cc: mesa-stable Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37643>	2025-10-01 20:52:12 +00:00
Natalie Vock	52c7b0d20c	radv/bvh: Encode empty AS bounds as NaN Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details If there are no leaves, the root node bounds still span -inf/inf. Making empty BLASs infinite-sized guarantees ray traversal needs to enter the BLAS (and immediately exit because it's empty). Remove the BLAS from the BVH entirely by marking its bounds as NaN. As a bonus, this works around RADV encountering issues in Silent Hill 2 on RDNA4 due to infinite-sized BVHs. Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37492>	2025-10-01 14:27:15 +00:00
Samuel Pitoiset	29ccbb21f3	radv: add a helper whether shader fp16 is enabled To remove code duplication. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37619>	2025-09-29 16:17:11 +00:00
Timur Kristóf	d3579190d6	ac/nir/ngg: Fix scalarized mesh primitive indices Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Take the write_mask into account when storing primitive indices, otherwise they will end up being stored in the wrong place. Fixes: `8e24d3426d` ("ac/nir/ngg: Refactor MS primitive indices for scalarized IO.") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37610>	2025-09-29 08:07:54 +00:00
Timur Kristóf	3dc9c1a91e	ac/nir/ngg: Remove dead code for 64-bit mesh shader variables We already lower all 64-bit I/O to 32-bit before this pass, and the rest of the code here already asserts that I/O variables must be 32-bit or smaller. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37610>	2025-09-29 08:07:54 +00:00
Georg Lehmann	a7f8c6ed60	radv: call nir_opt_undef late too Foz-DB GFX1201: Totals from 2263 (2.82% of 80287) affected shaders: MaxWaves: 57164 -> 57016 (-0.26%); split: +0.04%, -0.30% Instrs: 2711595 -> 2678247 (-1.23%); split: -1.23%, +0.00% CodeSize: 14066656 -> 13929720 (-0.97%); split: -1.01%, +0.03% VGPRs: 139452 -> 140004 (+0.40%); split: -0.03%, +0.42% Latency: 15902794 -> 15875935 (-0.17%); split: -0.17%, +0.00% InvThroughput: 2179122 -> 2165716 (-0.62%); split: -0.62%, +0.00% SClause: 61416 -> 61477 (+0.10%); split: -0.01%, +0.11% Copies: 169781 -> 175175 (+3.18%); split: -0.05%, +3.22% Branches: 53491 -> 53469 (-0.04%) PreSGPRs: 114087 -> 114086 (-0.00%) PreVGPRs: 115702 -> 115697 (-0.00%) VALU: 1555907 -> 1535514 (-1.31%); split: -1.31%, +0.00% SALU: 362560 -> 353803 (-2.42%) SMEM: 106263 -> 106259 (-0.00%) Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37552>	2025-09-26 15:11:26 +00:00
Georg Lehmann	8343e45467	aco/lower_branches: update branch hints after changing jump targets Fixes: `13ad3db43f` ("aco/lower_branches: implement try_remove_simple_block() in lower_branches()") Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37552>	2025-09-26 15:11:26 +00:00
Simon McVittie	9d36bf891b	vulkan: Compute path to write into JSON manifests once, use it everywhere Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This reduces duplication: we only need to distinguish between Windows and Unix in one place. The previous code was inconsistent about using either the `platforms` option, or the `host_machine`. Following the logic described in commit `94379377` "lavapipe: build "Windows" check should use the host machine, not the `platforms` option.", I've assumed that checking the host machine is the more-correct version and used that. Signed-off-by: Simon McVittie <smcv@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37576>	2025-09-26 10:47:31 +00:00
Simon McVittie	be8cac52d3	vulkan: Consistently form driver library names as prefix + name + suffix This consistently uses `NAME.dll` on Windows, `libNAME.dylib` on Darwin derivatives such as macOS, and `libNAME.so` on Linux, *BSD and so on. It's also consistent about using the local variable name `icd_file_name` for this name in every Vulkan driver, which was already the case in many but not all drivers. Some of these drivers probably don't make sense (or don't work) on Windows and/or macOS, but if this is kept consistent for all drivers, it should avoid the need for driver-specific commits like commit `611e9f29e` "lavapipe: fix icd generation for windows", commit `951f3287` "lavapipe: set empty dll prefix", commit `13e7a39f` "lavapipe: fixes for macOS support", commit `7008e655` "radv: Update JSON generator if Windows" and so on, each time a driver is found to be relevant on more platforms than previously believed. Signed-off-by: Simon McVittie <smcv@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37576>	2025-09-26 10:47:31 +00:00
Georg Lehmann	cc08786689	aco: use maximum RT vgpr_limit that doesn't reduce wave count Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details 144 instead of 132 with 5 waves, in practice. Foz-DB Navi31: Totals from 33 (0.04% of 80273) affected shaders: Instrs: 3266241 -> 3261329 (-0.15%) CodeSize: 16885356 -> 16860088 (-0.15%) VGPRs: 4356 -> 4752 (+9.09%) SpillVGPRs: 2504 -> 1535 (-38.70%) Scratch: 264704 -> 216320 (-18.28%) Latency: 18445909 -> 18395904 (-0.27%) InvThroughput: 3689182 -> 3679182 (-0.27%) VClause: 85171 -> 84595 (-0.68%) SClause: 59365 -> 59320 (-0.08%); split: -0.08%, +0.01% Copies: 260528 -> 259113 (-0.54%); split: -0.59%, +0.05% Branches: 92537 -> 92519 (-0.02%) VALU: 1937426 -> 1935925 (-0.08%); split: -0.08%, +0.01% SALU: 393075 -> 393047 (-0.01%); split: -0.01%, +0.01% VMEM: 147914 -> 146003 (-1.29%) Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37548>	2025-09-26 08:45:05 +00:00
Georg Lehmann	8e03505782	aco: don't insert s_sendmsg dealloc_vgprs with little vgprs allocated Reduces message bus traffic when the benefit is small. Foz-DB Navi31: Totals from 3752 (4.67% of 80273) affected shaders: Instrs: 1999755 -> 1992249 (-0.38%) CodeSize: 10531824 -> 10501800 (-0.29%) Latency: 14935247 -> 14935147 (-0.00%) InvThroughput: 5976053 -> 5975262 (-0.01%) Foz-DB Navi33: Totals from 2614 (3.26% of 80273) affected shaders: Instrs: 969475 -> 964247 (-0.54%) CodeSize: 5171240 -> 5150328 (-0.40%) Latency: 7891519 -> 7891434 (-0.00%) InvThroughput: 4815008 -> 4814287 (-0.01%); split: -0.01%, +0.00% Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37508>	2025-09-26 07:51:02 +00:00
Georg Lehmann	27cc6317f9	aco: dealloc vgprs if there is a pending non scratch store and no pending export Because s_sendmsg dealloc_vgprs waits for every counter except vs_count, and the message bus has limited throughput, we should only insert the dealloc when we know that it's beneficial. Foz-DB Navi31: Totals from 5280 (6.58% of 80273) affected shaders: Instrs: 4186851 -> 4197416 (+0.25%) CodeSize: 21910004 -> 21952264 (+0.19%) Latency: 31679067 -> 31679173 (+0.00%) InvThroughput: 9182625 -> 9183417 (+0.01%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37508>	2025-09-26 07:51:02 +00:00
Georg Lehmann	26e041e821	aco: remove existing dealloc_vgprs use We didn't consider that s_sendmsg dealloc_vgpr waits for all counters expect vscnt. Foz-DB Navi31: Totals from 74090 (92.52% of 80084) affected shaders: Instrs: 36031071 -> 35853573 (-0.49%) CodeSize: 189233756 -> 188523764 (-0.38%) Latency: 222378318 -> 222374890 (-0.00%) InvThroughput: 33366893 -> 33362457 (-0.01%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37508>	2025-09-26 07:51:02 +00:00
Georg Lehmann	cf30742a66	radv,aco: don't end monolithic ray tracing with unconditional terminate The terminate requires more code and blocks us from deallocating VGPRs early. Foz-DB Navi31: Totals from 63 (0.08% of 80273) affected shaders: Instrs: 3372702 -> 3372467 (-0.01%) CodeSize: 17441676 -> 17440736 (-0.01%) Latency: 19763447 -> 19763288 (-0.00%) InvThroughput: 3860502 -> 3860478 (-0.00%) Branches: 96204 -> 96141 (-0.07%) SALU: 406648 -> 406549 (-0.02%) Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37542>	2025-09-25 15:35:55 +00:00
Daniel Schürmann	d041640b88	aco: remove excess offset handling for load/store_shared Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37453>	2025-09-24 14:28:25 +00:00
Daniel Schürmann	dbb20a4e23	aco/optimizer: remove DS offset optimization No fossil changes. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37453>	2025-09-24 14:28:24 +00:00
Daniel Schürmann	10266e7b21	radv: allow for unsigned wraps for shared memory intrinsics in nir_opt_offsets Totals from 76 (0.10% of 79839) affected shaders: (Navi48) Instrs: 237450 -> 237323 (-0.05%); split: -0.05%, +0.00% CodeSize: 1276732 -> 1275824 (-0.07%); split: -0.07%, +0.00% Latency: 1123467 -> 1123387 (-0.01%); split: -0.01%, +0.01% InvThroughput: 364942 -> 364738 (-0.06%); split: -0.06%, +0.00% Copies: 20654 -> 20636 (-0.09%); split: -0.09%, +0.00% Branches: 7326 -> 7327 (+0.01%) PreSGPRs: 5197 -> 5195 (-0.04%) PreVGPRs: 3395 -> 3396 (+0.03%) VALU: 96134 -> 96034 (-0.10%) SALU: 48059 -> 48041 (-0.04%); split: -0.04%, +0.00% VOPD: 10 -> 8 (-20.00%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37453>	2025-09-24 14:28:24 +00:00
Rhys Perry	591b498e1f	radv: fix progress reporting in lower_rt_derefs Only create nir_load_rt_arg_scratch_offset_amd if needed. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Marek Olšák <maraeo@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35069>	2025-09-24 08:20:27 +00:00
Rhys Perry	92a2ab8b64	ac/nir: fix progress reporting in ac_nir_lower_tex Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Marek Olšák <maraeo@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35069>	2025-09-24 08:20:27 +00:00
Natalie Vock	f0d3d0ad21	aco/scheduler: Bail early on unreorderable instructions Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37212>	2025-09-22 11:13:50 +00:00
Ali, Nawwar	c75cb1233c	amd/vpelib: add FL capabilitie and lut container size [WHY] get a clear definition of fastload support and actual 3d lut container size [HOW] Added related code Acked-by: Chuanyu Tseng <Chuanyu.Tseng@amd.com> Signed-off-by: Nawwar Ali <Nawwar.Ali@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37504>	2025-09-22 10:37:22 +00:00
Nagulendran, Iswara	1cd047c958	amd/vpelib: Handle Destination Rect with zero dimensions [Why] Route case where dest rect has zero dimensions to perform background color fill. Acked-by: Chuanyu Tseng <Chuanyu.Tseng@amd.com> Signed-off-by: Iswara Nagulendran <Iswara.Nagulendran@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37504>	2025-09-22 10:37:22 +00:00
Assadian, Navid	4c96e8c352	amd/vpelib: Add new colors to visual confirm [WHY] Newly added formats require distinct colors for proper differentiation. [HOW] Add new colors, pairwise distinguishable for newly added formats. Acked-by: Chuanyu Tseng <Chuanyu.Tseng@amd.com> Signed-off-by: Navid Assadian <Navid.Assadian@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37504>	2025-09-22 10:37:21 +00:00
swscm, z1	d79665066d	amd/vpelib: Ensures type-safe comparison for callback assignment [WHY & How] Ensures type-safe comparison for the sys_event callback assignment by casting the NULL constant to the appropriate function pointer type. Acked-by: Chuanyu Tseng <Chuanyu.Tseng@amd.com> Signed-off-by: Muhammad Ansari <Muhammad.Ansari@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37504>	2025-09-22 10:37:21 +00:00
Zhao, Jiali	237ab0778e	amd/vpelib: Create Function to Check for Blending Feature [HOW] Created check_blending_support function and condition to check for readable purpose Acked-by: Chuanyu Tseng <Chuanyu.Tseng@amd.com> Signed-off-by: Zhao, Jiali <Jiali.Zhao@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37504>	2025-09-22 10:37:21 +00:00
Marek Olšák	bbab69d343	radv: fix load_smem alignment radv_cmd_buffer_upload_alloc_aligned is used with alignment=0, which guarantees that the alignment is at least 4. Fixes: `9e16ed7a13` - ac/nir: switch nir_load_smem_amd uses to ac_nir_load_smem wrapper Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37345>	2025-09-19 21:08:25 -04:00
Georg Lehmann	14dfc05f83	radv: use rt wave size in fragment shaders with ray queries Usually wave64 performs better for fragment shaders, because LDS sharing for interpolation is better. But the rt traversal loop divergence is likely high enough to make wave32 better on GFX10. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37360>	2025-09-19 11:06:06 +00:00
Georg Lehmann	4a080a8904	radv: allow application required fragment shader subgroup size If the application really thinks it needs pswave32, let it use it. Fragment shaders also have no concept of full subgroups, so the existing code that chooses the subgroup size will work already. For pre raster stages, we cannot allow this because of potential mismatches in merged stages. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37360>	2025-09-19 11:06:06 +00:00
Hans-Kristian Arntzen	3bc81ee6f1	radv/sqtt: Ensure that present fence gets signalled. Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Hans-Kristian Arntzen <post@arntzen-software.no> Fixes: `88cbe32048` ("radv: add support for RGP queue events") Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37438>	2025-09-18 14:58:39 +00:00

1 2 3 4 5 ...

18811 commits