fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-28 19:00:13 +01:00

Author	SHA1	Message	Date
Rhys Perry	bf41cf2eef	radv/rt: don't split array/struct payload variables If the shader has multiple payload variables, split passes might not preserve the order and this can cause the offsets used for the stores to not match the payload offsets for nir_intrinsic_trace_ray. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31204>	2024-09-24 15:41:04 +00:00
Rhys Perry	204e446bcd	radv/rt: align constant data by 64 when inlining shaders There's never any need for anything higher. If this were too high (such as NIR_ALIGN_MUL_MAX), it would have caused issues. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31204>	2024-09-24 15:41:04 +00:00
Sagar Ghuge	7e48cbb029	intel: uncached L1 to fix memory barrier issue in RT shader In the RT shader, if there's a executeCallableEXT() in between, even though the called shader does nothing, the instructions before and after the executeCallableEXT() is not properly synced. Patch fixes: - dEQP-VK.ray_tracing_pipeline.memguarantee.inside.rgen - dEQP-VK.ray_tracing_pipeline.memguarantee.inside.chit - dEQP-VK.ray_tracing_pipeline.memguarantee.inside.miss - dEQP-VK.ray_tracing_pipeline.memguarantee.inside.call Thank to Kevin for finding out there is a load/store issue. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31201>	2024-09-24 14:33:11 +00:00
Nanley Chery	730e83b525	anv: Require compression for fast-clears on gfx20+ In commit `44351d67f8`, I needed to change some variables in a check for compression in anv_can_fast_clear_color_view(). Instead of doing that, I dropped the check altogether because I thought the call to anv_layout_to_fast_clear_type() which followed right afterwards would return ANV_FAST_CLEAR_NONE if the aux usage was ISL_AUX_USAGE_NONE. That turned out not to be the case, due to special-casing of Xe2+. For now, make Xe2+ more like other platforms when it comes to enabling fast-clears. If there comes a reason to actually fast-clear with ISL_AUX_USAGE_NONE, we can revisit this. Fixes: `44351d67f8` ("anv: Change params of anv_can_fast_clear_color_view") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11920 Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31297>	2024-09-24 13:56:02 +00:00
Benjamin Otte	d5f207311d	pvr: Don't emit critical warning all the time Instead of unconditionally emitting a warning to applications about the missing environment variable to enable the driver, only check it after determining that the device actually matches. Signed-off-by: Benjamin Otte <otte@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31293>	2024-09-24 13:26:46 +00:00
Sviatoslav Peleshko	78a664b584	anv: Update XeSS workaround executable names for Satisfactory 1.0 Fixes: `8b36d230` ("anv: workaround XeSS for Satisfactory") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11915 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31343>	2024-09-24 12:06:54 +00:00
Mike Blumenkrantz	04709e4f7d	anv: fix video profile lists these didn't include dmabuf layout or mutable formats despite both being supported Reviewed-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31317>	2024-09-24 11:38:48 +00:00
Hyunjun Ko	a36b17d7a8	zink: walk the chain of resources for multi-planar formats. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31333>	2024-09-24 11:11:10 +00:00
Lionel Landwerlin	f81dc17e7d	anv: add missing pipeline instance multiplier Fix zink/anv tests : dEQP-GLES3.functional.fbo.multiview.samples_* Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11911 Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31341>	2024-09-24 10:36:17 +00:00
Rohan Garg	56adf42110	intel/brw: lower math op regions for Xe2+ This helps fix: - dEQP-VK.spirv_assembly.instruction.graphics.float16.arithmetic_3.tan_frag - dEQP-VK.spirv_assembly.instruction.graphics.float16.arithmetic_2.tan_frag Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31218>	2024-09-24 09:58:28 +00:00
Michel Dänzer	dc7723f788	radeonsi: Revert to GLSL_SAMPLER_DIM_2D in si_create_fmask_expand_cs Fixes piglit arb_shader_texture_image_samples-builtin-image hanging Navi 14. Fixes: `997c39c268` ("radeonsi: clean up and make corrections to si_create_fmask_expand_cs") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31323>	2024-09-24 07:52:55 +00:00
Michel Dänzer	d69c1ca1a0	Revert "radeonsi: remove CB sync after FMASK and DCC decompression" This reverts commit `3527d9f81d`. It broke piglit glx@glx-visuals-depth/glx@glx-visuals-stencil on Navi 14. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31323>	2024-09-24 07:52:55 +00:00
Eric Engestrom	76c5c49fca	radeonsi/ci: mark KHR-GL46.shader_image_load_store.basic-allTargets-atomic as fixed Fixed by a commit in the range e1a53d41...1b4e1007 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31337>	2024-09-24 07:34:11 +00:00
Eric Engestrom	bc6eae7d0a	radeonsi/ci: document spec@egl_ext_surface_compression@create as crashing Fixes: `213f5e9152` ("Uprev Piglit to e9ab30aeaed97b69868cf4d6d6a3f70f3b53c362") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31337>	2024-09-24 07:34:11 +00:00
David Rosca	712e49f137	radeonsi/vcn: Don't reuse context with multiple VCN instances Kernel does VCN instance scheduling per context, so when we have multiple instances we should use new context to be able to utilize all of them. Another issue is with AV1, VCN 3 and VCN 4 only support AV1 on first instance. Kernel parses IBs and switches to first instance when it detects AV1, but this only works for first submitted IB in context. The CS would be rejected if we first decode/encode other codecs, kernel schedules on second instance (default) and then we try to decode/encode AV1. Cc: mesa-stable Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31249>	2024-09-24 07:06:32 +00:00
Samuel Pitoiset	cf536f63d1	radv: introduce dirty flags for shaders state Instead of re-emitting some dynamic states when a new shader is bound, only re-emit the user SGPR states. This is slightly more optimal. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31160>	2024-09-24 06:00:00 +00:00
Samuel Pitoiset	8beea85232	radv: rename shader_query_state to task_state Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31160>	2024-09-24 06:00:00 +00:00
Samuel Pitoiset	a0951bae70	radv: use only one user SGPR for all NGG state Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31160>	2024-09-24 06:00:00 +00:00
Samuel Pitoiset	3022282ba3	radv: make sure to re-emit shader query state when a task shader is bound This doesn't change anything in practice because if we have a task shader, we also have a mesh shader and the state was already re-emitted. Though, this will prevent a regression from the upcoming patches because the user SGPR layout will change. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31160>	2024-09-24 06:00:00 +00:00
Samuel Pitoiset	16341f41e1	radv: emit all shader related user SGPR states in one place This will allow us to use only one user SGPR for NGG shaders, and also further optimizations. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31160>	2024-09-24 05:59:59 +00:00
Caio Oliveira	e1b74407bb	intel/brw: Only validate GRF boundary crossing restriction for GRFs Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31294>	2024-09-24 03:39:05 +00:00
Kenneth Graunke	878ae9708a	intel/brw: Don't include sync.nop in INTEL_DEBUG instruction counts In an earlier commit, I made us stop counting sync.nops in the shader statistics we use for shader-db (brw_debug_log_message) and fossil-db (stats->instructions = ...). However, I missed adjusting the printout for INTEL_DEBUG. Fixes: `1497f4e0c2` ("intel/fs: Don't include sync.nop in instruction count statistics") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31311>	2024-09-24 03:12:32 +00:00
Jason Macnak	c243970d50	gfxstream: use gralloc metadata in vkGetAHBPropertiesANDROID ... now that gralloc buffer metadata is more widely available and actually populated. Test: cvd start --gpu_mode=gfxstream_guest_angle_host_swiftshader Test: cts -m CtsGraphicsTestCases Test: cts -m CtsMediaCodecTestCases Test: cts -m CtsMediaDecoderTestCases Test: cts -m CtsViewTestCasesTest Test: Open Youtube in Webview Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31330>	2024-09-23 23:03:47 +00:00
Eric R. Smith	466df904b7	panfrost: Add back A8_UNORM format for valhall If A8_UNORM isn't specified in the format table, then it is emulated in the state tracker by RGBA8. This is suboptimal, both because it requires more memory, and because the blit gets more complicated (and in fact there's a bug currently in the blit code where we don't mask properly for GL_ALPHA). Fix this by adding an explicit A8_UNORM format entry. Fixes piglit test ext_framebuffer_multisample-blit-mismatched-formats. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31322>	2024-09-23 21:35:15 +00:00
Trigger Huang	7c01f70bdc	mesa: Fix AMD performance monitor implementation For cmd GL_PERFMON_RESULT_SIZE_AMD and GL_PERFMON_RESULT_AMD of glGetPerfMonitorCounterDataAMD(), the current implementation will return 0 if the result is not available on HW, but according to the sepc, if cmd is PERFMON_RESULT_SIZE_AMD, <data> will contain actual size of all counter results being sampled, instead of 0. And if cmd is PERFMON_RESULT_AMD, <data> will contain results. The spec doesn't require the application to wait for the result available on HW before executing cmd PERFMON_RESULT_SIZE_AMD and PERFMON_RESULT_AMD. So for cmd PERFMON_RESULT_SIZE_AMD, it should immediately return the correct result size for the counters enabled by glSelectPerfMonitorCountersAMD(), and for cmd PERFMON_RESULT_AMD, it should wait until the result is available on HW and then return the result. Without this fix, the Sample Usage in the spec will not work properly as it always gets size 0 when calling the cmd PERFMON_RESULT_SIZE_AMD V2: If SelectPerfMonitorCountersAMD is called on a monitor, then the result of querying for PERFMON_RESULT_SIZE will be 0. Signed-off-by: Trigger Huang <Trigger.Huang@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31179>	2024-09-23 21:01:18 +00:00
Marek Olšák	f8788b2a38	radeonsi: remove the make_texture_descriptor indirect function call Call gfx10_make_texture_descriptor from si_make_texture_descriptor and use si_make_texture_descriptor everywhere. This is more readable. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30904>	2024-09-23 20:34:13 +00:00
Marek Olšák	a578ca8388	radeonsi: rename hw_level -> view_level for readability Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30904>	2024-09-23 20:34:13 +00:00
Marek Olšák	f5b0f80de4	radeonsi: don't insert any barrier after the copy for PIPE_MAP_READ Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30904>	2024-09-23 20:34:13 +00:00
Marek Olšák	a7cb1433e3	radeonsi: use ACO on GFX11.5 with LLVM 18 or older to work around GPU hangs Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30904>	2024-09-23 20:34:13 +00:00
Marek Olšák	1c156f7fa9	radeonsi: clean up set_log_context code for all aux contexts Instead of allocating it and then leaking it, store the log context in si_screen. Also, the log context was only set for "general" instead of all aux contexts. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30904>	2024-09-23 20:34:13 +00:00
Iván Briano	2e1c278e3d	anv: skip rt pipeline compile if we found all shaders When no pipeline cache is provided by the application and we rely on the internal one, cache hits are not counted as such. This was causing us to return COMPILE_REQUIRED on some cases where all shaders had been found in the cache, as well as some unnecessary extra processing in the case that we did have to compile the pipeline. Fixes: `1dacea10f3` ("anv: implement caching for ray tracing pipelines") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31298>	2024-09-23 19:57:53 +00:00
Iván Briano	1a45c8827b	anv: free shaders on rt pipeline compile error We have not yet added the shaders to the pipeline->shaders array at this point. If we couldn't compile (or were asked not to) the pipeline, we were leaking references to any shaders found in the cache. This would manifest as an assert on device destruction: vk_pipeline_cache_destroy: Assertion `cache->object_cache->entries == 0' failed. Fixes: `58c9f817cb` ("anv: fix pipeline executable properties with graphics libraries") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31298>	2024-09-23 19:57:53 +00:00
David Rosca	f263e6d242	radeonsi/vcn: Enable IB parsing with AMD_DEBUG=ib Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31275>	2024-09-23 19:25:09 +00:00
David Rosca	1459193b99	ac: Add VCN IB parser Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31275>	2024-09-23 19:25:08 +00:00
Eric Engestrom	ada6702af0	llvmpipe/ci: document regression Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31326>	2024-09-23 18:50:33 +00:00
Eric Engestrom	bc086fcbdb	lavapipe/ci: document regression Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31326>	2024-09-23 18:50:33 +00:00
Eric Engestrom	bb51cb3f0e	zink+nvk/ci: add flakes seen recently Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31325>	2024-09-23 18:26:20 +00:00
Eric Engestrom	10b83041b8	zink+nvk/ci: document spec@egl_ext_surface_compression@create as crashing Fixes: `213f5e9152` ("Uprev Piglit to e9ab30aeaed97b69868cf4d6d6a3f70f3b53c362") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31325>	2024-09-23 18:26:20 +00:00
Thong Thai	8da847560b	ci: partially emulate cdna devices using lower image opcodes Use the AMD_IMAGE_OPCODES=0 environment variable to test the lower image opcode code path used by AMD CDNA/compute devices without graphics. Runs a very limited subset of GL tests. Signed-off-by: Thong Thai <thong.thai@amd.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31180>	2024-09-23 17:46:33 +00:00
Benjamin Otte	9f612155fc	nvk: Don't emit critical messages during init vk_error() is used for application errors with VK_DEBUG_REPORT_ERROR_BIT_EXT. Don't emit those for old hardware or old kernels. Related: https://gitlab.gnome.org/GNOME/gtk/-/issues/7020 Fixes: `4db1bd5846` ("nvk/nvkmd: Implement dev and pdev for nouveau") Signed-off-by: Benjamin Otte <otte@redhat.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31289>	2024-09-23 16:52:30 +00:00
Boris Brezillon	dc1a7b94a8	pan/va: Fix nir_op_pack_uvec4_to_uint We don't have a generic v4i8 on Valhall, we have to lower it to two v2i8. Fortunately, bi_make_vec_to() hides the Bifrost/Valhall differences, so use that for nir_op_pack_uvec4_to_uint. Fixes: `934b0f1add` ("pan/bi: Respect swizzles for more vector ops") Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31280>	2024-09-23 16:22:49 +00:00
Lionel Landwerlin	35ea8b6cd2	brw: disable null_rt only if color output does not affect other outputs We found out that some HW changes on Xe2 make the HW avoid reading the blend state if we're using the null_rt bit in the extended descriptor. Since the alpha_to_coverage bit resides in the blend state, that state is ignored and writes are going through to the depth/stencil buffers. Disable null_rt in the color outputs if the color outputs can affect other outputs (through alpha_to_coverage & omask). Fixes tests in this pattern on Xe2 : dEQP-VK.pipeline..multisample.alpha_to_coverage_no_color_attachment. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Backport-to: 24.2 Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31196>	2024-09-23 15:56:02 +00:00
Lionel Landwerlin	b45ce7d43e	brw: move null_rt control up a layer We'll want to tune this setting based on other parameters. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Backport-to: 24.2 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31196>	2024-09-23 15:56:02 +00:00
Lionel Landwerlin	9b42215e0d	iris: ensure null render target for specific cases Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31196>	2024-09-23 15:56:02 +00:00
Lionel Landwerlin	badb3f6301	anv: Only flush render target cache when detecting RT changes We setup an empty render target when there are no color attachments, which effectively makes it a different surface state. In most cases the compiler will insert a null-rt bit in the extended descriptor which means the RT isn't even accessed. But in some cases like alpha-to-coverage output + depth/stencil write, we will access the render target because using the null-rt will prevent alpha-to-coverage from happening. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `2bd304bc8f` ("anv: Skip the RT flush when doing depth-only rendering.") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31196>	2024-09-23 15:56:02 +00:00
Lionel Landwerlin	fb3ae17d96	anv: fix missing tracking for alpha-to-coverage runtime changes Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `9926aedc96` ("anv: enable EDS3 AlphaToCoverageEnable & RasterizationSamples") Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31196>	2024-09-23 15:56:01 +00:00
Boris Brezillon	22841babee	panvk: Protect access to the virtual address heap Device memory can be allocated from different threads and thus requires locking when allocating/freeing virtual addresses. Fixes: `53fb1d99ca` ("panvk: Transition to explicit VA assignment on v10+") Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31282>	2024-09-23 17:29:12 +02:00
Charmaine Lee	5bdcc290e3	svga: sync up with the latest svga include files Also imported vm_basic_types.h to make upstream sync up easier. Reviewed-by: Neha Bhende <bhenden@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31290>	2024-09-23 15:05:15 +00:00
Konstantin Seurer	00c94e0cd4	radv: Workaround apps using ray tracing when it is unsupported Emitting bvh64_intersect_ray_amd will crash the compiler on pre-GFX10_3 hardware. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11786 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30886>	2024-09-23 14:02:28 +00:00
Georg Lehmann	0e21cd9e15	aco/gfx10+: work around non uniform ds_append wave64 result In wave64 for hw with native wave32, ds_append seems to be split in a load for the low half and an atomic for the high half, and other LDS instructions can be scheduled between the two. Which means the result of the low half is unusable because it might be out of date. I was only able to reproduce this issue in WGP mode, but be conservative and apply the workaround in CU mode too. Foz-DB Navi31: Totals from 13 (0.02% of 79395) affected shaders: Instrs: 7599 -> 7656 (+0.75%) CodeSize: 39708 -> 39972 (+0.66%) Latency: 83174 -> 83572 (+0.48%) InvThroughput: 8271 -> 8357 (+1.04%) Copies: 718 -> 717 (-0.14%) VALU: 3689 -> 3703 (+0.38%) SALU: 935 -> 965 (+3.21%) Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11921 Fixes: `45e935800a` ("aco: implement nir_shared_append/consume_amd") Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31301>	2024-09-23 13:17:58 +00:00

1 2 3 4 5 ...

181096 commits