fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-01-20 04:30:22 +01:00

Author	SHA1	Message	Date
Georg Lehmann	2f4e53b22a	aco: fix detecting sgprs read by SMEM hazard s_waitcnt_lgkmcnt is SOPK, not SOPP and there are other SOPK instructions that don't mitigate the hazard. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26163>	2023-11-15 12:35:32 +00:00
Georg Lehmann	e49c413a86	aco: use null operand for SOPK s_waitcnt Both null def and op result in the same correct encoding, but these instructions optionally read a sgpr, so it makes more sense to use an operand. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26163>	2023-11-15 12:35:32 +00:00
Job Noorman	bcf0425f7f	ir3: correctly set bit size for 64b constant @load_ubo When lowering @load_constant to @load_ubo, the bit size is currently hard-coded to 32. This causes validation errors when lowering a constant with a 64b bit size. This patch fixes this by setting the @load_ubo bit size correctly for 64b constants. This 64b load is later lowered to a 32b load by ir3_nir_lower_64b_intrinsics. Fixes Piglit test: - spec@arb_gpu_shader_fp64@execution@fs-indirect-temp-double-src This patch has no impact on shader-db. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26191>	2023-11-15 11:58:22 +00:00
Samuel Pitoiset	bb92c34c28	radv: set radv_zero_vram=true for Unreal Engine 4/5 Unreal Engine seems to rely on uninitialized memory and RADV_DEBUG=zerovram fixes a bunch of issues. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9025 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9380 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9026 Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26188>	2023-11-15 11:24:31 +00:00
Samuel Pitoiset	627d593443	radv: fix registering queues for RGP with compute only This crashes if the graphics queue isn't created. Fixes: `930e77e903` ("radv/sqtt: add support for queue info") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10136 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26183>	2023-11-15 10:41:50 +00:00
Matt Turner	b66b299eda	r600: Add missing dep on git_sha1.h Bug: https://bugs.gentoo.org/917116 Fixes: `3ab51c7ebd` ("r600: Add callbacks for get_driver_uuid and get_device_uuid") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26195>	2023-11-15 10:17:05 +00:00
Karol Herbst	3916ee05b0	rusticl/api: workaround DPCPP fetching clSetProgramSpecializationConstant Nobody has to advertize it as an extension, but here we are. Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25701>	2023-11-15 08:34:57 +00:00
Karol Herbst	924c8e7bcd	vtn: add hack for system values placed in CrossWorkgroup memory Upstream bug: https://github.com/intel/llvm/issues/6703 Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25701>	2023-11-15 08:34:57 +00:00
Karol Herbst	41f814df6f	nir: allow vec derefs on system values There is no real reason to prevent this as far as I know. And some of the SPIR-V generated by DPCPP is running into this. Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25701>	2023-11-15 08:34:57 +00:00
Faith Ekstrand	23e1f3c373	nvk: Use nak_shader_info natively Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26197>	2023-11-15 02:24:13 +00:00
Faith Ekstrand	c074ea6215	nak: Handle the num_gpr offsetting inside nak This makes the thing in the nak_shader_info exactly the thing that gets plugged into the hardware. Makes the driver a bit simpler. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26197>	2023-11-15 02:24:13 +00:00
Faith Ekstrand	d8551cd328	nak: Add a writes_layer bit to nak_shader_info::vtg Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26197>	2023-11-15 02:24:12 +00:00
Faith Ekstrand	a232050204	nak: Move clip, cull, and XFB into a nak_shader_info.vtg Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26197>	2023-11-15 02:24:12 +00:00
Faith Ekstrand	440adf7970	nak: Properly prefix nak_xfb_info Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26197>	2023-11-15 02:24:12 +00:00
Faith Ekstrand	4e6e814f5e	nak: Rename TLS to SLM Shader Local Memory is what NVIDIA calls it in the shader header docs as well as the command stream headers. Better to be consistent even if it gets my Intel brain confused. (Intel uses SLM for shared memory.) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26197>	2023-11-15 02:24:12 +00:00
Faith Ekstrand	a946071546	nvk: Use nak_fs_key instead of rolling our own Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26197>	2023-11-15 02:24:12 +00:00
Faith Ekstrand	0f086401e3	nvk: Move even more lowering into nvk_codegen.c At this point, we're fully trusting NAK to do its own lowering and we only lower stuff in nvk_shader.c if it's relevant for Vulkan. This also assumes that NAK is already doing the right thing everywhere. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26197>	2023-11-15 02:24:12 +00:00
Faith Ekstrand	67bb8e8165	nvk: Move the guts of nvk_compile_nir() to nvk_codegen.c Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26197>	2023-11-15 02:24:12 +00:00
Faith Ekstrand	0405f494e8	nvk: Move the optimization loop to the nvk_codegen.c We also call it from nak_preprocess_nir and lower var copies there. NAK should already be doing this for us. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26197>	2023-11-15 02:24:12 +00:00
Faith Ekstrand	7f8fbacb8a	nvk: Move a bunch of codegen-specific lowering to helpers Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26197>	2023-11-15 02:24:12 +00:00
Faith Ekstrand	c3a44f6264	nvk: Add a codegen helper for nir_shader_compiler_options Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26197>	2023-11-15 02:24:12 +00:00
Faith Ekstrand	845e7d2911	nvk: Only lower outputs to temporaries Also, move it up to right after we parse the SPIR-V and remove some now unnecessary clean-up passes. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26197>	2023-11-15 02:24:12 +00:00
Faith Ekstrand	26bb5f4972	nak/nir: Lower indirect FS inputs Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26197>	2023-11-15 02:24:12 +00:00
Faith Ekstrand	e507d70333	nvk: Handle load_first_vertex in nvk_nir_lower_descriptors() Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26197>	2023-11-15 02:24:12 +00:00
Faith Ekstrand	82061b1b9d	nvk: Only advertise VK_KHR_shader_terminate_invocation if using NAK Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26197>	2023-11-15 02:24:12 +00:00
David Rosca	fcfa68a632	Revert "frontends/va: Alloc interlaced surface for interlaced pics" This reverts commit `578e10e157`. The only reason for reallocating surfaces as interlaced (on drivers that supports both progressive and interlaced) was deinterlacing with postproc filter, but that now also supports interleaved surfaces. With this change interlaced surfaces are no longer used on radeonsi. Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26174>	2023-11-15 01:44:29 +00:00
David Rosca	eafeff6302	gallium/auxiliary/vl: Support interleaved input in deinterlace filter This adds support for deinterlacing interleaved surfaces (both fields interleaved together instead of as separate layers). Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26174>	2023-11-15 01:44:29 +00:00
David Rosca	35b0ccd855	gallium/auxiliary/vl: Scale dst_rect x0/y0 when rendering chroma plane This fixes incorrect chroma plane position when x0/y0 is not zero. Fixes: `001358a97c` ("vl/compositor: add a new function for YUV deint") Acked-by: Thong Thai <thong.thai@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26123>	2023-11-15 01:12:01 +00:00
David Rosca	e9091b1f5c	gallium/auxiliary: Fix coordinates clamp in util_compute_blit Fixes: `7c8e1596d6` ("gallium/auxiliary: Fix util_compute_blit half texel offset with scaling") Acked-by: Thong Thai <thong.thai@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26123>	2023-11-15 01:12:01 +00:00
David Rosca	ef0546152f	gallium/auxiliary/vl: Fix coordinates clamp in compute shaders Fixes: `a6a43963ed` ("gallium/auxiliary/vl: Clamp coordinates in compute shaders") Acked-by: Thong Thai <thong.thai@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26123>	2023-11-15 01:12:01 +00:00
Jesse Natalie	cd0cff951a	nir_lower_mem_access_bit_sizes: Fix write-mask-constrained 3-byte stores as atomics The code here handled stores of actual 3-byte values (8-bit, 3-component), but didn't correctly handle stores of larger 8-bit vectors that were constrained by write mask to just 3 bytes. In that case, the pad-to-vec4 step was unnecessary and problematic. Seen in CL CTS test_basic vector_swizzle test group for char3 with CLOn12. Fixes: `c70d94a8` ("nir_lower_mem_access_bit_sizes: Support unaligned stores via a pair of atomics") Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26034>	2023-11-14 21:10:30 +00:00
Helen Koike	bff7e4b69d	ci/zink: add spec@ext_timer_query@time-elapsed to flakes Add the following flake to zink-anv-tgl-flakes.txt spec@ext_timer_query@time-elapsed See https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25861#note_2140498 Signed-off-by: Helen Koike <helen.koike@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25885>	2023-11-14 17:29:30 +00:00
Faith Ekstrand	618bdb8571	nak: Rework FS input interpolation This gives FS I/O the same treatment as we did for vertex attributes in that we now have a NIR intrinsic which pretty closely matches the hardware and we lower to that before going into NAK. This gives us a bit more control in the NIR. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26181>	2023-11-14 16:38:03 +00:00
Faith Ekstrand	d3c5688cf5	nak: Plumb the nak_compiler through to lower_fs_input_intrin Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26181>	2023-11-14 16:38:02 +00:00
Faith Ekstrand	f5ba0751e2	nak: Make encode_sm75 a method of Shader Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26181>	2023-11-14 16:38:02 +00:00
Faith Ekstrand	a6376705e4	nak: Make ALD/AST.PHYS a boolean The generic flags field was originally copied from codegen but a boolean makes way more sense. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26181>	2023-11-14 16:38:02 +00:00
Faith Ekstrand	8e00ee6fe8	nak: Drop OpAtomCas in favor of OpAtom with atom_op == CmpExch Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26181>	2023-11-14 16:38:02 +00:00
Faith Ekstrand	ea453b373d	nak: Fix copy-prop for OpPLop3 sources Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26181>	2023-11-14 16:38:02 +00:00
Faith Ekstrand	a65518b625	nvk: Free NAK shaders Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26181>	2023-11-14 16:38:02 +00:00
Jesse Natalie	2f1cb79968	d3d12: GL4.5 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26180>	2023-11-14 16:04:58 +00:00
Jesse Natalie	5a5178d5a4	d3d12: Fix MSAA-disabling pass; sample mask should be 0 for helper lanes Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26180>	2023-11-14 16:04:58 +00:00
Jesse Natalie	ba06542c7b	d3d12: Handle cull distance as an XFB target Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26180>	2023-11-14 16:04:58 +00:00
Jesse Natalie	263b56051d	d3d12: PRIMITIVES_GENERATED for stream > 0 should only be an SO query Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26180>	2023-11-14 16:04:58 +00:00
Tatsuyuki Ishi	538ca7801a	radv: Use shader part caching helpers for VS prolog and PS/TCS epilog. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26028>	2023-11-14 13:45:22 +00:00
Tatsuyuki Ishi	611545fbfe	radv: Implement helpers for shader part caching. Currently, shader part caching logic is duplicated between VS prolog and PS/TCS epilogs. This commit introduces a common abstraction to deduplicate the code. Additionally, there are a few design decisions that diverts from the current implementation: 1. A simple mutex is used instead of reader-writer lock. Prolog/epilog constructions are serialized, removing the need to free duplicate objects in case of a race. 2. A CS-local cache is used to quickly lookup an entry without holding a lock. This eliminates locking in over 99% of cases. 3. A set is used to reduce number of allocations. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26028>	2023-11-14 13:45:22 +00:00
Danylo Piliaiev	3cd6bb3e5d	tu: Add a725 workaround dispatch at the start of each cmdbuf Blob executes a special compute dispatch at the start of each command buffers. We copy this dispatch as is. At this point we don't know what this workaround is for. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25888>	2023-11-14 13:14:11 +00:00
Danylo Piliaiev	37f11ff1d4	freedreno/devices: Support Adreno 725 For 0x07030002 chip id different names are returned on different phones: Adreno730v3 or Adreno725v1. Settle on 725 to disambiguate them. The only difference from base 730 is that it has conditional execution of compute shader at the start of every command buffer. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25888>	2023-11-14 13:14:11 +00:00
Danylo Piliaiev	28f187b9a7	tu: Return error when GPU is unsupported Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25888>	2023-11-14 13:14:11 +00:00
Danylo Piliaiev	a669147689	tu: Always print startup failure messages If we encounter an error during the startup we always want to have it in the logs to quickly diagnose an issue from user attached logs. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25888>	2023-11-14 13:14:11 +00:00
LingMan	76996e2a94	rusticl: Use the `from_raw_parts` wrappers Deduplicates some safety checks and ensures we didn't forget one. Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26157>	2023-11-14 12:31:31 +00:00

1 2 3 4 5 ...

167372 commits