fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-16 07:38:14 +02:00

Author	SHA1	Message	Date
Bas Nieuwenhuizen	c98e52f88a	amd/common,radeonsi: Move gfx10_format_table to common. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5291>	2020-06-03 00:17:00 +00:00
Oschowa	c310677a75	radv: Explicitly cast TIMESTAMP_NOT_READY value to uin32_t where needed. Fixes a clang warning. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5228>	2020-06-02 21:31:17 +00:00
Oschowa	663e8cb4e6	aco: Use correct reference type in for-range-loop. Fixes a clang warning. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5228>	2020-06-02 21:31:17 +00:00
Oschowa	7b1bc460fd	aco: Don't std::move temporary object. Fixes the following clang warning: mesa/src/amd/compiler/aco_optimizer.cpp:2928:15: warning: moving a temporary object prevents copy elision [-Wpessimizing-move] ctx.uses = std::move(dead_code_analysis(program)); Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5228>	2020-06-02 21:31:17 +00:00
Oschowa	536339b0dd	aco: Don't declare 'Block' as class, but define as struct. Fixes clang warnings. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5228>	2020-06-02 21:31:17 +00:00
Oschowa	c2a778ef0f	radv: Don't take absolute value of unsigned type. Fixes clang warnings. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5228>	2020-06-02 21:31:17 +00:00
Timur Kristóf	7d2fe60f1c	radv/aco: Always enable subgroup shuffle. It is now supported by both backends on all hw. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5223>	2020-06-02 21:12:13 +00:00
Timur Kristóf	045c9ffa7d	aco: Implement subgroup shuffle on GFX6-7. GFX6 and GFX7 don't have the ds_bpermute (or permute) instruction, but we would like to support subgroup shuffle on these old GPUs. So we introduce a new pseudio instruction which will be lowered to an "unrolled loop" that emulates bpermute on GFX6 and GFX7 using readlane instructions, while also respecting the exec mask thanks to v_cmpx. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5223>	2020-06-02 21:12:12 +00:00
Timur Kristóf	14a5021aff	aco/gfx10: Refactor of GFX10 wave64 bpermute. The emulated GFX10 wave64 bpermute no longer needs a linear_vgpr, so we don't consider it a reduction anymore. Additionally, the code is slightly reorganized in preparation for the GFX6 emulated bpermute. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5223>	2020-06-02 21:12:12 +00:00
Marek Olšák	c6c8a9bd55	ac/nir: support v2f16 derivatives Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5003>	2020-06-02 16:29:25 -04:00
Marek Olšák	7c423dd721	ac/nir: set the second v_cvt_pkrtz argument to undef if it's unused Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5003>	2020-06-02 16:29:25 -04:00
Marek Olšák	bfb95725aa	ac/nir: select v_cvt_pkrtz for all conversions from f32 to f16 for radeonsi Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5003>	2020-06-02 16:29:25 -04:00
Marek Olšák	1d80015eaf	ac/nir: handle nir_op_[fiu]2[fiu]mp opcodes Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5003>	2020-06-02 16:29:25 -04:00
Marek Olšák	70b6d54011	ac/nir: support 16-bit data in image opcodes Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5003>	2020-06-02 16:29:25 -04:00
Marek Olšák	c3e0ba52a0	ac/nir: support 16-bit data in buffer_load_format opcodes Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5003>	2020-06-02 16:29:25 -04:00
Marek Olšák	b819ba949b	ac/nir: remove type and num_channels args from ac_build_buffer_store_common They were only used for type overloading where we can just use the type of data. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5003>	2020-06-02 16:29:25 -04:00
Marek Olšák	b98df7bf50	ac/nir: support vector types in the type suffix of overloaded intrinsics Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5003>	2020-06-02 16:29:25 -04:00
Marek Olšák	e5ea87cde8	ac/nir: use more types from ac_llvm_context Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5003>	2020-06-02 16:29:25 -04:00
Marek Olšák	116ec85012	ac: rename has_double_rate_fp16 -> has_packed_math_16bit Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5003>	2020-06-02 16:29:25 -04:00
Dylan Baker	a8e2d79e02	meson: use gnu_symbol_visibility argument This uses a meson builtin to handle -fvisibility=hidden. This is nice because we don't need to track which languages are used, if C++ is suddenly added meson just does the right thing. Acked-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4740>	2020-06-01 18:59:18 +00:00
Timothy Arceri	e843303d6f	radv: fix regression with builtin cache If the ~/.cache dir already exists continue on without failing. Fixes: `cd61f5234d` ("radv: Handle failing to create .cache dir.") Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5249>	2020-05-30 04:01:28 +00:00
Samuel Pitoiset	9d645a19eb	radv/aco: enable VK_KHR_subgroup_extended_types on GFX8+ Should be working now. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5148>	2020-05-29 11:20:58 +00:00
Samuel Pitoiset	e22567089c	aco: sign-extend input/indentity for 32-bit reduce ops on GFX10 Because some 16-bit instructions are already VOP3 on GFX10, we use the 32-bit variants to remove the temporary VGPR and to use DDP with the arithmetic instructions. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5148>	2020-05-29 11:20:58 +00:00
Samuel Pitoiset	83dcd1690b	aco: allow gfx10_wave64_bpermute with 8-bit/16-bit input Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5148>	2020-05-29 11:20:58 +00:00
Samuel Pitoiset	8ece71507d	aco: allocate a temp VGPR for some 8-bit/16-bit reduction ops on GFX10 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5148>	2020-05-29 11:20:58 +00:00
Samuel Pitoiset	2e0ea9bcca	aco: implement 8-bit/16-bit reductions on GFX10 Some 16-bit instructions are VOP3 on GFX10 and we have to emit a 32-bit DPP mov followed by the ALU instruction. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5148>	2020-05-29 11:20:58 +00:00
Samuel Pitoiset	75a730ced5	aco: fix register allocation for subdword instructions on GFX10 Cc: 20.1 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5148>	2020-05-29 11:20:58 +00:00
Samuel Pitoiset	7503863fe2	radv/aco: enable VK_EXT_subgroup_size_control ACO should already support Wave32 on GFX10 with all shader stages and CTS pass. RADV currently only allows Wave32 with the compute shader stage. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5056>	2020-05-29 10:12:26 +02:00
Samuel Pitoiset	10c4a7cf59	spirv,radv,anv: implement no-op VK_GOOGLE_user_type This extension only allows HLSL shader compilers to optionally embed unambiguous type information which can be safely ignored by the driver. This fixes a crash with the recent Vulkan backend of Path Of Exile (it uses the extension without checking if it's supported). Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Edmondo Tommasina <edmondo.tommasina@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5237>	2020-05-28 17:30:24 +02:00
Rhys Perry	01ce7887bf	aco: fix 64-bit shared_atomic_exchange Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4880>	2020-05-28 10:34:03 +00:00
Rhys Perry	1f2fd9c62e	aco: don't reorder barriers in the scheduler Unless we're reordering it around a barrier of the same type No shader-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4880>	2020-05-28 10:34:03 +00:00
Rhys Perry	e1900ee2c7	aco: preserve more fields when combining additions into SMEM Totals from 11 (0.01% of 127638) affected shaders: Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Fixes: `93c8ebfa78` ('aco: Initial commit of independent AMD compiler') Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4880>	2020-05-28 10:34:03 +00:00
Rhys Perry	95d5c1b8a1	aco: check instruction format before waiting for a previous SMEM store Totals from 7 (0.01% of 127638) affected shaders: CodeSize: 40336 -> 40320 (-0.04%) Instrs: 7807 -> 7803 (-0.05%) Cycles: 118588 -> 118344 (-0.21%); split: -0.23%, +0.02% SMEM: 331 -> 339 (+2.42%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Fixes: `1749953ea3` ('aco/gfx10: Wait for pending SMEM stores before loads') Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4880>	2020-05-28 10:34:03 +00:00
Rhys Perry	5ccc7c277c	aco: consider SDWA during value numbering Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Fixes: `23ac24f5b1` ('aco: add missing conversion operations for small bitsizes') Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5164>	2020-05-28 09:55:58 +00:00
Rhys Perry	8aa98cebc1	aco: fix interaction with 3f branch workaround and p_constaddr The offset was incorrect if we inserted a nop before the p_constaddr. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Fixes: `93c8ebfa` ('aco: Initial commit of independent AMD compiler') Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5164>	2020-05-28 09:55:58 +00:00
James Zhu	a91306677c	ac/gpu_info: Correct Acturus cu bitmap The cu bitmap in amd gpu info structure is 4x4 size array, and it's usually suitable for Vega ASICs which has 42 SE/SH layout. But for Arcturus, SE/SH layout is changed to 81. To mostly reduce the impact, we make it compatible with current bitmap array as below: SE4,SH0 --> cu_bitmap[0][1] SE5,SH0 --> cu_bitmap[1][1] SE6,SH0 --> cu_bitmap[2][1] SE7,SH0 --> cu_bitmap[3][1] Signed-off-by: James Zhu <James.Zhu@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5212>	2020-05-27 10:49:02 -04:00
Marek Olšák	2a3806ffa3	amd: replace SH -> SA (shader array) in comments Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5184>	2020-05-26 06:00:54 -04:00
Marek Olšák	2cf46f2e3d	ac/gpu_info: replace num_good_cu_per_sh with min/max_good_cu_per_sa Perf counters use the new max number. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5184>	2020-05-26 06:00:54 -04:00
Bas Nieuwenhuizen	be784cc77b	radv: Implement vkGetSwapchainGrallocUsage2ANDROID. This was implemented in version 6 of the VK_ANDROID_native_buffer extension and we only implement version 5. However, the Android Vulkan loader only checks whether vkGetInstanceProcAddr for the function is not NULL. This all went wrong when we switched to the layer code from ANV. Because the function may now be different per device, it adds fallback functions that dispatch to the dispatch table. So if we didn't implement the function we still returned a pointer to the dispatch function, which made the Android Vulkan loader believe it was supported. Dispatch functions: `d555794f30/src/amd/vulkan/radv_entrypoints_gen.py (L328)` Fixes: `d555794f30` "radv: update entrypoints generation from ANV" Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2936 Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5198>	2020-05-25 15:34:44 +00:00
Bas Nieuwenhuizen	a51ab5f956	radv: Do not close fd -1 when NULL-winsys creation fails. Fixes: `cd6ec2b1ab` "radv: implement a dummy winsys for creating devices without AMDGPU" Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5181>	2020-05-25 11:12:07 +00:00
Bas Nieuwenhuizen	cd0c5b64cc	radv: Remove dead code. pool is always non-NULL, and is also accessed before this check in the function, so remove the pool = NULL case. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5181>	2020-05-25 11:12:07 +00:00
Bas Nieuwenhuizen	cd61f5234d	radv: Handle failing to create .cache dir. Fixes: `f4e499ec79` "radv: add initial non-conformant radv vulkan driver" Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5181>	2020-05-25 11:12:07 +00:00
Bas Nieuwenhuizen	906435fb0e	radv/winsys: Remove extra sizeof multiply. The pointer is already uint64_t*, so the sizeof was too much ... Fixes: `eeff7e1154` "radv: Add userspace fence buffer per context." Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5181>	2020-05-25 11:12:07 +00:00
Samuel Pitoiset	b3c0f82841	radv: advertise VK_AMD_texture_gather_bias_lod Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5147>	2020-05-25 08:51:10 +02:00
Samuel Pitoiset	2e265b94a2	radv: add support for querying which formats support texture gather LOD Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5147>	2020-05-25 08:51:10 +02:00
Samuel Pitoiset	94570e87bd	aco: add support for bias/lod with texture gather Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5147>	2020-05-25 08:51:10 +02:00
Samuel Pitoiset	e99c818cf0	ac/nir: add support for bias/lod with texture gather Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5147>	2020-05-25 08:51:10 +02:00
Samuel Pitoiset	5bc18b79a4	radv: advertise shaderDeviceClock on GFX8+ Unsupported on GFX6-GFX7. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5117>	2020-05-24 20:37:59 +02:00
Samuel Pitoiset	14292310d9	ac/nir: implement nir_intrinsic_shader_clock with device scope Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5117>	2020-05-24 20:37:58 +02:00
Samuel Pitoiset	b034f6cf2a	ac/nir: fix shader clock with subgroup scope The compiler should emit s_memtime instead of s_memrealtime for the subgroup scope. I don't know why this LLVM 9 checks was for but LLVM 8 also has this amdgcn intrinsic. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5117>	2020-05-24 20:37:54 +02:00

1 2 3 4 5 ...

5269 commits