fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-18 18:08:15 +02:00

Author	SHA1	Message	Date
Hyunjun Ko	9190cc9b15	tu,radv: fix potentially wrong offset of flexible array. v2. Remove redundant memset and make the expression simpler. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5703>	2020-07-03 00:45:16 +00:00
Samuel Pitoiset	ab9ecb607b	radv,vulkan: add a new x11 wsi drirc workaround for DOOM Eternal DOOM Eternal happily creates a swapchain with 2 images for IMMEDIATE. This fixes a 10% performance issue with RADV. Cc: 20.1 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5704>	2020-07-02 08:31:57 +00:00
Samuel Pitoiset	11a6a96f8a	radv: fix wide lines with multisample enabled When set, EXPAND_LINE_WIDTH expands the line width by 1/cos(a), where a is the minimum angle from horizontal or vertical. This seems required by OpenGL line rasterization but not by Vulkan. Similar to what AMDVLK and AMDGPU-PRO do for AA wide lines. This fixes dEQP-VK.rasterization.interpolation_multisample__bit.lines_wide. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5698>	2020-07-02 07:51:48 +00:00
Samuel Pitoiset	53372175c9	radv: fix wide points and lines The maximum value for both points and lines is 65536. This doesn't fix anything known (just found this while looking in that area). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5696>	2020-07-02 08:26:03 +02:00
Marek Olšák	2866a6f78d	ac/gpu_info: fix num_physical_sgprs_per_simd for gfx10 Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5524>	2020-06-30 10:56:41 +00:00
Samuel Pitoiset	7a5e6fd25f	radv: add support for MRTs compaction to avoid holes SPI_SHADER_COL_FORMAT allocates export memory and CB_SHADER_MASK map them to higher MRTs if necessary. The hardware allows to remap MRTs to avoid holes somehow. For example, if we have a scenario where MRT0 is unused and only MRT1 and MRT2 are used, SPI_SHADER_COL_FORMAT is 0x77 and CB_SHADER_MASK/CB_TARGET_MASK are 0x770 (this assumes SPI_SHADER_UINT16_ABGR is set). This allows us to remove one workaround that was added for fixing GPU hangs with DXVK. I think this is because SPI_SHADER_COL_FORMAT expects contiguous MRTs to be allocated. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5434>	2020-06-29 08:43:14 +00:00
Samuel Pitoiset	4e0dcbb880	radv: use SPI_SHADER_ZERO for non-written color attachments When colorWriteMask is 0 we can assume that this color attachment is unused. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5434>	2020-06-29 08:43:14 +00:00
Samuel Pitoiset	18b42eebd5	radv: rework 8/16-bit color attachment formats detection To prepare for MRTs compaction. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5434>	2020-06-29 08:43:14 +00:00
Samuel Pitoiset	76ee45d3a8	radv: adjust CB_SHADER_MASK for dual-source blending in the shader info pass Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5434>	2020-06-29 08:43:14 +00:00
Samuel Pitoiset	26a48d8d35	radv: enable VK_AMD_shader_ballot on GFX6-7 with both compiler backends It gives +1-2 FPS with Doom Eternal on Pitcairn. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5659>	2020-06-29 07:40:05 +00:00
Daniel Schürmann	5c0f82b0d7	aco: fix partial copies on GFX6/7 While we don't allow partial subdword copies, we still need to be able to split 64bit registers Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5663>	2020-06-26 19:21:57 +00:00
Samuel Pitoiset	f13d79f519	radv: remove the load/store workaround for Monster Hunter World with LLVM Now that ACO is default, this is pointless. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5658>	2020-06-26 14:42:44 +02:00
Samuel Pitoiset	a30ad8cb23	radv: remove the shader ballot workaround for Youngblood with LLVM Now that ACO is default, this is now pointless. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5658>	2020-06-26 14:42:42 +02:00
Marek Olšák	c7680625c3	ac,winsys/amdgpu: align IBs the same as the kernel Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5603>	2020-06-26 07:02:57 +00:00
Marek Olšák	556f4458fe	amd: add proper definitions for NOP packets Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5603>	2020-06-26 07:02:57 +00:00
Daniel Schürmann	63e1e7209c	radv: enable ACO by default No more dragons have been seen, caution is still required... Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5445>	2020-06-25 15:16:30 +02:00
Daniel Schürmann	db0afb3800	radv: change use_aco -> use_llvm We are about to make ACO the default backend. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5445>	2020-06-25 15:16:28 +02:00
Daniel Schürmann	b78f64507e	radv: introduce RADV_DEBUG=llvm option This option enables the LLVM compiler backend to be used for shader compilation Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5445>	2020-06-25 15:16:23 +02:00
Samuel Pitoiset	a102896cff	radv: lower 64-bit dfloor on GFX6 for fixing precision issues GFX6 doesn't support v_floor_f64 and the precision of v_fract_f64 which is used to implement 64-bit floor is less than what Vulkan requires. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5609>	2020-06-25 12:09:08 +00:00
Samuel Pitoiset	c84f11e7b6	radv: lower 64-bit drcp/dsqrt/drsq for fixing precision issues The hardware precision of v_rcp_f64, v_sqrt_f64 and v_rsq_f64 is less than what Vulkan requires. This lowers using the Goldschmidt's algorithm to improve precision. Fixes dEQP-VK.glsl.builtin.precision_double.* on both compiler backends. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5609>	2020-06-25 12:09:08 +00:00
Rhys Perry	4fc0499049	aco: remove outdated assert in handle_operands() "target" is no longer expected to be completely inside "swap". Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5626>	2020-06-24 20:38:35 +00:00
Rhys Perry	7cad27831d	aco: ignore blocked registers when checking edges in get_reg_impl() If the only two registers available are consecutive and used by killed operands, both of them will be blocked and fail the edge check. Totals from 903 (0.66% of 135946) affected shaders: VGPRs: 30892 -> 30884 (-0.03%) CodeSize: 1584468 -> 1584044 (-0.03%); split: -0.05%, +0.02% MaxWaves: 14374 -> 14378 (+0.03%) Instrs: 306482 -> 306399 (-0.03%); split: -0.06%, +0.03% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5626>	2020-06-24 20:38:35 +00:00
Samuel Pitoiset	91a82d0069	radv: fix checking the return value of cs_finalize() cs_finalize() now returns a Vulkan error code and VK_SUCCESS is 0. Fixes: `64a92ef7a2` ("radv/winsys: Distinguish device/host memory errors.") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5629>	2020-06-24 15:50:06 +02:00
Bas Nieuwenhuizen	aa35670fd0	radv: Make radv_alloc_shader_memory static. Just a cleanup. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5578>	2020-06-24 13:00:02 +00:00
Bas Nieuwenhuizen	64a92ef7a2	radv/winsys: Distinguish device/host memory errors. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5578>	2020-06-24 13:00:02 +00:00
Bas Nieuwenhuizen	a5cb88eea4	radv: Handle mmap failures. Which can happen if we have to many mmaps active in the process. CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5578>	2020-06-24 13:00:02 +00:00
Bas Nieuwenhuizen	04765e6a9a	radv/winsys: Deal with realloc failures in BO lists. Otherwise if realloc fails we silently try to use it. Make recording fail instead. CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5578>	2020-06-24 13:00:02 +00:00
Rhys Perry	519ddfd312	aco: improve vectorization of 8/16-bit loads/stores Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5207>	2020-06-24 10:52:28 +00:00
Rhys Perry	ddffcf3627	aco: fix when sub-dword create_vector operand cannot be placed perfectly Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5207>	2020-06-24 10:52:28 +00:00
Daniel Schürmann	91fd53884d	aco: don't allow partial copies on GFX6/7 These are not supported due to missing SDWA instructions Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5207>	2020-06-24 10:52:28 +00:00
Daniel Schürmann	76b5d72921	aco: align swap operations to 4 bytes on GFX6/7 GFX6/7 can only swap full registers Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5207>	2020-06-24 10:52:28 +00:00
Rhys Perry	91d7e40176	aco: don't create byte-aligned short loads The ISA docs don't seem to say if this is allowed, so just assume short loads require short alignment. In practice, the only situation this should affect are byte-aligned u8vec2 loads. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5207>	2020-06-24 10:52:28 +00:00
Rhys Perry	c3259b6e6a	aco: add missing bld.scc() in byte_align_scalar() Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5207>	2020-06-24 10:52:28 +00:00
Rhys Perry	a0f6ca4393	aco: don't store byte-aligned short stores The ISA docs don't seem to say if this is allowed, so just assume short stores require short alignment. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5207>	2020-06-24 10:52:28 +00:00
Rhys Perry	a18da83d18	aco: fix copy+paste error in split_buffer_store Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5207>	2020-06-24 10:52:28 +00:00
Rhys Perry	841fdfcd45	radv/aco,aco: allow SMEM SSBO loads on GFX6/7 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5207>	2020-06-24 10:52:28 +00:00
Rhys Perry	35b5e1fc7c	aco: allow SMEM for some sub-dword accesses Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5207>	2020-06-24 10:52:27 +00:00
Rhys Perry	c702f8ed15	aco: only use SMEM if we can prove it's safe Totals from 26 (0.02% of 127638) affected shaders: SGPRs: 1680 -> 1664 (-0.95%) VGPRs: 1492 -> 1504 (+0.80%) CodeSize: 233140 -> 233016 (-0.05%); split: -0.09%, +0.04% Instrs: 47121 -> 47114 (-0.01%); split: -0.08%, +0.06% VMEM: 4930 -> 4655 (-5.58%); split: +0.12%, -5.70% SMEM: 2030 -> 2001 (-1.43%); split: +3.79%, -5.22% VClause: 891 -> 947 (+6.29%) SClause: 876 -> 816 (-6.85%) Copies: 4734 -> 4716 (-0.38%); split: -0.40%, +0.02% Branches: 2048 -> 2047 (-0.05%) PreSGPRs: 1400 -> 1396 (-0.29%) PreVGPRs: 1440 -> 1443 (+0.21%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5207>	2020-06-24 10:52:27 +00:00
Rhys Perry	0cfee26bee	radv: fix image variable types in meta shaders We write to these variables using image intrinsics. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5207>	2020-06-24 10:52:27 +00:00
Samuel Pitoiset	0aca04afa5	aco: fix printing ASM on GFX6-7 again Checking errno is actually wrong because it's only updated if popen() fails (ie. NULL). One solution is to check if the first line is empty. Fixes: `c95d258d1b` ("aco: fix printing ASM on GFX6-7 if clrxdisasm is not found") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5591>	2020-06-23 07:45:03 +00:00
Marek Olšák	2b8b62c55b	ac/nir: fix 64-bit division for GL CTS This fixes: KHR-GL45.gpu_shader_fp64.builtin.mod_* Fixes: `ba2ec1f3` "ac/nir: use llvm.amdgcn.rcp in ac_build_fdiv()" Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5531>	2020-06-23 04:46:55 +00:00
Eric Engestrom	04e8eaf4e8	util: rename xmlpool.h to driconf.h To make it clearer what it is and does. Signed-off-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5440>	2020-06-22 21:50:12 +00:00
Samuel Pitoiset	83d2a73b73	aco: improve validation checks for readlane/writelane This allows literals for the lane select on GFX10+. The doc says that is should be a SGPR or a constant but VOP3 on GFX10+ allows literals. Some later validation code checks if literals are allowed anyways. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5010>	2020-06-22 11:24:27 +00:00
Daniel Schürmann	f03a5f6cac	radv/aco: implement logic64 instead of lowering to make use of the scalar ALU Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5527>	2020-06-22 10:59:45 +00:00
Samuel Pitoiset	c95d258d1b	aco: fix printing ASM on GFX6-7 if clrxdisasm is not found Fixes some dEQP-VK.pipeline.executable_properties.* which expect a valid string to be returned. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5560>	2020-06-22 10:38:33 +02:00
Rhys Perry	f4a643f65e	radv: add new drirc option radv_no_dynamic_bounds Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5569>	2020-06-19 23:53:47 +00:00
Pierre-Eric Pelloux-Prayer	993c64e6fe	ac/llvm: load 1 byte at a time if unaligned on gfx10 If buffer or stride is unaligned we use the same trick as on gfx6: load 1 byte at a time and recompose the output if needed. This change fixes lots of deqp/glcts tests: - dEQP-GLES2.functional.draw.random.1, 10, ... - dEQP-GLES2.functional.vertex_arrays.multiple_attributes.stride.3_float2_0_float2_0_float2_17, ... - dEQP-GLES2.functional.vertex_arrays.single_attribute.first.byte_first24_offset1_stride2_quads256, ... - dEQP-GLES2.functional.vertex_arrays.single_attribute.strides.buffer_0_17_byte2_vec4_dynamic_draw_quads_1, ... - dEQP-GLES31.functional.draw_indirect.random.14, ... Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5502>	2020-06-19 09:20:16 +02:00
Samuel Pitoiset	2ac5cce1a1	radv: require LLVM 11+ for GFX 10.3 if not using ACO Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5389>	2020-06-19 08:18:43 +02:00
Samuel Pitoiset	dc698fb5dc	radv: add support for Sienna Cichlid Bits copied from RadeonSI. Totally untested. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5389>	2020-06-19 08:18:41 +02:00
Samuel Pitoiset	8c144482ea	aco: replace == GFX10 with >= GFX10 where it's needed Assume the GFX10.3 ISA is similar to GFX10 which is likely (except possible minor changes and new instructions for raytracing). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5389>	2020-06-19 08:18:39 +02:00

1 2 3 4 5 ...

5443 commits