fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-17 13:58:05 +02:00

Author	SHA1	Message	Date
Samuel Pitoiset	4cacba0c86	radv/gfx10: fix the vertex order for triangle strips emitted by a GS My fix wasn't totally correct as pointed out by Marek. Ported from RadeonSI. Fixes: `deafe4cc58` ("radv/gfx10: fix primitive indices orientation for NGG GS") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-12-04 08:28:57 +01:00
Samuel Pitoiset	dac6bd29ae	radv: simplify a check in radv_fixup_vertex_input_fetches() The number of loaded channels should always be > 0 now. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-12-04 08:04:05 +01:00
Samuel Pitoiset	3b51259f06	radv: remove dead shader input/output variables No pipeline-db changes. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-12-04 08:04:05 +01:00
Rhys Perry	35fab1ba33	radv: set writes_memory for global memory stores/atomics Fixes: `13ab63bb62` ('radv: Implement VK_EXT_buffer_device_address.') Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-12-02 11:47:12 +00:00
Samuel Pitoiset	c105e6169c	radv,ac/nir: lower deref operations for shared memory This shouldn't introduce any functional changes for RadeonSI when NIR is enabled because these operations are already lowered. pipeline-db (NAVI10/LLVM): SGPRS: 9043 -> 9051 (0.09 %) VGPRS: 7272 -> 7292 (0.28 %) Code Size: 638892 -> 621628 (-2.70 %) bytes LDS: 1333 -> 1331 (-0.15 %) blocks Max Waves: 1614 -> 1608 (-0.37 %) Found this while glancing at some F12019 shaders. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-29 21:58:18 +01:00
Bas Nieuwenhuizen	48fc65413c	radv: Fix timeline semaphore refcounting. Was totally broken ... Removed two if(point) {} because point is always non-NULL and we were counting on that already for counting, since we NULL our references to semaphores without active point earlier. Fixes: `4aa75bb3bd` "radv: Add wait-before-submit support for timelines." Closes: https://gitlab.freedesktop.org/mesa/mesa/issues/2137 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-11-28 23:46:09 +01:00
Bas Nieuwenhuizen	4cde0e04e3	radv: Unify max_descriptor_set_size. They were out of sync. Besides syncing, lets ensure they never diverge again. Fixes: `8d2654a419` "radv: Support VK_EXT_inline_uniform_block." Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-11-28 12:06:44 +01:00
Samuel Pitoiset	d347f2805d	radv: add more constants to avoid using magic numbers Trivial. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-28 10:59:14 +01:00
Samuel Pitoiset	a24f1c8f7f	radv: enable VK_KHR_shader_subgroup_extended_types on GFX10 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-27 07:42:44 +01:00
Samuel Pitoiset	86a5fbfd4a	radv: fix enabling sample shading with SampleID/SamplePosition When a fragment shader includes an input variable decorated with SampleId or SamplePosition, sample shading should be enabled because minSampleShadingFactor is expected to be 1.0. Cc: 19.2, 19.3 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-27 07:22:54 +01:00
Bas Nieuwenhuizen	02375b8436	radv: Enable VK_KHR_buffer_device_address. Still no capture/replay or multi device support. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-11-26 11:59:52 +00:00
Samuel Pitoiset	34dd4251e2	radv: fix reporting subgroup size with VK_KHR_pipeline_executable_properties Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-26 10:48:48 +01:00
Bas Nieuwenhuizen	25bc9102d8	radv: Allocate cmdbuffer space for buffer marker write. Fixes: `946193ae00` "radv: add support for VK_AMD_buffer_marker" Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-11-26 09:35:02 +00:00
Marek Olšák	f671cc4d95	ac: set swizzled bit in cache policy as a hint not to merge loads/stores LLVM now merges loads and stores for all opcodes, so this must be set. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-11-25 16:48:27 -05:00
Samuel Pitoiset	2af39c719e	radv: select the depth decompress path based on the aspect mask Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-25 16:29:23 +01:00
Samuel Pitoiset	905c005561	radv: create decompress pipelines for separate depth/stencil layouts No functional changes as the driver still uses the depth+stencil pipeline. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-25 16:29:21 +01:00
Samuel Pitoiset	faa58201f3	radv: rework creation of decompress/resummarize meta pipelines This refactoring will help for creating more decompress pipelines. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-25 16:29:18 +01:00
Samuel Pitoiset	8f0fb38825	radv: set the image view aspect mask before resolves No functional changes, but it will be used to decompress separate depth/stencil aspects. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-25 16:29:16 +01:00
Samuel Pitoiset	9dec90b7bc	radv: set the image view aspect mask during subpass transitions No functional changes because the aspect mask is still not used during image transitions but it will be needed for the separate depth/stencil aspects logic. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-25 16:29:13 +01:00
Rhys Perry	b3a3e4d1d2	radv: set alignment for load_ssbo/store_ssbo in meta shaders Otherwise, nir_intrinsic_align() will assert when called on the intrinsics Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-11-25 13:59:11 +00:00
Connor Abbott	bb78f9b4e4	aco: Use common argument handling Reviewed-by: Daniel Schürmann <daniel@schuermann.dev>	2019-11-25 14:17:51 +01:00
Connor Abbott	e7f4cadd02	radv: Replace supports_spill with explict_scratch_args The former was always true and hence dead code. We will want to explicitly declare the ring offset register with ACO, but we also want to declare the scratch offset too, and we can't try to disable it since ACO also supports spilling and the determination of whether spilling has to happen occurs well after setting up registers. So replace supports_spill with something that will actually be used for ACO. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-11-25 14:17:51 +01:00
Connor Abbott	b45c54ff8d	aco: Use radv_shader_args in aco_compile_shader() Reviewed-by: Daniel Schürmann <daniel@schuermann.dev>	2019-11-25 14:17:51 +01:00
Connor Abbott	66c703b3e8	radv: Move argument declaration out of nir_to_llvm Now it's executed for ACO too. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-11-25 14:17:51 +01:00
Connor Abbott	3b143369a5	ac/nir, radv, radeonsi: Switch to using ac_shader_args Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com>	2019-11-25 14:17:10 +01:00
Connor Abbott	43da33c169	radv: Rename ac_arg_regfile We'll duplicate this in a header file in the next commit, and then remove the original enum. Just rename it temporarily so that things keep building. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-11-25 14:12:46 +01:00
Samuel Pitoiset	b99295fb33	radv: disable subgroup shuffle operations on GFX10 They are broken like on GFX6-GFX7. It seems better to disable them instead of enabling a broken feature. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-25 08:03:24 +01:00
Timothy Arceri	f54c4e85ce	radv: create a fresh fork for each pipeline compile In order to prevent a potential malicious pipeline tainting our secure compile process and interfering with successive pipelines we want to create a fresh fork for each pipeline compile. Benchmarking has shown that simply forking on each pipeline creation doubles the total time it takes to compile a fossilize db collection. So instead here we fork the process at device creation so that we have a slim copy of the device and then fork this otherwise idle and untainted process each time we compile a pipeline. Forking this slim copy of the device results in only a 20% increase in compile time vs a 100% increase. Fixes: `cff53da3` ("radv: enable secure compile support")	2019-11-25 10:10:14 +11:00
Timothy Arceri	1663bb1f77	radv: add a secure_compile_open_fifo_fds() helper This will be used to create a communication pipe between the user facing device and a freshly forked (per pipeline compile) slim copy of that device. We can't use pipe() here because the fork will not be a direct fork of the user facing process. Instead we use a previously forked copy of the process that was forked at device creation in order to reduce the resources required for the fork and avoid performance issues. Fixes: `cff53da374` ("radv: enable secure compile support")	2019-11-25 10:10:14 +11:00
Timothy Arceri	ef54f15da9	radv: add some infrastructure for fresh forks for each secure compile In the following commits we want to be able to fork an existing lightweight fork created at device creation time. In order for the user facing process to communicate with this new fresh fork we create some members here to hold FIFO file descriptors and a unique id. Here we also add a new fork enum that we use to tell the lightweight process to create a fresh fork. For more information on why we create a fresh fork see the following commits.	2019-11-25 10:10:14 +11:00
Daniel Schürmann	8d7621a53f	radv: Enable Subgroup Arithmetic and Clustered for SI This patch also allows to enable VK_AMD_shader_ballot on SI. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-11-20 20:31:45 +00:00
Samuel Pitoiset	7ecd8a3471	radv: enable VK_KHR_shader_subgroup_extended_types on GFX6-GFX7 Most of DEQP-VK.subgroups are skipped because 16-bit float aren't supported but others pass. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-20 11:09:58 +00:00
Bas Nieuwenhuizen	4eb2a1dc6f	radv: Do not change scratch settings while shaders are active. When the scratch ringbuffer settings are changed, the shader unit has to be idle or we will have shaders using old and new settings. That combination is not supported on the HW (likely the offset is ringbuffer idx * WAVESIZE * 1024). CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-11-20 01:18:36 +00:00
Marek Olšák	e9cc4f670f	ac: add radeon_info::num_rings and move ring_type to amd_family.h Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-11-19 18:31:53 -05:00
Marek Olšák	ebe7579655	nir: move data.image.access to data.access The size of the data structure doesn't change. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2019-11-19 18:20:05 -05:00
Rhys Perry	7eb7969213	radv/aco: enable VK_KHR_shader_subgroup_extended_types We could enable it on GFX10 if LLVM wasn't used as a fallback for unsupported stages. Note that the CTS only tests it if VK_KHR_shader_float16_int8 is enabled, even though it's not a requirement. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev>	2019-11-19 18:58:04 +00:00
Samuel Pitoiset	c93f2cefd5	radv: advertise VK_KHR_shader_subgroup_extended_types on GFX8-GFX9 This extension allows to use subgroup operations with 8 and 16-bits Untested on GFX6-GFX7, and most of subgroup operations are broken on GFX10, so don't enable it for now. Not enabled on ACO because it's still doesn't support 8-bits/16-bits. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-19 18:01:13 +00:00
Samuel Pitoiset	1ebd9459e7	radv: implement VK_AMD_device_coherent_memory This extension adds the device coherent and device uncached memory types. It's known to be slower than non-device coherent memory but it might be useful for debugging. This is only exposed for chips that support L2 uncached. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-18 08:20:19 +00:00
Rhys Perry	df645fa369	aco: implement VK_KHR_shader_float_controls This actually supports more of the extension than the LLVM backend but we can't enable it because ACO doesn't work with all stages yet. With more of it enabled, some CTS tests fail because our 64-bit sqrt is very imprecise. I can't find any precision requirements for it anywhere, so I'm thinking it might be a CTS issue. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev>	2019-11-15 17:36:21 +00:00
Rhys Perry	d7b0d9a8d8	radv: enable FP16/FP64 denormals earlier and only for LLVM ACO sets this itself and will have to set it differently in the future to support shaderDenormFlushToZeroFloat64. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-11-15 17:36:21 +00:00
Samuel Pitoiset	41a1152cdc	radv: make sure to not clear the ds attachment after resolves To not overwrite the resolve if there is pending clear aspects, same as color resolves. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-15 09:36:43 +01:00
Samuel Pitoiset	519d9b30de	radv: remove useless RADV_DEBUG=unsafemath debug option This option is useless and shouldn't be used at all. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-15 09:07:34 +01:00
Luis Mendes	0cb5c96a83	radv: fix radv secure compile feature breaks compilation on armhf EABI and aarch64 __NR_select is not defined the same way across architectures, sometimes is not even defined, like in armhf EABI and aarch64. Signed-off-by: Luis Mendes <luis.p.mendes@gmail.com> Acked-by: Timothy Arceri <tarceri@itsqueeze.com> Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/issues/2042	2019-11-12 11:47:20 +11:00
Rhys Perry	de998d3eb5	radv: fix radv_nir_get_max_workgroup_size when nir=NULL Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Fixes: `84a1a2578` ('compiler: pack shader_info from 160 bytes to 96 bytes') Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-11-11 20:44:12 +00:00
Rhys Perry	76544f632d	radv: adjust loop unrolling heuristics for int64 In particular, increase the cost of 64-bit integer division. Fixes huge shaders with dEQP-VK.spirv_assembly.type.scalar.i64.mod_geom , with ACO used for GS this creates shaders requiring a branch with >32767 dword offset. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-07 23:29:12 +00:00
Samuel Pitoiset	deafe4cc58	radv/gfx10: fix primitive indices orientation for NGG GS The primitive indices have to be swapped to follow the drawing order. This fixes corruption with Overwatch when NGG GS is force enabled. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-07 19:21:15 +00:00
Samuel Pitoiset	fb07fd4e6c	radv: implement VK_EXT_subgroup_size_control This extension allows to control the subgroup size by allowing a varying subgroup size and also specifying a required subgroup size. This implementation only allows to specify a required subgroup size for compute shaders because there is some caveats with other shader stages (eg. NGG with geometry shader). This basically allows apps to use Wave32 for compute shaders. This extension is enabled for all chips but only GFX10 supports Wave32. ACO doesn't support it. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-06 09:20:39 +01:00
Samuel Pitoiset	da6c30f9f6	radv: rely on shader's wavesize when computing NGG info Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-06 09:20:36 +01:00
Samuel Pitoiset	d3f9957de4	radv: determine shaders wavesize at pipeline level Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-06 09:20:34 +01:00
Samuel Pitoiset	d1e1f7c4d5	radv: hardcode the number of waves for the GFX6 LS-HS bug It's always 64. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-06 09:20:32 +01:00

1 2 3 4 5 ...

2834 commits