fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-15 12:08:14 +02:00

Author	SHA1	Message	Date
Emil Velikov	4168c162c5	radv: advertise v6 of the wayland surface extension Jason updated the Khronos spec to explicitly state that Wayland surfaces must support VK_PRESENT_MODE_MAILBOX_KHR. ANV did so since day one (back in 2015) Cc: mesa-stable@lists.freedesktop.org Cc: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Cc: Dave Airlie <airlied@redhat.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-07-17 15:24:48 +01:00
Dave Airlie	9ee67467c9	radv: predicate cmask eliminate when using DCC. When using DCC some clear values don't require a cmask eliminate step. This patch adds support for black and black with alpha 1, there are other values, but I don't have access to a comprehensive list. This works by setting the cmask eliminate predicate when doing the fast clear, and later when doing the cmask elimination making sure the draws are predicated. This increases the fps on Sascha Willems deferred. Tonga: 580fps->670fps on a Tonga PRO card. Polaris 730->850fps Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-07-17 01:44:43 +01:00
Dave Airlie	8eed291c2c	radv/clear: add r32g32b32a32 fast clear support (v2) We can only fast clear 128-bit images if the r/g/b channels are the same, and we are using DCC. For DCC we'll bail out on translate if this isn't true, and we catch cmask clears explicitly. v2: remove 64-bit block (Bas), add uint32 as well. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-07-17 01:44:25 +01:00
Dave Airlie	acf1e132af	amd/addrlib: fix typo in api name. This fixes the misspelling of ALIGNMENTS in addrlib. Reviewed-by: Eduardo Lima Mitev <elima@igalia.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-07-17 01:44:14 +01:00
Dave Airlie	f8d5b377c8	radv: set cb base tile swizzles for MRT speedups (v4) This patch uses addrlib to workout the tile swizzles according to the surface index. It seems to produce the same values as amdgpu-pro for the deferred test. v2: don't apply swizzle to CMASK. the eg docs don't mention it, and we clearly don't align cmask for that. v3: disable surf index for dedicated images, as these will most likely be shared, and I don't think the metadata has space for this info in it yet. v4: update for shareable images, rename combined_swizzle to tile_swizzle This gets the deferred demo from 730->950fps on my rx480. (dcc cmask elim predication patches get it further) Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-07-17 01:43:41 +01:00
Dave Airlie	b86f86f55c	radv: allow clear merging for depth/stencil with no care stencil Some of the Sascha Willems demos pick a D32/S8 format for the depth buffer, then do a LOAD_OP_CLEAR/LOAD_OP_DONT_CARE on it, which means we don't get to merge the undefined->depth and clear htile transitions. This add the stencil aspect to the pending clears if there is a depth clear pending and the stencil aspect is don't care. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-07-17 01:16:59 +01:00
Bas Nieuwenhuizen	373f707fbb	radv: Remove NV dedicated alloc extension. To not confuse apps in thinking it might be faster. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Andres Rodriguez <andresx7@gmail.com>	2017-07-15 20:10:43 +02:00
Bas Nieuwenhuizen	515da29360	radv: Use the KHR dedicated alloc for the WSI. NV isn't valid for external images anymore. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Fixes: `6ddc64b93e` "radv: Add support for VK_KHR_dedicated_allocation." Reviewed-by: Andres Rodriguez <andresx7@gmail.com>	2017-07-15 20:10:25 +02:00
Jason Ekstrand	b70829708a	radv: Implement VK_KHR_external_memory This effectively reverts commit 43a171878bb4b5aedb36a. Technically, VK_KHR_get_memory_requirements2 and VK_KHR_dedicated_allocation are required for the KHR version but this at least restores the removed functionality. This patch builds but has received zero testing. Acked-by: Dave Airlie <airlied@redhat.com>	2017-07-15 08:59:38 -07:00
Bas Nieuwenhuizen	6ddc64b93e	radv: Add support for VK_KHR_dedicated_allocation. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Dave Airlie <airlied@redhat.com>	2017-07-15 08:59:38 -07:00
Bas Nieuwenhuizen	97931f0297	radv: Add support for VK_KHR_get_memory_requirements2. Fished the SparseImage call out of the headers as the spec missed the definition. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Dave Airlie <airlied@redhat.com>	2017-07-15 08:59:38 -07:00
Jason Ekstrand	3b95e03b2c	radv: Drop support for VK_KHX_external_semaphore_* These have been formally deprecated by Khronos never to be shipped again. The KHR versions should be implemented/used instead. Acked-by: Dave Airlie <airlied@redhat.com>	2017-07-15 08:58:55 -07:00
Alex Smith	0e1886efb9	radv: Fix descriptors for cube images with VK_IMAGE_USAGE_STORAGE_BIT If a cube image has VK_IMAGE_USAGE_STORAGE_BIT set, the type in an image view's descriptor was set to a 2D array (and a few other fields adjusted accordingly). This is correct when the image view is actually bound as a storage image, but not when bound as a sampled image. In that case the type should be set as a cube. Fix by generating 2 sets of descriptors at view creation time for both storage and non-storage usage, and then choose between them based on descriptor type when writing descriptor sets. v2: Generate storage descriptors for images with TRANSFER_DST, since those may be used as storage images internally. Signed-off-by: Alex Smith <asmith@feralinteractive.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-07-13 00:21:20 +02:00
Alex Smith	4d5c0c189d	radv: Fix possible invalid free of dynamic descriptors This free was left in after dynamic descriptors were changed to not be allocated separately from the descriptor set, and can cause a crash. Fixes: `39644fa40a` ("radv: Don't allocate dynamic descriptors separately") Signed-off-by: Alex Smith <asmith@feralinteractive.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-07-13 00:21:20 +02:00
Dave Airlie	7b5f2e0070	radv/ac: drop setting xnack Since radv uses compute rings and we can't know when we are setting up the shaders what ring they are to be used on, we should just use the default xnack setting. This may be suboptimal in some places, but if we hit a problem, we likely should try and address this between llvm and mesa. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-07-09 22:21:43 +01:00
Dave Airlie	edf2acbeb1	radv: add support for using addrlib max alignment. Rather than using 64k, use what addrlib returns as the base alignment for vulkan allocations. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-07-09 22:17:59 +01:00
Bas Nieuwenhuizen	1aba0e7f58	radv: Add compute htile clear for combined depth+stencil surfaces. Figured out the clear value when we have a combined depth stencil surface. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-07-08 16:11:29 +02:00
Alex Smith	c2a5cb6427	ac/nir: Fix ordering of parameters for image atomic cmpswap intrinsics The NIR parameters are ordered "compare, data", matching GLSL, but both the image and buffer LLVM intrinsics take them the other way around. This is already handled correctly for SSBO atomics. Signed-off-by: Alex Smith <asmith@feralinteractive.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Fixes: `f4e499ec79` "radv: add initial non-conformant radv vulkan driver"	2017-07-07 00:57:25 +02:00
Dave Airlie	8950fac6ab	radv: don't overallocate depth/stencil formats For depth/stencil formats the surface layer allocates the stencil separately, so we don't need to include it in the bpe. This reduces the side of d32s8 allocates to something closer to pro. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-07-06 23:23:22 +01:00
Dave Airlie	09d7c7be4f	radv: enable sisched toggle in perftest flags. RADV_PERFTEST=sisched to enable it. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-07-06 23:07:49 +01:00
Dave Airlie	d97275e42c	ac/llvm: set xnack like radeonsi does. Use family, but only set xnack+ for gfx9. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-07-06 23:07:45 +01:00
Dave Airlie	01e958d631	ac/llvm: create features list using snprintf. Just more moving code around before adding things to it. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-07-06 23:06:04 +01:00
Dave Airlie	9d9f051390	ac/radv: change api to create target machine This just modifies the API to make it easier to add other flags to target machine creation. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-07-06 23:05:59 +01:00
Dave Airlie	a6c2001ace	radv: add support for cmd predication. This doesn't get used yet, it just adds support to various PKT3 emissions to enable it later. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-07-06 02:06:49 +01:00
Bas Nieuwenhuizen	860a8e6b99	ac/nir: Move VS position exports before param exports. According to Nicolai the SX can already start work when all the position exports are done, so do those first. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-07-05 20:23:00 +02:00
Bas Nieuwenhuizen	3d527ba19b	radv: Always set depthbuffer using image format instead of iview format. We have some cases where changing between depth and stencil only aspect was causing hangs. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Acked-by: Dave Airlie <airlied@redhat.com>	2017-07-05 20:23:00 +02:00
Bas Nieuwenhuizen	7c7196e35c	radv: Disable depth & stencil tests when the depthbuffer doesn't support it. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Acked-by: Dave Airlie <airlied@redhat.com>	2017-07-05 20:23:00 +02:00
Dave Airlie	1bc40ae952	radv: enable Int64 capability (v2) I'm not 100% sure this is all wired up but it looks like it is. v2: actually enable extension. Signed-off-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-07-03 11:58:59 -07:00
Connor Abbott	2ec77f7a3c	ac/nir: fix 64-bit shifts NIR always makes the shift amount 32 bits, but LLVM asserts if the two sources aren't the same type. Zero-extend the shift amount to make LLVM happy. Signed-off-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-07-03 11:58:59 -07:00
Connor Abbott	7168425dd7	ac/nir: implement 64-bit packing and unpacking We implement the split opcodes, and tell NIR to lower the original ones. The lowering to LLVM is a little more complicated, but NIR can optimize the split ones a little better, and some NIR lowering passes that we might want to use (particularly for doubles) emit the split ones. This should fix pack/unpackDouble2x32, which seems like a bug since when we enabled the Float64 capability. It will also fix pack/unpackInt2x32 when we enable the Int64 capability. Fixes: `798ae37c` ("radv: Enable Float64 support.") Signed-off-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-07-03 11:58:58 -07:00
Bas Nieuwenhuizen	87d3349393	radv: Use v4i32 variant of llvm.SI.load.const. We apparently still used v16i8 .... As radeonsi doesn't use it with LLVM version checks I don't think we need them either. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-06-30 23:30:55 +02:00
Dave Airlie	ff422500cc	ac/nir: remove last remnants of v16i8 llvm doesn't need this workaround anymore. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-28 20:22:30 +01:00
Alex Smith	909184ac9c	ac/nir: Use correct LLVM intrinsics for atomic ops on imageBuffers The buffer intrinsics should be used instead of the image ones. Signed-off-by: Alex Smith <asmith@feralinteractive.com> Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-06-28 21:05:04 +02:00
James Legg	69a17da037	ac/nir: assert printfs will fit Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-06-28 21:05:04 +02:00
James Legg	6fc41bb4d5	ac/nir: Make intrinsic_name buffer long enough When using cmpswap on an image, it was being trunctated to lvm.amdgcn.image.atomic.cmpswa, with the coords type missing entirely. v2: Add stable CC CC: <mesa-stable@lists.freedesktop.org> Reviewed-by: Grazvydas Ignotas <notasas@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-06-28 21:05:04 +02:00
Nicolai Hähnle	2ce126df3a	ac/nir: convert emit helpers to ac_llvm_context Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:30 +10:00
Nicolai Hähnle	58d496c8e2	ac/nir: remove unused nir_to_llvm_context::has_ddxy Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:30 +10:00
Nicolai Hähnle	6ecef25545	ac/nir: implement nir_op_f2b Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:30 +10:00
Nicolai Hähnle	dacf73e527	ac/nir: implement nir_op_{b2i,i2b} Booleans in NIR are ~0 for true, b2i returns 0/1. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:30 +10:00
Nicolai Hähnle	77d7764d5e	ac/nir: convert type helpers to ac_llvm_context Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:30 +10:00
Nicolai Hähnle	b7bd49158e	ac/llvm: fix type of second llvm.cttz.* parameter LLVM has required an i1 here for a long time. llvm.ctlz.* was fixed in commit `edd23e0606` ("ac/llvm: fix various findMSB bugs"). Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:30 +10:00
Nicolai Hähnle	e8ba03d32a	ac/shader_info: fix a comment Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:29 +10:00
Nicolai Hähnle	edfd3be77e	ac: add ac_llvm_context::v8i32 Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:29 +10:00
Nicolai Hähnle	331a574732	ac: add ac_llvm_context::{i,f}32_{0,1} Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:29 +10:00
Nicolai Hähnle	7bf8c944dc	ac: add ac_llvm_context::{i16, i64, f16, f64} Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-27 10:28:29 +10:00
Eric Engestrom	a2ae2d1fb0	radv: use Mesa's u_atomic.h header Signed-off-by: Eric Engestrom <eric.engestrom@imgtec.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-06-26 18:21:22 +01:00
Dave Airlie	4a34f3244a	radv/meta: don't need vertex info for resolve shader. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-26 01:24:10 +01:00
Bas Nieuwenhuizen	78bef01da2	radv: Remove unused args of radv_image_view_init. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-06-26 01:24:50 +02:00
Bas Nieuwenhuizen	789f480029	radv: Use correct image layout for blit based copies. v2: Don't pass layout to image view usage mask. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Fixes: `0628580eff` "radv: Specify semantics of HTILE layout helpers."	2017-06-26 01:24:29 +02:00
Dave Airlie	6a68170c83	radv: handle primitive id input into fragment shader with no geom shader Fixes: dEQP-VK.pipeline.framebuffer_attachment.no_attachments dEQP-VK.pipeline.framebuffer_attachment.no_attachments_ms Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-06-26 08:45:30 +10:00

1 2 3 4 5 ...

976 commits