fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-20 00:28:08 +02:00

Author	SHA1	Message	Date
Bas Nieuwenhuizen	8ea34a98c0	radv: Only use PKT3_OCCLUSION_QUERY when it doesn't hang. PKT3_OCCLUSION_QUERY hangs when used in a nested IB. This only calls it when in a primary command buffer and we change GetQueryPoolResults to not need it. CmdCopyQueryPoolResults still needs it so we break that behavior for secondary command buffers. However, that would hang already and using an unitialized value is better than a hang. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Cc: 13.0 17.0 <mesa-stable@lists.freedesktop.org>	2017-02-27 01:33:10 +01:00
Bas Nieuwenhuizen	bb878db7eb	radv: Reset emitted compute pipeline when calling secondary cmd buffer. Otherwise if the new compute pipeline is the same as the last used pipeline before the call, we don't emit it again. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Cc: 13.0 17.0 <mesa-stable@lists.freedesktop.org>	2017-02-27 01:33:10 +01:00
Dave Airlie	15f47027ad	radv: add support for NV_dedicated_allocation This adds initial support for NV_dedicated_allocation, then uses it for the wsi image/memory allocation paths internally in the driver. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-02-27 00:22:51 +00:00
Andres Rodriguez	35189d3279	radv/winsys: fix freeing imported memory. This bo->fd wasn't setting some stuff correctly that could lead to crashes for anything using this path later. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-02-27 00:22:39 +00:00
Dave Airlie	f695735ed6	vulkan/wsi/radv: add initial prime support (v1.1) This is a complete rewrite of my previous rfc patches. This adds the ability to present to a different GPU that rendering using a driver side operation that can copy from the tiled to linear shared image. This does prime support completely in the swapchain present code, and each queue has a precreated command buffer for each image and for the each queue family. This means presenting should work on graphics and compute queues and transfer in the future. v1.1: initialise needs_linear_copy in swapchain. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Mike Lothian <mike@fireburn.co.uk> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-02-27 05:42:16 +10:00
Bas Nieuwenhuizen	336b05c49a	radv/ac: Add integer->integer casts. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Acked-by: Edward O'Callaghan <funfunctor@folklore1984.net>	2017-02-26 19:59:27 +01:00
Marek Olšák	c7878b0167	ac: silence a warning trivial	2017-02-25 00:16:38 +01:00
Emil Velikov	e3ad2d40db	radv/entrypoints: Only generate entrypoints for supported features This changes the way radv_entrypoints_gen.py works from generating a table containing every single entrypoint in the XML to just the ones that we actually need. There's no reason for us to burn entrypoint table space on a bunch of NV extensions we never plan to implement. RADV implements VK_AMD_draw_indirect_count, so add that to the list. Port of `114c281e70` "and/entrypoints: Only generate entrypoints for supported features" Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Acked-by: Dave Airlie <airlied@redhat.com>	2017-02-24 17:36:25 +00:00
Dave Airlie	ccb70d6f53	radv: add sample mask output support This adds support to write to sample mask from the fragment shader. We can optimise this later like radeonsi. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-02-24 10:31:53 +10:00
Dave Airlie	8282c5c771	radv/ac: refactor our fmask sample index fixup. This refactors out the sample index fixup between txf and image load. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-02-24 10:31:49 +10:00
Dave Airlie	5e9ead0fa2	radv: fetch sample index via fmask for image coord as well. This follows the txf_ms code, I can't figure out why amdgpu-pro doesn't do this in their shaders, they must know someone we don't. This fixes: dEQP-VK.pipeline.multisample_shader_builtin.sample_id.* Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-02-24 10:31:44 +10:00
Dave Airlie	bdcbe7c76b	radv: add sample mask input support Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-02-24 10:31:35 +10:00
Dave Airlie	58c97a0791	radv: enable location at sample when persample is forced. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-02-24 10:31:30 +10:00
Dave Airlie	fc430c391b	radv: fix interpolation at wrong place for offset interp The code was interpolating at the offset from the sample, not the offset from the center. Also fix for persample interpolation modes we should force the pixel center to be at the sample. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-02-24 10:31:19 +10:00
Dave Airlie	b71e6538a8	radv/ac: handle gs->copy shader clip distances. This fixes up the clip distance passing between the geometry shader and the copy shader. It packs the clip and cull distances into one or two consecutive slots, and avoids wasting space and make sure the gs output and copy shader input agree on where things are stored. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-02-23 15:31:41 +10:00
Dave Airlie	bec584ec0e	radv/ac: pass clips properly from vertex->geometry shader stages. This works out the geometry shader clip/cull inputs separately to the outputs, and uses that information to read from the ES->GS ring buffer. It stores the clip/cull distances packed into one or two slots. It fixes the es output emission and gs input reading to match. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-02-23 15:31:37 +10:00
Dave Airlie	c2cfb54f13	radv/ac: rename num clips/cull to output clips/culls As geom shaders can have different ones on entry and exit. also move to uint8_t as these are never that big. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-02-23 15:31:10 +10:00
Dylan Baker	8e03250fcf	vulkan: Combine wsi and util makefiles Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-02-22 13:12:02 -08:00
Dave Airlie	40e0dbf96c	radv: fix typo in the subpass barrier patch. Fixes: dbb0eaccc radv: handle subpass cache flushes Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-02-22 02:22:30 +00:00
Timothy Arceri	207e3a6e4b	util/radv: move *_get_function_timestamp() to utils Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-02-22 08:40:00 +11:00
Emil Velikov	8b79f0ed08	radv: make radv_resolve_entrypoint static Used only within the generated source file. Fixes: `12301c5418` ("radv: drop the RADV_CALL macro.") Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2017-02-21 18:31:16 +00:00
Emil Velikov	320561bd83	radv: remove unused radv_dispatch_table dtable Fixes: `12301c5418` ("radv: drop the RADV_CALL macro.") Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2017-02-21 18:31:14 +00:00
Emil Velikov	944620bc0e	radv: remove unneeded extern C notation Header is never #include(d) by a C++ source. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2017-02-21 18:28:15 +00:00
Bas Nieuwenhuizen	8cff852ae2	radv: Don't flush at the start of a command buffer. The preamble flushes now and the rest is the responsibility of the app. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-02-21 09:20:03 +01:00
Bas Nieuwenhuizen	5241fb0ffb	radv: Flush in the initial preamble CS. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-02-21 09:19:58 +01:00
Bas Nieuwenhuizen	c121739c47	radv: Special case the initial preamble. For flushing we don't want to flush every third IB. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-02-21 09:19:53 +01:00
Bas Nieuwenhuizen	eac790811b	radv: Split emitting the cache flush out. So that we can use it without a cmd_buffer. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-02-21 09:19:45 +01:00
Bas Nieuwenhuizen	b6e0df2edd	radv: Free empty_cs on device destruction. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-02-21 09:18:50 +01:00
Dave Airlie	6dbb0eaccc	radv: handle subpass cache flushes This splits out the cache flush bit setting code dependent on the src/dest access flags. It then calls it from the subpass barrier code. It also marks a TODO to remove the aggressive CS/PS flushes at some point. This fixes a bunch of the dEQP-VK.renderpass.attachment_allocation.input_output.* tests. Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-02-21 09:48:37 +10:00
Dave Airlie	0a44a680ff	vulkan/wsi/x11: add support to detect if we can support rendering (v3) This adds support to radv_GetPhysicalDeviceXlibPresentationSupportKHR and radv_GetPhysicalDeviceXcbPresentationSupportKHR to check if the local device file descriptor is compatible with the descriptor retrieved from the X server via DRI3. This will stop radv binding to an X server until we have prime support in place. Hopefully apps use this API before trying to render things. v2: drop unneeded function, don't leak memory. (jekstrand) v3: also check in surface_get_support callback. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-02-20 12:53:52 +10:00
Dave Airlie	1f6376935b	Revert "radv: detect command buffers that do no work and drop them (v2)" This just keeps popping up minor problems and regressions we should revisit in a more sustainable manner later. This also reverts: Revert "radv: query cmds should mark a cmd buffer as having draws." Revert "radv: also fixup event emission to not get culled." This reverts commit `d1640e7932`. This reverts commit `8b47b97215`. This reverts commit `b4b19afebe`. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-02-20 09:00:40 +10:00
Bas Nieuwenhuizen	81b2379664	radv: Handle VK_REMAINING_ARRAY_LAYERS in fast clear eliminate. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-02-19 20:58:06 +01:00
Dave Airlie	9aec76aca3	radv: handle layered fast clears. This iterates the fast clear flush across the layers in the specified range. It also moves the compute resolve flush into the function and builds the range in there. This fixes: dEQP-VK.geometry.layered.* regressions since fast clears. Signed-off-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-02-19 20:30:01 +10:00
Dave Airlie	efc89edf5a	radv: pass subresourceRange by pointer. This struct is 5 dwords, we should really just pass a pointer to it. Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-02-19 20:28:22 +10:00
Dave Airlie	2b3c490e23	radv: fix typo in a2b10g10r10 fast clear calculation. This fixes: dEQP-VK.renderpass.formats.a2b10g10r10_unorm_pack32* regressions. Fixes: `f22836dbdd` radv: Add CPU color packing for VK_FORMAT_A2B10G10R10_UNORM_PACK32. Signed-off-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-02-19 20:27:28 +10:00
Bas Nieuwenhuizen	c7fcaf2314	radv: Invert ring SGPR check. I assume this wants to check if all pipelines use the same SGPR for the rings. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Acked-by: Edward O'Callaghan <funfunctor@folklore1984.net>	2017-02-19 10:13:11 +01:00
Bas Nieuwenhuizen	e12cf3f9bf	radv: Clamp framebuffer dimensions to min. attachment dimensions. Even though the preferred stance is not to fix incorrect applications via the driver, this prevents some nasty GPU hangs. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-02-19 10:13:01 +01:00
Marek Olšák	675ef9c0c7	ac/llvm: use min+max instead of AMDGPU.clamp on LLVM 5.0 It selects v_med3_f32, which has the same rate & size. Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-02-18 02:58:43 +01:00
Marek Olšák	660b55e6d9	radeonsi: stop using TGSI_OPCODE_CLAMP by moving it amd/common Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-02-18 02:58:43 +01:00
Marek Olšák	edd23e0606	ac/llvm: fix various findMSB bugs sffbh needs to be suffixed with ".i32" Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-02-18 06:24:32 +10:00
Bas Nieuwenhuizen	d5bf4c7394	radv: Use different allocator for descriptor set vram. This one only keeps allocated memory in the list, and list nodes in the descriptor sets. Thsi doesn't need messing around with max_sets, and we get automatic merging of free regions. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-02-17 09:28:23 +01:00
Bas Nieuwenhuizen	f448701622	radv: Never try to create more than max_sets descriptor sets. We only use the freed ones after all free space has been used. If the app only allocates small descriptor sets, we might go over max_sets before the memory is full. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com> CC: <mesa-stable@lists.freedesktop.org> Fixes: `f4e499ec79`	2017-02-17 09:28:14 +01:00
Dave Airlie	ebed22ec67	radv/ac: use shared umsb helper. Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-02-16 22:57:16 +00:00
Dave Airlie	0ec66b9969	radeon/ac: add emit umsb shared code. Since we shared imsb, makes sense to share umsb. Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-02-16 22:57:16 +00:00
Dave Airlie	4617ad07e0	radeon/ac: use llvm.amdgcn.sffbh intrinsic instead of AMDGPU.flbit.i32 Use the newer intrinsic. Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-02-16 22:57:16 +00:00
Dave Airlie	fb15a1e9dd	radv/ac: use shader imsb emission code. Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-02-16 22:57:15 +00:00
Dave Airlie	cae1ff1a4b	radeon/ac: add ac_emit_imsb helper. We want to use a different intrinsic on newer llvm, so move this code to a shared area. Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-02-16 22:57:15 +00:00
Dave Airlie	b292e662fc	radv: add fast color clear for b10g11r11 This is used in DOOM, so provide the fast clear path for it. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-02-16 14:09:15 +10:00
Bas Nieuwenhuizen	4e6095ff61	radv: Add support for shaderStorageImageReadWithoutFormat. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-02-15 21:18:21 +01:00
Bas Nieuwenhuizen	53873697e4	radv: Add support for shaderStorageImageWriteWithoutFormat. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-02-15 21:18:13 +01:00

1 2 3 4 5 ...

402 commits