fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-06 11:38:05 +02:00

Author	SHA1	Message	Date
Jason Ekstrand	a8ac61b0ee	intel/fs: NoMask initialize the address register for shuffles Cc: mesa-stable@lists.freedesktop.org Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2979 Tested-by: Iván Briano <ivan.briano@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6825>	2020-10-02 00:42:56 +00:00
Gurchetan Singh	5c2129d434	virgl: fix stride + layer_stride inconsistency With blob resources, stride doesn't necesarily have to equal width * bpp. The use case for this a minigbm blob resource with blob mem BLOB_MEM_HOST3D_GUEST imported into guest Mesa. In addition, for BLOB_MEM_HOST we can repurpose the transfer ioctls to also flush caches if need be, so this seems a good time to fix this issue. Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4821>	2020-10-01 16:56:37 -07:00
Gurchetan Singh	87383e3163	virgl: query blob mem Resource blob also modifies resource info. Let's use this functionality. Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4821>	2020-10-01 16:56:37 -07:00
Gurchetan Singh	3b54e5837a	virgl: support PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT We should have GL4.5 with this. Piglit tests should now pass. In terms of performance, we're between 70% to 80% of host performance on Iris, based on a apitrace of a 2013 GL4.5 game: 11.204 FPS (guest) 15.947 FPS (host) This is still better than the status quo, when said game was unplayable with Virgl due to an inefficient GL4.3 fallback. TEST=piglit -t arb_buffer_storage all results/ passes Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4821>	2020-10-01 16:56:37 -07:00
Gurchetan Singh	cd31f46f08	virgl/drm: add resource create blob function A blob resource is a container for: - VIRTGPU_BLOB_MEM_GUEST: a guest memory allocation (referred to as a "guest-only blob resource") - VIRTGPU_BLOB_MEM_HOST3D: a host3d memory allocation (referred to as a "host-only blob resource") - VIRTGPU_BLOB_MEM_HOST3D_GUEST: a guest + host3d memory allocation (referred to as a "default blob resource"). Blob resources can be used to implement new features and fix shortcomings with the current resource create path. The subsequent patches how blob resources may be leveraged to implement GL_ARB_buffer_storage and get GL4.5. Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4821>	2020-10-01 16:56:31 -07:00
Gurchetan Singh	e01ec6ed2d	virgl/drm: query for resource blob and host visible memory region Check for these features. v2: refactor querying params in general (@shadeslayer) Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4821>	2020-10-01 16:16:07 -07:00
Gurchetan Singh	7b7f210825	drm-uapi: virtgpu_drm.h: resource create blob + host visible memory region Matches current API at virgl/resource_blob. Of course, don't submit until this lands in drm. Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4821>	2020-10-01 16:16:07 -07:00
Gurchetan Singh	c73c0cc317	virgl: add flags to (*resource_create) callback We never seemed to use these. But for ARB_buffer_storage we'll need it. Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4821>	2020-10-01 16:15:57 -07:00
Matt Turner	1aac47db69	Revert F16C series (MR 6774) This reverts commit `4fb2eddfdf`. This reverts commit `7a1deb16f8`. This reverts commit `2b6a172343`. This reverts commit `5af81393e4`. This reverts commit `87900afe5b`. A couple of problems were discovered after this series was merged that cause breakage in different configurations: (1) It seems that using -mf16c also enables AVX, leading to SIGILL on platforms that do not support AVX. (2) Since clang only warns about unknown flags, and as I understand it Meson's handling in cc.has_argument() is broken, the F16C code is wrongly enabled when clang is used, even for example on ARM, leading to a compilation error. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3583 Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6969>	2020-10-01 21:08:12 +00:00
Mauro Rossi	4a0164ed85	android: gallium/virgl: cleanup virgl_driinfo.h gen rules Android.mk and Makefile.sources are still defining virgl_driinfo.h target This patch removes the remaining gen rules Fixes the following building error: FAILED: out/target/product/x86_64/obj/STATIC_LIBRARIES/libmesa_pipe_virgl_intermediates/virgl/virgl_driinfo.h ... cp: bad 'out/target/product/x86_64/gen/STATIC_LIBRARIES/libmesa_pipe_virgl_intermediates/virgl/virgl_driinfo.h': No such file or directory Fixes: `974981c4e6` ("gallium/drm: Make the pipe loader handle the driconf merging.") Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Acked-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6880>	2020-10-01 22:37:26 +02:00
Mauro Rossi	d7fbf94ae8	android: gallium/radeonsi: cleanup si_driinfo.h gen rules Android.mk and Makefile.sources are still defining si_driinfo.h target This patch removes the remaining gen rules Fixes the following building error: FAILED: out/target/product/x86_64/obj/STATIC_LIBRARIES/libmesa_pipe_radeonsi_intermediates/radeonsi/si_driinfo.h ... cp: bad 'out/target/product/x86_64/gen/STATIC_LIBRARIES/libmesa_pipe_radeonsi_intermediates/radeonsi/si_driinfo.h': No such file or directory Fixes: `974981c4e6` ("gallium/drm: Make the pipe loader handle the driconf merging.") Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6880>	2020-10-01 22:37:20 +02:00
Mauro Rossi	a648aea3fd	android: gallium/iris: cleanup iris_driinfo.h gen rules Android.mk and Makefile.sources are still defining iris_driinfo.h target This patch removes the remaining gen rules Fixes the following building error: FAILED: out/target/product/x86_64/obj/STATIC_LIBRARIES/libmesa_pipe_iris_intermediates/iris/iris_driinfo.h ... cp: bad 'out/target/product/x86_64/gen/STATIC_LIBRARIES/libmesa_pipe_iris_intermediates/iris/iris_driinfo.h': No such file or directory Fixes: `974981c4e6` ("gallium/drm: Make the pipe loader handle the driconf merging.") Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6880>	2020-10-01 22:37:15 +02:00
Jason Ekstrand	cb95065dd1	nir: Add lowering from regular ALU conversions to the intrinsic Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Jason Ekstrand	bc7ed03ef8	clover/nir: Call nir_lower_convert_alu_types Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Jesse Natalie	7d97f3dfdc	spirv: Implement vload[a]_half[n] and vstore[a]_half[n][_r] Note, the aligned versions aren't handled specially yet. The float16buffer capability is now at least partially supported after this patch, so move it to be supported when kernels are supported. v2 (Jason Ekstrand): - A few cosmetic cleanups around type/base_type - Rebased on top of the big SPIR-V SSA value rework - Use the new version of the conversion helpers Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Jason Ekstrand	a85afb797e	spirv/opencl: Drop dest_type from handle_v_load_store At that point in the function, we don't know if it's a load or a store so calling it dest_type isn't really helpful. Also, we don't really want the glsl_type; we want the base_type. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Jason Ekstrand	8610af12b6	spirv: Handle all OpenCL conversion ops with full rounding This is done for kernels via the new convert_alu_types intrinsic. For Vulkan and OpenGL, we maintain the old path so that drivers don't have to add that lowering pass. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Jason Ekstrand	8e8458218c	spirv: Add some conversion handling helpers Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Jason Ekstrand	383ecfbc70	nir: Add a passes for nir_intrinsic_convert_alu_types This adds primarily two passes: One is a lowering pass which turns these conversion intrinsics into a series of ALU ops. The other is an optimization pass which attempt to simplify the conversion whenever possible in the hopes that we can turn it into a "normal" conversion op which doesn't need special treatment. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Jason Ekstrand	d5cb51e2b9	nir: Add builder helpers for OpenCL type conversions Most of these were originally written by Daniel Stone in the Microsoft ClOn12 branch, reworked by Jesse Natalie, fixed by Boris Brezillon, and possibly touched by others along the way. Unfortunately, none of that is in the commit history thanks to living in the CLOn12 branch. I ported them to mesa master and further reworked things for better cosmetics. In particular, 1. They now live in a builder helper rather than in vtn_alu.c. 2. Instead of looping inside each builder helper, we just trust NIR vector instructions to handle vectors. 3. Lots of re-arranging of the helpers for clarity, better asserting, and better re-use with the upcoming lowering pass. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Jason Ekstrand	588bb6686b	nir: Add a conversion and rounding intrinsic This new intrinsic is capable of handling the full range of conversions from OpenCL including rounding modes and possible saturation. The intention is that we'll emit this intrinsic directly from spirv_to_nir and then lower it to ALU ops later. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Jason Ekstrand	0aa08ae2f6	nir: Split NIR_INTRINSIC_TYPE into separate src/dest indices We're about to introduce conversion ops which are going to want two different types. We may as well just split the one we have rather than end up with three. There are a couple places where this is mildly inconvenient but most of the time I find it to actually be nicer. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Dave Airlie	4c70f1ba2f	gallivm/nir: fix non-32 bit find lsb/msb fixes piglit cl get-global-id Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6954>	2020-10-02 04:17:49 +10:00
Dave Airlie	e8f1cc41db	llvmpipe/cs: add in shader shared size. (can remove lavapipe setting this later). Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6954>	2020-10-02 04:17:46 +10:00
Dave Airlie	35b162eb2c	gallivm/nir: make sure to mask global reads. Make the driver only read values for the active lanes, otherwise it can cause unwanted oob accesses that aren't the apps fault. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6954>	2020-10-02 04:17:41 +10:00
Anuj Phogat	545d852a7a	intel/gen9: Enable MSC RAW Hazard Avoidance Workaround # 22011374674 Applied to i965, iris and anv drivers No performance impact is observed with WA. Cc: mesa-stable Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2020-10-01 16:57:50 +00:00
Marek Olšák	237f4d9d18	radeonsi: restructure si_pipe_set_constant_buffer Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6948>	2020-10-01 16:29:46 +00:00
Marek Olšák	d5cb7bd527	radeonsi: call nir_lower_bool_to_int32 last because it breaks nir_opt_if The new place is where shader variants are generated. This is a prerequisite for inlinable uniforms. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6948>	2020-10-01 16:29:46 +00:00
Marek Olšák	fd6bbdcf59	radeonsi: use staging buffer uploads for most VRAM buffers Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6948>	2020-10-01 16:29:46 +00:00
Marek Olšák	701f7ae9d2	radeonsi: move si_set_active_descriptors_for_shader into si_update_common_shader_state Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6948>	2020-10-01 16:29:46 +00:00
Marek Olšák	f5912c6d32	radeonsi: kill disabled clip distances and planes at per-channel granularity Apps often enable only 1 plane for gl_ClipVertex, which means 1 scalar clip distance. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6948>	2020-10-01 16:29:46 +00:00
Marek Olšák	30c3b2c0b6	radeonsi: simplify NGG culling enablement and add radeonsi_shader_culling option Add a vertex count threshold into si_shader_selector to simplify the draw_vbo code. The new option is supposed to be used in 00-mesa-defaults.conf and should be tweaked for best performance unlike the AMD_DEBUG experimental options. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6948>	2020-10-01 16:29:46 +00:00
Sagar Ghuge	b02bef01c8	intel/blorp: Conditionally clear full surface depth and stencil We should set "Full Surface Depth and Stencil Clear" field of WM_HZ_OP 3DSTATE packet, only when application requires the entire depth surface to be cleared. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6549>	2020-10-01 16:23:10 +00:00
Jason Ekstrand	d5849bc840	anv: Skip HiZ and CCS ambiguates which preceed fast-clears This gets rid of multiple HiZ ambiguate operations per frame in Witcher 3. v2: - Fix typo (Tapani) Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6549>	2020-10-01 16:23:10 +00:00
Jason Ekstrand	e9d5ec342d	anv: Use more temp vars in cmd_buffer_begin_subpass This is a mostly cosmetic change but there is one subtle functional issue: If we ever render to a 3D depth image, we are now handling the base layer and number of layers correctly. I'm not sure rendering to 3D depth is even allowed but we can theoretically handle it now. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6549>	2020-10-01 16:23:10 +00:00
Jason Ekstrand	7c92e413af	anv: Allow HiZ clears for multi-view Now that we're enabling HiZ on multi-layer images, there's no reason why we can't enable HiZ clears for multi-view. The only reason I can think of why we didn't before was because no one thought to and the old code didn't. Enabling this means that an attachment will get HiZ cleared if and only if att_state->fast_clear. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6549>	2020-10-01 16:23:10 +00:00
Eleni Maria Stea	03af98abe2	radeonsi: support for external buffers (ext_external_objects) So far, the callback to create a resource from a memory object had code for importing textures only. Modified it to allow importing buffers too. Fixes the following piglit tests: - ext_external_objects/vk-buf-exchange - ext_external_objects/vk-pix-buf-update-errors - ext_external_objects/vk-vert-buf-update-errors - ext_external_objects/vk-vert-buf-reuse v2: Used si_alloc_buffer_struct instead of CALLOC v3: Fixed indentation issue, removed free in case of unsuccessful allocation, joined two if conditions together Signed-off-by: Eleni Maria Stea <estea@igalia.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6364>	2020-10-01 15:35:07 +00:00
Samuel Pitoiset	df63491594	radv/aco: lower IO for all stages outside of ACO Lowering IO for VS, TCS, TES and GS still have to be done for LLVM. No fossils-db change on NAVI10. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6897>	2020-10-01 14:58:25 +00:00
Samuel Pitoiset	2c322514f3	radv: gather output usage mask from store_output for VS, TES and GS IO are now lowered before the shader info pass is called and the output usage masks have to be gathered from store_output instead. This is currently only used by ACO. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6897>	2020-10-01 14:58:25 +00:00
Connor Abbott	79368ab302	ttn: Fix number of components for IF/UIF NIR if statements only take one component, but TGSI registers are vec4. We're supposed to compare the x component, per https://docs.mesa3d.org/gallium/tgsi.html#opcode-IF. Fixes: `f103bded` ("ttn: Use nir control flow insertion helpers") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Tested-by: Leo Liu <leo.liu@amd.com> Closes: #3585 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6956>	2020-10-01 15:47:07 +02:00
Samuel Pitoiset	b00a023f1e	ac/nir: fix nir_intrinsic_shared_atomic_fadd This was completely broken. Fixes dEQP-VK.glsl.atomic_operations.add_float32_compute_shared. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6936>	2020-10-01 06:38:42 +00:00
Samuel Pitoiset	8227b08c08	ac/llvm: fix invalid use of unreachable in ac_build_atomic_rmw() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6936>	2020-10-01 06:38:42 +00:00
Samuel Pitoiset	892e74d2f7	radv: fix gathering writes_memory for global store/atomic operations Because global operations are lowered before the shader info pass now we have to adjust the gathering code. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3578 Fixes: `1588644543` ("radv: lower deref operations for global memory for both backends") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6934>	2020-10-01 08:14:18 +02:00
Dave Airlie	e94fd4cc65	lavapipe: rename vallium to lavapipe Just a cooler name, and a lot easier to search for. thanks Marek Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6921>	2020-10-01 00:23:40 +00:00
Olsak, Marek	5e8791a0bf	radeonsi: Fix dead lock with aux_context_lock in si_screen_clear_buffer. After disable SDMA on Arcturus(gfx9), dead lock with aux_context_lock is detected since si_screen_clear_buffer is called recursively before release lock. The call trace is: si_clear_render_target->si_compute_clear_render_target-> si_launch_grid_internal->si_launch_grid->si_emit_cache_flush-> si_prim_discard_signal_next_compute_ib_start->u_suballocator_alloc-> si_resource_create->si_buffer_create->si_alloc_resource-> si_screen_clear_buffer->simple_mtx_lock-> si_sdma_clear_buffer->si_pipe_clear_buffer-> si_clear_buffer->si_compute_do_clear_or_copy-> si_launch_grid_internal->si_launch_grid->si_emit_cache_flush-> si_prim_discard_signal_next_compute_ib_start->u_suballocator_alloc-> si_resource_create->si_buffer_create->si_alloc_resource-> si_screen_clear_buffer->simple_mtx_lock Fixes: `07a49bf597` "radeonsi: disable SDMA on gfx9" Signed-off-by: James Zhu <James.Zhu@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6941>	2020-10-01 00:06:29 +00:00
Eric Engestrom	90e42f87ac	add one last 20.1 release to coincide with expected 20.2.1 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6943>	2020-09-30 20:20:40 +00:00
Eric Engestrom	fe16e40974	docs: update calendar and link releases notes for 20.1.9 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6943>	2020-09-30 20:20:40 +00:00
Eric Engestrom	00d87db89b	docs: add release notes for 20.1.9 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6943>	2020-09-30 20:20:40 +00:00
Eric Anholt	49ec863e83	freedreno/ir3: Enable the i/o vectorizer on UBOs. This will merge loads of UBO components together into vec4 loads. At the same time, it improves the alignment information on our loads, fixing the regression from the vec3 loads fix. shader-db results: total instructions in shared programs: 12829370 -> 8755851 (-31.75%) total cat6 in shared programs: 145840 -> 97027 (-33.47%) Overall results from before the vec3 fix: total instructions in shared programs: 8019997 -> 8755851 (9.18%) total cat6 in shared programs: 87683 -> 97027 (10.66%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6612>	2020-09-30 19:53:43 +00:00
Eric Anholt	e3f4655805	nir: Make nir_lower_ubo_vec4() handle non-vec4-aligned loads. It turns out I had missed a case in my enumeration of why everything currently was vec4-aligned. Fixes a simple testcase of loading from a vec3[2] array in freedreno with IR3_SHADER_DEBUG=nouboopt. Initial shader-db results look devastating: total instructions in shared programs: 8019997 -> 12829370 (59.97%) total cat6 in shared programs: 87683 -> 145840 (66.33%) Hopefully this will recover once we introduce the i/o vectorizer, but that was blocked on getting the vec3 case fixed. Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6612>	2020-09-30 19:53:43 +00:00

1 2 3 4 5 ...

128951 commits