fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-26 08:08:09 +02:00

Author	SHA1	Message	Date
Connor Abbott	91dd2ca99f	ac/nir: rewrite shared variable handling (v2) Translate the NIR variables directly to LLVM instead of lowering to a TGSI-style giant array of vec4's and then back to a variable. This should fix indirect dereferences, make shared variables more tightly packed, and make LLVM's alias analysis more precise. This should fix an upcoming Feral title, which has a compute shader that was failing to compile because the extra padding made us run out of LDS space. v2: Combine the previous two patches into one, only use this for shared variables for now until LLVM becomes smarter. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Tested-by: Alex Smith <asmith@feralinteractive.com>	2017-07-17 14:16:03 -07:00
Jason Ekstrand	7947d05f84	i965: Check if the modifier is supported in select_best_modifier Otherwise, if a client gave us a list of modifiers that contained a modifier we understand but which is not supported on the hardware, we might return that one and then fail to create the image. Reviewed-by: Daniel Stone <daniels@collabora.com>	2017-07-17 13:48:38 -07:00
Jason Ekstrand	ec4364d57e	i965: Rework the modifier info map This commit splits the mapping in half. The modifier_infos table now only contains the modifier and the since_gen field. The tiling bits have been moved into a table in tiling_to_modifier as that's the only place it was ever used. The modifier_is_supported function now takes a devinfo and does the since_gen check. Reviewed-by: Daniel Stone <daniels@collabora.com>	2017-07-17 13:48:38 -07:00
Jason Ekstrand	f44171ef62	i965/surface_state: Remove the mcs_buf->offset == 0 restriction This assert was removed in `b0cc55f298` but got added back in `1a43d774b6`, probably by accident. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-07-17 13:48:38 -07:00
Jason Ekstrand	828c437078	intel/isl: Add a row_pitch parameter to surf_get_ccs_surf Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-07-17 13:48:38 -07:00
Jason Ekstrand	766784ef82	i965/miptree: Use BO_ALLOC_ZEROED for CCS_E buffers Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-07-17 13:48:38 -07:00
Jason Ekstrand	cbee2d1102	i965/screen: Allocate ZEROED BOs for images Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-07-17 13:48:38 -07:00
Jason Ekstrand	fb0caadc2a	i965/bufmgr: Add a BO_ALLOC_ZEROED flag Reviewed-by: Chad Versace <chadversary@chromium.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-07-17 13:48:38 -07:00
Jason Ekstrand	14570ecf63	i965/miptree: Replace is_lossless_compressed with mt->aux_usage checks Now that we have an actual aux_usage field, we no longer need the complex logic of is_lossless_compressed in order to figure out if a miptree is CCS_E compressed. As a side-effect, there is not longer any need to overload MSAA_LAYOUT_CMS for CCS_E and we can stop doing so. Reviewed-by: Chad Versace <chadversary@chromium.org> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-07-17 13:48:38 -07:00
Jason Ekstrand	67143a5037	i965/miptree: Allocate HiZ up-front HiZ, like MCS and CCS_E, can compress more than just clear colors so we want it turned on whenever the miptree is being used as a depth attachment. It's theoretically possible for someone to create a depth texture, upload data with glTexSubImage2D, and texture from it without ever binding it as a depth target. If this happens, we would end up wasting a bit of space by allocating a HiZ surface we never use. However, this is rather unlikely out side of test cases, so we're better off just allocating it up-front. Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-07-17 13:48:38 -07:00
Jason Ekstrand	138316cc99	i965/miptree: Add an intel_tiling_supports_hiz helper We need this split for the same reason that we need the split for CCS: intel_miptree_supports_hiz is called before we choose the actual tiling. Adding a tiling_supports_hiz helper lets choose_aux_usage more accurately decide whether or not to enable hiz. In particular, this prevents us from enabling HiZ on linear depth buffers. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-07-17 13:48:38 -07:00
Jason Ekstrand	e6b8877a54	i965/miptree: Gather initial aux allocation into a single function Reviewed-by: Chad Versace <chadversary@chromium.org> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-07-17 13:48:38 -07:00
Charmaine Lee	d8f51bfcbf	st/mesa: init winsys buffers list only if context creation succeeds Fixes piglit test crash when context creation fails. v2: As suggested by Brian, move the init to st_create_context_priv() Reviewed-by: Brian Paul <brianp@vmware.com>	2017-07-11 22:46:55 -07:00
Sinclair Yeh	ed45e8db3c	winsys/svga/drm: Enable import/export fence FD Enable the capability if the DRM supports it. Hook up mechanism to send and receive fence FD from the DRM. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2017-07-17 10:09:25 -06:00
Sinclair Yeh	d554f72c41	winsys/svga/drm: Connect winsys-side fence_* functions Connect fence_get_fd, fence_create_fd, and fence_server_sync. Implement the required functions in vmw_fence module. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2017-07-17 10:09:25 -06:00
Sinclair Yeh	56a6e890f3	drivers/svga: Connect driver-side fence_* functions Connect fence_get_fd, fence_create_fd, and fence_server_sync. Return PIPE_CAP_NATIVE_FENCE_FD capability based on what the winsys reports Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2017-07-17 10:09:25 -06:00
Sinclair Yeh	4da543e30a	winsys/svga/drm: Create winsys interface for Fence FD The new interfaces will be used to enable EGL_ANDROID_native_fence_sync. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2017-07-17 10:09:25 -06:00
Sinclair Yeh	2431cccad1	winsys/svga/drm: Prepare to support fence fd Make the fields and flags available. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2017-07-17 10:09:25 -06:00
Sinclair Yeh	65175df601	drivers/svga, winsys/svga/drm: Thread through timeout for fence_finish The timeout parameter is required to implement EGL_ANDROID_native_fence_sync. v2 * Replaced default timeout from 0 to PIPE_TIMEOUT_INFINITE * Add more documentation to the new timeout parameter Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2017-07-17 10:09:25 -06:00
Brian Paul	9ee86d6db7	svga: whitespace clean-up in svga_winsys.h Trivial.	2017-07-17 10:09:25 -06:00
Brian Paul	6f4923bd38	svga: add some const qualifiers Trivial.	2017-07-17 10:06:01 -06:00
Brian Paul	589f546256	svga: add comment about 'extra' constant locations Trivial.	2017-07-17 10:06:00 -06:00
Jason Ekstrand	c5700ed72e	anv/image: Add INPUT_ATTACHMENT to the list of required usages From the Vulkan 1.0.53 spec VU for vkCreateImageView: "image must have been created with a usage value containing at least one of VK_IMAGE_USAGE_SAMPLED_BIT, VK_IMAGE_USAGE_STORAGE_BIT, VK_IMAGE_USAGE_COLOR_ATTACHMENT_BIT, VK_IMAGE_USAGE_DEPTH_STENCIL_ATTACHMENT_BIT, or VK_IMAGE_USAGE_INPUT_ATTACHMENT_BIT" We were missing VK_IMAGE_USAGE_INPUT_ATTACHMENT_BIT from out list. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable@lists.freedesktop.org	2017-07-17 08:18:46 -07:00
Jason Ekstrand	cbdfd1daa2	anv: Stop leaking the no_aux sampler surface state Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable@lists.freedesktop.org	2017-07-17 08:18:46 -07:00
Jason Ekstrand	bd41564746	anv/cmd_buffer: Properly handle render passes with 0 attachments We were early returning and never created the NULL surface state. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Tested-by: James Legg <jlegg@feralinteractive.com> Cc: mesa-stable@lists.freedesktop.org	2017-07-17 08:18:46 -07:00
Marek Olšák	c62809171c	radeonsi/gfx9: add VM fault dmesg parser support Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-17 10:57:34 -04:00
Marek Olšák	9f320e0a38	radeonsi: automatically resize shader compiler thread queues when they are full Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-17 10:57:29 -04:00
Marek Olšák	4cae274116	radeonsi: prevent a deadlock in util_queue_add_job with too many GL contexts If the queue is full, util_queue_add_job will wait while bo_fence_lock is held. It pb_slab wants to reuse a buffer, it will lock the pb_slab mutex and try to check BO fence busyness, but it has to wait for bo_fence_lock to get released. Both bo_fence_lock and pb_slab mutex are locked now. When the CS thread unreferences and releases a suballocated buffer, it will try to lock the pb_slab mutex and has to wait. The CS thread can't finish its job in order to free a queue slot and unblock util_queue_add_job ==> deadlock. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-17 10:57:25 -04:00
Marek Olšák	59ad769770	util/u_queue: add an option to resize the queue when it's full Consider the following situation: mtx_lock(mutex); do_something(); util_queue_add_job(...); mtx_unlock(mutex); If the queue is full, util_queue_add_job will wait for a free slot. If the job which is currently being executed tries to lock the mutex, it will be stuck forever, because util_queue_add_job is stuck. The deadlock can be trivially resolved by increasing the queue size (reallocating the queue) in util_queue_add_job if the queue is full. Then util_queue_add_job becomes wait-free. radeonsi will use it. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-17 10:57:20 -04:00
Marek Olšák	465bb47d6f	radeonsi: expose ARB_timer_query unconditionally clock_crystal_freq is always non-zero now. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-17 10:57:17 -04:00
Marek Olšák	3d1a576fa6	ac/gpu_info: if clock crystal frequency is 0, print an error and set 1 During bring-up, this is often 0. Prevent automatic disablement of ARB_timer_query and demotion of the OpenGL version to 3.2 by setting a non-zero frequency. Print an error message instead. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-17 10:56:59 -04:00
Marek Olšák	d0963ef084	radeonsi/gfx9: don't read back non-existent register SRBM_STATUS2 It looks like there is no way to monitor SDMA busyness on GFX9. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-17 10:56:56 -04:00
Marek Olšák	5fb80a1e84	radeonsi: prevent a crash with DBG_CHECK_VM and u_threaded_context by setting PIPE_CONTEXT_DEBUG in the caller Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-17 10:56:51 -04:00
Marek Olšák	ddbd2f4c54	ac/surface/gfx9: flags.texture currently refers to TC-compatible HTILE This should lead to better MSAA performance on GFX9. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-17 10:56:46 -04:00
Marek Olšák	ffa7ec9e22	radeonsi: simplify computation of tessellation offchip buffers This is overly cautious, but better safe than sorry. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-17 10:55:07 -04:00
Marek Olšák	facfab28fe	radeonsi/gfx9: add workarounds to avoid VGPR indexing completely For inputs and outputs, indirect indexing is lowered by the GLSL compiler. For temporaries, use alloca and disable the "promote-alloca" pass. In the future, we could switch all codepaths to alloca permanently and just rely on the "promote-alloca" pass. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-17 10:50:39 -04:00
Marek Olšák	93391ac478	radeonsi: emit param exports after position exports Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-17 10:50:39 -04:00
Marek Olšák	9d9ffc8475	radeonsi: move building parameter exports into a separate function Both loops now look simple. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-17 10:50:39 -04:00
Marek Olšák	4e30fb4ecc	radeonsi: don't use info.num_inputs when it's unused For clarity. It's only used by color interpolation. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-17 10:50:39 -04:00
Marek Olšák	f8d6dd9b3d	radeonsi: add si_build_fs_interp helper This is much simpler. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-17 10:50:39 -04:00
Marek Olšák	4560f2b90a	radeonsi: merge si_llvm_get_amdgpu_target into ac_get_llvm_target Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-17 10:50:39 -04:00
Marek Olšák	c351037d6c	gallivm: inline gallivm_init_llvm_targets there is only one user. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-17 10:50:39 -04:00
Marek Olšák	ece0c0439f	radeonsi: don't call gallivm_init_llvm_targets It's for initializing the native (x86) target. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-17 10:50:39 -04:00
Marek Olšák	d308460586	gallium/radeon: reallocate suballocated buffers when exported This should fix exports of suballocated buffers. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-17 10:50:39 -04:00
Marek Olšák	5b555854cc	gallium/radeon: flush the context after in-place texture realloc before export Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-17 10:50:39 -04:00
Mark Thompson	63dcfed81f	st/va: Fix scaling list ordering for H.265 Mesa here requires the scaling lists in diagonal scan order, but VAAPI passes them in raster scan order. Therefore, rearrange the elements when copying. v2: Move scan tables to vl_zscan.c. Fix type in size assertion. Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Mark Thompson <sw@jkqxz.net> Reviewed-by: Christian König <christian.koenig@amd.com>	2017-07-17 15:24:56 +01:00
Emil Velikov	4168c162c5	radv: advertise v6 of the wayland surface extension Jason updated the Khronos spec to explicitly state that Wayland surfaces must support VK_PRESENT_MODE_MAILBOX_KHR. ANV did so since day one (back in 2015) Cc: mesa-stable@lists.freedesktop.org Cc: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Cc: Dave Airlie <airlied@redhat.com> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-07-17 15:24:48 +01:00
Emil Velikov	43c188f970	anv: advertise v6 of the wayland surface extension Jason updated the Khronos spec to explicitly state that Wayland surfaces must support VK_PRESENT_MODE_MAILBOX_KHR. ANV did so since day one (back in 2015) Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-07-17 15:24:32 +01:00
Emil Velikov	647b5a18df	i965: use strtol to convert the integer deviceID override One can override the deviceID, by setting the INTEL_DEVID_OVERRIDE variable. A few symbolic names or a numerical value for the actual device ID is accepted. At the same time we're using strtod (string to double) to convert the string to a decimal numeral. A seeming thinko, made by the original commit that introduces the code in libdrm_intel and got here with the import. Fixes: `514db96c11` ("i965: Import libdrm_intel.") Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-07-17 15:23:49 +01:00
Marek Olšák	f9d5611617	gallium/u_blitter: don't use TXF for scaled blits There seems to be a rounding difference with F2I vs nearest filtering. The precise problem in the rounding is unknown. This fixes an incorrect output with OpenMAX encoding. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-07-17 15:47:30 +02:00

... 24 25 26 27 28 ...

87704 commits