fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-07 09:18:04 +02:00

Author	SHA1	Message	Date
Connor Abbott	7594ef6eb0	lima/gpir: Support branch instructions Because branch conditions have to be in the pass slot, there is no unconditional branch, and realistically the pass slot has to contain a move when branching (there's nothing it does that would be useful for operating on booleans, so we can't use it for anything when computing the branch condition), we put the branch instruction in the pass slot and at codegen time turn it into a move of the branch condition. This means that it doesn't have to be special-cased like store instructions are in the scheduler. Because of this decision we can remove the half-implemented BRANCH codegen slot. Finally, we (ab)use the existing schedule_first mechanism to make sure that branches are always last in the basic block. Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-09-24 08:35:47 +02:00
Connor Abbott	2df2e081fd	lima/gpir: Only try to place actual children When picking a node to be scheduled, we try to schedule its children as well. But we shouldn't try to schedule nodes which only have a fake dependency on the original node, since this isn't the point of scheduling children at the same time and can break some expectations of the rest of the code. Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-09-24 08:35:26 +02:00
Connor Abbott	f989a024b4	lima/gpir: Fix compiler warning Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-09-24 08:33:56 +02:00
Adam Jackson	0d635ccc91	glx: Implement GLX_EXT_no_config_context This is the GLX counterpart to EGL_KHR_no_config_context. Contexts may now be created without reference to an fbconfig, in which case it is treated as compatible with any fbconfig (and thus any GLX drawable). Khronos: https://github.com/KhronosGroup/OpenGL-Registry/pull/102 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2019-09-23 20:39:01 -04:00
Adam Jackson	999c2aed88	glx: Lift sending the MakeCurrent request to top-level code Somewhat terrifyingly, we never sent this for direct contexts, which means the server never knew the context/drawable bindings. To handle this sanely, pull the request code up out of the indirect backend, and rewrite the context switch path to call it as appropriate. This attempts to preserve the existing behavior of not calling unbind() on the context if its refcount would not drop to zero. Of course, you can't just do this indiscriminately, because this is GLX and extant X servers have bugs and everything is terrible. To wit: - For 1.20.x prior to 1.20.6, you can bind a direct context once, but the second time you try to modify the context's binding you will get GLXBadContextTag. This includes unbinding the context. And "deleting" the context will leak memory, because it will still appear to be current. - For 1.19 and earlier, glXMakeCurrent(dpy, None, ctx) should be legal for GL 3.0+ contexts, but the server will throw BadMatch. To guard against this, we only send the request for indirect contexts unless the server is known good, and only mention one context at a time in such a request; if switching between contexts, we first unbind the old, and then bind the new. Note that the second VendorRelease() version is to catch XFree86 4.x and Xorg [67].x, which almost certainly have the above bugs. Other servers might report different version numbers here, but we can't do direct rendering against them, so this should be safe. Fixes glx-make-context, glx-multi-window-single-context and glx-query-drawable-glx_fbconfig_id-window. Sufficiently old piglit will regress on glx-make-glxdrawable-current (throwing BadMatch), which is fixed by mesa/piglit!116.	2019-09-23 20:39:01 -04:00
Adam Jackson	01e437988d	glx: Move vertex array protocol state into the indirect backend Only relevant for indirect contexts, so let's get that code out of the common path.	2019-09-23 20:21:01 -04:00
Kenneth Graunke	b9e93db208	intel: Increase Gen11 compute shader scratch IDs to 64. From the MEDIA_VFE_STATE docs: "Starting with this configuration, the Maximum Number of Threads must be set to (#EU * 8) for GPGPU dispatches. Although there are only 7 threads per EU in the configuration, the FFTID is calculated as if there are 8 threads per EU, which in turn requires a larger amount of Scratch Space to be allocated by the driver." It's pretty clear that we need to increase this for scratch address calculations, because the FFTID has a certain bit-pattern. The quote above seems to indicate that we should increase the actual thread count programmed in MEDIA_VFE_STATE as well, but we think the intention is to only bump the scratch space. Fixes GPU hangs in Bioshock Infinite and Synmark's CSDof on Icelake 8x8. Fixes: `5ac804bd9a` ("intel: Add a preliminary device for Ice Lake") Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-09-23 16:59:40 -07:00
Kenneth Graunke	50c0dd8621	Revert "intel/gen11+: Enable Hardware filtering of Semi-Pipelined State in WM" This reverts commit `729de1488f`. It turns out that, although the register is in the logical context, it isn't whitelisted, so we can't actually write it from userspace batch buffers. The write just becomes a noop, which is why we saw no performance changes. I manually whitelisted it, and still observed no performance gains, but it did regress KHR-GL46.texture_cube_map_array.color_depth_attachments on the iris driver. So we might need to fix something before enabling this. To prevent it randomly getting turned on should the kernel ever whitelist this register, we revert the patch for now.	2019-09-23 16:31:23 -07:00
Jason Ekstrand	03911195a3	util/rb_tree: Replace useless ifs with asserts Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2019-09-23 22:38:30 +00:00
Kenneth Graunke	a733423da5	broadcom/genxml: Stop manually scrubbing 'α' -> "alpha" 'α' has never appeared in any genxml files, so there's no need to replace it with the word "alpha". Reviewed-by: Eric Anholt <eric@anholt.net>	2019-09-23 20:24:54 +00:00
Kenneth Graunke	8489206e9d	intel/genxml: Stop manually scrubbing 'α' -> "alpha" 'α' has never appeared in any genxml files, so there's no need to replace it with the word "alpha". Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2019-09-23 20:24:54 +00:00
Rob Clark	d8cbf1adc1	freedreno/a6xx: do streamout only in binning pass Use VPC_SO_OVERRIDE to control whether we do streamout in binning or draw pass. Normally we want to do streamout in binning pass, except when there is a single tile and binning passed is skipped. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2019-09-23 20:02:34 +00:00
Rob Clark	b9bf374512	freedreno/a6xx: fix binning pass vs. xfb We could bit doing streamout from binning pass. In this case we want to use the full VS which doesn't have (potentially streamed out) varyings stripped out. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2019-09-23 20:02:34 +00:00
Rob Clark	331f89a971	freedreno/a6xx: un-open-code PC_PRIMITIVE_CNTL_1.PSIZE Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2019-09-23 20:02:34 +00:00
Marek Olšák	05d32850ff	ac/nir: force unnormalized coordinates for RECT This fixes VAAPI. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2019-09-23 15:34:54 -04:00
Marek Olšák	500181b2ba	ac/nir: port Z compare value clamping from radeonsi This fixes some dEQP tests. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2019-09-23 15:34:54 -04:00
Marek Olšák	09447ccc78	tgsi_to_nir: fix 2-component system values like tess_level_inner_default Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-09-23 15:34:56 -04:00
Marek Olšák	3906fce88b	tgsi_to_nir: fix masked out image loads This caused a failure in NIR validation. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-09-23 15:34:54 -04:00
Marek Olšák	780eeaf2f1	nir: define 8-byte size and alignment for bindless variables Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2019-09-23 15:34:22 -04:00
Marek Olšák	f5c103ce1d	nir: don't add bindless variables to num_textures and num_images It confuses radeonsi. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2019-09-23 15:34:05 -04:00
Marek Olšák	150f6ffb4c	amd: remove all PCI IDs supported by amdgpu Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-09-23 15:15:35 -04:00
Jiang, Sonny	5a545e355b	loader: always map the "amdgpu" kernel driver name to radeonsi (v2) v2: cleanup Signed-off-by: Sonny Jiang <sonny.jiang@amd.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-09-23 15:14:11 -04:00
Marek Olšák	9429714233	ac: stop using PCI IDs for chip identification PCI IDs for amdgpu will be removed from Mesa. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-09-23 15:14:11 -04:00
Marek Olšák	48742de601	ac/addrlib: fix chip identification for Vega10, Arcturus, Raven2, Renoir Cc: 19.2 <mesa-stable@lists.freedesktop.org> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-09-23 15:14:11 -04:00
Marek Olšák	65b698136c	amd: add more PCI IDs for Navi14 trivial and urgent Cc: 19.2 <mesa-stable@lists.freedesktop.org>	2019-09-23 15:12:34 -04:00
Eric Engestrom	c29c410182	meson: split compiler warnings one per line Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2019-09-23 17:56:22 +01:00
Jason Ekstrand	d63162cff0	nir/repair_ssa: Replace the unreachable check with the phi builder In `a3268599f3`, I attempted to fix nir_repair_ssa for unreachable blocks. However, that commit missed the possibility that the use is in a block which, itself, is unreachable. In this case, we can end up in an infinite loop trying to replace a def with itself. Even though a no-op replacement is a fine operation, it keeps extending the end of the uses list as we're walking it. Instead of explicitly checking for the group of conditions, just check if the phi builder gives us a different def. That's guaranteed to be 100% reliable and, while it lacks symmetry with the is_valid checks, should be more reliable. Fixes: `a3268599` "nir/repair_ssa: Repair dominance for unreachable..." Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2019-09-23 16:19:24 +00:00
Daniel Schürmann	2c050b49b3	aco: only emit waitcnt on loop continues if we there was some load or export Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>	2019-09-23 13:39:33 +02:00
Karol Herbst	70e39294d7	nv50/ir/nir: comparison of integer expressions of different signedness warning Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Rhys Kidd <rhyskidd@gmail.com>	2019-09-23 13:27:32 +02:00
Karol Herbst	61ccca12f5	nv50/ir: fix unnecessary parentheses warning Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Rhys Kidd <rhyskidd@gmail.com>	2019-09-23 13:27:32 +02:00
Erico Nunes	ab49a0e746	lima: remove partial clear support from pipe->clear() pipe->clear() is not called for partial clears, which mesa emulates by drawing a quad. Furthermore, drivers should not use rasterizer state information for scissor information (which was being used to handle the partial clears). So, remove the partial clear support since it was not supposed to be handled by pipe->clear() anyway. This fixes issues with clearing after switching to different sized framebuffers. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com>	2019-09-23 12:19:10 +02:00
Boris Brezillon	0c6ca0a647	dEQP-GLES2.functional.buffer.write.use.index_array.* are passing now. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com>	2019-09-23 09:48:38 +02:00
Boris Brezillon	055497fa84	panfrost: Fix indexed draws ->padded_count should be large enough to cover all vertices pointed by the index array. Use the local vertex_count variable that contains the updated vertex_count value for the indexed draw case. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-23 09:47:41 +02:00
Karol Herbst	697eb8f973	clover/nir: fix compilation with g++-5.5 and maybe earlier fixes "sorry, unimplemented: non-trivial designated initializers not supported" Fixes: `deb04adf2a` ("clover: add support for passing kernels as nir to the driver") Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch>	2019-09-23 07:09:41 +00:00
Kenneth Graunke	ec81f19b44	st/mesa: Bail on incomplete attachments in discard_framebuffer Incomplete attachments don't have an associated pipe_surface, so this would crash. Fixes a WebGL conformance test that uses incomplete attachments: https://www.khronos.org/registry/webgl/sdk/tests/conformance2/renderbuffers/invalidate-framebuffer.html?webglVersion=2&quiet=0&quick=1 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=111756 Reviewed-By: Tapani Pälli <tapani.palli@intel.com>	2019-09-22 21:03:16 -07:00
Vasily Khoruzhick	d214778753	lima: implement BO cache Allocating BOs is expensive, so we should avoid doing that by caching freed BOs. BO cache is modelled after one in v3d driver and works as follows: - in lima_bo_create() check if we have matching BO in cache and return it if there's one, allocate new BO otherwise. - in lima_bo_unreference() (renamed from lima_bo_free()): put BO in cache instead of freeing it and remove all stale BOs from cache Reviewed-by: Qiang Yu <yuq825@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-09-22 19:20:59 -07:00
Vasily Khoruzhick	9f897a2b4c	lima: use 0 to poll if BO is busy in lima_bo_wait() os_time_get_absolute_timeout(0) returns current time, while kernel driver expects 0 as value to poll BO status and return immediately. Fix it by setting abs_timeout to 0 if timeout_ns is 0 Reviewed-by: Qiang Yu <yuq825@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-09-22 19:20:59 -07:00
Qiang Yu	7f7ac21088	lima: move damage bound build to resource Reviewed-and-Tested-by: Vasily Khoruzhick <anarsoul@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com>	2019-09-23 09:48:55 +08:00
Qiang Yu	4ed569eed7	lima: don't use damage system when full damage Some time weston set full damage region. It is more effient to use the cached pp stream instead of dynamically create one. Reviewed-and-Tested-by: Vasily Khoruzhick <anarsoul@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com>	2019-09-23 09:48:50 +08:00
Qiang Yu	afbaed906d	lima: implement EGL_KHR_partial_update This extension set a damage region for each buffer swap which can be used to reduce buffer reload cost by only feed damage region's tile buffer address for PP. Reviewed-and-Tested-by: Vasily Khoruzhick <anarsoul@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com>	2019-09-23 09:48:15 +08:00
Icenowy Zheng	8278b236b0	lima: fix PLBU viewport configuration The PLBU expects the viewport's 4 borders' coordinates, however currently we're feeding the coordinate of the left-bottom point and the size to it, which leads to misrendering when the left-bottom point is not (0,0). Change the macros for the viewport PLBU command, and the data feed to it. The code to calculate the 4 borders is ported from Panfrost. Signed-off-by: Icenowy Zheng <icenowy@aosc.io> Reviewed-by: Qiang Yu <yuq825@gmail.com>	2019-09-22 15:22:38 +08:00
Bas Nieuwenhuizen	40087ffc5b	amd: Build aco only if radv is enabled ACO depends on C++14, but radeonsi/radv with LLVM 8,9 do not. Let us only require it for RADV, since that is the only user. Fixes: `a70a998718` "radv/aco: Setup alternate path in RADV to support the experimental ACO compiler" Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-09-21 14:06:41 +02:00
Karol Herbst	7955fabcf8	nvc0: expose spirv support required for OpenCL v2: adjust to changes in previous commits v3: properly convert to NIR in nvc0_cp_state_create Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Pierre Moreau <pierre.morrow@free.fr> (v1)	2019-09-21 08:28:32 +00:00
Karol Herbst	deb04adf2a	clover: add support for passing kernels as nir to the driver v2: minor formatting fixes v3: call glsl_type_singleton_init_or_ref and glsl_type_singleton_decref v4: capitalize and punctuate comments fix text_executable -> text_intermediate in TODO make glsl_type_singleton wrapper static v5: rewrite how we run the nir passes v6: fix unhandled case switch warning in st/mesa Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> (v4)	2019-09-21 08:28:32 +00:00
Karol Herbst	1befaf4417	clover: prepare supporting multiple IRs v2: rework arguments to compiler::compile_program add assert to device::ir_format v3: remove PIPE_SHADER_IR_SPIRV change title Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> (v2) Reviewed-by: Pierre Moreau <pierre.morrow@free.fr>	2019-09-21 08:28:32 +00:00
Karol Herbst	c8cd8e279d	clover: add support for drivers having no proper binary format Most drivers have actually no binary format and just store the IR directly as a single entry point blob. v2: add a cap to switch between single or multi entry point binaries v3: remove the entry_point field v4: remove PIPE_CAP_MULTI_ENTRY_POINT_BINARIES v5: remove supports_multiple_entry_points Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Pierre Moreau <pierre.morrow@free.fr>	2019-09-21 08:28:32 +00:00
Karol Herbst	1982ac6d6b	clover/functional: add id_equals helper v2: pass argument by value Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Pierre Moreau <pierre.morrow@free.fr>	2019-09-21 08:28:32 +00:00
Karol Herbst	f3ba98cb18	rename pipe_llvm_program_header to pipe_binary_program_header We want to use it for other formats as well, so give it a more generic name Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Pierre Moreau <pierre.morrow@free.fr>	2019-09-21 08:28:32 +00:00
Karol Herbst	b6c47abe3e	gallium: add blob field to pipe_llvm_program_header makes it easier to consume a IR_NATIVE binary Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Pierre Moreau <pierre.morrow@free.fr>	2019-09-21 08:28:32 +00:00
Pierre Moreau	2043c5f37c	clover/llvm: Add functions for compiling from source to SPIR-V Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2019-09-21 08:28:32 +00:00

... 5 6 7 8 9 ...

115915 commits