fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-01-19 13:10:28 +01:00

Author	SHA1	Message	Date
Samuel Pitoiset	5e2d25894b	mesa: Let compute shaders work in compatibility profiles The extension is already advertised in compatibility profile, but the _mesa_has_compute_shaders only returns true in core profile. If we advertise it, we should allow it to work. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2016-06-09 21:03:28 +02:00
Tim Rowley	2c85128e01	swr: implement clipPlanes/clipVertex/clipDistance/cullDistance v2: only load the clip vertex once v3: fix clip enable logic, add cullDistance v4: remove duplicate fields in vs jit key, fix test of clip fixup needed v5: fix clipdistance linkage for slot!=0,4 v6: support clip+cull; passes most piglit clip (failures understood) Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2016-06-09 13:28:35 -05:00
Daniel Czarnowski	cf804b4455	glx: fix crash with bad fbconfig GLX documentation states: glXCreateNewContext can generate the following errors: (...) GLXBadFBConfig if config is not a valid GLXFBConfig Function checks if the given config is a valid config and sets proper error code. Fixes currently crashing glx-fbconfig-bad Piglit test. v2: coding style cleanups (Emil, Topi) use DefaultScreen macro (Emil) Signed-off-by: Matt Roper <matthew.d.roper@intel.com> Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Cc: "11.2" <mesa-stable@lists.freedesktop.org>	2016-06-09 17:55:44 +03:00
Nayan Deshmukh	2d140ae70a	st/vdpau: implement luma keying Signed-off-by: Nayan Deshmukh <nayan26deshmukh@gmail.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2016-06-09 14:23:24 +02:00
Nayan Deshmukh	f24eb5a178	vl: Apply luma key filter before CSC conversion Apply the luma key filter to the YCbCr values during the CSC conversion in video buffer shader. The initial values of max and min luma are set to opposite values to disable the filter initially and will be set when enabling it. Add extra parmeters min and max luma for the luma key filter in vl_compositor_set_csc_matrix in va, xvmc. Setting them to opposite value 1.f and 0.f respectively won't effect the CSC conversion v2: -Squash 1,2 and 3 into one patch to avoid breaking build of other components. (Christian) -use ureg_swizzle. (Christian) -change name of the variables. (Christian) v3: -Squash all patches in one to avoid breaking of build. (Emil) -wrap functions properly. (Emil) -use 0.0f and 1.0f instead of 0.f and 1.f respectively. (Emil) v4: -Divide it in two patches one which introduces the functionality and assigs dummy values to the changed functions and second which implements the lumakey filter. (Christian) -use ureg_scalar instead ureg_swizzle. (Christian) Signed-off-by: Nayan Deshmukh <nayan26deshmukh@gmail.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2016-06-09 14:23:07 +02:00
Jason Ekstrand	037ce5d734	i965: Emit surface states for extra planes prior to gen8 When Kristian implemented GL_TEXTURE_EXTERNAL_OES, he hooked it up for gen8 but not for gen7 or earlier. It all works, we just need to emit the states for the extra planes. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net> Cc: "12.0" <mesa-stable@lists.freedesktop.org>	2016-06-08 21:57:57 -07:00
Marc-André Lureau	dc81b3ad43	virgl: fix checking fences When calling virgl_fence_wait() with timeout=0, virgl_{drm,vtest}_resource_is_busy() is called. However, it returns TRUE for a busy resource, whereace virgl_fence_wait() should return TRUE for a completed (non-busy) resource. This fixes running supertuxkart in a VM (I could not reproduce locally with vtest though there is a similar fix) Signed-off-by: Marc-André Lureau <marcandre.lureau@redhat.com> Cc: "11.1 11.2 12.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-06-09 14:07:53 +10:00
Dave Airlie	15896a470b	glsl/types: rename is_dual_slot_double to is_dual_slot_64bit. In the future int64 support will have the same requirements. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-06-09 09:17:24 +10:00
Dave Airlie	45c901f7a3	st/glsl_to_tgsi: move to checking 64-bitness instead of double This uses the new types interfaces to check for 64-bit types, as futureproofing against int64 support. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-06-09 07:37:49 +10:00
Dave Airlie	bbbc45b8e1	st/glsl_to_tgsi: use enum glsl_base_type instead of unsigned This is just some better type safety that I noticed while working on 64-bit integer support. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-06-09 07:37:49 +10:00
Dave Airlie	152f5eea62	mesa: use new 64-bit checks instead of explicit double checks. This just moves to the new interfaces in advance of int64. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-06-09 07:37:47 +10:00
Dave Airlie	2df46519e4	glsl/link_varyings: switch to 64bit check instead of double. This is prep work for int64 support. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-06-09 07:37:43 +10:00
Dave Airlie	35616a9e0e	glsl: use new interfaces for 64-bit checks. This is just prep work for int64 support, changing places where 64-bit matters no doubles. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-06-09 07:37:19 +10:00
Dave Airlie	a82b8e8b36	compiler: use 64bit check for sizing instead of double check. This just moves code to the new check in advance of int64 support. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-06-09 07:37:15 +10:00
Dave Airlie	246518154e	compiler/types: add 64-bitness queries. This adds an inline and type query for if a type is 64-bit. Fow now this is equivalent to double, but int64 will change this. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-06-09 07:37:04 +10:00
Adam Jackson	a1c5cd426c	glapi/glx: Add overflow checks to the client-side indirect code Coverity complains that the computed sizes can lead to negative lengths passed to memcpy. If that happens we've been handed invalid arguments anyway, so just bomb out. The funky "0%s" is because the size string for the variable-length part of the request is of the form "+ safe_pad() ...", and a unary + would coerce the result to always be positive, defeating the overflow check. Signed-off-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-06-08 14:39:46 -04:00
Marek Olšák	26b69ad250	radeonsi: improve the computation and comment of scratch_waves 2% isn't much. If you think the number should be decreased, please speak up. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-08 19:28:25 +02:00
Marek Olšák	1d9c1d9386	radeonsi: print the number of spilled VGPRs Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-08 19:28:25 +02:00
Marek Olšák	2b18d67a1e	gallium/radeon: remove dead code creating LLVMTargetMachine This was for some old unsupported LLVM version. Only si_create_context creates the target machine now. r600g doesn't use this function. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-08 19:23:42 +02:00
Marek Olšák	a343ab55f7	radeonsi: don't enable scratch just for SGPR spills Diff from shader-db: Scratch: 3221504 -> 17408 (-99.46 %) bytes per wave v2: add "break;" Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-08 19:23:41 +02:00
Marek Olšák	55b097d004	st/mesa: try not to compile compute shader on the first use Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-06-08 19:23:41 +02:00
Marek Olšák	95288277d5	Revert "radeonsi: allow direct hw MSAA resolve for scanout surfaces" This reverts commit `ffd54d1936`. No, it doesn't work. The test case is "glxgears -samples 2".	2016-06-08 19:21:55 +02:00
Nicolai Hähnle	bd5c41fe5f	st/mesa: directly compute level=0 texture size in st_finalize_texture The width0/height0/depth0 on stObj may not have been set at this point. Observed in a trace that set up levels 2..9 of a 2d texture, and set the base level to 2, with height 1. This made the guess logic always bail. Originally investigated by Ilia Mirkin, this patch gets rid of the somewhat redundant storage of width0/height0/depth0 and makes sure we always compute pipe texture sizes that are compatible with the base level image of the GL texture. Fixes the gl-1.2-texture-base-level piglit test provided by Brian Paul. v2: - try to re-use an existing pipe texture when possible - handle a corner case where the base level is not level 0 and it is of size 1x1x1 v3: - ptHeight = ptWidth in cube map 1x1 case (suggested by Brian) Cc: "12.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2016-06-08 19:12:07 +02:00
Timothy Arceri	8c3ecde0e1	glsl: stop allocating memory for SSBOs and builtins This just stops counting and assigning a storage location for these uniforms, the count is only used to create the uniform storage. These uniform types don't use this storage. Reviewed-by: Dave Airlie <airlied@redhat.com>	2016-06-08 13:19:32 +10:00
Ilia Mirkin	6e6fd911da	st/mesa: use buffer usage history to set dirty flags for revalidation We were previously unconditionally doing this for arrays and ubo's, and ignoring texture/storage/atomic buffers. Instead use the usage history to determine which atoms need to be revalidated. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Cc: "12.0" <mesa-stable@lists.freedesktop.org>	2016-06-07 22:27:04 -04:00
Gurchetan Singh	d9546b0c5d	i965: Integrate precise trig into configuration infrastructure With this change, to enable precise SIN and COS instructions on Intel hardware, one can put <option name="precise_trig" value="true"/> in the proper drirc file. V2: Make option name more generic Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Stephane Marchesin <stephane.marchesin@gmail.com>	2016-06-07 15:42:21 -07:00
Marek Olšák	f39439d166	radeonsi: re-enable PBO ReadPixels acceleration disabled by `4f1cccf570` Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-06-08 00:22:45 +02:00
Marek Olšák	7c6e88b643	radeonsi: allow MSAA resolving into a texture that has DCC enabled Since DCC is enabled almost everywhere now, it's important not to disable this fast path. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-08 00:22:45 +02:00
Marek Olšák	9a472a3e0b	gallium/radeon: move DCC clearing into a separate function Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-08 00:22:45 +02:00
Marek Olšák	ffd54d1936	radeonsi: allow direct hw MSAA resolve for scanout surfaces No idea why this was disabled, but it works fine. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-08 00:22:45 +02:00
Marek Olšák	4be46c7d9d	radeonsi: don't allocate DCC for the temporary MSAA resolve surface Allocating it has no effect, but it adds overhead (useless DCC clear). Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-08 00:22:45 +02:00
Marek Olšák	c06246501e	radeonsi: don't enable DCC in the sampler if first_level doesn't have it If first_level > 0 and DCC is disabled for that level, let's skip DCC reads entirely. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-08 00:22:45 +02:00
Marek Olšák	00389100b6	winsys/amdgpu: enable DCC for mipmapped textures Also add dcc_fast_clear_size for clearing only the necessary subset of DCC. For no AA, it's equal to the size of the whole DCC level. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-08 00:22:45 +02:00
Marek Olšák	c65361763c	gallium/radeon: don't disable DCC because of SDMA We want to keep DCC enabled to save bandwidth. It was a bad idea to disable it here. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-08 00:22:45 +02:00
Marek Olšák	2fd74a05bb	radeonsi: don't flag renderbuffer feedback loop if DCC has just been disabled Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-08 00:22:45 +02:00
Marek Olšák	aa7fe70443	radeonsi: add per-level dcc_enabled flags Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-08 00:22:45 +02:00
Marek Olšák	60e93ddd06	radeonsi: compute DCC register parameters in si_emit_framebuffer_state This will get more complicated with mipmapped DCC or when DCC is enabled after allocation. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-08 00:22:45 +02:00
Marek Olšák	a01536a29f	gallium/radeon: add an assertion checking the validity of PIPE_BIND_SCANOUT Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-08 00:22:45 +02:00
Marek Olšák	d4d733e39d	gallium/radeon: don't allocate DCC for non-renderable texture formats R9G9B9E5 is the only uncompressed one hopefully. This fixes incorrect rendering not discovered (due to a lack of tests) until DCC mipmapping was enabled. Cc: 11.1 11.2 12.0 <mesa-stable@lists.freedesktop.org> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-06-08 00:22:45 +02:00
Nicolai Hähnle	b42bc90b6a	radeonsi: enable WQM in PS prolog when needed WQM is needed when the PS prolog computes a VGPR that is consumed by a shader with (implicit or explicit) derivatives. Depends on http://reviews.llvm.org/D20839 / LLVM r272063 for this to be effective (otherwise it's just a no-op). Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=95130 Cc: 12.0 <mesa-dev@lists.freedesktop.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-07 23:46:02 +02:00
Nicolai Hähnle	d3a584defe	tgsi/scan: add uses_derivatives (v2) v2: - TG4 does not calculate derivatives (Ilia) - also handle SAMPLE* instructions (Roland) Cc: 12.0 <mesa-stable@lists.freedesktop.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> (v1) Reviewed-by: Brian Paul <brianp@vmware.com> (v1) Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2016-06-07 23:45:17 +02:00
Nanley Chery	b7a0c0ec7f	docs/devinfo: Expound on helpful extension tips Signed-off-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-06-07 11:16:23 -07:00
Nanley Chery	9e7de50cab	docs/devinfo: Update bullet in stale extension guide Signed-off-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-06-07 11:16:23 -07:00
Nanley Chery	26b0f023d7	docs/devinfo: Add closing paragraph tag Signed-off-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-06-07 11:16:23 -07:00
Tim Rowley	87f0a0448f	swr: fix provoking vertex Use rasterizer provoking vertex API. Fix rasterizer provoking vertex for tristrips and quad list/strips. v2: make provoking vertex tables static const Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2016-06-07 11:47:52 -05:00
Ilia Mirkin	c81b090c92	st/mesa: revalidate image atoms when a texture is updated A texture may be redefined with _NEW_TEXTURE, which might have been bound to a shader image slot. We have to revalidate the image atoms to pick up on the new resource. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Cc: "12.0" <mesa-stable@lists.freedesktop.org>	2016-06-07 10:18:34 -04:00
Ilia Mirkin	71ad8a173f	gk104/ir: fix conditions for adding a texbar Sometimes a register source can actually be double- or even quad-wide. We must make sure that the inserted texbars take that width into account. Based on an earlier patch by Samuel Pitoiset. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Cc: "12.0 11.2" <mesa-stable@lists.freedesktop.org>	2016-06-07 10:18:13 -04:00
Nicolai Hähnle	8239da28e8	radeonsi: keep track of dirty descriptor sets Reduces CPU load for draw calls that change none or few of the descriptors. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-07 15:18:10 +02:00
Nicolai Hähnle	d152c73712	radeonsi: move si_descriptors into a per-context array Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-07 15:18:07 +02:00
Nicolai Hähnle	a29c4f9ebd	radeonsi: pass shader stage to si_disable_shader_image Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-07 15:18:05 +02:00

... 64 65 66 67 68 ...

85652 commits