fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-01-06 17:30:20 +01:00

Author	SHA1	Message	Date
Connor Abbott	f1ba0a5ea0	glsl: fix ir_constant::equals() for doubles Reviewed-by: Timothy Arceri <t_arceri@yahoo.com.au> Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com>	2015-11-19 09:16:18 +01:00
Connor Abbott	84ed3819a4	glsl: fix isinf() for doubles Reviewed-by: Timothy Arceri <t_arceri@yahoo.com.au> Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com>	2015-11-19 09:16:18 +01:00
Connor Abbott	7820b2c071	nir: fix constant folding of bfi Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com>	2015-11-19 09:16:18 +01:00
Brian Paul	1cfffb95eb	hud: fix Windows build break Protect signal-related code with PIPE_OS_UNIX test. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2015-11-19 07:57:09 +00:00
Ian Romanick	2f55476153	glsl: Fix off-by-one error in array size check assertion Apparently, this has been a bug since 2010 (`c30f6e5d`). Also use ARRAY_SIZE instead of open coding it. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: mesa-stable@lists.freedesktop.org	2015-11-18 18:35:56 -08:00
Ian Romanick	0aded03046	mesa: Don't expose GL_EXT_shader_integer_mix in GLES 1.x There are no shaders, so it doesn't even make sense to expose the extension. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com> Cc: Nanley Chery <nanley.g.chery@intel.com>	2015-11-18 18:35:56 -08:00
Ian Romanick	37c2cfa6bc	glsl: Silence unused parameter warnings builtin_functions.cpp:5289:52: warning: unused parameter 'num_arguments' [-Wunused-parameter] unsigned num_arguments, ^ builtin_functions.cpp:5290:52: warning: unused parameter 'flags' [-Wunused-parameter] unsigned flags) ^ Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2015-11-18 18:35:56 -08:00
Ian Romanick	c82498c4da	glsl: Silence ignored qualifier warning I think the intention was to mark the "this" parameter as const, but const goes on the other end to do that. In file included from glsl_symbol_table.cpp:26:0: ast.h:339:35: warning: type qualifiers ignored on function return type [-Wignored-qualifiers] const bool is_single_dimension() ^ Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com>	2015-11-18 18:35:56 -08:00
Kenneth Graunke	fc19a0d2e4	i965: Allow indirect GS input indexing in the scalar backend. This allows arbitrary non-constant indices on GS input arrays, both for the vertex index, and any array offsets beyond that. All indirects are handled via the pull model. We could potentially handle indirect addressing of pushed data as well, but it would add additional code complexity, and we usually have to pull inputs anyway due to the sheer volume of input data. Plus, marking pushed inputs as live due to indirect addressing could exacerbate register pressure problems pretty badly. We'd need to be careful. v2: Use updated MOV_INDIRECT opcode. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Abdiel Janulgue <abdiel.janulgue@linux.intel.com> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-11-18 15:42:36 -08:00
Jimmy Berry	56a1c10bb8	gallium/hud: control visibility at startup and runtime. - env GALLIUM_HUD_VISIBLE: control default visibility - env GALLIUM_HUD_SIGNAL_TOGGLE: toggle visibility via signal Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2015-11-19 00:02:33 +01:00
Jason Ekstrand	0bee3acc2a	i965/nir: Add hooks for testing nir_shader_clone This commit adds code for testing nir_shader_clone by running it after each and every optimization pass and throwing away the old shader. Testing nir_shader_clone is hidden behind a new INTEL_CLONE_NIR environment variable. Reviewed-by: Rob Clark <robclark@freedesktop.org> Acked-by: Kenneth Graunke <kenneth@whitecape.org>	2015-11-18 12:28:55 -08:00
Jason Ekstrand	9fbd390dd4	nir: Add support for cloning shaders This commit is heavily based on one by Rob Clark <robdclark@gmail.com> but reworked to re-use nir_create functions and do less hashing. Signed-off-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Rob Clark <robclark@freedesktop.org>	2015-11-18 12:28:32 -08:00
Kenneth Graunke	9ff71b649b	i965/nir: Validate that NIR passes call nir_metadata_preserve(). Failing to call nir_metadata_preserve() can have nasty consequences: some pass breaks dominance information, but leaves it marked as valid, causing some subsequent pass to go haywire and probably crash. This pass adds a simple validation mechanism to ensure passes handle this properly. We add a new bogus metadata flag that isn't used for anything in particular, set it before each pass, and ensure it isn't still set after the pass. nir_metadata_preserve will reset the flag, so correct passes will work, and bad passes will assert fail. (I would have made these functions static inline, but nir.h is included in C++, so we can't bit-or enums without lots of casting...) Thanks to Dylan Baker for the idea. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2015-11-18 12:28:32 -08:00
Kenneth Graunke	7bc0978999	i965/nir: Add OPT() and OPT_V() macros for invoking NIR passes. OPT() is the normal macro for passes that return booleans, while OPT_V() is a variant that works for passes that don't properly report progress. (Such passes should be fixed to return a boolean, eventually.) These macros take care of calling nir_validate_shader() and setting progress appropriately. In the future, it would be easy to add shader dumping similar to INTEL_DEBUG=optimizer by extending the macro. v2 (Jason Ekstrand): - Fix an unused variable warning Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2015-11-18 12:28:32 -08:00
Rob Clark	d27ae2cf8c	nir: add array length field This will simplify things somewhat in clone. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2015-11-18 12:28:32 -08:00
Rob Clark	624ec66653	nir: remove nir_variable::max_ifc_array_access No users. Signed-off-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2015-11-18 12:28:32 -08:00
Rob Clark	4671c13852	freedreno/a4xx: add fake RGTC support (required for GL3) The a4xx bits corresponding to 'freedreno/a3xx: add fake RGTC support (required for GL3)' TODO some more r/e.. maybe we get lucky and hw supports some of this directly? For now this will help us enable gl3. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-11-18 14:31:13 -05:00
Rob Clark	2379cc9fe0	freedreno/a4xx: add compressed texture formats Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-11-18 14:31:13 -05:00
Rob Clark	fadd39442b	freedreno: update generated headers Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-11-18 14:31:13 -05:00
Ilia Mirkin	4607b2b9b6	freedreno: expose GLSL 140 and fake MSAA for GL3.0/3.1 support Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-11-18 14:31:13 -05:00
Ilia Mirkin	9c409c8df3	freedreno/a3xx: fix texture buffers, enable offsets The main issue is that the current logic looked into cso->u.tex, which is the wrong side of the union to look into for texture buffers. While I was at it, it was easy enough to add the logic to handle offsets (first_element). - reduce texture buffer size limit (determined experimentally) - don't look at first/last levels, instead look at first/last element - include the first element offset - set offset alignment to 16 (determined experimentally) Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-11-18 14:31:13 -05:00
Ilia Mirkin	d69e557f2a	freedreno: add support for conditional rendering, required for GL3.0 A smarter implementation would make it possible to attach this to emit state for the BY_REGION versions to avoid breaking the tiling. But this is a start. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-11-18 14:31:13 -05:00
Ilia Mirkin	059da344ec	freedreno/a3xx: add fake RGTC support (required for GL3) Also throw in LATC while we're at it (same exact format). This could be made more efficient by keeping a shadow compressed texture to use for returning at map time. However... it's not worth it for now... presumably compressed textures are not updated often. Lastly fix up Z32S8 transfers to non-0 layers. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-11-18 14:31:13 -05:00
Ilia Mirkin	84d087aea2	freedreno/a3xx: add missing formats to enable ARB_vertex_type_2_10_10_10_rev The previously RE'd formats were from an ES driver implementing OES_vertex_type_10_10_10_2 and thus backwards. A future change could add the 2_10_10_10 support. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-11-18 14:31:13 -05:00
Rob Clark	8106fec74c	freedreno/a3xx+a4xx: fix for stk binning pass hang We'd end up in a state where shader uses no inputs, yet num_elements is greater than zero. Triggered by a TF vertex shader which did: gl_Position = vec4(0.0, 0.0, 0.0, 0.0); resulting in a binning pass variant with no inputs. Includes equiv fix in a4xx, even though we don't have binning-pass enabled yet on a4xx. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-11-18 14:31:13 -05:00
Rob Clark	b24c9a8aee	freedreno/a3xx+a4xx: fix GL_POINTS lockup w/ GLES point_size_per_vertex is always TRUE for GLES, causing us to configure the hw as if gl_PointSize was written, even if it was not. Which makes for grumpy hw. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-11-18 14:31:13 -05:00
Ilia Mirkin	b40e144a66	nir: fix typo in idiv lowering, causing large-udiv-udiv failures In nv50, and in the python script that Rob circulated, we do: bld.mkCmp(OP_SET, CC_GE, TYPE_U32, (s = bld.getSSA()), TYPE_U32, m, b); Do the same in the nir div lowering pass. This fixes the large-udiv-udiv piglit tests on freedreno. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Rob Clark <robclark@freedesktop.org>	2015-11-18 14:31:13 -05:00
Oded Gabbay	4581f8428e	llvmpipe: disable VSX in ppc due to LLVM PPC bug This patch disables the use of VSX instructions, as they cause some piglit tests to fail For more details, see: https://llvm.org/bugs/show_bug.cgi?id=25503#c7 With this patch, ppc64le reaches parity with x86-64 as far as piglit test suite is concerned. v2: - Added check that we have at least LLVM 3.4 - Added the LLVM bug URL as a comment in the code v3: - Only disable VSX if Altivec is supported, because if Altivec support is missing, then VSX support doesn't exist anyway. - Change original patch description. Signed-off-by: Oded Gabbay <oded.gabbay@gmail.com> Cc: "11.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2015-11-18 21:27:29 +02:00
Ilia Mirkin	8e68113c1a	nvc0/ir: actually emit AFETCH on kepler Looks like this was forgotten in the commit which added the AFETCH logic. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: mesa-stable@lists.freedesktop.org	2015-11-18 14:26:16 -05:00
Kenneth Graunke	2631bfd62c	nir: Store the size of the TCS output patch in nir_shader_info. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-11-18 10:49:18 -08:00
Kenneth Graunke	b196f1fff3	i965: Add enums for 3DSTATE_TE field values. 3DSTATE_TE has partitioning, output topology, and domain fields, each of which has several enumerated values. We'll also need to switch on the domain, so enums (rather than #defines) seem like a natural fit. I chose to put these in brw_compiler.h because they'll be stored in struct brw_tes_prog_data, which will live there. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Kristian Høgsberg <krh@bitplanet.net>	2015-11-18 10:49:18 -08:00
Ian Romanick	72e232374e	meta/generate_mipmap: Don't leak the framebuffer object Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Cc: "10.6 11.0" <mesa-stable@lists.freedesktop.org>	2015-11-18 09:38:21 -08:00
Brian Paul	1a48326a84	svga: use more VGPU10 formats We always want to prefer the VGPU10 formats over the VGPU9 ones when we have VGPU10 support. Original patch by Jose and updated by Brian. Reviewed-by: Charmaine Lee <charmainel@vmware.com> Reviewed-by: José Fonseca <jfonseca@vmware.com>	2015-11-18 09:16:12 -07:00
Brian Paul	1a90e3e1e3	svga: add/use new svga_sampler_format() function This is important for the case of sampling from a depth texture. In that case, we need to sample the texture as if it were a single-channel color texture. For other/color formats, we can use the format as-is. Reviewed-by: Charmaine Lee <charmainel@vmware.com> Reviewed-by: José Fonseca <jfonseca@vmware.com>	2015-11-18 09:15:54 -07:00
Nicolai Hähnle	27ce75ed12	radeon: count cs dwords separately for query begin and end This will be important for perfcounter queries. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2015-11-18 12:27:13 +01:00
Nicolai Hähnle	ffd01b7781	radeon: expose r600_query_hw functions for reuse Reviewed-by: Marek Olšák <marek.olsak@amd.com> [Fixed a rebase conflict and re-tested before pushing.]	2015-11-18 12:27:13 +01:00
Nicolai Hähnle	50f0f938e3	radeon: implement r600_query_hw_get_result via function pointers We will need the clear_result override for the batch query implementation. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2015-11-18 12:27:13 +01:00
Nicolai Hähnle	c207c55fc0	radeon: split hw query buffer handling from cs emit The idea here is that driver queries implemented outside of common code will use the same query buffer handling with different logic for starting and stopping the corresponding counters. Reviewed-by: Marek Olšák <marek.olsak@amd.com> [Fixed a rebase conflict and re-tested before pushing.]	2015-11-18 12:27:13 +01:00
Nicolai Hähnle	1d10b3d01e	radeon: convert hardware queries to the new style Move r600_query and r600_query_hw into the header because we will want to reuse the buffer handling and suspend/resume logic outside of the common radeon code. Reviewed-by: Marek Olšák <marek.olsak@amd.com> [Fixed a rebase conflict and re-tested before pushing.]	2015-11-18 12:27:12 +01:00
Nicolai Hähnle	019106760d	radeon: convert software queries to the new style Software queries are all queries that do not require suspend/resume and explicit handling of result buffers. Reviewed-by: Marek Olšák <marek.olsak@amd.com> [Fixed a rebase conflict and re-tested before pushing.]	2015-11-18 12:27:12 +01:00
Nicolai Hähnle	829a9808a9	radeon: add query handler function pointers The goal here is to be able to move the implementation details of hardware- specific queries (in particular, performance counters) out of the common code. Reviewed-by: Marek Olšák <marek.olsak@amd.com> [Fixed a rebase conflict and re-tested before pushing.]	2015-11-18 12:27:12 +01:00
Nicolai Hähnle	50cab4788d	radeon: move R600_QUERY_* constants into a new query header file More query-related structures will have to be moved into their own header file to support hardware-specific performance counters. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2015-11-18 12:27:12 +01:00
Nicolai Hähnle	c56e83e518	radeon: cleanup driver query list Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2015-11-18 12:27:12 +01:00
Nicolai Hähnle	e117e74baf	radeon: move get_driver_query_info to r600_query.c Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2015-11-18 12:27:11 +01:00
Neil Roberts	5dfb4dbc05	i965: Prevent fast clears for MSRTs on SKL There are currently a bunch of formats that behave strangely when sampling the cleared color from the MCS buffer on SKL. They seem to mostly be formats that don't have an alpha component, although it's not all of them, and we haven't yet found anything in the specs which would explain this. For now to be on the safe side this patch just prevents fast clears for MSRTs on SKL altogether so that when fast clears are eventually enabled it will only be for single-sampled surfaces. The assumption is that clears are probably more likely to be used in single-sampled applications anyway so we can at least get them working and we can enable MSRTs later once we understand the problem better. This patch should have no functional effect other than perhaps receiving fewer perf_debug messages on SKL+. v2: Improve the commit message to avoid saying the patch disables fast clears because it will be merged before fast clears are enabled for any surfaces so it doesn't actually disable anything. Reviewed-by: Ben Widawsky <benjamin.widawsky@intel.com> Reviewed-by: Chad Versace <chad.versace@intel.com>	2015-11-18 10:29:07 +01:00
Eric Anholt	dd05ffebfc	vc4: Don't bother lowering uniforms when the same value is used twice. DEQP likes to do math on uniforms, and the "fmaxabs dst, uni, uni" to get the absolute value would get lowered. The lowering doesn't bother to try to restrict the lifetime of the lowered uniforms, so we'd end up register allocation failng due to this on 5 of the tests (More tests still fail in RA, which look like we'll need to reduce lowered uniform lifetimes to fix). No changes on shader-db, though fewer extra MOVs are generated on even glxgears (MOVs pair well enough that it ends up being the same instruction count).	2015-11-17 17:45:23 -08:00
Eric Anholt	dffe7260cd	vc4: Fix uniform reordering to support reading the same uniform twice. This does actually happen in the wild (particularly fabs of a uniform), so we'd like to support it.	2015-11-17 17:45:23 -08:00
Eric Anholt	d18d1ba587	vc4: Fix documentation on vc4_qir_lower_uniforms.c.	2015-11-17 17:45:23 -08:00
Eric Anholt	a4bf28178f	vc4: Add support for nir_op_uge, using the carry bit on QPU_A_SUB. It looks like nir_lower_idiv is going to use it soon, so add support. With Ilia's change, this fixes one case in fs-op-div-large-uint-uint (with GL 3.0 forced on). Cc: "11.0" <mesa-stable@lists.freedesktop.org>	2015-11-17 17:45:23 -08:00
Kenneth Graunke	27b1d34438	i965: Fix PIPE_CONTOL typo. PIPE_CONTOL!!!	2015-11-17 16:33:48 -08:00

1 2 3 4 5 ...

67657 commits