fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-01-30 22:20:27 +01:00

Author	SHA1	Message	Date
Brian Paul	bd60fb49ba	vbo: clean up with 'indent', whitespace fixes, etc in vbo_exec_array.c Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2016-10-20 09:47:21 -06:00
Brian Paul	8b9965442a	vbo: whitespace fixes and reformatting in vbo_exec_api.c Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2016-10-20 09:47:21 -06:00
Brian Paul	8320bf1a7e	vbo: minor clean-up in vbo_exec_api.c Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2016-10-20 09:47:21 -06:00
Brian Paul	1098e6957c	vbo: move attribute type assignment If the attribute type is changing, we would have found that earlier in the ATTR_UNION() macro and would have called vbo_exec_fixup_vertex(). So move the assignment into that function so we don't do it every time. No Piglit regressions. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2016-10-20 09:47:21 -06:00
Brian Paul	4c3c9f1441	vbo: rename reset_attrfv() to vbo_reset_all_attr() Use a better name. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2016-10-20 09:47:21 -06:00
Brian Paul	7693bcde28	vbo: make vbo_reset_attr() static Not called from any other file. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2016-10-20 09:47:21 -06:00
Brian Paul	9d6d9b28f7	vbo: trivial indentation fix in vbo_exec_api.c	2016-10-20 09:47:21 -06:00
Marek Olšák	c2a602d21a	gallivm: try to fix build with LLVM <= 3.4 due to missing CallSite.h Reviewed-by: Brian Paul <brianp@vmware.com> Tested-by: Brian Paul <brianp@vmware.com>	2016-10-20 17:45:23 +02:00
Marek Olšák	f19f71830b	radeonsi: fix build of si_eliminate_const_vs_outputs on LLVM <= 3.8 Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-20 11:07:50 +02:00
Marek Olšák	2db56434d4	gallivm: add wrappers for missing functions in LLVM <= 3.8 radeonsi needs these. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-20 11:07:50 +02:00
Nicolai Hähnle	4a2dbfff05	radeonsi: fix 64-bit loads from LDS Fixes spec/arb_tessellation_shader/execution/dvec[23]-vs-tcs-tes, among others. Cc: "12.0 13.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-10-20 10:37:07 +02:00
Nicolai Hähnle	bfa50f88ce	st/mesa: only set primitive_restart when the restart index is in range Even when enabled, primitive restart has no effect when the restart index is larger than the representable values in the index buffer. Fixes GL45-CTS.gtf31.GL3Tests.primitive_restart.primitive_restart_upconvert for radeonsi VI. v2: add an explanatory comment Cc: "12.0 13.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> (v1)	2016-10-20 10:37:06 +02:00
Nicolai Hähnle	3d9b57e493	st/glsl_to_tgsi: sort input and output decls by TGSI index Fixes a regression introduced by commit `777dcf81b`. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=98307 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Cc: 13.0 <mesa-stable@lists.freedesktop.org>	2016-10-20 10:37:06 +02:00
Nicolai Hähnle	a1895685f8	st/glsl_to_tgsi: fix block copies of arrays of structs Use a full writemask in this case. This is relevant e.g. when a function has an inout argument which is an array of structs. v2: use C-style comment (Timothy Arceri) Reviewed-by: Marek Olšák <marek.olsak@amd.com> (v1) Cc: 13.0 <mesa-stable@lists.freedesktop.org>	2016-10-20 10:37:01 +02:00
Nicolai Hähnle	ca592af880	st/glsl_to_tgsi: fix block copies of arrays of doubles Set the type of the left-hand side to the same as the right-hand side, so that when the base type is double, the writemask of the MOV instruction is properly fixed up. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Cc: 13.0 <mesa-stable@lists.freedesktop.org>	2016-10-20 10:30:00 +02:00
Iago Toral Quiroga	3da08e1664	glsl: Indirect array indexing on non-last SSBO member must fail compilation After the changes in comit `5b2675093e`, we moved this check to the linker, but the spec expects this to be checked at compile-time. There are dEQP tests that expect an error at compile time and the spec seems to confirm that expectation: "Except for the last declared member of a shader storage block (section 4.3.9 “Interface Blocks”), the size of an array must be declared (explicitly sized) before it is indexed with anything other than an integral constant expression. The size of any array must be declared before passing it as an argument to a function. Violation of any of these rules result in compile-time errors. It is legal to declare an array without a size (unsized) and then later redeclare the same name as an array of the same type and specify a size, or index it only with integral constant expressions (implicitly sized)." Commit `5b2675093e` tries to take care of the case where we have implicitly sized arrays in SSBOs and it does so by checking the max_array_access field in ir_variable during linking. In this patch we change the approach: we look for indirect access on SSBO arrays, and when we find one, we emit a compile-time error if the accessed member is not the last in the SSBO definition. There is a corner case that the specs do not address directly though and that dEQP checks for: the case of an unsized array in an SSBO definition that is not defined last but is never used in the shader code either. The following dEQP tests expect a compile-time error in this scenario: dEQP-GLES31.functional.debug.negative_coverage.callbacks.shader.compile_compute_shader dEQP-GLES31.functional.debug.negative_coverage.get_error.shader.compile_compute_shader dEQP-GLES31.functional.debug.negative_coverage.log.shader.compile_compute_shader However, since the unsized array is never used it is never indexed with a non-constant expression, so by the spec quotation above, it should be valid and the tests are probably incorrect. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-20 08:26:51 +02:00
Ilia Mirkin	cd45d758ff	nv50/ir: process texture offset sources as regular sources With ARB_gpu_shader5, texture offsets can be any source, including TEMPs and IN's. Make sure to process them as regular sources so that we pick up masks, etc. This should fix some CTS tests that feed offsets directly to textureGatherOffset, and we were not picking up the input use, thus not advertising it in the shader header. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-by: Dave Airlie <airlied@redhat.com> Cc: 12.0 13.0 <mesa-stable@lists.freedesktop.org>	2016-10-19 21:02:01 -04:00
Ilia Mirkin	313fba5ee1	nv50,nvc0: avoid reading out of bounds when getting bogus so info The state tracker tries to attach the info to the wrong shader. This is easy enough to protect against. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Cc: 12.0 13.0 <mesa-stable@lists.freedesktop.org>	2016-10-19 21:02:01 -04:00
Eric Engestrom	8bf7717e1f	wsi/wayland: fix error path Fixes: `1720bbd353` ("anv/wsi: split image alloc/free out to separate fns.") Cc: "13.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Eric Engestrom <eric@engestrom.ch> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-20 10:53:59 +10:00
Dave Airlie	b0f131b0bf	anv: drop unused zero macro. I can't see this being used anywhere. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-20 10:53:37 +10:00
Dave Airlie	d842546ad1	radv: use emit_icmp for samples_identical On a debug llvm build we'd assert on the next compare when the return from samples_identical was i1 instead of i32. Cc: "13.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-10-20 01:43:55 +01:00
Jordan Justen	64c3d73535	i965/cs: Don't use a thread channel ID for small local sizes When the local group size is 8 or less, we will execute the program at most 1 time. Therefore, the local channel ID will always be 0. By using a constant 0 in this case we can prevent using push constant data. This is not expected to be common a occurance in real applications, but it has been seen in tests. We could extend this optimization to 16 and 32 for SIMD16 and SIMD32, but it gets a bit more complicated, because this optimization is currently being done early on, before we have decided the SIMD size. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-10-19 16:51:45 -07:00
Jordan Justen	1fa000a33b	i965/cs: Use udiv/umod for local IDs This allows for more optimizations relating to power-of-two divisions. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-10-19 16:51:45 -07:00
Timothy Arceri	740a8fa1e2	mesa: remove unused LocalSizeVariable Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2016-10-20 10:30:32 +11:00
Samuel Pitoiset	2b6e04e91f	nvc0/ir: simplify predicate logic for GK104 atomic operations The predicate is always CC_NOT_P as defined in processSurfaceCoordsNVE4(), so we only want to emit OR. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-10-19 23:53:57 +02:00
Samuel Pitoiset	974ab614d3	nvc0/ir: remove useless NVC0LoweringPass::gMemBase Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2016-10-19 23:53:48 +02:00
Samuel Pitoiset	03dc87caab	nv50/ir: print CCTL subops in debug mode Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-10-19 23:53:39 +02:00
Ian Romanick	4d35683d91	nir: Optimize integer division and modulus with 1 The previous power-of-two rules didn't catch idiv (because i965 doesn't set lower_idiv) and imod cases. The udiv and umod cases should have been caught, but I included them for orthogonality. This fixes silly code observed from compute shaders with local_size_[xy] = 1. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=98299 Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-10-19 14:25:10 -07:00
Marek Olšák	3ec9975555	radeonsi: eliminate trivial constant VS outputs These constant value VS PARAM exports: - 0,0,0,0 - 0,0,0,1 - 1,1,1,0 - 1,1,1,1 can be loaded into PS inputs using the DEFAULT_VAL field, and the VS exports can be removed from the IR to save export & parameter memory. After LLVM optimizations, analyze the IR to see which exports are equal to the ones listed above (or undef) and remove them if they are. Targeted use cases: - All DX9 eON ports always clear 10 VS outputs to 0.0 even if most of them are unused by PS (such as Witcher 2 below). - VS output arrays with unused elements that the GLSL compiler can't eliminate (such as Batman below). The shader-db deltas are quite interesting: (not from upstream si-report.py, it won't be upstreamed) PERCENTAGE DELTAS Shaders PARAM exports (affected only) batman_arkham_origins 589 -67.17 % bioshock-infinite 1769 -0.47 % dirt-showdown 548 -2.68 % dota2 1747 -3.36 % f1-2015 776 -4.94 % left_4_dead_2 1762 -0.07 % metro_2033_redux 2670 -0.43 % portal 474 -0.22 % talos_principle 324 -3.63 % warsow 176 -2.20 % witcher2 1040 -73.78 % ---------------------------------------- All affected 991 -65.37 % ... 9681 -> 3353 ---------------------------------------- Total 26725 -10.82 % ... 58490 -> 52162 v2: treat Undef as both 0 and 1 Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> (v1) Tested-by: Edmondo Tommasina <edmondo.tommasina@gmail.com> (v1)	2016-10-19 22:21:46 +02:00
Samuel Pitoiset	041da0ae81	nv50/ir: silent TGSI_PROPERTY_FS_DEPTH_LAYOUT Found that information message while replaying a trace from Metro 2033 Redux. Mark that property as useless for now. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-10-19 21:02:50 +02:00
Marek Olšák	a2ea653a49	radeonsi: remove cb0_is_integer handling st/mesa does this for us. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-19 19:26:30 +02:00
Marek Olšák	54f8efeb02	st/mesa: disable alpha-test, alpha-to-coverage, alpha-to-one for integer FBs v2: rebased Reviewed-by: Brian Paul <brianp@vmware.com>	2016-10-19 19:26:30 +02:00
Marek Olšák	c64da9d499	mesa: remove gl_shader_compiler_options::EmitNoNoise it's always true Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-19 19:26:30 +02:00
Marek Olšák	2897cb3dba	glsl_to_tgsi: remove code for fixing up TGSI labels I don't know what this was supposed to do, but all TGSI labels were always 0. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-19 19:26:30 +02:00
Marek Olšák	ec35ff4e2b	glsl_to_tgsi: remove subroutine support Never used. The GLSL compiler doesn't even look at EmitNoFunctions. v2: add back "return" support in "main" Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-19 19:26:30 +02:00
Marek Olšák	eacda2c080	mesa_to_tgsi: remove remnants of flow control and subroutine support Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-19 19:26:30 +02:00
Marek Olšák	82f4c0126d	mesa_to_tgsi: drop support for instructions that can't occur here Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-19 19:26:30 +02:00
Marek Olšák	4e42898d9d	glsl_to_tgsi: allocate glsl_to_tgsi_instruction::tex_offsets on demand sizeof(glsl_to_tgsi_instruction): 384 -> 264 Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-19 19:26:30 +02:00
Marek Olšák	4d3d620f26	glsl_to_tgsi: merge buffer and sampler fields in glsl_to_tgsi_instruction sizeof(glsl_to_tgsi_instruction): 416 -> 384 Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-19 19:26:30 +02:00
Marek Olšák	dbf64ea28b	glsl_to_tgsi: reduce the size of glsl_to_tgsi_instruction using bitfields sizeof(glsl_to_tgsi_instruction): 464 -> 416 Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-19 19:26:30 +02:00
Marek Olšák	9015cbb3a3	glsl_to_tgsi: reduce the size of st_dst_reg and st_src_reg I noticed that glsl_to_tgsi_instruction is too huge. sizeof(glsl_to_tgsi_instruction): 752 -> 464 (-38%) Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-19 19:26:30 +02:00
Marek Olšák	222c599b61	glsl_to_tgsi: remove unused st_translate::tex_offsets Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-19 19:26:30 +02:00
Marek Olšák	0d95eeb79c	glsl_to_tgsi: remove unused parameters from calc_deref_offsets Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-19 19:26:30 +02:00
Marek Olšák	6980480052	glsl_to_tgsi: use array_id for temp arrays instead of hacking high bits Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-10-19 19:26:30 +02:00
Eric Engestrom	8acb79dfac	egl: bring back the default glapi.so name Earlier commit replaced the default platform specific libglapi.so name with an #error. This may have been overzealous since the name is the correct for the BSD platforms, at least. Reinstate the hunk - bringing back OpenBSD, et al. to a successful build state. Fixes: `7a9c92d071` ("egl/dri2: non-shared glapi cleanups") [Emil Velikov: format the patch from Eric, add commit message and tag.] Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com>	2016-10-19 15:09:26 +01:00
Iago Toral Quiroga	66d8bd3b7e	i965: fix subnr overflow in suboffset() Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-10-19 11:48:21 +02:00
Dave Airlie	86c4575a81	radv: decompress fmask before reading using texture unit Before we can read the fmask using the compute shader, we need to decompress the fmask in place. This fixes a bunch of remaining failure and hopefully multisampling in Talos.	2016-10-19 17:39:47 +10:00
Dave Airlie	67c91ef2a2	radv: fix samples_identical return value. This was returning an inversion, so not doing as it should have. We need to compare the fmask value with 0, and return the result from that.	2016-10-19 17:39:01 +10:00
Dave Airlie	93ba86c307	radv: fix wsi porting regression in swapchain destroy. The code in anv is right, there's a pending patch to fix this up different, but I'll sync the code for now.	2016-10-19 13:54:49 +10:00
Dave Airlie	63406b669e	radv: fix fmask ptr issue We were using the wrong descriptor in the fmask picking code.	2016-10-19 13:16:25 +10:00

1 2 3 4 5 ...

78728 commits