fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-25 06:08:21 +02:00

Author	SHA1	Message	Date
Tapani Pälli	c2ff3485b3	glsl: clone inputs and outputs during linking This increases memory pressure during linking but makes it easier for backend to free IR after it is not needed anymore. v2: use resource list as ralloc context in case of relink (Kenneth) Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: mesa-stable@lists.freedesktop.org	2015-06-24 12:01:21 -07:00
Kenneth Graunke	147cdb53ec	nir: Use a switch statement for detecting move-like operations. Suggested by Jason Ekstrand. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2015-06-24 10:35:04 -07:00
Tapani Pälli	32a220f1f6	glsl: remove cross validation of interpolation qualifier with GLSL 4.40 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Timothy Arceri <t_arceri@yahoo.com.au>	2015-06-24 10:06:32 +03:00
Ilia Mirkin	fccf012adc	glsl: binding point is a texture unit, which is a combined space This fixes compilation failures in Dota 2 Reborn where a texture unit binding point was used that was numerically higher than the max per stage. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Timothy Arceri <t_arceri@yahoo.com.au> Tested-by: Nick Sarnie <commendsarnex@gmail.com> Cc: "10.5 10.6" <mesa-stable@lists.freedesktop.org>	2015-06-23 12:08:34 -04:00
Jose Fonseca	634cfb9a45	glsl: Specify the shader stage in linker errors due to too many in/outputs. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2015-06-23 12:06:39 +01:00
Kenneth Graunke	1762568fd3	nir: Allow vec2/vec3/vec4 instructions in the select peephole pass. These are basically just moves, so they should be safe as well. When disabling i965's GLSL IR level scalarizer (channel expressions) pass, I started seeing NIR code like this: if ssa_21 { block block_1: /* preds: block_0 / vec4 ssa_120 = vec4 ssa_82, ssa_83, ssa_84, ssa_30 / succs: block_3 / } else { block block_2: / preds: block_0 / / succs: block_3 / } block block_3: / preds: block_1 block_2 */ vec4 ssa_33 = phi block_1: ssa_120, block_2: ssa_2 Previously, the GLSL IR scalarizer pass would break the vec4 into a series of fmovs, which were allowed by the peephole pass. But with the vec4 operation, they were not. We want to keep getting selects. Normal i965 on Broadwell: instructions in affected programs: 200 -> 176 (-12.00%) helped: 4 With brw_fs_channel_expressions() disabled: instructions in affected programs: 1832 -> 1646 (-10.15%) helped: 30 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-06-22 14:08:36 -07:00
Brian Paul	a1f84453a2	glsl: fix formatting glitch in _mesa_print_ir() Print the closing ) before the newline. Trivial.	2015-06-19 16:46:29 -06:00
Jose Fonseca	f734d25560	glsl: Fail linkage when UBO exceeds GL_MAX_UNIFORM_BLOCK_SIZE. It's not totally clear whether other Mesa drivers can safely cope with over-sized UBOs, but at least for llvmpipe receiving a UBO larger than its limit causes problems, as it won't fit into its internal display lists. This fixes piglit "arb_uniform_buffer_object-maxuniformblocksize fsexceed" without regressions for llvmpipe. NVIDIA driver also fails to link the shader from "arb_uniform_buffer_object-maxuniformblocksize fsexceed". Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=65525 PS: I don't recommend cherry-picking this for Mesa stable, as some app might inadvertently been relying on UBOs larger than GL_MAX_UNIFORM_BLOCK_SIZE to work on other drivers, so even if this commit is universally accepted it's probably best to let it mature in master for a while. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2015-06-19 07:21:05 +01:00
Ilia Mirkin	5974841fd0	glsl: guard gl_NumSamples enablement on ARB_sample_shading gl_NumSamples should only be enabled when ARB_sample_shading is enabled. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2015-06-18 20:40:22 -04:00
Ilia Mirkin	b6e238023c	glsl: add version checks to conditionals for builtin variable enablement A number of builtin variables have checks based on the extension being enabled, but were missing enablement via a higher GLSL version. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Timothy Arceri <t_arceri@yahoo.com.au> Cc: "10.5 10.6" <mesa-stable@lists.freedesktop.org>	2015-06-18 13:21:44 -04:00
Ilia Mirkin	c40e7ee7c4	glsl: handle conversions to double when comparing param matches This allows mod(int, int) to become selected as float mod when doubles are supported. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz> Cc: "10.6" <mesa-stable@lists.freedesktop.org>	2015-06-18 13:21:44 -04:00
Chris Forbes	fa49536ab1	glsl: add ir reader support for ir_barrier Picked from the tessellation branch. Reviewed-by: Brian Paul <brianp@vmware.com>	2015-06-16 15:47:02 +02:00
Marek Olšák	2f86c22e75	glsl: print locations of variables Reviewed-by: Brian Paul <brianp@vmware.com>	2015-06-16 15:47:02 +02:00
Timothy Arceri	da6996485f	Revert "glsl: remove restriction on unsized arrays in GLSL ES 3.10" This reverts commit `adee54f826`. Further down in the GLSL ES 3.10 spec it say: "If an array is declared as the last member of a shader storage block and the size is not specified at compile-time, it is sized at run-time. In all other cases, arrays are sized only at compile-time." Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2015-06-16 20:58:59 +10:00
Marek Olšák	2489054f66	glsl: fix "tesselation" typo Trivial.	2015-06-14 20:17:29 +02:00
Timothy Arceri	faf7670ee8	glsl: fix compile error message Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2015-06-13 12:21:26 +10:00
Timothy Arceri	0d2068a92d	glsl: enforce restriction on AoA interface blocks in GLSL ES 3.10 Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2015-06-13 08:31:21 +10:00
Timothy Arceri	94d669b0d2	glsl: enforce fragment shader input restrictions in GLSL ES 3.10 Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2015-06-13 08:31:15 +10:00
Timothy Arceri	3d78bdea31	glsl: enforce output variable rules for GLSL ES 3.10 Some rules are already applied this just adds the missing ones. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2015-06-13 08:31:09 +10:00
Jordan Justen	2867f2e8cd	nir: Add barrier intrinsic function Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Chris Forbes <chrisf@ijw.co.nz> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Ben Widawsky <ben@bwidawsk.net>	2015-06-12 15:12:40 -07:00
Chris Forbes	86855365b4	glsl: Add builtin barrier() function [jordan.l.justen@intel.com: Add CS support] Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ben Widawsky <ben@bwidawsk.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-06-12 15:12:39 -07:00
Chris Forbes	e7f628c2fc	glsl: Add ir node for barrier v2: * Changes suggested by mattst88 [jordan.l.justen@intel.com: Add nir support] Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ben Widawsky <ben@bwidawsk.net>	2015-06-12 15:12:39 -07:00
Emil Velikov	1df5a6c71e	mesa; add a dummy _mesa_error_no_memory() symbol to libglsl_util Rather than forcing everyone to provide their own definition of the symbol provide a common (dummy) one. This helps us resolve the build of the standalone pipe-drivers (amongst others), which are missing the symbol. Cc: Rob Clark <robclark@freedesktop.org> Cc: "10.6" <mesa-stable@lists.freedesktop.org> Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com>	2015-06-12 15:32:18 +01:00
Martin Peres	5b61cb1236	glsl: fix constructing a vector from a matrix Without this patch, the following constructs (not an extensive list) would crash mesa: - mat2 foo = mat2(1); vec4 bar = vec4(foo); - mat3 foo = mat3(1); vec4 bar = vec4(foo); - mat3 foo = mat3(1); ivec4 bar = ivec4(foo); The first case is explicitely allowed by the GLSL spec, as seen on page 101 of the GLSL 4.40 spec: "vec4(mat2) // the vec4 is column 0 followed by column 1" The other cases are implicitely allowed also. The actual changes are quite minimal. We first split each column of the matrix to a list of vectors and then use them to initialize the vector. An additional check to make sure that we are not trying to copy 0 elements of a vector fix the (i)vec4(mat3) case as the last vector (3rd column) is not needed at all. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Signed-off-by: Martin Peres <martin.peres@linux.intel.com>	2015-06-11 14:04:29 +03:00
Timothy Arceri	adee54f826	glsl: remove restriction on unsized arrays in GLSL ES 3.10 Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2015-06-10 18:54:43 +10:00
Chih-Wei Huang	c5e11e5f7f	android: build with libcxx on android lollipop On Lollipop, apparently stlport is gone and libcxx must be used instead. We still support stlport when building on earlier android releases. Signed-off-by: Chih-Wei Huang <cwhuang@linux.org.tw> Reviewed-by: Eric Anholt <eric@anholt.net>	2015-06-09 12:25:50 -07:00
Brian Paul	c10dc485f3	glsl: fix comment typo: s/accpet/accept/	2015-06-09 10:49:35 -06:00
Alan Coopersmith	cb277cde6f	glsl_compiler: Remove unused extra argument to printf in usage_fail Flagged by Oracle's parfait static analyzer: Error: Format string argument mismatch (CWE 628) In call to printf with format string "usage: %s [options] <file.vert \| file.geom \| file.frag>\n\nPossible options are:\n" Too many arguments for format string (got more than 1 arguments) at line 285 of src/glsl/main.cpp in function 'usage_fail'. Signed-off-by: Alan Coopersmith <alan.coopersmith@oracle.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2015-06-04 19:01:16 -07:00
Martin Peres	87a4bc5118	mesa: reference built-in uniforms into gl_uniform_storage This change introduces a new field in gl_uniform_storage to explicitely say that a uniform is built-in. In the case where it is, no storage is defined to make it clear that it is read-only from the mesa side. I fixed all the places in the code that made use of the structure that I changed. Any place making a wrong assumption and using the storage straight away will just crash. This patch seems to implement the path of least resistance towards listing built-in uniforms in GL_ACTIVE_UNIFORM (and other APIs). Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Signed-off-by: Martin Peres <martin.peres@linux.intel.com>	2015-06-04 09:25:00 +03:00
Timothy Arceri	86a74e9b6b	nir: use src for ssa helper Reviewed-by: Thomas Helland <thomashelland90@gmail.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-06-03 06:50:39 +10:00
Timothy Arceri	5f7b8fa481	nir: remove extra semicolon Reviewed-by: Thomas Helland <thomashelland90@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-06-03 06:50:33 +10:00
Eduardo Lima Mitev	5b226a1242	nir: prevent use-after-free condition in should_lower_phi() lower_phis_to_scalar() pass recurses the instruction dependence graph to determine if all the sources of a given instruction are scalarizable. To prevent cycles, it temporary marks the phi instruction before recursing in, then updates the entry with the resulting value. However, it does not consider that the entry value may have changed after a recursion pass, hence causing a use-after-free situation and a crash. This patch fixes this by reloading the entry corresponding to the 'phi' after recursing and before updating its value. The crash can be reproduced ~20% of times with the dEQP test: dEQP-GLES3.functional.shaders.loops.while_constant_iterations.nested_sequence_fragment Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-06-02 20:21:49 +02:00
Iago Toral Quiroga	2231cf0ba3	nir: Fix output swizzle in get_mul_for_src When we compute the output swizzle we want to consider the number of components in the add operation. So far we were using the writemask of the multiplication for this instead, which is not correct. Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-05-28 18:25:37 +02:00
Ilia Mirkin	5646f0f18a	glsl: avoid leaking linked gl_shader when there's a late linker error This makes piglit mixing-clip-distance-and-clip-vertex-disallowed have 0 definitely lost blocks with valgrind. (Same non-0 number of possibly lost blocks though.) Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Tobias Klausmann <tobias.johannes.klausmann@mni.thm.de> Cc: "10.5 10.6" <mesa-stable@lists.freedesktop.org>	2015-05-25 16:52:11 -04:00
Matt Turner	5614bcc416	nir: Remove sRGB colorspace conversion round-trip. Some shaders in Civilization V and Beyond Earth do pow(pow(x, 2.2), 0.454545) which is converting to and from sRGB colorspace. A more general rule that replaces pow(pow(a, b), c) with pow(a, b * c) actually regresses two shaders in Sun Temple in which the result of the inner pow is used twice, once by another pow and once by another instruction. Also, since 2.2 * 0.454545 isn't exactly one, the more general pattern would have still left us with a pow, and I'm 2.2 * 0.454545 percent sure that's not what they want. instructions in affected programs: 934 -> 886 (-5.14%) helped: 16	2015-05-22 11:26:36 -07:00
Timothy Arceri	d67515b7be	glsl: remove element_type() helper We now have is_array() and without_array() that make the code much clearer and remove the need for this. For all remaining calls to this we already knew that the type was an array so returning a null wasn't adding any value. v2: use without_array() in _mesa_ast_array_index_to_hir() and don't use without_array() in lower_clip_distance_visitor() as we want to make sure the array is 2D. Reviewed-by: Matt Turner <mattst88@gmail.com>	2015-05-22 08:35:45 +10:00
Matt Turner	51ccdb6346	glsl: Use AM_V_GEN/AM_V_at in NIR rules.	2015-05-21 09:43:43 -07:00
Jason Ekstrand	2126c68e5c	nir: Get rid of the array elements parameter on load/store intrinsics Previously, we used intrinsic->const_index[1] to represent "the number of array elements to load" for load/store intrinsics. However, this set to 1 by every pass that ever creates a load/store intrinsic. Also, while it might make some sense for registers, it makes no sense whatsoever in SSA. On top of that, the i965 backend was the only backend to ever support it; freedreno and vc4 just assert that it's always 1. Let's just delete it. Signed-off-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Rob Clark <robclark@freedesktop.org>	2015-05-20 09:28:06 -07:00
Tapani Pälli	9f4eaba36f	glsl: add stage references for UBO uniforms Patch marks uniforms inside UBO properly referenced by stages. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Samuel Iglesias Gonsalvez <siglesias@igalia.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=90397	2015-05-18 15:23:09 +03:00
Francisco Jerez	d91d6b3f03	nir: Translate memory barrier intrinsics from GLSL IR. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-05-12 15:47:57 +03:00
Francisco Jerez	f8f8b31847	nir: Translate image load, store and atomic intrinsics from GLSL IR. v2: Undefine coordinate components not applicable to the target. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-05-12 15:47:57 +03:00
Francisco Jerez	6de78e6b0c	nir: Fix indexing of atomic counter arrays with a constant value. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-05-12 15:47:57 +03:00
Francisco Jerez	f1269a3e01	nir: Add memory barrier intrinsic. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-05-12 15:47:57 +03:00
Francisco Jerez	d9e930997f	nir: Define image load, store and atomic intrinsics. v2: Undefine coordinate components not applicable to the target. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-05-12 15:47:57 +03:00
Tapani Pälli	95774ca258	nir: fix sampler lowering pass for arrays This fixes bugs with special cases where we have arrays of structures containing samplers or arrays of samplers. I've verified that patch results in calculating same index value as returned by _mesa_get_sampler_uniform_value for IR. Patch makes following ES3 conformance test pass: ES3-CTS.shaders.struct.uniform.sampler_array_fragment v2: remove unnecessary comment (Topi) simplify changes and the overall code (Jason) Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=90114	2015-05-12 14:28:16 +03:00
Kenneth Graunke	d6fb155f30	nir: Fix aggressive typos in nir_from_ssa.c. s/agressive/aggressive/g Trivial.	2015-05-08 19:38:14 -07:00
Jason Ekstrand	fb5f411248	nir/search: Save/restore the variables_seen bitmask when matching Shader-db results on Broadwell: total instructions in shared programs: 7152330 -> 7137006 (-0.21%) instructions in affected programs: 1330548 -> 1315224 (-1.15%) helped: 5797 HURT: 76 GAINED: 0 LOST: 8 Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-05-08 17:29:15 -07:00
Jason Ekstrand	e0cfe59c37	nir/search: Assert that variable id's are in range Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-05-08 17:29:15 -07:00
Jason Ekstrand	13facfbd5b	nir/search: handle explicitly sized sources in match_value Previously, this case was being handled in match_expression prior to calling match_value. However, there is really no good reason for this given that match_value has all of the information it needs. Also, they weren't being handled properly in the commutative case and putting it in match_value gives us that for free. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-05-08 17:29:14 -07:00
Jason Ekstrand	f752effa08	nir/nir: Use a linked list instead of a hash set for use/def sets This commit switches us from the current setup of using hash sets for use/def sets to using linked lists. Doing so should save us quite a bit of memory because we aren't carrying around 3 hash sets per register and 2 per SSA value. It should also save us CPU time because adding/removing things from use/def sets is 4 pointer manipulations instead of a hash lookup. Running shader-db 50 times with USE_NIR=0, NIR, and NIR + use/def lists: GLSL IR Only: 586.4 +/- 1.653833 NIR with hash sets: 675.4 +/- 2.502108 NIR + use/def lists: 641.2 +/- 1.557043 I also ran a memory usage experiment with Ken's patch to delete GLSL IR and keep NIR. This patch cuts an aditional 42.9 MiB of ralloc'd memory over and above what we gained by deleting the GLSL IR on the same dota trace. On the code complexity side of things, some things are now much easier and others are a bit harder. One of the operations we perform constantly in optimization passes is to replace one source with another. Due to the fact that an instruction can use the same SSA value multiple times, we had to iterate through the sources of the instruction and determine if the use we were replacing was the only one before removing it from the set of uses. With this patch, uses are per-source not per-instruction so we can just remove it safely. On the other hand, trying to iterate over all of the instructions that use a given value is more difficult. Fortunately, the two places we do that are the ffma peephole where it doesn't matter and GCM where we already gracefully handle duplicates visits to an instruction. Another aspect here is that using linked lists in this way can be tricky to get right. With sets, things were quite forgiving and the worst that happened if you didn't properly remove a use was that it would get caught in the validator. With linked lists, it can lead to linked list corruption which can be harder to track. However, we do just as much validation of the linked lists as we did of the sets so the validator should still catch these problems. While working on this series, the vast majority of the bugs I had to fix were caught by assertions. I don't think the lists are going to be that much worse than the sets. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2015-05-08 17:16:13 -07:00

1 2 3 4 5 ...

3601 commits