fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-20 15:38:19 +02:00

Author	SHA1	Message	Date
Thomas Hindoe Paaboel Andersen	5b4fa21d53	nir: remove duplicated foreach loop The foreach loop was called both in the else case and right after. The indentation seems to indicate that the extra call was from a previous version with an else section with out curly brackets. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 17:04:47 -08:00
Kenneth Graunke	203c128781	spirv: Move cursor before calling vtn_ssa_value() in phi 2nd pass. vtn_ssa_value() can produce variable loads, and the cursor might be after a return statement, causing nir_builder assert failures about not inserting instructions after a jump. This fixes: dEQP-VK.spirv_assembly.instruction.graphics.barrier.in_if dEQP-VK.spirv_assembly.instruction.graphics.barrier.in_switch Cc: "13.0 12.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 16:52:02 -08:00
Samuel Iglesias Gonsálvez	88c8121ec9	spirv: enable SpvCapabilityFloat64 only to supported platforms v2 (Jason): - Use nir_spirv_supported_extensions to check if the feature is enabled. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 10:44:07 +01:00
Juan A. Suarez Romero	c2acf97fcc	nir/i965: use two slots from inputs_read for dvec3/dvec4 vertex input attributes So far, input_reads was a bitmap tracking which vertex input locations were being used. In OpenGL, an attribute bigger than a vec4 (like a dvec3 or dvec4) consumes just one location, any other small attribute. So we mark the proper bit in inputs_read, and also the same bit in double_inputs_read if the attribute is a dvec3/dvec4. But in Vulkan, this is slightly different: a dvec3/dvec4 attribute consumes two locations, not just one. And hence two bits would be marked in inputs_read for the same vertex input attribute. To avoid handling two different situations in NIR, we just choose the latest one: in OpenGL, when creating NIR from GLSL/IR, any dvec3/dvec4 vertex input attribute is marked with two bits in the inputs_read bitmap (and also in the double_inputs_read), and following attributes are adjusted accordingly. As example, if in our GLSL/IR shader we have three attributes: layout(location = 0) vec3 attr0; layout(location = 1) dvec4 attr1; layout(location = 2) dvec3 attr2; then in our NIR shader we put attr0 in location 0, attr1 in locations 1 and 2, and attr2 in location 3 and 4. Checking carefully, basically we are using slots rather than locations in NIR. When emitting the vertices, we do a inverse map to know the corresponding location for each slot. v2 (Jason): - use two slots from inputs_read for dvec3/dvec4 NIR from GLSL/IR. v3 (Jason): - Fix commit log error. - Use ladder ifs and fix braces. - elements_double is divisible by 2, don't need DIV_ROUND_UP(). - Use if ladder instead of a switch. - Add comment about hardware restriction in 64bit vertex attributes. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 10:42:22 +01:00
Samuel Iglesias Gonsálvez	cc4ff6c2a0	spirv: add support for doubles to OpSpecConstant v2 (Jason): - Fix indent in radv change - Add vtn_u64_literal() helper to take 64 bits (Jason) Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	fc1708948b	spirv/nir: add (un)packDouble2x32() translation Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	c332432bae	spirv/nir: implement DF conversions SPIR-V does not have special opcodes for DF conversions. We need to identify them by checking the bit size of the operand and the result. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	27cf6a369f	nir: add nir_type_conversion_op() This function returns the nir_op corresponding to the conversion between the given nir_alu_type arguments. This function lacks support for integer-based types with bit_size != 32 and for float16 conversion ops. v2: - Improve readiness of the code and delete cases that don't happen now (Jason) Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	3a571fcc43	nir: add nir_get_nir_type_for_glsl_type() v2 (Jason): - Refactor nir_get_nir_type_for_glsl_type() to avoid using unneeded helpers (Jason) v3: - Use return directly (Jason) Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	59944a77ae	spirv: add support for doubles on OpComposite{Insert,Extract} Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	e6bebb9982	spirv: Enable double floating points when copying variables in _vtn_variable_copy() Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	9d71cfeff8	spirv: add double support to _vtn_block_load_store() Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	0cd0c32c06	spirv: add double support to _vtn_variable_load_store Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	8076c8b59f	spirv: add double support to SpvOpCompositeExtract v2 (Jason): - Add asserts. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	a966387883	spirv: fix SpvOpSpecConstantOp with SpvOpVectorShuffle working with double-based vecs We need to pick two 32-bit values per component to perform the right shuffle operation. v2 (Jason): - Add assert to check matching bit sizes (Jason) - Simplify the code to pick components (Jason) v3: - Switch on bit_size once (Jason) - Add comment to explain the constant value for unused components (Erik) Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	ec686ff62c	spirv: add DF support to SpvOp*ConstantComposite v2 (Jason): - Add assert. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	2bf4d0ba7a	spirv: add DF support to vtn_const_ssa_value() Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	d77ffc3d87	spirv: add support for loading DF constants v2 (Jason): - Add assert. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	9602c7c02f	spirv: add definition of double based data types Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	d1bbe2c94e	spirv: fix typo in spec_constant_decoration_cb() Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Timothy Arceri	ae632afe4f	st/mesa/glsl: set num_images directly in shader_info This change also removes the now duplicate NumImages field. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-09 15:27:35 +11:00
Timothy Arceri	1130f82a88	nir: add another comparison simplification On BDW: total instructions in shared programs: 13061877 -> 13060965 (-0.01%) instructions in affected programs: 133569 -> 132657 (-0.68%) helped: 566 HURT: 0 total cycles in shared programs: 256611784 -> 256599536 (-0.00%) cycles in affected programs: 861016 -> 848768 (-1.42%) helped: 379 HURT: 73 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 12:32:16 +11:00
Kenneth Graunke	3371de38f2	nir: Turn bcsel of +/- 1.0 and 0.0 into b2f sequences. On BDW: total instructions in shared programs: 13074882 -> 13068703 (-0.05%) instructions in affected programs: 1823116 -> 1816937 (-0.34%) helped: 4187 HURT: 537 total cycles in shared programs: 256622718 -> 256425382 (-0.08%) cycles in affected programs: 123790120 -> 123592784 (-0.16%) helped: 3823 HURT: 2037 total spills in shared programs: 15276 -> 14929 (-2.27%) spills in affected programs: 9446 -> 9099 (-3.67%) helped: 352 HURT: 1 total fills in shared programs: 20496 -> 20144 (-1.72%) fills in affected programs: 13040 -> 12688 (-2.70%) helped: 352 HURT: 1 LOST: 2 GAINED: 21 v2: Rely on 'a' being a well-formed boolean (Connor, Eric). Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 12:32:16 +11:00
Kenneth Graunke	1c50d31c26	nir: Convert ineg(b2i(a)) to a if it's a boolean. On BDW: total instructions in shared programs: 13071119 -> 13070371 (-0.01%) instructions in affected programs: 83424 -> 82676 (-0.90%) helped: 505 HURT: 45 (all TCS, all hurt by a single instruction) total cycles in shared programs: 256601322 -> 256588932 (-0.00%) cycles in affected programs: 819410 -> 807020 (-1.51%) helped: 450 HURT: 57 total loops in shared programs: 2950 -> 2942 (-0.27%) loops in affected programs: 8 -> 0 helped: 7 HURT: 0 v2: Drop unnecessary 'a@bool' annotation (Connor, Eric). Add a comment explaining the rule (Ian). Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> [v1] Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-01-09 12:32:16 +11:00
Kenneth Graunke	a4fd84ef5f	mesa: Introduce a compiler enum for tessellation spacing. It feels weird using GL_* enums in a Vulkan driver. v2: Fix the TESS_SPACING -> PIPE_TESS_SPACING conversion. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-07 22:22:28 -08:00
Kenneth Graunke	9bb89175e6	compiler: Change shader_info->tes.vertex_order into a ccw boolean. The vertex order is either clockwise or counterclockwise. We can just store a "ccw" boolean rather than GLenum values. I don't want to use GLenums in a Vulkan driver, and even in GL a simple boolean works fine. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-07 20:42:32 -08:00
Kenneth Graunke	957ec00243	Revert recent GLSL slot counting fiasco. I apparently broke mark_whole_variable in ir_set_program_inouts. It was passing a type that wasn't var->type, so the wrapper didn't work out. It's all broken, revert it and start over. Fixes all kinds of things on other drivers. Revert "glsl: Make is_fixed_function_array actually check for varyings." This reverts commit `42699e1271`. Revert "glsl: Mark whole variable used for ClipDistance and TessLevel." This reverts commit `5c580e64cc`. Revert "glsl: Override the # of varying slots for ClipDistance and TessLevel." This reverts commit `8b5749f65a`. Revert "glsl: Create and use a new ir_variable::count_attribute_slots() wrapper." This reverts commit `6aa5cb34d0`.	2017-01-07 15:15:08 -08:00
Kenneth Graunke	42699e1271	glsl: Make is_fixed_function_array actually check for varyings. We can't check VARYING_SLOT_* locations until we've determined that the variable is actually a varying. Fixes assert failures in drivers which actually use this path, such as radeonsi and i915. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=99314 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2017-01-07 13:05:37 -08:00
Timothy Arceri	1edc53a66b	glsl: fix opt_minmax redundancy checks against baserange Marking operations as redundant if they are equal to the base range is fine when the tree structure is something like this: max / \ max b / \ 3 max / \ 3 a But the opt falls apart with a tree like this: max / \ max max / \ / \ 3 a b 3 The problem is that both branches are treated the same: descending in the left branch will prune the constant, and then descending the right branch will prune the constant there as well, because limits[0] wasn't updated to take the change on the left branch into account, and so we still get [3,\infty) as baserange. In order to fix the bug we just disable the marking of redundant expressions when they match the baserange. NIR algebraic opt will clean up the first tree for anyway, hopefully other backends are smart enough to do this also. Cc: "13.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-07 21:46:36 +11:00
Jason Ekstrand	62332d139c	nir: Add a local variable-based copy propagation pass Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com>	2017-01-06 16:44:28 -08:00
Jason Ekstrand	830dca74fe	nir/builder: Add a helper for getting the most recently added instruction Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com>	2017-01-06 16:44:28 -08:00
Jason Ekstrand	75a6707984	nir/builder: Add a load_deref_var helper Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com>	2017-01-06 16:44:28 -08:00
Jason Ekstrand	13a2f20740	nir/dead_variables: Remove shader-local variables that are only written Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com>	2017-01-06 16:44:28 -08:00
Jason Ekstrand	58fe5c57cd	nir/dead_variables: Removed shared variables when requested Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com>	2017-01-06 16:44:28 -08:00
Kenneth Graunke	311b1f0a98	nir: Make glsl_to_nir compact scalar TessLevel arrays. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-06 15:55:46 -08:00
Kenneth Graunke	a46bd79ee1	glsl: Support gl_TessLevelInner/Outer[] as TES input variables. Upcoming reworks in i965 are going to make it easy to handle this like any other input. Having it as a system value will just require additional code for no benefit. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com>	2017-01-06 15:55:41 -08:00
Kenneth Graunke	5c580e64cc	glsl: Mark whole variable used for ClipDistance and TessLevel*. There's no point in trying to mark partial array access for gl_ClipDistance, gl_TessLevelOuter, or gl_TessLevelInner - they're special built-in variables that control fixed function hardware, and will likely be used in an all-or-nothing fashion. Since these arrays only occupy 1-2 varying slots, we have to avoid our normal processing which increments the slot value by the array index. (I wrote this code before i965 switched from ir_set_program_inouts to nir_shader_gather_info. It's not used by anyone today, and I'm not sure how valuable it is...the alternative to GLSL IR lowering is NIR compact arrays, at which point you should use nir_gather_info.) Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com>	2017-01-06 15:55:39 -08:00
Kenneth Graunke	8b5749f65a	glsl: Override the # of varying slots for ClipDistance and TessLevel*. Right now, this shouldn't have any effect, as all drivers use LowerClipDist and LowerTessFactors to turn the float[] arrays into vectors. However, it should help make it possible for drivers to avoid that lowering. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com>	2017-01-06 15:55:37 -08:00
Kenneth Graunke	6aa5cb34d0	glsl: Create and use a new ir_variable::count_attribute_slots() wrapper. This wraps glsl_type::count_attribute_slots(), but will soon contain a couple of overrides for a couple of GLSL built-ins variables. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com>	2017-01-06 15:55:25 -08:00
Timothy Arceri	076ab157ff	st/mesa/glsl: move SamplerTargets to gl_program This will help allow us to simplify the handling of samplers by storing them in a single location rather than duplicating them in both gl_linked_shader and gl_program. Reviewed-by: Eric Anholt <eric@anholt.net>	2017-01-06 11:21:42 +11:00
Timothy Arceri	937523971f	st/mesa/glsl: set SamplersUsed directly in gl_program Reviewed-by: Eric Anholt <eric@anholt.net>	2017-01-06 11:21:42 +11:00
Timothy Arceri	53a509723f	mesa/glsl: set sampler units directly in gl_program Now that we create gl_program earlier there is no need to mess about copying things to gl_linked_shader then to gl_program. Reviewed-by: Eric Anholt <eric@anholt.net>	2017-01-06 11:21:42 +11:00
Timothy Arceri	4807a83da0	mesa/glsl: set num_textures per stage directly in shader_info Reviewed-by: Eric Anholt <eric@anholt.net>	2017-01-06 11:21:42 +11:00
Timothy Arceri	5ceedefd6c	mesa/glsl: remove hack to reset sampler units to zero Now that we have the is_arb_asm flag we can just skip the initialisation. V2: remove hack from standalone compiler where it was never needed since it only compiles glsl shaders. Reviewed-by: Eric Anholt <eric@anholt.net>	2017-01-06 11:21:42 +11:00
Timothy Arceri	f584f38214	st/mesa/glsl: add new is_arb_asm flag in gl_program Set the flag via the _mesa_init_gl_program() and NewProgram() helpers. In i965 we currently check for the existance of gl_shader_program to decide if this is an ARB assembly style program or not. Adding a flag makes the code clearer and will help removes a dependency on gl_shader_program in the i965 codegen functions. Also this will allow use to skip initialising sampler units for linked shaders, we currently memset it to zero again during linking. Reviewed-by: Eric Anholt <eric@anholt.net>	2017-01-06 11:21:42 +11:00
Timothy Arceri	f5bc127b2f	st/mesa/glsl/i965: move ShaderStorageBlocks to gl_program Having it here rather than in gl_linked_shader allows us to simplify the code. Also it is error prone to depend on the gl_linked_shader for programs in current use because a failed linking attempt will free infomation about the current program. In i965 we could be trying to recompile a shader variant but may have lost some required fields. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-01-06 11:21:41 +11:00
Timothy Arceri	f62eb6c7eb	st/mesa/glsl/i965: set num_ssbos directly in shader_info Here we also remove the duplicate field in gl_linked_shader and always get the value from shader_info instead. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-01-06 11:21:41 +11:00
Timothy Arceri	0e7eec1ab5	st/mesa/glsl/i965: move per stage UniformBlocks to gl_program This will help allow us to store pointers to gl_program structs in the CurrentProgram array resulting in a bunch of code simplifications. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-01-06 11:21:41 +11:00
Timothy Arceri	b792c38979	st/mesa/glsl/i965: set num_ubos directly in shader_info This also removes the duplicate field in gl_linked_shader, and gets num_ubos from shader_info instead. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-01-06 11:21:41 +11:00
Timothy Arceri	a1da57c19c	st/mesa/glsl/i965: move ImageUnits and ImageAccess fields to gl_program Having it here rather than in gl_linked_shader allows us to simplify the code. Also it is error prone to depend on the gl_linked_shader for programs in current use because a failed linking attempt will free infomation about the current program. In i965 we could be trying to recompile a shader variant but may have lost some required fields. We drop the memset on ImageUnits because gl_program is already created using rzalloc(). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-01-06 11:21:40 +11:00

1 2 3 4 5 ...

1412 commits