fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-07 04:58:05 +02:00

Author	SHA1	Message	Date
Kenneth Graunke	c17b2f5724	spirv: Shut up unhandled enumeration value warnings. We don't want to do anything for the other cases. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2017-01-11 15:16:27 -08:00
Timothy Arceri	de8b03f5fb	nir: don't turn ieq/ine into inot if used by an if Otherwise we will end up with an extra instruction to compare the result of the inot. On BDW: total instructions in shared programs: 13060620 -> 13060481 (-0.00%) instructions in affected programs: 103379 -> 103240 (-0.13%) helped: 127 HURT: 0 total cycles in shared programs: 256590950 -> 256587408 (-0.00%) cycles in affected programs: 11324730 -> 11321188 (-0.03%) helped: 114 HURT: 21 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-12 09:47:29 +11:00
Timothy Arceri	7acc865226	nir: add late opt to turn inot/b2f combos back to bcsel We turn these from bcsel into inot/b2f combos in order for other optimisation passes to get further. Once we have finished turn the ones that remain and are used in more than a single expression back into a bcsel. On BDW: total instructions in shared programs: 13060965 -> 13060297 (-0.01%) instructions in affected programs: 835701 -> 835033 (-0.08%) helped: 670 HURT: 2 total cycles in shared programs: 256599536 -> 256598006 (-0.00%) cycles in affected programs: 114655488 -> 114653958 (-0.00%) helped: 419 HURT: 240 LOST: 0 GAINED: 1 The 2 HURT is because inserting bcsel creates the only use of const 1.0 in two shaders from tri-of-friendship-and-madness. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-12 09:47:29 +11:00
Timothy Arceri	8f37fc7066	nir: add imprecise flrp optimisation On BDW: total instructions in shared programs: 13061890 -> 13061877 (-0.00%) instructions in affected programs: 2441 -> 2428 (-0.53%) helped: 13 HURT: 0 total cycles in shared programs: 256612254 -> 256611784 (-0.00%) cycles in affected programs: 16418 -> 15948 (-2.86%) helped: 10 HURT: 2 V2: don't use ffma directly Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-12 09:47:29 +11:00
Kenneth Graunke	b4c44ff08c	i965: Use the nir_move_comparisons pass. While the below stats are encouraging this pass will also become very usefull for avoiding regression once brw_do_channel_expressions() and brw_do_vector_splitting() are disabled. On Broadwell: total instructions in shared programs: 13078787 -> 13060898 (-0.14%) instructions in affected programs: 1809827 -> 1791938 (-0.99%) helped: 4527 HURT: 157 total cycles in shared programs: 256562762 -> 256590424 (0.01%) cycles in affected programs: 159749392 -> 159777054 (0.02%) helped: 5583 HURT: 2289 total spills in shared programs: 14929 -> 14923 (-0.04%) spills in affected programs: 62 -> 56 (-9.68%) helped: 1 HURT: 0 total fills in shared programs: 20144 -> 20141 (-0.01%) fills in affected programs: 253 -> 250 (-1.19%) helped: 1 HURT: 3 LOST: 0 GAINED: 2 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-12 09:47:29 +11:00
Kenneth Graunke	b5e682a1ef	i965: Move nir_lower_locals_to_regs a bit later. I'm going to add a boolean scheduling pass that I want run late, but after copy propagation and dead code elimination. Yet, I don't want to have to think about registers. So, move the register conversion a little later. No impact on shader-db. Suggested by Jason Ekstrand. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com>	2017-01-12 09:47:29 +11:00
Kenneth Graunke	fd957b1751	nir: Introduce a nir_opt_move_comparisons() pass. This tries to move comparisons (a common source of boolean values) closer to their first use. For GPUs which use condition codes, this can eliminate a lot of temporary booleans and comparisons which reload the condition code register based on a boolean. V2: (Timothy Arceri) - fix move comparision for phis so we dont end up with: vec1 32 ssa_227 = phi block_34: ssa_1, block_38: ssa_240 vec1 32 ssa_235 = feq ssa_227, ssa_1 vec1 32 ssa_230 = phi block_34: ssa_221, block_38: ssa_235 - add nir_op_i2b/nir_op_f2b to the list of comparisons. V3: (Timothy Arceri) - tidy up suggested by Jason. - add inot/fnot to move comparison list V4: (Jason Ekstrand) - clean up move_comparison_source - get rid of the tuple - rework phi handling Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> [v1] Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-12 09:47:29 +11:00
Timothy Arceri	e8328e55e7	nir/algebraic: add support for conditional helper functions to expressions Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-12 09:47:29 +11:00
Jason Ekstrand	a7e399de59	anv/TODO: Check off a bunch of stuff	2017-01-11 10:28:18 -08:00
Jason Ekstrand	c472568b4e	nir/search: Only allow matching SSA values This is more correct and should also be a tiny bit faster since we're just comparing pointers instead of calling nir_src_equal. Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com> Cc: "13.0" <mesa-stable@lists.freedesktop.org>	2017-01-11 10:28:18 -08:00
Derek Foreman	534ea2b5ba	egl/dri2: add image_loader_extension back into loader extensions for wayland before commit `f871946594` image_loader_extension was always present in dri2_dpy->extensions, after that commit it is only present for render nodes. Its removal broke partial render based on buffer age on (at least) raspberry pi. Fixes: `f871946594` "egl/dri2: rework dri2_egl_display::extensions storage" Signed-off-by: Derek Foreman <derekf@osg.samsung.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-01-11 15:58:14 +00:00
Li Qiang	6205c53303	gallium/tgsi: fix overflow in parse property In parse_identifier, it doesn't stop copying 'pcur' untill encounter the NULL. As the 'ret' has a fixed-size buffer, if the 'pcur' has a long string, there will be a buffer overflow. This patch avoid this. Signed-off-by: Li Qiang <liq3ea@gmail.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Marc-André Lureau <marcandre.lureau@redhat.com>	2017-01-11 12:40:38 +01:00
Mauro Rossi	2c0d849e2d	st/dri: remove trailing whitespace Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-01-11 10:16:19 +02:00
Mauro Rossi	eca79e84b9	android: st/mesa: fix building error in libmesa_st_mesa Fixes building error due to dependency on nir generated headers Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-01-11 10:16:19 +02:00
Dave Airlie	e9d3cbca31	radv: fix multi-viewport emission This set context req seq was in the wrong place. Signed-off-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2017-01-11 09:08:51 +01:00
Tapani Pälli	f97f938650	nir: change asserts to unreachable in nir_type_conversion_op this is to avoid following compilation error on Android: error: control may reach end of non-void function [-Werror,-Wreturn-type] Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2017-01-11 10:08:13 +02:00
Iago Toral Quiroga	a9f497c678	spirv: gl_PrimitiveID in the fragment shader is handled as an input Geometry and Tessellation stages do handle this as a system value instead. Fixes: dEQP-VK.geometry.basic.primitive_id Reviewed-by: Dave Airlie <ailried@redhat.com>	2017-01-11 08:59:28 +01:00
Rob Clark	99e9dca149	freedreno: add "nogrow" debug param Sometimes it is useful to disable the "growable" cmdstream buffers for debugging. (See 419a154d in libdrm) Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-01-10 19:40:00 -05:00
Rob Clark	a43f3b895c	freedreno/a5xx: remove hack for glamor Now that issues glamor was hitting w/ glsl>=130 (aka missing INSTANCED bit in vertex attribute state) is fixed, remove hack. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-01-10 19:40:00 -05:00
Rob Clark	3c71853c9a	freedreno/a5xx: fixed instanced Add missing bit, now that we know where it is. Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-01-10 19:40:00 -05:00
Rob Clark	b48fde1576	freedreno/a5xx: use the non-_ZERO_BASE for vertexid Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-01-10 19:40:00 -05:00
Rob Clark	730c3047f0	freedreno/a5xx: add texture MIPLVLS Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-01-10 19:40:00 -05:00
Rob Clark	1a5d0818df	freedreno/a5xx: fix fragcoord related hangs Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-01-10 19:40:00 -05:00
Rob Clark	ff81c3c9fd	freedreno: update generated headers Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-01-10 19:40:00 -05:00
Kenneth Graunke	23a36c2811	anv: Enable tessellation shaders. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-10 13:27:31 -08:00
Kenneth Graunke	ebd88b5aa3	anv: Initialize physical device limits for tessellation Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-10 13:27:31 -08:00
Kenneth Graunke	dcca706b4e	anv: Clamp depth buffer dimensions to be at least 1. When there are no framebuffer attachments, fb->width and fb->height will be 0. Subtracting 1 results in 4294967295 which is too large for the field, causing genxml assertions when trying to create the packet. In this case, we can just program it to 1. Caught by dEQP-VK.tessellation.tesscoord.triangles_equal_spacing. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-10 13:27:31 -08:00
Kenneth Graunke	e50d4807a3	anv: Compile TCS/TES shaders. v2: Merge more TCS/TES info. v3: Fix caching keys. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-10 13:27:31 -08:00
Kenneth Graunke	de05ecba9f	anv: Emit 3DSTATE_HS/TE/DS packets. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-10 13:27:31 -08:00
Kenneth Graunke	08b5713068	anv: Handle patch primitives. v2: Use anv_pipeline_has_stage rather than tess_info != NULL. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> [v1] Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-10 13:27:10 -08:00
Kenneth Graunke	5297267a1c	nir: Add a pass to lower TES patch_vertices intrinsics to a constant. In Vulkan, we always have both the TCS and TES available in the same pipeline, so we can simply use the TCS OutputVertices execution mode value as the TES PatchVertices built-in. For GLSL, we handle this in the linker. But we could use this pass in the case when both TCS and TES are linked together, if we wanted. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-10 13:21:53 -08:00
Kenneth Graunke	944e8b08cd	spirv: Silence unsupported tessellation capability warnings. ...when the capability bit is set. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> [v1] Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-10 13:21:38 -08:00
Kenneth Graunke	1e5b09f42f	spirv: Tidy some repeated if checks by using a switch statement. Iago suggested tidying this. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-10 13:21:31 -08:00
Kenneth Graunke	bb04b84114	spirv: Add tessellation varying and built-in support. We need to: - handle the extra array level for per-vertex varyings - handle the patch qualifier correctly - assign varying locations Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-10 13:21:28 -08:00
Kenneth Graunke	23710e17f8	spirv: Handle tessellation execution modes. v2: Use info->tess. v3: Handle more things in either TCS/TES. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Dave Airlie <airlied@redhat.com> [v1] Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> [v1] Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-10 13:21:24 -08:00
Kenneth Graunke	5edc338162	compiler: Merge shader_info's tcs and tes structs. Annoyingly, SPIR-V lets you specify all of these fields in either the TCS or TES, which means that we need to be able to store all of them for either shader stage. Putting them in a union won't work. Combining both is an easy solution, and given that the TCS struct only had a single field, it's pretty inexpensive. This patch renames the combined struct to "tess" to indicate that it's for tessellation in general, not one of the two stages. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-10 13:21:21 -08:00
Kenneth Graunke	195bf8f027	genxml: Rename 3DSTATE_HS::Enable to "Function Enable". "Function Enable" is what the other stages use. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-10 13:20:33 -08:00
Lionel Landwerlin	860d91ec5b	anv: set input_slots_valid on brw_wm_prog_key With shaders using a lot of inputs/outputs, like this (from Gtk+) : layout(location = 0) in vec2 inPos; layout(location = 1) in float inGradientPos; layout(location = 2) in flat int inRepeating; layout(location = 3) in flat int inStopCount; layout(location = 4) in flat vec4 inClipBounds; layout(location = 5) in flat vec4 inClipWidths; layout(location = 6) in flat ColorStop inStops[8]; layout(location = 0) out vec4 outColor; we're missing the programming of the input_slots_valid field leading to an assert further down the backend code. v2: Use valid slots of the geometry or vertex stage (Jason) v3: Use helper to find correct vue map (Jason) v4: Set the valid slots off the previous stages (Jason) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-10 18:16:45 +00:00
Lionel Landwerlin	4b44ca7225	anv: add helper to get vue map for fragment shader Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-10 18:14:36 +00:00
Lionel Landwerlin	59fe3796a8	anv: add get_.*_prog_data for tesselation stages Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-10 18:14:33 +00:00
Lionel Landwerlin	6122b4ee96	anv: make get_.*_prog_data take a const pipeline Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-10 18:14:09 +00:00
Vinson Lee	01d80bed1f	nir: Fix anonymous union initialization with older GCC. Fix this build error with GCC 4.4.7. CC nir/nir_opt_copy_prop_vars.lo nir/nir_opt_copy_prop_vars.c: In function ‘copy_prop_vars_block’: nir/nir_opt_copy_prop_vars.c:765: error: unknown field ‘deref’ specified in initializer nir/nir_opt_copy_prop_vars.c:765: warning: missing braces around initializer nir/nir_opt_copy_prop_vars.c:765: warning: (near initialization for ‘(anonymous).<anonymous>’) nir/nir_opt_copy_prop_vars.c:765: warning: initialization from incompatible pointer type Fixes: `62332d139c` ("nir: Add a local variable-based copy propagation pass") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Acked-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 23:25:32 -08:00
Samuel Iglesias Gonsálvez	17eac30e90	docs: add Vulkan Float64 capability support for anv driver Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-10 06:42:44 +01:00
Dave Airlie	ada66480b2	radv/ac: add support for multi sample image coords This just adds the nir->llvm support, enabling the extension causes some failures on llvm 3.9 at least, but this code seems fine. NIR passes the sampler in src[1].x, and we LLVM/SI requires it as the last parameters in the coords (coord[2] for 2D, coord[3] for 2DArray). Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-01-10 12:59:31 +10:00
Boyan Ding	41b1d9a558	glsl: Do not allow scalar types in vector relational functions According to OpenGL Shading Language 4.50 spec, Section 8.7 "Vector Relational Functions", functions of this type do not operate on scalar types, so remove scalar types from signature definitions to make the behavior consistent with glslangValidator and other drivers. Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Boyan Ding <boyan.j.ding@gmail.com>	2017-01-09 17:58:33 -08:00
Thomas Hindoe Paaboel Andersen	5b4fa21d53	nir: remove duplicated foreach loop The foreach loop was called both in the else case and right after. The indentation seems to indicate that the extra call was from a previous version with an else section with out curly brackets. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 17:04:47 -08:00
Kenneth Graunke	2bae2fa094	i965: Fix number of slots in SSO mode when there are no user varyings. We want vue_map->num_slots to be one more than the final slot. When assigning fixed slots, built-in slots, and non-SSO user varyings, we do slot++. This leaves "slot" as one past the most recently assigned slot. But for SSO user varyings, we computed slot based on the varying location value...and left it at that slot value. To work around this inconsistency, I made num_slots be "slot + 1" if separate and "slot" otherwise. The problem is...if there are no user varyings in SSO mode...then we would have done slot++ when assigning built-ins, so it would be off by one. This resulted in loops from 0 to vue_map->num_slots hitting a bonus BRW_VARYING_SLOT_PAD at the end. This used to break the SIMD8 VS/TES backends, but I fixed that in commit `480d6c1653`. It's probably safe at this point, but we should fix it anyway. To fix this, do slot++ in all cases. For SSO mode, we overwrite slot for every varying, so this increment only matters on the last varying. Because we process varyings in order, this will set slot to 1 more than the highest assigned slot. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2017-01-09 16:52:16 -08:00
Kenneth Graunke	203c128781	spirv: Move cursor before calling vtn_ssa_value() in phi 2nd pass. vtn_ssa_value() can produce variable loads, and the cursor might be after a return statement, causing nir_builder assert failures about not inserting instructions after a jump. This fixes: dEQP-VK.spirv_assembly.instruction.graphics.barrier.in_if dEQP-VK.spirv_assembly.instruction.graphics.barrier.in_switch Cc: "13.0 12.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 16:52:02 -08:00
Marek Olšák	230b756f86	mesa: set GLSL 1.20 for the fixed-function fragment shader This fixes broken depth texturing after: commit `22639a6e19` Author: Timothy Arceri <timothy.arceri@collabora.com> Date: Mon Nov 21 00:29:29 2016 +1100 st/mesa: get Version from gl_program rather than gl_shader_program Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2017-01-10 01:03:32 +01:00
Bas Nieuwenhuizen	8bc39e251b	radv: Create single RADV_DEBUG env var. Also changed RADV_SHOW_QUEUES to a no compute queue option. That would make more sense later when the compute queue is established, but the transfer queue still experimental. v2: Don't include the trace flag. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2017-01-09 21:44:14 +01:00

1 2 3 4 5 ...

88019 commits