fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-07 17:58:26 +02:00

Author	SHA1	Message	Date
Eric Anholt	1410403e1e	vc4: Fix tests for format supported with nr_samples == 1. This was a bug from the MSAA enabling. Tests for surfaces with nr_samples==1 instead of 0 (generally GL renderbuffers) would incorrectly fail out. Fixes the ARB_framebuffer_sRGB piglit tests other than srgb_conformance. Cc: "11.1 11.2" <mesa-stable@lists.freedesktop.org>	2016-04-22 11:27:11 -07:00
Eric Anholt	6eabdb8959	vc4: Don't try to blit from MSAA surfaces with mismatched width to dst. I had made the previous blit fix non-MSAA only because I was thinking about how the hardware infers stride from the RENDERING_CONFIG packet. However, I'm also inferring the stride for both MSAA src and dst in vc4_render_cl.c from the width argument in the ioctl. Fixes 15 EXT_framebuffer_multisample piglit tests.	2016-04-22 11:27:11 -07:00
Kenneth Graunke	42dea145d9	i965: Disable channel expressions for scalar GS, TCS, TES. On Broadwell, I get the following shader-db statistics: Tessellation Control Shaders: total instructions in shared programs: 57327 -> 57012 (-0.55%) instructions in affected programs: 27334 -> 27019 (-1.15%) helped: 45 HURT: 0 total cycles in shared programs: 265692 -> 255188 (-3.95%) cycles in affected programs: 263122 -> 252618 (-3.99%) helped: 184 HURT: 26 Tessellation Evaluation Shaders: total instructions in shared programs: 23236 -> 23157 (-0.34%) instructions in affected programs: 2791 -> 2712 (-2.83%) helped: 27 HURT: 0 total cycles in shared programs: 151858 -> 149704 (-1.42%) cycles in affected programs: 151858 -> 149704 (-1.42%) helped: 101 HURT: 114 Geometry Shaders: Orbital Explorer goes from 6442 -> 6356 instructions. Two Shadow of Mordor shaders increase by a single instruction. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-04-22 10:26:30 -07:00
Topi Pohjolainen	1883613a24	i965/blorp: Add support for 2x msaa Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-04-22 17:02:29 +03:00
Topi Pohjolainen	125a7fdf32	i965/blorp: Add support for encoding/decoding interleaved 2x msaa Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-04-22 17:01:29 +03:00
Samuel Iglesias Gonsálvez	f70cacc4bd	i965: don't lower mod() in glsl ir NIR will lower it in nir_opt_algebraic. No change in shader-db. Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-04-22 13:44:28 +02:00
Timothy Arceri	72b5d00c9c	glsl: fix cross validation for explicit locations on structs and arrays Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-04-22 20:59:57 +10:00
Nicolai Hähnle	39e9cf6cb1	radeonsi: implement TGSI_SEMANTIC_HELPER_INVOCATION Depends on LLVM support introduced in r267102. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-04-21 23:14:04 -05:00
Ilia Mirkin	2bac561787	swr: ignore generated files in rasterizer Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Tim Rowley <timothy.o.rowley@intel.com>	2016-04-22 00:07:25 -04:00
Ilia Mirkin	88ca4a43a2	nvc0: fix retrieving query results into buffer for timestamps The timestamps are stored in a funny place, and even though they are a 64-bit result, are not stored with is64bit. Account for that when retrieving the query result into a resource. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "11.2" <mesa-stable@lists.freedesktop.org>	2016-04-22 00:06:49 -04:00
Jason Ekstrand	541e6c0500	i965/surface_state: Use libisl functions for image format lowering This lets us delete some redundant code and keep all of the image_load_store format lowering logic in one place: libisl. Reviewed-by: Chad Versace <chad.versace@intel.com>	2016-04-21 20:44:27 -07:00
Jason Ekstrand	e53cabe730	i965/fs_surface_builder: Use isl instead of mesa for format info Reviewed-by: Chad Versace <chad.versace@intel.com>	2016-04-21 20:44:27 -07:00
Jason Ekstrand	1831fa104c	i965/fs_surface_builder: Add a helper for converting GL to ISL formats Reviewed-by: Chad Versace <chad.versace@intel.com>	2016-04-21 20:44:27 -07:00
Jason Ekstrand	24bb75049b	i965/fs_surface_builder: Explicitly handle FORMAT_NONE in num_image_coordinates Previously, we were relying on has_matching_typed_format returning true for MESA_FORMAT_NONE which, in turn, relied on _mesa_get_format_bytes returning 1 for MESA_FORMAT_NONE. When we switch to ISL, this behaviour will no longer be something we can rely on. Reviewed-by: Chad Versace <chad.versace@intel.com>	2016-04-21 20:44:27 -07:00
Jason Ekstrand	f310c02b94	i965/fs_surface_builder: Take a GL format enum instead of mesa_format Reviewed-by: Chad Versace <chad.versace@intel.com>	2016-04-21 20:44:27 -07:00
Jason Ekstrand	2980507a19	isl/format: Add a get_num_channels helper Reviewed-by: Chad Versace <chad.versace@intel.com>	2016-04-21 20:44:27 -07:00
Jason Ekstrand	3415cf5f2f	isl/format: Add more isl_format_has_type_channel functions Reviewed-by: Chad Versace <chad.versace@intel.com>	2016-04-21 20:44:27 -07:00
Jason Ekstrand	a4c04dd410	isl/format: Break the guts of has_[us]int_channel into a helper Reviewed-by: Chad Versace <chad.versace@intel.com>	2016-04-21 20:44:27 -07:00
Jason Ekstrand	ca8c5993bf	anv/image: Use the has_matching_typed_storage_image_format helper from isl Reviewed-by: Chad Versace <chad.versace@intel.com>	2016-04-21 20:44:27 -07:00
Jason Ekstrand	65bd8317e2	isl: Add a helper for determining when a typed load/store can be used Reviewed-by: Chad Versace <chad.versace@intel.com>	2016-04-21 20:44:27 -07:00
Jason Ekstrand	90576ac963	isl: Take a devinfo in lower_storage_image_format instead of an isl_device We want to call this function from the shader compiler and having a full isl_device available at that point isn't practical. Reviewed-by: Chad Versace <chad.versace@intel.com>	2016-04-21 20:44:27 -07:00
Jason Ekstrand	37f6f21b1f	isl: Don't use designated initializers in the header C++ doesn't support designated initializers and g++ in particular doesn't handle them when the struct gets complicated, i.e. has a union. Reviewed-by: Chad Versace <chad.versace@intel.com>	2016-04-21 20:44:27 -07:00
Jason Ekstrand	2785840586	isl: Include c99_compat.h We need the restrict keyword in isl.h Reviewed-by: Chad Versace <chad.versace@intel.com>	2016-04-21 20:44:27 -07:00
Jason Ekstrand	ef5dca2034	i965: Add a dependency on libisl To avoid build issues, ensure that you're running `make' at the top level and/or you've executed `make clean' beforehand. Reviewed-by: Chad Versace <chad.versace@intel.com>	2016-04-21 20:44:27 -07:00
Nicolai Hähnle	fe3b1e1448	radeon: handle query buffer allocation and mapping failures Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=94984 Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-04-21 22:33:12 -05:00
Nicolai Hähnle	b222580578	radeon: wire end_query return value to sw/hw_end Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-04-21 22:33:07 -05:00
Nicolai Hähnle	71f33a6f69	st/mesa: check return value of begin/end_query They can only indicate out of memory conditions, since the other error conditions are caught earlier. v2: fix error message in EndQuery Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2016-04-21 22:33:03 -05:00
Nicolai Hähnle	32214e0c68	gallium: add bool return to pipe_context::end_query Even when begin_query succeeds, there can still be failures in query handling. For example for radeon, additional buffers may have to be allocated when queries span multiple command buffers. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-04-21 22:32:50 -05:00
Ben Widawsky	6a0d036483	i965: Always use Y-tiled buffers on SKL+ Starting with Skylake, the display engine is capable of scanning out from Y-tiled buffers. As such, we can and should use Y-tiling for better efficiency. This also has the added benefit of being able to fast clear the winsys buffer. Note that the buffer allocation done for mipmaps will already never allocate an X-tiled buffer for GEN9. This has an almost universal positive impact on benchmarks, some improving by as much as 20%. Signed-off-by: Ben Widawsky <ben@bwidawsk.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-04-21 20:14:58 -07:00
Marek Olšák	c3b88cc2c1	softpipe: fix a warning due to an incorrect enum comparison no change in behavior, because both are defined the same Acked-by: Jose Fonseca <jfonseca@vmware.com>	2016-04-22 01:30:39 +02:00
Marek Olšák	c9e5a7df61	gallium: remove helpers converting to/from TGSI_PROCESSOR_* Acked-by: Jose Fonseca <jfonseca@vmware.com>	2016-04-22 01:30:39 +02:00
Marek Olšák	af249a7da9	gallium: use PIPE_SHADER_* everywhere, remove TGSI_PROCESSOR_* Acked-by: Jose Fonseca <jfonseca@vmware.com>	2016-04-22 01:30:39 +02:00
Marek Olšák	fb523cb6ad	gallium: merge PIPE_SWIZZLE_* and UTIL_FORMAT_SWIZZLE_* Use PIPE_SWIZZLE_* everywhere. Use X/Y/Z/W/0/1 instead of RED, GREEN, BLUE, ALPHA, ZERO, ONE. The new enum is called pipe_swizzle. Acked-by: Jose Fonseca <jfonseca@vmware.com>	2016-04-22 01:30:39 +02:00
Marek Olšák	ed23335a31	gallium: use enums in p_shader_tokens.h (v2) Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> (v1) Reviewed-by: Roland Scheidegger <sroland@vmware.com> (v1) Acked-by: Jose Fonseca <jfonseca@vmware.com> (v1) v2: name enums	2016-04-22 01:30:36 +02:00
Marek Olšák	0135bd44c2	gallium: use enums in p_defines.h (v2) and remove number assignments which are consecutive Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> (v1) Reviewed-by: Roland Scheidegger <sroland@vmware.com> (v1) Acked-by: Jose Fonseca <jfonseca@vmware.com> (v1) v2: name enums	2016-04-22 01:30:34 +02:00
Marek Olšák	8cfc4cf76d	radeonsi: remove the shader parameter from si_set_ring_buffer not used anymore this is a follow-up to the RW buffer cleanup. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-04-22 01:14:14 +02:00
Marek Olšák	3cbd8cfc7a	radeonsi: decrease GS copy shader user SGPRs to 2 const buffers are no longer used since the clip plane const buffer was moved to RW buffers Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-04-22 01:14:14 +02:00
Marek Olšák	3acaefb1bb	radeonsi: shorten slot masks to 32 bits Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-04-22 01:14:14 +02:00
Marek Olšák	0954d5e982	radeonsi: clean up shader resource limit definitions Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-04-22 01:14:14 +02:00
Marek Olšák	3138a28ff2	radeonsi: move default tess level constant buffer to RW buffers Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-04-22 01:14:14 +02:00
Marek Olšák	302bec24bd	radeonsi: move sample positions constant buffer to RW buffers Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-04-22 01:14:13 +02:00
Marek Olšák	860b658b97	radeonsi: move clip plane constant buffer to RW buffers Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-04-22 01:14:13 +02:00
Marek Olšák	698821bda3	radeonsi: rework polygon stippling to use constant buffer instead of texture add it to the RW_BUFFERS descriptor array now the slot masks don't have to have 64 bits Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-04-22 01:14:13 +02:00
Marek Olšák	bb1e647ada	radeonsi: generalize si_set_constant_buffer this will be used in the next commit Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-04-22 01:14:13 +02:00
Marek Olšák	36261c29cd	radeonsi: make RW buffer descriptor array global, not per shader stage v2: also simplify invalidation of RW buffer bindings (squashed) Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-04-22 01:14:13 +02:00
Marek Olšák	1378487fb4	radeonsi: rename and rearrange RW buffer slots - use an enum - use a unique slot number regardless of the shader stage (the per-stage slots will go away for RW buffers) Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-04-22 01:14:13 +02:00
Roland Scheidegger	4ff8cbb0d8	gallivm: fix bogus argument order to lp_build_sample_mipmap function Screwed up since `0753b135f6`. (Only an issue with different min/mag filters, and then only in some cases, which is probably why it went unnoticed for quite a while. The effect should have simply been nearest mip filter instead of linear, iff min was nearest, mag was linear, and all pixels hit the mignifying path.) Fixes a bunch of dEQP failures. Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Cc: "11.1 11.2" <mesa-stable@lists.freedesktop.org>	2016-04-21 23:57:24 +02:00
Kenneth Graunke	73b01e2711	i965: Fix clear code for ignoring colormask for XRGB formats on Gen9+. In commit `cda886a485`, Neil made us stop advertising RGBX formats on Gen9+, as the hardware apparently no longer has working fast clear support for those formats. Instead, we just fall back to RGBA formats, and use SCS to override alpha to 1.0. This is fine, but had one unintended side effect: it made us fall back to slow clears when the color mask disables alpha. Normally, we ignore the color mask for non-existent channels. This includes alpha for XRGB formats as writing garbage to the X channel is harmless. But, now that we use RGBA, we think there's a real alpha channel, and can't do the optimization. To hack around this, check if _BaseFormat is GL_RGB and ignore alpha. Improves WebGL Aquarium performance on Skylake GT3e by about 50% by letting it use repclears instead of slow clears. Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ben Widawsky <ben@bwidawsk.net> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2016-04-21 12:01:49 -07:00
Iago Toral Quiroga	bdaa0e12a2	i965/blorp: Improve precission of blitting coordinates when clipping We do this in two steps: first we clip the dst rect and adjust the src rect accordingly. Then we do it the other way around. In both passes the adjustment part involves multiplying by a scale factor that can lead to a small precision loss. This is breaking a few dEQP tests. Specifically, the problem happens when we need to clip the same coordinate twice. For example, if srcX0 and dstX0 need both to be clipped we want to avoid the situation where we clip srcX0 first, then adjust dstX0 accordingly but then we realize that the resulting dstX0 still needs to be clipped, so we clip dstX0 and adjust srcX0 again. Each of these two passes can lead to precission loss. What we want to do here is detect the rect that leads to the largest clip (accounting for the scale factor involved), clip that rect and adjust the other one. With this we ensure that the adjusted coordinate does not need to be clipped again and we can skip a second pass, improving precision. Fixes the following 4 dEQP tests: dEQP-GLES3.functional.fbo.blit.rect.out_of_bounds_reverse_src_x_nearest dEQP-GLES3.functional.fbo.blit.rect.out_of_bounds_reverse_src_x_linear dEQP-GLES3.functional.fbo.blit.rect.out_of_bounds_reverse_dst_x_nearest dEQP-GLES3.functional.fbo.blit.rect.out_of_bounds_reverse_dst_x_linear Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Tested-by: Mark Janes <mark.a.janes@intel.com>	2016-04-21 10:43:39 -07:00
Bas Nieuwenhuizen	38f4cee3ff	radeonsi: Add config parameter to si_shader_apply_scratch_relocs. shader->config is not updated for compute kernels. Signed-off-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Tom Stellard <thomas.stellard@amd.com>	2016-04-21 19:36:19 +02:00

... 8 9 10 11 12 ...

80948 commits