fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-18 15:58:06 +02:00

Author	SHA1	Message	Date
Ilia Mirkin	0a5e1b02cf	swr: don't clear all dirty bits when changing so targets Among other things, blits would clear existing SO targets which would cause a bunch of updates from u_blitter to be missed. Fixes fbo-scissor-blit fbo, probably among many others. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2016-11-28 19:41:23 -05:00
Ilia Mirkin	8a70a4d984	swr: [rasterizer core] fix typo in scissor tile-alignment logic Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Tim Rowley <timothy.o.rowley@intel.com>	2016-11-28 19:41:13 -05:00
Kenneth Graunke	15d3fc167a	anv: Fix cache UUID generation. I asked Emil to switch from 0 (success) vs. -1 (fail) to use a boolean in my review comments. The "not" went missing. Easy mistake, but the result is that nothing runs at all :) Fix whitespace while we're here too. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org>	2016-11-28 13:40:04 -08:00
Gwan-gyeong Mun	65ea559465	vulkan/wsi: Fix resource leak in success path of wsi_queue_init() It fixes leakage of pthread_condattr resource on wsi_queue_init() Cc: "13.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Mun Gwan-gyeong <elongbug@gmail.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2016-11-28 21:11:25 +00:00
Gwan-gyeong Mun	b178652b41	anv: Update the teardown in reverse order of the anv_CreateDevice This updates releasing of resource in reverse order of the anv_CreateDevice to anv_DestroyDevice. And it fixes resource leak in pthread_mutex, pthread_cond, anv_gem_context. Cc: "13.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Mun Gwan-gyeong <elongbug@gmail.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-28 21:11:25 +00:00
Gwan-gyeong Mun	ca4706960c	anv: drop the return type for anv_queue_init() anv_queue_init() always returns VK_SUCCESS, so caller does not need to check return value of anv_queue_init(). Signed-off-by: Mun Gwan-gyeong <elongbug@gmail.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-28 21:11:25 +00:00
Gwan-gyeong Mun	ecc618b0d8	anv: Add missing error-checking to anv_block_pool_init (v2) When the memfd_create() and u_vector_init() fail on anv_block_pool_init(), this patch makes to return VK_ERROR_INITIALIZATION_FAILED. All of initialization success on anv_block_pool_init(), it makes to return VK_SUCCESS. CID 1394319 v2: Fixes from Emil's review: a) Add the return type for propagating the return value to caller. b) Changed anv_block_pool_init() to return VK_ERROR_INITIALIZATION_FAILED on failure of initialization. Cc: "13.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Mun Gwan-gyeong <elongbug@gmail.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-28 21:11:25 +00:00
Chandu Babu Namburu	02bf1bbe6e	st/omx/dec/h264: consider POC as signed instead of unsigned picture order count can be a negative value Reviewed-by: Christian König <christian.koenig@amd.com>	2016-11-28 15:31:51 -05:00
Emil Velikov	7c277eae98	radv: don't return VK_SUCCESS if radv_device_get_cache_uuid() fails If radv_device_get_cache_uuid() fails result will be VK_SUCCESS as set by the radv_init_wsi() call above. Fixes: `d943839` (radv: Use library mtime for cache UUID.) Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-11-28 19:51:31 +00:00
Emil Velikov	78707a15f2	radv: don't leak the fd if radv_physical_device_init() succeeds radv_amdgpu_winsys_create() does not take ownership of the fd, thus we end up leaking it as we return with VK_SUCCESS. Cc: Dave Airlie <airlied@redhat.com> Cc: "13.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-11-28 19:51:22 +00:00
Emil Velikov	a1cf494f77	anv: don't leak memory if anv_init_wsi() fails brw_compiler_create() rzalloc-ates memory which we forgot to free. Cc: "13.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-28 19:47:34 +00:00
Emil Velikov	3af8171547	anv: don't double-close the same fd Cc: "13.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2016-11-28 19:47:28 +00:00
Emil Velikov	2d42a34566	anv: automake: don't generate anv_timestamp.h No longer used as of last commit. Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2016-11-28 19:47:17 +00:00
Emil Velikov	83548e1292	anv: Use library mtime for cache UUID. Inspired by a similar commit for radv. Rather than recomputing the timestamp on each make invocation, just fetch it at runtime. Thus we no longer get the constant rebuild of anv_device.c and the follow-up libvulkan_intel.so link, when nothing has changed. I.e. using make && make install is a little bit faster. v2: Use bool return type (Ken). Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-11-28 19:46:45 +00:00
Emil Velikov	de138e9ced	anv: Store UUID in physical device. Port of an equivalent commit for radv. v2: Move the call just after MMAP_VERSION (Ken). Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-28 19:46:05 +00:00
Emil Velikov	3f9397753b	isl: Make isl_finishme only warn once per call-site Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2016-11-28 19:12:49 +00:00
Emil Velikov	f3a1c17b96	radv: Make radv_finishme only warn once per call-site Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-11-28 19:12:48 +00:00
Emil Velikov	7feac8bdb9	anv: use do { } while (0) in the anv_finishme macro Use the generic construct instead of the currect GCC specific one. Suggested-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-11-28 19:12:38 +00:00
Dave Airlie	09c0c17bc3	radv: fix 3D clears with baseMiplevel This fixes: dEQP-VK.api.image_clearing.clear_color_image.3d* These were hitting an assert as the code wasn't taking the baseMipLevel into account when minify the image depth. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com> Cc: "13.0" <mesa-stable@lists.freedesktop.org>	2016-11-28 07:10:12 +00:00
Dave Airlie	020978af12	radv: brown-paper bag for a forgotten else. This fixes the fix: radv/ac/llvm: fix regression with shadow samplers fix Signed-off-by: Dave Airlie <airlied@redhat.com> Cc: "13.0" <mesa-stable@lists.freedesktop.org>	2016-11-28 16:23:10 +10:00
Dave Airlie	b2e217369e	radv/ac/llvm: fix regression with shadow samplers fix This fixes `b56b54cbf1`: radv/ac/llvm: shadow samplers only return one value It makes sure we only do that for shadow sampling, as opposed to sizing requests. Signed-off-by: Dave Airlie <airlied@redhat.com> Cc: "13.0" <mesa-stable@lists.freedesktop.org>	2016-11-28 15:43:59 +10:00
Dave Airlie	b56b54cbf1	radv/ac/llvm: shadow samplers only return one value. The intrinsic engine asserts in llvm due to this. Reported-by: Christoph Haag <haagch+mesadev@frickel.club> Cc: "13.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-11-27 23:05:01 +00:00
Dave Airlie	9838db8f64	radv/si: fix optimal micro tile selection The same fix was posted for radeonsi, so port it here. Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net> Cc: "13.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-11-27 23:03:20 +00:00
Emil Velikov	a025c5b2c7	radv: honour the number of properties available Cap up-to the number of properties available while copying the data. Otherwise we might crash and/or leak data. Cc: Dave Airlie <airlied@redhat.com> Cc: "13.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-11-27 23:03:01 +00:00
Mun Gwan-gyeong	0a27dd458b	radv: drop the return type for radv_queue_init() radv_queue_init() always returns VK_SUCCESS, so caller does not need to check return value of radv_queue_init(). Signed-off-by: Mun Gwan-gyeong <elongbug@gmail.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-11-27 23:00:57 +00:00
Rob Clark	8cb965b112	freedreno: fix slice size for imported buffers Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-11-27 17:26:05 -05:00
Rob Clark	f4ffe2786b	freedreno/a3xx: make _emit_const() static Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-11-27 17:26:05 -05:00
Rob Clark	b8b800d18a	freedreno/a4xx: make _emit_const() static Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-11-27 17:26:05 -05:00
Jason Ekstrand	af98c6c31d	anv/pipeline: Make is_dual_src_blend_factor inline It's not used on gen8+ so it causes unused function warnings. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-11-26 11:58:59 -08:00
Jason Ekstrand	e41f7c3063	anv/pipeline: Make the temp blend attachment state pointer const This fixes a "discards const" warning since blend is const. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-11-26 11:55:09 -08:00
Samuel Pitoiset	8fdb800bda	gm107/ir: optimize 32-bit CONST load to mov This is not allowed for indirect accesses because the source GPR might be erased by a subsequent instruction (WaR hazard) if we don't emit a read dep bar. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-11-26 19:05:11 +01:00
Samuel Pitoiset	948cce0196	gm107/ir: do not combine CONST loads This will allow to use MOV instead of LD. The main advantage is that MOV doesn't require a read dependency barrier while LD does, and so this will both reduce barriers pressure and the number of stall counts needed to read data from constant memory. This is currently only for user uniform accesses. I should do something similar when loading from the driver constant buffer but it seems like a bit tricky to handle for now. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-11-26 19:05:08 +01:00
Jason Ekstrand	fa6bbb5c00	anv/device: Remove a bogus finishme comment We've been properly detecting bit6 swizzling for a long time now.	2016-11-25 21:46:11 -08:00
Ben Widawsky	2a7db18890	i965: Enable fast clears for multi-lod On SKL (also fast clear is used for level 0, layer 0): Manhattan 3.0: 3.88434% +/- 0.814659% Manhattan 3.0 off: 3.25542% +/- 0.101149% Trex: 3.43501% +/- 0.31223% Trex off: 4.13781% +/- 0.0993569% ON BDW: Manhattan 3.0: 1.37079% +/- 0.571208% Manhattan 3.0 off: 1.74029% +/- 0.267499% v2 (Ben, Matt): Fix rebase error by removing the perf warning v3 (Topi): Rebased on top of revised eligibility logic Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-25 16:57:07 +02:00
Topi Pohjolainen	3aec6bce5b	i965: Allow single-sampled miptree to be resolved and shared Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-25 16:57:07 +02:00
Topi Pohjolainen	17d7c5a037	i965/gen8: Relax asserts prohibiting arrayed/mipmapped fast clears Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-25 16:57:07 +02:00
Topi Pohjolainen	544ed74315	i965: Use ISL for CCS layouts One can now also delete intel_get_non_msrt_mcs_alignment(). v2 (Jason): Do not leak aux buf but allocate only after getting ISL surfaces. Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-25 16:57:07 +02:00
Topi Pohjolainen	96dbe765e1	i965: Resolve non-compressed fast clears prior layered rendering Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-25 16:57:07 +02:00
Topi Pohjolainen	dea8e7fb07	i965: Restrict fast color clear on first slice only Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-25 16:57:07 +02:00
Topi Pohjolainen	d41fc8dc9f	i965: Track fast color clear state in level/layer granularity Note that RESOLVED is not tracked in the map explicitly. Absence of item implicitly means RESOLVED state. v2: Added intel_resolve_map_clear() into intel_miptree_release() v3 (Jason): Properly handle the assumption of resolve map not containing any items with state RESOLVED. Removed unnecessary intel_miptree_set_fast_clear_state() call in brw_blorp_resolve_color() preventing intel_miptree_set_fast_clear_state() from asserting against RESOLVED. Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-25 16:57:07 +02:00
Topi Pohjolainen	28dc3f6199	i965: Move fast clear state enumeration into resolve map Status is still tracked per miptree. Next patch will switch to resolve map per slice/level. Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-25 16:57:07 +02:00
Topi Pohjolainen	6859d2ba2e	i965: Refactor check if color resolve is needed Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-25 16:57:07 +02:00
Topi Pohjolainen	ea2c419600	i965: Add plumbing for fast clear layer/level details Until now fast clear has been supported only for non-layered and non-mipmapped buffers. However, from gen8 onwards there is hardware support also for layered/mipmapped. Once this is enabled, fast clear operations target specific layer/level and call for the state to be tracked in the same granularity. This is the first step providing the details from callers to the state tracking. Patch introduces new interface for reading and writing the state hiding the upcoming bookkeeping changes in the call sites. There is bunch of sanity checks added that will be relaxed per hardware generation later on when the actual functionality is enabled. v2: Rebased on top current master setting the state in blorp_surf_for_miptree(). v3: Replace open-coded resolved check in surface state emission with intel_miptree_has_color_unresolved(). Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-25 16:57:07 +02:00
Topi Pohjolainen	d07cf68a97	i965: Add interface for checking multiple slices if any is unresolved Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-25 16:57:07 +02:00
Topi Pohjolainen	17e6a214fd	i965: Provide slice details to renderbuffer fast clear state tracker This patch also introduces getter and setter for fast clear state preparing for tracking the state per slice. Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-25 16:57:06 +02:00
Topi Pohjolainen	cec30a6669	i965: Split per miptree and per slice/level fast clear bits Currently the status bits for fast clear include the flag telling if non-multisampled mcs buffer should be used at all. Once the state tracking is changed to follow individual levels/layers one still needs to have the mcs enabling information in the miptree. Therefore simply split it out to its own boolean. Possible follow-up work is to combine disable_aux_buffers and no_ccs into single enum. v2 (Jason): Changed no_msrt_mcs to no_ccs and updated comment Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-25 16:57:06 +02:00
Topi Pohjolainen	9c7717c066	i965: Provide slice details to color resolver v2: Make intel_miptree_resolve_color() take start layer and layer count. Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-25 16:57:06 +02:00
Topi Pohjolainen	12010b9226	i965: Add new interface for full color resolves Upcoming patches will introduce fast clear in level/layer granularity like the driver does already for depth/hiz. This patch introduces equivalent full resolve option. Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-25 16:57:06 +02:00
Topi Pohjolainen	71d48d6f42	i965: Refactor lossless compression state tracking Essentially this moves fast clear state update away from surface state setup into brw_postdraw_set_buffers_need_resolve() that gets called just after draw submission. Calling intel_miptree_used_for_rendering() can be drop for gen6 and earlier as it is no-op. v2: Rebased on top current master setting the state in blorp_surf_for_miptree(). Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-11-25 16:57:06 +02:00
Andres Gomez	b27be186cb	Revert "glsl: allow layout qualifier overrides with ARB_shading_language_420pack" This reverts commit `aaa69c79cd`. The commit was erroneous because the ast_layout_expression class is meant to hold a list used for an after check that all the declared values for a layout-qualifier-name are consistent. Therefore, the check for the possibility of duplicated values was previously fixed to happen much sooner, in the GLSL parser and the merge of layout qualifiers, and the process_qualifier_constant method only needs to check that the values are consistent. By now, those layout-qualifier-name represented as a ast_layout_expression are "max_vertices", "invocations", "vertices", "local_size_[x\|y\|z]" and "xfb_stride". From page 40 (page 46 of the PDF) of the GLSL 1.50 spec: " All geometry shader output layout declarations in a program must declare the same layout and same value for max_vertices." From page 44 (page 50 of the PDF) of the GLSL 4.00 spec: " If an invocation count is declared, all such declarations must specify the same count." From page 47 (page 53 of the PDF) of the GLSL 4.00 spec: " All tessellation control shader layout declarations in a program must specify the same output patch vertex count." From page 60 (page 66 of the PDF) of the GLSL 4.30 spec: " Also, if such a layout qualifier is declared more than once in the same shader, all those declarations must set the same set of local work-group sizes and set them to the same values; otherwise a compile-time error results. If multiple compute shaders attached to a single program object declare local work-group size, the declarations must be identical; otherwise a link-time error results." From page 73 (page 79 of the PDF) of the GLSL 4.40 spec: " While xfb_stride can be declared multiple times for the same buffer, it is a compile-time or link-time error to have different values specified for the stride for the same buffer." Fixes GL44-CTS.enhanced_layouts.xfb_duplicated_stride Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com> Signed-off-by: Andres Gomez <agomez@igalia.com>	2016-11-25 13:18:31 +02:00

1 2 3 4 5 ...

79721 commits