fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-01-01 18:20:10 +01:00

Author	SHA1	Message	Date
Nicolai Hähnle	d3931a355f	radeonsi: fix isolines tess factor writes to control ring Fixes piglit arb_tessellation_shader/execution/isoline{_no_tcs}.shader_test. Cc: mesa-stable@lists.freedesktop.org	2016-12-07 11:21:32 +01:00
Kenneth Graunke	9871bde351	i965: Drop redundant key->outputs_written initialization. This was already set to the same value earlier. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-12-06 22:14:58 -08:00
Kenneth Graunke	09ffc5c84f	i965: Initialize "separate" flag in VUE maps. This was uninitialized, which resulted in weird looking printouts where it appeared that the TCS output and TES input patch URB entries differed in SSO/non-SSO layout. There is no "separable" layout for both, as they're tied together. It should have no other actual effect. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-12-06 22:14:58 -08:00
Ian Romanick	b87039499b	nir: In split_var_copies_block, uint, int, and bool types cannot be matrices Noticed while adding support for 64-bit integer types. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-12-06 17:30:38 -08:00
Tom Stellard	4c8c13b356	radeonsi: Use amdgcn intrinsics for fs interpolation Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2016-12-07 00:42:40 +00:00
Rob Clark	a9383ae6d6	freedreno/a5xx: fix draw packet size with index buffer gpuaddr of idx buffer is now two dwords (64b). Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-06 18:01:31 -05:00
Rob Clark	ec24f009ca	freedreno/a5xx: gmem bypass mode Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-06 18:01:31 -05:00
Rob Clark	85a3057f65	freedreno/a5xx: fix emit_string_marker() Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-06 18:01:31 -05:00
Rob Clark	c1e9cca696	freedreno: pitch alignment should match gmem alignment Deal w/ differing gmem tile size alignment between generations, and make sure texture pitch matches. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-06 18:01:31 -05:00
Rob Clark	8f4da2ff63	freedreno/a5xx: more formats Bunch of stuff we can at least turn on for vbo formats. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-06 18:01:31 -05:00
Rob Clark	b337099849	freedreno/a5xx: fix fragface Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-06 18:01:31 -05:00
Rob Clark	f143eeaffa	freedreno/a5xx: fix fragcoord Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-06 18:01:31 -05:00
Rob Clark	f5c5f76255	freedreno: update generated headers Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-06 18:01:31 -05:00
Rob Clark	3ec4d1f809	freedreno/a5xx: fix alpha test GRAS_SU_DEPTH_PLANE_CNTL doesn't in fact seem to be anything to do with alpha test. This fixes xonotic and (other than some iommu faults) gets gnome-shell working. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-06 18:01:31 -05:00
Rob Clark	2b305725e2	freedreno/a5xx: fix VPC_VAR[n].DISABLE bits We don't need varying interpolators enabled for pos/psize out of the VS (despite the fact that they show up in VS_OUT map), so emit these before we append pos/psize to the linkage. Signed-off-by: Rob Clark <robdclark@gmail.com>	2016-12-06 18:01:31 -05:00
Nanley Chery	72db1570b4	anv/TODO: Document sampling from HiZ Acked-by: Jason Ekstrand <jason@jlekstrand.net>	2016-12-06 14:51:30 -08:00
Kenneth Graunke	05a4e3a009	i965: Don't force SSO layout for VS->TCS. This was a hack which worked around the VS and TCS disagreeing on their shared interface due to the lack of varying packing. In particular, it was needed by Piglit's tcs-input-read-array-interface test. However, that was just one case where things could go awry, so the previous commit forcibly made interfaces match. This hack is no longer necessary. It also seems to be broken, though I'm not sure why. It fixes Piglit regressions in spec/arb_shader_image_load_store/semantics from commit `ec1f159ac8`. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=98893 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com>	2016-12-06 12:36:21 -08:00
Kenneth Graunke	44fd85d8eb	i965: Unify shader interfaces explicitly. A while ago, I made i965 start compiling shaders independently. The VUE map layouts were based entirely on each shader's input/output bitfields. Assuming the interfaces match, this works out well - both sides will compute the same layout, and outputs are correctly routed to inputs. At the time, I had assumed that the linker would guarantee that the interfaces match. While it usually succeeds, it unfortunately seems to fail in some cases. For example, Piglit's tcs-input-read-array-interface test has a VS output array with two elements, but the TCS only reads one. The linker isn't able to eliminate the unused element from the VS, which makes the interfaces not match. Another case is where a shader other than the last writes clip/cull distances. These should be demoted to ordinary varyings, but they currently aren't - so we think they still have some special meaning, and prevent them from being eliminated. Fixing the linker to guarantee this in all cases is complicated. It needs to be able to optimize out dead code. It's tied into varying packing and other messiness. While we can certainly improve it---and should---I'd rather not rely on it being correct in all cases. This patch ORs adjacent stages' input/output bitfields together, ensuring that their interface (and hence VUE map layout) will be compatible. This should safeguard us against linker insufficiencies. Fixes line rendering in Dolphin, and the Piglit test based on it: spec/glsl-1.50/execution/geometry/clip-distance-vs-gs-out. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=97232 Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com>	2016-12-06 12:34:23 -08:00
Jason Ekstrand	eb7b51d62a	genxml/gen9: Change the default of MI_SEMAPHORE_WAIT::RegisterPoleMode We would really like it to be false as that's what you get on hardware that doesn't have RegisterPoleMode (Sky Lake for example). While we're at it, we change it to a boolean. This fixes dEQP-VK.synchronization.smoke.events on Broxton. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "13.0" <mesa-stable@lists.freedesktop.org>	2016-12-06 11:35:13 -08:00
Roland Scheidegger	8ac3c1bf1a	gallivm: optimize 16bit->32bit gather path a bit LLVM can't really optimize anything which crosses scalar/vector boundaries, so help a bit with some particular gather operations when the width is expanded (only do it for 16->32bit expansion for now), by doing expansion after fetch. That is probably a better solution anyway even if llvm would recognize it, makes for cleaner IR... Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2016-12-06 20:06:06 +01:00
Roland Scheidegger	fd5f420fbb	gallivm: handle 16bit float fetches in lp_build_fetch_rgba_soa Note that we really want to _never_ reach the bottom of the function, which resorts to AoS fetch. Half floats can be handled just like other formats which fit into 32bit vectors (so, only 1x16 and 2x16 formats, albeit with more channels things are not THAT bad), with minimal plumbing. I've seen code size go down nearly by a factor of 3 for a complete texture sampling function (including bilinear filtering) using R16F. (What we should do for everything not special cased is to do AoS gather, shuffle/shift things into SoA vectors, and then do the conversion there. Otherwise it's particularly bad with 1 or 2 channel formats - that r16f format with either 4 or 8-wide vectors was still doing one element at a time, essentially doing exactly the same work as for rgba16f. Also replacing the channels with SWIZZLE0/1 (particularly the latter) adds even more work, as it has to be done per aos vector, and not just straightforward at the end with the SoA vector.) Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2016-12-06 20:06:06 +01:00
Roland Scheidegger	775a244645	util: (trivial) ETC1 meets the criteria for fitting into unorm8 Just like other similar compressed formats. Reviewed-by: Jose Fonseca <jfonseca@vmware.com>	2016-12-06 20:06:06 +01:00
Matt Turner	43cdbb3e6a	i965: Emit proper NOPs. The PRMs for HSW and newer say that other than the opcode and DebugCtrl bits of the instruction word, the rest must be zero. By zeroing the instruction word manually, we avoid using any of the state inherited through brw_codegen. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=96959 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-12-06 10:42:46 -08:00
Roland Scheidegger	9c95ad24cc	glsl: (trivial) fix type typo Accidentally changed the type of a constant in `df33f11b39` causing assertion failures.	2016-12-06 17:44:21 +01:00
Kenneth Graunke	a41f5dcb14	i965: Allocate at least some URB space even when max_vertices = 0. Allocating zero URB space is a really bad idea. The hardware has to give threads a handle to their URB space, and threads have to use that to terminate the thread. Having it be an empty region just breaks a lot of assumptions. Hence, why we asserted that it isn't possible. Unfortunately, it /is/ possible prior to Gen8, if max_vertices = 0. In theory a geometry shader could do SSBO/image access and maybe still accomplish something. In reality, this is tripped up by conformance tests. Gen8+ already avoids this problem by placing the vertex count DWord in the URB entry header. This fixes things on earlier generations. Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Tested-by: Ian Romanick <ian.d.romanick@intel.com>	2016-12-05 20:47:03 -08:00
Roland Scheidegger	cd9bb4b918	main: allow NEAREST_MIPMAP_NEAREST for stencil texturing As per GL 4.5 rules, which fixed a spec mistake in GL_ARB_stencil_texturing. The extension spec wasn't updated, but just allow it with older GL versions as well, hoping there aren't any crazy tests which want to see an error there... (Compile tested only.) Reported by Józef Kucia <joseph.kucia@gmail.com> Acked-by: Józef Kucia <joseph.kucia@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2016-12-06 04:10:43 +01:00
Roland Scheidegger	df33f11b39	glsl: fix ldexp lowering if bitfield insert lowering is also requested Trivial, this just resurrects the code which was there once upon a time (the code can't lower instructions generated in the lowering pass there, and even if it could it would probably be suboptimal). This fixes piglit mesa_shader_integer_functions fs-ldexp.shader_test and vs-ldexp.shader_test with llvmpipe. Reviewed-by: Matt Turner <mattst88@gmail.com>	2016-12-06 04:10:43 +01:00
Nayan Deshmukh	3015a23fe0	radv: fix resource leak in radv_amdgpu_ctx_create CovID: 1396387 V2. Fixup bad whitespace. Signed-off-by: Nayan Deshmukh <nayan26deshmukh@gmail.com> Reviewed-by: Edward O'Callaghan <funfunctor@folkore1984.net>	2016-12-06 11:49:01 +11:00
Andy Furniss	5338fb34d6	st/omx/enc Raise default encode level Fixes: https://bugs.freedesktop.org/show_bug.cgi?id=91281 Signed-off-by: Andy Furniss <adf.lists@gmail.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2016-12-05 19:39:47 -05:00
Andy Furniss	2a38a5b2b2	radeon/vce Handle H.264 level 5.2 Fixes: https://bugs.freedesktop.org/show_bug.cgi?id=91281 v2: explicitly add case 52 Signed-off-by: Andy Furniss <adf.lists@gmail.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2016-12-05 19:39:47 -05:00
Jason Ekstrand	7db009b59e	nir: Remove some unused fields from nir_variable All of these are happily set from glsl_to_nir or spirv_to_nir but their values are never used for anything. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2016-12-05 15:40:10 -08:00
Jason Ekstrand	50e0b0bee3	nir: Delete most of the constant_initializer support Constant initializers have been a constant (ha!) pain for quite some time. While they're useful from a language perspective, people writing passes or backends really don't want deal with them most of the time. This commit removes most of the constant initializer support from NIR. It is expected that you call nir_lower_constant_initializers VERY EARLY to ensure that they're gone before you do anything interesting. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2016-12-05 15:40:09 -08:00
Jason Ekstrand	2f19c19b5d	nir: Simplify nir_lower_gs_intrinsics It's only ever called on single-function shaders. At this point, there are a lot of helpers that can make it all much simpler. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2016-12-05 15:40:09 -08:00
Jason Ekstrand	257aa5a1c4	nir/lower_returns: Stop using constant initializers Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2016-12-05 15:40:09 -08:00
Jason Ekstrand	507626304c	glsl/nir: Call nir_lower_constant_initializers Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2016-12-05 15:40:09 -08:00
Jason Ekstrand	c5d664f9dc	anv/pipeline: Call nir_lower_constant_initializers Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2016-12-05 15:40:09 -08:00
Jason Ekstrand	f5232db9e5	nir: Add a pass for lowering away constant initializers Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2016-12-05 15:40:09 -08:00
Jason Ekstrand	0291bf4db2	Revert "i965: use nir_lower_indirect_derefs() for GLSL" This reverts commit `9404439a75`. I didn't intend to push it and it breaks clip and cull distance.	2016-12-05 15:21:20 -08:00
Jason Ekstrand	5f0e4c7c79	i965: Delete the meta-base CopyImageSubData implementation When I originally implemented the ARB_copy_image extension, the fast-path was written in meta using texture views. This path only worked if both images were uncompressed color images. All of the other cases fell back to the blitter or, in the worst case, mapping and memcpy on the CPU. Now that we have the blorp path, it handles all copies ever and the old meta, blitter, and CPU paths are only used on gen5 and below. The primary reason why we needed the meta path (apart from having a slow blitter on later hardware) was to handle multisampling which gen5 and earlier don't support anyway. Since the blitter is reasonably fast on gen5, we can just delete the meta path and get rid of all that terrible code. If we decide that we're ok with just disabling ARB_copy_image on gen5 and earlier (I personally am), then we could get rid of another 300 lines or so of semi-hairy code. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2016-12-05 14:00:35 -08:00
Jason Ekstrand	06d864921e	i965/copy_image: Re-implement the blitter path with emit_miptree_blit By using emit_miptree_blit which does chunking, this fixes the blitter path for the case where the image is too tall to blit normally. We also pull it into intel_blit as intel_miptree_copy. This matches the naming of the blorp blit and copy functions brw_blorp_blit and brw_blorp_copy. Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Cc: "13.0" <mesa-dev@lists.freedesktop.org>	2016-12-05 14:00:35 -08:00
Jason Ekstrand	6c74e7f492	i965/blit: Break the guts of intel_miptree_blit into a helper Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Cc: "13.0" <mesa-dev@lists.freedesktop.org>	2016-12-05 14:00:35 -08:00
Timothy Arceri	9404439a75	i965: use nir_lower_indirect_derefs() for GLSL This moves the nir_lower_indirect_derefs() call into brw_preprocess_nir() so thats is called by both OpenGL and Vulkan and removes that call to the old GLSL IR pass lower_variable_index_to_cond_assign() We want to do this pass in nir to be able to move loop unrolling to nir. There is a increase of 1-3 instructions in a small number of shaders, and 2 Kerbal Space program shaders that increase by 32 instructions. Shader-db results BDW: total instructions in shared programs: 8705873 -> 8706194 (0.00%) instructions in affected programs: 32515 -> 32836 (0.99%) helped: 3 HURT: 79 total cycles in shared programs: 74618120 -> 74583476 (-0.05%) cycles in affected programs: 528104 -> 493460 (-6.56%) helped: 47 HURT: 37 LOST: 2 GAINED: 0	2016-12-05 14:00:35 -08:00
Tim Rowley	0c70b26a2d	swr: mark PIPE_CAP_NATIVE_FENCE_FD unsupported Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2016-12-05 13:42:39 -06:00
Tim Rowley	efc3ca64ba	swr: include llvm version and vector width in renderer string Uses llvmpipe's string formating. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2016-12-05 13:42:39 -06:00
Tim Rowley	b035d9cab5	gallivm: use getHostCPUFeatures on x86/llvm-4.0+. Use llvm provided API based on cpuid rather than our own manually mantained list of mattr enabling/disabling. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2016-12-05 13:42:39 -06:00
Juan A. Suarez Romero	48416b6f4d	st/va: declare vlVaBuffer before vlVaContext And declare coded_buf in vlVaContext as "vlVaBuffer " instead of "struct vlVaBuffer ". This fixes several warnings later about assignment from incompatible pointer type. Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2016-12-05 17:03:57 +00:00
Juan A. Suarez Romero	5a585d019e	st/va: remove unused variable pbuff Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Elie Tournier <tournier.elie@gmail.com>	2016-12-05 17:03:56 +00:00
Emil Velikov	510722d146	st/va: automake: cleanup C{PP,}FLAGS Remove some transitional left overs from the gallium pipe-loader rework and kill off unneeded AM_CPPFLAGS. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2016-12-05 17:03:56 +00:00
Rob Clark	8ca14b04e1	add EGL_TEXTURE_EXTERNAL_WL to WL_bind_wayland_display spec Signed-off-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Daniel Stone <daniels@collabora.com>	2016-12-05 16:01:21 +00:00
Emil Velikov	d09da32cfa	docs: add news item and link release notes for 12.0.5 Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2016-12-05 15:42:58 +00:00

... 99 100 101 102 103 ...

92185 commits