fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-23 02:30:12 +01:00

Author	SHA1	Message	Date
Nicolai Hähnle	a0970de839	configure.ac: require libdrm_amdgpu 2.4.77 The sparse buffer implementation requires amdgpu_bo_va_op_raw. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-04-05 10:30:42 +02:00
Matt Turner	d5ee55f028	mesa: Replace program locks with atomic inc/dec. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-04-05 14:54:49 +10:00
Jason Ekstrand	060a6434ec	anv: Advertise larger heap sizes Instead of just advertising the aperture size, we do something more intelligent. On systems with a full 48-bit PPGTT, we can address 100% of the available system RAM from the GPU. In order to keep clients from burning 100% of your available RAM for graphics resources, we have a nice little heuristic (which has received exactly zero tuning) to keep things under a reasonable level of control. Reviewed-by: Kristian H. Kristensen <krh@bitplanet.net>	2017-04-04 18:33:52 -07:00
Jason Ekstrand	651ec926fc	anv: Add support for 48-bit addresses This commit adds support for using the full 48-bit address space on Broadwell and newer hardware. Thanks to certain limitations, not all objects can be placed above the 32-bit boundary. In particular, general and state base address need to live within 32 bits. (See also Wa32bitGeneralStateOffset and Wa32bitInstructionBaseOffset.) In order to handle this, we add a supports_48bit_address field to anv_bo and only set EXEC_OBJECT_SUPPORTS_48B_ADDRESS if that bit is set. We set the bit for all client-allocated memory objects but leave it false for driver-allocated objects. While this is more conservative than needed, all driver allocations should easily fit in the first 32 bits of address space and keeps things simple because we don't have to think about whether or not any given one of our allocation data structures will be used in a 48-bit-unsafe way. Reviewed-by: Kristian H. Kristensen <krh@bitplanet.net>	2017-04-04 18:33:52 -07:00
Jason Ekstrand	439da38d18	anv: Replace anv_bo::is_winsys_bo with a uint32_t flags Reviewed-by: Kristian H. Kristensen <krh@bitplanet.net>	2017-04-04 18:33:52 -07:00
Jason Ekstrand	f938354362	i965/blorp: Align vertex buffers to 64B Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "13.0 17.0" <mesa-stable@lists.freedesktop.org>	2017-04-04 18:33:52 -07:00
Jason Ekstrand	5d1ba2cb04	anv/blorp: Align vertex buffers to 64B This fixes issues seen when adding support for full 48-bit addresses. The 48-bit addresses themselves have nothing to do with it other than that it caused the kernel to place buffers slightly differently so they interacted differently with the caches. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "13.0 17.0" <mesa-stable@lists.freedesktop.org>	2017-04-04 18:33:52 -07:00
Jason Ekstrand	c964f0e485	anv: Query the kernel for reset status When a client causes a GPU hang (or experiences issues due to a hang in another client) we want to let it know as soon as possible. In particular, if it submits work with a fence and calls vkWaitForFences or vkQueueQaitIdle and it returns VK_SUCCESS, then the client should be able to trust the results of that rendering. In order to provide this guarantee, we have to ask the kernel for context status in a few key locations. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-04-04 18:33:52 -07:00
Jason Ekstrand	82573d0f75	anv: Check for device loss at the end of WaitForFences It's possible that the device could have been lost while we were waiting. We should let the user know if this has happened. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-04-04 18:33:51 -07:00
Jason Ekstrand	c6f69eea6a	anv/pipeline: Properly handle unset gl_Layer and gl_ViewportIndex When the shader does not set one of these values, they are supposed to get a default value of 0. We have hardware bits in 3DSTATE_CLIP for this but haven't been setting them. This fixes the intermittent failure of dEQP-VK.geometry.layered.3d.render_to_default_layer. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: "13.0 17.0" <mesa-stable@lists.freedesktop.org>	2017-04-04 18:33:51 -07:00
Jason Ekstrand	3503b2714b	i965/fs: Always provide a default LOD of 0 for TXS and TXL We already provide a default LOD for textureQueryLevels and texture() on non-fragment stages. However, there are more cases where one is needed such as textureSize(gsampler2DMS*) in SPIR-V. Instead of trying to list out all of the cases one at a time, just provide the default for all TXS and TXL operations. This fixes a shader validation error in the new Sascha deferredmultisampling demo which uses textureSize(gsampler2DMS). Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100391 Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Cc: "13.0 17.0" <mesa-stable@lists.freedesktop.org>	2017-04-04 18:33:35 -07:00
Kenneth Graunke	c5bf7cb529	mesa: Require mipmap completeness for glCopyImageSubData(), sometimes. This patch makes glCopyImageSubData require mipmap completeness when the texture object's built-in sampler object has a mipmapping MinFilter. Fixes (on i965): dEQP-GLES31.functional.debug.negative_coverage.*.buffer.copy_image_sub_data Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2017-04-04 17:35:18 -07:00
Vinson Lee	c161a10462	libgl-xlib: Link with libunwind. Fix linking error. CXXLD libGL.la ../../../../src/gallium/auxiliary/.libs/libgallium.a(u_debug_stack.o): In function `debug_backtrace_capture': src/gallium/auxiliary/util/u_debug_stack.c:59: undefined reference to `_Ux86_64_getcontext' src/gallium/auxiliary/util/u_debug_stack.c:60: undefined reference to `_ULx86_64_init_local' src/gallium/auxiliary/util/u_debug_stack.c:62: undefined reference to `_ULx86_64_step' src/gallium/auxiliary/util/u_debug_stack.c:71: undefined reference to `_ULx86_64_get_proc_info' src/gallium/auxiliary/util/u_debug_stack.c:73: undefined reference to `_ULx86_64_get_proc_name' src/gallium/auxiliary/util/u_debug_stack.c:65: undefined reference to `_ULx86_64_step' Fixes: `70c272004f` ("gallium/util: libunwind support") Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100562 Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Rob Clark <robdclark@gmail.com>	2017-04-04 16:47:41 -07:00
Jason Ekstrand	1fde054b8f	intel/isl: Refactor and clerify gen8 alignment calculations Adding the actual table from the docs makes it clearer exactly what the restrictions are. In particular, it becomes clear that compressed textures ignore the alignment parameters in RENDER_SURFACE_STATE. Reviewed-by: Chad Versace <chadversary@chromium.org>	2017-04-04 14:51:57 -07:00
Francisco Jerez	0de17f52a5	drirc: Set glsl_zero_init for Kerbal Space Program. This fixes the stripes of garbage rendered on the floor of the vehicle assembly building among other rendering issues. The reason for the misrendering seems to be that some of the GLSL shaders used by the application use variables before initializing them, incorrectly assuming that they will be implicitly set to zero by the implementation. Acked-by: Matt Turner <mattst88@gmail.com>	2017-04-04 14:13:03 -07:00
Lionel Landwerlin	e8d9b76f63	intel: tools: add aubinator_error_decode tool This is pretty much the same tool as what i-g-t has, only with a more fancy decoding of the instructions/registers. It also doesn't support anything before gen4. v2 (from Matt): Drop authors Remove undefined automake variable v3: Fix incorrect offsets for dword > 1 (Jordan) v4: Fix decompression error with large blobs (Jordan) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Matt Turner <mattst88@gmail.com>	2017-04-04 21:22:26 +01:00
Lionel Landwerlin	567d77885e	intel: genxml: add RING_BUFFER_CTL registers Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-04-04 21:22:26 +01:00
Lionel Landwerlin	6f260ff049	intel: genxml: add FAULT_REG register Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-04-04 21:22:26 +01:00
Lionel Landwerlin	ca2771fa18	intel: genxml: add gen7 ERR_INT register v2: add register to gen7.5 (Matt) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-04-04 21:22:26 +01:00
Lionel Landwerlin	84613bf6d5	intel: genxml: add ACTHD registers Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-04-04 21:22:26 +01:00
Lionel Landwerlin	0f195f22aa	intel: genxml: add GFX_ARB_ERROR_RPT register Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-04-04 21:22:26 +01:00
Lionel Landwerlin	d1a7a54d77	intel: genxml: add INSTDONE registers Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-04-04 21:22:26 +01:00
Marek Olšák	18b12bf533	targets: export radeon winsys_create functions to silence LLVM warning It silences the following radeonsi LLVM warning due to a previous commit adding an LLVM workaround: "mesa: for the -simplifycfg-sink-common option: may only occur zero or one times!" Cc: 17.0 <mesa-stable@lists.freedesktop.org> Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by; Emil Velikov <emil.velikov@collabora.com>	2017-04-04 22:15:47 +02:00
Constantine Kharlamov	6ee486899b	r600g: check rasterizer primitive states like in radeonsi Specifically, non-line primitives skipped, and defaulting to reset on each packet. The skip of non-line primitives saves ≈110 resetting of PA_SC_LINE_STIPPLE register per frame in Kane&Lynch2. Signed-off-by: Constantine Kharlamov <Hi-Angel@yandex.ru> Signed-off-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2017-04-04 22:15:47 +02:00
Constantine Kharlamov	7ade08e2a8	r600g: extract a code into a r600_emit_rasterizer_prim_state() Also change gs_output_prim type: unsigned → pipe_prim_type. The idea of the code is mostly taken from radeonsi. The new code operating on prev/curr rast_primitives saves ≈15 reloads of PA_SC_LINE_STIPPLE per frame in Kane&Lynch2 Signed-off-by: Constantine Kharlamov <Hi-Angel@yandex.ru> Signed-off-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2017-04-04 22:15:47 +02:00
Constantine Kharlamov	fa8bc90990	r600g/radeonsi: use the correct types (taken from pipe_draw_info) Note: si_shader.h has also "type" variable that should be changed to "enum pipe_prim_type", however it triggers a bunch of warnings about unhandled switches, so due not knowing the correct way to handle them, I decided to leave it as is. Signed-off-by: Constantine Kharlamov <Hi-Angel@yandex.ru> Signed-off-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2017-04-04 22:15:47 +02:00
Constantine Kharlamov	ef62a7651c	r600g: remove duplicate memset by using a pointer, and constify args Signed-off-by: Constantine Kharlamov <Hi-Angel@yandex.ru> Signed-off-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de>	2017-04-04 22:15:47 +02:00
Elie TOURNIER	ba5b1ab3e0	glsl: remove unused file udivmod64 appears in src/compiler/glsl/builtin_int64.h and src/compiler/glsl/udivmod.h The second file seems unused. Fix commit `6b03b345eb` This change doesn't affect shader-db. Signed-off-by: Elie Tournier <elie.tournier@collabora.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2017-04-04 18:37:42 +01:00
Marek Olšák	6ca46c3d77	radeonsi: access gallivm through ctx in most places Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-04-04 16:55:21 +02:00
Marek Olšák	04e4fe594b	radeonsi: use ctx->types instead of bld->types etc. even vec_type is f32. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-04-04 16:55:19 +02:00
Marek Olšák	7a5e6dcba5	radeonsi: use i32_0/1 instead of *int_bld.zero/one in most places Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-04-04 16:55:16 +02:00
Marek Olšák	7216e1d8af	gallium: decrease the size of pipe_draw_info - 88 -> 80 bytes Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2017-04-04 11:14:43 +02:00
Marek Olšák	295f4f56cb	gallium: decrease the size of pipe_vertex_element - 16 -> 8 bytes Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2017-04-04 11:14:43 +02:00
Marek Olšák	e6428092f5	gallium: decrease the size of pipe_resource - 64 -> 48 bytes Some other changes needed here. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2017-04-04 11:14:43 +02:00
Marek Olšák	3dfe61ed6e	gallium: decrease the size of pipe_box - 24 -> 16 bytes Also: pipe_transfer: 48 -> 40 bytes. pipe_blit_info = 176 -> 160 bytes. v2: add a comment at pipe_box Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2017-04-04 11:14:43 +02:00
Marek Olšák	9869a3b3ba	gallium: decrease the size of pipe_sampler_view - 48 -> 32 bytes Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2017-04-04 11:14:43 +02:00
Marek Olšák	4648bc2a8f	gallium: decrease the size of pipe_surface - 48 -> 40 bytes Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2017-04-04 11:14:43 +02:00
Marek Olšák	eb0fd0e5f8	gallium: decrease the size of pipe_framebuffer_state - 96 -> 80 bytes Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2017-04-04 11:14:43 +02:00
Marek Olšák	19bc74f513	gallium: decrease the size of pipe_stream_output_info - 532 -> 268 bytes Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2017-04-04 11:14:43 +02:00
Marek Olšák	15ff2f7aa9	gallium: decrease the size of pipe_rasterizer_state - 36 -> 32 bytes Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2017-04-04 11:14:43 +02:00
Marek Olšák	18e760346a	amd/addrlib: second update for Vega10 + bug fixes Highlights: - Display needs tiled pitch alignment to be at least 32 pixels - Implement Addr2ComputeDccAddrFromCoord(). - Macro-pixel packed formats don't support Z swizzle modes - Pad pitch and base alignment of PRT + TEX1D to 64KB. - Fix support for multimedia formats - Fix a case "PRT" entries are not selected on SI. - Fix wrong upper bits in equations for 3D resource. - We can't support 2d array slice rotation in gfx8 swizzle pattern - Set base alignment for PRT + non-xor swizzle mode resource to 64KB. - Bug workaround for Z16 4x/8x and Z32 2x/4x/8x MSAA depth texture - Add stereo support - Optimize swizzle mode selection - Report pitch and height in pixels for each mip - Adjust bpp/expandX for format ADDR_FMT_GB_GR/ADDR_FMT_BG_RG - Correct tcCompatible flag output for mipmap surface - Other fixes and cleanups Acked-by: Alex Deucher <alexander.deucher@amd.com> Acked-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-04-04 11:14:43 +02:00
Marek Olšák	3e7d62a774	radeonsi: use i32_0 and i32_1 more Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-04-04 11:14:43 +02:00
Marek Olšák	29adaa19ac	radeonsi: remove most uses of lp_build_const* Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-04-04 11:14:43 +02:00
Marek Olšák	7cec96a038	radeonsi: clean up 'radeon_bld' references Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-04-04 11:14:43 +02:00
Marek Olšák	0fb5a505fa	radeonsi: fix broken texture filtering on SI-CIK since GFX9 changes Don't clear state[7] on SI-CIK, and only do the meta stuff on VI+. Fixes: `5abf60076c` ("radeonsi/gfx9: image descriptor changes in mutable fields") Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=100531 Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-04-04 11:14:43 +02:00
Juan A. Suarez Romero	1bcdf74cdd	bin/get-fixes-pick-list.sh: fix typo Replace "nore" by "more". Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com>	2017-04-04 09:05:44 +02:00
Mauro Rossi	72175bd2a5	android: intel: genxml: fix genX_xml.h generation rules Recent changes in Makefile.sources merged the aubinator files in a unique list of generated files and genxml/genX_xml.h is now needed to avoid the following building error: ninja: error: '.../genxml/genX_xml.h', needed by '.../genxml/genX_xml.h', missing and no known rule to make it build/core/ninja.mk:148: recipe for target 'ninja_wrapper' failed Fixes: `0f83c05` "intel: genxml: compress all gen files into one" Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-04-04 09:10:46 +03:00
Jason Ekstrand	405ef7bb33	intel/vec4: Add some fall through comments Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-04-03 16:58:35 -07:00
Bartosz Tomczyk	64b3aa7ad8	mesa/glthread: Avoid unnecessary batch reallocation Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2017-04-04 09:56:52 +10:00
Bas Nieuwenhuizen	6e5e8a2e49	radv: Increase descriptor limits. We supported more generally. Decreased the dynamic buffers though, as we only support 16 for uniform+storage. Signed-off-by: Bas Nieuwenhuizen <basni@google.com> Acked-by: Dave Airlie <airlied@redhat.com>	2017-04-04 01:47:47 +02:00

... 16 17 18 19 20 ...

91544 commits