fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-06 05:08:08 +02:00

Author	SHA1	Message	Date
Kenneth Graunke	bc13e5f42a	docs: Add GL_KHR_blend_equation_advanced to relnotes.	2016-08-26 13:17:22 -07:00
Mario Kleiner	2cc880cba5	r600: increase performance for DRI PRIME offloading if 2nd GPU is Evergreen+ This is a direct port of Marek Olšáks patch "radeonsi: increase performance for DRI PRIME offloading if 2nd GPU is CIK or VI" to r600. It uses SDMA for the detiling blit from renderoffload VRAM to GTT, as SDMA is much faster for tiled->linear blits from VRAM to GTT. Testing on a dual Radeon HD-5770 setup reduced the time for the render offload gpu to get its rendering into system RAM from approximately 16 msecs for simple rendering at 1920x1080 pixel 32 bpp to 5 msecs, a > 3x speedup! This was measured using ftrace to trace the time the radeon kms driver waited on the dmabuf fence of the renderoffload gpu to complete. All in all this brought the time for a flip down from 20 msecs to 9 msecs, so the prime setup can display at full 60 fps instead of barely 30 fps vsync'ed. The current r600 implementation supports SDMA on Evergreen and later, but not R600/R700 due to some bugs apparently present in their SDMA implementation. Signed-off-by: Mario Kleiner <mario.kleiner.de@gmail.com> Cc: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2016-08-26 19:57:21 +02:00
Jordan Justen	7970238fcf	docs: Update stencil texturing & ES 3.1 status for i965 Haswell Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-08-26 10:09:22 -07:00
Jordan Justen	93f5eb7ae7	i965: Enable OpenGLES 3.1 for Haswell Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-08-26 10:09:22 -07:00
Jordan Justen	116b6e12d4	i965: Enable ARB_texture_stencil8 for Haswell Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-08-26 10:09:22 -07:00
Jordan Justen	f20f616324	i965: Enable ARB_stencil_texturing for Haswell Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-08-26 10:09:22 -07:00
Jordan Justen	751682434e	i965/gen7: Use R8_UINT stencil copy when sampling the stencil texture v2: * Check gen <= 7, rather than gen == 7. (Ian) Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-08-26 10:09:22 -07:00
Jordan Justen	8d78b096f8	i965/gen7: Copy stencil when sampling the stencil texture Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-08-26 10:09:22 -07:00
Jordan Justen	7af51b8f03	i965: Add function to copy a stencil miptree to an R8_UINT miptree v2: * Cleanups suggested by Ian, Matt and Topi Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-08-26 10:09:22 -07:00
Jordan Justen	c8194dc737	i965: Track that the stencil data was updated when using Tex*Image Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-08-26 10:09:22 -07:00
Jordan Justen	101b56bab2	i965: Track that the stencil data was updated when rendering Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-08-26 10:09:22 -07:00
Jordan Justen	7bd87c1e6e	i965: Track that the stencil data was updated when clearing Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-08-26 10:09:22 -07:00
Jordan Justen	2a9c65a01d	i965/gen7: Add R8_UINT stencil miptree copy for sampling For gen < 8, we can't sample from the stencil buffer, which is required for the ARB_stencil_texturing extension. We'll make a copy of the stencil data into a new texture that we can sample using the R8_UINT surface type. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2016-08-26 10:09:22 -07:00
Jordan Justen	91627d1956	i965: Fix assert with multisampling and cubemaps Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-08-26 10:09:22 -07:00
Jordan Justen	b82bb98441	i965/hsw: Adjust uploading default color for stencil surfaces v2: * has_component (Ken); const bits_per_channel (Topi) Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-08-26 10:09:22 -07:00
Jordan Justen	30fee52036	i965/hsw: Don't advertise more than 64 threads for compute shaders thread_width_max in the GPGPU walker command limits us to a maximum of 64 threads. This fixes a crash on Haswell in the OpenGLES 3.1 conformance test suite which tests the advertised limits of the max invocation counts. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-08-26 10:09:22 -07:00
Jordan Justen	861c9cbee3	main: Add MESA_VERBOSE=api support for glClearStencil Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-08-26 10:09:22 -07:00
Jordan Justen	9a1f950bef	main: Add MESA_VERBOSE=api support for glTexImage Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-08-26 10:09:22 -07:00
Charmaine Lee	0035f7f136	svga: add guest statistic gathering interface This file was supposed to be added with the previous "svga: add guest statistic gathering interface" patch but went MIA for some reason. Reviewed-by: Brian Paul <brianp@vmware.com>	2016-08-26 08:04:02 -06:00
Marek Olšák	49c798e902	radeonsi: disable CE on SI + AMDGPU Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-08-26 15:50:10 +02:00
Marek Olšák	281f1a5980	winsys/amdgpu: disable IB chaining on SI Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-08-26 15:50:10 +02:00
Marek Olšák	a6869e7c06	winsys/amdgpu: finish up SI addrlib integration Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net>	2016-08-26 15:50:10 +02:00
Ronie Salgado	97b55243fb	winsys/amdgpu: initial SI support Signed-off-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net>	2016-08-26 15:50:10 +02:00
Marek Olšák	971ef7518f	gallium/radeon: add a driver query for AMDGPU_INFO_NUM_EVICTIONS If the kernel driver doesn't support it, it returns 0. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-08-26 15:50:10 +02:00
Marek Olšák	7172906c0c	radeonsi: fix printing shaders and states on a VM fault This was missed while rewriting the PIPE_DUMP flags. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-08-26 15:50:10 +02:00
Marek Olšák	5ee3cac138	radeonsi: increase performance for DRI PRIME offloading if 2nd GPU is CIK or VI SDMA is much faster for tiled->linear blits from VRAM to GTT. I have Bonaire in my second PCIe slot. $ glxinfo \| grep OpenGL.renderer OpenGL renderer string: Gallium 0.4 on AMD TONGA ... $ DRI_PRIME=1 glxinfo \| grep OpenGL.renderer OpenGL renderer string: Gallium 0.4 on AMD BONAIRE ... Without SDMA: $ DRI_PRIME=1 glxgears 8796 frames in 5.0 seconds = 1759.074 FPS 8899 frames in 5.0 seconds = 1779.672 FPS With SDMA: $ DRI_PRIME=1 glxgears 12765 frames in 5.0 seconds = 2552.788 FPS 12888 frames in 5.0 seconds = 2577.495 FPS The 1st GPU is irrelevant. The improvement should be much lower at 60 fps, but definitely measurable. SI will get this once we add SDMA blit support for it. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-08-26 15:50:10 +02:00
Marek Olšák	0241d8300f	radeonsi: enable SDMA on CIK It passes R600_DEBUG=testdma on Bonaire/radeon. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-08-26 15:50:10 +02:00
Marek Olšák	bcfd49e511	gallium/radeon: increase priority for shader binaries Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-08-26 15:50:10 +02:00
Marek Olšák	c3f716fe67	gallium/radeon: merge USER_SHADER and INTERNAL_SHADER priority flags there's no reason to separate these Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2016-08-26 15:50:10 +02:00
Miklós Máté	b9ac72b511	vbo: set draw_id Fixes conditional jump depending on uninitialized value in si_state_draw.c:593 Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Miklós Máté <mtmkls@gmail.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2016-08-26 07:34:22 -06:00
Neha Bhende	10f6e08549	svga: fix regression related to srgb This regression is caused because of commit `3190c7ee97` Regression caused by following OpenGL 4.4 spec rules relates to GL_FRAMEBUFFER_SRGB in Mesa. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2016-08-26 06:19:52 -06:00
Neha Bhende	3b7341d547	svga: use local variable blit instead of pointer Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2016-08-26 06:19:52 -06:00
Brian Paul	b09e4ab13c	svga: s/INDEX_0D/INDEX_IMMEDIATE32/ Both are zero, but the later is the right token.	2016-08-26 06:19:52 -06:00
Brian Paul	93779b87a1	svga: add comment about unsupported blend modes	2016-08-26 06:19:52 -06:00
Charmaine Lee	b1772651b7	svga: fix ordering of mksstats counter strings String for SVGA_STATS_COUNT_TEXREADBACK was swapped with the string for SVGA_STATS_COUNT_SURFACEWRITEFLUSH. Trivial fix.	2016-08-26 06:19:52 -06:00
Charmaine Lee	2781d60375	svga: avoid emitting redundant SetShaderResource command Tested with Lightsmark2008, Heaven, MTT piglit, glretrace, viewperf, conform. Reviewed-by: Brian Paul <brianp@vmware.com>	2016-08-26 06:19:52 -06:00
Charmaine Lee	5313b294e6	svga: add a cleanup function to clean up sampler state This patch adds a cleanup function to clean up sampler state at context destruction time. Reviewed-by: Brian Paul <brianp@vmware.com>	2016-08-26 06:19:52 -06:00
Brian Paul	e292f38c6c	svga: loosen the condition to flush in get_query_result_vgpu10() Fixes piglit spec/ext_transform_feedback/overflow-edge-cases segfaults because the query's fence pointer was null. Tested with Piglit, Sauerbraten, ETQW. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2016-08-26 06:19:52 -06:00
Brian Paul	99d8fe20ab	svga: fix vgpu10 query fencing We don't want to flush the command buffer or sync on the fence when ending a query (that kind of defeats the whole purpose of async queries). Do that instead in get_query_result(). Tested with Piglit, arbocclude, Sauerbraten game, Nobel Clinician Viewer, ETQW. Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2016-08-26 06:19:52 -06:00
Charmaine Lee	3f51a3f6ac	svga: avoid emitting redundant DXSetSamplers command This patch avoid emitting redundant DXSetSamplers command. Tested with Lightsmark2008, Heaven, MTT piglit, glretrace, viewperf. Reviewed-by: Brian Paul <brianp@vmware.com>	2016-08-26 06:19:52 -06:00
Neha Bhende	6a43148e20	svga: enable ARB_clear_texture extension in the driver. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2016-08-26 06:19:52 -06:00
Neha Bhende	2111795d51	svga: define svga_clear() in svga_init_clear_functions() Put all the clearing related functions in svga_init_clear_functions() Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2016-08-26 06:19:51 -06:00
Neha Bhende	40557ae07c	svga: add svga_init_clear_functions() define svga_init_clear_functions() and svga_clear_texture as svga->pipe.clear_texture. This is part of ARB_clear_texture extension Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2016-08-26 06:19:51 -06:00
Neha Bhende	52d88b67be	svga: add new function svga_clear_texture() To clear texture this function can be used. This is part of ARB_clear_texture extension. Basically this extension allows you to clear texture with given color values. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2016-08-26 06:19:51 -06:00
Neha Bhende	1da538f85b	svga: add new begin_blit() Saving all blitter states will be done in begin_blit() so that begin_blit() can be used before performing any blit operation. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Charmaine Lee <charmainel@vmware.com>	2016-08-26 06:19:51 -06:00
Charmaine Lee	a5fd54f8bf	svga: add opt to the list of valid build types For opt build, add VMX86_STATS to the list of cpp defines. Reviewed-by: Brian Paul <brianp@vmware.com>	2016-08-26 06:19:51 -06:00
Charmaine Lee	2e1cfcc431	svga: add guest statistic gathering interface With this patch, guest statistic gathering interface is added to svga winsys interface that can be used to gather svga driver statistic. The winsys module can then share the statistic info with the VMX host via the mksstats interface. The statistic enums used in the svga driver are defined in svga_stats_count and svga_stats_time in svga_winsys.h Reviewed-by: Brian Paul <brianp@vmware.com>	2016-08-26 06:19:51 -06:00
Charmaine Lee	4791991808	svga: fix indirect non-indexable temp access If the shader has indirect access to non-indexable temporaries, convert these non-indexable temporaries to indexable temporary array. This works around a bug in the GLSL->TGSI translator. Fixes glsl-1.20/execution/fs-const-array-of-struct-of-array.shader_test on DX11Renderer. Reviewed-by: Brian Paul <brianp@vmware.com>	2016-08-26 06:19:51 -06:00
Brian Paul	d221a6545c	gallium/hud: move signo declaration inside PIPE_OS_UNIX block To silence unused var warning with MSVC, MinGW. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-08-26 06:19:51 -06:00
Chris Wilson	f92a87a140	i965: Embrace "unlimited" GTT mmap support From about kernel 4.9, GTT mmaps are virtually unlimited. A new parameter, I915_PARAM_MMAP_GTT_VERSION, is added to advertise the feature so query it and use it to avoid limiting tiled allocations to only fit within the mappable aperture. A couple of caveats: - fence support is still limited by stride to 262144 and the stride needs to be a multiple of tile_width (as before, and same limitation as the current 3D pipeline in hardware) - the max_gtt_map_object_size forcing untiled may be hiding a few bugs in handling of large objects, though none were spotted in piglits. See kernel commit 4cc6907501ed ("drm/i915: Add I915_PARAM_MMAP_GTT_VERSION to advertise unlimited mmaps"). v2: Include some commentary on mmap virtual space vs CPU addressable space. Signed-off-by: Chris Wilson <chris@chris-wilson.co.uk> Cc: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Daniel Vetter <daniel.vetter@ffwll.ch>	2016-08-26 09:09:34 +01:00

1 2 3 4 5 ...

84329 commits