fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-23 13:20:14 +01:00

Author	SHA1	Message	Date
Bruce Cherniak	32c1a54bd0	swr: Limit memory held by defer deleted resources. This patch limits the number of items on the fence work queue (the deferred deletion list) by submitting a sync fence when the queue size exceeds a threshold. This initiates deferred deletion of all resources on the list and decreases the total amount of memory held waiting for "deferred deletion". This resolves bug 101467 filed against swr for the piglit streaming-texture-leak test. For those running on smaller memory (16GB?) systems, this will prevent oom-killer. Thus far, we have not seen any real world applications that exhibit behavior like the streaming-texture-leak test; as any form of pipeline flush will trigger the defer queue and properly free any retained allocations. But, this addresses those as well. Cc: "17.1" <mesa-stable@lists.freedesktop.org> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2017-07-02 17:38:57 -05:00
Tim Rowley	ab564c7ab4	swr/rast: move default split size from driver to rasterizer Reviewed-by: Bruce Cherniak <bruce.cherniak at intel.com>	2017-06-30 13:26:19 -05:00
Samuel Pitoiset	973822bcee	gallium: add PIPE_CAP_BINDLESS_TEXTURE Whether bindless texture operations are supported by the underlying driver. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-06-14 10:04:36 +02:00
Lyude	467af445a3	gallium: Add a cap to check if the driver supports ARB_post_depth_coverage Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2017-06-02 23:19:22 -04:00
Marek Olšák	50189379fa	gallium: add PIPE_CAP_ALLOW_MAPPED_BUFFERS_DURING_EXECUTION for skipping mapped-buffer checking in every GL draw call Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-05-17 20:28:44 +02:00
Marek Olšák	70dcb7377d	gallium: add PIPE_CAP_CAN_BIND_CONST_BUFFER_AS_VERTEX The next patch will use it. This is really for svga and GL2-level drivers. Tested-by: Edmondo Tommasina <edmondo.tommasina@gmail.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2017-05-10 19:29:08 +02:00
Bruce Cherniak	f52e63069a	swr: move msaa resolve to generalized StoreTile v3: list piglit tests fixed by this patch. Fixed typo Tim pointed out. v2: Reword commit message to more closely adhere to community guidelines. This patch moves msaa resolve down into core/StoreTiles where the surface format conversion routines are available. The previous "experimental" resolve was limited to 8-bit unsigned render targets. This fixes a number of piglit msaa tests by adding resolve support for all the render target formats we support. Specifically: layered-rendering/gl-layer-render: fail->pass layered-rendering/gl-layer-render-storage: fail->pass multisample-formats [2,4,8,16] gl_arb_texture_rg: crash->pass multisample-formats [2,4,8,16] gl_ext_texture_snorm: crash->pass multisample-formats [2,4,8,16] gl_arb_texture_float: fail->pass multisample-formats [2,4,8,16] gl_arb_texture_rg-float: fail->pass MSAA is still disabled by default, but can be enabled with "export SWR_MSAA_MAX_COUNT=4" (1,2,4,8,16 are options) The default is 0, which is disabled. This patch improves the number of multisample-formats supported by swr, and fixes several crashes currently in the 17.1 branch. Therefore, it should be considered for inclusion in the 17.1 stable release. Being disabled by default, it poses no risk to most users of swr. Reviewed-by: Tim Rowley <timothy.o.rowley@intel.com> cc: mesa-stable@lists.freedesktop.org	2017-05-08 14:45:40 -05:00
Nicolai Hähnle	17f24a9b75	gallium: add PIPE_CAP_TGSI_TES_LAYER_VIEWPORT Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net>	2017-04-14 22:49:44 +02:00
Bruce Cherniak	1832ef6cd9	swr: Enable MSAA in OpenSWR software renderer This patch enables multisample antialiasing in the OpenSWR software renderer. MSAA is a proof-of-concept/work-in-progress with bug fixes and performance on the way. We wanted to get the changes out now to allow several customers to begin experimenting with MSAA in a software renderer. So as not to impact current customers, MSAA is turned off by default - previous functionality and performance remain intact. It is easily enabled via environment variables, as described below. It has only been tested with the glx-lib winsys. The intention is to enable other state-trackers, both Windows and Linux and more fully support FBOs. There are 2 environment variables that affect behavior: * SWR_MSAA_FORCE_ENABLE - force MSAA on, for apps that are not designed for MSAA... Beware, results will vary. This is mainly for testing. * SWR_MSAA_MAX_SAMPLE_COUNT - sets maximum supported number of samples (1,2,4,8,16), or 0 to disable MSAA altogether. (The default is currently 0.) Reviewed-by: George Kyriazis <george.kyriazis@intel.com>	2017-04-14 15:22:45 -05:00
Bruce Cherniak	91a7f0b3af	swr: Removed unnecessary PIPE_BIND flags from swr_is_format_supported Removed unnecessary and probably wrong PIPE_BIND_SCANOUT and PIPE_BIND_SHARED flags in favor of check on single PIPE_BIND_DISPLAY_TARGET flag. Reference llvmpipe change <bee4c7718a3bd57e3d99f0913d9081cd13fe5fd> Reviewed-by: Tim Rowley <timothy.o.rowley@intel.com>	2017-04-14 15:22:44 -05:00
Tim Rowley	9a7b257450	swr: return true for PIPE_CAP_DOUBLES Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2017-04-11 13:16:43 -05:00
Tim Rowley	7bd5057fd1	swr: fix unused variable warnings Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-04-07 16:50:41 -05:00
Nicolai Hähnle	d3e6f6d7f7	gallium: add PIPE_CAP_TGSI_BALLOT Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-04-05 15:29:31 +02:00
Nicolai Hähnle	d6e6fa01a5	gallium: add sparse buffer interface and capability v2: - explain the resource_commit interface in more detail Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-04-05 10:37:04 +02:00
Lyude	ffe2bd676f	gallium: Add a cap to check if the driver supports fill_rectangle Changes since v1: - Add pipe caps for etnaviv, freedreno, swr and virgl Signed-off-by: Lyude <lyude@redhat.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2017-03-31 21:41:24 -04:00
Nicolai Hähnle	d0c7f924a3	gallium: add PIPE_CAP_TGSI CLOCK Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-03-31 07:56:25 +02:00
Marek Olšák	bf3cdf0fd3	gallium: add PIPE_CAP_TGSI_TEX_TXF_LZ	2017-03-15 18:17:41 +01:00
Brian Paul	637e5719b5	gallium: s/unsigned/enum pipe_shader_type/ for pipe_screen::get_shader_param() Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net>	2017-03-08 08:50:20 -07:00
Tim Rowley	f1d7284117	swr: implement geometry shaders Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-03-05 07:33:49 -06:00
Bruce Cherniak	dd649a541d	swr: enable clear_texture with util_clear_texture Passes corresponding piglit tests. Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net>	2017-03-02 13:39:52 -06:00
Marek Olšák	4a883966c1	gallium: remove PIPE_CAP_USER_INDEX_BUFFERS all drivers support it Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Brian Paul <brianp@vmware.com> Tested-by: Brian Paul <brianp@vmware.com> (VMware driver only)	2017-02-25 00:03:09 +01:00
Ilia Mirkin	b090033087	gallium: add separate PIPE_CAP_INT64_DIVMOD Nouveau does not currently have logic to implement this as a library function. Even though such a library could be written, there's no big advantage to do it that way for now given that int64 is a very uncommon use-case. Allow a driver to expose INT64 without supporting division and modulo operations. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-02-09 12:57:21 -05:00
Dave Airlie	f804506d4d	gallium: Add integer 64 capability v1.1: move to using a normal CAP. (Marek) v2: fill in the cap everywhere Signed-off-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2017-01-27 10:19:25 +01:00
Ilia Mirkin	6e40938fbc	gallium: add PIPE_CAP_TGSI_MUL_ZERO_WINS Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Axel Davy <axel.davy@ens.fr>	2017-01-23 20:36:47 -05:00
Ilia Mirkin	ee3ebe68f9	gallium: add PIPE_CAP_TGSI_FS_FBFETCH Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com>	2017-01-16 21:13:09 -05:00
George Kyriazis	a61528fa33	Always defer memory free in swr_resource_destroy Defer delete on regular resources. This ensures that any work being done on the resource is completed before freeing up the resource's memory. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-01-12 09:10:15 -06:00
Tim Rowley	33fa4c99f7	swr: [rasterizer core/common/jitter] gl_double support Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=99214 Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2017-01-05 14:10:36 -06:00
George Kyriazis	36ad826548	swr: fix windows build break wrap lp_bld_type.h around extern "C". Windows decorates global variables, so when used from .cpp files, need to use an undecorated version. Also, removed related and unneeded code from swr_screen.cpp Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2017-01-05 07:30:18 -06:00
Marek Olšák	e51baeb6c1	gallium: add PIPE_CAP_GLSL_OPTIMIZE_CONSERVATIVELY Drivers with good compilers don't need aggressive optimizations before TGSI. Reviewed-by: Eric Anholt <eric@anholt.net>	2017-01-05 13:07:12 +01:00
Bruce Cherniak	79b66ec05e	swr: Implement fence attached work queues for deferred deletion. Work can now be added to fences and triggered by fence completion. This allows for deferred resource deletion, and other asynchronous tasks. Reviewed-by: George Kyriazis <george.kyriazis@intel.com>	2016-12-16 11:29:02 -06:00
Tim Rowley	2a127b780b	swr: [rasterizer common/core/jitter] fetch support for GL_FIXED v2: use fmul(1/65536) instead of fdiv(65535) Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2016-12-09 16:20:13 -06:00
Tim Rowley	0c70b26a2d	swr: mark PIPE_CAP_NATIVE_FENCE_FD unsupported Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2016-12-05 13:42:39 -06:00
Tim Rowley	efc3ca64ba	swr: include llvm version and vector width in renderer string Uses llvmpipe's string formating. Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2016-12-05 13:42:39 -06:00
Ilia Mirkin	02b2efa5eb	swr: properly report max number of SO components The components count the number of individual values, not the number of slots. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Tim Rowley <timothy.o.rowley@intel.com>	2016-11-30 20:35:56 -05:00
Ilia Mirkin	d8ce8acdfa	swr: don't advertise stream pause/resume There is no support for resuming streamout. Furthermore, this also controls glDrawTransformFeedback functionality which requires the same ability to query how many primitives were sent out of TF. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Tim Rowley <timothy.o.rowley@intel.com>	2016-11-30 20:35:43 -05:00
Nicolai Hähnle	611166b8ed	gallium: add PIPE_CAP_TGSI_CAN_READ_OUTPUTS Drivers that support this benefit by saving one lowering pass in the GLSL-to-TGSI conversion. radeonsi already supports this because all outputs are stored in temporary variables before the export (except for TCS outputs, which have always been readable in TGSI anyway due to their special semantics). Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-11-30 09:09:50 +01:00
Ilia Mirkin	8ed703cfa6	swr: add missing rgbx8_srgb variant Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Tim Rowley <timothy.o.rowley@intel.com>	2016-11-29 20:54:57 -05:00
Ilia Mirkin	d6a06228a6	swr: reorder renderable formats, add grouping comments Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Tim Rowley <timothy.o.rowley@intel.com>	2016-11-29 20:54:54 -05:00
Ilia Mirkin	86f7932b1e	swr: enable cubemap arrays Everything is in place for these. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Tim Rowley <timothy.o.rowley@intel.com>	2016-11-29 20:54:46 -05:00
Ilia Mirkin	8dd9853516	swr: rearrange caps into limits/supported/unsupported groups I find this a lot more readable and compact - much easier to scan through the list and see what's on and what's off. No functional change intended. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Tim Rowley <timothy.o.rowley@intel.com>	2016-11-29 20:54:43 -05:00
Ilia Mirkin	2234a4330e	swr: report a reasonable max lod bias This is the same value that llvmpipe uses. Since swr uses the same sampler logic, makes sense for this value to also be the same. Most applications don't care. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Tim Rowley <timothy.o.rowley@intel.com>	2016-11-22 20:27:20 -05:00
Ilia Mirkin	2b7bdff83f	swr: avoid using exceptions for expected condition handling I was getting a weird segfault from GCC 4.9.3: 0x00007ffff54f27aa in strlen () from /lib64/libc.so.6 (gdb) bt #0 0x00007ffff54f27aa in strlen () from /lib64/libc.so.6 #1 0x00007ffff4f128e5 in get_cie_encoding (cie=cie@entry=0x7ffff6e09813) at /gcc-4.9.3/libgcc/unwind-dw2-fde.c:272 #2 0x00007ffff4f1318e in classify_object_over_fdes (ob=ob@entry=0xd7bb90, this_fde=0x7ffff7f11010) at /gcc-4.9.3/libgcc/unwind-dw2-fde.c:628 #3 0x00007ffff4f135ba in init_object (ob=0xd7bb90) at /gcc-4.9.3/libgcc/unwind-dw2-fde.c:749 #4 search_object (ob=ob@entry=0xd7bb90, pc=pc@entry=0x7ffff4f11f4d <_Unwind_RaiseException+61>) at /gcc-4.9.3/libgcc/unwind-dw2-fde.c:961 #5 0x00007ffff4f13e62 in _Unwind_Find_registered_FDE (bases=0x7fffffffd358, pc=0x7ffff4f11f4d <_Unwind_RaiseException+61>) at /gcc-4.9.3/libgcc/unwind-dw2-fde.c:1025 #6 _Unwind_Find_FDE (pc=0x7ffff4f11f4d <_Unwind_RaiseException+61>, bases=bases@entry=0x7fffffffd358) at /gcc-4.9.3/libgcc/unwind-dw2-fde-dip.c:450 #7 0x00007ffff4f11197 in uw_frame_state_for (context=context@entry=0x7fffffffd2b0, fs=fs@entry=0x7fffffffd100) at /gcc-4.9.3/libgcc/unwind-dw2.c:1245 #8 0x00007ffff4f11b15 in uw_init_context_1 (context=context@entry=0x7fffffffd2b0, outer_cfa=outer_cfa@entry=0x7fffffffd660, outer_ra=0x7ffff518d23b <__cxa_throw+91>) at /gcc-4.9.3/libgcc/unwind-dw2.c:1566 #9 0x00007ffff4f11f4e in _Unwind_RaiseException (exc=0xd7c250) at /gcc-4.9.3/libgcc/unwind.inc:88 #10 0x00007ffff518d23b in __cxa_throw () from /usr/lib/gcc/x86_64-pc-linux-gnu/4.9.3/libstdc++.so.6 #11 0x00007ffff51ed556 in std::__throw_out_of_range(char const*) () from /usr/lib/gcc/x86_64-pc-linux-gnu/4.9.3/libstdc++.so.6 #12 0x00007fffea778be0 in std::map<pipe_format, SWR_FORMAT, std::less<pipe_format>, std::allocator<std::pair<pipe_format const, SWR_FORMAT> > >::at ( this=0x7fffebeb4c40 <mesa_to_swr_format(pipe_format)::mesa2swr>, __k=@0x7fffffffd73c: PIPE_FORMAT_RGTC1_UNORM) at /usr/lib/gcc/x86_64-pc-linux-gnu/4.9.3/include/g++-v4/bits/stl_map.h:549 #13 0x00007fffea776aee in mesa_to_swr_format (format=PIPE_FORMAT_RGTC1_UNORM) at swr_screen.cpp:597 We can just void this whole issue by not using exceptions in the first place. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2016-11-22 20:27:20 -05:00
Ilia Mirkin	946a7abd1c	swr: remove formats from mapping table that don't have StoreTile impls This table exists for the purpose of determining renderable formats. Without a StoreTile implementation, that can't happen. This basically removes rendering support to all L/LA/I formats. They can be re-added when/if StoreTile implementations are added. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2016-11-22 20:27:20 -05:00
Ilia Mirkin	2e12d2ba72	swr: remove unnecessary -1 entries in format mapping table Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2016-11-22 20:27:20 -05:00
Ilia Mirkin	7cfb364b1a	swr: rework resource layout and surface setup This is a bit of a mega-commit, but unfortunately there's no great way to break this up since a lot of different pieces have to match up. Here we do the following: - change surface layout to match swr's Load/StoreTile expectations - fix sampler settings to respect all sampler view parameters - fix stencil sampling to read from secondary resource - respect pipe surface format, level, and layer settings - fix resource map/unmap based on the new layout logic - fix resource map/unmap to copy proper parts of stencil values in and out of the matching depth texture These fix a massive quantity of piglits, including all the tex-miplevel-selection ones. Note that the swr native miptree layout isn't extremely space-efficient, and we end up using it for all textures, not just the renderable ones. A back-of-the-envelope calculation suggests about 10%-25% increased memory usage for miptrees, depending on the number of LODs. Single-LOD textures should be unaffected. There are a handful of regressions as a result of this change: - Some textureGrad tests, these failures match llvmpipe. (There are debug settings allowing improved gallivm sampling accurancy.) - Some layered clearing tests as swr doesn't currently support that. It was getting lucky before because enough other things were broken. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2016-11-22 20:27:20 -05:00
Ilia Mirkin	c3dd5b2e3f	swr: don't claim to allow setting layer/viewport from VS This may ultimately be possible to support, but for now it's not hooked up and the swr core only supports this output from GS. This normally wouldn't matter, but we lie about supporting GL 3.2, and also the blitter and st/mesa will make use of this functionality if claimed. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Tim Rowley <timothy.o.rowley@intel.com>	2016-11-21 21:11:26 -05:00
George Kyriazis	87bd28210f	swr: renamed duplicate swr_create_screen() There are 2 swr_create_screen() functions. One in swr_loader.cpp, which is used during driver init, and the other is hiding in swr_screen.cpp, which ends up in the arch-specific .dll/.so. Rename the second one to swr_create_screen_internal(), to avoid confusion in header files. Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2016-11-21 12:44:46 -06:00
George Kyriazis	974d280e81	swr: Handle windows.h and NOMINMAX Reorder header files so that we have a chance to defined NOMINMAX before mesa include files include windows.h v3: split from bigger patch Reviewed-by: Emil Velikov <emil.velikov@collabora.com>	2016-11-21 12:44:46 -06:00
Ilia Mirkin	dafffd2f11	swr: mark color clamping as unsupported There is no functionality in swr to clamp either vertex or frag colors. This could be added in swr_shader, at which point these could be re-enabled. Fixes arb_color_buffer_float-render Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Tim Rowley <timothy.o.rowley@intel.com>	2016-11-15 20:26:32 -05:00
Ilia Mirkin	2f19a974a5	swr: mark rgb9_e5 as unrenderable The support in swr requires shaders to output the components as UINTs. This is not how GL or Gallium work, and since this is not a required-renderable format, just leave it out. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Bruce Cherniak <bruce.cherniak@intel.com>	2016-11-15 20:25:35 -05:00

1 2

77 commits