fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-01-20 21:50:26 +01:00

Author	SHA1	Message	Date
Boris Brezillon	079aaa9c6d	panfrost: Get rid of the panfrost_driver abstraction leftovers Commit `5f81669d88` ("panfrost: Remove the panfrost_driver abstraction") left a few things behind, remove them now. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com>	2019-07-02 14:57:35 +02:00
Boris Brezillon	6608642d21	panfrost: Move scanout res creation out of panfrost_resource_create() Which improves readability and help us avoid a memory leak. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com>	2019-07-02 14:57:35 +02:00
Boris Brezillon	873b7b93e8	panfrost: Add the sampled texture BO to the job Otherwise we get random use-after-{free,unmap} errors. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> --- Changes in v2: - Move the panfrost_job_add_bo() call out of the loop	2019-07-02 14:57:35 +02:00
Samuel Pitoiset	6cc213b3c1	radv: enable DCC for layers on GFX8 It's currently only enabled if dcc_slice_size is equal to dcc_slice_fast_clear_size because the driver assumes that portions of multiple layers are contiguous but it's not always true. Still not supported on GFX9. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-02 09:38:02 +02:00
Samuel Pitoiset	233224c7f7	radv: do not enable DCC for mipmapped arrays because performance is worse Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-02 09:38:00 +02:00
Samuel Pitoiset	e41e575e24	radv: implement clearing DCC layers on GFX8 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-02 09:37:56 +02:00
Samuel Pitoiset	e47c68b7b0	radv: merge radv_dcc_clear_level() into radv_clear_dcc() This will help for clearing DCC arrays because we need to know the subresource range. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-02 09:37:51 +02:00
Samuel Pitoiset	f772fe6a11	radv: add support for decompressing DCC layers with compute Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-02 09:37:49 +02:00
Samuel Pitoiset	83297baf2d	ac: compute the DCC fast clear size per slice on GFX8 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-02 09:37:44 +02:00
Samuel Pitoiset	6517d226ac	ac: compute the size of one DCC slice on GFX8 Addrlib doesn't provide this info. Because DCC is linear, at least on GFX8, it's easy to compute the size of one slice. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-02 09:37:41 +02:00
Kenneth Graunke	457a55716e	iris: Defer closing and freeing VMA until buffers are idle. There will unfortunately be circumstances where we cannot re-use a virtual memory address until it's no longer active on the GPU. To facilitate this, we instead move BOs to a "dead" list, and defer closing them and returning their VMA until they are idle. We periodically sweep these away in cleanup_bo_cache, which triggers every time a new object's refcount hits zero. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Tested-by: Jordan Justen <jordan.l.justen@intel.com>	2019-07-02 07:23:55 +00:00
Kenneth Graunke	07f3455664	iris: Add an explicit alignment parameter to iris_bo_alloc_tiled(). In the future, some images will need to be aligned to a larger value than 4096. Most buffers, however, don't have any such requirement, so for now we only add the parameter to iris_bo_alloc_tiled() and leave the others with the simpler interface. v2: Fix missing alignment in vma_alloc, caught by Caio! Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Tested-by: Jordan Justen <jordan.l.justen@intel.com>	2019-07-02 07:23:55 +00:00
Iago Toral Quiroga	042aeffd5b	v3d: do not flush jobs that are synced with 'Wait for transform feedback' Generally, we achieve this by skipping the flush on calls to v3d_flush_jobs_writing_resource() when we detect that the resource is written in the current job from a transform feedback write. The exception to this is the case where the caller is about to map the resource, in which case we need to flush immediately since we can only emit 'Wait for transform feedback' commands on rendering jobs. We add a parameter to the function so the caller can identify that scenario. Reviewed-by: Eric Anholt <eric@anholt.net>	2019-07-02 08:57:20 +02:00
Iago Toral Quiroga	88cbc4f7f6	v3d: emit 'Wait for transform feedback' commands when needed The hardware can flush transform feedback writes before reads in the same job by inserting this command. This patch detects when the rendering state for the current draw call reads resources that had been previously written by transform feedback in the same job and inserts the 'Wait for transform feedback' command before emitting the new draw. v2 (Eric): - this was intended to look at job->tf_write_prscs for TF jobs. - clear job->tf_write_prscs after we emit the TF flush. - can skip flushes for fragment shader reads from TF. v3 (Eric): - all resources in job->tf_write_prscs are resources written by TF so we don't need to check if they are bound to PIPE_BIND_STREAM_OUTPUT. - documented optimization opportunity for geometry stages. Reviewed-by: Eric Anholt <eric@anholt.net>	2019-07-02 08:57:20 +02:00
Iago Toral Quiroga	c7dff0e614	v3d: keep track of resources written by transform feedback The hardware provides a feature to sync reads from previous transform feedback writes in the same job so if we use this mechanism we no longer have to flush the job. In order to identify this scenario we need a mechanism to identify resources that are written by transform feedback. v2: use _mesa_pointer_set_create (Eric) Reviewed-by: Eric Anholt <eric@anholt.net>	2019-07-02 08:57:20 +02:00
Mike Blumenkrantz	c8dcc308cc	st/dri: fix typo in format table for GR1616 format the dri image format here should match the fourcc format Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-07-01 15:17:10 -07:00
Mike Blumenkrantz	08fc14a979	st/dri: pass dri2_format_mapping directly to dri2_create_image_from_winsys this makes the entire struct available for use here Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-07-01 15:16:56 -07:00
Mike Blumenkrantz	2cc85670a7	mesa/st: simplify format usage in st_bind_egl_image the formats handled in the switch statement will always return an unknown mesa format, so process them directly and leave the default case for other/unknown formats no functional changes Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-07-01 15:16:43 -07:00
Kenneth Graunke	9b1b971491	iris: Use MI_COPY_MEM_MEM for tiny resource_copy_region calls. If our resource_copy_region size is a small number of DWords, then instead of firing up BLORP, we can simply use MI_COPY_MEM_MEM (after a CS stall). We also try and select the optimal batch. Improves performance in Shadow of Mordor on Low settings at 1920x1080 on Skylake GT4e by 0.689096% +/- 0.473968% (n=4). It tries to copy 4 bytes of data to a buffer which was most recently used as a writable compute shader SSBO. Previously we were switching from compute to the render pipeline, then firing up all of blorp_buffer_copy...for 4 bytes. I arbitrarily decided to support 4/8/12/16 bytes. Jason thinks this is about the right threshold where it's cheaper to use MI_COPY_MEM_MEM.	2019-07-01 13:59:49 -07:00
Bas Nieuwenhuizen	d7e6541cc7	radv: Only allocate supplied number of descriptors when variable. Fixes: `b5e04e9217` "radv: Support allocating variable size descriptor sets." Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=111019 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-07-01 20:53:33 +02:00
Eric Engestrom	177c35bf13	egl: simplify loop Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Sagar Ghuge<sagar.ghuge@intel.com>	2019-07-01 19:35:22 +01:00
Eric Anholt	67ffb853f0	sparc: Reuse m_vector_asm.h. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-07-01 11:14:29 -07:00
Eric Anholt	20294dceeb	mesa: Enable asm unconditionally, now that gen_matypes is gone. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-07-01 11:14:10 -07:00
Eric Anholt	52a39a332f	mesa: Replace gen_matypes with a simple header for V4F/mat layout. We can greatly simplify our builds by just hardcoding GLvector4f and GLmatrix's layouts. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-07-01 11:12:15 -07:00
Eric Anholt	1738b38ce8	matypes: Drop some unused defines. Most of these haven't been used since the conversion from checked-in matypes to generation. By cutting down the generated contents, this should clarify why the file is generated: we need architecture-specific offsets to the V4F fields in the asm that uses it. v2: Keep matrix offsets to prevent x86 build breakage.. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-07-01 11:09:26 -07:00
Eric Engestrom	1835f30097	meson: drop duplicate source & inc_dir These two are already pulled from `idep_vulkan_util_headers`. Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com>	2019-07-01 18:53:57 +01:00
Eric Engestrom	04e0ac59b1	swrast: simplify function pointer calls Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com>	2019-07-01 18:51:49 +01:00
Eric Engestrom	fbf7c38da3	egl/wayland: use bitset.h for `formats` bit set Currently only 7 formats are supported, but we don't want the 16 limit (it's an `unsigned`) to hit us by surprise :] Let's use bitset.h's BITSET magic to allow us to have any number of formats, with a static assert to make sure we don't forget to update it. Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-07-01 18:35:54 +01:00
Sagar Ghuge	d5f63990b4	intel/tools: Add assembler unit tests for ROL/ROR instructions Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-07-01 10:14:22 -07:00
Sagar Ghuge	e9c35dd7cc	intel/tools: Add ROL/ROR support in assembler Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-07-01 10:14:22 -07:00
Sagar Ghuge	456557a837	nir: Add lower_rotate flag and set to true in all drivers Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Suggested-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-07-01 10:14:22 -07:00
Sagar Ghuge	1e92e83856	intel/compiler: Emit ROR and ROL instruction v2: Reorder patch (Matt Turner) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-07-01 10:14:22 -07:00
Sagar Ghuge	80117117bd	nir: Add optimization to use ROR/ROL instructions v2: 1) Add more optimization rules for ROL/ROR (Matt Turner) 2) Add lowering rules for ROL/ROR (Matt Turner) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-07-01 10:14:22 -07:00
Sagar Ghuge	81d342e2a1	nir: Add urol and uror opcodes Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-07-01 10:14:22 -07:00
Sagar Ghuge	83fdec0f0d	intel/compiler: Enable the emission of ROR/ROL instructions v2: 1) Drop changes for vec4 backend as on Gen11+ we don't support align16 mode (Matt Turner) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-07-01 10:14:22 -07:00
Alyssa Rosenzweig	8d74749f81	panfrost: Implement instanced rendering We implement GLES3.0 instanced rendering with full support for instanced arrays (via instance divisors). To do so, we use the new invocation helpers to invoke a triplet of (1, vertex_count, instance_count), rather than simply (1, vertex_count, 1). We rewrite the attribute handling code into a new pan_instancing.c file which handles both the simple LINEAR case for non-instanced as well as each of the new instancing cases: MODULO (for per-vertex attributes), POT and NPOT divisors. As a side effect, we rework how vertex buffers are handled, duplicating them to be 1:1 with vertex descriptors to simplify instancing code paths dramatically. This might be a performance regression, but this remains to be seen; if so, we can always deduplicate later with some added logic in pan_instancing.c Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-07-01 07:50:57 -07:00
Alyssa Rosenzweig	e9e22546ff	panfrost/decode: Compute padded_num_vertices for MODULO Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-07-01 07:49:18 -07:00
Alyssa Rosenzweig	9b97ed1250	panfrost/midgard: Emit type appropriate ld_vary Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-07-01 07:42:56 -07:00
Alyssa Rosenzweig	aa333ac6ad	panfrost/midgard: Add unsigned ld/st ops Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-07-01 07:42:55 -07:00
Alyssa Rosenzweig	bbc050b82e	panfrost/midgard: Use the appropriate ld_attr type Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-07-01 07:42:55 -07:00
Alyssa Rosenzweig	c9b164f9b5	panfrost: Implement dispatch helpers Rather than open-coding workgroups_shift_* type fields, we include a general routine for packing the vertex/tiler/compute descriptor based on the provided dispatch parameters. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-07-01 07:42:55 -07:00
Alyssa Rosenzweig	8fd748de3d	panfrost: Remove ancient comment Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-07-01 07:42:55 -07:00
Alyssa Rosenzweig	9fe4fd8a9c	panfrost: Extend software tiling to larger bpp Should not affect lima. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-07-01 07:40:19 -07:00
Alyssa Rosenzweig	f2801f7775	panfrost: Rewrite u-interleaving code Rather than using a magic lookup table with no explanations, let's add liberal comments to the code to explain what this tiling scheme is and how to encode/decode it efficiently. It's not so mysterious after all -- just reordering bits with some XORs thrown in. v2: Correct copyright identifier. Fix spelling error. Switch space_4 to a LUT. Fix comment typo. Use LUT instead of space_x tricks. Fallback on generic rather than split up unaligned writes. v3: Correct stride order (fixes crash loading). Correct coordinate system mishap. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Tested-by: Andreas Baierl <ichgeh@imkreisrum.de>	2019-07-01 07:39:51 -07:00
Rob Clark	02893fe73a	freedreno: update generated registers Corrects the a3xx texconst state for TILE_MODE. Signed-off-by: Rob Clark <robdclark@chromium.org>	2019-07-01 06:15:52 -07:00
Samuel Pitoiset	d8b079e4c7	radv: rework how the number of VGPRs is computed Just a cleanup, it shouldn't change anything. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-01 14:59:27 +02:00
Samuel Pitoiset	e3baa54195	radv: gather if a vertex shaders needs the instance ID Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-01 14:59:24 +02:00
Samuel Pitoiset	17cb7ea6fc	radv: fix decompressing DCC levels with compute Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-01 14:59:22 +02:00
Samuel Pitoiset	f4d2c47cf6	radv: the number of VGPR_COMP_CNT for GS is expected to be 0 on GFX8 Just move around the switch case. GFX9+ is handled below. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-01 14:59:19 +02:00
Samuel Pitoiset	b4477fa4d4	radv: reduce number of VGPRs for TESS_EVAL if primitive ID is not used We only need to 2. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-07-01 14:59:17 +02:00

1 2 3 4 5 ...

103705 commits