fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-23 17:18:11 +02:00

Author	SHA1	Message	Date
Samuel Pitoiset	29c4d44cee	radv: select the depth/stencil resolve method based on some conditions Only fallback to the compute path for layers. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-06-21 14:50:24 +02:00
Samuel Pitoiset	5cf350f565	radv: implement all depth/stencil resolve modes using compute This path supports layers but it requires to decompress HTILE before resolving. The driver also needs to fixup HTILE after the resolve. This path is probably slower than the graphics one. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-06-21 14:50:19 +02:00
Samuel Pitoiset	cdc6efddf9	radv: implement all depth/stencil resolve modes using graphics When using graphics, the driver doesn't need to decompress HTILE before resolving. This path currently doesn't support layers so we have to fallback to the compute path. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-06-21 14:50:15 +02:00
Samuel Pitoiset	e52ad9f845	radv: record if a render pass has depth/stencil resolve attachments Only supported with vkCreateRenderPass2(). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-06-21 14:50:12 +02:00
Samuel Pitoiset	ac6369a2d0	radv: rename has_resolve to has_color_resolve Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-06-21 14:50:10 +02:00
Samuel Pitoiset	203f60ebf2	radv: emit framebuffer state from primary if secondary doesn't inherit it Otherwise fast color/depth clears can't work because they depend on the framebuffer. This fixes the following CTS (when the small hint is disabled): - dEQP-VK.geometry.layered.1d_array.secondary_cmd_buffer - dEQP-VK.geometry.layered.2d_array.secondary_cmd_buffer - dEQP-VK.geometry.layered.cube.secondary_cmd_buffer - dEQP-VK.geometry.layered.cube_array.secondary_cmd_buffer Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110810 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107986 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-06-21 13:49:35 +02:00
Eric Engestrom	6a9dd62882	drisw: move build logic to build systems Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-06-21 11:35:39 +00:00
Tomeu Vizoso	1cbe2ad394	panfrost: ci: Exclude two more flip-flop from results These three tests pass on RK3399, but fail on RK3288: dEQP-GLES2.functional.shaders.matrix.div.const_lowp_mat2_mat2_vertex dEQP-GLES2.functional.shaders.operator.unary_operator.pre_increment_effect.highp_ivec4_vertex dEQP-GLES2.functional.shaders.texture_functions.vertex.texture2dprojlod_vec3 They reliably pass when run individually, but reliably fail when run in a full CI run. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com>	2019-06-21 10:45:12 +02:00
Gert Wollny	ef4429d9c5	gallium/st: Add Gallium hud to swrast drivers Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com>	2019-06-21 08:54:57 +02:00
Iago Toral Quiroga	4d8f82946b	v3d: flush jobs writing to vertex buffers used in the current draw call This can happen when any of our vertex buffers was written by a previous transform feedback draw. Fixes the following piglit tests: spec/ext_transform_feedback/position-render-bufferbase spec/ext_transform_feedback/position-render-bufferbase-discard spec/ext_transform_feedback/position-render-bufferoffset spec/ext_transform_feedback/position-render-bufferoffset-discard spec/ext_transform_feedback/position-render-bufferrange spec/ext_transform_feedback/position-render-bufferrange-discard Reviewed-by: Eric Anholt <eric@anholt.net>	2019-06-21 08:06:13 +02:00
Iago Toral Quiroga	eb44dcc219	v3d: flush jobs reading from transform feedback output buffers If we are about to write to a transform feedback buffer, we should make sure that we flush any prior work that intended to read from any of these buffers. Fixes piglit test: spec/ext_transform_feedback/immediate-reuse Reviewed-by: Eric Anholt <eric@anholt.net>	2019-06-21 08:06:13 +02:00
Iago Toral Quiroga	42572f2f7d	v3d: add a helper to check if transform feedback is enabled v2: We should be safe assuming that bind_vs != NULL (Eric) Reviewed-by: Eric Anholt <eric@anholt.net>	2019-06-21 08:06:13 +02:00
Dave Airlie	00a56acc23	llvmpipe: make remove_shader_variant static. this isn't used outside this file. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2019-06-21 10:27:57 +10:00
Eric Engestrom	955c63d364	util/os_file: resize buffer to what was actually needed Fixes: `316964709e` "util: add os_read_file() helper" Reported-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-06-20 21:49:30 +00:00
Tomeu Vizoso	2743e34f20	panfrost: ci: Update expectations These tests have been fixed recently. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com>	2019-06-20 20:57:41 +02:00
Alyssa Rosenzweig	195e297a92	panfrost/midgard: Broadcast swizzle Fixes regression in shaders using ball/etc by explicitly passing through the number of channels in the NIR op and broadcasting the last components of the channel appropriately, as the Midgard ops are all vec4 implicitly but NIR can be vec2/3. v2: Don't also regress every other swizzle in Equestria. v3: Don't regress the swizzles at Canterlot High either. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Acked-by: Tomeu Vizoso <tomeu.vizoso@collabora.com>	2019-06-20 20:52:04 +02:00
Kenneth Graunke	31de802e7e	iris: Use stream uploader for shader draw parameters. Most vertex data lives in user VBOs in IRIS_MEMZONE_OTHER, which typically have high bits set to 0xffff. The shader draw parameters were being uploaded in IRIS_MEMZONE_DYNAMIC, which have high bets set to 0x2. This was causing a lot of ping-ponging of high bits, leading to unnecessary VF cache flushing. Cuts 7.2% of the flushes in the Civizilation VI demo on Kabylake GT2.	2019-06-20 13:32:16 -05:00
Kenneth Graunke	db8f57a5cb	iris: Don't check VF address high bits when there is no buffer. If there is no buffer, then it doesn't matter. Leave the old stale high bits in place (for next time) and don't bother invalidating. Cuts 5.6% of the flushes in the Civilization VI demo on Kabylake GT2.	2019-06-20 13:32:16 -05:00
Kenneth Graunke	ecc500398f	iris: Drop RT flushes from depth stencil clearing flushes. These write depth and stencil, not color writes, so there's no need to flush the render target.	2019-06-20 13:32:16 -05:00
Kenneth Graunke	1d63af0f2c	iris: Don't bother with PIPE_CONTROLs for CPU writes and no history If a buffer has no usage history, we don't have any read only cache invalidates to do. If we've written it with the CPU, we don't need to flush the render cache. The only bit remaining is the CS stall from iris_flush_bits_for_history. We can just skip the PIPE_CONTROL in this case. This is pretty common - an app creates a buffer, fills it with data, and then binds it for some purpose. Cuts 36% of the flushes in Manhattan 3.0 on Kabylake GT2.	2019-06-20 13:32:16 -05:00
Kenneth Graunke	dfff6e10b4	iris: Only do an RT flush for transfer maps if using copy_region. If we wrote the data via the CPU, there's no point in doing a render target flush. If using BLORP, we do want a render target flush so the data lands.	2019-06-20 13:32:15 -05:00
Kenneth Graunke	c4c17ab3ec	iris: Use iris_flush_bits_for_history in iris_transfer_flush_region Instead of using the combined iris_flush_and_dirty_for_history, use iris_flush_bits_for_history directly - we were already using the split out iris_dirty_for_history. There's no need to dirty twice, and we can avoid the looping altogether for non-buffers.	2019-06-20 13:32:15 -05:00
Kenneth Graunke	6890340c31	iris: Avoid double flushing in iris_transfer_flush_region when copying. My intention was to have iris_copy_region not do flushing, and leave that up to the callers. iris_resource_copy_region needs to do this, but iris_transfer_flush_region was already doing it. The net result was that we were doing it twice for transfers. So, move the flushing from iris_copy_region to iris_resource_copy_region so that it only happens in the callers as I intended.	2019-06-20 13:32:15 -05:00
Kenneth Graunke	64fb20ed32	iris: Fix iris_flush_and_dirty_history to actually dirty history. When I split iris_flush_and_dirty_history into two helper functions, I accidentally made it stop dirtying. Which was...sort of the point. Fixes: `21688a306b` iris: Split iris_flush_and_dirty_for_history into two helpers.	2019-06-20 13:32:15 -05:00
Kenneth Graunke	5e501ffeb2	iris: Add maybe_flush calls to texture_barrier and memory_barrier Otherwise, tests which loop on glMemoryBarrier may run us out of batch space with piles of flushing. (Ideally, we'd elide those bonus PIPE_CONTROLs, but presumably this isn't that common of a case...) Piglit's arb_pipeline_statistics_query-comp would hit this case after some of the next patches remove other PIPE_CONTROLs with maybe_flushes.	2019-06-20 13:32:15 -05:00
Kenneth Graunke	d4a4384b31	iris: Implement INTEL_DEBUG=pc for pipe control logging. This prints a log of every PIPE_CONTROL flush we emit, noting which bits were set, and also the reason for the flush. That way we can see which are caused by hardware workarounds, render-to-texture, buffer updates, and so on. It should make it easier to determine whether we're doing too many flushes and why.	2019-06-20 13:32:15 -05:00
Alyssa Rosenzweig	c378829a0d	panfrost: Skip shading unaffected tiles Looking at the scissor, we can discard some tiles. We specifially don't care about the scissor on the wallpaper, since that's a no-op if the entire tile is culled. v2: Clarify clear comment (not reviewed but trivial). Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com>	2019-06-20 09:30:38 -07:00
Eric Engestrom	65b016b146	glx: fix glvnd pointer types Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=110709 Fixes: `22a9e00aab` ("glx: Implement the libglvnd interface.") Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-06-20 17:21:37 +01:00
Eric Engestrom	e0ee790ba7	glx: drop misleading comment about the file being "generated" This `gen_scrn_dispatch.pl` has never existed, in the sense that NVIDIA never published it. There have been a number (6) of commits to fix various things in there over the years, and never anything from NVIDIA. For all intents and purposes this file is hand-written and hand-maintained, and we're on our own. Let's make this clear by removing this misleading comment. Suggested-by: Eric Anholt <eric@anholt.net> Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net> Acked-by: Emil Velikov <emil.velikov@collabora.com>	2019-06-20 16:19:58 +00:00
Boris Brezillon	56434450f6	nir/lower_tex: Add an assert() in nir_lower_txs_lod() We don't expect the output of a TXS instruction to be wider than a vec3. Add an assert() to make sure this never happens. Suggested-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-20 09:15:53 -07:00
Tomeu Vizoso	babc3ad291	panfrost: Set job requirements during draw Right now we are doing it at a moment when we don't have all the information we need. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Suggested-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Acked-by: Rohan Garg <rohan.garg@collabora.com> Cc: Rohan Garg <rohan.garg@collabora.com> Fixes: `bfca21b622` ("panfrost: Figure out job requirements in pan_job.c")	2019-06-20 18:07:04 +02:00
Alyssa Rosenzweig	dc668203db	panfrost/meson: Link with libpanfrost_shared Fixes: `035a07c0` ("panfrost: Switch to lima tiling") Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-20 08:56:38 -07:00
Hyunjun Ko	f7f8fb1b55	freedreno/ir3: fix typo Fixes: `a9b556d3a0` ("freedreno/ir3: check the type of regs of absneg opcode in is_same_type_mov") Reviewed-by: Rob Clark <robdclark@gmail.com>	2019-06-20 08:34:09 -07:00
Alyssa Rosenzweig	546236e27f	panfrost: Load from tiled images Now that we have lima tiling code available, use it to load from a tiled source. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-20 08:22:38 -07:00
Alyssa Rosenzweig	035a07c0ae	panfrost: Switch to lima tiling Lima and Panfrost both have implementations of software tiling (the Lima one was forked off the Panfrost one which was forked off the original Lima one...). Switch to the most recent Lima code, since it's more complete than ours at this point. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-20 08:22:38 -07:00
Alyssa Rosenzweig	7b46f09f26	panfrost: Fix tiled NPOT textures with bpp<4 Panfrost's tiling routines (incorrectly) ignored the source stride, masking this bug; lima's routines respect this stride, causing issues when tiling NPOT textures whose stride is not a multiple of 64 (for instance, NPOT textures with bpp=1). Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-20 08:22:38 -07:00
Alyssa Rosenzweig	413242277a	lima,panfrost: Move lima_tiling.c/h to /src/panfrost This will allow both drivers to share this code. Both drivers build-tested with meson. Android build not tested. v2: Change naming from tiling->shared, in case Lima and Panfrost can share more in the future. Fix Android build system. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-and-tested-by: Qiang Yu <yuq825@gmail.com>	2019-06-20 08:06:35 -07:00
Kenneth Graunke	c57b4c86c0	iris: Use render_batch/compute_batch locals in memory_barrier We have them, may as well use them.	2019-06-20 10:04:38 -05:00
Lionel Landwerlin	4a61be24fe	anv: only resort to sync fds internally with no syncobj support We can rely on only one kind of synchronization object (drm-syncobj) when it is available. This reduces the number of file descriptors we use in our implementation. This will be required later for timeline semaphores implementation, at this point we won't ever want to use anything else but syncobjs. v2: Only use has_syncobj for semaphores (Jason) v3: Only has_syncobj in assert on semaphores in QueueSubmit (Jason) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-06-20 14:59:51 +00:00
Alyssa Rosenzweig	1d7e53a854	panfrost: Remove other commented pointers Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-20 07:48:05 -07:00
Alyssa Rosenzweig	2608da14b9	panfrost/decode: Elide more zero fields Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-20 07:48:05 -07:00
Alyssa Rosenzweig	cfc2218a8c	panfrost/decode: Remove memory comments These do more harm than good at this point. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-20 07:48:04 -07:00
Alyssa Rosenzweig	8643b89c48	panfrost: Add missing 0x in invocation_count Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-20 07:48:04 -07:00
Alyssa Rosenzweig	b6d46d09c2	panfrost/decode: Skip decode of fragment backend in non-fragment This is all zero for anything but fragment shaders. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-20 07:48:04 -07:00
Alyssa Rosenzweig	ae2bfab7b7	panfrost/decode: Clip mali_compute_fbd at 64-bytes Looking at internal evidence (later fields including a literal other compute job inception-style, seeming memory corruption, no clear function, and the field after this being a pointer to itself), it looks like this is really a much smaller descriptor. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-20 07:48:04 -07:00
Alyssa Rosenzweig	3faf33488a	panfrost/decode: Print COMPUTE uniforms as pointers In OpenGL, uniforms generally represent fp32 vec4s (at least in highp mode). In OpenCL, they represent vec2s of 64-bit pointers. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-20 07:48:04 -07:00
Alyssa Rosenzweig	0021fae7f8	panfrost/decode: Show int uniforms Float is ambiguous. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-20 07:48:04 -07:00
Alyssa Rosenzweig	1f7dfee1b4	panfrost/decode: Expand pointers in compute descriptor Just as an aid. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-20 07:48:04 -07:00
Alyssa Rosenzweig	0aa5d89acb	panfrost/decode: Identify "compute FBD" There is fundamentally not a framebuffer associated with a compute job. Allocate a new structure for it so we don't mess up graphics when decoding. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-20 07:48:04 -07:00
Tomeu Vizoso	4f881237c3	panfrost: Allocate panfrost_job in panfrost_context Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-06-20 15:48:35 +02:00

... 99 100 101 102 103 ...

117072 commits