fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-23 08:38:08 +02:00

Author	SHA1	Message	Date
Emma Anholt	ae01d856de	nir_to_tgsi: Remove the abs on fcsel's bool src. While the nir fcsel opcode specifies src0 != 0.0, as the comment says, it's only ever used on bools-as-floats, so we know that src0 is non-negative. This saves an instruction per CMP on i915. Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12921>	2021-09-20 20:20:54 +00:00
Emma Anholt	9017bb0837	ci/freedreno: Add some cubearray piglit flakes on a630 I noticed. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12939>	2021-09-20 19:55:55 +00:00
Emma Anholt	feb971fdee	freedreno: Assert to check for the previous regression. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12942>	2021-09-20 19:34:57 +00:00
Emma Anholt	b8c4ad378d	freedreno: Fix autotune regression since batch-cache rework. I freed the key that autotune needed a little early. Fixes: `b2349a4671` ("freedreno: Move the batch cache to the context.") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12942>	2021-09-20 19:34:57 +00:00
Rob Clark	68d4d09b56	freedreno: Add info->a6xx.has_shading_rate @flto noticed these registers seem to be related to GL_QCOM_shading_rate Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12856>	2021-09-20 19:13:25 +00:00
Emma Anholt	4474c46f5f	mesa/st: Allow loops in GLSL when NIR is enabled, even if the HW can't. The jump lowering enabled by EmitNoLoops breaks GLSL's loop unrolling on various obviously unrollable loops, resulting in a lot of deqp-gles2 and piglit failures. NIR will help unroll whatever GLSL doesn't, so we can trust the driver to apply that after GLSL's unrolling, so no need to ask GLSL to lower all loops. Fixes: #4979 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12917>	2021-09-20 17:41:24 +00:00
Emma Anholt	346a2950bb	ci/i915g: Clarify failure happening in fbo-fragcoord2. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12917>	2021-09-20 17:41:24 +00:00
Philipp Zabel	4968313fe5	etnaviv: fix dirty bit check for baselod emission Since baselod is stored in sampler state, not sampler view, we should check the ETNA_DIRTY_SAMPLERS bit instead of ETNA_DIRTY_SAMPLER_VIEWS. Signed-off-by: Philipp Zabel <p.zabel@pengutronix.de> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12916>	2021-09-20 17:03:14 +00:00
Boris Brezillon	4bc10578a5	panfrost/ci: Skip the indirect_draw+XFB tests We lack a dependency between the vertex job filling the indirect draw buffers and the indirect draw compute job reading from these buffers, leading to unreliable results (the tests pass if the vertex job is done before the compute job starts, and fail otherwise). Let's disable those tests until we sort it out. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12589>	2021-09-20 14:58:02 +00:00
Boris Brezillon	3c2d7cb0d6	panfrost: Fix collision in the indirect draw shader table Min/max index search shaders are different for the !primitive_restart and primitive_restart. We need to add entries for the primitive restart cases otherwise we might retrieve a wrong shader from the cache. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12589>	2021-09-20 14:58:02 +00:00
Boris Brezillon	43760a7b2f	panfrost: Fix indirect draws when vertex or instance count is 0 In that case we should just skip the vertex/tiler jobs as done in the direct draw path. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Acked-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12589>	2021-09-20 14:58:02 +00:00
Boris Brezillon	bdb37c862f	panfrost: Prepare shader helpers to per-gen XML Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12935>	2021-09-20 15:18:56 +02:00
Boris Brezillon	0d57a76458	panfrost: Prepare texture helpers to per-gen XML Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12935>	2021-09-20 15:18:56 +02:00
Boris Brezillon	de13fdc251	panfrost: Prepare pandecode to per-gen XML Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12935>	2021-09-20 15:11:30 +02:00
Boris Brezillon	5af3516f9e	panfrost: Prepare pan_cs helpers to per-gen XML Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12935>	2021-09-20 15:10:00 +02:00
Boris Brezillon	e145a5fdd4	panfrost: Prepare blend helpers to per-gen XML Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12935>	2021-09-20 15:09:19 +02:00
Boris Brezillon	f68c9a575b	panfrost: Prepare blitter helpers to per-gen XML Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12935>	2021-09-20 15:08:31 +02:00
Boris Brezillon	eed57ada6e	panfrost: Patch Z32_S8X24 format when creating a sampler view The gallium driver always stores Z32_S8X24 textures on 2 different planes. Let's fix the create_sampler_view() logic so we can support single-planar Z32_S8X24 in the vulkan driver. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12508>	2021-09-20 11:21:29 +00:00
Qiang Yu	6f9f350622	radeonsi: fix ps SI_PARAM_LINE_STIPPLE_TEX arg This arg size should be 1 instead of 3. It does not affect functionality because we does not enable it in SPI_PS_INPUT_ADDR. But it does affect the VGPR number that LLVM produce when LLVM still count with all PS function arguments. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12922>	2021-09-19 01:24:21 +00:00
Daniel Stone	0e15d5af81	fdno/resource: Rewrite layout selection for allocation The previous code had a number of errors, the most glaring of which was forcing linear when it was one of the possible layouts requested. When modifiers are being used, a list of _acceptable_ modifiers is supplied; it's up to the driver to then make a decision as to which it thinks is most optimal. Normally we would select between linear/tiled/UBWC in ascending order of preference according to what's possible, however we can't use a tiled layout with explicit modifiers as there is no modifier token defined for it. Rewrite the layout-selection mechanism to always try to do the most optimal thing. If the use flags force us to, or we have a shared resource without explicit modifiers, we use linear. Failing that, we use UBWC wherever possible; if this is not possible, we use tiled for internal resources only or linear for shared resources. v2 (Rob): respect FD_FORMAT_MOD_QCOM_TILED; do not print perf warning on user choice of disabling UBWC; v3: fix several issues breaking CI tests: revert removal of using MOD_INVALID in various places, and assume implicit modifiers if present; do not attempt to set UBWC flags when screen->tile_mode(prsc) falls back to LINEAR (e.g. for small mip-maps levels); use TILED for implicit modifier case with non-shared resources v4: fix unintended demotion of UBWC, i.e. only check QCOM_COMPRESSED modifier and demote UBWC to less optimal format when using explicit modifiers Signed-off-by: Daniel Stone <daniels@collabora.com> Tested-by: Heinrich Fink <hfink@snap.com> Signed-off-by: Heinrich Fink <hfink@snap.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12595>	2021-09-18 17:37:05 +00:00
Christian Gmeiner	f8a37832df	lima: fix leak of the screen hash table Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12902>	2021-09-18 16:47:49 +00:00
Ian Romanick	897bb9a222	iris: Add finalize_nir Improves performance of SynMark OglDrvShComp by +241.879%±1.01366% (n=5) on a random KBL desktop that I have. That seems to put it at about the same performance as i965, but I did not test that in a statistically sound way. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12858>	2021-09-17 16:36:08 -07:00
Ian Romanick	b042c71ac1	iris: Move iris_set_max_shader_compiler_threads and iris_is_parallel_shader_compilation_finished There's going to be at least one more shader function set in pipe_screen, so it makes more sense to do it in iris_program.c. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12858>	2021-09-17 16:36:08 -07:00
Ian Romanick	aaf2e6a9f5	iris: Eliminate iris_uncompiled_shader::needs_edge_flag Use the flag that was set by nir_lower_passthrough_edgeflags. The lowering passes will soon be moved to a finalize_nir hook, so there won't be any choice. Ideally we'd like to eliminate iris_fix_edge_flags completely, and this is a first step. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12858>	2021-09-17 16:36:08 -07:00
Ian Romanick	d7ba52cce9	nir/edgeflags: Add a flag to indicate the edge flag input is needed Most modern hardware needs the edge flag added as a hidden vertex input and needs code added to the vertex shader to copy the input to an output. Intel hardware is a little different. Gfx4 and Gfx5 hardware works in the previously described mannter. Gfx6+ hardware needs the edge flag as a specific vertex shader input, and that input is magically processed by fixed-function hardware without need for extra shader code. This flag signals only that the vertex shader input is needed. It would be nice if we could decouple adding the vertex shader input from generating the copy-to-output code, but that has proven to be challenging. Not having that code causes other passes to want to eliminate that shader input. v2: Convert conditional to assertion. This pass is only called for vertex shaders. Suggested by Ken. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12858>	2021-09-17 16:36:08 -07:00
Ian Romanick	3281ccf4b1	iris: Calculate uses_atomic_load_store after all lowering The lowering passes will soon be moved to another function, so there won't be any choice. As a side benefit, this allows eliminating the uses_atomic_load_store pointer parameter from brw_nir_lower_storage_image. For some reason crocus was passing false instead of NULL. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12858>	2021-09-17 16:36:08 -07:00
Ian Romanick	e33055874b	iris: crocus: Use shader_info::is_arb_asm flag ...instead of looking for "ARB" in the name of the shader. This matches the behavior of i965. Using "ARB" was added in `a1ebac3750` ("iris: Implement ALT mode for ARB_{vertex,fragment}_shader"), but there's no explanation of why that method was used. v2: Just use shader_info::is_arb_asm everywhere instead of iris_uncompiled_shader::use_alt_mode. Suggested by Ken. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12858>	2021-09-17 16:36:08 -07:00
Rob Clark	bf74d58f21	freedreno/a6xx: Pre-bake SO-disable stateobj No need to re-create this every time we transition from stream-out enabled to disabled. Creation of streamout_disable_stateobj is deferred until we create a program state using streamout to avoid creating it unnecessarily and because fd6_prog_init() is called before ctx->pipe is created. (Changing that ordering is complicated by the fact that u_blitter copies pctx->bind_fs_state(), and friends.) Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12918>	2021-09-17 20:03:40 +00:00
Mike Blumenkrantz	60f12ac127	zink: don't leak drm fd on drmPrimeFDToHandle failure Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Michel Dänzer <mdaenzer@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12911>	2021-09-17 19:33:43 +00:00
Rob Clark	8c6e789c24	freedreno: Add perf warning for WC readback Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11176>	2021-09-17 18:24:34 +00:00
Rob Clark	7e088d3621	freedreno: Used cached coherent for staging resources These are really only accessed by the GPU once, so CPU access speed is more important. Especially for PIPE_MAP_READ. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11176>	2021-09-17 18:24:34 +00:00
Rhys Perry	a1af902531	nir/algebraic: distribute fmul(fadd(a, b), c) when b and c are constants This allows for more MAD/FMA instructions to be created. fossil-db (Sienna Cichlid): Totals from 50134 (33.46% of 149839) affected shaders: VGPRs: 2436536 -> 2436000 (-0.02%); split: -0.05%, +0.03% SpillSGPRs: 13136 -> 13135 (-0.01%); split: -0.02%, +0.02% CodeSize: 206621424 -> 206278292 (-0.17%); split: -0.23%, +0.07% MaxWaves: 1116804 -> 1117448 (+0.06%); split: +0.07%, -0.01% Instrs: 38977460 -> 38862886 (-0.29%); split: -0.33%, +0.04% Latency: 832425389 -> 827432260 (-0.60%); split: -0.63%, +0.03% InvThroughput: 184193457 -> 183563350 (-0.34%); split: -0.37%, +0.03% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7458>	2021-09-17 17:28:26 +00:00
Emma Anholt	63cc1fe71f	freedreno: Remove dead fd_batch_reset(). Unused since `58f5605124` ("freedreno: Handle full blit discards by invalidating the resource.") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11511>	2021-09-17 17:07:43 +00:00
Emma Anholt	3ade94df86	freedreno: Use a BO bitset for faster checks for resource referenced. When moving the batch cache to the context, I added hash table lookups from batch to rsc for "is this resource in use" because we could no longer store data in the rsc bo under the batch cache's lock. We can save that cost by tracking a bitfield of resources referenced by the batch, which gives us very cheap checks in the draw path at a minor cost in memory. We can just use the GEM BO handle, since it's a nice small integer already (we can't use the TC buffer ID, because the frontend changes that, and we're in the driver thread). This required moving the !pending() assert up in resource shadowing, since the BO swap meant we were checking pending on the wrong resource. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11511>	2021-09-17 17:07:43 +00:00
Emma Anholt	22486ffa51	freedreno: Remove the submit lock locking. I think the whole submit lock thing should be possible to remove now, but just getting rid of the lock since we're no longer sharing batches between ctxes means another several percent draw overhead improvement. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11511>	2021-09-17 17:07:43 +00:00
Emma Anholt	b2349a4671	freedreno: Move the batch cache to the context. Our draw call rate was significantly limited by the atomics we had to do to manage access to the batch cache. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11511>	2021-09-17 17:07:43 +00:00
Emma Anholt	c3d6493122	freedreno: Use TC's flag for whether get_query is in the driver thread. In moving batch cache to the context, the check for whether there's pending work being done to this resources ends up accessing the context, so we can't do it outside of the fd_context_access_begin(). This flag lets us do the driver-thread asserts before we've decided whether we need to flush. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11511>	2021-09-17 17:07:43 +00:00
Emma Anholt	cbbe3e7641	freedreno: Precompute resource pointer hash values. It was around half a percent of the CPU in the fd-bc-ctx branch, and adds 4b to our 472b struct. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11511>	2021-09-17 17:07:43 +00:00
Mike Blumenkrantz	5eb59a03ec	util/tc: rename tc_replace_buffer_storage_func::num_rebinds and document this parameter is only a hint, as tc provides no method for tracking cases when a buffer is bound multiple times to the same site (e.g., multiple vertex buffer slots will be counted as 1 bind), so rename to "minimum" to be more clear Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12898>	2021-09-17 12:32:29 +00:00
Filip Gawin	beb7ed2b89	r300: assert that array in translate_vertex_program is initialized Problematic usage is in case RC_OPCODE_ENDLOOP, at line ret_addr = loops[--loop_depth]; Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12880>	2021-09-17 07:19:24 +00:00
Mike Blumenkrantz	eb3fa25150	zink: zero VkImageCreateInfo::queueFamilyIndexCount on creation this explodes gfxreconstruct Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12905>	2021-09-17 00:24:49 +00:00
Neha Bhende	f2b08decf7	auxiliary/indices: convert primitive type PIPE_PRIM_PATCHES Few tessellation related piglit test are crashing. This patch fixes unhandled case Tested with piglit Reviewed-by: Charmaine Lee <charmainel@vmware.com> (cherry picked from commit 0e1c962cc105a9330caf22266e1962b049c13454) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12882>	2021-09-16 21:45:43 +00:00
Dave Airlie	86399c1b1f	lavapipe: enable KHR_spirv_1_4 One test fails but I think it's a missing barrier in the test Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12888>	2021-09-17 05:38:29 +10:00
Nanley Chery	69242f188c	iris: Finish aux import in iris_resource_from_handle This allows us to delete iris_resource_unfinished_aux_import, which incorrectly assumed that a CCS-enabled resource needs an aux BO. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12795>	2021-09-16 18:07:23 +00:00
Nanley Chery	8510d17ff4	iris: Add and use get_num_planes This function counts the number of objects in a pipe_resource linked list. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12795>	2021-09-16 18:07:23 +00:00
Nanley Chery	a16d6bc179	iris: Don't add a clear color BO for MC_CCS It's unusable because ISL_AUX_USAGE_MC doesn't support fast clears. Instead of performing this change in the if-ladder, replace the if-ladder with a switch statement to make it clear what's going on. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12795>	2021-09-16 18:07:23 +00:00
Nanley Chery	c7a61bbf13	iris: Allow NULL aux BOs in aux-state functions XeHP can use CCS_E without an aux BO. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12795>	2021-09-16 18:07:23 +00:00
Nanley Chery	d32a4cdab9	iris: Simplify an iris_use_pinned_bo call Avoid using a helper function to get the resource BO. This fits in better with the previous iris_use_pinned_bo calls. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12795>	2021-09-16 18:07:23 +00:00
Nanley Chery	89319a0dfd	iris: Split clear color and aux BO checks CCS_E-enabled resources on XeHP have a clear color without an aux BO. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12795>	2021-09-16 18:07:23 +00:00
Nanley Chery	d25515fbf1	iris: Support NULL aux BOs in fill_surface_state XeHP can use CCS_E without an aux BO. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12795>	2021-09-16 18:07:23 +00:00

1 2 3 4 5 ...

50897 commits