fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-04-26 15:30:40 +02:00

Author	SHA1	Message	Date
Mike Blumenkrantz	b31c414e28	zink: set predicate_dirty on query creation ensure this is set when it needs to be Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21581>	2023-03-06 02:22:39 +00:00
Mike Blumenkrantz	5374605ea9	zink: merge qbo update copies when possible if a single query is being started and stopped frequently, update the internal qbo with a single copy call instead of one copy per result not actually that useful in practice because of how query pools are shared, but could help somewhere in theory Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21581>	2023-03-06 02:22:39 +00:00
Mike Blumenkrantz	7f956435a0	zink: rework xfb queries for drivers with poor primgen support for drivers lacking one of: * EXT_color_write_enable * primitivesGeneratedQueryWithRasterizerDiscard terrible things must happen. specifically, dummy surfaces have to be used in a framebuffer with rast-discard enabled for the duration of the query now that queries are only started/stopped in renderpasses, however, there are new hurdles. with tc renderpass optimizing, queries can be started outside the renderpass, which would trigger recursion when trying to start a primgen query outside the renderpass if any clears are enabled, as those must be flushed onto the real surfaces to solve all of this: * block tc renderpass optimizing if at least one of the above features is missing * detect a pending primgen query start during renderpass start * activate rast-discard and set dummy surfaces before beginning renderpass * this recurses and automatically flushes clears * finally, start the real renderpass BUT WAIT THERE'S MORE! because there's also drivers that support EXT_color_write_enable and don't support primitivesGeneratedQueryWithRasterizerDiscard, which means they do need rast-discard, but they don't need dummy surfaces, and so the clears still have to be flushed, so they need an explicit (recursive) renderpass start/stop in advance to ensure the clears are applied as expected Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21628>	2023-03-06 02:00:06 +00:00
Mike Blumenkrantz	5144c8a858	zink: track whether a primgen query is suspended and needing color write hacks Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21628>	2023-03-06 02:00:06 +00:00
Mike Blumenkrantz	9bc871199c	zink: only resume queries inside renderpasses from set_active_query_state match new default query behavior Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21628>	2023-03-06 02:00:06 +00:00
Mike Blumenkrantz	81de7a1c25	zink: resume queries after conditional render and clears are processed this should have no functional effect other than ensuring primgen queries don't recurse when detecting clears Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21628>	2023-03-06 02:00:06 +00:00
Mike Blumenkrantz	f7d1fff23f	zink: disable queries for clear_texture() this otherwise can do weird things cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21628>	2023-03-06 02:00:06 +00:00
David Heidelberg	26dc5b3737	ci/ci_run_n_monitor: while we usually disable many jobs, print them inline Saving scrolling time... Reviewed-by: Guilherme Gallo <guilherme.gallo@collabora.com> Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21713>	2023-03-06 01:51:59 +01:00
Friedrich Vock	f5061758be	radv: Use LDS for closest-hit hit attributes Q2RTX: 23.1ms -> 22.9ms shader-db: Totals from 19 (0.69% of 2764) affected shaders: MaxWaves: 197 -> 208 (+5.58%) Instrs: 87702 -> 87817 (+0.13%); split: -0.03%, +0.16% CodeSize: 474320 -> 475128 (+0.17%) VGPRs: 1840 -> 1728 (-6.09%) Latency: 2771599 -> 2773173 (+0.06%); split: -0.13%, +0.18% InvThroughput: 561281 -> 533010 (-5.04%); split: -5.16%, +0.12% VClause: 2782 -> 2788 (+0.22%); split: -0.18%, +0.40% Copies: 12115 -> 12136 (+0.17%); split: -0.45%, +0.63% Branches: 4116 -> 4122 (+0.15%) PreVGPRs: 1665 -> 1638 (-1.62%); split: -1.92%, +0.30% Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21635>	2023-03-05 21:53:34 +00:00
Friedrich Vock	c1651a1032	radv: Extend hit attribute lowering for LDS Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21635>	2023-03-05 21:53:34 +00:00
Alyssa Rosenzweig	61663859bc	asahi: Wire up compute kernels Now that we have multiple sysval tables, implementing compute kernels -- including with indirect dispatch and load_num_workgroups -- is straightforward. This patch adds the straightforward launch_grid implementation. As usual needs UAPI support patches to actually do anything, but the relevant compute tests are passing downstream. It's not possible to properly test compute shaders support right now (pending support for images), so we don't update the CAPs or features.txt here. This is more about flushing out the piles of downstream patches we have (and getting reviewed!) in preparation for cutting a downstream release soon. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21703>	2023-03-05 19:40:43 +00:00
Alyssa Rosenzweig	c086f2770b	asahi: Rework system value lowering The previous lowering was insufficient in two areas: * No support for indirection. This is required for dynamically indexing into UBOs, SSBOS, etc in OpenGL ES 3.2 * Only a single table supported. Multiple tables are required to implement indirect dispatch/draws efficiently, in order to bind the indirect buffer as uniforms. The first problem is addressed here by reworking the lowering of system values to happen in NIR, decoupled from the uniform register assignment details, such that we can handle 1:n lowerings in a straightforward way. Namely, indirect sysvals are lowered to indirect memory loads relative to the base address of the sysval table, where the table address is itself pushed as a (direct) sysval. The second problem is addressed in this patch by generalizing to multiple uniform tables. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21703>	2023-03-05 19:40:43 +00:00
Alyssa Rosenzweig	f92738eaaa	agx: Handle fragment shader side effects Fragment shaders with side effects need to be lowered to ensure they execute for all shaded pixels but no helper threads. Add a lowering pass to handle this. Fixes dEQP-GLES31.functional.shaders.opaque_type_indexing.atomic_counter.const_literal_fragment Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21712>	2023-03-05 19:12:35 +00:00
Alyssa Rosenzweig	290f3b76f3	agx: Disable tri merging with side effects As Metal does. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21712>	2023-03-05 19:12:35 +00:00
David Heidelberg	b20c9adb4e	crocus/meson: add dependency on libintel_dev also for versioned static libraries Fixes: `a0fa31bcdd` ("intel/dev: create a helper dependency for libintel_dev") Reviewed-by: Mark Janes <markjanes@swizzler.org> Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21709>	2023-03-05 18:44:54 +00:00
Rob Clark	8e7511ea7f	vk/runtime: Use libdrm shim Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21636>	2023-03-05 16:31:51 +00:00
Rob Clark	44f7ec40ef	loader: Use libdrm shim Signed-off-by: Rob Clark <robdclark@chromium.org> Acked-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21636>	2023-03-05 16:31:51 +00:00
Rob Clark	5f5ccf4bec	turnip: Use libdrm shim Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21636>	2023-03-05 16:31:51 +00:00
Rob Clark	e05abb1345	util: Add a simple no-op libdrm shim Make it easier to deal with build configs that do not have libdrm. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21636>	2023-03-05 16:31:51 +00:00
David Heidelberg	b73b701579	ci/freedreno: rare flake KHR-GL45.sample_variables.mask.rgba8i.samples_4.mask_3 Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21718>	2023-03-05 14:34:33 +00:00
David Heidelberg	5ee724e180	ci/lavapipe: add recent occasional flake Issue: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8441 Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21717>	2023-03-05 14:35:45 +01:00
Gert Wollny	9b09f244f0	r600/sfn: Fix atomic lowering Fixes: `56dedf052f` r600/sfn: add r600 specific lowering pass for atomics and use it Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21684>	2023-03-05 09:54:08 +00:00
Gert Wollny	3c3ecdab36	r600/sfn/tests: Add a test for the copy prop into a group Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21684>	2023-03-05 09:54:08 +00:00
Gert Wollny	244cc152d1	r600/sfn: redirect copy propagation to alu parent group If an ALU instruction was emitted from the get-go as group, then we have to make sure that replacing a source doesn't violate the readport configuration in the group. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8374 Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21684>	2023-03-05 09:54:08 +00:00
Gert Wollny	2028465bd8	r600/sfn: Add print method to AluReadportValidation Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21684>	2023-03-05 09:54:08 +00:00
Gert Wollny	ee0010213f	r600/sfn: Add method to AluGroup to replace sources Related: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8374 Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21684>	2023-03-05 09:54:08 +00:00
Gert Wollny	6180721005	r600/sfn: Split AluInstr replace_source into test and actual replace Related: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8374 Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21684>	2023-03-05 09:54:08 +00:00
Gert Wollny	afa545b926	r600/sfn: Add AluGroup method to update readport validation from scratch Related: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8374 Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21684>	2023-03-05 09:54:08 +00:00
Alyssa Rosenzweig	ed587ae6ac	asahi/meta: Use lowered I/O No point in creating a variable when we can just synthesize the store_output directly. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21430>	2023-03-05 09:27:02 +00:00
Alyssa Rosenzweig	485eddcc85	asahi: Bump shader buffers No reason to limit it, it's direct access anyway. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21430>	2023-03-05 09:27:02 +00:00
Alyssa Rosenzweig	c7b5f01461	agx: Only lower int64 late This is required for address arithmetic to be lowered properly for compute kernels, which may have u2u64 in the source NIR. No shader-db changes (for GLES3.0). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21430>	2023-03-05 09:27:02 +00:00
Alyssa Rosenzweig	811f8b899d	agx: Don't print pre-optimization shader It's usually too noisy to be useful, especially before DCE. The optimized (but pre-RA) shader is usually the useful bit. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21430>	2023-03-05 09:27:02 +00:00
Alyssa Rosenzweig	ea37d7f81f	agx: Use agx_emit_collect for st_tile Instead of open coding. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21430>	2023-03-05 09:27:02 +00:00
Alyssa Rosenzweig	7bb8112fd1	agx: Refactor vector creation agx_vec4 is unused, drop in, and split out the common logic since we'll use it in a new helper. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21430>	2023-03-05 09:27:02 +00:00
Alyssa Rosenzweig	037609f1dc	agx: Constify agx_print Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21430>	2023-03-05 09:27:02 +00:00
Alyssa Rosenzweig	a9c5956f2f	agx: Inline 16-bit load/store offsets Most integer immediates are only 8-bit, but load/store instructions allow their immediate offsets to be 16-bit instead. Take advantage of this in the optimizer. This eliminates 36% of the instructions in dEQP-GLES31.functional.ssbo.layout.random.all_shared_buffer.36, a fitting percentage. Insignificant effect on dEQP-GLES31.functional.ssbo.* performance... Only a small % of our compile-time pie is actually spent in the backend anyway (as opposed to NIR passes or GLSL IR). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21430>	2023-03-05 09:27:02 +00:00
Alyssa Rosenzweig	c9728b41d5	agx: Factor out allows_16bit_immediate check The optimizer needs this information to inline immediates effectively. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21430>	2023-03-05 09:27:02 +00:00
Alyssa Rosenzweig	445ca949cd	agx: Clean up after lowering address arithmetic This avoids creating silly preambles that don't actually do anything except push a constant that we could've inlined for cheaper anyway, since nir_opt_preamble's cost model is sensitive to e.g. constant folding. This avoids a pointless preamble in split-hell. As a nice bonus, this also improves compile-time on address-heavy shaders. With a release build, CPU time in dEQP-GLES31.functional.ssbo.* reduces from 12.87s to 10.77... a 16% improvement is nothing to sneeze at. shader-db results are mostly noise. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21430>	2023-03-05 09:27:02 +00:00
Alyssa Rosenzweig	4b1f4b86ea	agx: Add AGX_MESA_DEBUG=nopreamble option Useful both for ruling out issues with shader preambles as well as (in some cases) making for a nicer reading experience of the compiled assembly. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21430>	2023-03-05 09:27:02 +00:00
Alyssa Rosenzweig	c22a18c9af	agx: Don't write sample mask from preambles It doesn't make sense, they're basically little compute kernel environments. Noticed while debugging dEQP-GLES31.functional.fbo.no_attachments.multisample.* Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21710>	2023-03-05 08:20:09 +00:00
Alyssa Rosenzweig	e9f7d14de6	asahi: Mark PIPE_FORMAT_NONE "supported" Kinda silly but fixes dEQP-GLES31.functional.state_query.integer.max_framebuffer_samples_* which queries the number of samples of a NONE format, required for ARB_framebuffer_no_attachments to make sense. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21710>	2023-03-05 08:20:09 +00:00
Alyssa Rosenzweig	8bb40ce4ad	agx: Fix 2D MSAA array texture register allocation Sample index and layer index are both 16-bits, even though they are zero extended for compiler simplicity in some cases. In particular this means that 2D MSAA arrays consume 6 half-regs for their coordinates, not 8. This is what the IR translation (actually agx_nir_lower_texture) produces, we just need to fix the calculation in agx_read_registers to agree. Fixes validation failure in tests like dEQP-GLES31.functional.texture.multisample.samples_4.use_texture_color_2d_array Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21708>	2023-03-05 08:06:43 +00:00
Alyssa Rosenzweig	3032e3ad23	agx: Mask shifts in the backend This gives our shifts SM5 behaviour at the cost of a little extra ALU. That way, we match NIR's shifts. This fixes unsoundness of GLSL expressions like "a << (b & 31)", where the & would mistakenly get optimized away. Closes: #8181 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reported-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21673>	2023-03-05 07:52:22 +00:00
Alyssa Rosenzweig	f4e2b22646	asahi: Advertise dual-source blending This is handled entirely in common code. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21545>	2023-03-05 07:38:36 +00:00
Yogesh Mohan Marimuthu	af953616a1	wsi/display: check alloc failure in wsi_display_alloc_connector() vulkancts test dEQP-VK.wsi.direct_drm.surface.create_simulate_oom is failing because in wsi_display_alloc_connector() function memory allocation for connector is not checked for return NULL. create_simulate_oom test simulates out of memory, hence memory allocation fails for connector and later when tried to dereference connector program will segfault. This patch fixes the dEQP-VK.wsi.direct_drm.surface.create_simulate_oom test segfault issue by checking if connector is NULL afer memory allocation. Cc: mesa-stable Reviewed-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21701>	2023-03-04 21:20:54 +00:00
Rob Clark	82cc236458	freedreno/a6xx: Fix mirror x/y blits Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21706>	2023-03-04 19:13:40 +00:00
Rob Clark	ec9e03fb39	freedreno/a6xx: Convert blitter to OUT_REG() We'll need this to add a7xx support, since some of the regs are different btwn a6xx and a7xx and reg variants are not supported with the legacy reg builders. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21706>	2023-03-04 19:13:40 +00:00
Rob Clark	149f2a2e81	freedreno/a6xx: Namespace reg/pkt packer vars Otherwise they could conflict with parameters to the reg/pkt. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21706>	2023-03-04 19:13:40 +00:00
Alyssa Rosenzweig	1d2c1b8bd6	pan/mdg: Use nir_lower_helper_writes It's now in common code, drop our (buggier) copy. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Italo Nicola <italonicola@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21413>	2023-03-04 13:31:05 -05:00
Alyssa Rosenzweig	586da7b329	nir: Add nir_lower_helper_writes pass This NIR pass lowers stores in fragment shaders to: if (!gl_HelperInvocaton) { store(); } This implements the API requirement that helper invocations do not have visible side effects, and the lowering is required on any hardware that cannot directly mask helper invocation's side effects. The pass was originally written for Midgard (which has this issue) but is also needed for Asahi. Let's share the code, and fix it while we're at it. Changes from the Midgard pass: 1. Add an option to only lower atomics. AGX hardware can mask helper invocations for "plain" stores but not for atomics. Accordingly, the AGX compiler wants this lowering for atomics but not store_global. By contrast, Midgard cannot mask any stores and needs the lowering for all store intrinsics. Add an option to the common pass to accommodate both cases. This is an optimization for AGX. It is not required for correctness, this lowering is always legal. 2. Fix dominance issues. It's invalid to have NIR like if ... { ssa_1 = ... } foo ssa_1 Instead we need to rewrite as if ... { ssa_1 = ... } else { ssa_2 = undef } ssa_3 = phi ssa_1, ssa_2 foo ssa_3 By default, neither nir_validate nor the backends check this, so this doesn't currently fix a (known) real bug. But it's still invalid and fails validation with NIR_DEBUG=validate_ssa_dominance. Fix this in lower_helper_writes for intrinsics that return data (atomics). 3. Assert that the pass is run only for fragment shaders. This encourages backends to be judicious about which passes they call instead of just throwing everything in a giant lower everything spaghetti. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Italo Nicola <italonicola@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21413>	2023-03-04 13:31:05 -05:00

1 2 3 4 5 ...

167708 commits