fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-04-21 12:00:41 +02:00

Author	SHA1	Message	Date
Pavel Ondračka	7c291fca15	r300: remove most of backend contant folding This is now done in NIR. The remaining one for ADD + 0 to MOV is kept until we move some remaining part of FS lowering to NIR. There single regressions is in one d3d->glsl shader from Wine. Wine sets invariant for glPosition which translates to exact bit for all calculations leading to it (or the TGSI PRECISE flag). r300 backend ignores is completelly, so removing the backend optimizations should even make us more correct in this regards. RV530: total instructions in shared programs: 130705 -> 130706 (<.01%) instructions in affected programs: 16 -> 17 (6.25%) helped: 0 HURT: 1 RV370: no change Reviewed-by: Filip Gawin <filip.gawin@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23927>	2023-07-05 18:34:37 +00:00
Pavel Ondračka	41f1dd89a3	r300: add some early safe bool lowering This lowers some of the bool-producing comparisons and following bcsels if the bool comparison results is only used in the bcsel. This is a temporary solution before we can fork ntt and optimize the pass sequence there. Right now if we have something like bcsel(a,b,0.0) we lower it to flrp in nir_lower_bool_to_float. The flrp goes to backend where it will be lowered to 2 MADs. However in this case with one of the arguments being a constant one MAD is enough. The backend can figure this out in the constant folding pass, however this is actually one of the last things we need it for. So if we do early translation of the bcsels, than the algebraic pass can clean it up and we can remove more backend code in the next patch. no significant change with RV370 shader-db: total instructions in shared programs: 82497 -> 82496 (<.01%) instructions in affected programs: 1029 -> 1028 (-0.10%) helped: 4 HURT: 3 total temps in shared programs: 12351 -> 12355 (0.03%) temps in affected programs: 10 -> 14 (40.00%) helped: 0 HURT: 4 Reviewed-by: Filip Gawin <filip.gawin@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23927>	2023-07-05 18:34:37 +00:00
Pavel Ondračka	0bf6dcb785	r300: lower undefs to zero They will get translated to read from random register otherwise, which is not problematic per se, but they will not be regalloced and if the initial register index was too high, we can fail the shader compilation because we think we run out of registers. Almost no effect with shader-db on RV530: total instructions in shared programs: 130707 -> 130705 (<.01%) instructions in affected programs: 1012 -> 1010 (-0.20%) helped: 2 HURT: 1 Reviewed-by: Filip Gawin <filip.gawin@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23927>	2023-07-05 18:34:37 +00:00
Yonggang Luo	ba83c1e254	radeonsi: Use ALIGN_POT instead ALIGN_TO ALIGN_POT would be a bit faster as it's have no divide arithmetic Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23987>	2023-07-05 18:04:27 +00:00
Friedrich Vock	4880c827d6	radv: Re-enable RT pipeline capture/replay handles Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23516>	2023-07-05 15:58:25 +00:00
Friedrich Vock	169583a4ad	radv/rt: Rework radv_GetRayTracingCaptureReplayShaderGroupHandlesKHR Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23516>	2023-07-05 15:58:25 +00:00
Friedrich Vock	fccf6fbeec	radv/rt: Replay shader allocations according to capture/replay handle Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23516>	2023-07-05 15:58:25 +00:00
Friedrich Vock	3e9bd821f1	radv/rt: Associate capture/replay handles with stages For stages where the capture/replay handle is only known after compiling and uploading the shader, the shader needs to be relocated to the VA corresponding to the capture/replay address. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23516>	2023-07-05 15:58:25 +00:00
Friedrich Vock	eee0068943	radv/rt: Only compare the non-recursive capture/replay handle Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23516>	2023-07-05 15:58:25 +00:00
Friedrich Vock	59d269c48e	radv: Add radv_rt_capture_replay_handle Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23516>	2023-07-05 15:58:25 +00:00
Friedrich Vock	e3bd54d2a8	radv: Add support for creating capture/replay shaders Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23516>	2023-07-05 15:58:25 +00:00
Friedrich Vock	4f192b9af4	radv: Split up implementation of radv_shader_create This will make it easy to re-use the split-up parts for creating replayed shaders. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23516>	2023-07-05 15:58:25 +00:00
Friedrich Vock	51f2fa1a5e	radv: Break up radv_shader_nir_to_asm radv_shader_nir_to_asm actually had 3 functions: compiling the NIR to asm, uploading the shaders and generating debug info for them. This reduces the functionality of radv_shader_nir_to_asm to only compile NIR to asm. Uploading the shader and generating debug info is split into separate functions. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23516>	2023-07-05 15:58:25 +00:00
Friedrich Vock	878a731c77	radv: Add radv_shader_reupload Used for relocating RT shaders with capture/replay. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23516>	2023-07-05 15:58:24 +00:00
Friedrich Vock	744357477e	radv: Add utilities to serialize and deserialize shader allocation info Can be used to capture/replay an arbitrary sequence of shader allocations while preserving VAs. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23516>	2023-07-05 15:58:24 +00:00
Friedrich Vock	d23e41de6c	radv: Add option to allocate shaders in replayable VA range Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23516>	2023-07-05 15:58:24 +00:00
Friedrich Vock	ec9f5b7777	radv: Move shader arena allocation to a separate function The arena size is also needed for capture/replay. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23516>	2023-07-05 15:58:24 +00:00
Friedrich Vock	91241014e8	radv: Add radv_shader_free_list Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23516>	2023-07-05 15:58:24 +00:00
José Roberto de Souza	5cc9569b5b	iris: Convert slab address to canonical This was the only missing case of bo->address that could possibly not formated as canonical. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23994>	2023-07-05 13:18:50 +00:00
Alyssa Rosenzweig	0e7e6f2a0d	nir: Fix breaking in nir_foreach_phi(_safe) When I reading through some of my older commits I noticed that `break` in `nir_foreach_phi` is broken because I used the two-loop trick wrong. Rewrite the macros to fix this, and also to generally be a lot cleaner. Fixes: `7dc297cc14` ("nir: Add nir_foreach_phi(_safe) macro") Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23957>	2023-07-05 08:42:23 -04:00
Michael Tretter	ee62f4629a	kmsro: assert that scanout refcount is larger than 0 The dumb buffer backing the renderonly_scanout is only destroyed if the refcount reaches zero. If a driver does not correctly initialize the refcount, the refcount may be negative and the buffer will never be freed. Add an assert to ensure that drivers correctly initialize the refcount. Signed-off-by: Michael Tretter <m.tretter@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23743>	2023-07-05 12:10:18 +00:00
Michael Tretter	279d08a18a	panfrost: remove BO from cache before closing GEM If the GEM is closed before setting the BO in the sparse array to zero, a newly allocated GEM may be associated with a stale BO that is left in the cache reusing an old BO. Zero the BO before closing the GEM to make sure that the BO is removed from the cache and won't be associated with a different GEM. Signed-off-by: Michael Tretter <m.tretter@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23744>	2023-07-05 11:50:59 +00:00
Boris Brezillon	7a0033a1c9	winsys/panfrost: Make sure we reset scanout on error in create_kms_dumb_buffer_for_resource() If an error occured, make sure we reset the scanout object before leaving, otherwise the next user of this handle will hit the refcnt == 0 assert. Fixes: `ad4d7ca833` ("kmsro: Fix renderonly_scanout BO aliasing") Cc: mesa-stable Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Italo Nicola <italonicola@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23746>	2023-07-05 06:22:22 +00:00
Boris Brezillon	45a27adc3b	renderonly: Make sure we reset scanout on error in create_kms_dumb_buffer_for_resource() If an error occured, make sure we reset the scanout object before leaving, otherwise the next user of this handle will hit the refcnt == 0 assert. Fixes: `ad4d7ca833` ("kmsro: Fix renderonly_scanout BO aliasing") Cc: mesa-stable Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23746>	2023-07-05 06:22:21 +00:00
Boris Brezillon	8568a46c1c	renderonly: Fix potential NULL deref in the error path scanout can be NULL. Fixes: `ad4d7ca833` ("kmsro: Fix renderonly_scanout BO aliasing") Cc: mesa-stable Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23746>	2023-07-05 06:22:21 +00:00
Robert Beckett	8087f784e4	winsys/panfrost: Fix a scanout resource leak Use ro->bo_map to alloc scanout and make sure we initialize the refcnt to one. This fixes leaking the scanout object and the underlying dumb-buffer. Fixes: `ad4d7ca833` ("kmsro: Fix renderonly_scanout BO aliasing") Cc: mesa-stable Signed-off-by: Robert Beckett <bob.beckett@collabora.com> Reviewed-by: Italo Nicola <italonicola@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23746>	2023-07-05 06:22:21 +00:00
Mike Blumenkrantz	46b488151f	aux/trace: fix bindless texture dumping cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23971>	2023-07-05 05:32:21 +00:00
Alyssa Rosenzweig	a28f9738e1	asahi: Use txf_ms for MSAA background programs Fixes regression in assorted dEQP tests including: dEQP-EGL.functional.color_clears.multi_context.gles3.rgba8888_window Fixes: `d4424950ac` ("asahi: Use txf for background program") Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23998>	2023-07-05 05:11:49 +00:00
Alyssa Rosenzweig	02ac7305a0	agx: Don't leak ssa_to_reg_out calloc'd in the RA, should be freed in the RA. Identified with valgrind. Fixes: 6b13616cba2 ("agx: Implement vector live range splitting") Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23998>	2023-07-05 05:11:49 +00:00
Alyssa Rosenzweig	2a334a9f4d	asahi: Take ownership of compute shader NIR Fixes massive leak of compute shader NIR. Identified with valgrind. Yes, this requires casting away const *. Yes, Gallium is dumb. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23998>	2023-07-05 05:11:49 +00:00
Alyssa Rosenzweig	a004d96874	asahi: Use ralloc harder Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23998>	2023-07-05 05:11:49 +00:00
Alyssa Rosenzweig	56461bc0a2	asahi: Fix scissor_culls_everything check Account for the possibility that the scissor is outside the render area. Fixes the usual assertion fail: glcts: ../src/gallium/drivers/asahi/agx_state.c:1015: agx_upload_viewport_scissor: Assertion `maxx > minx && maxy > miny' failed. on the following dEQP tests with my conformance build: dEQP-GLES3.functional.fragment_ops.scissor.outside_render_line dEQP-GLES3.functional.fragment_ops.scissor.outside_render_point dEQP-GLES3.functional.fragment_ops.scissor.outside_render_tri Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23998>	2023-07-05 05:11:49 +00:00
Alyssa Rosenzweig	98de1b1b95	asahi: Assert we don't transition shared resources This is an invariant maintained by all current callers and subtly required for the BO swapping to work. Assert it to make it obvious. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23998>	2023-07-05 05:11:49 +00:00
Asahi Lina	1140bdb783	asahi: Arrange VS varyings in the correct order The GPU ABI requires varyings to be grouped as follows: - Position - Smooth shaded fp32 - Flat shaded fp32 - Linear shaded fp32 - Smooth shaded fp16 - Flat shaded fp16 - Linear shaded fp16 - Point size Use the flat shaded mask info we now have in the vertex shader key to sort things properly, and pass the counts to the hardware. FP16 is still TODO. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23998>	2023-07-05 05:11:49 +00:00
Asahi Lina	2055e03243	asahi: Add flat/linear shaded varyings mask to the VS shader key We need this information in order to arrange varyings properly, which means we need shader variants. Add this to the shader key, taking the value from the FS input info. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23998>	2023-07-05 05:11:49 +00:00
Asahi Lina	4a65b4bb14	asahi: Fix type confusion for fragment shader keys We can't attempt to access the fs union member if this is not a FS. That worked so far since there wasn't a VS shader key at all, but we're about to introduce one. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23998>	2023-07-05 05:11:49 +00:00
Asahi Lina	90834353a1	asahi: Gather flat/linear shaded input info from uncompiled FS We need to propagate shading model metadata from the FS to the VS in order to correctly lay out the uniforms in the right order. This means we need VS variants depending on this data. We could use the existing shader info structure, but that applies to compiled shaders which would introduce a dependency from the VS compile to the FS compile. This information does not change with FS variants, so we can introduce an agx_uncompiled_shader_info structure and gather it early at precompilation time. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23998>	2023-07-05 05:11:49 +00:00
Asahi Lina	49994dc8cb	asahi: Identify the separate varying count fields Flat/goraud/linear and 32/16 need to be specified separately. This change identifies the new fields but should be a functional no-op. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23998>	2023-07-05 05:11:49 +00:00
Alyssa Rosenzweig	d9bf52e00f	agx: Assert that barriers are not used in the preamble It is nonsensical and confuses the hardware. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23998>	2023-07-05 05:11:49 +00:00
Alyssa Rosenzweig	9bf7d14b2c	agx: Use nir_opt_shrink_vectors Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23998>	2023-07-05 05:11:49 +00:00
Alyssa Rosenzweig	c81a14c754	agx: Use nir_opt_shrink_stores This especially helps with image stores, where we otherwise insert a bunch of pointless moves to collect a vector even when we know the format only has a single channel. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23998>	2023-07-05 05:11:49 +00:00
Alyssa Rosenzweig	45cbe12282	asahi: Remove ; in perf_debug_ctx Otherwise `if(x) perf_debug_ctx(); else if (y) ...` doesn't work. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23998>	2023-07-05 05:11:49 +00:00
Alyssa Rosenzweig	b57faede71	asahi: Identify PBE::sRGB flag Needed to write out sRGB images correctly. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23998>	2023-07-05 05:11:49 +00:00
Alyssa Rosenzweig	6dc6991930	asahi: Rename 'Render Target' to 'PBE' It's used for all PBE operations, including regular image writes, so use the more general name. Compare the powervr driver. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23998>	2023-07-05 05:11:49 +00:00
Alyssa Rosenzweig	75b5bf8dbc	asahi: Strip ? in GenXML Sometimes it's nice to have boolean flags with ? in the name, allow this by stripping ? when generating the sanitized C name. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23998>	2023-07-05 05:11:49 +00:00
Asahi Lina	850380cbf5	asahi: match_soa: Treat offsets as signed An offset may be negative, indexing backwards from the array base. When we right shift an offset by the format shift, we need to use a signed shift to ensure that the resulting offset is still negative. Fixes Nautilus faults/pink crashes. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23998>	2023-07-05 05:11:49 +00:00
Alyssa Rosenzweig	a90b0743f3	agx: Smarten discard_agx -> sample_mask lowering In 97a1bbeaf26 ("agx: Fix discards"), we made our discard lowering very simple, since we had just discovered the underlying instruction behaviour and needed a hotfix for misrendering in the wild. Now that we understand the behaviour, we can do better. There are two potential performance issues with the lowering in that commit: 1. It generates extra sample_mask instructions. For a shader that has a single discard_if at root level, it would generate two instructions sample_mask foo, 0 sample_mask ~0, ~0 rather than a single sample_mask ~0, ~foo 2. It runs depth/stencil testing/updates at the end of the shader, even when it could be run immediately after the discard. This might cause pipeline stalls. The solution is to insert the "trigger testing" sample_mask instruction as soon after the "discard" instruction as possible, fusing them if they would be next to each other. There are two cases: 1. The last discard is executed unconditionally. In this case, we can test immediately after, unconditionally, and fuse together. 2. The last discard is executed conditionally. In this case, we test in the first unconditional block after the discard. Example shader: ... loop { if .. { loop { discard_if <-- discard here ... } .. } ... } <---- we test here ... store_output Together this covers all the usual patterns for single-sampled discard. We could still do better with multisampling, but whatever. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23998>	2023-07-05 05:11:49 +00:00
Alyssa Rosenzweig	5a4c9136cd	agx: Add algebraic opt to help with discard lowering When lowering discards, it will be convenient to generate the pattern: (cond ? 255 : 0) ^ 255 Add rules to optimize that to (cond ? 0 : 255) This is not part of the main algebraic optimizer since this lowering happens late. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23998>	2023-07-05 05:11:49 +00:00
Mike Blumenkrantz	54bd804ad3	zink: don't destroy swapchain on initial CreateSwapchainKHR fail this used to be correct at some point but now it no longer is cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23970>	2023-07-05 04:22:23 +00:00
Dave Airlie	2fc2597fe5	gallivm: make block_size use discrete values. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23997>	2023-07-05 11:40:44 +10:00

1 2 3 4 5 ...

173811 commits