fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-04 20:38:06 +02:00

Author	SHA1	Message	Date
Mike Blumenkrantz	55b57db84d	zink: add vk/spirv caps/extension for shader LAYER variable this is required if gl_Layer is used outside of GEOMETRY stage Fixes: `c77df59c9e` ("zink: export PIPE_CAP_TGSI_VS_LAYER_VIEWPORT") Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9410>	2021-03-05 03:45:51 +00:00
Dave Airlie	1186fbcdf1	lavapipe: fix dynamic viewport/scissor pipeline emission Just fixup the tests for when the pipeline vp/scissors are emitted. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9422>	2021-03-05 03:34:47 +00:00
Dave Airlie	6bcd304278	lavapipe: fix pipeline vp/scissor mixup. Not copying all the scissors caused dEQP-VK.pipeline.extended_dynamic_state.two_draws_dynamic.2_viewports to fail but thah test pointlessly relies on KHR_multiview (cts issue filed). Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Fixes: `b38879f8c5` ("vallium: initial import of the vulkan frontend") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9422>	2021-03-05 03:34:47 +00:00
Iván Briano	194e477615	anv: don't advertise mipmaps for linear 3D surfaces on BDW Prior to SKL, the mipmaps for 3D surfaces are laid out in a way that make it impossible to represent in the way that VkSubresourceLayout expects. Since we can't tell users how to make sense of them, don't report them as available. "Fixes" dEQP-VK.image.subresource_layout.3d.* Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9419>	2021-03-04 16:23:23 -08:00
Ian Romanick	2c4fd24c01	nir/algebraic: Apply addition property of equality to the other ordering too Inequality comparison operations are not commutative, so `foo < bar` and `bar < foo` both have to be explicitly listed. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> All Intel GPUs had similar results. (Ice Lake shown) total instructions in shared programs: 20027051 -> 20026899 (<.01%) instructions in affected programs: 37181 -> 37029 (-0.41%) helped: 85 HURT: 0 helped stats (abs) min: 1 max: 20 x̄: 1.79 x̃: 1 helped stats (rel) min: 0.05% max: 6.78% x̄: 0.92% x̃: 0.68% 95% mean confidence interval for instructions value: -2.42 -1.15 95% mean confidence interval for instructions %-change: -1.23% -0.61% Instructions are helped. total cycles in shared programs: 979762793 -> 979753527 (<.01%) cycles in affected programs: 2653905 -> 2644639 (-0.35%) helped: 104 HURT: 50 helped stats (abs) min: 1 max: 1048 x̄: 119.99 x̃: 11 helped stats (rel) min: <.01% max: 9.88% x̄: 0.77% x̃: 0.20% HURT stats (abs) min: 1 max: 734 x̄: 64.26 x̃: 8 HURT stats (rel) min: <.01% max: 3.06% x̄: 0.36% x̃: 0.10% 95% mean confidence interval for cycles value: -98.65 -21.68 95% mean confidence interval for cycles %-change: -0.66% -0.15% Cycles are helped. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9374>	2021-03-04 22:50:53 +00:00
Ian Romanick	33031bdab6	nir/algebraic: Apply addition property of equality more conservatively This allows a lot more CSE. Depending on where the addition and the comparison are scheduled, it may also reduce register pressure by reducing the live range of the addends. Across all the platforms, the shaders affected for spills or fills were all fragment shaders from Dirt Rally. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Tiger Lake and Ice Lake had similar results. (Tiger Lake shown) total instructions in shared programs: 21043103 -> 21038804 (-0.02%) instructions in affected programs: 892878 -> 888579 (-0.48%) helped: 1549 HURT: 724 helped stats (abs) min: 1 max: 225 x̄: 4.14 x̃: 2 helped stats (rel) min: 0.05% max: 11.18% x̄: 1.04% x̃: 0.78% HURT stats (abs) min: 1 max: 71 x̄: 2.93 x̃: 1 HURT stats (rel) min: 0.07% max: 6.90% x̄: 0.80% x̃: 0.56% 95% mean confidence interval for instructions value: -2.33 -1.45 95% mean confidence interval for instructions %-change: -0.50% -0.40% Instructions are helped. total cycles in shared programs: 855054155 -> 855757566 (0.08%) cycles in affected programs: 58275918 -> 58979329 (1.21%) helped: 1213 HURT: 1680 helped stats (abs) min: 1 max: 107405 x̄: 1684.00 x̃: 10 helped stats (rel) min: <.01% max: 38.09% x̄: 1.51% x̃: 0.25% HURT stats (abs) min: 1 max: 126632 x̄: 1634.59 x̃: 12 HURT stats (rel) min: <.01% max: 85.91% x̄: 2.75% x̃: 0.49% 95% mean confidence interval for cycles value: -98.06 584.35 95% mean confidence interval for cycles %-change: 0.71% 1.22% Inconclusive result (value mean confidence interval includes 0). total spills in shared programs: 9843 -> 9771 (-0.73%) spills in affected programs: 72 -> 0 helped: 5 HURT: 0 total fills in shared programs: 9600 -> 9451 (-1.55%) fills in affected programs: 149 -> 0 helped: 5 HURT: 0 LOST: 14 GAINED: 9 Skylake total instructions in shared programs: 18185074 -> 18183866 (<.01%) instructions in affected programs: 575180 -> 573972 (-0.21%) helped: 1286 HURT: 468 helped stats (abs) min: 1 max: 15 x̄: 1.55 x̃: 1 helped stats (rel) min: 0.03% max: 4.08% x̄: 0.67% x̃: 0.65% HURT stats (abs) min: 1 max: 8 x̄: 1.69 x̃: 1 HURT stats (rel) min: 0.13% max: 7.69% x̄: 0.87% x̃: 0.45% 95% mean confidence interval for instructions value: -0.77 -0.60 95% mean confidence interval for instructions %-change: -0.30% -0.22% Instructions are helped. total cycles in shared programs: 960518105 -> 960608234 (<.01%) cycles in affected programs: 42536073 -> 42626202 (0.21%) helped: 1210 HURT: 1714 helped stats (abs) min: 1 max: 7015 x̄: 123.41 x̃: 10 helped stats (rel) min: <.01% max: 33.76% x̄: 1.32% x̃: 0.26% HURT stats (abs) min: 1 max: 14474 x̄: 139.71 x̃: 14 HURT stats (rel) min: <.01% max: 58.94% x̄: 2.00% x̃: 0.44% 95% mean confidence interval for cycles value: 4.02 57.63 95% mean confidence interval for cycles %-change: 0.43% 0.82% Cycles are HURT. LOST: 16 GAINED: 42 Broadwell total instructions in shared programs: 17856880 -> 17852158 (-0.03%) instructions in affected programs: 564836 -> 560114 (-0.84%) helped: 1243 HURT: 418 helped stats (abs) min: 1 max: 115 x̄: 4.36 x̃: 1 helped stats (rel) min: 0.03% max: 9.67% x̄: 0.90% x̃: 0.67% HURT stats (abs) min: 1 max: 8 x̄: 1.67 x̃: 1 HURT stats (rel) min: 0.14% max: 7.69% x̄: 0.89% x̃: 0.46% 95% mean confidence interval for instructions value: -3.45 -2.23 95% mean confidence interval for instructions %-change: -0.51% -0.38% Instructions are helped. total cycles in shared programs: 1031140321 -> 1029856892 (-0.12%) cycles in affected programs: 66986946 -> 65703517 (-1.92%) helped: 1084 HURT: 1653 helped stats (abs) min: 1 max: 415168 x̄: 1835.32 x̃: 10 helped stats (rel) min: <.01% max: 57.16% x̄: 1.19% x̃: 0.28% HURT stats (abs) min: 1 max: 43930 x̄: 427.14 x̃: 12 HURT stats (rel) min: <.01% max: 57.53% x̄: 1.32% x̃: 0.39% 95% mean confidence interval for cycles value: -915.76 -22.07 95% mean confidence interval for cycles %-change: 0.17% 0.47% Inconclusive result (value mean confidence interval and %-change mean confidence interval disagree). total spills in shared programs: 20891 -> 20335 (-2.66%) spills in affected programs: 1567 -> 1011 (-35.48%) helped: 70 HURT: 0 total fills in shared programs: 27307 -> 25905 (-5.13%) fills in affected programs: 5381 -> 3979 (-26.05%) helped: 71 HURT: 0 LOST: 17 GAINED: 20 Haswell total instructions in shared programs: 16411850 -> 16409414 (-0.01%) instructions in affected programs: 602666 -> 600230 (-0.40%) helped: 1152 HURT: 781 helped stats (abs) min: 1 max: 103 x̄: 3.59 x̃: 1 helped stats (rel) min: 0.03% max: 8.61% x̄: 0.85% x̃: 0.65% HURT stats (abs) min: 1 max: 41 x̄: 2.18 x̃: 1 HURT stats (rel) min: 0.12% max: 7.69% x̄: 0.88% x̃: 0.69% 95% mean confidence interval for instructions value: -1.74 -0.78 95% mean confidence interval for instructions %-change: -0.21% -0.10% Instructions are helped. total cycles in shared programs: 1035338781 -> 1036977801 (0.16%) cycles in affected programs: 68961096 -> 70600116 (2.38%) helped: 1246 HURT: 2206 helped stats (abs) min: 1 max: 392022 x̄: 1040.28 x̃: 14 helped stats (rel) min: <.01% max: 56.44% x̄: 2.32% x̃: 0.38% HURT stats (abs) min: 1 max: 68630 x̄: 1330.56 x̃: 18 HURT stats (rel) min: <.01% max: 69.97% x̄: 3.31% x̃: 0.61% 95% mean confidence interval for cycles value: 90.43 859.17 95% mean confidence interval for cycles %-change: 1.02% 1.54% Cycles are HURT. total spills in shared programs: 17805 -> 17457 (-1.95%) spills in affected programs: 1202 -> 854 (-28.95%) helped: 34 HURT: 31 total fills in shared programs: 20939 -> 20387 (-2.64%) fills in affected programs: 2702 -> 2150 (-20.43%) helped: 34 HURT: 31 LOST: 24 GAINED: 45 Ivy Bridge and earlier Intel GPUs had similar results. (Ivy Bridge shown) total instructions in shared programs: 15515912 -> 15516757 (<.01%) instructions in affected programs: 396569 -> 397414 (0.21%) helped: 578 HURT: 858 helped stats (abs) min: 1 max: 9 x̄: 1.32 x̃: 1 helped stats (rel) min: 0.04% max: 3.70% x̄: 0.65% x̃: 0.65% HURT stats (abs) min: 1 max: 11 x̄: 1.87 x̃: 1 HURT stats (rel) min: 0.08% max: 12.90% x̄: 0.95% x̃: 0.53% 95% mean confidence interval for instructions value: 0.47 0.70 95% mean confidence interval for instructions %-change: 0.24% 0.37% Instructions are HURT. total cycles in shared programs: 584395455 -> 584466352 (0.01%) cycles in affected programs: 20346570 -> 20417467 (0.35%) helped: 1192 HURT: 1896 helped stats (abs) min: 1 max: 4108 x̄: 123.27 x̃: 14 helped stats (rel) min: <.01% max: 37.20% x̄: 2.27% x̃: 0.46% HURT stats (abs) min: 1 max: 3698 x̄: 114.89 x̃: 19 HURT stats (rel) min: <.01% max: 70.28% x̄: 3.02% x̃: 0.71% 95% mean confidence interval for cycles value: 10.75 35.16 95% mean confidence interval for cycles %-change: 0.73% 1.23% Cycles are HURT. LOST: 20 GAINED: 12 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9374>	2021-03-04 22:50:53 +00:00
Kenneth Graunke	206495cac4	iris: Enable u_threaded_context This implements most of the remaining u_threaded_context support. Most of the heavy lifting was done in the previous patches which fixed things up for the new thread safety requirements. Only a few things remain. u_threaded_context support can be disabled via an environment variable: GALLIUM_THREAD=0 On Felix's Tigerlake with the GPU at fixed frequency, enabling u_threaded_context improves performance of several games: - Civilization VI: +17% - Shadow of Mordor: +6% - Bioshock Infinite +6% - Xonotic: +6% Various microbenchmarks improve substantially as well: - GfxBench5 gl_driver2: +58% - SynMark2 OglBatch6: +54% - Piglit drawoverhead: +25% Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8964>	2021-03-04 13:59:21 -08:00
Kenneth Graunke	c133d0930f	iris: Use thread safe slab allocators in transfer_map handling pipe->transfer_map can be called from u_threaded_context's thread rather than the driver thread. We need to use two different slab allocators, one for each thread. transfer_unmap, on the other hand, is only ever called from the driver thread. Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8964>	2021-03-04 13:59:21 -08:00
Kenneth Graunke	1b1c857248	iris: Make various classes inherit from u_threaded_context base classes u_threaded_context requires various objects to inherit from a new threaded_foo base class rather than directly from pipe_foo. This patch does most of the mechanical changes required for that. It also initializes the new threaded_resource fields. Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8964>	2021-03-04 13:59:21 -08:00
Kenneth Graunke	3358c7125a	iris: Use different shader uploaders for precompile vs. draw time When we enable u_threaded_context, the pipe->create_*_state hooks (precompile variants) are going to be called from one thread, while iris_update_compiled_shaders (on-the-fly variants) are going to be called from a driver thread. BLORP shaders also happen from clear, blit, and so on in the driver thread. u_upload_mgr isn't thread-safe, so use an uploader for each purpose. Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8964>	2021-03-04 13:59:21 -08:00
Kenneth Graunke	ec0d61c14c	iris: Support rebinding of stream output targets This enables us to replace the backing storage of resources that have been used as stream output targets, in case we're invalidating their entire contents. This can avoid stalls. We simply hadn't supported it because it was going to be tricky to re-emit 3DSTATE_SO_BUFFER without screwing up "reset offset to zero" vs. "keep appending". But that should be working fine with the previous patch's refactor. Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8964>	2021-03-04 13:59:21 -08:00
Kenneth Graunke	08e04ddd2c	iris: Rework zeroing of stream output buffer offsets The previous mechanism was a bit fragile. We stored the zero offset in the pre-baked packet, and used an flag to override 0xFFFFFFFF (append) offsets until our first emit - then prohibited anyone from trying to re-emit the packet by flagging IRIS_DIRTY_SO_BUFFERS, because that would re-emit the version with the zeroing of the offset. Now, we always store 0xFFFFFFFF in the pre-baked packet, and use a flag to override it to zero on the first emit. That way, we can re-emit that packet at any time, and it'll just keep appending. Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8964>	2021-03-04 13:59:21 -08:00
Kenneth Graunke	e40fafa991	iris: Defer stream output target space allocation until set time In the future, Marek is planning to make u_threaded_context call create_stream_output_target() from a different thread than the main driver thread, which means that we can't safely use uploaders there. To prepare for this eventual future, just defer the allocation of the offset BO 'til later. It's a very small amount of overhead. Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8964>	2021-03-04 13:59:20 -08:00
Kenneth Graunke	5659460af4	iris: Defer uploading of surface states With u_threaded_context, create_surface and create_sampler_view will be called from a different thread than the driver thread. They aren't allowed to access the context, which means that they can't use the uploaders there to upload our SURFACE_STATE entries. Thanks to backing-storage replacement and iris_rebind_buffer, we already reworked things to maintain CPU-side copies of the SURFACE_STATE entries and added the ability to upload or re-upload them later. So we can skip the upload at object creation time, and add a simple resource-is-NULL check at binding table upload time to ensure that they get uploaded by the time we need them. (They might get uploaded earlier due to rebinds or clear color updates, but this is the last moment to do so.) Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8964>	2021-03-04 13:59:20 -08:00
Eric Anholt	3bdd39f03c	lima: avoid stomping over bound shader state when creating new shaders It shouldn't affect bound program state, and the current context state shouldn't be relevant for shader creation precompiles anyway (level load isn't going to have the eventual set of sampler views bound when you go to draw with that shader). Closes: #4306 Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9089>	2021-03-04 18:34:35 +00:00
Eric Anholt	4ac3f85054	lima: upload the shader to a BO at shader creation No need to conditionally upload later. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9089>	2021-03-04 18:34:35 +00:00
Eric Anholt	5a550c8dc7	lima: don't look at dirty bits for setup of FS key You always have to populate the key with the right texture swizzles, even if textures haven't changed since binding a new shader. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9089>	2021-03-04 18:34:35 +00:00
Eric Anholt	d4f706389c	lima: stop encoding the texture format in the shader key We can compose the swizzles at sampler view creation time, saving recompiles on texture format changes. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9089>	2021-03-04 18:34:34 +00:00
Lionel Landwerlin	8023d6de20	anv: implement INTEL_DEBUG=submit Name all the BOs! v2: Fix 32bit build issue (Thanks Marge!) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5736>	2021-03-04 19:46:24 +02:00
Rohan Garg	c6eb84ff30	virgl: Add support for querying detailed memory info This allows for virgl guests to expose GL_NVX_gpu_memory_info and GL_ATI_meminfo when the extensions are supported on the host. Signed-off-by: Rohan Garg <rohan.garg@collabora.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9337>	2021-03-04 17:14:14 +01:00
Jason Ekstrand	1e53e0d2c7	intel/mi_builder: Drop the gen_ prefix mi_ is already a unique prefix in Mesa so the gen_ isn't really gaining us anything except extra characters. It's possible that MI_ may conflict a tiny bit with GenXML but it doesn't seem to be a problem today and we can deal with that in the future if it's ever an issue. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9393>	2021-03-04 15:14:27 +00:00
Jason Ekstrand	6d522538b6	intel: Rename gen_mi_builder.h to mi_builder.h Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9393>	2021-03-04 15:14:27 +00:00
Danylo Piliaiev	7e25e5b56f	ir3: disallow moving memory writes over discard Writes to global memory should not be moved over discard, otherwise we could have unintended side-effects or lack of side-effects where they should be observed. Fixes tests: dEQP-VK.rasterization.frag_side_effects.color_at_beginning.kill dEQP-VK.rasterization.frag_side_effects.color_at_end.kill Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9365>	2021-03-04 11:40:58 +00:00
Juan A. Suarez Romero	7b3b8524ef	ci: Bump deqp to vk-gl-cts 1.2.5.2 Reviewed-by: Eric Anholt <eric@anholt.net> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9369>	2021-03-04 11:09:35 +00:00
Danylo Piliaiev	72a9f315db	ir3: make mark_kill_path exit early if instr is already seen Would bring down its complexity in pathological cases. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9386>	2021-03-04 10:52:06 +00:00
Danylo Piliaiev	9dbb678f5a	ir3: prevent duplication of instruction's dependencies Otherwise mark_kill_path() is happy to take exponential time to finish. It was possible to have such chains: ... stib.base0 imm[0.000000,0,0x0], ssa_233, ssa_234, false-deps:ssa_231, ssa_231 stib.base0 imm[0.000000,0,0x0], ssa_237, ssa_238, false-deps:ssa_235, ssa_235 stib.base0 imm[0.000000,0,0x0], ssa_241, ssa_242, false-deps:ssa_239, ssa_239 stib.base0 imm[0.000000,0,0x0], ssa_245, ssa_246, false-deps:ssa_243, ssa_243 stib.base0 imm[0.000000,0,0x0], ssa_249, ssa_250, false-deps:ssa_247, ssa_247 stib.base0 imm[0.000000,0,0x0], ssa_105, ssa_253, false-deps:ssa_251, ssa_251 stib.base0 imm[0.000000,0,0x0], ssa_109, ssa_256, false-deps:ssa_254, ssa_254 stib.base0 imm[0.000000,0,0x0], ssa_113, ssa_259, false-deps:ssa_257, ssa_257 stib.base0 imm[0.000000,0,0x0], ssa_117, ssa_262, false-deps:ssa_260, ssa_260 stib.base0 imm[0.000000,0,0x0], ssa_265, ssa_266, false-deps:ssa_263, ssa_263 stib.base0 imm[0.000000,0,0x0], ssa_269, ssa_270, false-deps:ssa_267, ssa_267 stib.base0 imm[0.000000,0,0x0], ssa_273, ssa_274, false-deps:ssa_271, ssa_271 ... Fixes tests: dEQP-VK.geometry.layered.cube_array.36_36_12.secondary_cmd_buffer_inherit_framebuffer dEQP-VK.geometry.layered.3d.64_64_8.secondary_cmd_buffer_inherit_framebuffer dEQP-VK.geometry.layered.cube_array.64_64_12.secondary_cmd_buffer_inherit_framebuffer Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9386>	2021-03-04 10:52:06 +00:00
Samuel Pitoiset	517600b4d5	Revert "radv: stop using VM_ALWAYS_VALID on APUs" Disabling VM_ALWAYS_VALID actually hurts more than it helps after doing more testing. Managing the global BO list in userspace is really costly and make a bunch of games CPU bound. I think re-enabling VM_ALWAYS_VALID is a step in the right direction. This reverts commit `6ac6e2fbfb`. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9341>	2021-03-04 09:37:59 +00:00
Gert Wollny	e148d5ec99	r600/sfn: lower intrinsic_load_tess_coord to driver version Fixes KHR-GL45.tessellation_shader.tessellation_shader_tessellation.TCS_TES KHR-GL45.tessellation_shader.tessellation_shader_tessellation.TES Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9373>	2021-03-04 09:14:03 +00:00
Gert Wollny	81b41e0c76	nir: Add r600 specific intrinsic for loading the tesselation coords Only the XY pair is provided directly, the Z value has to be deducted from the primitive type. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9373>	2021-03-04 09:14:03 +00:00
cheyang	6f4c4df6c2	virgl: add astc 2d compressed formats Signed-off-by: cheyang <cheyang@bytedance.com> Signed-off-by: hexin <hexin.op@bytedance.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9306>	2021-03-04 09:03:47 +00:00
Iago Toral Quiroga	c3732ac0d0	broadcom/compiler: be more aggressive skipping unifa writes We had an optimization in place to skip a unifa write if the address happens to be right after the last ldunifa read address, but we can take this further and update the unifa address by emitting ldunifa instructions if needed to skip a unifa write that is close enough. This is because a unifa write involves 4 cycles: 1 for the write and 3 delay slots before we can emit the first ldunifa. So if we have code like this: unifa addr + 0 ldunifa.r0 unifa addr + 12 ldunifa.r1 In practice we end up with QPU like this: unifa addr + 0 nop nop nop ldunifa.r0 unifa addr + 12 nop nop nop ldunifa.r1 And with this patch we get: unifa addr + 0 nop nop nop ldunifa.r0 <--- reads offset 0 ldunifa.- <--- reads offset 4 ldunifa.- <--- reads offset 8 ldunifa.r1 <--- reads offset 12 Of course, QPU scheduling might find ways to fill the NOPs to some extent and remove some of the gains, but generally speaking, this is still usually a win. Going by shader-db results, allowing the next unifa address to be up to 12 bytes after the address resulting from the last ldunifa read shows the best results: total instructions in shared programs: 13817048 -> 13812202 (-0.04%) instructions in affected programs: 602701 -> 597855 (-0.80%) helped: 1750 HURT: 760 Instructions are helped. total uniforms in shared programs: 3795485 -> 3793200 (-0.06%) uniforms in affected programs: 43930 -> 41645 (-5.20%) helped: 898 HURT: 0 Uniforms are helped. total max-temps in shared programs: 2326612 -> 2326621 (<.01%) max-temps in affected programs: 651 -> 660 (1.38%) helped: 10 HURT: 21 Inconclusive result (value mean confidence interval includes 0). total sfu-stalls in shared programs: 30942 -> 30906 (-0.12%) sfu-stalls in affected programs: 627 -> 591 (-5.74%) helped: 186 HURT: 158 Inconclusive result (value mean confidence interval includes 0). total inst-and-stalls in shared programs: 13847990 -> 13843108 (-0.04%) inst-and-stalls in affected programs: 601404 -> 596522 (-0.81%) helped: 1747 HURT: 757 Inst-and-stalls are helped. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9384>	2021-03-04 09:00:15 +01:00
Iago Toral Quiroga	2897a83ff8	broadcom/compiler: drop the destination for unused ldunifa We can't remove unused ldunifa that are not the first or last in a sequence, but we can still ignore their destination to reduce register pressure. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9384>	2021-03-04 09:00:15 +01:00
Timothy Arceri	9d1ef1595c	util/disk_cache: make MESA_DISK_CACHE_READ_ONLY_FOZ_DBS a relative path Rather than passing in full paths this changes things so that we can just pass in filenames relative to the current cache directory. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9279>	2021-03-04 04:07:46 +00:00
Eric Anholt	a8423eb732	ci/turnip: Mark a flaky WSI test. This one has flaked many times at this point, and I've even seen it flake locally. No luck debugging it yet. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9397>	2021-03-03 23:03:48 +00:00
Rob Clark	f8714b2852	freedreno: Remove dead-cells MBR workaround With threaded-context we won't have a chance to apply the workaround in the backend driver. But the previous commit moves it to a driconf configured workaround in mesa/st, so we can drop this now. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9316>	2021-03-03 22:47:59 +00:00
Rob Clark	e6f2e8b3fc	driconf: Add ignore_map_unsynchronized option Add an option to workaround incorrect unsynchronized VBO updates in Dead-Cells. See: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4337 Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9316>	2021-03-03 22:47:59 +00:00
Mike Blumenkrantz	3c20b698e2	zink: rewrite macro for getting KHR device functions we have the technology. we can improve our our lives with better macros. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9398>	2021-03-03 17:27:22 -05:00
Rob Clark	910a2464cf	freedreno/a6xx: Fix compile warning Fixes: `79921b81bc` ("freedreno/a6xx: Document threadsize-related fields") Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9394>	2021-03-03 22:09:22 +00:00
Rob Clark	8642456472	freedreno: Deduplicate fixup_shader_state() All the ir3 gens had the same thing, time to move it out into a shared helper. The keeping the storage in fdN_context is to avoid namespace clashes between ir3 and ir2. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9394>	2021-03-03 22:09:22 +00:00
Rob Clark	1611693977	freedreno/ir3: Add comments about shader key/gen I had forgotton on which gens these where used on (which is important if you need to know which shader stages use these).. expand the comments a bit. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9394>	2021-03-03 22:09:22 +00:00
Dave Airlie	bc02fc4823	clover: fix array images view creation Found this on top of Karol's patches but it seems like it can just be applied to master. Helps with some cases of kernel_image_methods/test_kernel_image_methods 2Darray Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9381>	2021-03-03 21:59:22 +00:00
Eric Anholt	18be15ad16	ci/zink: Add another primitive restart flake. This one flaked all the way to a run failure in a recent MR of mine. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9396>	2021-03-03 21:49:41 +00:00
Eric Anholt	283a05ddc9	ci/a5xx: Update piglit expectations. The mesa/st shader variants change fixed some fails for us. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9314>	2021-03-03 21:05:39 +00:00
Eric Anholt	957132294f	ci/a5xx: Increase the gles3/31 coverage. Now that there's more time available in our budget per board, we can run all of gles31, and half of gles3, instead of 10%. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9314>	2021-03-03 21:05:39 +00:00
Eric Anholt	1087bf16af	ci/a3xx: Run all of GLES3 dEQP. We're not spending half our time booting any more, so run the other half. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9314>	2021-03-03 21:05:39 +00:00
Eric Anholt	bb82efa792	ci/a5xx: Run all of gles2 in one job. Now that we're not spending so much time on boot overhead, no need to parallelize. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9314>	2021-03-03 21:05:39 +00:00
Eric Anholt	bcdfee3bcd	ci/freedreno: Switch the fastboot boards to using nfsroot. This saves time in packing the rootfs, allows for larger rootfses, and avoids the need for webdav. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9314>	2021-03-03 21:05:39 +00:00
Eric Anholt	5c89f6ed17	ci/freedreno: Also retest when only CI configuration changes. Fixes: `dab845d457` ("ci: Move specific driver testing to separate files in separate dirs.") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9314>	2021-03-03 21:05:39 +00:00
Eric Anholt	e2aff7425d	tgsi_exec: Jump over entirely non-taken THEN or ELSE branches. TGSI has these nice labels for us for where to jump in this case, let's use them. Improves piglit arb_shader_image_load_store-shader-mem-barrier runtime massively, though not enough to make the test really reasonable to run. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9347>	2021-03-03 20:47:08 +00:00
Eric Anholt	3429c83f87	tgsi_exec: Roll the loops for condmask handling. No need to hand-unroll this, the compiler will do it. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9347>	2021-03-03 20:47:08 +00:00

1 2 3 4 5 ...

136034 commits