fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-23 22:00:13 +01:00

Author	SHA1	Message	Date
Mike Blumenkrantz	943f6a038d	zink: always set matching resource export type for dmabuf creation both of these need to be set if one is cc: mesa-stable Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13799>	2021-11-17 19:22:02 +00:00
Mike Blumenkrantz	11c79a8bd7	zink: stop using VK_IMAGE_LAYOUT_PREINITIALIZED for dmabuf this is illegal cc: mesa-stable Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13799>	2021-11-17 19:22:02 +00:00
Jason Ekstrand	63baeffc2d	vulkan/sync: Rework asserts a bit ANV currently smashes off the TIMELINE bit depending on whether or not the i915 interface supports them, triggering assert(!type->get_value). Instead of requiring ANV to smash off function pointers, let the extra function pointers through and then assert on the feature bits before the function pointers get used. This should give us roughly the same amount of assert protection while side-stepping the feature disabling problem. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13839>	2021-11-17 16:52:29 +00:00
Filip Gawin	0c74f80645	glsl: fix trivial strict aliasing warning Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13827>	2021-11-17 16:22:10 +00:00
Omar Akkila	58a0d8d0de	llvmpipe: page-align memory allocations Allows memory allocated by llvmpipe_allocate_memory_fd to be mappable to guests in virtualized environments like KVM which requires page-aligned memory. llvmpipe_allocate_memory is updated similarly for consistency. Signed-off-by: Omar Akkila <omar.akkila@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13793>	2021-11-17 09:25:37 -05:00
Connor Abbott	23a5f1a5ac	ir3: Stop inserting nops during scheduling Not necessary since nothing uses it anymore. This might have a slight effect on spilling with multiple blocks, but no shader-db difference because nothing spills. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13722>	2021-11-17 13:41:47 +00:00
Connor Abbott	e0eeba6cbb	ir3/postsched: Only prefer tex/sfu if they are soft-ready Otherwise we schedule an SFU depending on a tex as soon as the tex is scheduled, which is very much not what we want. Note that sstall is helped more than nops are hurt, and the shaders with the largest nop regressions also have sstall helped. However (sy) is also very much helped. total nops in shared programs: 345482 -> 345986 (0.15%) nops in affected programs: 5731 -> 6235 (8.79%) helped: 15 HURT: 81 helped stats (abs) min: 1 max: 9 x̄: 3.27 x̃: 3 helped stats (rel) min: 0.50% max: 16.00% x̄: 8.55% x̃: 10.26% HURT stats (abs) min: 1 max: 72 x̄: 6.83 x̃: 4 HURT stats (rel) min: 0.57% max: 400.00% x̄: 32.50% x̃: 13.16% 95% mean confidence interval for nops value: 3.34 7.16 95% mean confidence interval for nops %-change: 13.07% 39.10% Nops are HURT. total sstall in shared programs: 133804 -> 132381 (-1.06%) sstall in affected programs: 4743 -> 3320 (-30.00%) helped: 68 HURT: 24 helped stats (abs) min: 1 max: 153 x̄: 21.88 x̃: 8 helped stats (rel) min: 1.79% max: 100.00% x̄: 33.20% x̃: 28.00% HURT stats (abs) min: 1 max: 11 x̄: 2.71 x̃: 2 HURT stats (rel) min: 1.02% max: 200.00% x̄: 17.73% x̃: 5.59% 95% mean confidence interval for sstall value: -22.05 -8.89 95% mean confidence interval for sstall %-change: -27.60% -12.22% Sstall are helped. total (ss) in shared programs: 35471 -> 35481 (0.03%) (ss) in affected programs: 462 -> 472 (2.16%) helped: 9 HURT: 15 helped stats (abs) min: 1 max: 2 x̄: 1.11 x̃: 1 helped stats (rel) min: 4.17% max: 33.33% x̄: 14.00% x̃: 7.69% HURT stats (abs) min: 1 max: 3 x̄: 1.33 x̃: 1 HURT stats (rel) min: 1.19% max: 50.00% x̄: 12.27% x̃: 8.33% 95% mean confidence interval for (ss) value: -0.14 0.97 95% mean confidence interval for (ss) %-change: -5.11% 9.94% Inconclusive result (value mean confidence interval includes 0). total (sy) in shared programs: 13522 -> 13288 (-1.73%) (sy) in affected programs: 422 -> 188 (-55.45%) helped: 22 HURT: 1 helped stats (abs) min: 1 max: 21 x̄: 10.68 x̃: 10 helped stats (rel) min: 8.00% max: 94.44% x̄: 56.58% x̃: 56.94% HURT stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 HURT stats (rel) min: 25.00% max: 25.00% x̄: 25.00% x̃: 25.00% 95% mean confidence interval for (sy) value: -13.18 -7.17 95% mean confidence interval for (sy) %-change: -65.48% -40.59% (sy) are helped. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13722>	2021-11-17 13:41:47 +00:00
Connor Abbott	6f5c0d209c	ir3/postsched: Rewrite delay handling Analogous to the pre-RA scheduler. Unfortunately this time it's a bit more involved because we have to correctly handle (rptN), which is already relevant for swz. This means we need the index of the destination register that conflicts with the source register, to handle swz, and we need to expose that part of ir3_delay. But once that's done, we can delete ir3_delay_calc_postra. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13722>	2021-11-17 13:41:47 +00:00
Connor Abbott	140e117f2b	ir3/delay: Ignore earlier definitions to the same register We have a situation in some skia shaders like: add.f r0.x, ... (rpt2)nop mul.f ..., r0.x sam (xyzw) r0.x, ... rcp ..., r0.x Notice that rcp uses the result of the sam instruction, not the add.f, but we didn't keep track of which instructions kill the sources in ir3_delay, so we'd add an extra nop, resulting in a disagreement betwen ir3_delay and the scheduling graph. Since postsched is correct, fix ir3_delay. This only results in some very slight shader-db changes but keeps the next commit from changing things. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13722>	2021-11-17 13:41:47 +00:00
Connor Abbott	a54e7baa65	ir3/postsched: Handle sync dependencies better We want to model soft dependencies, but because of how there's only a single bit to wait on all of them, there may be unnecessary delays inserted when a (sy)-consumer follows an unrelated (sy)-producer. Previously there was some code to try to work around this, but we can just model it directly using the sfu_delay and tex_delay cycle counts that we have to maintain anyway and delete it. This also gets rid of the calls to ir3_delay_postra with soft=true which would be more complicated to handle in the next commit. There is a functional change here: the idea of preferring less nop's over critical path length (max_delay) up to 3 nops is kept (and we delete the TODO which is already sort-of resolved by it), but delays due to (ss)/(sy) and nops are now treated equally, rather than always preferring nops over syncs. So if our estimate indicates that scheduling an (ss) consumer will result in a wait of one cycle and there's another instruction that will require one nop, we will treat them otherwise equal and choose based on max_delay instead. This results in more sstall, but the decrease in nops is much greater. total nops in shared programs: 376613 -> 345482 (-8.27%) nops in affected programs: 275483 -> 244352 (-11.30%) helped: 3226 HURT: 110 helped stats (abs) min: 1 max: 78 x̄: 9.73 x̃: 7 helped stats (rel) min: 0.19% max: 100.00% x̄: 19.48% x̃: 13.68% HURT stats (abs) min: 1 max: 16 x̄: 2.43 x̃: 2 HURT stats (rel) min: 0.00% max: 150.00% x̄: 13.34% x̃: 4.36% 95% mean confidence interval for nops value: -9.61 -9.06 95% mean confidence interval for nops %-change: -19.01% -17.78% Nops are helped. total sstall in shared programs: 126195 -> 133806 (6.03%) sstall in affected programs: 79440 -> 87051 (9.58%) helped: 300 HURT: 1922 helped stats (abs) min: 1 max: 15 x̄: 4.72 x̃: 4 helped stats (rel) min: 1.05% max: 100.00% x̄: 17.15% x̃: 14.55% HURT stats (abs) min: 1 max: 29 x̄: 4.70 x̃: 4 HURT stats (rel) min: 0.00% max: 900.00% x̄: 25.38% x̃: 10.53% 95% mean confidence interval for sstall value: 3.22 3.63 95% mean confidence interval for sstall %-change: 17.50% 21.78% Sstall are HURT. total (ss) in shared programs: 35190 -> 35472 (0.80%) (ss) in affected programs: 6433 -> 6715 (4.38%) helped: 163 HURT: 401 helped stats (abs) min: 1 max: 2 x̄: 1.06 x̃: 1 helped stats (rel) min: 1.92% max: 33.33% x̄: 11.53% x̃: 10.00% HURT stats (abs) min: 1 max: 3 x̄: 1.13 x̃: 1 HURT stats (rel) min: 1.56% max: 100.00% x̄: 15.33% x̃: 12.50% 95% mean confidence interval for (ss) value: 0.41 0.59 95% mean confidence interval for (ss) %-change: 6.22% 8.93% (ss) are HURT. total (sy) in shared programs: 13476 -> 13521 (0.33%) (sy) in affected programs: 669 -> 714 (6.73%) helped: 30 HURT: 78 helped stats (abs) min: 1 max: 2 x̄: 1.13 x̃: 1 helped stats (rel) min: 4.00% max: 50.00% x̄: 21.22% x̃: 21.11% HURT stats (abs) min: 1 max: 2 x̄: 1.01 x̃: 1 HURT stats (rel) min: 3.45% max: 100.00% x̄: 31.93% x̃: 25.00% 95% mean confidence interval for (sy) value: 0.23 0.60 95% mean confidence interval for (sy) %-change: 11.19% 23.15% (sy) are HURT. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13722>	2021-11-17 13:41:47 +00:00
Connor Abbott	b9f61d7287	ir3/postsched: Fix copy-paste mistake Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13722>	2021-11-17 13:41:47 +00:00
Connor Abbott	d9a91318b1	ir3/sched: Rewrite delay handling The old code walked the instructions between each ready instruction and each of its parents for every instruction, which can quickly become accidentally quadratic. Instead we keep track of the current "instruction pointer" of the to-be-scheduled instruction, and for each ready instruction calculate an "earliest possible IP" which is the IP that needs to be reached before we can schedule it. Because this stays constant as soon as an instruction becomes ready, we never have to recompute it and each call to ir3_delay_calc_prera() becomes a simple comparison and subtract. We only need to iterate over the children and update their earliest_ip when scheduling an instruction, and we already do that in util_day_prune_head() so it should be cheap. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13722>	2021-11-17 13:41:47 +00:00
Connor Abbott	b8fc7a08f9	util/dag: Add dag_add_edge_max_data This will be useful for when the edge data represents a delay of some sort, like it will with ir3. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13722>	2021-11-17 13:41:47 +00:00
Connor Abbott	508f917d8c	util/dag: Make edge data a uintptr_t Nobody was actually using it as a pointer, and I'm going to introduce a shared function which relies on it not being a pointer so let's fix this once and for all. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13722>	2021-11-17 13:41:47 +00:00
Erico Nunes	ee2e14b352	ci: temporarily disable lima CI The lima board farm will be unavailable for a few days, so disable it to avoid CI failures. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13595>	2021-11-17 11:40:19 +00:00
Kenneth Graunke	3b78f17532	iris: Tidy code in iris_use_pinned_bo a bit Now that we aren't short-circuiting most of the code, we should probably reorganize it a little bit. Tagged with fixes just so we pull all the refactors together as one group. Fixes: `b21e916a62` ("iris: Combine iris_use_pinned_bo and add_exec_bo") Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13808>	2021-11-17 02:43:30 -08:00
Kenneth Graunke	6e90984934	iris: Check for cross-batch flushing whenever a buffer is newly written. We need to perform cross-batch flushing if any batch writes to a BO while others refer to it. We checked this case when recording a new BO in the list which we'd never seen before. However, we neglected to handle the case when we already read from a BO, but then began writing to it. That new write may provoke a conflict between existing reads in other batches, so we need to re-check the cross-batch flushing. Caught by Piglit's copyteximage when forcing blits and copies to use a new IRIS_BATCH_BLITTER that isn't upstream yet. But this bug could be provoked by render/compute work today...we just hadn't noticed it. Fixes: `b21e916a62` ("iris: Combine iris_use_pinned_bo and add_exec_bo") Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13808>	2021-11-17 02:43:30 -08:00
Kenneth Graunke	76030964a6	iris: Make a helper function for cross-batch dependency flushing This should have no functional change, but it's tagged with Fixes anyway because it's needed for the bug fix in the next patch. Fixes: `b21e916a62` ("iris: Combine iris_use_pinned_bo and add_exec_bo") Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13808>	2021-11-17 02:43:30 -08:00
Alejandro Piñeiro	cbf0d83eac	v3d,v3dv: move TFU register definition to a common header We are using the same definitions for both OpenGL and Vulkan, so let's move it to common. As we are here we are also adding versioning on the TFU register definition. Those are basically register bit places, so really likely to change between versions. Adding 33 as it is the first version they got defined. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13832>	2021-11-17 11:04:31 +01:00
Samuel Pitoiset	ffbad81305	radv: simplify re-using cache entries in radv_pipeline_cache_insert_shaders() If entry->shaders[i] is NULL, shaders[i] should be also NULL, so the else condition is a no-op. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13823>	2021-11-17 08:15:53 +01:00
Iago Toral Quiroga	836a4b5836	v3dv: fix internal bpp of D/S formats Depth/stencil formats can, at worse (d32/d24s8), be exactly 32bpp, which is the minimum we can program for the internal format. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13816>	2021-11-17 06:57:48 +00:00
Pavel Asyutchenko	8ee7309e57	llvmpipe: enable PIPE_CAP_FBFETCH_COHERENT llvmpipe's fragment shaders are always run sequentially and in API order for a single tile, so it's impossible to have out of order render target writes requiring fetch barriers. Issues fixed in previous commits were actually breaking most piglit/deqp tests for coherent extension variant. Signed-off-by: Pavel Asyutchenko <sventeam@yandex.ru> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13252>	2021-11-17 04:08:54 +00:00
Pavel Asyutchenko	e403c1c23e	llvmpipe: remove dead args from load_unswizzled_block They were only used in fs_fb_fetch. Signed-off-by: Pavel Asyutchenko <sventeam@yandex.ru> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13252>	2021-11-17 04:08:54 +00:00
Pavel Asyutchenko	ea6eeb70e6	llvmpipe: fix FB fetch with non 32-bit render target formats Use lp_build_fetch_rgba_soa instead of lp_build_unpack_rgba_soa. This one was failing most of deqp framebuffer_fetch tests. Signed-off-by: Pavel Asyutchenko <sventeam@yandex.ru> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13252>	2021-11-17 04:08:54 +00:00
Pavel Asyutchenko	2b3a020928	llvmpipe: protect from doing FB fetch of missing buffers Signed-off-by: Pavel Asyutchenko <sventeam@yandex.ru> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13252>	2021-11-17 04:08:54 +00:00
Pavel Asyutchenko	3ebd6498c4	llvmpipe: fix gl_FragColor and gl_LastFragData[0] combination Signed-off-by: Pavel Asyutchenko <sventeam@yandex.ru> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13252>	2021-11-17 04:08:54 +00:00
Pavel Asyutchenko	b1de61dd38	llvmpipe: fix wrong assumption on FB fetch shader opacity In certain cases variant->opaque could be set to true, which reset command list for tiles fully covered by a triangle with this shader. This is obviously wrong in presence of framebuffer fetch. Signed-off-by: Pavel Asyutchenko <sventeam@yandex.ru> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13252>	2021-11-17 04:08:54 +00:00
Mike Blumenkrantz	86eb1549ef	zink: implement pipe_context::draw_vertex_state rough implementation, but it should be a decent start Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13692>	2021-11-17 03:16:13 +00:00
Vasily Khoruzhick	02e5f4fb10	lima: add more wrap modes Using 1 bit per wrap mode looked very suspicious and after some experiments it turns out it's 3-bit enum. Border color is also here, it sits right after depth field. For some reason it uses 16 bit per channel just like for clear color in RSW GL_CLAMP mode is broken for nearest filter just as on Midgard, so add the same workaround - use GL_CLAMP_TO_EDGE for nearest filter. Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13213>	2021-11-16 22:58:12 +00:00
Vasily Khoruzhick	cbed4d784e	lima: handle 1D samplers It's just a matter of changing number of dimensions in texture descriptor. Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13213>	2021-11-16 22:58:12 +00:00
Vasily Khoruzhick	fa86a2a94d	lima: add support for 3D textures It looks like MBS format used by blob doesn't distinguish sampler2D from sampler3D, so load texture instruction is the same for 2D and 3D textures. So all we need to RE is texture descriptor for 3D textures, but blob doesn't implement it, so we need to do some guesswork: - unknown_3_1 looks like a depth since it sits after height/width and always set to 1 - unknown_2_2 is exactly 3 bits and it follows wrap_t, so it must be wrap_r - missing part is texture type for 3D textures. By trial and error it seems to be 4. First bit is only set for cubemap, so it's likely a separate flag, and rest 2 bits look like number of tex dimensions akin to midgard and later (thanks, panfrost!) with 0 for 1D, 1 for 2D and 2 for 3D. Put it all together and we have working 3D textures on lima! Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13213>	2021-11-16 22:58:12 +00:00
Mike Blumenkrantz	97b92c9c32	zink: set suballocator bo size to aligned allocation size this is the actual memory size cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13824>	2021-11-16 22:29:20 +00:00
Mike Blumenkrantz	eb6f1d5348	zink: block suballocator caching for swapchain/dmabuf images these have pNext pointers which makes their memory uncacheable cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13824>	2021-11-16 22:29:20 +00:00
Marek Olšák	ba6d389fa7	radeonsi: don't use GS SGPR6 for the small prim cull info use a user SGPR instead. This will be needed in the future. Also don't upload small_prim_precision because it's passed via VS_STATE_BITS. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13811>	2021-11-16 19:41:07 +00:00
Marek Olšák	0690a44e69	radeonsi: inline declare_vs_specific_input_sgprs I think it was getting a little hard to follow. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13811>	2021-11-16 19:41:07 +00:00
Marek Olšák	513bd6acca	radeonsi: cull against clip planes, clipvertex, clip/cull distances in shader The downside is that this duplicates shader code for clip/cull distances in both the position and parameter portions of the shader. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13811>	2021-11-16 19:41:07 +00:00
Marek Olšák	881c459191	radeonsi: unify how ngg_cull_flags are set Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13811>	2021-11-16 19:41:07 +00:00
Jesse Natalie	a818f7b686	d3d12: Fix incorrect hash table usage I'd assumed that since insert didn't take a deleter, it was find-or-insert, not insert-or-replace. This caused a bo reference leak if the same bo was used more than once in a batch. Fixes: `fde36d7992` ("d3d12: Don't wait for GPU reads to do CPU reads") Reviewed By: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13819>	2021-11-16 19:27:16 +00:00
Vasily Khoruzhick	764760314d	lima: add native txp support Currently lima uses generic TXP lowering that results in downgrading coords precision to FP16 since we have to do some calculations with coords instead of loading them directly from varying. Mali4x0 has native TXP support, however coords and projector have to come from a single source. Add NIR lowering pass that combines coords and projector into a single backend-specific source and use it instead of generic lowering. Unfortunately this change regresses one test, but it also fails in blob and disassembly is now identical. shader-db diff: total instructions in shared programs: 15623 -> 15603 (-0.13%) instructions in affected programs: 877 -> 857 (-2.28%) helped: 7 HURT: 0 helped stats (abs) min: 2 max: 8 x̄: 2.86 x̃: 2 helped stats (rel) min: 0.87% max: 10.53% x̄: 4.93% x̃: 1.85% 95% mean confidence interval for instructions value: -4.95 -0.76 95% mean confidence interval for instructions %-change: -9.31% -0.55% Instructions are helped. total loops in shared programs: 3 -> 3 (0.00%) loops in affected programs: 0 -> 0 helped: 0 HURT: 0 total spills in shared programs: 136 -> 137 (0.74%) spills in affected programs: 0 -> 1 helped: 0 HURT: 1 total fills in shared programs: 598 -> 602 (0.67%) fills in affected programs: 0 -> 4 helped: 0 HURT: 1 Tested-by: Denis Pauk <pauk.denis@gmail.com> Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13111>	2021-11-16 19:13:42 +00:00
Rob Clark	fac9d22773	isaspec: Add prototypes for expr evaluators Add function prototypes for generated expr evaluators, to avoid a use- before-declaration issue if an expression references a derived field. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13787>	2021-11-16 18:44:22 +00:00
Ilia Mirkin	aa93896156	freedreno/ir3: adjust condition for when to use ldib We have to use it any time that the image is writable. Otherwise writes from the same invocation won't have posted into the texture cache. See: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5629 Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13807>	2021-11-16 18:22:29 +00:00
Samuel Pitoiset	011ea32585	nir: fix constant expression of ibitfield_extract This fixes dEQP-VK.graphicsfuzz.cov-condition-bitfield-extract-integer. For example, nir_ibitfield_extract(3, 1, 2) should return 1. Cc: 21.3 mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13791>	2021-11-16 17:32:21 +00:00
Jason Ekstrand	8a11d2a31b	vulkan: Add a dummy sync type This is useful in WSI scenarios where you want to trivially signal a fence or semaphore. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:54:27 -06:00
Jason Ekstrand	2a910071bc	vulkan,anv: Auto-detect syncobj features Instead of having a bunch of const vk_sync_type for each permutation of vk_drm_syncobj capabilities, have a vk_drm_syncobj_get_type helper which auto-detects features. If a driver can't support a feature for some reason (i915 got timeline support very late, for instance), they can always mask off feature bits they don't want. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:54:27 -06:00
Jason Ekstrand	36b4d12f02	anv: Simplify submit_simple_batch() BO waits aren't going away any time soon so using a syncobj here doesn't really gain us anything. It just makes it more complicated. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:54:27 -06:00
Jason Ekstrand	623e9ecd6d	anv: Remove unnecessary syncobj wrappers These are entirely unused except for the syncobj in submit_simple_batch. We can use the libdrm wrappers for that as they're basically equivalent and the core Vulkan sync code already depends on them. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:54:27 -06:00
Jason Ekstrand	c5ac1d1669	vulkan: Add an emulated binary vk_sync type This wraps a timeline vk_sync type and turns it into a binary one. This is useful for, for instance, driver layered on D3D12. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:54:27 -06:00
Jason Ekstrand	36ea90a361	anv: Convert to the common sync and submit framework This is, unfortunately, a large flag-day mega-commit. However, any other approach would likely be fragile and involve a lot more churn as we try to plumb the new vk_fence and vk_semaphore primitives into ANV's submit code before we delete it all. Instead, we do it all in one go and accept the consequences. While this should be mostly functionally equivalent to the previous code, there is one potential perf-affecting change. The command buffer chaining optimization no longer works across VkSubmitInfo structs. Within a single VkSubmitInfo, we will attempt to chain all the command buffers together but we no longer try to chain across a VkSubmitInfo boundary. Hopefully, this isn't a significant perf problem. If it ever is, we'll have to teach the core runtime code how to combine two or more VkSubmitInfos into a single vk_queue_submit. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:54:27 -06:00
Jason Ekstrand	87cf858946	anv: Use helpers in util/os_time.h in the query code These are about to be the only use of anv_gettime_ns() so lets switch them over so the next patch can delete the helper outright. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:54:27 -06:00
Jason Ekstrand	213edd1841	anv: Remove the last remnants of in/out fences This should have been dropped as part of `d44ea09e61` ("anv: Drop unused sync_file and BO semaphore code") but we missed a few bits. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:54:27 -06:00

... 25 26 27 28 29 ...

148555 commits