fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-22 15:40:11 +01:00

Author	SHA1	Message	Date
Iago Toral Quiroga	0ce346368f	v3dv: limit sync for barriers to hw queues selected by source mask Until know when we consumed a barrier we would implement it by setting the serialize flag on a job, which would cause it to be serialized across all hardware queues (CL, CSD, TFU). However, now that we track the source(s) of the barrier, we can restrict this to only the relevant queue(s) instead (multisync path only). It should be noted that we can implement transfers via TFU or CL jobs, so if the source of a barrier is a transfer, we currently synchronize against both the TFU and the CL queues, however, we may be able to more effectively track this in the future to restrict this to just one of the queues. Also, for secondary command buffers we are taking the easy way out and always synchronize against all queues, but we should be able to do the same for secondaries without too much effort. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16743>	2022-05-31 06:06:10 +00:00
Iago Toral Quiroga	ad249e9020	v3dv: track sources of barriers Until now we have been tracking the dstStageMask of barriers (where they are consumed) but not where they are produced (the srcStageMask). With this change we extend our barrier state to keep track of this as well. This allows the driver to have better knowledge of the intended barrier semantics so it can limit the amount of synchronization it does only to the source stages involved with a barrier. We will do this in a later patch. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16743>	2022-05-31 06:06:10 +00:00
Iago Toral Quiroga	f7ce42636c	v3dv: use an explicit struct type to track barrier state Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16743>	2022-05-31 06:06:10 +00:00
Iago Toral Quiroga	eccc0e6a0b	v3dv: only clear BCL barrier state if we don't have pending graphics barriers Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16743>	2022-05-31 06:06:10 +00:00
Iago Toral Quiroga	24ebcbbaa7	v3dv: consume barriers at the right stages Until now, we have always consumed barriers with the next GPU job recorded into the command buffer after the barrier even if the job was not the target of the barrier itself. This works based on the idea that when we consume a barrier in a job we serialize it against all queues, so effectively we are ensuring that whatever came before it has completed, so if the barrier was intended for an even later job, it would have served its purpose anyway. It should be noted that CL jobs are special because they are actually split in two different queues: the binning queue and the render queue, with a dependency between them to ensure render runs after binning. With our current implementation, if we have 3 jobs (A, B, C) and we have a barrier after job A which is intended to block job C on A's completion, with our implementation we would instead block B on A's completion. If C is a CL job, and the barrier was targetting the binning stage then we can have the following scenarios: 1. If B) is a CL job, it will consume the barrier at its binning stage, so we know that B's binning will not start until A has completed. Then C's binning will not start until B's binning has completed, and thus, will not start until A has completed, as intended. 2. If B) is not a CL job, it will consume the barrier and will not start until A has completed, however, C's binning job will be submitted to the binning queue without any sync requirements and since B did not put any jobs in the binning queue it will start as soon as A's binning has completed, but not A's render, which would be incorrect. Further, since `a981ac0539` we now skip consumming BCL barriers if a job does not have draw calls that can be affected by them. In the same scenarios as before, now case 1) would also be problematic, since B may skip the binning sync in that case and start immediately, and since C's binning would be allowe to start immediately after B's binning, there is no guarantee that this doesn't happen in parallel with A's render. With this patch we fix this situation by tracking the intended consumer of each barrier: graphics, compute or transfer, and we make sure to consume them only with jobs that match those semantics. This fixes flakyness in dEQP-VK.device_group.* Fixes: `a981ac0539` ('v3dv: skip binning sync if binning shaders don't access external resources') Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16743>	2022-05-31 06:06:10 +00:00
Marek Olšák	05eb9530ca	ac/gpu_info: always retile DCC on gfx10 and newer chips Unaligned DCC doesn't work there. Fixes: `f8cf5ea982` - amd: add support for gfx1036 and gfx1037 chips Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16726>	2022-05-31 05:34:25 +00:00
Vinson Lee	b35991dbcb	amd: Initialize Gfx11Lib members in constructor. Fix defects reported by Coverity Scan. Uninitialized scalar field (UNINIT_CTOR) uninit_member: Non-static class member m_numPkrLog2 is not initialized in this constructor nor in any functions that it calls. uninit_member: Non-static class member m_numSaLog2 is not initialized in this constructor nor in any functions that it calls. Fixes: `4fdf42b3c2` ("amd: import gfx11 addrlib") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16679>	2022-05-31 03:36:53 +00:00
Ben Skeggs	90ff3c350b	nv50/ir: fix ALD/AST where address comes from AL2P Signed-off-by: Ben Skeggs <bskeggs@redhat.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16784>	2022-05-31 01:57:16 +00:00
Ben Skeggs	9fd1602fb5	nv50/ir: fix scope of memory ops on ampere Signed-off-by: Ben Skeggs <bskeggs@redhat.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16784>	2022-05-31 01:57:16 +00:00
Ben Skeggs	ca04eff600	nv50/ir: fix OUT.FINAL on ampere Signed-off-by: Ben Skeggs <bskeggs@redhat.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16784>	2022-05-31 01:57:16 +00:00
Karol Herbst	e06c373c6b	nv50/ir: recoginse Ampere Signed-off-by: Karol Herbst <kherbst@redhat.com> Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16784>	2022-05-31 01:57:16 +00:00
Hyunjun Ko	16ea41c901	ir3: handle intrinsic_load_draw_id when scanning driver constants Fixes: #6567 Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16769>	2022-05-31 01:33:55 +00:00
Timothy Arceri	00313effdb	nir/gcm: fix pushing instructions into if blocks The previous logic would just set the block to the instructions original location if we couldn't evict it from a loop. For now we only push const loads to a later block inside ifs but we can add more heuristics later. This change helps a hand full of shaders but also stops a CTS regression caused by excess spilling after a series I'm working on to disable more of the GLSL IR optimisation passes. Shader-db results iris (BDW): total instructions in shared programs: 17529759 -> 17529749 (<.01%) instructions in affected programs: 15929 -> 15919 (-0.06%) helped: 5 HURT: 2 helped stats (abs) min: 1 max: 5 x̄: 2.40 x̃: 2 helped stats (rel) min: 0.06% max: 0.15% x̄: 0.11% x̃: 0.12% HURT stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 HURT stats (rel) min: 0.06% max: 0.06% x̄: 0.06% x̃: 0.06% 95% mean confidence interval for instructions value: -3.34 0.49 95% mean confidence interval for instructions %-change: -0.14% 0.02% Inconclusive result (value mean confidence interval includes 0). total cycles in shared programs: 861109994 -> 861099681 (<.01%) cycles in affected programs: 7027698 -> 7017385 (-0.15%) helped: 95 HURT: 72 helped stats (abs) min: 1 max: 7995 x̄: 138.54 x̃: 9 helped stats (rel) min: <.01% max: 15.96% x̄: 0.54% x̃: 0.11% HURT stats (abs) min: 1 max: 474 x̄: 39.56 x̃: 12 HURT stats (rel) min: <.01% max: 1.17% x̄: 0.20% x̃: 0.11% 95% mean confidence interval for cycles value: -159.05 35.54 95% mean confidence interval for cycles %-change: -0.45% 0.01% Inconclusive result (value mean confidence interval includes 0). total spills in shared programs: 17606 -> 17605 (<.01%) spills in affected programs: 323 -> 322 (-0.31%) helped: 1 HURT: 0 total fills in shared programs: 22599 -> 22598 (<.01%) fills in affected programs: 1348 -> 1347 (-0.07%) helped: 1 HURT: 0 Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14940>	2022-05-31 01:03:43 +00:00
David Heidelberg	2cf7f08b04	ci: traces: temporarily disable nheko trace Disable nheko trace until apitrace gets fixed. apitrace currently fails with this trace, when more than 1 run is requested. Upstream issue: https://github.com/apitrace/apitrace/issues/800 Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16774>	2022-05-31 00:00:25 +00:00
Alyssa Rosenzweig	0170500627	pan/bi: Interpolate varyings at 16-bit On Bifrost, we have a single "load float varying" instruction that controls the bit size of the result, allowing us to fold a f2f16 into the load. However, the larger benefit is that 16-bit varying loads are interpolated at 16-bit. Arm claims that the varying unit has a 32-bit data path, allowing 16-bit varyings to be interpolated in half the cycles from 32-bit. This change should therefore improve performance for workloads that are varying units. This means we want to be aggressive about 16-bit varying loads, even if it costs some extra f2f32 instructions. glmark2 total score on Mali-G52 up from 1173fps to 1218fps with particular wins in -brefract, -bshadow, -bjellyfish, and -bshading. total instructions in shared programs: 2432246 -> 2423668 (-0.35%) instructions in affected programs: 516056 -> 507478 (-1.66%) helped: 3641 HURT: 432 helped stats (abs) min: 1.0 max: 12.0 x̄: 2.91 x̃: 2 helped stats (rel) min: 0.08% max: 54.55% x̄: 9.88% x̃: 5.71% HURT stats (abs) min: 1.0 max: 42.0 x̄: 4.71 x̃: 4 HURT stats (rel) min: 0.23% max: 200.00% x̄: 12.58% x̃: 6.37% 95% mean confidence interval for instructions value: -2.21 -2.00 95% mean confidence interval for instructions %-change: -7.92% -7.07% Instructions are helped. total tuples in shared programs: 1941309 -> 1934647 (-0.34%) tuples in affected programs: 353169 -> 346507 (-1.89%) helped: 3233 HURT: 453 helped stats (abs) min: 1.0 max: 14.0 x̄: 2.46 x̃: 2 helped stats (rel) min: 0.12% max: 50.00% x̄: 9.90% x̃: 5.56% HURT stats (abs) min: 1.0 max: 25.0 x̄: 2.85 x̃: 2 HURT stats (rel) min: 0.22% max: 150.00% x̄: 8.96% x̃: 5.26% 95% mean confidence interval for tuples value: -1.89 -1.72 95% mean confidence interval for tuples %-change: -8.01% -7.15% Tuples are helped. total clauses in shared programs: 357354 -> 356610 (-0.21%) clauses in affected programs: 25794 -> 25050 (-2.88%) helped: 994 HURT: 317 helped stats (abs) min: 1.0 max: 3.0 x̄: 1.16 x̃: 1 helped stats (rel) min: 1.49% max: 33.33% x̄: 10.78% x̃: 10.00% HURT stats (abs) min: 1.0 max: 4.0 x̄: 1.31 x̃: 1 HURT stats (rel) min: 1.19% max: 50.00% x̄: 13.56% x̃: 8.33% 95% mean confidence interval for clauses value: -0.63 -0.50 95% mean confidence interval for clauses %-change: -5.63% -4.16% Clauses are helped. total cycles in shared programs: 167697.96 -> 167431.15 (-0.16%) cycles in affected programs: 12638.29 -> 12371.48 (-2.11%) helped: 2652 HURT: 350 helped stats (abs) min: 0.04166399999999726 max: 0.75 x̄: 0.11 x̃: 0 helped stats (rel) min: 0.12% max: 100.00% x̄: 14.39% x̃: 5.04% HURT stats (abs) min: 0.041665999999999315 max: 0.5833329999999997 x̄: 0.11 x̃: 0 HURT stats (rel) min: 0.00% max: 75.00% x̄: 7.90% x̃: 4.71% 95% mean confidence interval for cycles value: -0.09 -0.08 95% mean confidence interval for cycles %-change: -12.56% -11.02% Cycles are helped. total arith in shared programs: 74169.46 -> 73891.71 (-0.37%) arith in affected programs: 13885.87 -> 13608.12 (-2.00%) helped: 3215 HURT: 445 helped stats (abs) min: 0.04166399999999726 max: 0.5416680000000014 x̄: 0.10 x̃: 0 helped stats (rel) min: 0.12% max: 100.00% x̄: 14.16% x̃: 6.67% HURT stats (abs) min: 0.041665999999999315 max: 1.125 x̄: 0.12 x̃: 0 HURT stats (rel) min: 0.00% max: 100.00% x̄: 9.76% x̃: 5.49% 95% mean confidence interval for arith value: -0.08 -0.07 95% mean confidence interval for arith %-change: -11.91% -10.59% Arith are helped. total texture in shared programs: 11936 -> 11931 (-0.04%) texture in affected programs: 20 -> 15 (-25.00%) helped: 10 HURT: 0 helped stats (abs) min: 0.5 max: 0.5 x̄: 0.50 x̃: 0 helped stats (rel) min: 14.29% max: 100.00% x̄: 45.71% x̃: 33.33% 95% mean confidence interval for texture value: -0.50 -0.50 95% mean confidence interval for texture %-change: -73.16% -18.26% Texture are helped. total vary in shared programs: 4180.88 -> 3447.19 (-17.55%) vary in affected programs: 2109.88 -> 1376.19 (-34.77%) helped: 2202 HURT: 39 helped stats (abs) min: 0.0625 max: 1.4375 x̄: 0.34 x̃: 0 helped stats (rel) min: 2.38% max: 66.67% x̄: 40.43% x̃: 50.00% HURT stats (abs) min: 0.125 max: 0.375 x̄: 0.26 x̃: 0 HURT stats (rel) min: 0.00% max: 300.00% x̄: 92.54% x̃: 23.08% 95% mean confidence interval for vary value: -0.34 -0.32 95% mean confidence interval for vary %-change: -39.22% -37.01% Vary are helped. total quadwords in shared programs: 1689664 -> 1684852 (-0.28%) quadwords in affected programs: 265522 -> 260710 (-1.81%) helped: 2864 HURT: 447 helped stats (abs) min: 1.0 max: 14.0 x̄: 2.10 x̃: 2 helped stats (rel) min: 0.15% max: 31.58% x̄: 6.05% x̃: 4.65% HURT stats (abs) min: 1.0 max: 22.0 x̄: 2.67 x̃: 2 HURT stats (rel) min: 0.27% max: 38.46% x̄: 6.79% x̃: 4.55% 95% mean confidence interval for quadwords value: -1.54 -1.37 95% mean confidence interval for quadwords %-change: -4.55% -4.08% Quadwords are helped. total threads in shared programs: 53656 -> 53688 (0.06%) threads in affected programs: 32 -> 64 (100.00%) helped: 32 HURT: 0 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 100.00% max: 100.00% x̄: 100.00% x̃: 100.00% 95% mean confidence interval for threads value: 1.00 1.00 95% mean confidence interval for threads %-change: 100.00% 100.00% Threads are helped. total preloads in shared programs: 116212 -> 103476 (-10.96%) preloads in affected programs: 45222 -> 32486 (-28.16%) helped: 3022 HURT: 11 helped stats (abs) min: 1.0 max: 11.0 x̄: 4.23 x̃: 4 helped stats (rel) min: 7.14% max: 68.75% x̄: 30.39% x̃: 25.00% HURT stats (abs) min: 2.0 max: 4.0 x̄: 3.45 x̃: 4 HURT stats (rel) min: 14.29% max: 50.00% x̄: 25.93% x̃: 25.00% 95% mean confidence interval for preloads value: -4.26 -4.14 95% mean confidence interval for preloads %-change: -30.68% -29.69% Preloads are helped. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Tested-by: Chris Healy cphealy@gmail.com Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16752>	2022-05-30 17:49:44 -04:00
Alyssa Rosenzweig	93f69e4b1c	pan/bi: Model Valhall source formats LD_VAR_BUF instructions on Valhall take a source format, indicating the in-memory format of the varying independent from the register format, which we still model within the compiler for compatibility with Bifrost. (Prior to Valhall, source format is specified in the attribute descriptor as a physical pixel format.) Model this information, allowing us to generate fp16 LD_VAR_BUF instructions correctly on Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16752>	2022-05-30 17:49:44 -04:00
Alyssa Rosenzweig	06886c3861	pan/bi: Make LD_VAR w=format instead of w=vecsize Fixes a vector dimension validation failure in dEQP-GLES3.functional.shaders.indexing.varying_array.vec4_static_write_dynamic_read after we enable fp16 varyings. No shader-db changes, as we don't yet support fp16 varyings. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16752>	2022-05-30 17:49:44 -04:00
Alyssa Rosenzweig	a9b13a1867	pan/va: Fill in missing src_flat16 enum Valhall gains(?) the ability to flatshade 16-bit varyings, this is indicated by a particular source format. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16752>	2022-05-30 17:49:44 -04:00
Alyssa Rosenzweig	e898e2466b	pan/bi: Add VAR_TEX fusing unit test As fusing VAR_TEX is an optimization, it's helpful to have unit tests since functional tests won't check that the optimization triggers when expected. Originally written when I was touching the VAR_TEX code. Those changes have since been dropped by the unit test remains useful. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16752>	2022-05-30 17:48:59 -04:00
Konstantin Seurer	1727310ea1	radv: Use vk_buffer Use vk_buffer as a base for radv_buffer and replace manual handling of VK_WHOLE_SIZE with vk_buffer_range. Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16764>	2022-05-30 17:43:58 +00:00
Konstantin Seurer	7f1e967c02	vulkan: Use the correct enum in vk_buffer VkBufferCreateFlags is correct. Fixes: `f6ae21b` ("vulkan: Add a base struct for buffers") Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16764>	2022-05-30 17:43:58 +00:00
Matt Turner	27f3fb5699	mesa: Drop dead #include "sparc/sparc.h" sparc.h was removed in when the classic drivers were removed. It only contained a prototype for _mesa_init_all_sparc_transform_asm() which wasn't even called by context.c. Remove the #include to fix the build on sparc. Fixes: `e030d5ba8a` ("mesa: Delete libmesa_classic") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6425 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16765>	2022-05-30 17:03:21 +00:00
Rhys Perry	cd2b4c319a	aco/ra: fix usage of invalidated iterator Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Fixes: `58bd9a379e` ("aco/ra: fix live-range splits of phi definitions") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16454>	2022-05-30 16:41:32 +00:00
Alyssa Rosenzweig	42a4a123a6	pan/bi: Don't allow spilling coverage mask writes The register precolouring logic assumes that coverage masks are always in R60, so spilling them causes incorrect results. We could do better. Fixes on Valhall: dEQP-GLES3.functional.ubo.random.all_per_block_buffers.28 Fixes: `3df5446cbd` ("pan/bi: Simplify register precolouring in the IR") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16748>	2022-05-30 14:00:55 +00:00
Alyssa Rosenzweig	67f5721349	panfrost: Set allow_rotating_primitives On Valhall, the driver should set this flag if the hardware may rotate primitives. This happens if: 1. The rasterization of lines does not matter, AND 2. The provoking vertex does not matter. The first condition we may satisfy by checking for LINES and the second by checking for flat shading. Otherwise, we should set this flag to allow optimizations. This may be more efficient for tiling. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16748>	2022-05-30 14:00:55 +00:00
Alyssa Rosenzweig	e7a7679b9a	panfrost: Set overdraw_alpha[01] flags These basically correspond to the alpha_zero_nop and alpha_one_store flags we already compute and set. Except those flags don't exist on Valhall, so these need to be used instead (on Bifrost, in addition .. unclear why the duplication on Bifrost). Set these flags when we can. Ostensibly this is for performance (neglible improvement on glmark2 score), but mostly I want to get us using the hardware optimally. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16748>	2022-05-30 14:00:55 +00:00
Alyssa Rosenzweig	776fa19db2	panfrost: Allow FPK on Valhall Reuse the logic from Bifrost for a performance win. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16748>	2022-05-30 14:00:55 +00:00
Samuel Pitoiset	2981ee5c7b	radv: init states from VkPipelineFragmentShadingRateState at only one place Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16672>	2022-05-30 12:21:14 +00:00
Samuel Pitoiset	8fdc4bf3ba	radv: init states from VkPipelineDiscardRectangleState at only one place Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16672>	2022-05-30 12:21:14 +00:00
Samuel Pitoiset	b36bee672f	radv: init states from VkAttachmentSampleCountInfo at only one place Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16672>	2022-05-30 12:21:14 +00:00
Samuel Pitoiset	23ba9a6249	radv: init states from pColorBlendState at only one place Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16672>	2022-05-30 12:21:14 +00:00
Samuel Pitoiset	1dff2a9204	radv: use AMD values for computing blend related state This will allow to translate the function and factors earlier. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16672>	2022-05-30 12:21:14 +00:00
Samuel Pitoiset	d423d1c683	radv: init states from VkPipelineRenderingCreateInfo at only one place Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16672>	2022-05-30 12:21:14 +00:00
Samuel Pitoiset	6e8e926e1b	radv: init states from pDepthStencilState at only one place Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16672>	2022-05-30 12:21:14 +00:00
Samuel Pitoiset	8e9b3fabc5	radv: init states from pMultisampleState at only one place Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16672>	2022-05-30 12:21:14 +00:00
Samuel Pitoiset	1f8db57023	radv: copy viewport/scissor when initializing radv_viewport_info Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16672>	2022-05-30 12:21:14 +00:00
Marcin Ślusarz	0f46a8fbfe	anv: remove invalid copy/pasted comment Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16773>	2022-05-30 11:46:13 +00:00
Marcin Ślusarz	34b5a717c0	anv: remove redundant code calculating dynamic states mask pipeline->dynamic_states is already set by anv_graphics_pipeline_init since `231651fd89`. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16773>	2022-05-30 11:46:13 +00:00
David Heidelberg	092d03a90e	ci/iris: skqp: remove flaking atlastext for TGL (gl version) gles version of atlastext was already removed due to same behavior Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16772>	2022-05-30 10:50:12 +00:00
Tatsuyuki Ishi	250b6eb908	radv/ci: skip image.sample_texture._compressed_format These tests are flaky due to missing barriers, exposed by `211db6d333` ("radv: Fix redundant subpass barriers due to erroneous comparison"). Reviewed-by: Martin Roukala <martin.roukala@mupuf.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16762>	2022-05-30 09:18:38 +00:00
Tatsuyuki Ishi	6eb5a110dd	radv/ci: Move transient_attachment_bit from fail to skip list These tests are flaky and should not be treated as expected-fail. This also removes the duplicates from the fail list which was breaking CTS runner. Fixes: `cd14431b8c` ("radv/ci: skip dEQP-VK.fragment_operations.transient_attachment_bit") Reviewed-by: Martin Roukala <martin.roukala@mupuf.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16762>	2022-05-30 09:18:38 +00:00
Jesse Natalie	a56d47b0ba	microsoft/compiler: Fixup sampler derefs in tex instrs that don't need samplers Sometimes you can end up with tex instructions that have sampler deref srcs, even though they don't need them, e.g. a txs. In this case, still fix up those derefs in the sampler splitting pass rather than leaving them pointing to a typed sampler. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16639>	2022-05-30 07:28:50 +00:00
Konstantin Seurer	8aecb3ed58	radv: Ignore transformOffset if transformData is 0 There is also a hypothetical scenario where transformData is 0 and transformOffset is not 0 and we end up reading from transformOffset because transform_addr is not 0. VkAccelerationStructureBuildRangeInfoKHR spec: If VkAccelerationStructureGeometryTrianglesDataKHR::transformData is not NULL, a single VkTransformMatrixKHR structure is consumed from VkAccelerationStructureGeometryTrianglesDataKHR::transformData, at an offset of transformOffset. This matrix describes a transformation from the space in which the vertices for all triangles in this geometry are described to the space in which the acceleration structure is defined. Which I think means, that we should ignore transformOffset if transformData is NULL. Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16719>	2022-05-30 00:00:53 +00:00
Konstantin Seurer	9be00573c4	radv: Fix handling of primitiveOffset VkAccelerationStructureBuildRangeInfoKHR spec: If the geometry uses indices, primitiveCount × 3 indices are consumed from VkAccelerationStructureGeometryTrianglesDataKHR::indexData, starting at an offset of primitiveOffset. The value of firstVertex is added to the index values before fetching vertices. If the geometry does not use indices, primitiveCount × 3 vertices are consumed from VkAccelerationStructureGeometryTrianglesDataKHR::vertexData, starting at an offset of primitiveOffset + VkAccelerationStructureGeometryTrianglesDataKHR::vertexStride × firstVertex. Meaning: We always add firstVertex * vertexStride to the vertex address and add primitiveOffset either to the vertex address or the index address, depending on wether indices are used. Also add missing handling with instances. Fixes: `0dad88b` ("radv: Implement device-side BVH building.") Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16719>	2022-05-30 00:00:53 +00:00
SureshGuttula	f2e3646321	Revert "radeonsi: Set display_remote for non-refernced frames" This reverts commit `ef76b83633`. Reason for revert: This only helps in using I MBs.To further fix in dpb , reverting this. Fix added : https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16745 Signed-off-by: SureshGuttula <suresh.guttula@amd.com> Reviewed-by: Thong Thai <thong.thai@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16744>	2022-05-29 00:07:47 +00:00
Jesse Natalie	55f9ff03c2	d3d12: Fix forward decl for sw_winsys Closes part of: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4099 Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16760>	2022-05-29 00:00:19 +00:00
SureshGuttula	77a6feff89	radeonsi/vcn : update enc->dpb ref_use for index 0 Currently dpb_enc referneces not updated properly when index 0, as we are skipping clearing that ref. This patch will fix this for index 0. So that when ever we set non_referenced flag, that is not used as ref and not pushed to DPB. This is helping in SVC encoding. Signed-off-by: SureshGuttula <suresh.guttula@amd.com> Reviewed-by: Thong Thai <thong.thai@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16745>	2022-05-28 15:52:53 +00:00
Lionel Landwerlin	09caa8902c	anv: move internal RT shaders to the internal cache Those shaders are just like the blorp ones. v2: Use a single internal cache for blorp/RT (Jason) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `7f1e82306c` ("anv: Switch to the new common pipeline cache") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16741>	2022-05-28 10:14:03 +00:00
Jason Ekstrand	0eee071038	panvk: Use the vk_buffer base struct Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16607>	2022-05-27 18:39:00 -05:00
Jason Ekstrand	5d0b09be5b	anv: Use the base vk_buffer struct This mostly gets us the vk_buffer_range() helper but may be useful in the future. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16607>	2022-05-27 18:38:57 -05:00

1 2 3 4 5 ...

154630 commits