fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-18 15:58:06 +02:00

Author	SHA1	Message	Date
Marek Olšák	46cb3bb4d1	ac/debug: add an option to disable colors for printed IBs Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12656>	2021-09-01 00:42:58 +00:00
Marek Olšák	0aed2d0cd3	radeonsi: stop using AC_EXP_PARAM_UNDEFINED because it's not useful Just use AC_EXP_PARAM_DEFAULT_VAL_0000 to keep things simple. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12656>	2021-09-01 00:42:57 +00:00
Timur Kristóf	395c0c52c7	ac: Calculate workgroup sizes of HW stages that operate in workgroups. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12321>	2021-08-26 09:46:18 +00:00
Timur Kristóf	5b7446d74c	radv, ac, aco: Use indices 0-2 of gs_vtx_offset argument array on GFX9+. Previously, indices 0, 2, 4 were used. This worked, but it was somewhat unintuitive. This commit changes it to use indices 0, 1, 2 instead, which makes the code easier to understand. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12511>	2021-08-26 05:20:15 +00:00
Marek Olšák	556c10c02c	ac/surface: allow arbitrary swizzle modes for displayable DCC Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12430>	2021-08-20 14:28:36 +00:00
Eric Engestrom	f1eae2f8bb	python: drop python2 support Signed-off-by: Eric Engestrom <eric@engestrom.ch> Acked-by: Jose Fonseca <jfonseca@vmware.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3674>	2021-08-14 21:44:32 +00:00
Michel Zou	e4c0a34bfe	radv: fix build with mingw Cc: 21.2 mesa-stable Reviewed-by: Joshua Ashton <joshua@froggi.es> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Closes #5092 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12178>	2021-08-13 12:13:21 +02:00
Samuel Pitoiset	16793c8efa	ac/surface: implement CmaskAddrFromCoord in NIR on GFX10+ Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12182>	2021-08-05 06:37:09 +00:00
Samuel Pitoiset	1d67fa4d73	ac/surface: add tests for CmaskAddrFromCoord on GFX10+ Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12182>	2021-08-05 06:37:09 +00:00
Timur Kristóf	8918a809ce	ac: Remove deprecated use_late_alloc field as nobody uses it anymore. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11905>	2021-08-04 15:37:05 +00:00
Samuel Pitoiset	a49b397041	ac/surface: implement CmaskAddrFromCoord in NIR It's similar to DCC, only GFX9 is currently supported. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12140>	2021-08-03 07:02:48 +00:00
Samuel Pitoiset	eedc0b59b7	ac/surface: copy the CMASK equation to radeon_surf Only GFX9 is currently supported. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12140>	2021-08-03 07:02:48 +00:00
Samuel Pitoiset	1f12c3ccc1	ac/surface: store CMASK pitch and height to radeon_surf Only GFX9+ is currently supported. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12140>	2021-08-03 07:02:48 +00:00
Samuel Pitoiset	132b205566	ac/surface: add tests for CmaskAddrFromCoord prototype outside of addrlib Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12140>	2021-08-03 07:02:48 +00:00
Samuel Pitoiset	501db87779	ac: introduce a structure to store DCC address equations for GFX9 CMASK addr equations will use the same struct. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12140>	2021-08-03 07:02:48 +00:00
Timur Kristóf	a6110f3c3a	ac/nir: Remove unhelpful nir_opt_cse from ac_nir_lower_ngg_nogs. This CSE call adds to our compile time without adding any real benefit to the compiled code. Fossil DB results on Sienna Cichlid (with NGGC on): Totals from 1580 (1.23% of 128647) affected shaders: CodeSize: 4563912 -> 4562312 (-0.04%); split: -0.07%, +0.03% Instrs: 870722 -> 870338 (-0.04%); split: -0.09%, +0.04% Latency: 3349863 -> 3351458 (+0.05%); split: -0.10%, +0.14% InvThroughput: 617796 -> 617971 (+0.03%); split: -0.01%, +0.03% VClause: 22604 -> 22568 (-0.16%); split: -0.75%, +0.59% SClause: 16285 -> 16327 (+0.26%); split: -0.07%, +0.33% Copies: 83472 -> 83599 (+0.15%); split: -0.07%, +0.22% PreSGPRs: 62340 -> 62334 (-0.01%) No Fossil DB changes with NGGC off. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11908>	2021-08-02 11:38:25 +00:00
Timur Kristóf	4b540aef2c	ac/nir: Don't count vertices and primitives in wave after culling. These are not needed anymore, because the EXEC mask doesn't depend on them. Fossil DB results on Sienna Cichlid (with NGGC on): Totals from 58239 (45.27% of 128647) affected shaders: Latency: 138113669 -> 138285372 (+0.12%) InvThroughput: 22404840 -> 22405245 (+0.00%) No Fossil DB changes with NGGC off. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11908>	2021-08-02 11:38:25 +00:00
Timur Kristóf	a2d02c0c11	ac/nir: Use gs_accepted variable after culling. This prevents us from recalculating the EXEC mask later in the shader, and removes the requirement for counting the number of primitives. The stats are better than expected because they also show that some code that is still there is now DCE'd by ACO. Fossil DB results on Sienna Cichlid (with NGGC on): Totals from 58239 (45.27% of 128647) affected shaders: SpillSGPRs: 330 -> 340 (+3.03%) CodeSize: 166356072 -> 162805724 (-2.13%) Instrs: 31920041 -> 31089256 (-2.60%) Latency: 138815742 -> 138113669 (-0.51%); split: -0.54%, +0.03% InvThroughput: 22459553 -> 22404840 (-0.24%); split: -0.26%, +0.02% SClause: 753746 -> 753765 (+0.00%); split: -0.00%, +0.01% Copies: 3226647 -> 3268973 (+1.31%); split: -0.45%, +1.76% Branches: `1223441` -> 1223440 (-0.00%); split: -0.00%, +0.00% PreSGPRs: 2025339 -> 2091013 (+3.24%) No Fossil DB changes with NGGC off. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11908>	2021-08-02 11:38:25 +00:00
Timur Kristóf	8159868699	ac/nir: Use es_accepted variable after culling. This avoids re-calculating the exec mask for ES vertices, and makes it unnecessary to count the number of vertices left. Fossil DB results on Sienna Cichlid (with NGGC on): Totals from 58239 (45.27% of 128647) affected shaders: CodeSize: 166521108 -> 166356072 (-0.10%); split: -0.10%, +0.00% Instrs: 31961308 -> 31920041 (-0.13%); split: -0.13%, +0.00% Latency: 138820463 -> 138815742 (-0.00%); split: -0.04%, +0.04% InvThroughput: 22460177 -> 22459553 (-0.00%); split: -0.00%, +0.00% SClause: 753744 -> 753746 (+0.00%) Copies: 3093140 -> 3226647 (+4.32%); split: -0.03%, +4.34% No Fossil DB changes with NGGC off. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11908>	2021-08-02 11:38:25 +00:00
Timur Kristóf	1bbea90f50	aco, nir, ac: Simplify sequence of getting initial NGG VS edge flags. Instead of v_bfe + v_lshl_or for each vertex, get all 3 edge flags at once of every vertex. This takes fewer VALU instructions than previously. Fossil DB results on Sienna Cichlid (with NGGC on): Totals from 56917 (44.24% of 128647) affected shaders: CodeSize: 161028288 -> 158751628 (-1.41%) Instrs: 30917985 -> 30519571 (-1.29%) Latency: 130617204 -> 129975532 (-0.49%); split: -0.50%, +0.01% InvThroughput: 21280238 -> 20927401 (-1.66%) Copies: 3011120 -> 3011125 (+0.00%); split: -0.00%, +0.00% No Fossil DB changed with NGGC off. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11908>	2021-08-02 11:38:25 +00:00
Timur Kristóf	8341af5109	radv, aco, ac/nir: Tweak position export scheduling for NGG culling. The result is about +5-ish fps in Doom Eternal. It turns out that the location of position exports matters more than we thought, and it's actually better to keep them at the bottom for culling shaders rather than schedule it up to the top. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10525>	2021-07-13 23:56:33 +00:00
Timur Kristóf	0bb543bb60	ac/nir: Reuse uniforms from top part of culling shaders. Uniforms have the same value in all invocations, therefore they can safely be reused by invocations even after repacking. This saves several instructions from culling shaders, mainly UBO loads and such. We exclude uniform floats, because those would harm the VGPR usage of the shaders too much. Fossil DB results on Sienna Cichlid (with NGG culling on): Totals from 55379 (43.05% of 128647) affected shaders: VGPRs: 1926472 -> 1925360 (-0.06%); split: -0.07%, +0.01% SpillSGPRs: 139 -> 330 (+137.41%) CodeSize: 159472988 -> 157462856 (-1.26%); split: -1.27%, +0.00% MaxWaves: 1571492 -> 1571412 (-0.01%) Instrs: 30665685 -> 30302076 (-1.19%); split: -1.21%, +0.02% Latency: 127385148 -> 126723891 (-0.52%); split: -0.55%, +0.03% InvThroughput: 21096298 -> 20773069 (-1.53%); split: -1.53%, +0.00% VClause: 514792 -> 511231 (-0.69%); split: -0.83%, +0.13% SClause: 713959 -> 679556 (-4.82%); split: -4.84%, +0.02% Copies: 2975106 -> 2828185 (-4.94%); split: -5.39%, +0.45% Branches: 1201921 -> 1152766 (-4.09%) PreSGPRs: 1753786 -> 1892848 (+7.93%); split: -0.00%, +7.93% PreVGPRs: 1590522 -> 1583574 (-0.44%); split: -0.44%, +0.00% Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10525>	2021-07-13 23:56:33 +00:00
Timur Kristóf	fc1fabbabf	ac/nir: Analyze culling shaders to remember which inputs are used when. These will be useful for some optimizations. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10525>	2021-07-13 23:56:33 +00:00
Timur Kristóf	faf766b864	ac/nir: Reuse the repacked output positions of culling shaders. The position outputs are stored into LDS and reloaded after repacking, therefore the repacked position values can be reused in the bottom part of the shader. Fossil DB results on Sienna Cichlid (with NGG culling on): Totals from 9016 (7.01% of 128647) affected shaders: VGPRs: 372472 -> 347560 (-6.69%); split: -6.82%, +0.13% SpillSGPRs: 437 -> 87 (-80.09%) CodeSize: 32359340 -> 30441692 (-5.93%); split: -5.93%, +0.00% MaxWaves: 222030 -> 238970 (+7.63%); split: +7.83%, -0.20% Instrs: 6207833 -> 5834149 (-6.02%); split: -6.02%, +0.00% Latency: 27626263 -> 27890632 (+0.96%); split: -5.34%, +6.29% InvThroughput: 4792958 -> 4361336 (-9.01%); split: -9.01%, +0.00% VClause: 144385 -> 139586 (-3.32%); split: -9.29%, +5.97% SClause: 141350 -> 129875 (-8.12%); split: -8.57%, +0.45% Copies: 580017 -> 568916 (-1.91%); split: -3.60%, +1.68% Branches: 209067 -> 209154 (+0.04%); split: -0.24%, +0.28% PreSGPRs: 281320 -> 277814 (-1.25%) PreVGPRs: 290040 -> 273861 (-5.58%) Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10525>	2021-07-13 23:56:33 +00:00
Timur Kristóf	e97f0463a8	ac/nir: Implement NGG deferred attribute culling in NIR. Culling is traditionally done by the rasterizer, but that can be a bottleneck when an app creates a large number of primitives. Eg. a lot of tiny triangles reduce the rasterziation efficiency. NGG makes it possible for the shader to check primitives and delete those that it can prove are not needed. After this is done, we have to repack the surviving invocations so they remain compact. This also saves bandwidth, because some memory loads are only executed by those vertices that survived the culling. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10525>	2021-07-13 23:56:33 +00:00
Timur Kristóf	556a690bac	ac/nir: Use a ballot that matches the wave size during NGG lowering. This generates slightly more efficient code in Wave32 mode. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10525>	2021-07-13 23:56:33 +00:00
Timur Kristóf	651a3da1b5	ac/nir: Add a NIR port of ac_llvm_cull. The algorithms were originally implemented by Marek Olšák, hence the copyright to AMD. This commit just ports the LLVM based implementation to NIR, using the new intrinsics added earlier. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10525>	2021-07-13 23:56:33 +00:00
Samuel Pitoiset	29f264f258	ac,radv: implement the cs_regalloc_hang HW bug workaround Might fix spurious failures on GFX6 and some GFX7 chips. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11675>	2021-07-09 13:37:37 +00:00
Marek Olšák	b2397c394d	ac,radeonsi: move late alloc computation into common code and shader states This also fixes a rare deadlock when a scratch buffer is used. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11754>	2021-07-08 18:37:41 +00:00
Marek Olšák	c4644bf3e6	ac/surface/tests: fix the ARM build Fixes: `8771d45a` "ac/surface/tests: fix a random segfault in the modifier test" Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4655 Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11754>	2021-07-08 18:37:41 +00:00
Dave Airlie	e5d158881b	ac: fix win32 build Fixes: `e2e9dd44f4` ("ac/surface: Handle non-retiled displayable DCC correctly for modifiers.") Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11725>	2021-07-06 14:49:24 +00:00
Bas Nieuwenhuizen	e2e9dd44f4	ac/surface: Handle non-retiled displayable DCC correctly for modifiers. There is some hardware with num_render_backends == 1, but the number of render backends in GB_ADDR_CFG > 1. Turns out this can be turned off by making them rb unaligned which is valid with only 1 render backend. Fixes: `0833dd7d12` ("amd/common: Add support for modifiers.") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10575>	2021-07-05 22:34:13 +00:00
Pierre-Eric Pelloux-Prayer	c564841fae	ac/surface: don't print stencil info if tex has no stencil color/zs are stored in a union so testing for zs.stencil_offset isn't the correct way to test for stencil. Fixes: `988f148db3` ("ac/surface: overlap color and Z/S fields using a union in gfx9_surf_layout") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11625>	2021-06-29 23:38:21 +02:00
Marek Olšák	86355b5984	ac/gpu_info: adjust the condition for use_late_alloc Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11509>	2021-06-23 22:37:32 -04:00
Samuel Pitoiset	8f9368ddb7	ac/perfcounters: add a GPU block ID to every block definitions The enumeration comes from AMDVLK. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11186>	2021-06-22 06:38:55 +00:00
Samuel Pitoiset	5a8776fd8c	ac/perfcounters: add more SPM configuration fields Add the number of SPM wires because sometimes a block has eg. 2 counters but only holds 3 16-bit counters instead of 4. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11186>	2021-06-22 06:38:55 +00:00
Samuel Pitoiset	3d8d6ebcb0	ac/perfcounters: rename num_multi to num_spm_counters Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11186>	2021-06-22 06:38:54 +00:00
Samuel Pitoiset	da94772510	ac/perfcounters,radeonsi: rework performance counters layout Instead of having different layouts which might complicate things when some registers are missing, hardcode the SELECT and SELECT1 registers into separate arrays. The SELECT registers are "legacy" counters, while the SELECT1 registers are SPM counters. This is more verbose and emit more UCONFIG registers, but emitting the SELECT registers is now much simpler and it seems less error prone. This will also help emitting the SPM configuration. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11186>	2021-06-22 06:38:54 +00:00
Samuel Pitoiset	66a34be6ac	ac/perfcounters: remove ac_pc_block_base::num_prelude This seems unnecessary if the first select register is correctly set. This CB filter was always disabled anyways. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11186>	2021-06-22 06:38:54 +00:00
Timur Kristóf	72174a3eef	ac/nir: Update TCS output barriers with nir_var_mem_shared. Output loads and stores are lowered to shared memory access, so we have to update the barriers to also reflect this. Closes: #4955 Fixes: `bf966d1c1d` Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11484>	2021-06-21 08:27:14 +00:00
Marek Olšák	61a845ca19	ac/surface: don't set DCC_PIPE_ALIGN modifier bit for gfx10 with 1 RB Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11486>	2021-06-20 01:22:01 -04:00
Marek Olšák	2acd34f266	ac/surface/tests: fix RB counts The real number of RBs can be less than what GB_ADDR_CONFIG contains. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11486>	2021-06-20 01:22:01 -04:00
Georg Lehmann	d3f735a249	ac: Enable 32bit predication on gfx9 with fw feature version 52. Amdvlk does this as well and it passes the vulkan CTS on renoir. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11297>	2021-06-11 06:07:10 +00:00
Georg Lehmann	fc437ef944	ac: Enable 32bit predication on gfx10. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11297>	2021-06-11 06:07:10 +00:00
Georg Lehmann	a41ba20cbd	ac: Check me_fw_feature for 32bit predication on gfx10.3 Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11297>	2021-06-11 06:07:10 +00:00
Timur Kristóf	1e49018ced	amd: Add extra source to the mbcnt_amd NIR intrinsic. The v_mbcnt instructions can take an extra source that they add to the result. This is not exposed in SPIR-V but we now expose it in NIR. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Tony Wasserka <tony.wasserka@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11072>	2021-06-09 16:48:51 +00:00
Timur Kristóf	f6b2db298f	ac/nir: Refactor and optimize the repacking sequence. According to feedback, the terminology with "exclusive scan" and "reduction" is difficult. Change it to use "repack" instead, which better fits what this sequence is actually used for. The new sequence stores only 1 byte / wave to LDS, and uses packed instructions to produce the results. This has lower latency and fewer instructions than what we previously had. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Tony Wasserka <tony.wasserka@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11072>	2021-06-09 16:48:51 +00:00
Leo Liu	43c04ab2b4	radeonsi: separate video hw info based on HW engine individually This removes previous "has_hw_decode" and "uvd_enc_supported" and makes information more accuate for cases where HW decode, HW encode, and HW JPEG decode might partially available. Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: James Zhu <James.Zhu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11201>	2021-06-08 09:32:48 -04:00
Samuel Pitoiset	9f7e63e12a	ac/debug: fix color printing PKT3 when count in header is too low Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11211>	2021-06-08 11:19:00 +02:00
Samuel Pitoiset	aff92f50c6	ac: add ac_thread_trace::data Instead of passing two different structs to ac_dump_rgp_capture(). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11156>	2021-06-03 15:39:34 +00:00

1 2 3 4 5 ...

1710 commits