fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-20 09:08:07 +02:00

Author	SHA1	Message	Date
Pierre-Eric Pelloux-Prayer	3914bd457b	amd/registers: fix fields conflict detection The existing code handled the case where the new definition of the same field was larger than the old one. This commit adds a check to handle the reverse case: the new def is smaller than the old one (= so writing using the merged macro would affect the next fields). The affected fields are: * LGKM_CNT (in SQ_WAVE_IB_STS) * DONUT_SPLIT (in VGT_TESS_DISTRIBUTION) * HEAD_QUEUE (in GDS_GWS_RESOURCE) DONUT_SPLIT is the only one used by radeonsi/radv. Fixes: `e6184b0892` ("amd/registers: scripts for processing register descriptions in JSON") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12063>	2021-07-30 08:50:38 +00:00
Alejandro Piñeiro	476dc3c050	vulkan: add vk_spec_info_to_nir_spirv util method All vulkan drivers have been copying anv's code to convert VkSpecializationInfo into nir_spirv_specialization. Recently there was a Vulkan spec change on allowed values for VkSpecializationInfo, and all drivers got affected. This commits creates a new helper, and uses it on all Vulkan Mesa drivers. v2: use (uint8_t) castings, instead of void, to avoid C2036 with MSVC (detected by the CI, inspired on what radv was doing) Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12047>	2021-07-29 03:28:52 +00:00
Samuel Pitoiset	72f55cf7c4	radv: implement VK_EXT_shader_atomic_float2 Some floating atomic instructions are not available on GFX8-9. No LLVM support. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12060>	2021-07-27 08:44:36 +02:00
Samuel Pitoiset	6694c37ea0	aco: implement VK_EXT_shader_atomic_float2 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12060>	2021-07-27 08:44:31 +02:00
Samuel Pitoiset	0497588eac	radv: allow unused VkSpecializationMapEntries Fixes future CTS: dEQP-VK.pipeline.spec_constant..basic.unused_* Cc: 21.2 mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12062>	2021-07-26 14:09:09 +02:00
Georg Lehmann	890b1c0f2a	aco: Use cpp_msvc_compat_args. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11610>	2021-07-23 20:28:58 +00:00
Georg Lehmann	c6bcafcc07	radv: Use c_msvc_compat_args. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11610>	2021-07-23 20:28:58 +00:00
Jason Ekstrand	a7b429e8ec	amd: Don't handle nir_tex_src_ms_mcs It's an intel-specific texture source and will never be seen on AMD. Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11775>	2021-07-23 15:53:57 +00:00
Jason Ekstrand	e83fe65cd8	radv,radeonsi: Do cube size divide-by-6 lowering in NIR No point in carrying all this code around twice each in two back-ends. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12005>	2021-07-22 14:22:35 -05:00
Rhys Perry	211d1dfd34	aco: don't create v_madmk_f32/v_madak_f32 from v_fma_legacy_f16 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5105 Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12004>	2021-07-22 15:43:31 +00:00
Daniel Stone	d8bfad70dc	Revert "CI: Disable LAVA devices" This reverts commit `1f4ff4ed2e`. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12016>	2021-07-22 08:42:40 +01:00
Daniel Stone	7b8bb81e82	CI: Disable LAVA devices We've had a physical machine death, and the restore/transfer is achingly slow at the moment. Some of the devices are still fine, but conservatively just kill the lot until it's all recovered. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11997>	2021-07-21 17:26:43 +01:00
Daniel Schürmann	1d8e9430d2	aco: include <cstddef> in aco_util.h It's needed for ptrdiff. Fixes: `59fdaa1985` ('aco: reorder and cleanup #includes') Closes: #5076 Reviewed-by: Tony Wasserka <tony.wasserka@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11947>	2021-07-21 13:37:00 +00:00
Daniel Schürmann	3870c52159	aco/ra: don't allocate vector space for MIMG NSA operands In this case, the MIMG vaddr components are not vector-aligned anymore, anyway. Totals from 11866 (7.90% of 150170) affected shaders: (GFX10.3) VGPRs: 733064 -> 728408 (-0.64%); split: -0.66%, +0.02% CodeSize: 67968356 -> 67968440 (+0.00%); split: -0.02%, +0.02% MaxWaves: 214022 -> 214014 (-0.00%) Instrs: 12798200 -> 12797232 (-0.01%); split: -0.02%, +0.01% Latency: 196427665 -> 196418706 (-0.00%); split: -0.02%, +0.01% InvThroughput: 37082037 -> 37080799 (-0.00%); split: -0.02%, +0.02% VClause: 246097 -> 246031 (-0.03%); split: -0.16%, +0.13% Copies: 494852 -> 493923 (-0.19%); split: -0.52%, +0.34% Branches: 220323 -> 220294 (-0.01%); split: -0.03%, +0.02% Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11980>	2021-07-21 09:38:15 +00:00
Daniel Schürmann	9b1a296172	aco/optimizer: ensure to not erase high bits when propagating packed constants Packed constants with non-zero values in the high half might have been propagated as 16 bit, dropping the high half. Cc: mesa-stable Closes: #5070 Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11954>	2021-07-20 07:48:39 +00:00
Timur Kristóf	55d57b828f	aco: Fix how p_elect interacts with optimizations. Since p_elect doesn't have any operands, ACO's value numbering and/or the pre-RA optimizer could currently recognize two p_elect instructions in two different blocks as the same. This patch adds exec as an operand to p_elect in order to achieve correct behavior. Fixes: `e66f54e5c8` Closes: #5080 Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11943>	2021-07-18 00:48:06 +02:00
Timur Kristóf	6e17931d21	radv: Use pre-computed viewport transform for NGG culling state. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11839>	2021-07-16 15:22:46 +00:00
Mike Blumenkrantz	c9a478f1cd	radv: remove unused variable from radv_emit_viewport Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11839>	2021-07-16 15:22:46 +00:00
Mike Blumenkrantz	a2ef92d7a5	radv: pre-calculate viewport transforms this requires more storage in the viewport struct, but it avoids the need to repeatedly calculate the same transform if e.g., a meta operation occurs, which can save about 5% cpu in some cases Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11839>	2021-07-16 15:22:46 +00:00
Mike Blumenkrantz	1e13cb1965	radv: merge si_write_viewport into radv_emit_viewport Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11839>	2021-07-16 15:22:46 +00:00
Timur Kristóf	60c5abf685	aco: Remove s_and with exec when all lanes are active. This helps NGG GS and culling shaders. No Fossil DB changes without NGG culling. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11458>	2021-07-16 14:31:54 +00:00
Timur Kristóf	e66f54e5c8	aco: Allow elect to take advantage of knowing when all lanes are active. Implement elect using a pseudo-op which is lowered during the insert_exec_mask pass. This makes it possible to emit a more optimal sequence when the exec mask is constant. Fossil DB results on Sienna Cichlid: Totals from 211 (0.16% of 128647) affected shaders: CodeSize: 2254356 -> 2240468 (-0.62%); split: -0.62%, +0.00% Instrs: 438471 -> 434996 (-0.79%); split: -0.80%, +0.01% Latency: 2717082 -> 2709400 (-0.28%); split: -0.28%, +0.00% InvThroughput: 566987 -> 566342 (-0.11%); split: -0.11%, +0.00% Copies: 40058 -> 40162 (+0.26%) Branches: 31209 -> 31211 (+0.01%) PreSGPRs: 9927 -> 10125 (+1.99%) Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11458>	2021-07-16 14:31:54 +00:00
Timur Kristóf	b12318f26c	aco: Swap s_and operand order for ballot. This allows our optimizer to recognize this and eliminate it when it can prove that the s_and with exec is unneeded. Fossil DB changes on Sienna Cichlid: Totals from 1969 (1.53% of 128647) affected shaders: CodeSize: 9468228 -> 9469348 (+0.01%); split: -0.00%, +0.01% Instrs: 1773566 -> 1773581 (+0.00%); split: -0.01%, +0.01% Latency: 19504042 -> 19503385 (-0.00%); split: -0.00%, +0.00% InvThroughput: 3617406 -> 3617333 (-0.00%) Copies: 108998 -> 110592 (+1.46%) Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11458>	2021-07-16 14:31:54 +00:00
Timur Kristóf	d07e5bde75	radv: Remove num_viewports from radv_skip_ngg_culling. NGG culling is not compiled into shaders that can use multiple viewports, so it's not necessary to check it here. Fixes: `9a95f5487f` Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11910>	2021-07-16 13:00:36 +00:00
Timur Kristóf	aa24740370	radv: Don't compile NGG culling into shaders that write viewport index. We don't support NGG culling with multiple viewports yet. Fixes: `f30e4351de` Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11910>	2021-07-16 13:00:36 +00:00
Samuel Pitoiset	0b637919a8	radv: fix specifying the stencil layout for separate depth/stencil layouts The Vulkan spec has been updated few months ago again and pNext is always honored if present. Found this with vkd3d-proton which implemented separate depth/stencil layouts recently. Cc: 21.2 mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11903>	2021-07-16 09:20:58 +02:00
Samuel Pitoiset	cadf2d63b7	radv: report APUs as discrete GPUs for Red Dead Redemption 2 On APUs, we fake heaps to simulate a dGPU setup because it seems to have the maximum compatibility. Though, some applications like RDR2 still only looks at GTT if the driver reports an iGPU which means it will only use 1/3rd of total memory available. This is currently behind a drirc option because it might have implications for other apps but we might want to extend this later if everything is fine. Cc: 21.2 mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11891>	2021-07-16 07:01:45 +00:00
Samuel Pitoiset	7a1cc56e40	radv: fix bounds checking for zero vertex stride on GFX6-7 GFX6 and GFX10+ have similar logic. This fixes test_zero_vertex_stride from vkd3d-proton on Pitcairn (GFX6) and on Bonaire (GFX7). Cc: 21.2 mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11904>	2021-07-16 08:30:07 +02:00
Timur Kristóf	19c8283729	radv: Use 128-sized vertex grouping for NGG shaders. This matches what RadeonSI also does. It seems to improve performance especially with NGG culling shaders. Eg. in Doom Eternal this gives me +5ish fps. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11810>	2021-07-15 16:13:04 +00:00
Samuel Pitoiset	1ea156f44c	radv: only init the TC-compat ZRANGE metadata for the depth aspect With separate depth/stencil layouts, if the depth aspect is first initialized and then cleared, the ZRANGE_PRECISION metadata might be different than 0. Initializing it again for the stencil aspect will overwrite the value. Fixes rendering glitches with Scarlet Nexus on GFX8-9. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5052 Cc: 21.1 21.2 mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11883>	2021-07-15 07:21:50 +00:00
Daniel Schürmann	71aab9607d	aco/live_var_analysis: change worklist to a single integer Reduces overall compile times by ~0.45%. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11879>	2021-07-14 18:10:56 +02:00
Daniel Schürmann	20eaa074ec	aco/insert_waitcnt: Remove many unnecessary wait_imm.combine() Reduces overall compile times by ~0.2%. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11879>	2021-07-14 18:10:50 +02:00
Daniel Schürmann	114d38e57d	aco/isel: avoid unnecessary calls to nir_unsigned_upper_bound() These were responsible for ~20% of the time spent in instruction selection. Reduces overall compile times by ~0.5%. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11879>	2021-07-14 18:10:40 +02:00
Timur Kristóf	8341af5109	radv, aco, ac/nir: Tweak position export scheduling for NGG culling. The result is about +5-ish fps in Doom Eternal. It turns out that the location of position exports matters more than we thought, and it's actually better to keep them at the bottom for culling shaders rather than schedule it up to the top. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10525>	2021-07-13 23:56:33 +00:00
Timur Kristóf	0bb543bb60	ac/nir: Reuse uniforms from top part of culling shaders. Uniforms have the same value in all invocations, therefore they can safely be reused by invocations even after repacking. This saves several instructions from culling shaders, mainly UBO loads and such. We exclude uniform floats, because those would harm the VGPR usage of the shaders too much. Fossil DB results on Sienna Cichlid (with NGG culling on): Totals from 55379 (43.05% of 128647) affected shaders: VGPRs: 1926472 -> 1925360 (-0.06%); split: -0.07%, +0.01% SpillSGPRs: 139 -> 330 (+137.41%) CodeSize: 159472988 -> 157462856 (-1.26%); split: -1.27%, +0.00% MaxWaves: 1571492 -> 1571412 (-0.01%) Instrs: 30665685 -> 30302076 (-1.19%); split: -1.21%, +0.02% Latency: 127385148 -> 126723891 (-0.52%); split: -0.55%, +0.03% InvThroughput: 21096298 -> 20773069 (-1.53%); split: -1.53%, +0.00% VClause: 514792 -> 511231 (-0.69%); split: -0.83%, +0.13% SClause: 713959 -> 679556 (-4.82%); split: -4.84%, +0.02% Copies: 2975106 -> 2828185 (-4.94%); split: -5.39%, +0.45% Branches: 1201921 -> 1152766 (-4.09%) PreSGPRs: 1753786 -> 1892848 (+7.93%); split: -0.00%, +7.93% PreVGPRs: 1590522 -> 1583574 (-0.44%); split: -0.44%, +0.00% Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10525>	2021-07-13 23:56:33 +00:00
Timur Kristóf	fc1fabbabf	ac/nir: Analyze culling shaders to remember which inputs are used when. These will be useful for some optimizations. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10525>	2021-07-13 23:56:33 +00:00
Timur Kristóf	faf766b864	ac/nir: Reuse the repacked output positions of culling shaders. The position outputs are stored into LDS and reloaded after repacking, therefore the repacked position values can be reused in the bottom part of the shader. Fossil DB results on Sienna Cichlid (with NGG culling on): Totals from 9016 (7.01% of 128647) affected shaders: VGPRs: 372472 -> 347560 (-6.69%); split: -6.82%, +0.13% SpillSGPRs: 437 -> 87 (-80.09%) CodeSize: 32359340 -> 30441692 (-5.93%); split: -5.93%, +0.00% MaxWaves: 222030 -> 238970 (+7.63%); split: +7.83%, -0.20% Instrs: 6207833 -> 5834149 (-6.02%); split: -6.02%, +0.00% Latency: 27626263 -> 27890632 (+0.96%); split: -5.34%, +6.29% InvThroughput: 4792958 -> 4361336 (-9.01%); split: -9.01%, +0.00% VClause: 144385 -> 139586 (-3.32%); split: -9.29%, +5.97% SClause: 141350 -> 129875 (-8.12%); split: -8.57%, +0.45% Copies: 580017 -> 568916 (-1.91%); split: -3.60%, +1.68% Branches: 209067 -> 209154 (+0.04%); split: -0.24%, +0.28% PreSGPRs: 281320 -> 277814 (-1.25%) PreVGPRs: 290040 -> 273861 (-5.58%) Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10525>	2021-07-13 23:56:33 +00:00
Timur Kristóf	d18920e03a	radv: Run algebraic optimizations before NGG lowering. This makes culling shaders more efficient because they split the shader in two parts. It is better to optimize before this split happens. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10525>	2021-07-13 23:56:33 +00:00
Timur Kristóf	f30e4351de	radv: Support NGG culling with new perftest environment variable. Currently we don't enable it on any chip by default, but we plan to enable it soon on GFX10.3 when we are comfortable with its performance. RADV_PERFTEST=nggc environment variable enables it on GFX10+ GPUs. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10525>	2021-07-13 23:56:33 +00:00
Timur Kristóf	182d9b1e60	aco: Implement NGG culling related intrinsics. These are very straightforward as they just copy data from the newly added shader arguments. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10525>	2021-07-13 23:56:33 +00:00
Timur Kristóf	9a95f5487f	radv: New shader args for NGG culling settings and viewport. Add new shader arguments in RADV for: - NGG culling settings - Viewport transform These will be used by NGG culling shaders. Additionally, some tweaks are made to some config registers in order to make culling shaders more efficient. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10525>	2021-07-13 23:56:33 +00:00
Timur Kristóf	ed163a44b6	radv: Expose radv_get_viewport_xform in radv_private.h We need to emit viewport transform information for culling shaders. This is used for small primitive culling. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10525>	2021-07-13 23:56:33 +00:00
Timur Kristóf	e97f0463a8	ac/nir: Implement NGG deferred attribute culling in NIR. Culling is traditionally done by the rasterizer, but that can be a bottleneck when an app creates a large number of primitives. Eg. a lot of tiny triangles reduce the rasterziation efficiency. NGG makes it possible for the shader to check primitives and delete those that it can prove are not needed. After this is done, we have to repack the surviving invocations so they remain compact. This also saves bandwidth, because some memory loads are only executed by those vertices that survived the culling. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10525>	2021-07-13 23:56:33 +00:00
Timur Kristóf	556a690bac	ac/nir: Use a ballot that matches the wave size during NGG lowering. This generates slightly more efficient code in Wave32 mode. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10525>	2021-07-13 23:56:33 +00:00
Timur Kristóf	651a3da1b5	ac/nir: Add a NIR port of ac_llvm_cull. The algorithms were originally implemented by Marek Olšák, hence the copyright to AMD. This commit just ports the LLVM based implementation to NIR, using the new intrinsics added earlier. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10525>	2021-07-13 23:56:33 +00:00
Tony Wasserka	f438cbc23e	aco: Remove deprecated Operand constructors Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11653>	2021-07-13 17:43:26 +00:00
Tony Wasserka	cfd866ed42	aco: Clean up unneeded literal casts These were only needed to select the appropriate Operand constructor before. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11653>	2021-07-13 17:43:26 +00:00
Tony Wasserka	66e51dc474	aco: Remove use of deprecated Operand constructors This migration was done with libclang-based automatic tooling, which performed these replacements: * Operand(uint8_t) -> Operand::c8 * Operand(uint16_t) -> Operand::c16 * Operand(uint32_t, false) -> Operand::c32 * Operand(uint32_t, bool) -> Operand::c32_or_c64 * Operand(uint64_t) -> Operand::c64 * Operand(0) -> Operand::zero(num_bytes) Casts that were previously used for constructor selection have automatically been removed (e.g. Operand((uint16_t)1) -> Operand::c16(1)). Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11653>	2021-07-13 17:43:26 +00:00
Tony Wasserka	76554419b3	aco: Remove use of deprecated Operand constructors in aco_builder.h Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11653>	2021-07-13 17:43:26 +00:00
Tony Wasserka	4e33688f23	aco: Remove use of deprecated Operand constructors in test_to_hw_instr.cpp Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11653>	2021-07-13 17:43:26 +00:00

1 2 3 4 5 ...

7623 commits