fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-21 00:18:09 +02:00

Author	SHA1	Message	Date
Bas Nieuwenhuizen	ccf0a69e05	radv: Make the number of internal nodes be written on the GPU. Opens the door of algorithms with a variable number of nodes. Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19292>	2022-10-30 19:48:46 +00:00
Bas Nieuwenhuizen	0e23df959e	radv: Add BVH IR header. To include GPU state passed between stages but not in a node. Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19292>	2022-10-30 19:48:46 +00:00
Friedrich Vock	37525c11d1	radv: Rename emulated float helpers Use only conversion functions now. Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19292>	2022-10-30 19:48:46 +00:00
Marek Olšák	0ac37b595a	nir: add nir_intrinsic_optimization_barrier_vgpr_amd for LLVM We need this for the MSAA resolve shader. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Mihai Preda <mhpreda@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19243>	2022-10-29 18:38:33 +00:00
Rhys Perry	7fa50ced14	aco: insert waitcnt before/after ds_ordered_count The LLVM backend does this when lowering ordered_xfb_counter_add_amd. I guess there is some missing dependency checking or something. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19345>	2022-10-28 21:50:05 +00:00
Rhys Perry	ea8ddf5c26	aco: add storage_gds Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19345>	2022-10-28 21:50:05 +00:00
Samuel Pitoiset	eae2867122	radv: move nir_opt_idiv_const/nir_lower_idiv after NGG lowering NGG streamout lowering creates some idiv instructions that need to be lowered. No fossil-db results because it's currently broken. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19364>	2022-10-28 18:19:57 +00:00
Samuel Pitoiset	e2fcbd4a37	radv/llvm: fix dual source blending on GFX11 Untested but this should be similar to RadeonSI. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19367>	2022-10-28 17:03:37 +00:00
Samuel Pitoiset	d172fc1fca	radv: fix VRS limit when attachmentFragmentShadingRate is disabled Can be reproduced on GFX10.3 with RADV_DEBUG=nohiz. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19374>	2022-10-28 16:30:29 +00:00
Samuel Pitoiset	c41997f29f	radv: fix suspending/resuming pipeline statistics queries with GDS This probably doesn't fix anything in practice because GDS is only used for the number of generated primitives by GS and meta operations don't use GS. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19348>	2022-10-28 08:12:30 +00:00
Samuel Pitoiset	cf687e88ce	ac/nir/ngg: fix emitting streamout output by using packed location In RadeonSI, they are packed but not in RADV, so don't rely on driver locations. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19365>	2022-10-28 07:49:32 +00:00
Alyssa Rosenzweig	941c37c085	nir/lower_idiv: Remove imprecise_32bit_lowering NIR has two implementations of lower_idiv, keyed on the imprecise_32bit_lowering flag. This flag is misleading: the results when setting this flag "imprecise", they're completely wrong for some values. If a backend has a native implementation of umul_high, the correct path isn't that much more expensive. If it doesn't, it's substantially slower for highp integer divison... but in practice, non-constant highp integer division is pretty rare. After a painful migration of the tree, this code path has no more users. Remove it so nobody else gets the bright idea of using it again. Closes: #6555 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19303>	2022-10-27 19:37:14 +00:00
Rhys Perry	93fb84237f	ac/nir: add ac_nir_lower_ngg_options These signatures were getting ridiculous. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19340>	2022-10-27 13:31:40 +00:00
Rhys Perry	21a319851a	ac/nir: micro-optimize boolean expression Ignoring SCC spilling, the old version is probably faster because this mixes uniform and divergent booleans. fossil-db (navi21): Totals from 61167 (45.10% of 135636) affected shaders: Instrs: 29961899 -> 29932551 (-0.10%) CodeSize: 157407028 -> 157289636 (-0.07%) Latency: 139671953 -> 139625186 (-0.03%); split: -0.03%, +0.00% InvThroughput: 21221097 -> 21220756 (-0.00%) SClause: 750438 -> 750439 (+0.00%) Copies: 2672846 -> 2582332 (-3.39%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19340>	2022-10-27 13:31:40 +00:00
Daniel Schürmann	c80137fcba	radv/rt: overwrite hit args with undef in case of a miss This helps some variable coalescing. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19188>	2022-10-27 09:45:39 +00:00
Daniel Schürmann	f4270b7659	radv/rt: create traversal shader independent from main shader Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19188>	2022-10-27 09:45:39 +00:00
Iago Toral Quiroga	9deef4cde6	vulkan/runtime: include robustness info when hashing a shader stage Suggested by Jason Ekstrand. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18883>	2022-10-27 08:17:11 +00:00
Qiang Yu	bfb6a5fef1	ac/nir/ngg: add one odd dword to nogs culling pervertex lds radeonsi use like this. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18832>	2022-10-27 07:35:01 +00:00
Qiang Yu	13fb7f8f2c	ac/nir/ngg,ac/llvm,aco: save nogs ngg culling one lds dword TES rel patch id is <256, so we can use an existing unused LDS byte instead of extra dword. To ease the programing, change the index of repacked_arg_vars for these variables. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18832>	2022-10-27 07:35:01 +00:00
Qiang Yu	66d1fa9666	ac/nir/ngg: save and restore no_varying/no_sysval_output These are used by radeonsi for param export count, should be saved and restore. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18832>	2022-10-27 07:35:01 +00:00
Qiang Yu	b197dd0d15	ac/nir/ngg: allow passthrough with vs primitive id output vertex primtive id and passthrough are not exclusive, just need to get correct vertex index when passthrough. radeonsi won't disable passthrough when vs primitive id output, this is also for fixing the crash of the assertion. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18832>	2022-10-27 07:35:01 +00:00
Qiang Yu	e536d0fe4b	ac/nir/ngg,radv: move LDS layout calculation out of nir ngg lowering Use lds base load intrinsics in nir ngg lowering to get layout, left its calulation to driver. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18832>	2022-10-27 07:35:01 +00:00
Qiang Yu	54eea0e393	ac/nir/ngg: pass primitive_id_location as param for nogs lower radeonsi need to use packed driver location for all outputs, while radv need to use VARYING_SLOT_*. To meet both drivers' needs. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18832>	2022-10-27 07:35:01 +00:00
Qiang Yu	d82b668bc6	ac/nir/ngg: support user edge flags for ngg lower Pack user edge flag into arg code is ported from radeonsi gfx10_ngg_build_export_prim. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18832>	2022-10-27 07:35:01 +00:00
Qiang Yu	238eeeacb2	ac/llvm: get back intrinsics used by NGG Will be used by radeonsi. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18832>	2022-10-27 07:35:01 +00:00
Lionel Landwerlin	53a0804146	radv: tweak lower_shader_calls parameters On Q2RTX shaders : MaxWaves: 62 -> 69 (+11.29%) Instrs: 41626 -> 41575 (-0.12%); split: -0.27%, +0.15% CodeSize: 224960 -> 223740 (-0.54%); split: -0.62%, +0.08% VGPRs: 800 -> 704 (-12.00%) Scratch: 75776 -> 70656 (-6.76%) Latency: 922219 -> 977997 (+6.05%) InvThroughput: 212154 -> 201746 (-4.91%); split: -5.54%, +0.64% VClause: 1120 -> 1155 (+3.12%); split: -1.88%, +5.00% SClause: 1148 -> 1144 (-0.35%); split: -0.70%, +0.35% Copies: 5840 -> 5788 (-0.89%); split: -0.94%, +0.05% PreVGPRs: 753 -> 651 (-13.55%) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16556>	2022-10-26 12:53:26 +00:00
Lionel Landwerlin	1d10d17817	nir/lower_shader_calls: add an option structure for future optimizations Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16556>	2022-10-26 12:53:25 +00:00
Samuel Pitoiset	db573f7362	aco: add support for device clock on GFX11 According to LLVM, s_sendmsg_rtn(GET_REALTIME) should be used instead of s_memrealtime. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19267>	2022-10-25 20:23:08 +02:00
Samuel Pitoiset	c481978ac2	aco: split the sendmsg enumeration into sendmsg_rtn Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19267>	2022-10-25 20:23:07 +02:00
Samuel Pitoiset	6630b6e2aa	aco: add support for s_sendmsg_rtn_b{32,64} Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19267>	2022-10-25 20:23:05 +02:00
Samuel Pitoiset	3a3df9acda	ac/llvm: add support for device clock on GFX11 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19267>	2022-10-25 20:22:48 +02:00
Rhys Perry	1c005e72f4	ac/nir: add legacy streamout and GS copy shader helpers Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19302>	2022-10-25 17:35:08 +00:00
Rhys Perry	382831c986	radv,nir: add intrinsics for streamout and GS copy shaders Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19302>	2022-10-25 17:35:08 +00:00
Qiang Yu	cf74cf3901	radeonsi: implement nir shader query enabled intrinsics Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17457>	2022-10-25 12:58:43 +00:00
Qiang Yu	540eafada1	ac/nir/ngg: add streamout emitted primitive query For radeonsi to implement GL_TRANSFORM_FEEDBACK_PRIMITIVES_WRITTEN. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17457>	2022-10-25 12:58:43 +00:00
Qiang Yu	188a7f9226	ac/nir/ngg: add query param to ac_nir_lower_ngg_gs radeonsi may disable it. gfx_level will also be used by latter vertex param export when gfx11. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17457>	2022-10-25 12:58:43 +00:00
Qiang Yu	a119a6464f	nir,ac,radv: add primitive count add intrinsics radeonsi use shader buffer, but radv use gds for the query result storage. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17457>	2022-10-25 12:58:43 +00:00
Samuel Pitoiset	e18f76d890	radv: disable dual source blending in more situations According to PAL, there is more restrictions that RADV doesn't have. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19278>	2022-10-25 12:22:34 +00:00
Pierre-Eric Pelloux-Prayer	8034a71430	radeonsi/sqtt: re-export shaders in a single bo RGP expects a pipeline's shaders to be all stored sequentially, eg: [vs][ps][gs] As such, it assumes a single bo is dumped to the .rgp file, with the following info: * va of the bo * offset to each shader inside the bo For radeonsi, the shaders are stored individually, so we may have a big gap between the shaders forming a pipeline => we can produce very large file because the layout in the file must match the one in memory (see the warning in ac_rgp_file_write_elf_text). This commit implements a workaround: gfx shaders are re-exported as a pipeline. To update the shader address, a new state is created (sqtt_pipeline), which will overwrite the needed _PGM_LO_* registers. This reduces DeuxEX rgp captures from 150GB+ to less than 100MB. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18865>	2022-10-25 11:58:07 +00:00
Daniel Schürmann	a36e27e507	aco: change thread_local memory resource to pointer Apparently the TLS constructor doesn't work well if RADV is instantiated multiple times and/or used by a program with already existing threads. Fixes: `a128d444cb` ('aco: use monotonic_buffer_resource for instructions') Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19219>	2022-10-25 09:08:08 +00:00
Qiang Yu	7ee0b8b8df	ac/nir/ngg,radv: use different counters for shader queries VK_QUERY_TYPE_PRIMITIVES_GENERATED_EXT should count for each stream. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7409 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19015>	2022-10-25 02:42:52 +00:00
Qiang Yu	83643e4dc8	nir,ac/nir/ngg,radv: split shader_query_enabled_amd For used by different counter. Vulkan: 1. VK_QUERY_PIPELINE_STATISTIC_GEOMETRY_SHADER_PRIMITIVES_BIT, sum generated primitives of all 4 streams when GS. 2. VK_QUERY_TYPE_PRIMITIVES_GENERATED_EXT, count generated primitives for all 4 streams when VS/TES/GS. 3. VK_QUERY_TYPE_TRANSFORM_FEEDBACK_STREAM_EXT, count generated and streamout primitives for all 4 streams when VS/TES/GS. OpenGL: 1. GL_GEOMETRY_SHADER_PRIMITIVES_EMITTED_ARB, sum generated primitives for all 4 streams when GS. 2. GL_PRIMITIVES_GENERATED, count generated primitives for all 4 streams when VS/TES/GS. 3. GL_TRANSFORM_FEEDBACK_PRIMITIVES_WRITTEN, count streamout primitives for all 4 streams when VS/TES/GS. pipeline_stat_query_enabled_amd is for Vulkan 1 and OpenGL 1. xfb_query_enabled_amd is for Vulkan 2/3 and OpenGL 2/3. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19015>	2022-10-25 02:42:52 +00:00
Qiang Yu	1dcbf25757	radv: split active_pipeline_gds_queries For different enabling of pipeline stat and prims gen. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19015>	2022-10-25 02:42:52 +00:00
Qiang Yu	0bbe8029b6	radv: count gen_prims_queries_enabled User can enable/disable multi VK_QUERY_TYPE_PRIMITIVES_GENERATED_EXT queries with same or different index. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19015>	2022-10-25 02:42:52 +00:00
Timur Kristóf	a17e801a9c	aco: Add ACO_DEBUG=novalidateir option. This disables IR validation in debug/debugoptimized builds. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18103>	2022-10-24 20:14:16 +00:00
Timur Kristóf	0cceab788e	aco: Move is_dead to aco_ir.h to allow it to get inlined. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18103>	2022-10-24 20:14:16 +00:00
Timur Kristóf	36bc3afb8b	aco/optimizer_postRA: Delete dead instructions more efficiently. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18103>	2022-10-24 20:14:16 +00:00
Timur Kristóf	7263a29794	aco/optimizer_postRA: Properly handle vccz/execz/scc in reset_block. Fixes: `a8dd07518c` Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18103>	2022-10-24 20:14:16 +00:00
Timur Kristóf	75967a4814	aco/optimizer_postRA: Speed up reset_block() with predecessors. Copy the information from the first predecessor then check whether it matches other predecessors and modify the data accordingly. Marked for backporting to stable to make it possible to also backport fixes based on this. Cc: mesa-stable Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18103>	2022-10-24 20:14:16 +00:00
Timur Kristóf	b542ab0243	aco/optimizer_postRA: Use unique_ptr + array for instruction indices. According to perf, this roughly halves the impact of the post-RA optimizer in ACO's compile times. Measurement was taken using a debug optimized build using NIR_DEBUG=novalidate RADV_DEBUG=nocache and replaying the Fossil DB from the Doom Eternal shaders. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18103>	2022-10-24 20:14:16 +00:00

1 2 3 4 5 ...

10415 commits