fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-22 15:18:09 +02:00

Author	SHA1	Message	Date
Marek Olšák	0e0cc12de6	nir/opt_vectorize: don't ralloc the set Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36728>	2025-08-21 06:13:48 +00:00
Marek Olšák	f7ca848ad5	nir/remove_dead_variables: don't ralloc the set Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36728>	2025-08-21 06:13:48 +00:00
Marek Olšák	68b80e4d25	nir/instr_set: don't ralloc the set Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36728>	2025-08-21 06:13:48 +00:00
Marek Olšák	c1ae58d479	nir/lower_vars_to_ssa: don't ralloc sets reducing ralloc overhead Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36728>	2025-08-21 06:13:48 +00:00
Marek Olšák	3aadae22ad	nir: make nir_block::predecessors & dom_frontier sets non-malloc'd We can just place the set structures inside nir_block. This reduces the number of ralloc calls by 6.7% when compiling Heaven shaders with radeonsi+ACO using a release build (i.e. not including nir_validate set allocations, which are also removed). Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36728>	2025-08-21 06:13:48 +00:00
Marek Olšák	81cb571642	nir/dominance: eliminate ralloc overhead for allocating dom_children This is only 1% of all ralloc calls of Unigine Heaven with the gallium noop driver, but it's an easy one to get rid of. Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36728>	2025-08-21 06:13:48 +00:00
Marek Olšák	aeed2cc19d	nir/dominance: don't allocate 0-sized dom_children 86% of all ralloc calls for dom_children in Unigine Heaven + Superposition had size == 0. It was only allocating the ralloc header. It was 6.1% of all ralloc calls with the gallium noop driver. Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36728>	2025-08-21 06:13:48 +00:00
Job Noorman	30716cc524	nir/lower_explicit_io: add support for offset_shift The goal here is to generate addresses that are a right-shifted version of the actual byte address and record the shift amount in the offset_shift index. While we could just insert a ushr at the end of deref chains, this will prevent the shift to be optimized away in many cases. Instead, we try to extract the shift from the array strides and struct offsets that make up the deref chain, and only insert a ushr when absolutely necessary (i.e., for casts). This means we have to walk the entire deref chain at once for accesses that support offset_shift and we don't use the standard algorithm of replacing each deref one at a time. To be able to legally right-shift casts, we use the alignment information and never shift more than what the alignment could support. It should also be noted that casts generally have two sources: something provided by the driver (e.g., a Vulkan resource index) or a variable pointer coming from a phi/bcsel. For the latter, the entire access chain consists of multiple parts that are ended by either a phi/bcsel or an access. Only the part the ends in an access is handled by this new algorithm; the other parts are handled as usual. This is necessary because we have no way to encode the offset shift or to even know how much we would be able to shift without knowing how it is accessed. This commit adds the general implementation for lowering accesses using offset_shift and adds a compiler option for drivers to enable it for SSBO accesses. Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35092>	2025-08-20 07:51:30 +00:00
Job Noorman	1406eafbcd	nir/lower_explicit_io: add alignment parameters to address builder We will need this when building shifted addresses. Since adding these parameters has a lot of code churn which would distract from the main changes, it is split-off in a separate commit. Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35092>	2025-08-20 07:51:30 +00:00
Job Noorman	553a439b54	nir/lower_explicit_io: use nir_io_offset to pass around addresses We will add support for shifted addresses; this commit makes sure the APIs of the functions already support passing shifts. Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35092>	2025-08-20 07:51:30 +00:00
Job Noorman	4c9afbd01d	nir/lower_explicit_io: add helper to build address The helper is used to build the address passed to build_explicit_io_load/store. For now, it simply takes care of adding the component offset when scalarizing. In the future, this can be used to do more complex address manipulations, like calculating the full deref chain address. Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35092>	2025-08-20 07:51:30 +00:00
Job Noorman	1fffba12a0	nir/lower_explicit_io: make offset calculation reusable nir_explicit_io_address_from_deref implicitly builds the offset but only makes the full address available. Split-out the offset calculation in a separate function so we can reuse it elsewhere. Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35092>	2025-08-20 07:51:30 +00:00
Job Noorman	b0bc97cb43	nir/opt_load_store_vectorize: fix wrap check for scaled offsets Hardware will typically do bounds checking on the final scaled address so the wrap check should do the same. Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35092>	2025-08-20 07:51:30 +00:00
Job Noorman	cb773dec8c	nir/opt_load_store_vectorize: add support for offset_shift Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35092>	2025-08-20 07:51:30 +00:00
Job Noorman	249e27c9c7	nir/opt_load_store_vectorize: allow per-instruction offset scaling We currently support offset scaling on a per-intrinsic type basis. Since the introduction of the offset_shift index, different instantiations of the same type can now have a different scale. Add support for this by calculating the offset scale on the fly for instructions that have offset_shift. Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35092>	2025-08-20 07:51:30 +00:00
Job Noorman	7fe484e373	nir/lower_mem_access_bit_sizes: add partial support for offset_shift Note: this was implemented and tested for ir3. The code paths that are never used there [1] seem non-trivial to implement. Since they cannot be easily tested, asserts and TODOs are added to ensure we don't accidentally hit them for intrinsics with offset_shift. [1]: these paths are never used on ir3 since lower_mem_access_bit_sizes is only used for SSBO accesses to lower 64b accesses (which are 64b aligned) to 32b ones. So we'll never request an increase of alignment. Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35092>	2025-08-20 07:51:30 +00:00
Job Noorman	cd72d8e366	nir/opt_shrink_vectors: add support for offset_shift Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35092>	2025-08-20 07:51:30 +00:00
Job Noorman	f876adc372	nir/lower_wrmasks: add support for offset_shift Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35092>	2025-08-20 07:51:30 +00:00
Job Noorman	b85c379945	nir/lower_wrmasks: don't adjust BASE The immediate addition can easily be handled by nir_opt_offsets, which will also take any driver limits into account. Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35092>	2025-08-20 07:51:30 +00:00
Job Noorman	dd296a6d80	nir/lower_io_to_scalar: add support for offset_shift Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35092>	2025-08-20 07:51:30 +00:00
Job Noorman	59eb95cd2f	nir/lower_atomics: add support for offset_shift Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35092>	2025-08-20 07:51:30 +00:00
Job Noorman	513412c893	nir,ir3: add offset_shift index to SSBO access intrinsics In ir3, SSBO offsets are in units of the accessed type size so we want to start using the new offset_shift index. Even though the shift is implicit for the ir3 intrinsics, we use nir_intrinsic_copy_const_indices when creating them so we need to make sure our indices match the ones used by the generic intrinsics. Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35092>	2025-08-20 07:51:30 +00:00
Job Noorman	355c9b88f7	nir: add some helpers for dealing with offset_shift For intrinsics supporting offset_shift, dealing with their offset is a bit tricky as we cannot simply add a byte offset to it anymore (which is what most passes want to do). This commit adds some helpers to add byte offsets (and adjusting offset_shift accordingly) so that individual passes don't have to worry about this. Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35092>	2025-08-20 07:51:30 +00:00
Job Noorman	7cc09e9952	nir: add offset_shift intrinsic index For load/store intrinsics that take an offset, this specifies the amount the offset is shifted left to calculate the final offset: offset = (offset_src + base) << offset_shift This is useful for backends that have memory operations that use offset units other than bytes (i.e., where the shift is implicit). Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35092>	2025-08-20 07:51:30 +00:00
Job Noorman	ebea9ce825	nir: add nir_src_is_deref helper Signed-off-by: Job Noorman <jnoorman@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35092>	2025-08-20 07:51:30 +00:00
Karol Herbst	83cf765f8e	nak: run nir_opt_move nir_move_load_ubo Usually we can fold most ldc and ldcx into the instruction using it, however there are a couple of cases where we can't, e.g. when there is an indirect offset. Moving the ldc(x) down to the consumer leads to increase value ranges for uniform registers, but lowering them for normal registers. Totals: CodeSize: 914650304 -> 914469536 (-0.02%); split: -0.05%, +0.03% Number of GPRs: 3879754 -> 3863818 (-0.41%); split: -0.42%, +0.01% Static cycle count: 1073273107 -> 1073101189 (-0.02%); split: -0.09%, +0.08% Spills to reg: 67219 -> 67707 (+0.73%); split: -0.10%, +0.83% Fills from reg: 79733 -> 80456 (+0.91%); split: -0.10%, +1.01% Max warps/SM: 3666036 -> 3672668 (+0.18%); split: +0.18%, -0.00% Totals from 24235 (27.66% of 87622) affected shaders: CodeSize: 444747392 -> 444566624 (-0.04%); split: -0.11%, +0.07% Number of GPRs: 1360384 -> 1344448 (-1.17%); split: -1.20%, +0.03% Static cycle count: 806310857 -> 806138939 (-0.02%); split: -0.12%, +0.10% Spills to reg: 35826 -> 36314 (+1.36%); split: -0.19%, +1.55% Fills from reg: 31863 -> 32586 (+2.27%); split: -0.26%, +2.53% Max warps/SM: 911328 -> 917960 (+0.73%); split: +0.74%, -0.01% Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36536>	2025-08-19 17:29:07 +00:00
Georg Lehmann	de3d04dd72	nir/uub: guard against division by 0 Fixes: `8ee5440073` ("nir/uub: improve ishl/imul with constant sources") Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36805>	2025-08-19 15:49:57 +00:00
Daniel Schürmann	8c8fc7d058	nir/opt_load_store_vectorize: don't vectorize large shared2_amd loads for performance reasons. Totals from 180 (0.23% of 79839) affected shaders: (Navi48) Instrs: 288089 -> 289937 (+0.64%); split: -0.00%, +0.64% CodeSize: 1515884 -> 1527936 (+0.80%); split: -0.00%, +0.80% VGPRs: 10740 -> 10704 (-0.34%) Latency: 1477965 -> 1478591 (+0.04%); split: -0.09%, +0.14% InvThroughput: 467449 -> 467885 (+0.09%); split: -0.02%, +0.11% VClause: 5012 -> 5010 (-0.04%); split: -0.08%, +0.04% SClause: 6509 -> 6512 (+0.05%); split: -0.02%, +0.06% Copies: 20815 -> 20923 (+0.52%); split: -0.28%, +0.80% Branches: 6019 -> 6018 (-0.02%) PreSGPRs: 7670 -> 7669 (-0.01%) PreVGPRs: 7239 -> 7192 (-0.65%) VALU: 151763 -> 152011 (+0.16%); split: -0.04%, +0.20% SALU: 39199 -> 39202 (+0.01%) VOPD: 877 -> 861 (-1.82%); split: +0.57%, -2.39% Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36133>	2025-08-19 14:28:14 +00:00
Daniel Schürmann	957b271a9f	nir/opt_load_store_vectorize: only attempt to vectorize shared2 after exhausting other possibilities Totals from 249 (0.31% of 79839) affected shaders: (Navi48) Instrs: 276401 -> 275918 (-0.17%); split: -0.29%, +0.11% CodeSize: 1477072 -> 1474440 (-0.18%); split: -0.26%, +0.08% VGPRs: 12748 -> 12760 (+0.09%); split: -0.28%, +0.38% Latency: 1397959 -> 1398846 (+0.06%); split: -0.10%, +0.16% InvThroughput: 424767 -> 424496 (-0.06%); split: -0.09%, +0.02% VClause: 5183 -> 5186 (+0.06%); split: -0.10%, +0.15% SClause: 6537 -> 6538 (+0.02%); split: -0.05%, +0.06% Copies: 21295 -> 21098 (-0.93%); split: -1.21%, +0.29% Branches: 4324 -> 4325 (+0.02%) PreSGPRs: 9719 -> 9717 (-0.02%) PreVGPRs: 8857 -> 8847 (-0.11%); split: -0.24%, +0.12% VALU: 144514 -> 144334 (-0.12%); split: -0.20%, +0.07% SALU: 38970 -> 38944 (-0.07%); split: -0.08%, +0.01% VOPD: 884 -> 898 (+1.58%); split: +1.92%, -0.34% Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36133>	2025-08-19 14:28:14 +00:00
Gert Wollny	8c65da0c9d	r600/sfn: cleanup GS shader emission Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Now that we lower all load_per_vertex_input to r600_load_per_vertex_input we can remove some dead code and also change the intrinsic to use only one source value. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36488>	2025-08-12 14:30:17 +00:00
Georg Lehmann	8818d7367d	nir/opt_load_skip_helpers: optionally handle intrinsics Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36610>	2025-08-12 08:56:37 +00:00
Georg Lehmann	cd687e277f	nir: add access for scratch loads To be able to use ACCESS_SKIP_HELPERS. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36610>	2025-08-12 08:56:37 +00:00
Georg Lehmann	2d16f457c5	nir: add ACCESS_SKIP_HELPERS Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36610>	2025-08-12 08:56:37 +00:00
Georg Lehmann	91572a99bb	nir: rename to nir_opt_load_skip_helpers and add options struct Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36610>	2025-08-12 08:56:37 +00:00
Georg Lehmann	fbae0893a6	nir: print skip_helpers for tex instrs Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36610>	2025-08-12 08:56:37 +00:00
Georg Lehmann	6577f68ad4	nir/opt_tex_skip_helpers: never require helpers for stores/atomics Helpers never execute stores/atomics. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36610>	2025-08-12 08:56:37 +00:00
Georg Lehmann	26e6c4c092	nir/opt_tex_skip_helpers: don't skip helpers for terminate_if source Helpers must be terminated correctly. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36610>	2025-08-12 08:56:37 +00:00
Qiang Yu	bfd7f498a5	nir/opt_varying: remove assert for mesh shader crash Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This assert is not true when mesh shader. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36596>	2025-08-11 01:44:45 +00:00
Alyssa Rosenzweig	8566a566e6	nir: plumb ballot options glsl needs to plumb this from the backend. we should clean up nir_lower_subgroups to use this later but I don't have time to churn everything right now. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Marek Olšák <maraeo@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36649>	2025-08-08 20:51:03 +00:00
Alyssa Rosenzweig	1af0897452	nir/lower_subgroups: add lower_fp64 option This is needed for doubles lowering to do the right thing. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Marek Olšák <maraeo@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36649>	2025-08-08 20:51:03 +00:00
John Anthony	000bd3046d	nir,spirv: Add support for SPV_ARM_core_builtins Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36019>	2025-08-07 11:46:33 +02:00
John Anthony	a68a825aad	nir,agx: unvendor core_id_agx core_id will be used by SPV_ARM_core_builtins Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36019>	2025-08-07 11:46:33 +02:00
Qiang Yu	c135ed1eb9	all: rename gl_shader_stage_name to mesa_shader_stage_name Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36569>	2025-08-06 10:28:41 +08:00
Qiang Yu	807d693421	compiler: rename gl_shader_stage_is_callable to mesa_shader_stage_is_callable Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36569>	2025-08-06 10:28:41 +08:00
Qiang Yu	4847e0b380	all: rename gl_shader_stage_uses_workgroup to mesa_shader_stage_uses_workgroup Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36569>	2025-08-06 10:28:41 +08:00
Qiang Yu	7a91473192	all: rename gl_shader_stage_is_compute to mesa_shader_stage_is_compute Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36569>	2025-08-06 10:28:41 +08:00
Qiang Yu	196569b1a4	all: rename gl_shader_stage to mesa_shader_stage It's not only for GL, change to a generic name. Use command: find . -type f -not -path '/.git/' -exec sed -i 's/\bgl_shader_stage\b/mesa_shader_stage/g' {} + Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36569>	2025-08-06 10:28:40 +08:00
Marek Olšák	fee8e92855	nir: use gc_ctx for nir_variable to reduce ralloc/malloc overhead gc_ctx uses a slab allocator. This reduces GLSL compile times by 1-3% with the gallium noop driver. This reduces the number of ralloc_size calls for Heaven shaders by 14.3%. Note that gc_ctx also uses ralloc_size, so the reduction is a net change. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36538>	2025-08-05 22:55:14 +00:00
Marek Olšák	44350bce1f	nir: add nir_variable_create_zeroed helper This will allow us to switch nir_variable from ralloc to gc_ctx, which uses a slab allocator. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36538>	2025-08-05 22:55:14 +00:00
Marek Olšák	b769d5dcde	nir: don't use variables as ralloc parents, use the shader instead so that we can switch variables to gc_ctx Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36538>	2025-08-05 22:55:13 +00:00

1 2 3 4 5 ...

6518 commits