fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-20 15:38:19 +02:00

Author	SHA1	Message	Date
Faith Ekstrand	c859ea5783	nir: Add a sort_unstructured_blocks() helper Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28300>	2024-03-25 15:55:48 +00:00
Faith Ekstrand	a18a917237	nir: Improve the comment for nir_block::imm_dom Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28300>	2024-03-25 15:55:48 +00:00
Faith Ekstrand	a782809f81	nir/builder: Correctly handle decl_reg or undef as the first instruction These are both handled by inserting them directly at the top of the nir_function_impl. However, if the cursor is already at the top, it never gets updated so we end up inserting other stuff after the newly inserted undef or decl_reg. It's an odd edge case to be sure but I hit it with my new NIR CF pass for NAK. Fixes: `1be4c61c95` ("nir/builder: Add a helper for creating undefs") Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28300>	2024-03-25 15:55:48 +00:00
Faith Ekstrand	b069151e62	nir/print: Inline print_ssa_use() It has one caller. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28300>	2024-03-25 15:55:48 +00:00
Faith Ekstrand	ba8860301f	nir: Take a nir_def in nir_goto_if() Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28300>	2024-03-25 15:55:48 +00:00
Jesse Natalie	d82ff37ed0	nir_tests: Add /bigobj when compiling with MSVC Otherwise the opt_varying tests fail to compile Fixes: `6dbd1dcd` ("nir/tests: add tests for nir_opt_varyings") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28345>	2024-03-24 15:50:18 +00:00
Marek Olšák	60231e6d6f	nir/validate: validate more fields of nir_io_semantics Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28312>	2024-03-22 22:39:50 +00:00
Marek Olšák	2034cf87c5	nir/lower_io: add nir_io_semantics::interp_explicit_strict This preserves the misnamed "per_vertex" flag in lowered IO. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28312>	2024-03-22 22:39:50 +00:00
Marek Olšák	f02678bb50	nir/validate: validate interp_mode of load_barycentric_* Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28312>	2024-03-22 22:39:50 +00:00
Marek Olšák	1585a5cc6d	nir,amd: add nir_intrinsic_load_debug_log_desc_amd and its use for shader debugging Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27952>	2024-03-22 21:58:02 +00:00
Marek Olšák	6773595ed0	nir: rename AMD XFB intrinsics to *_gfx11_amd to indicate it's only for gfx11. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27952>	2024-03-22 21:58:02 +00:00
Yonggang Luo	1ac1c0843f	treewide: Replace usage of macro DEBUG with MESA_DEBUG when possible This is achieved by the following steps: #ifndef DEBUG => #if !MESA_DEBUG defined(DEBUG) => MESA_DEBUG #ifdef DEBUG => #if MESA_DEBUG This is done by replace in vscode excludes docs,.rs,addrlib,src/imgui,.sh,src/intel/vulkan/grl/gpu These are safe because those files should keep DEBUG macro is already excluded; and not directly replace DEBUG, as we have some symbols around it. Use debug or NDEBUG instead of DEBUG in comments when proper This for reduce the usage of DEBUG, so it's easier migrating to MESA_DEBUG These are found when migrating DEBUG to MESA_DEBUG, these are all comment update, so it's safe Replace comment /* DEBUG / and / !DEBUG / with proper / MESA_DEBUG / or / !MESA_DEBUG */ manually DEBUG \|\| !NDEBUG -> MESA_DEBUG \|\| !NDEBUG !DEBUG && NDEBUG -> !(MESA_DEBUG \|\| !NDEBUG) Replace the DEBUG present in comment with proper new MESA_DEBUG manually Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: David Heidelberg <david.heidelberg@collabora.com> Reviewed-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28092>	2024-03-22 18:22:34 +00:00
Marek Olšák	9b819adbd8	glsl/linker,st/mesa: enable nir_opt_varyings and lower IO in the linker The varying linker isn't changed. The passes are executed after linking varyings and before linking uniforms if nir->options->lower_io_variables is true. nir_opt_varyings can move uniforms between shaders and cause them to be DCE'd. It requires moving IO deref lowering from st/mesa into the GLSL linker and nir_opt_varyings should be added at the same time because IO deref lowering alone would disable IO optimizations in st/mesa such as compaction. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26819>	2024-03-15 19:55:46 +00:00
Marek Olšák	6dbd1dcd93	nir/tests: add tests for nir_opt_varyings Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26819>	2024-03-15 19:55:46 +00:00
Marek Olšák	c66967b5cb	nir: add nir_opt_varyings, new pass optimizing and compacting varyings Highlights: - all shader stages and all input/output types are handled, including inputs and outputs with multiple vertices - the optimizations performed are: unused input/output removal, constant and uniform propagation, output deduplication, inter-shader code motion, and compaction - constant and uniform propagation and output deduplication work even if a shader contains multiple stores of the same output, e.g. in GS - the same optimizations are also performed between output stores and output loads (for TCS) - FS inputs are packed agressively. Only flat, interp FP32, and interp FP16 can't be in the same vec4. Also, if an output value is non-divergent within a primitive, the corresponding FS input is opportunistically promoted to flat. The big comment at the beginning of nir_opt_varyings.c has a detailed explanation, which is the same as: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8841 dEQP and GLCTS have incorrect tests that fail with this, see: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10361 Acked-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26819>	2024-03-15 19:55:46 +00:00
Marek Olšák	ba54099dce	nir: add a utility computing post-dominance of SSA uses The massive comment explains it. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26819>	2024-03-15 19:55:46 +00:00
Faith Ekstrand	b68f2e747c	zink: Rework sparse texture lowering Instead of the previous fragile attempt to handle sparse_resident_and by crawling deref chains, we now insert an is_sparse_resident_zink intrinsic immediately after the tex or sparse_load intrinsic and define Zink's sparse resident codes to always be 0/1. Then sparse_resident_and becomes iand and is_sparse_texels_resident becomes != 0 and everything is well-defined and robust. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28123>	2024-03-14 22:37:51 +00:00
Alyssa Rosenzweig	49a89911c4	nir/print: do not print empty lists on intrinsics before: 32 %0 = @load_vertex_id () () after: 32 %0 = @load_vertex_id Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27925>	2024-03-12 19:00:26 +00:00
Alyssa Rosenzweig	a6123a80da	nir/opt_shrink_vectors: shrink some intrinsics from start If the backend supports it, intrinsics with a component() are straightforward to shrink from the start. Notably helps vectorized I/O. v2: add an option for this and enable only on grown up backends, because some backends ignore the component() parameter. RADV GFX11: Totals from 921 (1.16% of 79439) affected shaders: Instrs: 616558 -> 615529 (-0.17%); split: -0.30%, +0.14% CodeSize: 3099864 -> 3095632 (-0.14%); split: -0.25%, +0.11% Latency: 2177075 -> 2160966 (-0.74%); split: -0.79%, +0.05% InvThroughput: 299997 -> 298664 (-0.44%); split: -0.47%, +0.02% VClause: 16343 -> 16395 (+0.32%); split: -0.01%, +0.32% SClause: 10715 -> 10714 (-0.01%) Copies: 24736 -> 24701 (-0.14%); split: -0.37%, +0.23% PreVGPRs: 30179 -> 30173 (-0.02%) VALU: 353472 -> 353439 (-0.01%); split: -0.03%, +0.02% SALU: 40323 -> 40322 (-0.00%) VMEM: 25353 -> 25352 (-0.00%) AGX: total instructions in shared programs: 2038217 -> 2038049 (<.01%) instructions in affected programs: 10249 -> 10081 (-1.64%) total alu in shared programs: 1593094 -> 1592939 (<.01%) alu in affected programs: 7145 -> 6990 (-2.17%) total fscib in shared programs: 1589254 -> 1589102 (<.01%) fscib in affected programs: 7217 -> 7065 (-2.11%) total bytes in shared programs: 13975666 -> 13974722 (<.01%) bytes in affected programs: 65942 -> 64998 (-1.43%) total regs in shared programs: 592758 -> 591187 (-0.27%) regs in affected programs: 6936 -> 5365 (-22.65%) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> (v1) Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28004>	2024-03-12 18:17:17 +00:00
Alyssa Rosenzweig	aa99753a28	nir/opt_shrink_vectors: hoist alu helpers to be used earlier in the file in the next commit Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28004>	2024-03-12 18:17:17 +00:00
Marek Olšák	813f37a8ed	nir: add nir_block::divergent to indicate a divergent entry condition to be used by nir_opt_varyings Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28049>	2024-03-12 00:29:03 +00:00
Marek Olšák	936690f733	nir: print nir_io_semantics::invariant this was missing Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28049>	2024-03-12 00:29:03 +00:00
Marek Olšák	867a0a7db9	nir/divergence_analysis: handle derefs of system values needed by GLSL compiler optimizations that have unlowered sysvals Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28049>	2024-03-12 00:29:03 +00:00
Marek Olšák	eb670d6eaf	nir/divergence_analysis: load_instance_id is convergent within a primitive Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28049>	2024-03-12 00:29:03 +00:00
Marek Olšák	310b13b7f0	nir/divergence_analysis: load_primitive_id is convergent within a primitive Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28049>	2024-03-12 00:29:03 +00:00
Marek Olšák	1621d4a0d3	nir/divergence_analysis: change function prototypes for following commits Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28049>	2024-03-12 00:29:03 +00:00
Juan A. Suarez Romero	62e1dff256	v3d: add load_fep_w_v3d intrinsic This intrinsic helps to read the W coordinate stored in the QPU register when initializing the input data for the fragment shaders. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28072>	2024-03-11 12:42:49 +00:00
Jesse Natalie	cda6877cb6	nir_lower_tex_shadow: For old-style shadows, use vec4(result, 0, 0, 1) If the app requests a swizzle on the shadow sampler which doesn't just return the red channel or literal 0s/1s, we'll crash attempting to build the result vector. Use something that's probably valid. Cc: mesa-stable Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28001>	2024-03-07 01:15:46 +00:00
Rhys Perry	beb07fafba	nir/search: fix nir_replace_instr() debug code Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27335>	2024-03-06 15:23:18 +00:00
Rhys Perry	a93bd52f4f	nir/lower_int64: allow 64-bit comparisons when lowering minmax RADV doesn't need these to be lowered. fossil-db (navi31): Totals from 1 (0.00% of 79242) affected shaders: Instrs: 28 -> 26 (-7.14%) CodeSize: 140 -> 128 (-8.57%) Latency: 605 -> 604 (-0.17%) Copies: 5 -> 6 (+20.00%) VALU: 14 -> 13 (-7.14%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27335>	2024-03-06 15:23:18 +00:00
Rhys Perry	b37804c8de	nir/algebraic: optimize 64-bit comparisons with zero'd halves to 32-bit These expect nir_lower_int64 to replace u2u64 to pack_64_2x32_split(, 0). fossil-db (navi31): Totals from 149 (0.19% of 79242) affected shaders: Instrs: 433095 -> 431830 (-0.29%); split: -0.29%, +0.00% CodeSize: 2165980 -> 2160284 (-0.26%); split: -0.27%, +0.00% SpillSGPRs: 689 -> 688 (-0.15%) Latency: 3801497 -> 3799901 (-0.04%); split: -0.05%, +0.01% InvThroughput: 1547916 -> 1546567 (-0.09%); split: -0.09%, +0.01% VClause: 4698 -> 4693 (-0.11%) SClause: 9981 -> 9977 (-0.04%); split: -0.05%, +0.01% Copies: 66148 -> 65431 (-1.08%); split: -1.09%, +0.01% PreSGPRs: 6732 -> 6729 (-0.04%) PreVGPRs: 7976 -> 7945 (-0.39%) VALU: 252936 -> 252336 (-0.24%) SALU: 51794 -> 51274 (-1.00%); split: -1.03%, +0.02% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27335>	2024-03-06 15:23:18 +00:00
Rhys Perry	417eb390c6	nir/algebraic: remove duplicated iand(ien, ine)/ior(ieq, ieq) patterns These don't seem useful, since they're already done in the early optimizations. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27335>	2024-03-06 15:23:18 +00:00
Rhys Perry	6952bb359c	nir/algebraic: don't create 64-bit min/max/ior if lowered fossil-db (navi31): Totals from 58 (0.07% of 79242) affected shaders: Instrs: 11692 -> 11304 (-3.32%) CodeSize: 65836 -> 62412 (-5.20%) VGPRs: 1320 -> 1344 (+1.82%) Latency: 51712 -> 50234 (-2.86%) InvThroughput: 10190 -> 10160 (-0.29%) Copies: 460 -> 688 (+49.57%) VALU: 6130 -> 5897 (-3.80%) SALU: 1231 -> 1284 (+4.31%); split: -0.32%, +4.63% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27335>	2024-03-06 15:23:18 +00:00
Georg Lehmann	1d8b2b159e	nir/divergence_analysis: fix subgroup mask These depend on the subgroup invocation id, so they are divergent. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Fixes: `df86c5ffb3` ("nir: add divergence analysis pass.") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27962>	2024-03-05 14:52:17 +00:00
Georg Lehmann	230743da2e	nir: remove rotate scope All other subgroup operations do not have a scope in NIR, so for consistency rotate shouldn't have one either. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27964>	2024-03-05 14:12:21 +00:00
Timothy Arceri	edf242f825	nir: add some nir_parameter fields These will be used in future to do more validation on functions as the glsl nir linker is expanded. The first use is in the following patch. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27841>	2024-03-04 05:40:55 +00:00
Timothy Arceri	7afce96b80	glsl_to_nir: merge function param handling Here we remove the special handling for input params that was hard to work with and unite it with the output and inout params. Here a mediump test needs to be updated to what is a more expected outcome anyway. We also need to update the code that inserts software f64 to the new way input params are handled. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27108>	2024-03-04 11:31:20 +11:00
Job Noorman	96c2fe3e1a	nir: add search helper is_only_used_by_if Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27411>	2024-03-01 13:45:11 +00:00
Lionel Landwerlin	259cdc5496	nir: add additional flag to resource_intel for embedded samplers This will enable specific lowering of embedded samplers. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22151>	2024-02-29 07:05:06 +00:00
Faith Ekstrand	f4fb5277c3	nir: Add an imad opcode Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27159>	2024-02-27 21:51:30 -06:00
Ian Romanick	a2292f53b5	nir: Optimize uniform vote_all and vote_any No shader-db changes on any Intel platform. fossil-db: All Ice Lake and newer platforms had similar results. (Ice Lake) Totals: Instrs: 165513303 -> 165511820 (-0.00%) Cycles: 15125314947 -> 15125211500 (-0.00%); split: -0.00%, +0.00% Totals from 82 (0.01% of 656120) affected shaders: Instrs: 544627 -> 543144 (-0.27%) Cycles: 22616493 -> 22513046 (-0.46%); split: -0.46%, +0.00% No fossil-db changes on Gfx9. Suggested-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 09:44:32 -08:00
Ian Romanick	535caaf3e0	nir: Optimize uniform iadd, fadd, and ixor reduction operations This adds optimizations for iadd, fadd, and ixor with reduce, inclusive scan, and exclusive scan. NOTE: The fadd and ixor optimizations had no shader-db or fossil-db changes on any Intel platform. NOTE 2: This change "fixes" arb_compute_variable_group_size-local-size and base-local-size.shader_test on DG2 and MTL. This is just changing the code path taken to not use whatever path was not working properly before. This is a subset of the things optimized by ACO. See also https://gitlab.freedesktop.org/mesa/mesa/-/issues/3731#note_682802. The min, max, iand, and ior exclusive_scan optimizations are not implemented. Broadwell on shader-db is not happy. I have not investigated. v2: Silence some warnings about discarding const. v3: Rename mbcnt to count_active_invocations. Add a big comment explaining the differences between the two paths. Suggested by Rhys. shader-db: All Gfx9 and newer platforms had similar results. (Ice Lake shown) total instructions in shared programs: 20300384 -> 20299545 (<.01%) instructions in affected programs: 19167 -> 18328 (-4.38%) helped: 35 / HURT: 0 total cycles in shared programs: 842809750 -> 842766381 (<.01%) cycles in affected programs: 2160249 -> 2116880 (-2.01%) helped: 33 / HURT: 2 total spills in shared programs: 4632 -> 4626 (-0.13%) spills in affected programs: 206 -> 200 (-2.91%) helped: 3 / HURT: 0 total fills in shared programs: 5594 -> 5581 (-0.23%) fills in affected programs: 664 -> 651 (-1.96%) helped: 3 / HURT: 1 fossil-db results: All Intel platforms had similar results. (Ice Lake shown) Totals: Instrs: 165551893 -> 165513303 (-0.02%) Cycles: 15132539132 -> 15125314947 (-0.05%); split: -0.05%, +0.00% Spill count: 45258 -> 45204 (-0.12%) Fill count: 74286 -> 74157 (-0.17%) Scratch Memory Size: 2467840 -> 2451456 (-0.66%) Totals from 712 (0.11% of 656120) affected shaders: Instrs: 598931 -> 560341 (-6.44%) Cycles: 184650167 -> 177425982 (-3.91%); split: -3.95%, +0.04% Spill count: 983 -> 929 (-5.49%) Fill count: 2274 -> 2145 (-5.67%) Scratch Memory Size: 52224 -> 35840 (-31.37%) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 09:44:11 -08:00
Ian Romanick	f10d1ef372	nir: Initial framework for optimizing uniform subgroup operations The first commit just optimizes operation where the result of the subgroup operation is the same as each of the individual channel results. This is a subset of the things optimized by ACO. See also https://gitlab.freedesktop.org/mesa/mesa/-/issues/3731#note_682802. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 08:38:31 -08:00
Ian Romanick	75de4458a1	nir: Mark nir_intrinsic_load_global_block_intel as divergent This is divergent because it specifically loads sequential values into successive SIMD lanes. No shader-db or fossil-db changes on any Intel platform. Fixes: `9f44a26462` ("nir/divergence: handle load_global_block_intel") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 08:36:42 -08:00
Ian Romanick	5da5106727	nir: Add documentation for subgroup_.._mask v2: Fix reference to GL_ARB_shader_ballot. Noticed by Lionel. Suggested-by: Lionel Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 08:36:09 -08:00
Sagar Ghuge	30ead72e80	nir: Allow nir_texop_tg4 in implicit derivative This allow us to invoke the quad helper. v2: (Georg) - Add check for is_gather_implicit_lod Fixes: `48158636bf` ("nir: add is_gather_implicit_lod") Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27447>	2024-02-27 00:22:46 +00:00
Alyssa Rosenzweig	6825902bb6	treewide: use ralloc_memdup @@ expression memctx, dst, src, size; @@ -dst = ralloc_size(memctx, size); -memcpy(dst, src, size); +dst = ralloc_memdup(memctx, src, size); Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27762>	2024-02-26 15:37:58 +00:00
Timur Kristóf	cc1501628f	nir: Clean up divergence analysis for TES patch input loads. Just make the code a little bit easier to follow. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27680>	2024-02-26 14:53:23 +00:00
Timur Kristóf	870a2e4197	nir: Cleanup divergence analysis for mesh shaders. 1. Mesh shaders don't have inputs (only task payload), so remove them from handling load_input. 2. Clarify in comments that loading any mesh shader output is an NV_mesh_shader only feature. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27680>	2024-02-26 14:53:23 +00:00
Timur Kristóf	9553d67373	nir: Fix divergence analysis of load_patch_vertices_in. load_patch_vertices_in can only occur in tessellation shaders, and contains the number of vertices in an input patch. * TCS: patch_vertices_in is equal to the input patch size * TES: patch_vertices_in is equal to the TCS output patch size The patch sizes may be set by a pipeline or dynamic states, however in both cases it is definitely uniform within a subgroup. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27680>	2024-02-26 14:53:23 +00:00

1 2 3 4 5 ...

5186 commits