fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-24 11:00:11 +01:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	a6123a80da	nir/opt_shrink_vectors: shrink some intrinsics from start If the backend supports it, intrinsics with a component() are straightforward to shrink from the start. Notably helps vectorized I/O. v2: add an option for this and enable only on grown up backends, because some backends ignore the component() parameter. RADV GFX11: Totals from 921 (1.16% of 79439) affected shaders: Instrs: 616558 -> 615529 (-0.17%); split: -0.30%, +0.14% CodeSize: 3099864 -> 3095632 (-0.14%); split: -0.25%, +0.11% Latency: 2177075 -> 2160966 (-0.74%); split: -0.79%, +0.05% InvThroughput: 299997 -> 298664 (-0.44%); split: -0.47%, +0.02% VClause: 16343 -> 16395 (+0.32%); split: -0.01%, +0.32% SClause: 10715 -> 10714 (-0.01%) Copies: 24736 -> 24701 (-0.14%); split: -0.37%, +0.23% PreVGPRs: 30179 -> 30173 (-0.02%) VALU: 353472 -> 353439 (-0.01%); split: -0.03%, +0.02% SALU: 40323 -> 40322 (-0.00%) VMEM: 25353 -> 25352 (-0.00%) AGX: total instructions in shared programs: 2038217 -> 2038049 (<.01%) instructions in affected programs: 10249 -> 10081 (-1.64%) total alu in shared programs: 1593094 -> 1592939 (<.01%) alu in affected programs: 7145 -> 6990 (-2.17%) total fscib in shared programs: 1589254 -> 1589102 (<.01%) fscib in affected programs: 7217 -> 7065 (-2.11%) total bytes in shared programs: 13975666 -> 13974722 (<.01%) bytes in affected programs: 65942 -> 64998 (-1.43%) total regs in shared programs: 592758 -> 591187 (-0.27%) regs in affected programs: 6936 -> 5365 (-22.65%) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> (v1) Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28004>	2024-03-12 18:17:17 +00:00
Alyssa Rosenzweig	aa99753a28	nir/opt_shrink_vectors: hoist alu helpers to be used earlier in the file in the next commit Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28004>	2024-03-12 18:17:17 +00:00
Timothy Arceri	182bff5c05	glsl: remove unrequired do_lower_jumps() call We were using this to remove unreachable instructions following jumps. The previous patch allowed glsl to nir to handle these instructions so this call is no longer needed. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27288>	2024-03-12 01:43:03 +00:00
Timothy Arceri	1391bc3721	glsl_to_nir: never convert instructions after jump Unlike in GLSL IR it is illegal to add an instruction to a block following a jump in NIR. Here we add code to the glsl_to_ir pass to remove any such instructions before they are processed i.e. we remove them as soon as we process the jumps. Handling this in glsl to nir allows us to avoid depending on the lower_jumps() pass being called directly before glsl to nir when it otherwise doesn't need to be called an additional time. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27288>	2024-03-12 01:43:03 +00:00
Timothy Arceri	f06aed8e1d	glsl: make an explicitly safe version of visit_exec_list() visit_exec_list() has always called foreach_in_list_safe() here were rename that version to visit_exec_list_safe() and create a version that calls the non-safe foreach call. There are only 2 users of visit_exec_list() we change lower_jumps to use the renamed version and leave glsl_to_nir() to use the non-safe version as it never deletes the current instruction and in the following patch we will add code that may delete the next instruction meaning the safe version would be unsafe to use. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27288>	2024-03-12 01:43:03 +00:00
Marek Olšák	813f37a8ed	nir: add nir_block::divergent to indicate a divergent entry condition to be used by nir_opt_varyings Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28049>	2024-03-12 00:29:03 +00:00
Marek Olšák	936690f733	nir: print nir_io_semantics::invariant this was missing Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28049>	2024-03-12 00:29:03 +00:00
Marek Olšák	867a0a7db9	nir/divergence_analysis: handle derefs of system values needed by GLSL compiler optimizations that have unlowered sysvals Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28049>	2024-03-12 00:29:03 +00:00
Marek Olšák	eb670d6eaf	nir/divergence_analysis: load_instance_id is convergent within a primitive Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28049>	2024-03-12 00:29:03 +00:00
Marek Olšák	310b13b7f0	nir/divergence_analysis: load_primitive_id is convergent within a primitive Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28049>	2024-03-12 00:29:03 +00:00
Marek Olšák	1621d4a0d3	nir/divergence_analysis: change function prototypes for following commits Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28049>	2024-03-12 00:29:03 +00:00
Iván Briano	e1b66f9707	compiler/types: fix serialization of cooperative matrix Encoding of cmat_desc is overwriting the base_type with the type of the elements of the matrix. Fixes: `2d0f4f2c17` ("compiler/types: Add support for Cooperative Matrix types") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28086>	2024-03-11 20:35:16 +00:00
Juan A. Suarez Romero	62e1dff256	v3d: add load_fep_w_v3d intrinsic This intrinsic helps to read the W coordinate stored in the QPU register when initializing the input data for the fragment shaders. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28072>	2024-03-11 12:42:49 +00:00
Timothy Arceri	981900055c	glsl: remove now unused glsl ir lower discard pass Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28005>	2024-03-07 04:02:45 +00:00
Timothy Arceri	8ceb10a1bd	glsl: make use of nir lower discard flow Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28005>	2024-03-07 04:02:45 +00:00
Timothy Arceri	8317a37ea7	glsl: implement nir version of lower discard flow Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28005>	2024-03-07 04:02:45 +00:00
Jesse Natalie	cda6877cb6	nir_lower_tex_shadow: For old-style shadows, use vec4(result, 0, 0, 1) If the app requests a swizzle on the shadow sampler which doesn't just return the red channel or literal 0s/1s, we'll crash attempting to build the result vector. Use something that's probably valid. Cc: mesa-stable Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28001>	2024-03-07 01:15:46 +00:00
Rhys Perry	beb07fafba	nir/search: fix nir_replace_instr() debug code Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27335>	2024-03-06 15:23:18 +00:00
Rhys Perry	a93bd52f4f	nir/lower_int64: allow 64-bit comparisons when lowering minmax RADV doesn't need these to be lowered. fossil-db (navi31): Totals from 1 (0.00% of 79242) affected shaders: Instrs: 28 -> 26 (-7.14%) CodeSize: 140 -> 128 (-8.57%) Latency: 605 -> 604 (-0.17%) Copies: 5 -> 6 (+20.00%) VALU: 14 -> 13 (-7.14%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27335>	2024-03-06 15:23:18 +00:00
Rhys Perry	b37804c8de	nir/algebraic: optimize 64-bit comparisons with zero'd halves to 32-bit These expect nir_lower_int64 to replace u2u64 to pack_64_2x32_split(, 0). fossil-db (navi31): Totals from 149 (0.19% of 79242) affected shaders: Instrs: 433095 -> 431830 (-0.29%); split: -0.29%, +0.00% CodeSize: 2165980 -> 2160284 (-0.26%); split: -0.27%, +0.00% SpillSGPRs: 689 -> 688 (-0.15%) Latency: 3801497 -> 3799901 (-0.04%); split: -0.05%, +0.01% InvThroughput: 1547916 -> 1546567 (-0.09%); split: -0.09%, +0.01% VClause: 4698 -> 4693 (-0.11%) SClause: 9981 -> 9977 (-0.04%); split: -0.05%, +0.01% Copies: 66148 -> 65431 (-1.08%); split: -1.09%, +0.01% PreSGPRs: 6732 -> 6729 (-0.04%) PreVGPRs: 7976 -> 7945 (-0.39%) VALU: 252936 -> 252336 (-0.24%) SALU: 51794 -> 51274 (-1.00%); split: -1.03%, +0.02% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27335>	2024-03-06 15:23:18 +00:00
Rhys Perry	417eb390c6	nir/algebraic: remove duplicated iand(ien, ine)/ior(ieq, ieq) patterns These don't seem useful, since they're already done in the early optimizations. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27335>	2024-03-06 15:23:18 +00:00
Rhys Perry	6952bb359c	nir/algebraic: don't create 64-bit min/max/ior if lowered fossil-db (navi31): Totals from 58 (0.07% of 79242) affected shaders: Instrs: 11692 -> 11304 (-3.32%) CodeSize: 65836 -> 62412 (-5.20%) VGPRs: 1320 -> 1344 (+1.82%) Latency: 51712 -> 50234 (-2.86%) InvThroughput: 10190 -> 10160 (-0.29%) Copies: 460 -> 688 (+49.57%) VALU: 6130 -> 5897 (-3.80%) SALU: 1231 -> 1284 (+4.31%); split: -0.32%, +4.63% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27335>	2024-03-06 15:23:18 +00:00
Georg Lehmann	1d8b2b159e	nir/divergence_analysis: fix subgroup mask These depend on the subgroup invocation id, so they are divergent. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Fixes: `df86c5ffb3` ("nir: add divergence analysis pass.") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27962>	2024-03-05 14:52:17 +00:00
Georg Lehmann	230743da2e	nir: remove rotate scope All other subgroup operations do not have a scope in NIR, so for consistency rotate shouldn't have one either. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27964>	2024-03-05 14:12:21 +00:00
Christian Gmeiner	516a2a3a0e	isaspec: encode: Constify bitset_params Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Acked-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27965>	2024-03-05 07:29:08 +00:00
Christian Gmeiner	381d19d138	isaspec: encode: Constify encode.type Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Acked-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27965>	2024-03-05 07:29:08 +00:00
Oskar Viljasaar	f9acfeeb59	compiler/types: Fix glsl_dvec*_type() helpers Commit `90e364edb0` contained a typo in the glsl_dvec4_type() helper, instead returning a glsl_ivec4_type. As an ivec4 is 2x smaller than a dvec4, this also broke piglit sanity on crocus/hsw. This also fixes the dvec2 helper, though it has not been specifically tested anywhere. Fixes: `90e364edb0` ("compiler/types: Add a few more helpers to get builtin types") Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27917>	2024-03-04 15:22:32 +00:00
Corentin Noël	bc11e6ee8d	glsl: Ensure that we are dealing with ir_variable and ir_rvalue Use the built-in function from ir_instruction to make sure that we are actually not casting to anther type by mistake. Signed-off-by: Corentin Noël <corentin.noel@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27114>	2024-03-04 08:03:55 +00:00
Erik Faye-Lund	d795bd380a	glsl: Make error_value a real ir_rvalue type It exposes a type so let it be a real ir_rvalue instead of abusing ir_type_unset. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27114>	2024-03-04 08:03:55 +00:00
Timothy Arceri	eefd836ebc	glsl: make use of nir recursion detection Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27841>	2024-03-04 05:40:55 +00:00
Timothy Arceri	38eb850883	glsl: move function inlining out of glsl_to_nir() This will allow us to do more of the function linking work in nir in the future. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27841>	2024-03-04 05:40:55 +00:00
Timothy Arceri	f7a664754f	glsl: add nir version of function recursion detection Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27841>	2024-03-04 05:40:55 +00:00
Timothy Arceri	eecd7504a8	glsl: add missing define to linker_util.h Avoids compiler warning in files that use linker_util.h but not the set util. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27841>	2024-03-04 05:40:55 +00:00
Timothy Arceri	edf242f825	nir: add some nir_parameter fields These will be used in future to do more validation on functions as the glsl nir linker is expanded. The first use is in the following patch. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27841>	2024-03-04 05:40:55 +00:00
Timothy Arceri	39052dabf9	glsl: don't inline functions in glsl ir Everthing is now in place for nir and glsl to nir to handle this stuff for us. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27108>	2024-03-04 11:31:21 +11:00
Timothy Arceri	c6c150b4cd	glsl_to_nir: support conversion of opaque function params Here we can assume anything that is not an input is bindless. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27108>	2024-03-04 11:31:21 +11:00
Timothy Arceri	de7574f70a	glsl_to_nir: support conversion of struct/array function returns This adds support for array and struct function returns in the glsl to nir pass allowing us to avoid extra calls to the glsl IR optimisation loop. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27108>	2024-03-04 11:31:20 +11:00
Timothy Arceri	fac9b1c594	glsl_to_nir: support conversion of struct/array function params This adds support for array and struct function params in the glsl to nir pass allowing us to avoid extra calls to the glsl IR optimisation loop. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27108>	2024-03-04 11:31:20 +11:00
Timothy Arceri	7afce96b80	glsl_to_nir: merge function param handling Here we remove the special handling for input params that was hard to work with and unite it with the output and inout params. Here a mediump test needs to be updated to what is a more expected outcome anyway. We also need to update the code that inserts software f64 to the new way input params are handled. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27108>	2024-03-04 11:31:20 +11:00
Job Noorman	96c2fe3e1a	nir: add search helper is_only_used_by_if Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27411>	2024-03-01 13:45:11 +00:00
Lionel Landwerlin	259cdc5496	nir: add additional flag to resource_intel for embedded samplers This will enable specific lowering of embedded samplers. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22151>	2024-02-29 07:05:06 +00:00
Faith Ekstrand	f4fb5277c3	nir: Add an imad opcode Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27159>	2024-02-27 21:51:30 -06:00
Ian Romanick	a2292f53b5	nir: Optimize uniform vote_all and vote_any No shader-db changes on any Intel platform. fossil-db: All Ice Lake and newer platforms had similar results. (Ice Lake) Totals: Instrs: 165513303 -> 165511820 (-0.00%) Cycles: 15125314947 -> 15125211500 (-0.00%); split: -0.00%, +0.00% Totals from 82 (0.01% of 656120) affected shaders: Instrs: 544627 -> 543144 (-0.27%) Cycles: 22616493 -> 22513046 (-0.46%); split: -0.46%, +0.00% No fossil-db changes on Gfx9. Suggested-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 09:44:32 -08:00
Ian Romanick	535caaf3e0	nir: Optimize uniform iadd, fadd, and ixor reduction operations This adds optimizations for iadd, fadd, and ixor with reduce, inclusive scan, and exclusive scan. NOTE: The fadd and ixor optimizations had no shader-db or fossil-db changes on any Intel platform. NOTE 2: This change "fixes" arb_compute_variable_group_size-local-size and base-local-size.shader_test on DG2 and MTL. This is just changing the code path taken to not use whatever path was not working properly before. This is a subset of the things optimized by ACO. See also https://gitlab.freedesktop.org/mesa/mesa/-/issues/3731#note_682802. The min, max, iand, and ior exclusive_scan optimizations are not implemented. Broadwell on shader-db is not happy. I have not investigated. v2: Silence some warnings about discarding const. v3: Rename mbcnt to count_active_invocations. Add a big comment explaining the differences between the two paths. Suggested by Rhys. shader-db: All Gfx9 and newer platforms had similar results. (Ice Lake shown) total instructions in shared programs: 20300384 -> 20299545 (<.01%) instructions in affected programs: 19167 -> 18328 (-4.38%) helped: 35 / HURT: 0 total cycles in shared programs: 842809750 -> 842766381 (<.01%) cycles in affected programs: 2160249 -> 2116880 (-2.01%) helped: 33 / HURT: 2 total spills in shared programs: 4632 -> 4626 (-0.13%) spills in affected programs: 206 -> 200 (-2.91%) helped: 3 / HURT: 0 total fills in shared programs: 5594 -> 5581 (-0.23%) fills in affected programs: 664 -> 651 (-1.96%) helped: 3 / HURT: 1 fossil-db results: All Intel platforms had similar results. (Ice Lake shown) Totals: Instrs: 165551893 -> 165513303 (-0.02%) Cycles: 15132539132 -> 15125314947 (-0.05%); split: -0.05%, +0.00% Spill count: 45258 -> 45204 (-0.12%) Fill count: 74286 -> 74157 (-0.17%) Scratch Memory Size: 2467840 -> 2451456 (-0.66%) Totals from 712 (0.11% of 656120) affected shaders: Instrs: 598931 -> 560341 (-6.44%) Cycles: 184650167 -> 177425982 (-3.91%); split: -3.95%, +0.04% Spill count: 983 -> 929 (-5.49%) Fill count: 2274 -> 2145 (-5.67%) Scratch Memory Size: 52224 -> 35840 (-31.37%) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 09:44:11 -08:00
Ian Romanick	f10d1ef372	nir: Initial framework for optimizing uniform subgroup operations The first commit just optimizes operation where the result of the subgroup operation is the same as each of the individual channel results. This is a subset of the things optimized by ACO. See also https://gitlab.freedesktop.org/mesa/mesa/-/issues/3731#note_682802. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 08:38:31 -08:00
Ian Romanick	75de4458a1	nir: Mark nir_intrinsic_load_global_block_intel as divergent This is divergent because it specifically loads sequential values into successive SIMD lanes. No shader-db or fossil-db changes on any Intel platform. Fixes: `9f44a26462` ("nir/divergence: handle load_global_block_intel") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 08:36:42 -08:00
Ian Romanick	5da5106727	nir: Add documentation for subgroup_.._mask v2: Fix reference to GL_ARB_shader_ballot. Noticed by Lionel. Suggested-by: Lionel Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 08:36:09 -08:00
Sagar Ghuge	30ead72e80	nir: Allow nir_texop_tg4 in implicit derivative This allow us to invoke the quad helper. v2: (Georg) - Add check for is_gather_implicit_lod Fixes: `48158636bf` ("nir: add is_gather_implicit_lod") Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27447>	2024-02-27 00:22:46 +00:00
Alyssa Rosenzweig	6825902bb6	treewide: use ralloc_memdup @@ expression memctx, dst, src, size; @@ -dst = ralloc_size(memctx, size); -memcpy(dst, src, size); +dst = ralloc_memdup(memctx, src, size); Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27762>	2024-02-26 15:37:58 +00:00
Timur Kristóf	cc1501628f	nir: Clean up divergence analysis for TES patch input loads. Just make the code a little bit easier to follow. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27680>	2024-02-26 14:53:23 +00:00

1 2 3 4 5 ...

9117 commits