fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-18 22:28:06 +02:00

Author	SHA1	Message	Date
Iván Briano	7fce39484e	nir: add pass to convert ViewIndex to DeviceIndex Used to implement VK_PIPELINE_CREATE_VIEW_INDEX_FROM_DEVICE_INDEX_BIT_KHR. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30329>	2024-08-07 19:09:55 +00:00
Georg Lehmann	b6d3f666ab	nir/peephole_select: ignore masked/quad swizzle without fetch_inactive Without fetch_inactive, these instructions need to return 0 for inactive lanes and peephole_select changes which instructions are inactive. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30540>	2024-08-07 20:21:05 +02:00
Zan Dobersek	7fd5f76393	nir/lower_vars_to_scratch: calculate threshold-limited variable size separately ir3's lowering of variables to scratch memory has to treat 8-bit values as 16-bit ones when comparing such value's size against the given threshold since those values are handled through 16-bit half-registers. But those values can still use natural 8-bit size and alignment for storing inside scratch memory. nir_lower_vars_to_scratch now accepts two size-and-alignment functions, one used for calculating the variable size and the other for calculating the size and alignment needed for storing inside scratch memory. Non-ir3 uses of this pass can just duplicate the currently-used function. ir3 provides a separate variable-size function that special-cases 8-bit types. Signed-off-by: Zan Dobersek <zdobersek@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29875>	2024-08-07 14:32:28 +00:00
Alyssa Rosenzweig	796b3ab23d	nir/opt_peephole_select: allow speculatable load constant this is useful on AGX when soft fault is enabled. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30501>	2024-08-06 20:01:37 +00:00
Alyssa Rosenzweig	340831dbcc	nir/divergence_analysis: handle AGX stuff bunch of vendor intrinsics, plus some standard intrinsics used in weird shader stages. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30488>	2024-08-06 11:48:18 -04:00
Alyssa Rosenzweig	d99c2ef059	nir/opt_uniform_atomics: add fs atomics predicated? flag on agx (and mali), we predicate atomics on "if (!helper)", so doing so again in this pass is redundant. and would cause a problem since we'd then have to lower the "is helper inv?" flag late. so just skip the extra lowering code. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30488>	2024-08-06 11:48:17 -04:00
Rhys Perry	810808b778	nir/opt_uniform_atomics: require block index metadata is_atomic_already_optimized() uses this. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30518>	2024-08-06 15:04:21 +00:00
Karol Herbst	14ea102175	nir: add load_global_size intrinsic There is no need to compute it in the shader as the result is known at runtime already. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Tested-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30467>	2024-08-01 17:43:42 +00:00
Timothy Arceri	298633e365	nir: set disallow_undef_to_nan for legacy ARB asm programs Reviewed-by: Marek Olšák <marek.olsak@amd.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11389 Fixes: `861d274453` ("nir: replace undef only used by ALU opcodes with 0 or NaN") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30419>	2024-08-01 02:28:24 +00:00
Christian Gmeiner	26474f8d4a	nir_lower_mem_access_bit_sizes: Support load_kernel_input Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30407>	2024-07-30 06:51:22 +00:00
Timothy Arceri	017770ff14	nir: add nir_tex_src_{sampler,texture}_deref_intrinsic To be used as a placeholder until after function inlining so we can replace function params with bindless handles if needed. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30315>	2024-07-29 00:06:10 +00:00
Timothy Arceri	ef13ff00d1	nir: create validate_tex_src_texture_deref() helper Will be used in a following patch. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30315>	2024-07-29 00:06:10 +00:00
Christian Gmeiner	c33d2db06a	meson: Add missing inc's to idep_nir_headers nir.h includes: - "compiler/glsl_types.h" -> inc_src is needed - "util/u_atomic.h" -> "no_extern_c.h" -> inc_include needed This makes it possible to use rust's bindgen with only nir.h as specified include. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Acked-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30359>	2024-07-25 05:51:19 +00:00
Marek Olšák	d90080b51b	nir/opt_vectorize_io: optionally don't vectorize IO with different types Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11443 Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29895>	2024-07-23 16:13:17 +00:00
Marek Olšák	9bfea3183a	nir/opt_varyings: improve convergent input handling to fix data corruption Backward inter-shader code motion can move any code into the previous shader if it only uses convergent inputs. The problem is the final input type can end up being integer or FP64, which is incompatible with the assumption that convergent inputs can always be interpolated. If such a case occurs and the type is integer or FP64, either don't do any code motion, or if the driver exposes the new flag, rewrite convergent loads to use load_input. If the new flag is supported, all convergent loads are rewritten to use load_input, and flat varyings are allowed to be classified as convergent, which means they are packed into interpolated vec4 slots if there are unused components. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29895>	2024-07-23 16:13:16 +00:00
Marek Olšák	b2d32ae246	nir: add nir_intrinsic_load_per_primitive_input, split from io_semantics flag Instead of having 1 bit in nir_io_semantics indicating a per-primitive FS input, add a dedicated intrinsic for it. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29895>	2024-07-23 16:13:16 +00:00
Marek Olšák	ecfefe823e	nir/opt_algebraic: use fmulz for fpow lowering to fix incorrect rendering The original implementation in all radeon drivers had this behavior. Fixes: `9bc1fb4c07` - ac/llvm,radeonsi: lower nir_fpow for aco and llvm Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11464 Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30069>	2024-07-23 15:23:27 +00:00
Rhys Perry	3aa29c47b9	nir/instr_set: hash tex sources commutatively I'm not sure if two otherwise equal texture instructions ever have sources in different orders, but they should be considered equal. ministat of nir_opt_cse: N Min Max Median Avg Stddev x 9 6.586801 6.718673 6.682875 6.6621411 0.047817119 + 9 6.519098 6.609235 6.552997 6.5605604 0.028879587 Difference at 95.0% confidence -0.101581 +/- 0.0394755 -1.52475% +/- 0.585928% (Student's t, pooled s = 0.0395) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Tatsuyuki Ishi <ishitatsuyuki@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30145>	2024-07-22 11:04:01 +00:00
Rhys Perry	b7ceb9d327	nir/instr_set: stop sorting phi sources This is faster. ministat of nir_opt_cse: N Min Max Median Avg Stddev x 9 6.724212 6.84511 6.788336 6.7873378 0.034363882 + 9 6.586801 6.718673 6.682875 6.6621411 0.047817119 Difference at 95.0% confidence -0.125197 +/- 0.0416115 -1.84456% +/- 0.609248% (Student's t, pooled s = 0.0416374) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Tatsuyuki Ishi <ishitatsuyuki@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30145>	2024-07-22 11:04:01 +00:00
Rhys Perry	8b328443e3	nir/instr_set: combine XXH32 calls ministat of nir_opt_cse: N Min Max Median Avg Stddev x 9 7.393408 7.490593 7.434056 7.4338972 0.028150325 + 9 6.724212 6.84511 6.788336 6.7873378 0.034363882 Difference at 95.0% confidence -0.646559 +/- 0.0313916 -8.69745% +/- 0.407925% (Student's t, pooled s = 0.0314111) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Tatsuyuki Ishi <ishitatsuyuki@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30145>	2024-07-22 11:04:01 +00:00
Ian Romanick	faee9426ab	nir/algebraic: Optimize some masking of extract_u8 operations I observed this pattern in several Red Dead Redemption 2 shaders. No shader-db changes on any Intel platform. v2: Remove duplicated patterns. Noticed by Georg. fossil-db: All Intel platforms had similar results. (Meteor Lake shown) Totals: Instrs: 151519393 -> 151507192 (-0.01%); split: -0.01%, +0.00% Cycle count: 17208246858 -> 17177437340 (-0.18%); split: -0.25%, +0.07% Spill count: 80830 -> 80759 (-0.09%); split: -0.09%, +0.00% Fill count: 152754 -> 152179 (-0.38%); split: -0.40%, +0.02% Totals from 7531 (1.20% of 630198) affected shaders: Instrs: 12606141 -> 12593940 (-0.10%); split: -0.10%, +0.00% Cycle count: 5466605514 -> 5435795996 (-0.56%); split: -0.79%, +0.22% Spill count: 25251 -> 25180 (-0.28%); split: -0.29%, +0.01% Fill count: 45143 -> 44568 (-1.27%); split: -1.36%, +0.08% Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30158>	2024-07-20 00:19:05 +00:00
Ian Romanick	1c7e35d4e0	nir/algebraic: Optimize some bit operation nonsense observed in some shaders In updates (not post at the time of this writing) to !29884, a change caused many spill and fill regressions shader for OpenGL Tomb Raider. While looking at that shader, I noticed some odd patterns. I initially added these patterns to counteract the regressions caused by the other change, but I had no luck. On Ice Lake... this cuts 99 instructions from the shader. shader-db: All Intel platforms had simliar results. (Meteor Lake shown) total instructions in shared programs: 19732341 -> 19732295 (<.01%) instructions in affected programs: 1744 -> 1698 (-2.64%) helped: 1 / HURT: 0 total cycles in shared programs: 916273716 -> 916273068 (<.01%) cycles in affected programs: 14266 -> 13618 (-4.54%) helped: 1 / HURT: 0 fossil-db: All Intel platforms had similar results. (Meteor Lake shown) Totals: Instrs: 151519575 -> 151519393 (-0.00%) Cycle count: 17208402120 -> 17208246858 (-0.00%); split: -0.00%, +0.00% Totals from 159 (0.03% of 630198) affected shaders: Instrs: 51970 -> 51788 (-0.35%) Cycle count: 11474176 -> 11318914 (-1.35%); split: -1.36%, +0.01% Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30158>	2024-07-20 00:19:05 +00:00
Ian Romanick	92befad89f	nir/range_analysis: Fix errors in fmin and fmax tables fmin(x, 0.0) must at least be le_zero, and fmax(x, 0.0) be at least be ge_zero. shader-db: All Intel platforms had similar results. (Meteor Lake shown) total instructions in shared programs: 19733226 -> 19731919 (<.01%) instructions in affected programs: 196415 -> 195108 (-0.67%) helped: 615 / HURT: 0 total cycles in shared programs: 916277979 -> 916265288 (<.01%) cycles in affected programs: 2482535 -> 2469844 (-0.51%) helped: 346 / HURT: 178 LOST: 2 GAINED: 1 fossil-db: All Intel platforms had similar results. (Meteor Lake shown) Totals: Instrs: 151531355 -> 151519575 (-0.01%); split: -0.01%, +0.00% Cycle count: 17209372399 -> 17208402120 (-0.01%); split: -0.01%, +0.01% Max live registers: 32016490 -> 32016514 (+0.00%) Totals from 4307 (0.68% of 630198) affected shaders: Instrs: 4179418 -> 4167638 (-0.28%); split: -0.28%, +0.00% Cycle count: 1063492212 -> 1062521933 (-0.09%); split: -0.24%, +0.15% Max live registers: 359250 -> 359274 (+0.01%) Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30158>	2024-07-20 00:19:05 +00:00
Daniel Stone	e05415a82e	format: Generate endian-independent format aliases Instead of having a hardcoded list of endian-independent format aliases in the header, generate them from the format definitions. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29649>	2024-07-19 13:50:42 +00:00
Georg Lehmann	aa6d363634	nir: constant fold inverse_ballot Foz-DB Navi21: Totals from 210 (0.26% of 79395) affected shaders: Instrs: 79583 -> 78892 (-0.87%) CodeSize: 435636 -> 431680 (-0.91%) VGPRs: 7208 -> 7224 (+0.22%) Latency: 660376 -> 658808 (-0.24%); split: -0.38%, +0.14% InvThroughput: 127489 -> 127544 (+0.04%); split: -0.35%, +0.39% VClause: 1503 -> 1504 (+0.07%) SClause: 3970 -> 3947 (-0.58%) Copies: 4932 -> 4682 (-5.07%); split: -5.17%, +0.10% Branches: 2411 -> 2406 (-0.21%); split: -0.33%, +0.12% PreSGPRs: 6395 -> 6434 (+0.61%); split: -0.31%, +0.92% PreVGPRs: 4100 -> 4103 (+0.07%) VALU: 48484 -> 48145 (-0.70%); split: -0.70%, +0.00% SALU: 12499 -> 12202 (-2.38%); split: -2.41%, +0.03% SMEM: 6448 -> 6420 (-0.43%) Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30235>	2024-07-19 07:24:34 +00:00
Georg Lehmann	2d3f536174	aco,nir: add dpp16_shift_amd intrinsic Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24650>	2024-07-17 15:04:38 +00:00
Faith Ekstrand	bbccbd8d50	nir,nak: Add a nir_op_prmt_nv We have this in hardware since forever and it's really useful. May as well add it to NIR so we can use it in various lowerings. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30218>	2024-07-17 13:38:24 +00:00
Alyssa Rosenzweig	9f1d1c4fc8	nir/opt_constant_folding: fix array size define, pt 2 In practice these are equal but the old code was semantically wrong: that dimension is "sources" not "components". Use the correct #define. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Suggested-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30214>	2024-07-16 17:38:16 +00:00
Daniel Schürmann	ffef3d1709	nir/opt_sink: ignore loops without backedge Loops without backedge should not be considered loops. For RADV, 2069 (2.61% of 79395) affected shaders. Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28783>	2024-07-16 12:29:08 +00:00
Daniel Schürmann	540ee1c81a	nir: implement loop invariant code motion (LICM) pass This simple LICM pass hoists all loop-invariant instructions from the loops' top-level control flow, skipping any nested CF. The hoisted instructions are placed right before the loop. Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28783>	2024-07-16 12:29:08 +00:00
Alyssa Rosenzweig	d238d766c6	nir: add lower_fminmax_signed_zero This implements IEEE-754-2019 signed zero semantics for fmin/fmax, as now required by NIR, for hardware that has busted signed zero behaviour for fmin/fmax. Ian expressed interest in this for Intel. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30075>	2024-07-15 19:29:00 +00:00
Alyssa Rosenzweig	0e46f7b39a	nir/lower_alu: remove dead #define Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30075>	2024-07-15 19:29:00 +00:00
Alyssa Rosenzweig	4ab3d95c11	nir/lower_double_ops: handle signed zero with min/max Ensure the following identities hold to match IEEE-754-2019 and upcoming NIR: min(-0, +0) = -0 min(+0, -0) = -0 max(-0, +0) = +0 max(+0, -0) = +0 NVK uses this lowering. In a simple compute shader using fmin64 on an SSBO with signed zero preserve required, testing the effect of this patch, the instruction count goes from 47->52. Obviously I'm not thrilled by that but I also couldn't find any obvious way of mitigating the issue. (Maybe NVIDIA has special hardware support here. By instruction count, lowering all the way to int64 is a loss, though I don't know how to count cycles on NVIDIA.) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30075>	2024-07-15 19:29:00 +00:00
Alyssa Rosenzweig	6f48fa4ebe	nir: strengthen fmin/fmax definitions with signed zero SPIR-V strengthened the semantics around signed zero, requiring fmin(-0, +0) = -0. Since nir_op_fmin is commutative, we must also require fmin(+0, -0) = -0 to match, although it's unclear if SPIR-V requires that. We must strengthen NIR's definitions accordingly. This strengthening is additionally motivated by the existing nir_opt_algebraic rule like: (('fmin', a, ('fneg', a)), ('fneg', ('fabs', a))), With the strengthened new definition, this transform is clearly exact. With the weaker definition, the transform could change the sign of zero based on implementation-defined behaviours which ... while, not exactly unsound, is undesireable semantically. ... This is probably technically a bug fix, but I'm not convinced it's worth it's weight in backporting. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30075>	2024-07-15 19:29:00 +00:00
Alyssa Rosenzweig	7fc5a2296b	nir: use MIN2/MAX2 opcodes for imin/umax folding This is more idiomatic and already #include'd. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30075>	2024-07-15 19:29:00 +00:00
Alyssa Rosenzweig	e8db5759b8	nir/search: use ALU float control helpers Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30075>	2024-07-15 19:29:00 +00:00
Alyssa Rosenzweig	d4c6fbc4a7	nir: add nir_alu_instr float controls queries These are helpful now that float_controls2 exists, these are common patterns worth factoring out into helpers. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30075>	2024-07-15 19:29:00 +00:00
M Henning	e506955056	nir: Handle texop_*_nv in nir_tex_instr_is_query Fixes: `aa1f00cf` ("nir/gather_info: handle uses_fbfetch_output for texture operations") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11505 Tested-by: Thomas H.P. Andersen <phomes@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30166>	2024-07-13 15:36:29 +00:00
Marek Olšák	1b2cd628b8	nir: rename ordered_xfb_counter_add_gfx12_amd -> ordered_add_loop_gfx12_amd because it can also be used by compute. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30063>	2024-07-13 01:32:48 +00:00
Samuel Pitoiset	aa1f00cf5c	nir/gather_info: handle uses_fbfetch_output for texture operations Like nir_texop_txf_ms. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30109>	2024-07-12 09:33:51 +00:00
Samuel Pitoiset	0d0b949cd7	nir/gather_info: handle uses_fbfetch_output for sparse image loads Looks like this was missing. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30109>	2024-07-12 09:33:51 +00:00
Christian Gmeiner	87786a7a7e	nak: Move imad late optimization to nir It is more or less just a code move, but I touched is_only_used_by_iadd(..) to match the style of the other functions in that file. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30099>	2024-07-12 05:54:46 +00:00
Rhys Perry	c4706c6177	nir/linking_helpers: remove nested IF Just add a && to the condition. This is more readable to me. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25590>	2024-07-10 19:11:38 +00:00
Rhys Perry	525aacd9d7	nir/linking_helpers: remove varying accesses in nir_remove_unused_io_vars interp_deref_at_sample of a nir_var_shader_temp is nonsensical and might be ignored by later passes, instead of removed. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7818 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10588 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25590>	2024-07-10 19:11:38 +00:00
Rhys Perry	bcd98e091a	nir/linking_helpers: remove special case for read mesh outputs Only VK_NV_mesh_shader allows this kind of access, and no driver advertises that extension anymore. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25590>	2024-07-10 19:11:38 +00:00
Connor Abbott	45a57fa735	ir3: Plumb through descriptor prefetch intrinsics Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29873>	2024-07-10 11:54:15 +00:00
Connor Abbott	ccf88d940b	nir/instr_set: Don't remove matching instruction We currently assume that the instruction is already inserted and we are optimizing it away, but in the use case I have where we are hoisting instructions into a preamble and deduplicating as we go along, that isn't the case. Move this responsibility onto the caller, which also makes it a bit clearer what's going on and turns this into something more similar to an actual set. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29873>	2024-07-10 11:54:15 +00:00
Connor Abbott	cda7d9c971	nir/instr_set: Return the matching instruction This allows use cases where we copy over expression trees and deduplicate as we go along. We can use the matching instruction to build up the rest of the expression tree. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29873>	2024-07-10 11:54:15 +00:00
Alyssa Rosenzweig	0ce2e6594d	nir/opt_constant_folding: fix array size define In practice these are equal but the old code was semantically wrong: that dimension is "sources" not "components". Use the correct #define. This came up when reviewing https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29994 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30066>	2024-07-08 14:34:29 +00:00
Konstantin Seurer	d9e41e8a8c	nir: Stop using "capture : true" for nir_opt_algebraic "calture : true" is suboptimal and and prevents the script from writing multiple files in one go. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30041>	2024-07-06 15:51:06 +00:00

1 2 3 4 5 ...

5480 commits