fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-16 07:38:14 +02:00

Author	SHA1	Message	Date
Daniel Stone	e05415a82e	format: Generate endian-independent format aliases Instead of having a hardcoded list of endian-independent format aliases in the header, generate them from the format definitions. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29649>	2024-07-19 13:50:42 +00:00
Georg Lehmann	aa6d363634	nir: constant fold inverse_ballot Foz-DB Navi21: Totals from 210 (0.26% of 79395) affected shaders: Instrs: 79583 -> 78892 (-0.87%) CodeSize: 435636 -> 431680 (-0.91%) VGPRs: 7208 -> 7224 (+0.22%) Latency: 660376 -> 658808 (-0.24%); split: -0.38%, +0.14% InvThroughput: 127489 -> 127544 (+0.04%); split: -0.35%, +0.39% VClause: 1503 -> 1504 (+0.07%) SClause: 3970 -> 3947 (-0.58%) Copies: 4932 -> 4682 (-5.07%); split: -5.17%, +0.10% Branches: 2411 -> 2406 (-0.21%); split: -0.33%, +0.12% PreSGPRs: 6395 -> 6434 (+0.61%); split: -0.31%, +0.92% PreVGPRs: 4100 -> 4103 (+0.07%) VALU: 48484 -> 48145 (-0.70%); split: -0.70%, +0.00% SALU: 12499 -> 12202 (-2.38%); split: -2.41%, +0.03% SMEM: 6448 -> 6420 (-0.43%) Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30235>	2024-07-19 07:24:34 +00:00
Georg Lehmann	2d3f536174	aco,nir: add dpp16_shift_amd intrinsic Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24650>	2024-07-17 15:04:38 +00:00
Faith Ekstrand	bbccbd8d50	nir,nak: Add a nir_op_prmt_nv We have this in hardware since forever and it's really useful. May as well add it to NIR so we can use it in various lowerings. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30218>	2024-07-17 13:38:24 +00:00
Alyssa Rosenzweig	9f1d1c4fc8	nir/opt_constant_folding: fix array size define, pt 2 In practice these are equal but the old code was semantically wrong: that dimension is "sources" not "components". Use the correct #define. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Suggested-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30214>	2024-07-16 17:38:16 +00:00
Daniel Schürmann	ffef3d1709	nir/opt_sink: ignore loops without backedge Loops without backedge should not be considered loops. For RADV, 2069 (2.61% of 79395) affected shaders. Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28783>	2024-07-16 12:29:08 +00:00
Daniel Schürmann	540ee1c81a	nir: implement loop invariant code motion (LICM) pass This simple LICM pass hoists all loop-invariant instructions from the loops' top-level control flow, skipping any nested CF. The hoisted instructions are placed right before the loop. Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28783>	2024-07-16 12:29:08 +00:00
Alyssa Rosenzweig	d238d766c6	nir: add lower_fminmax_signed_zero This implements IEEE-754-2019 signed zero semantics for fmin/fmax, as now required by NIR, for hardware that has busted signed zero behaviour for fmin/fmax. Ian expressed interest in this for Intel. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30075>	2024-07-15 19:29:00 +00:00
Alyssa Rosenzweig	0e46f7b39a	nir/lower_alu: remove dead #define Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30075>	2024-07-15 19:29:00 +00:00
Alyssa Rosenzweig	4ab3d95c11	nir/lower_double_ops: handle signed zero with min/max Ensure the following identities hold to match IEEE-754-2019 and upcoming NIR: min(-0, +0) = -0 min(+0, -0) = -0 max(-0, +0) = +0 max(+0, -0) = +0 NVK uses this lowering. In a simple compute shader using fmin64 on an SSBO with signed zero preserve required, testing the effect of this patch, the instruction count goes from 47->52. Obviously I'm not thrilled by that but I also couldn't find any obvious way of mitigating the issue. (Maybe NVIDIA has special hardware support here. By instruction count, lowering all the way to int64 is a loss, though I don't know how to count cycles on NVIDIA.) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30075>	2024-07-15 19:29:00 +00:00
Alyssa Rosenzweig	6f48fa4ebe	nir: strengthen fmin/fmax definitions with signed zero SPIR-V strengthened the semantics around signed zero, requiring fmin(-0, +0) = -0. Since nir_op_fmin is commutative, we must also require fmin(+0, -0) = -0 to match, although it's unclear if SPIR-V requires that. We must strengthen NIR's definitions accordingly. This strengthening is additionally motivated by the existing nir_opt_algebraic rule like: (('fmin', a, ('fneg', a)), ('fneg', ('fabs', a))), With the strengthened new definition, this transform is clearly exact. With the weaker definition, the transform could change the sign of zero based on implementation-defined behaviours which ... while, not exactly unsound, is undesireable semantically. ... This is probably technically a bug fix, but I'm not convinced it's worth it's weight in backporting. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30075>	2024-07-15 19:29:00 +00:00
Alyssa Rosenzweig	7fc5a2296b	nir: use MIN2/MAX2 opcodes for imin/umax folding This is more idiomatic and already #include'd. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30075>	2024-07-15 19:29:00 +00:00
Alyssa Rosenzweig	e8db5759b8	nir/search: use ALU float control helpers Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30075>	2024-07-15 19:29:00 +00:00
Alyssa Rosenzweig	d4c6fbc4a7	nir: add nir_alu_instr float controls queries These are helpful now that float_controls2 exists, these are common patterns worth factoring out into helpers. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30075>	2024-07-15 19:29:00 +00:00
M Henning	e506955056	nir: Handle texop_*_nv in nir_tex_instr_is_query Fixes: `aa1f00cf` ("nir/gather_info: handle uses_fbfetch_output for texture operations") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11505 Tested-by: Thomas H.P. Andersen <phomes@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30166>	2024-07-13 15:36:29 +00:00
Marek Olšák	1b2cd628b8	nir: rename ordered_xfb_counter_add_gfx12_amd -> ordered_add_loop_gfx12_amd because it can also be used by compute. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30063>	2024-07-13 01:32:48 +00:00
Samuel Pitoiset	aa1f00cf5c	nir/gather_info: handle uses_fbfetch_output for texture operations Like nir_texop_txf_ms. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30109>	2024-07-12 09:33:51 +00:00
Samuel Pitoiset	0d0b949cd7	nir/gather_info: handle uses_fbfetch_output for sparse image loads Looks like this was missing. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30109>	2024-07-12 09:33:51 +00:00
Christian Gmeiner	87786a7a7e	nak: Move imad late optimization to nir It is more or less just a code move, but I touched is_only_used_by_iadd(..) to match the style of the other functions in that file. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30099>	2024-07-12 05:54:46 +00:00
Rhys Perry	c4706c6177	nir/linking_helpers: remove nested IF Just add a && to the condition. This is more readable to me. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25590>	2024-07-10 19:11:38 +00:00
Rhys Perry	525aacd9d7	nir/linking_helpers: remove varying accesses in nir_remove_unused_io_vars interp_deref_at_sample of a nir_var_shader_temp is nonsensical and might be ignored by later passes, instead of removed. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7818 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10588 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25590>	2024-07-10 19:11:38 +00:00
Rhys Perry	bcd98e091a	nir/linking_helpers: remove special case for read mesh outputs Only VK_NV_mesh_shader allows this kind of access, and no driver advertises that extension anymore. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25590>	2024-07-10 19:11:38 +00:00
Connor Abbott	45a57fa735	ir3: Plumb through descriptor prefetch intrinsics Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29873>	2024-07-10 11:54:15 +00:00
Connor Abbott	ccf88d940b	nir/instr_set: Don't remove matching instruction We currently assume that the instruction is already inserted and we are optimizing it away, but in the use case I have where we are hoisting instructions into a preamble and deduplicating as we go along, that isn't the case. Move this responsibility onto the caller, which also makes it a bit clearer what's going on and turns this into something more similar to an actual set. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29873>	2024-07-10 11:54:15 +00:00
Connor Abbott	cda7d9c971	nir/instr_set: Return the matching instruction This allows use cases where we copy over expression trees and deduplicate as we go along. We can use the matching instruction to build up the rest of the expression tree. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29873>	2024-07-10 11:54:15 +00:00
Alyssa Rosenzweig	0ce2e6594d	nir/opt_constant_folding: fix array size define In practice these are equal but the old code was semantically wrong: that dimension is "sources" not "components". Use the correct #define. This came up when reviewing https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29994 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30066>	2024-07-08 14:34:29 +00:00
Konstantin Seurer	d9e41e8a8c	nir: Stop using "capture : true" for nir_opt_algebraic "calture : true" is suboptimal and and prevents the script from writing multiple files in one go. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30041>	2024-07-06 15:51:06 +00:00
Qiang Yu	a071929f8d	nir: consider more deref types when fixup deref Fix ANV and virpipe CI test fail when nir_fixup_deref_types is used in nir_vectorize_tess_levels by later commits. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29799>	2024-07-03 02:06:56 +00:00
Qiang Yu	f9ed3158b4	nir: nir_vectorize_tess_levels support indirect access Replace the implementation with nir_lower_array_deref_of_vec. This will be used by compact_array=false drivers to lower indirect tess levels array access to direct vector access too. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29799>	2024-07-03 02:06:56 +00:00
Qiang Yu	3151f5ec47	nir: add filter parameter to nir_lower_array_deref_of_vec To be used by latter commits to limit the lowering to specific variables. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29799>	2024-07-03 02:06:56 +00:00
David Heidelberg	68215332a8	build: pass licensing information in SPDX form Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Dylan Baker <dylan.c.baker@intel.com> Acked-by: Eric Engestrom <eric@igalia.com> Acked-by: Daniel Stone <daniels@collabora.com> Signed-off-by: David Heidelberg <david@ixit.cz> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29972>	2024-06-29 12:42:49 -07:00
Jesse Natalie	c2b53d7bd0	nir: Remove assert-only variable by inlining its single use Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29970>	2024-06-28 20:44:36 +00:00
Alyssa Rosenzweig	30db807f79	nir/algebraic: explicitly suffix constants Make our intentions super duper clear. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Suggested-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29952>	2024-06-28 19:53:36 +00:00
Alyssa Rosenzweig	270446ee21	nir: fix miscompiles with rules with INT32_MIN `812b3415` added rules for upcasts with comparisons with a variety of types. The float & unsigned rules should be ok, but the signed integer rules are unsound as currently implemented. This can cause end-to-end miscompiles. I originally hit this issue while debugging a large real world OpenCL kernel. I found the bug symptoms changed when disabling loop unrolling, which tipped me off to a compiler bug. I've reduced it to a minimal test case. Imagine my surprise when I find out the NIR my backend ingested was already constant folded to be wrong. In the minimal test case, during optimization we have NIR: 32 %6 = .... 64 %9 = i2i64 %6 64 %44 = load_const (0x0000000000000001) 1 %45 = ilt %9, %44 (0x1) This is a simple check (int64_t)%6 < 1. nir_opt_algebraic turns this into: 32 %6 = ... 64 %9 = i2i64 %6 64 %44 = load_const (0x0000000000000001) 64 %55 = load_const (0x0000000080000000 = 2147483648) 1 %56 = ilt %55 (0x80000000), %44 (0x1) 64 %57 = load_const (0x000000007fffffff = 2147483647) 1 %58 = ilt %57 (0x7fffffff), %44 (0x1) 32 %59 = i2i32 %44 (0x1) 1 %60 = ilt %6, %59 1 %61 = ior %58, %60 1 %62 = iand %56, %61 This pile of math constant-folds to an unconditional "false"! The problem is %56. At first glance, INT32_MIN < 1 is true so %56 should be true. Indeed, it should. But here's the kicker: both constants are 64-bit here, so the ilt operation is a 64-bit comparison -- that left-hand side is INT32_MIN zero-extended to 64-bit for the signed comparison at 64-bit. So in fact, it evaluates to false, causing the whole expression to go false. If we're going to do a 64-bit comparison for %56, then we need to sign-extend the bound. So we'll just adjust the Python and be on our way, right? Unfortunately the issue is deeper. According to the comment in the generated nir_opt_algebraic.c file, the guilty algebraic rule is: ('ilt', ('i2i64', 'a@32'), '#b') => ('iand', ('ilt', -2147483648, 'b'), ('ior', ('ilt', 2147483647, 'b'), ('ilt', 'a', ('i2i32', 'b')))) From a Python perspective? That rule is correct. -2147483648 < 1 is a true statement. Adjusting the Python rule is not the appropriate solution here, since the issue is more fundamental and might affect other rules. The real problem is the translation of that Python replacement tree into C, incorrectly zero-extending -2147483648 into 0x0000000080000000 instead of sign-extending to 0xffffffff80000000. Crawling down the rabbit hole of the generated algebraic file, we see the constant encoded as: { .constant = { { nir_search_value_constant, 64 }, nir_type_int, { -0x80000000 /* -2147483648 */ }, } }, NIR correctly translates the negative constant to a C level negate operation of its absolute value. This maps to the correct sign-extension... ...for all constants except for INT_MIN. Because that constant lacks a ULL suffix, it is a 32-bit integer. And for this integer (only), negating it hits signed integer overflow (UB!) and then we end up with an effective zero-extension when going to 64-bit. This patch fixes the end-to-end miscompile. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Closes: #11402 Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29952>	2024-06-28 19:53:36 +00:00
Georg Lehmann	3e86d2452f	nir/opt_algebraic: add various unordered/ordered patterns from aco Foz-DB Navi21: Totals from 6747 (8.50% of 79395) affected shaders: MaxWaves: 134646 -> 134642 (-0.00%) Instrs: 7830299 -> 7828851 (-0.02%); split: -0.03%, +0.01% CodeSize: 43045532 -> 43010260 (-0.08%); split: -0.09%, +0.00% VGPRs: 378960 -> 378968 (+0.00%) SpillSGPRs: 1209 -> 1208 (-0.08%) Latency: 74667977 -> 74670405 (+0.00%); split: -0.02%, +0.02% InvThroughput: 20124981 -> 20124768 (-0.00%); split: -0.02%, +0.02% VClause: 162870 -> 162868 (-0.00%); split: -0.00%, +0.00% SClause: 277280 -> 277315 (+0.01%); split: -0.00%, +0.02% Copies: 528627 -> 528667 (+0.01%); split: -0.00%, +0.01% PreSGPRs: 319526 -> 319508 (-0.01%) PreVGPRs: 334264 -> 334265 (+0.00%); split: -0.00%, +0.00% VALU: 5485412 -> `5485408` (-0.00%); split: -0.02%, +0.02% SALU: 743882 -> 742301 (-0.21%); split: -0.21%, +0.00% Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29467>	2024-06-27 08:12:30 +00:00
Georg Lehmann	434dfb51ca	nir/opt_algebraic: optimize cmp(fneg(a), #b) and feq with fabs Foz-DB Navi21: Totals from 2483 (3.13% of 79395) affected shaders: Instrs: 4067533 -> 4067756 (+0.01%); split: -0.00%, +0.01% CodeSize: 22525156 -> 22499904 (-0.11%); split: -0.12%, +0.01% Latency: 51967223 -> 51963654 (-0.01%); split: -0.01%, +0.00% InvThroughput: 16685020 -> 16683045 (-0.01%); split: -0.01%, +0.00% SClause: 131890 -> 131907 (+0.01%) Copies: 402557 -> 402510 (-0.01%); split: -0.01%, +0.00% Branches: 146962 -> 146958 (-0.00%) PreSGPRs: 118404 -> 118401 (-0.00%) PreVGPRs: 123791 -> 123787 (-0.00%) VALU: 2709846 -> 2710174 (+0.01%); split: -0.00%, +0.01% SALU: 565883 -> 565786 (-0.02%) Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29467>	2024-06-27 08:12:30 +00:00
Georg Lehmann	98cc57bccb	nir/optimize cmp(a, -0.0) +0.0 can use an inline constant for AMD hardware, -0.0 needs a literal. Foz-DB Navi21: Totals from 1014 (1.28% of 79395) affected shaders: Instrs: 3037490 -> 3036849 (-0.02%); split: -0.02%, +0.00% CodeSize: 17060228 -> 17051276 (-0.05%); split: -0.05%, +0.00% Latency: 45916788 -> 45916600 (-0.00%); split: -0.00%, +0.00% InvThroughput: 12982201 -> 12982187 (-0.00%); split: -0.00%, +0.00% VClause: 79475 -> 79478 (+0.00%) SClause: 119935 -> 119934 (-0.00%); split: -0.00%, +0.00% Copies: 301641 -> 300964 (-0.22%); split: -0.23%, +0.00% PreSGPRs: 59155 -> 59144 (-0.02%) VALU: 2032016 -> 2032034 (+0.00%) SALU: 386424 -> 385729 (-0.18%) Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29467>	2024-06-27 08:12:30 +00:00
Georg Lehmann	8e6bf596cb	nir/opt_algebraic: look through fabs/fneg when matching fmulz/ffmaz Prevents regressions when removing input modifiers from a == 0.0. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29467>	2024-06-27 08:12:30 +00:00
Georg Lehmann	99372c1ed7	nir: add ford, funord, fneo, fequ, fltu, fgeu Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29467>	2024-06-27 08:12:29 +00:00
Alyssa Rosenzweig	dd85b50d18	treewide: use nir_break_if Via Coccinelle patch and some manual hunk editing: @@ expression b, E; @@ -nir_push_if(b, E); -{ -nir_jump(b, nir_jump_break); -} -nir_pop_if(b, NULL); +nir_break_if(b, E); Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29877>	2024-06-26 19:07:35 +00:00
Alyssa Rosenzweig	d57934fdec	nir: add nir_break_if helper I see people open-coding this all over the tree and it makes nir_builder loops really annoying. Make them slightly less annoying with a helper. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29877>	2024-06-26 19:07:35 +00:00
Karol Herbst	3482ea599b	nir/schedule: add write dep also for shared_atomic Otherwise it might change the order between a load_shared and a shared_atomic on the same location. Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29918>	2024-06-26 18:20:14 +00:00
Connor Abbott	ec37e65a2d	ir3: Introduce elect_any_ir3 For preambles, we don't actually care which invocation we get, so we don't have to enable helper invocations when the preamble uses "getone." Introduce a new intrinsic with the right semantics and plumb it through. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29914>	2024-06-26 17:40:15 +00:00
Karol Herbst	d5da434851	nir/opt_sink: add load_kernel_input Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25362>	2024-06-26 10:04:02 +00:00
Karol Herbst	535e617ccd	nir/lower_alu: support 8 and 16 bit bit_count Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25362>	2024-06-26 10:04:02 +00:00
Qiang Yu	93f790b04a	nir: fix clip cull distance lowering metadata preserve indirect store lowering will use if/else which changes the control flow of the shader. Fixes: `110887de2b` ("nir: Add a new pass to lower array dereferences on vectors") Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29894>	2024-06-26 01:22:12 +00:00
Qiang Yu	09b4ba27a3	nir: fix lower array to vec metadata preserve indirect store lowering will change control flow, so we should not preserve control flow metadate when it's present. Fixes: `35b8f6f40b` ("nir: Add a new pass to lower array dereferences on vectors") Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29894>	2024-06-26 01:22:12 +00:00
Ian Romanick	6b678d32cb	nir: dpas_intel second source can have different number of components The number of components for the second source is -1 to avoid validation of its value. Some supported configurations will have the component count of that matrix different than the others. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28834>	2024-06-25 14:17:47 -07:00
Juan A. Suarez Romero	60e7cb7654	nir: use unsigned types when performing bitshifting Ensure unsigned integers are used instead of signed ones when performing left bit shifts. This has been detected by the Undefined Behaviour Sanitizer (UBSan). Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29772>	2024-06-21 21:07:05 +00:00
Juan A. Suarez Romero	e43cc49806	nir: fix overflow when negating maxint in constant expressions Undefined Behaviour Sanitizer (UBSan) detected the following when running testing `dEQP-VK.graphicsfuzz.cov-fold-negate-min-int-value`: `negation of -2147483648 cannot be represented in type 'int'; cast to an unsigned type to negate this value to itself` SPIR-V spec states that OpSNegate(0x80000000) has to return 0x80000000; in our case, -2147483648 should be -2147483648. While this is not causing any issue because compilers seem to be behaving like that, it is still undefined behaviour, so it expects to be this handled explicitly, which is the purpose of this commit. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29772>	2024-06-21 21:07:05 +00:00

1 2 3 4 5 ...

5457 commits