fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-19 09:18:10 +02:00

Author	SHA1	Message	Date
Christian Gmeiner	c33d2db06a	meson: Add missing inc's to idep_nir_headers nir.h includes: - "compiler/glsl_types.h" -> inc_src is needed - "util/u_atomic.h" -> "no_extern_c.h" -> inc_include needed This makes it possible to use rust's bindgen with only nir.h as specified include. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Acked-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30359>	2024-07-25 05:51:19 +00:00
Marek Olšák	d90080b51b	nir/opt_vectorize_io: optionally don't vectorize IO with different types Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11443 Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29895>	2024-07-23 16:13:17 +00:00
Marek Olšák	9bfea3183a	nir/opt_varyings: improve convergent input handling to fix data corruption Backward inter-shader code motion can move any code into the previous shader if it only uses convergent inputs. The problem is the final input type can end up being integer or FP64, which is incompatible with the assumption that convergent inputs can always be interpolated. If such a case occurs and the type is integer or FP64, either don't do any code motion, or if the driver exposes the new flag, rewrite convergent loads to use load_input. If the new flag is supported, all convergent loads are rewritten to use load_input, and flat varyings are allowed to be classified as convergent, which means they are packed into interpolated vec4 slots if there are unused components. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29895>	2024-07-23 16:13:16 +00:00
Marek Olšák	b2d32ae246	nir: add nir_intrinsic_load_per_primitive_input, split from io_semantics flag Instead of having 1 bit in nir_io_semantics indicating a per-primitive FS input, add a dedicated intrinsic for it. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29895>	2024-07-23 16:13:16 +00:00
Marek Olšák	ecfefe823e	nir/opt_algebraic: use fmulz for fpow lowering to fix incorrect rendering The original implementation in all radeon drivers had this behavior. Fixes: `9bc1fb4c07` - ac/llvm,radeonsi: lower nir_fpow for aco and llvm Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11464 Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30069>	2024-07-23 15:23:27 +00:00
Dylan Baker	e5b53d9408	compilers/clc: Add missing break statements. fixes: `c0cf7f578a` Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30301>	2024-07-22 21:55:49 +00:00
Karol Herbst	bad67ee77c	spirv: handle function parameters passed by value Cc: mesa-stable Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29896>	2024-07-22 21:16:58 +00:00
Karol Herbst	9b55dcca54	spirv: initial parsing of function parameter decorations It doesn't do anything substantial yet, but it ignores enough so internal shaders won't generate warnings. I've also added ByVal parsing, because I need this one to actually fix a correctness issue in a later patch. Cc: mesa-stable Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29896>	2024-07-22 21:16:58 +00:00
Karol Herbst	90db6c729d	spirv: generate info for FunctionParameterAttribute Cc: mesa-stable Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29896>	2024-07-22 21:16:58 +00:00
Rhys Perry	3aa29c47b9	nir/instr_set: hash tex sources commutatively I'm not sure if two otherwise equal texture instructions ever have sources in different orders, but they should be considered equal. ministat of nir_opt_cse: N Min Max Median Avg Stddev x 9 6.586801 6.718673 6.682875 6.6621411 0.047817119 + 9 6.519098 6.609235 6.552997 6.5605604 0.028879587 Difference at 95.0% confidence -0.101581 +/- 0.0394755 -1.52475% +/- 0.585928% (Student's t, pooled s = 0.0395) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Tatsuyuki Ishi <ishitatsuyuki@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30145>	2024-07-22 11:04:01 +00:00
Rhys Perry	b7ceb9d327	nir/instr_set: stop sorting phi sources This is faster. ministat of nir_opt_cse: N Min Max Median Avg Stddev x 9 6.724212 6.84511 6.788336 6.7873378 0.034363882 + 9 6.586801 6.718673 6.682875 6.6621411 0.047817119 Difference at 95.0% confidence -0.125197 +/- 0.0416115 -1.84456% +/- 0.609248% (Student's t, pooled s = 0.0416374) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Tatsuyuki Ishi <ishitatsuyuki@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30145>	2024-07-22 11:04:01 +00:00
Rhys Perry	8b328443e3	nir/instr_set: combine XXH32 calls ministat of nir_opt_cse: N Min Max Median Avg Stddev x 9 7.393408 7.490593 7.434056 7.4338972 0.028150325 + 9 6.724212 6.84511 6.788336 6.7873378 0.034363882 Difference at 95.0% confidence -0.646559 +/- 0.0313916 -8.69745% +/- 0.407925% (Student's t, pooled s = 0.0314111) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Tatsuyuki Ishi <ishitatsuyuki@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30145>	2024-07-22 11:04:01 +00:00
Ian Romanick	faee9426ab	nir/algebraic: Optimize some masking of extract_u8 operations I observed this pattern in several Red Dead Redemption 2 shaders. No shader-db changes on any Intel platform. v2: Remove duplicated patterns. Noticed by Georg. fossil-db: All Intel platforms had similar results. (Meteor Lake shown) Totals: Instrs: 151519393 -> 151507192 (-0.01%); split: -0.01%, +0.00% Cycle count: 17208246858 -> 17177437340 (-0.18%); split: -0.25%, +0.07% Spill count: 80830 -> 80759 (-0.09%); split: -0.09%, +0.00% Fill count: 152754 -> 152179 (-0.38%); split: -0.40%, +0.02% Totals from 7531 (1.20% of 630198) affected shaders: Instrs: 12606141 -> 12593940 (-0.10%); split: -0.10%, +0.00% Cycle count: 5466605514 -> 5435795996 (-0.56%); split: -0.79%, +0.22% Spill count: 25251 -> 25180 (-0.28%); split: -0.29%, +0.01% Fill count: 45143 -> 44568 (-1.27%); split: -1.36%, +0.08% Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30158>	2024-07-20 00:19:05 +00:00
Ian Romanick	1c7e35d4e0	nir/algebraic: Optimize some bit operation nonsense observed in some shaders In updates (not post at the time of this writing) to !29884, a change caused many spill and fill regressions shader for OpenGL Tomb Raider. While looking at that shader, I noticed some odd patterns. I initially added these patterns to counteract the regressions caused by the other change, but I had no luck. On Ice Lake... this cuts 99 instructions from the shader. shader-db: All Intel platforms had simliar results. (Meteor Lake shown) total instructions in shared programs: 19732341 -> 19732295 (<.01%) instructions in affected programs: 1744 -> 1698 (-2.64%) helped: 1 / HURT: 0 total cycles in shared programs: 916273716 -> 916273068 (<.01%) cycles in affected programs: 14266 -> 13618 (-4.54%) helped: 1 / HURT: 0 fossil-db: All Intel platforms had similar results. (Meteor Lake shown) Totals: Instrs: 151519575 -> 151519393 (-0.00%) Cycle count: 17208402120 -> 17208246858 (-0.00%); split: -0.00%, +0.00% Totals from 159 (0.03% of 630198) affected shaders: Instrs: 51970 -> 51788 (-0.35%) Cycle count: 11474176 -> 11318914 (-1.35%); split: -1.36%, +0.01% Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30158>	2024-07-20 00:19:05 +00:00
Ian Romanick	92befad89f	nir/range_analysis: Fix errors in fmin and fmax tables fmin(x, 0.0) must at least be le_zero, and fmax(x, 0.0) be at least be ge_zero. shader-db: All Intel platforms had similar results. (Meteor Lake shown) total instructions in shared programs: 19733226 -> 19731919 (<.01%) instructions in affected programs: 196415 -> 195108 (-0.67%) helped: 615 / HURT: 0 total cycles in shared programs: 916277979 -> 916265288 (<.01%) cycles in affected programs: 2482535 -> 2469844 (-0.51%) helped: 346 / HURT: 178 LOST: 2 GAINED: 1 fossil-db: All Intel platforms had similar results. (Meteor Lake shown) Totals: Instrs: 151531355 -> 151519575 (-0.01%); split: -0.01%, +0.00% Cycle count: 17209372399 -> 17208402120 (-0.01%); split: -0.01%, +0.01% Max live registers: 32016490 -> 32016514 (+0.00%) Totals from 4307 (0.68% of 630198) affected shaders: Instrs: 4179418 -> 4167638 (-0.28%); split: -0.28%, +0.00% Cycle count: 1063492212 -> 1062521933 (-0.09%); split: -0.24%, +0.15% Max live registers: 359250 -> 359274 (+0.01%) Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30158>	2024-07-20 00:19:05 +00:00
Daniel Stone	e05415a82e	format: Generate endian-independent format aliases Instead of having a hardcoded list of endian-independent format aliases in the header, generate them from the format definitions. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29649>	2024-07-19 13:50:42 +00:00
Timothy Arceri	355a1f2058	glsl: remove out of date comment GLSL 4.40 changed the relevant language in Section 8.13.2 (Interpolation Functions) to: "Component selection operators (e.g., .xy) may be used when specifying interpolant." Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30239>	2024-07-19 10:59:11 +00:00
Georg Lehmann	aa6d363634	nir: constant fold inverse_ballot Foz-DB Navi21: Totals from 210 (0.26% of 79395) affected shaders: Instrs: 79583 -> 78892 (-0.87%) CodeSize: 435636 -> 431680 (-0.91%) VGPRs: 7208 -> 7224 (+0.22%) Latency: 660376 -> 658808 (-0.24%); split: -0.38%, +0.14% InvThroughput: 127489 -> 127544 (+0.04%); split: -0.35%, +0.39% VClause: 1503 -> 1504 (+0.07%) SClause: 3970 -> 3947 (-0.58%) Copies: 4932 -> 4682 (-5.07%); split: -5.17%, +0.10% Branches: 2411 -> 2406 (-0.21%); split: -0.33%, +0.12% PreSGPRs: 6395 -> 6434 (+0.61%); split: -0.31%, +0.92% PreVGPRs: 4100 -> 4103 (+0.07%) VALU: 48484 -> 48145 (-0.70%); split: -0.70%, +0.00% SALU: 12499 -> 12202 (-2.38%); split: -2.41%, +0.03% SMEM: 6448 -> 6420 (-0.43%) Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30235>	2024-07-19 07:24:34 +00:00
Georg Lehmann	2d3f536174	aco,nir: add dpp16_shift_amd intrinsic Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24650>	2024-07-17 15:04:38 +00:00
Faith Ekstrand	bbccbd8d50	nir,nak: Add a nir_op_prmt_nv We have this in hardware since forever and it's really useful. May as well add it to NIR so we can use it in various lowerings. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30218>	2024-07-17 13:38:24 +00:00
Timothy Arceri	c9f26a9995	glsl: fix cross validate globals We want to skip the validation of compiler added global temps. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30199>	2024-07-17 05:19:03 +00:00
Timothy Arceri	dde1a69929	glsl: set how_declared to hidden for compiler temps This will be useful for the nir linker that otherwise cant detect these compiler temps created in glsl ir. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30199>	2024-07-17 05:19:03 +00:00
Caio Oliveira	d202f24698	spirv: Don't warn about FPFastMathMode if not OpenCL This decoration can now be used in Vulkan with VK_KHR_shader_float_controls2. Acked-by: Iván Briano <ivan.briano@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30191>	2024-07-16 19:14:21 +00:00
Alyssa Rosenzweig	9f1d1c4fc8	nir/opt_constant_folding: fix array size define, pt 2 In practice these are equal but the old code was semantically wrong: that dimension is "sources" not "components". Use the correct #define. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Suggested-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30214>	2024-07-16 17:38:16 +00:00
Daniel Schürmann	ffef3d1709	nir/opt_sink: ignore loops without backedge Loops without backedge should not be considered loops. For RADV, 2069 (2.61% of 79395) affected shaders. Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28783>	2024-07-16 12:29:08 +00:00
Daniel Schürmann	540ee1c81a	nir: implement loop invariant code motion (LICM) pass This simple LICM pass hoists all loop-invariant instructions from the loops' top-level control flow, skipping any nested CF. The hoisted instructions are placed right before the loop. Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28783>	2024-07-16 12:29:08 +00:00
Alyssa Rosenzweig	d238d766c6	nir: add lower_fminmax_signed_zero This implements IEEE-754-2019 signed zero semantics for fmin/fmax, as now required by NIR, for hardware that has busted signed zero behaviour for fmin/fmax. Ian expressed interest in this for Intel. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30075>	2024-07-15 19:29:00 +00:00
Alyssa Rosenzweig	0e46f7b39a	nir/lower_alu: remove dead #define Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30075>	2024-07-15 19:29:00 +00:00
Alyssa Rosenzweig	4ab3d95c11	nir/lower_double_ops: handle signed zero with min/max Ensure the following identities hold to match IEEE-754-2019 and upcoming NIR: min(-0, +0) = -0 min(+0, -0) = -0 max(-0, +0) = +0 max(+0, -0) = +0 NVK uses this lowering. In a simple compute shader using fmin64 on an SSBO with signed zero preserve required, testing the effect of this patch, the instruction count goes from 47->52. Obviously I'm not thrilled by that but I also couldn't find any obvious way of mitigating the issue. (Maybe NVIDIA has special hardware support here. By instruction count, lowering all the way to int64 is a loss, though I don't know how to count cycles on NVIDIA.) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30075>	2024-07-15 19:29:00 +00:00
Alyssa Rosenzweig	26de3d5366	glsl/float64: handle signed zero with min/max Ensure the following identities hold to match IEEE-754-2019 and upcoming NIR: min(-0, +0) = -0 min(+0, -0) = -0 max(-0, +0) = +0 max(+0, -0) = +0 To implement, we specialize a version of flt64_nonnan. The regular flt64 has extra logic to handle signed zero, so this version is actually simpler. So in addition to the bug fix, this is an optimization. Compute shaders from KHR-GL46.gpu_shader_fp64.builtin.max_dvec4 before and after: before: 136 inst, 122 alu, 122 fscib, 4 ic, 1006 bytes, 39 regs, 28 uniforms after: 104 inst, 90 alu, 90 fscib, 4 ic, 766 bytes, 39 regs, 28 uniforms I will happy take a 24% reduction in instruction count as the cost of standards conformance ^_^ Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30075>	2024-07-15 19:29:00 +00:00
Alyssa Rosenzweig	6f48fa4ebe	nir: strengthen fmin/fmax definitions with signed zero SPIR-V strengthened the semantics around signed zero, requiring fmin(-0, +0) = -0. Since nir_op_fmin is commutative, we must also require fmin(+0, -0) = -0 to match, although it's unclear if SPIR-V requires that. We must strengthen NIR's definitions accordingly. This strengthening is additionally motivated by the existing nir_opt_algebraic rule like: (('fmin', a, ('fneg', a)), ('fneg', ('fabs', a))), With the strengthened new definition, this transform is clearly exact. With the weaker definition, the transform could change the sign of zero based on implementation-defined behaviours which ... while, not exactly unsound, is undesireable semantically. ... This is probably technically a bug fix, but I'm not convinced it's worth it's weight in backporting. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30075>	2024-07-15 19:29:00 +00:00
Alyssa Rosenzweig	7fc5a2296b	nir: use MIN2/MAX2 opcodes for imin/umax folding This is more idiomatic and already #include'd. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30075>	2024-07-15 19:29:00 +00:00
Alyssa Rosenzweig	e8db5759b8	nir/search: use ALU float control helpers Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30075>	2024-07-15 19:29:00 +00:00
Alyssa Rosenzweig	d4c6fbc4a7	nir: add nir_alu_instr float controls queries These are helpful now that float_controls2 exists, these are common patterns worth factoring out into helpers. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30075>	2024-07-15 19:29:00 +00:00
Konstantin Seurer	43dadbd2fa	nir: Add FLOAT_CONTROLS_.*_PRESERVE A logical or of all bit sizes. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30075>	2024-07-15 19:28:59 +00:00
M Henning	e506955056	nir: Handle texop_*_nv in nir_tex_instr_is_query Fixes: `aa1f00cf` ("nir/gather_info: handle uses_fbfetch_output for texture operations") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11505 Tested-by: Thomas H.P. Andersen <phomes@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30166>	2024-07-13 15:36:29 +00:00
Marek Olšák	1b2cd628b8	nir: rename ordered_xfb_counter_add_gfx12_amd -> ordered_add_loop_gfx12_amd because it can also be used by compute. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30063>	2024-07-13 01:32:48 +00:00
Samuel Pitoiset	aa1f00cf5c	nir/gather_info: handle uses_fbfetch_output for texture operations Like nir_texop_txf_ms. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30109>	2024-07-12 09:33:51 +00:00
Samuel Pitoiset	0d0b949cd7	nir/gather_info: handle uses_fbfetch_output for sparse image loads Looks like this was missing. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30109>	2024-07-12 09:33:51 +00:00
Christian Gmeiner	87786a7a7e	nak: Move imad late optimization to nir It is more or less just a code move, but I touched is_only_used_by_iadd(..) to match the style of the other functions in that file. Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30099>	2024-07-12 05:54:46 +00:00
Rhys Perry	c4706c6177	nir/linking_helpers: remove nested IF Just add a && to the condition. This is more readable to me. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25590>	2024-07-10 19:11:38 +00:00
Rhys Perry	525aacd9d7	nir/linking_helpers: remove varying accesses in nir_remove_unused_io_vars interp_deref_at_sample of a nir_var_shader_temp is nonsensical and might be ignored by later passes, instead of removed. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7818 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10588 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25590>	2024-07-10 19:11:38 +00:00
Rhys Perry	bcd98e091a	nir/linking_helpers: remove special case for read mesh outputs Only VK_NV_mesh_shader allows this kind of access, and no driver advertises that extension anymore. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25590>	2024-07-10 19:11:38 +00:00
Rhys Perry	57080749f7	gallium: remove PIPE_CAP_SHADER_CAN_READ_OUTPUTS nir_lower_io_to_temporaries is now done for all stages except TCS, and nir_lower_io_to_temporaries with a TCS is a no-op. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25590>	2024-07-10 19:11:38 +00:00
Rhys Perry	767ea18517	glsl: always lower non-TCS outputs to temporaries It seems only radeonsi and v3d sets CAN_READ_OUTPUTS/SupportsReadingOutputs, and v3d has lower_all_io_to_temps=true. It looks like radeonsi basically lowers the outputs to temporaries in the backend. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25590>	2024-07-10 19:11:38 +00:00
Connor Abbott	45a57fa735	ir3: Plumb through descriptor prefetch intrinsics Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29873>	2024-07-10 11:54:15 +00:00
Connor Abbott	ccf88d940b	nir/instr_set: Don't remove matching instruction We currently assume that the instruction is already inserted and we are optimizing it away, but in the use case I have where we are hoisting instructions into a preamble and deduplicating as we go along, that isn't the case. Move this responsibility onto the caller, which also makes it a bit clearer what's going on and turns this into something more similar to an actual set. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29873>	2024-07-10 11:54:15 +00:00
Connor Abbott	cda7d9c971	nir/instr_set: Return the matching instruction This allows use cases where we copy over expression trees and deduplicate as we go along. We can use the matching instruction to build up the rest of the expression tree. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29873>	2024-07-10 11:54:15 +00:00
Alyssa Rosenzweig	0ce2e6594d	nir/opt_constant_folding: fix array size define In practice these are equal but the old code was semantically wrong: that dimension is "sources" not "components". Use the correct #define. This came up when reviewing https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29994 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30066>	2024-07-08 14:34:29 +00:00
Timothy Arceri	d1767ddd13	glsl/tests: fix test_gl_lower_mediump This fixes test_gl_lower_mediump to properly test linking, which also means we can drop all the custom nir calls as we are now simply passing the tests directly through the real nir linking code. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30034>	2024-07-08 06:38:19 +00:00

1 2 3 4 5 ...

9540 commits