fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-25 21:18:26 +02:00

Author	SHA1	Message	Date
Kenneth Graunke	fbaa5ad0c3	iris: Implement force_dual_color_blend_by_location via NIR We can just have iris look at its own program key and change the fragment shader output variable's location/index in the NIR. By doing this before lowering fragment shader outputs, the rest of the output lowering does the right thing, and the backend no longer has to consider hacks for broken OpenGL apps. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41122>	2026-05-07 08:29:40 +00:00
Alyssa Rosenzweig	5636a57f60	jay/lower_scoreboard: use SYNC.allrd/allwr This collapses piles of silliness. Totals: CodeSize: 71626288 -> 70710000 (-1.28%) Totals from 1634 (61.73% of 2647) affected shaders: CodeSize: 66319376 -> 65403088 (-1.38%) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41398>	2026-05-06 23:25:26 +00:00
Alyssa Rosenzweig	c1dc9d3b1a	jay/lower_scoreboard: be the sole emitter of SYNC this gets closer to something we can schedule and avoids some pointless syncs. Totals from 491 (18.55% of 2647) affected shaders: Instrs: 602994 -> 602946 (-0.01%) CodeSize: 9063888 -> 9015904 (-0.53%) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41398>	2026-05-06 23:25:26 +00:00
Alyssa Rosenzweig	0885ed10f5	jay/lower_scoreboard: use .src annotations This is less heavy handed, avoiding unnecessary stalls after SENDs in a bunch of common cases. The stats (SIMD32) are: Totals: CodeSize: 70345392 -> 71674272 (+1.89%) Totals from 1774 (67.02% of 2647) affected shaders: CodeSize: 67359248 -> 68688128 (+1.97%) What's happening here is we are inserting extra SYNC.nop instructions in a bunch of cases for the .src preceding the eventual .dst. However, putting aside the i-cache impact for a moment, this is showing the optimization doing what it should (deferring dst syncs and inserting cheaper src syncs first). So this should be positive in reality despite the negative stat impact. The most hurt shaders are pooling up SYNC.nop's at the end of blocks due to local-only SWSB and lack of SYNC.allwr optimization. The latter is added later in this MR. The former is planned. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41398>	2026-05-06 23:25:25 +00:00
Alyssa Rosenzweig	130e724d5e	jay/lower_scoreboard: refactor SYNC.nop insertion for next commit Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41398>	2026-05-06 23:25:25 +00:00
Alyssa Rosenzweig	1ecd75a397	jay/lower_scoreboard: fix tracking for A@* and *@7 update the tracking with what we actually waited on, not what we ideally wanted to wait on. reduces extra annotations in some cases. SIMD32: Totals from 194 (7.33% of 2647) affected shaders: CodeSize: 14473840 -> 14469088 (-0.03%) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41398>	2026-05-06 23:25:25 +00:00
Alyssa Rosenzweig	93edf9a3fd	jay/lower_scoreboard: refactor wait pipe code for next commit. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41398>	2026-05-06 23:25:25 +00:00
Alyssa Rosenzweig	18e09858eb	jay/lower_scoreboard: elide more dependencies IGC does these optimizations and I think they should be safe given my mental model. Given a sequence like: r0 = add.f32 r1, r2 r1 = add.f32 r3, r4 Each ALU pipe is pipelined but in-order. Therefore, the second add cannot possibly complete before the first add, so it cannot write r1 before the first add reads r1, so we can elide the write-after-read dependency. That in term avoids a pipeline bubble between the two instructions. Ditto for write-after-write. Similarly if the distance is too great within an in-order pipe since there is a maximum pipeline length, it's not infinite. Note that if there was cross-pipe dependencies we do need the annotation since the pipes themselves are parallel. SIMD32: Totals from 58 (2.19% of 2647) affected shaders: CodeSize: 3316592 -> 3315056 (-0.05%); split: -0.05%, +0.00% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41398>	2026-05-06 23:25:25 +00:00
Alyssa Rosenzweig	e4dc161277	jay: assign accumulators post-RA Greedy post-RA substitution pass, similar to IGC's AccSubstitution pass. Stats together with the previous commits. SIMD16: Totals from 2209 (83.45% of 2647) affected shaders: Instrs: 2701029 -> 2696350 (-0.17%) CodeSize: 39166720 -> 40372272 (+3.08%); split: -0.36%, +3.44% SIMD32: Totals from 2211 (83.53% of 2647) affected shaders: Instrs: 4691165 -> 4641188 (-1.07%) CodeSize: 69365792 -> 69341616 (-0.03%); split: -0.50%, +0.47% The instruction count reduction is from RA shuffle code getting coalesced via accumulators. The code size changes are from: * Fewer moves from the instr count reduction (helped) * Smaller MADs encoded as MACs (helped) * Fewer SYNC.nop due to fewer scoreboarding annotations (helped) * Less compaction due to explicit accumulator operands (hurt) I expect significant cycle count changes from this but we don't have a cycle model wired up yet, so reading the assembly will have to do. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41398>	2026-05-06 23:25:25 +00:00
Alyssa Rosenzweig	8b324591d1	jay: move simd32 deswizzling to float pipe for more accumulator usage. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41398>	2026-05-06 23:25:25 +00:00
Alyssa Rosenzweig	712719a2ae	jay: do moves on the float pipe where possible this allows us to use accumulators more. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41398>	2026-05-06 23:25:25 +00:00
Alyssa Rosenzweig	6f2b1cece6	jay: model MAC Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41398>	2026-05-06 23:25:25 +00:00
Alyssa Rosenzweig	b6e88ab904	jay/to_binary: fix packing of simd-split accumulators Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41398>	2026-05-06 23:25:25 +00:00
Caio Oliveira	1ebc14bcb9	brw: Stop tracking inline parameter usage in prog_key/prog_data Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Since inline parameter is the last field of the thread payload, the backend can always assume they may exist. They won't affect the position of other payload fields and the register allocator will reuse any unused space. In Anv, also update EmitInlineParameter for Task/Mesh/CS to reflect previous changes in inline parameter setup. Remove/Update some stale comments since we are here. Finally, remove the prog_key/prog_data bits that tracked whether inline data or a push address was needed. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41230>	2026-04-30 16:39:22 +00:00
Alyssa Rosenzweig	a78634ccb0	jay/to_binary: rename grf -> phys_reg Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details since it covers accumulators to Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41215>	2026-04-28 23:13:50 +00:00
Alyssa Rosenzweig	ab87a035c9	jay: drop a bunch of stale TODO and XXX These are either done, or never going to be done, or otherwise stale or silly or unnecessary. Drop a bunch. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41215>	2026-04-28 23:13:50 +00:00
Alyssa Rosenzweig	70d09d97ef	jay: predicate NoMask instructions in uniform IF's Totals: Instrs: 4742391 -> 4742257 (-0.00%) CodeSize: 70245120 -> 70243520 (-0.00%); split: -0.00%, +0.00% Totals from 81 (3.06% of 2647) affected shaders: Instrs: 337727 -> 337593 (-0.04%) CodeSize: 4992992 -> 4991392 (-0.03%); split: -0.03%, +0.00% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41215>	2026-04-28 23:13:50 +00:00
Alyssa Rosenzweig	f199f00564	jay: adjust flag replication Now instructions still read/write UFLAG, which preserves the information about lane 0 we need for proper predication etc. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41215>	2026-04-28 23:13:50 +00:00
Alyssa Rosenzweig	930d36b54a	jay: smarten predication pass Merge the empty else optimization, the then-block predication, and the break-while fusion into a unified "try to predicate each side of an if, peephole optimizing control flow" optimization. This is simpler and more general. Totals: Instrs: 4783809 -> 4775647 (-0.17%) CodeSize: 70766656 -> 70674064 (-0.13%); split: -0.13%, +0.00% Totals from 1109 (41.90% of 2647) affected shaders: Instrs: 4130644 -> 4122482 (-0.20%) CodeSize: 61180848 -> 61088256 (-0.15%); split: -0.15%, +0.00% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41215>	2026-04-28 23:13:50 +00:00
Alyssa Rosenzweig	80081ef7b2	jay: check for inverse-ballots in jay_uses_flag Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41215>	2026-04-28 23:13:50 +00:00
Alyssa Rosenzweig	86f19bc983	jay: propagate inverse-ballots only locally Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41215>	2026-04-28 23:13:50 +00:00
Alyssa Rosenzweig	d7283a25d7	jay: do not copyprop ballots globally Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41215>	2026-04-28 23:13:50 +00:00
Alyssa Rosenzweig	5828b66b65	jay: convert to LCSSA for correctness with loops. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41215>	2026-04-28 23:13:50 +00:00
Alyssa Rosenzweig	fed6b7bea0	jay: drop UGPR->UMEM spilling path This is totally broken now that we have a physical CFG for UGPRs. And of course, UGPRs generally were totally broken without the physical CFG. So I conclude this code basically never worked. Which is good because it was also basically always dead too. Just delete it and replace with a clear error message, instead of pretending it works and either randomly splatting validation or just straight up miscompiling silently or whatever. We might need an alternative UGPR->GPR spill path some day but that day is not today. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41215>	2026-04-28 23:13:50 +00:00
Alyssa Rosenzweig	ad040f2fbb	jay: introduce a physical control flow graph Consider: u0 = foo() if (divergent) { u0 = bar() r0 = baz(u0) } else { r0 = quux(u0) } Logically, this is fine, there is no interference between bar() and u0. But physically, both sides of the if execute so the bar() write to u0 overwrites the variable the else reads. So this is a miscompile. The solution is to model the extra edges in the physical control flow graph, which lives next to the existing logical control flow graph. Liveness for UGPRs now follows the physical CFG, while liveness for GPRs continues to follow the logical CFG. That models the interference properly, while still allowing phis to work as before (since phis writing UGPRs follow uniform bits of control flow that are necessarily critical edge free for the same reason the logical CFG is). Because our RA copies shuffled registers back at block ends (following Colombet), there's no issue with live range splits here (unlike aco which inserts phis for this case and then needs to worry about critical edges around those phis). There might still be an extremely-challenging-to-hit bug here with UGPR spilling which I need to think more about. It might be fine as-is? Not convinced though. But this is big enough and strictly less broken than what we have right now and the full solution will build on this, so here we are. Fixes artefating in SuperTuxKart and Celestia knows what else. Totals: Instrs: 2770938 -> 2771269 (+0.01%); split: -0.00%, +0.02% CodeSize: 40133712 -> 40138480 (+0.01%); split: -0.01%, +0.02% Totals from 158 (5.97% of 2647) affected shaders: Instrs: 514523 -> 514854 (+0.06%); split: -0.02%, +0.09% CodeSize: 7603040 -> 7607808 (+0.06%); split: -0.03%, +0.09% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41215>	2026-04-28 23:13:50 +00:00
Alyssa Rosenzweig	fadb826515	jay/opt_propagate: disable f64 opts for now could be done but would need more work. No stats change. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41215>	2026-04-28 23:13:50 +00:00
Alyssa Rosenzweig	8e4145948f	jay/opt_propagate: fold uflag copies Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41215>	2026-04-28 23:13:50 +00:00
Alyssa Rosenzweig	b9f8f2477e	jay: inline jay_control() This accessor is more opaque imho. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41215>	2026-04-28 23:13:50 +00:00
Alyssa Rosenzweig	978d20e5fe	jay: drop jay_exec_mask this strategy is panning out nicely. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41215>	2026-04-28 23:13:50 +00:00
Alyssa Rosenzweig	238c4ecf40	jay: fix 16-bit predicated compares Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41215>	2026-04-28 23:13:50 +00:00
Alyssa Rosenzweig	0bd4f1b874	jay: consolidate file prefixes Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41215>	2026-04-28 23:13:50 +00:00
Alyssa Rosenzweig	15365f8ea2	jay: jayize swsb print Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41215>	2026-04-28 23:13:50 +00:00
Alyssa Rosenzweig	fccd68625c	jay: shrink stack allocation Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41215>	2026-04-28 23:13:50 +00:00
Kenneth Graunke	0a5c748e19	jay: Don't forget UACCUM! Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41215>	2026-04-28 23:13:50 +00:00
Alyssa Rosenzweig	3308626e12	jay/assign_flags: don't burn a flag for ballots Increases GPR pressure somehow but it's obviously the right thing to do. SIMD16: Totals: Instrs: 2767536 -> 2767381 (-0.01%); split: -0.01%, +0.00% CodeSize: 44323392 -> 40075680 (-9.58%); split: -9.58%, +0.00% Totals from 2147 (81.11% of 2647) affected shaders: Instrs: 2704498 -> 2704343 (-0.01%); split: -0.01%, +0.00% CodeSize: 43477568 -> 39229856 (-9.77%); split: -9.77%, +0.00% SIMD32: Totals: Instrs: 4731031 -> 4746775 (+0.33%); split: -0.33%, +0.67% CodeSize: 76609152 -> 70004080 (-8.62%); split: -8.68%, +0.06% Number of spill instructions: 50110 -> 50187 (+0.15%); split: -0.00%, +0.16% Number of fill instructions: 51341 -> 51804 (+0.90%); split: -0.00%, +0.91% Totals from 2136 (80.70% of 2647) affected shaders: Instrs: 4666677 -> 4682421 (+0.34%); split: -0.34%, +0.67% CodeSize: 75735136 -> 69130064 (-8.72%); split: -8.78%, +0.06% Number of spill instructions: 50108 -> 50185 (+0.15%); split: -0.00%, +0.16% Number of fill instructions: 51339 -> 51802 (+0.90%); split: -0.00%, +0.91% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41215>	2026-04-28 23:13:50 +00:00
Alyssa Rosenzweig	2c77717e5c	jay/assign_flags: don't burn a null flag SIMD32: Totals from 423 (15.98% of 2647) affected shaders: Instrs: 740042 -> 736360 (-0.50%); split: -1.25%, +0.75% CodeSize: 11984176 -> 11925888 (-0.49%); split: -1.23%, +0.74% Number of spill instructions: 4675 -> 4676 (+0.02%) Number of fill instructions: 5698 -> 5684 (-0.25%); split: -0.28%, +0.04% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41215>	2026-04-28 23:13:50 +00:00
Alyssa Rosenzweig	796886f72c	jay/assign_flags: refactor for next commit Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41215>	2026-04-28 23:13:50 +00:00
Alyssa Rosenzweig	fd46a48ccc	jay/ra: only use stride=4 temps Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details SIMD16: Totals from 56 (2.12% of 2647) affected shaders: Instrs: 541831 -> 542004 (+0.03%); split: -0.40%, +0.44% CodeSize: 8597680 -> 8597248 (-0.01%); split: -0.45%, +0.44% SIMD32: Totals: Instrs: 4858179 -> 4734713 (-2.54%); split: -2.78%, +0.24% CodeSize: 78651424 -> 76667440 (-2.52%); split: -2.76%, +0.24% Totals from 1108 (41.86% of 2647) affected shaders: Instrs: 4241312 -> 4117846 (-2.91%); split: -3.18%, +0.27% CodeSize: 68753152 -> 66769168 (-2.89%); split: -3.16%, +0.27% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41064>	2026-04-20 22:32:12 +00:00
Alyssa Rosenzweig	1f62da938b	jay/ra: drop memory copy reordering No shader-db changes, and no longer required for correctness. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41064>	2026-04-20 22:32:12 +00:00
Alyssa Rosenzweig	45845ea7f2	jay/ra: use accumulator for stride=4 swaps SIMD16: Totals: Instrs: 2767930 -> 2767190 (-0.03%) CodeSize: 44327408 -> 44312304 (-0.03%); split: -0.04%, +0.00% Totals from 142 (5.36% of 2647) affected shaders: Instrs: 658928 -> 658188 (-0.11%) CodeSize: 10514512 -> 10499408 (-0.14%); split: -0.16%, +0.01% SIMD32: Totals: Instrs: 4884039 -> 4858179 (-0.53%) CodeSize: 79079008 -> 78651424 (-0.54%); split: -0.54%, +0.00% Totals from 761 (28.75% of 2647) affected shaders: Instrs: 3803274 -> 3777414 (-0.68%) CodeSize: 61707728 -> 61280144 (-0.69%); split: -0.70%, +0.00% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41064>	2026-04-20 22:32:12 +00:00
Alyssa Rosenzweig	489f883277	jay/ra: use accumulator for memory swaps SIMD1: Totals from 34 (1.28% of 2647) affected shaders: Instrs: 427731 -> 434349 (+1.55%); split: -0.03%, +1.58% CodeSize: 6773248 -> 6881136 (+1.59%); split: -0.04%, +1.63% Number of spill instructions: 1833 -> 1700 (-7.26%) Number of fill instructions: 2095 -> 1944 (-7.21%) SIMD32: Totals from 621 (23.46% of 2647) affected shaders: Instrs: 3663406 -> 3739089 (+2.07%); split: -0.62%, +2.68% CodeSize: 59392464 -> 60624704 (+2.07%); split: -0.61%, +2.68% Number of spill instructions: 52115 -> 50109 (-3.85%); split: -3.90%, +0.05% Number of fill instructions: 53864 -> 51355 (-4.66%) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41064>	2026-04-20 22:32:11 +00:00
Alyssa Rosenzweig	2e5fd6da42	jay/ra: use accumulator for memory copies SIMD16: Totals from 34 (1.28% of 2647) affected shaders: Instrs: 424527 -> 427731 (+0.75%); split: -0.03%, +0.78% CodeSize: 6720896 -> 6773248 (+0.78%); split: -0.04%, +0.82% Number of spill instructions: 1967 -> 1833 (-6.81%) Number of fill instructions: 2247 -> 2095 (-6.76%) SIMD32: Totals: Instrs: 4691989 -> 4808356 (+2.48%); split: -0.46%, +2.94% CodeSize: 76011248 -> 77884320 (+2.46%); split: -0.46%, +2.92% Number of spill instructions: 54223 -> 52115 (-3.89%); split: -4.08%, +0.19% Number of fill instructions: 56519 -> 53864 (-4.70%) Totals from 606 (22.89% of 2647) affected shaders: Instrs: 3509511 -> 3625878 (+3.32%); split: -0.61%, +3.93% CodeSize: 56909488 -> 58782560 (+3.29%); split: -0.61%, +3.90% Number of spill instructions: 54223 -> 52115 (-3.89%); split: -4.08%, +0.19% Number of fill instructions: 56519 -> 53864 (-4.70%) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41064>	2026-04-20 22:32:11 +00:00
Alyssa Rosenzweig	7d2a88a9e5	jay/ra: don't reserve registers when not spilling No changes at SIMD16. At SIMD32: Totals: Instrs: 4691895 -> 4691989 (+0.00%); split: -0.03%, +0.03% CodeSize: 76010880 -> 76011248 (+0.00%); split: -0.03%, +0.03% Number of spill instructions: 54369 -> 54223 (-0.27%) Number of fill instructions: 56668 -> 56519 (-0.26%) Totals from 71 (2.68% of 2647) affected shaders: Instrs: 75963 -> 76057 (+0.12%); split: -1.67%, +1.79% CodeSize: 1229792 -> 1230160 (+0.03%); split: -1.71%, +1.74% Number of spill instructions: 146 -> 0 (-inf%) Number of fill instructions: 149 -> 0 (-inf%) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41064>	2026-04-20 22:32:11 +00:00
Alyssa Rosenzweig	e5bf153d4f	jay/lower_post_ra: drop old 2<-->8 lowering this XOR based lowering is no longer needed. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41064>	2026-04-20 22:32:10 +00:00
Alyssa Rosenzweig	915af8e121	jay/lower_post_ra: remove SWAP macro Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41064>	2026-04-20 22:32:10 +00:00
Alyssa Rosenzweig	4c5ad7a832	jay/register_allocate: start using accumulators this lets us lower away 8<-->2 copies/swaps in a faster, more straightforward way by (ab)using accumulators. I think as an edge case this plays nicely enough with my plans to profit from accs for normal fma-heavy code. SIMD16: Totals: Instrs: 2761525 -> 2758108 (-0.12%) CodeSize: 44222384 -> 44167168 (-0.12%) Totals from 33 (1.25% of 2647) affected shaders: Instrs: 422130 -> 418713 (-0.81%) CodeSize: 6713680 -> 6658464 (-0.82%) SIMD32: Totals: Instrs: 4911601 -> 4691895 (-4.47%) CodeSize: 79553984 -> 76010880 (-4.45%) Totals from 947 (35.78% of 2647) affected shaders: Instrs: 4143501 -> 3923795 (-5.30%) CodeSize: 67174592 -> 63631488 (-5.27%) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41064>	2026-04-20 22:32:10 +00:00
Alyssa Rosenzweig	53c1c076a8	jay: validate non-SSA accumulators just enough for us to do parallel copy lowering with them. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41064>	2026-04-20 22:32:09 +00:00
Alyssa Rosenzweig	28cf0f52c1	jay/to_binary: handle packing accumulators Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41064>	2026-04-20 22:32:09 +00:00
Alyssa Rosenzweig	aa37d8b248	jay/print: deal with bare r0 copies Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41064>	2026-04-20 22:32:09 +00:00
Kenneth Graunke	e55af8793f	jay: Add missing ROR case Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41064>	2026-04-20 22:32:09 +00:00

1 2

97 commits