fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-24 06:18:10 +02:00

Author	SHA1	Message	Date
Kenneth Graunke	a590500802	jay: Add a GPR_FROM_UGPRS opcode Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Kenneth Graunke	4555cd23c6	jay: Set Dispatch GRF Start Register in jay_setup_payload() We want it to be set to wherever the push constants ended up. Setting it close to the setup_payload_push() call makes this easier. We'll also be adding some extra UGPRs for the fragment shader payload soon, and the partitioning code will just have one big UGPR partition for payload fields, push constants, and general purpose UGPRs, so it really won't know how to do this very well without duplicating a bunch of information. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Kenneth Graunke	0670b40013	jay: Add comments summarizing the PS thread payload layout The documentation is large and hard to follow due to all the optional fields and the SIMD16 vs. SIMD32 split for barycentrics. This quick summary helps clarify what fields exist, which are split for SIMD32 or kept together, and which pairs of registers are involved for splits. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Kenneth Graunke	6c142f7edc	jay: Implement sample mask writes Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Kenneth Graunke	49299050ea	jay: Implement fragment shader stencil writes Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Kenneth Graunke	b01d286083	jay: Move render target store payload/descriptor construction to backend Constructing the render target store payload is more complex than we can reasonably handle at the NIR level. The main reason is that samplemask and stencil are packed 16-bit and 8-bit parameters, respectively, which are intermixed with other values that are 32-bit. In SIMD32 mode, the packed sub-32-bit values take up fewer registers than normal values. Currently we also don't specialize the NIR for each FS dispatch width, and we can't construct the message descriptor without knowing it. So, we alter nir_intrinsic_store_render_target_intel to take each of the expected parameters - colour, depth, stencil, samplemask, src0_alpha, and discard predicate. We construct the payloads and descriptors in the backend. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Alyssa Rosenzweig	bc22a37d98	jay: schedule for pressure Implement a simple pre-RA bottom-up list scheduler with the goal of decreasing register pressure. On Xe2, this significantly reduces spilling. SSA form allows us to estimate register demand cheaply and accurately, which theoretically [1] gives this algorithm the two Hippocratic properties: 1. Shaders with low register pressure are unaffected. 2. Register pressure can only be decreased, never increased. In other words: first, do no harm. The heuristic itself is very simple: greedily choose instructions that decrease liveness using a backwards list scheduler. This is far from optimal! But thanks to the above properties, even a heuristic that picked random instructions would be a win overall - by construction, we can only ever win. In other words: this scheduler is your older brother powering off the game console any time he's about to lose a game, maintaining a 100% win rate. [1] In reality, neither property is strictly satisfied due to the messy details of mapping our clean logical model onto Intel's many weird physical register files. Nevertheless, the algorithm is well-motivated and the empirical results on Xe2 are excellent. SIMD16: Totals: Instrs: 2754194 -> 2753957 (-0.01%); split: -0.23%, +0.22% CodeSize: 41094768 -> 41092768 (-0.00%); split: -0.23%, +0.23% Number of spill instructions: 1724 -> 1129 (-34.51%) Number of fill instructions: 1912 -> 1119 (-41.47%) Totals from 168 (6.35% of 2647) affected shaders: Instrs: 850994 -> 850757 (-0.03%); split: -0.75%, +0.73% CodeSize: 12825680 -> 12823680 (-0.02%); split: -0.74%, +0.73% Number of spill instructions: 1724 -> 1129 (-34.51%) Number of fill instructions: 1912 -> 1119 (-41.47%) SIMD32: Totals: Instrs: 4688858 -> 4557800 (-2.80%); split: -3.53%, +0.74% CodeSize: 70177200 -> 68214816 (-2.80%); split: -3.53%, +0.74% Number of spill instructions: 50316 -> 45795 (-8.99%); split: -9.56%, +0.57% Number of fill instructions: 51526 -> 45075 (-12.52%); split: -13.23%, +0.71% Totals from 819 (30.94% of 2647) affected shaders: Instrs: 3810182 -> 3679124 (-3.44%); split: -4.35%, +0.91% CodeSize: 57044000 -> 55081616 (-3.44%); split: -4.35%, +0.91% Number of spill instructions: 49264 -> 44743 (-9.18%); split: -9.76%, +0.58% Number of fill instructions: 50182 -> 43731 (-12.86%); split: -13.58%, +0.73% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Alyssa Rosenzweig	81e21a8756	jay: factor jay_op_(starts,ends)_block queries Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Alyssa Rosenzweig	e72ffb0046	jay: annotate pure sends for scheduling, CSE, etc Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Alyssa Rosenzweig	c069b7e47c	jay/opt_propagate: avoid branching on poison logically it doesn't matter because we'll bail on a later check, but this is still UB and therefore releases nasal demons. i am jealous of Faith's Rust compilers. there, I said it. ==107281== Conditional jump or move depends on uninitialised value(s) ==107281== at 0x7069768: propagate_backwards (jay_opt_propagate.c:327) ==107281== by 0x7069768: jay_opt_propagate_backwards (jay_opt_propagate.c:367) ==107281== by 0x7058960: jay_compile (jay_from_nir.c:2677) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Alyssa Rosenzweig	4b0c3f5c32	jay/lower_scoreboard: add asserts on key bounds if these are botched you get UB (-: Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Alyssa Rosenzweig	4c97493b69	jay/lower_scoreboard: handle accumulator hazard Challenging to hit but fixes dEQP-GLES3.functional.shaders.swizzle_math_operations.vector_multiply.mediump_ivec4_wzyx_zyxw_fragment with scheduling changes. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Alyssa Rosenzweig	9a68101bc2	jay/liveness: drop redundant source filtering Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Alyssa Rosenzweig	9b68b4e7a1	jay/liveness: speed up physical CFG merging on top of scheduler changes, compile-time of shaders/blender/1017.shader_test: Difference at 95.0% confidence -0.00173202 +/- 0.00116931 -0.791537% +/- 0.532384% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Alyssa Rosenzweig	1b50d3eed2	jay/liveness: remove pointless bitset init dup initializes it. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Alyssa Rosenzweig	5da3b57605	jay: insert simd32 deswizzle in a dedicated pass we don't actually need the DESWIZZLE pseudo instruction, and the pseudo op complicates pre-RA scheduling. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Alyssa Rosenzweig	47c6601d5e	jay: relax fragment payload layout this isn't optimal but it should unblock bring up. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Co-authored-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Karol Herbst	e9c1cce35f	nir: remove ffma_old Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41165>	2026-05-19 18:13:42 +00:00
Karol Herbst	6208a590cb	intel/jay: support nir_op_ffma Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41165>	2026-05-19 18:13:32 +00:00
Karol Herbst	a9b18f8607	nir: rename ffma to ffma_old We'll get three new opcodes to properly model float multiply-add. ffma_old is temporary and will be deleted at the end of this series. Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41165>	2026-05-19 18:13:27 +00:00
Lionel Landwerlin	682dc50776	brw/jay: move sample_mask_in handling to NIR Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41529>	2026-05-14 14:05:06 +00:00
Lionel Landwerlin	df5a6d7b87	brw/jay: move some coarse lowering to NIR We add a pass to allow testing partially known fs config bits (main user is DX11 always disabling VRS/coarse). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41529>	2026-05-14 14:05:06 +00:00
Kenneth Graunke	f6debb842d	jay: Gripe more clearly about dual source blending Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41535>	2026-05-13 23:03:15 +00:00
Kenneth Graunke	4f26c6b682	jay: Add a TODO for coarse pixel shading This is a less obtuse error message for why things break. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41535>	2026-05-13 23:03:15 +00:00
Kenneth Graunke	4b4aad7c44	jay: Include depth and stencil on all MRT stores The hardware expects it to be present for every colour target. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41535>	2026-05-13 23:03:15 +00:00
Alyssa Rosenzweig	db95df3da4	jay/opt_propagate: propagate undefs Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details allows deleting piles of moves & pressure. simd16 results: Totals: Instrs: 2759547 -> 2753358 (-0.22%); split: -0.29%, +0.06% CodeSize: 41141280 -> 41071072 (-0.17%); split: -0.23%, +0.06% Totals from 332 (12.54% of 2647) affected shaders: Instrs: 648080 -> 641891 (-0.95%); split: -1.23%, +0.28% CodeSize: 9782272 -> 9712064 (-0.72%); split: -0.97%, +0.25% simd32 is a loss because of RA being stupid. again, this is obviously the right thing to do so we're doing it. stats are just a hint. Totals: Instrs: 4683556 -> 4689193 (+0.12%); split: -0.25%, +0.37% CodeSize: 70072256 -> 70171920 (+0.14%); split: -0.23%, +0.38% Number of spill instructions: 50320 -> 50316 (-0.01%) Number of fill instructions: 51530 -> 51526 (-0.01%) Totals from 351 (13.26% of 2647) affected shaders: Instrs: 1349954 -> 1355591 (+0.42%); split: -0.86%, +1.28% CodeSize: 20484224 -> 20583888 (+0.49%); split: -0.80%, +1.29% Number of spill instructions: 21762 -> 21758 (-0.02%) Number of fill instructions: 26328 -> 26324 (-0.02%) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:36 +00:00
Alyssa Rosenzweig	21e527ceec	jay/opt_propagate: fix NOT propagation and add a test for it. oops. Totals: Instrs: 4700885 -> 4683707 (-0.37%); split: -1.36%, +1.00% CodeSize: 70551872 -> 70285088 (-0.38%); split: -1.35%, +0.97% Number of spill instructions: 50325 -> 50320 (-0.01%) Number of fill instructions: 51541 -> 51530 (-0.02%) Totals from 1261 (47.64% of 2647) affected shaders: Instrs: 3932922 -> 3915744 (-0.44%); split: -1.63%, +1.19% CodeSize: 59196320 -> 58929536 (-0.45%); split: -1.60%, +1.15% Number of spill instructions: 47901 -> 47896 (-0.01%) Number of fill instructions: 48420 -> 48409 (-0.02%) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:36 +00:00
Alyssa Rosenzweig	5cbf0002c4	jay/register_allocate: tweak roundrobin heuristic Totals: Instrs: 4706214 -> 4700132 (-0.13%); split: -1.03%, +0.90% CodeSize: 70628880 -> 70540336 (-0.13%); split: -1.02%, +0.89% Totals from 2084 (78.73% of 2647) affected shaders: Instrs: 4515981 -> 4509899 (-0.13%); split: -1.08%, +0.94% CodeSize: 67822800 -> 67734256 (-0.13%); split: -1.06%, +0.93% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:35 +00:00
Alyssa Rosenzweig	37e4144693	jay/register_allocate: set num_regs[MEM] properly this is both a correctness fix (insufficient MEM registers reserved in some cases) and a performance fix (unnecessary allocations & zeroing in the RA when we don't spill). fixes dEQP-VK.dgc.ext.compute.misc.scratch_space stats are noise but positive i guess. Totals from 35 (1.32% of 2647) affected shaders: Instrs: 396770 -> 396690 (-0.02%) CodeSize: 6040832 -> 6039600 (-0.02%) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:35 +00:00
Alyssa Rosenzweig	d67e37a24c	jay/lower_scoreboard: use sbid syncs to elide regdist deps Totals from 1522 (57.50% of 2647) affected shaders: CodeSize: 65268400 -> 65056176 (-0.33%); split: -0.33%, +0.00% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:35 +00:00
Alyssa Rosenzweig	89e33407e4	jay/lower_scoreboard: use CFG for RegDist scoreboarding this is now properly global. Totals from 558 (21.08% of 2647) affected shaders: CodeSize: 42098496 -> 42078256 (-0.05%); split: -0.05%, +0.00% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:35 +00:00
Alyssa Rosenzweig	c2a423b5b5	jay/lower_scoreboard: rename gpr_range -> key for clarity since UGPRs are here too. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:34 +00:00
Alyssa Rosenzweig	d549fb9c04	jay/lower_scoreboard: compact inst_exec_pipe Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:34 +00:00
Alyssa Rosenzweig	adaae3baf1	jay/lower_scoreboard: control flow is int pipe according to IGC output. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:34 +00:00
Alyssa Rosenzweig	039b76d07c	jay/lower_scoreboard: factor regdist logic out no change, just hoisting the loop & reindenting. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:33 +00:00
Alyssa Rosenzweig	a7b8395c15	jay/lower_scoreboard: run RegDist globally poking around, it seems branches stall the pipelines so we don't need to do any dataflow analysis, but we do need to fall through for correctness. just keep going across block boundaries. this isn't optimal yet but it reduces a pile of A@1's already. Totals from 1389 (52.47% of 2647) affected shaders: CodeSize: 56385376 -> 56325776 (-0.11%); split: -0.13%, +0.03% -- this also fixes issues where the first instruction of a block is a SEND that has an unmet register dependency, since the old code was fundamentally broken. oops. lol. fixes dEQP-VK.compute.pipeline.workgroup_memory_explicit_layout.zero.uint8_t_array_to_uint_array_1 among many others. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:33 +00:00
Alyssa Rosenzweig	52224bb597	jay/lower_scoreboard: refactor no functional change, just reshuffling code for next commit. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:33 +00:00
Alyssa Rosenzweig	3a7baf2cde	jay/lower_scoreboard: fix trivial scheduling Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:32 +00:00
Alyssa Rosenzweig	7ba6e9810a	jay: clarify development model Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:32 +00:00
Alyssa Rosenzweig	45d63539a6	jay: have proper UNDEF matches NIR's broken semantics but allows more opts later. just a rename here. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:32 +00:00
Alyssa Rosenzweig	c2911dd688	jay: fix comment Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:32 +00:00
Alyssa Rosenzweig	3d94ba1d20	jay: make indirect push data blow up more obviously fail to crash: dEQP-VK.spirv_assembly.instruction.compute.untyped_pointers.glsl_memory_model.basic_usecase.load.push_constant.int32 Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:31 +00:00
Alyssa Rosenzweig	b10c0d95a8	jay: optimize pack_32_2x16_split(#0 , x) Kinda pointless but whatever. Totals from 10 (0.38% of 2647) affected shaders: Instrs: 6846 -> 6830 (-0.23%) CodeSize: 95728 -> 95520 (-0.22%) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:31 +00:00
Alyssa Rosenzweig	5ebf0c9161	jay: elide atomic dests simd16 results. kinda noisy but obviously the right thing to do. Totals from 45 (1.70% of 2647) affected shaders: Instrs: 59182 -> 59194 (+0.02%); split: -0.11%, +0.14% CodeSize: 905200 -> 904752 (-0.05%); split: -0.17%, +0.12% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:31 +00:00
Alyssa Rosenzweig	b3fe01e2c1	jay: fix bfn with 0xffff constant awkward. Totals from 128 (4.84% of 2647) affected shaders: Instrs: 258121 -> 257970 (-0.06%); split: -0.07%, +0.01% CodeSize: 3662400 -> 3661792 (-0.02%); split: -0.14%, +0.12% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:30 +00:00
Alyssa Rosenzweig	c5cee5d973	jay: add JAY_DEBUG=noacc option can help when debugging RA. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:30 +00:00
Alyssa Rosenzweig	9dbaaecb74	jay: swap predication/acc pass order Lets us use more accumulators, I think this is well motivated. Saw this in a test shader. Totals from 242 (9.14% of 2647) affected shaders: Instrs: 1365060 -> 1365035 (-0.00%); split: -0.00%, +0.00% CodeSize: 20678592 -> 20680096 (+0.01%); split: -0.01%, +0.02% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:30 +00:00
Kenneth Graunke	fbaa5ad0c3	iris: Implement force_dual_color_blend_by_location via NIR We can just have iris look at its own program key and change the fragment shader output variable's location/index in the NIR. By doing this before lowering fragment shader outputs, the rest of the output lowering does the right thing, and the backend no longer has to consider hacks for broken OpenGL apps. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41122>	2026-05-07 08:29:40 +00:00
Alyssa Rosenzweig	5636a57f60	jay/lower_scoreboard: use SYNC.allrd/allwr This collapses piles of silliness. Totals: CodeSize: 71626288 -> 70710000 (-1.28%) Totals from 1634 (61.73% of 2647) affected shaders: CodeSize: 66319376 -> 65403088 (-1.38%) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41398>	2026-05-06 23:25:26 +00:00
Alyssa Rosenzweig	c1dc9d3b1a	jay/lower_scoreboard: be the sole emitter of SYNC this gets closer to something we can schedule and avoids some pointless syncs. Totals from 491 (18.55% of 2647) affected shaders: Instrs: 602994 -> 602946 (-0.01%) CodeSize: 9063888 -> 9015904 (-0.53%) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41398>	2026-05-06 23:25:26 +00:00

1 2 3

144 commits