fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-24 21:28:10 +02:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	9b68b4e7a1	jay/liveness: speed up physical CFG merging on top of scheduler changes, compile-time of shaders/blender/1017.shader_test: Difference at 95.0% confidence -0.00173202 +/- 0.00116931 -0.791537% +/- 0.532384% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Alyssa Rosenzweig	1b50d3eed2	jay/liveness: remove pointless bitset init dup initializes it. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Alyssa Rosenzweig	5da3b57605	jay: insert simd32 deswizzle in a dedicated pass we don't actually need the DESWIZZLE pseudo instruction, and the pseudo op complicates pre-RA scheduling. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Alyssa Rosenzweig	47c6601d5e	jay: relax fragment payload layout this isn't optimal but it should unblock bring up. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Co-authored-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Kenneth Graunke	cb75c9f962	brw: Lower sample_pos for non-per-sample shaders in NIR We generalize the sample_mask_in lowering to handle this too. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:45 +00:00
Jordan Justen	28f6a442c6	brw/compact: Precompact using 2src fields on 3src instructions In shader-db, with `-p skl`, shaders/0ad/12.shader_test does not compact an instruction because precompact overwrites portions of the instruction. (Treating the three source instruction as a two source when accessing instruction fields.) This instruction could be compacted: mad(8) g65<1>F g61<4,4,1>F g64<4,4,1>F -g17<4,4,1>F { align16 1Q }; But, since precompact erroneously sets bits, the instruction isn't compacted. Fossil testing: * Tested with `0a3f3fd193` ("brw: drop unused color_outputs_valid key") reverted, as fossils are currently producing inconsitent results otherwise. * Tested skl, icl, dg2, mtl, lnl, bmg and ptl. Only skl had a change. SKL: Totals: CodeSize: 8335219296 -> 8320248992 (-0.18%) Totals from 359508 (14.42% of 2492689) affected shaders: CodeSize: 2838254352 -> 2823284048 (-0.53%) Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41588>	2026-05-20 11:52:52 -07:00
José Roberto de Souza	180d8cb544	intel/brw: Fix nir_intrinsic_load_inline_data_intel register offset calculation Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details In case of nir_intrinsic_load_inline_data_intel it was not using base_offset to create the uniform, instead it was using only the special BRW_INLINE_PARAM_REG value that later will be replaced by the inline_data fixed register. So here using base_offset for both intrinsics, adding BRW_INLINE_PARAM_REG if nir_intrinsic_load_inline_data_intel and then in brw_shader::assign_curb_setup checking for inst->src[i].nr >= BRW_INLINE_PARAM_REG and adjusting brw_reg by the remaining of the subtraction with BRW_INLINE_PARAM_REG. Fixes: `7f19814414` ("brw/nir: handle inline_data_intel more like push_data_intel") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41607>	2026-05-19 19:30:18 +00:00
Karol Herbst	e9c1cce35f	nir: remove ffma_old Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41165>	2026-05-19 18:13:42 +00:00
Karol Herbst	a9206a271a	intel/brw: port over to nir_op_ffma Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41165>	2026-05-19 18:13:33 +00:00
Karol Herbst	6208a590cb	intel/jay: support nir_op_ffma Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41165>	2026-05-19 18:13:32 +00:00
Karol Herbst	df69364e69	intel/elk: port over to nir_op_ffma Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41165>	2026-05-19 18:13:32 +00:00
Karol Herbst	a9b18f8607	nir: rename ffma to ffma_old We'll get three new opcodes to properly model float multiply-add. ffma_old is temporary and will be deleted at the end of this series. Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41165>	2026-05-19 18:13:27 +00:00
Calder Young	f60749ff3c	brw: Add support for ACCESS_CAN_REORDER memory ordering Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Passes the ACCESS_CAN_REORDER flag from NIR on to the backend so that we can lower the loads to a non-volatile SEND. This allows the scheduler to freely reorder them around stores or fences. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41008>	2026-05-17 19:03:24 +00:00
Calder Young	bb4878b203	brw: Allow instruction reordering around memory writes Our scheduler is overly conservative about reordering instructions around memory writes or fences. Fortunately, there are several simple assumptions we can make about our IR to schedule these things a lot more fluidly: * Unless its an EOT, a SEND instruction's side effects will only be observed through other SEND instructions * The effects of workgroup barriers, memory fences, and BRW_OPCODE_SYNC, are only used in the IR to synchronize SEND instructions * All other scheduler dependencies related to memory access are already expressed through the source and destination operands Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41008>	2026-05-17 19:03:24 +00:00
Lionel Landwerlin	682dc50776	brw/jay: move sample_mask_in handling to NIR Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41529>	2026-05-14 14:05:06 +00:00
Lionel Landwerlin	df5a6d7b87	brw/jay: move some coarse lowering to NIR We add a pass to allow testing partially known fs config bits (main user is DX11 always disabling VRS/coarse). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41529>	2026-05-14 14:05:06 +00:00
Lionel Landwerlin	dfa7e15f7c	brw: simplify VF component packing code We can determine used components earlier. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41501>	2026-05-14 10:39:25 +00:00
Caio Oliveira	771714a0ce	brw/tests: Stop using regions/type for non-null SEND sources in tests SEND operands don't have regions or types, hardware don't use those bits except for possibly an old workaround. So from the perspective of assembler, we shouldn't need to add them. For now brw_asm grammar requires at least a type, so normalize to UD. This will make easier to swap the parser syntax and code later. Assisted-by: Pi coding agent (opus-4.7) Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41456>	2026-05-14 01:29:13 +00:00
Caio Oliveira	08d805e03b	brw/tests: Stop using regions/type for null in assembler tests From the perspective of assembler, regions and types for ARF null are not relevant -- so ignore them. We still have some validation relying on the byte-stride of the destination, so keep those for now. In the long run, if a certain Gfx version HW requires some specific matching, the encoder (or the parser) should take care of it. This change will make easier to swap the parser syntax and code later. Assisted-by: Pi coding agent (opus-4.7) Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41456>	2026-05-14 01:29:13 +00:00
Caio Oliveira	7a12758b8c	brw/tests: Remove redundant parser test Same test a couple of lines above. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41456>	2026-05-14 01:29:12 +00:00
Kenneth Graunke	f6debb842d	jay: Gripe more clearly about dual source blending Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41535>	2026-05-13 23:03:15 +00:00
Kenneth Graunke	4f26c6b682	jay: Add a TODO for coarse pixel shading This is a less obtuse error message for why things break. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41535>	2026-05-13 23:03:15 +00:00
Kenneth Graunke	4b4aad7c44	jay: Include depth and stencil on all MRT stores The hardware expects it to be present for every colour target. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41535>	2026-05-13 23:03:15 +00:00
Kenneth Graunke	faede3c3c1	intel/nir: Only add an explicit LOD 0 when lod/bias don't already exist When lowering tg4 sparse testing to a non-gather opcode, we were adding an explicit LOD 0 parameter. But we might already have a LOD or bias. Fixes tests like: dEQP-VK.glsl.texture_gather.basic.2d.rgba8.base_level.sparse_level_1_amd_lod dEQP-VK.glsl.texture_gather.basic.2d.rgba8.base_level.sparse_level_1_amd_bias Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41535>	2026-05-13 23:03:14 +00:00
Alyssa Rosenzweig	db95df3da4	jay/opt_propagate: propagate undefs Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details allows deleting piles of moves & pressure. simd16 results: Totals: Instrs: 2759547 -> 2753358 (-0.22%); split: -0.29%, +0.06% CodeSize: 41141280 -> 41071072 (-0.17%); split: -0.23%, +0.06% Totals from 332 (12.54% of 2647) affected shaders: Instrs: 648080 -> 641891 (-0.95%); split: -1.23%, +0.28% CodeSize: 9782272 -> 9712064 (-0.72%); split: -0.97%, +0.25% simd32 is a loss because of RA being stupid. again, this is obviously the right thing to do so we're doing it. stats are just a hint. Totals: Instrs: 4683556 -> 4689193 (+0.12%); split: -0.25%, +0.37% CodeSize: 70072256 -> 70171920 (+0.14%); split: -0.23%, +0.38% Number of spill instructions: 50320 -> 50316 (-0.01%) Number of fill instructions: 51530 -> 51526 (-0.01%) Totals from 351 (13.26% of 2647) affected shaders: Instrs: 1349954 -> 1355591 (+0.42%); split: -0.86%, +1.28% CodeSize: 20484224 -> 20583888 (+0.49%); split: -0.80%, +1.29% Number of spill instructions: 21762 -> 21758 (-0.02%) Number of fill instructions: 26328 -> 26324 (-0.02%) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:36 +00:00
Alyssa Rosenzweig	21e527ceec	jay/opt_propagate: fix NOT propagation and add a test for it. oops. Totals: Instrs: 4700885 -> 4683707 (-0.37%); split: -1.36%, +1.00% CodeSize: 70551872 -> 70285088 (-0.38%); split: -1.35%, +0.97% Number of spill instructions: 50325 -> 50320 (-0.01%) Number of fill instructions: 51541 -> 51530 (-0.02%) Totals from 1261 (47.64% of 2647) affected shaders: Instrs: 3932922 -> 3915744 (-0.44%); split: -1.63%, +1.19% CodeSize: 59196320 -> 58929536 (-0.45%); split: -1.60%, +1.15% Number of spill instructions: 47901 -> 47896 (-0.01%) Number of fill instructions: 48420 -> 48409 (-0.02%) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:36 +00:00
Alyssa Rosenzweig	5cbf0002c4	jay/register_allocate: tweak roundrobin heuristic Totals: Instrs: 4706214 -> 4700132 (-0.13%); split: -1.03%, +0.90% CodeSize: 70628880 -> 70540336 (-0.13%); split: -1.02%, +0.89% Totals from 2084 (78.73% of 2647) affected shaders: Instrs: 4515981 -> 4509899 (-0.13%); split: -1.08%, +0.94% CodeSize: 67822800 -> 67734256 (-0.13%); split: -1.06%, +0.93% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:35 +00:00
Alyssa Rosenzweig	37e4144693	jay/register_allocate: set num_regs[MEM] properly this is both a correctness fix (insufficient MEM registers reserved in some cases) and a performance fix (unnecessary allocations & zeroing in the RA when we don't spill). fixes dEQP-VK.dgc.ext.compute.misc.scratch_space stats are noise but positive i guess. Totals from 35 (1.32% of 2647) affected shaders: Instrs: 396770 -> 396690 (-0.02%) CodeSize: 6040832 -> 6039600 (-0.02%) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:35 +00:00
Alyssa Rosenzweig	d67e37a24c	jay/lower_scoreboard: use sbid syncs to elide regdist deps Totals from 1522 (57.50% of 2647) affected shaders: CodeSize: 65268400 -> 65056176 (-0.33%); split: -0.33%, +0.00% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:35 +00:00
Alyssa Rosenzweig	89e33407e4	jay/lower_scoreboard: use CFG for RegDist scoreboarding this is now properly global. Totals from 558 (21.08% of 2647) affected shaders: CodeSize: 42098496 -> 42078256 (-0.05%); split: -0.05%, +0.00% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:35 +00:00
Alyssa Rosenzweig	c2a423b5b5	jay/lower_scoreboard: rename gpr_range -> key for clarity since UGPRs are here too. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:34 +00:00
Alyssa Rosenzweig	d549fb9c04	jay/lower_scoreboard: compact inst_exec_pipe Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:34 +00:00
Alyssa Rosenzweig	adaae3baf1	jay/lower_scoreboard: control flow is int pipe according to IGC output. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:34 +00:00
Alyssa Rosenzweig	039b76d07c	jay/lower_scoreboard: factor regdist logic out no change, just hoisting the loop & reindenting. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:33 +00:00
Alyssa Rosenzweig	a7b8395c15	jay/lower_scoreboard: run RegDist globally poking around, it seems branches stall the pipelines so we don't need to do any dataflow analysis, but we do need to fall through for correctness. just keep going across block boundaries. this isn't optimal yet but it reduces a pile of A@1's already. Totals from 1389 (52.47% of 2647) affected shaders: CodeSize: 56385376 -> 56325776 (-0.11%); split: -0.13%, +0.03% -- this also fixes issues where the first instruction of a block is a SEND that has an unmet register dependency, since the old code was fundamentally broken. oops. lol. fixes dEQP-VK.compute.pipeline.workgroup_memory_explicit_layout.zero.uint8_t_array_to_uint_array_1 among many others. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:33 +00:00
Alyssa Rosenzweig	52224bb597	jay/lower_scoreboard: refactor no functional change, just reshuffling code for next commit. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:33 +00:00
Alyssa Rosenzweig	3a7baf2cde	jay/lower_scoreboard: fix trivial scheduling Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:32 +00:00
Alyssa Rosenzweig	7ba6e9810a	jay: clarify development model Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:32 +00:00
Alyssa Rosenzweig	45d63539a6	jay: have proper UNDEF matches NIR's broken semantics but allows more opts later. just a rename here. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:32 +00:00
Alyssa Rosenzweig	c2911dd688	jay: fix comment Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:32 +00:00
Alyssa Rosenzweig	3d94ba1d20	jay: make indirect push data blow up more obviously fail to crash: dEQP-VK.spirv_assembly.instruction.compute.untyped_pointers.glsl_memory_model.basic_usecase.load.push_constant.int32 Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:31 +00:00
Alyssa Rosenzweig	b10c0d95a8	jay: optimize pack_32_2x16_split(#0 , x) Kinda pointless but whatever. Totals from 10 (0.38% of 2647) affected shaders: Instrs: 6846 -> 6830 (-0.23%) CodeSize: 95728 -> 95520 (-0.22%) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:31 +00:00
Alyssa Rosenzweig	5ebf0c9161	jay: elide atomic dests simd16 results. kinda noisy but obviously the right thing to do. Totals from 45 (1.70% of 2647) affected shaders: Instrs: 59182 -> 59194 (+0.02%); split: -0.11%, +0.14% CodeSize: 905200 -> 904752 (-0.05%); split: -0.17%, +0.12% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:31 +00:00
Alyssa Rosenzweig	b3fe01e2c1	jay: fix bfn with 0xffff constant awkward. Totals from 128 (4.84% of 2647) affected shaders: Instrs: 258121 -> 257970 (-0.06%); split: -0.07%, +0.01% CodeSize: 3662400 -> 3661792 (-0.02%); split: -0.14%, +0.12% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:30 +00:00
Alyssa Rosenzweig	c5cee5d973	jay: add JAY_DEBUG=noacc option can help when debugging RA. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:30 +00:00
Alyssa Rosenzweig	9dbaaecb74	jay: swap predication/acc pass order Lets us use more accumulators, I think this is well motivated. Saw this in a test shader. Totals from 242 (9.14% of 2647) affected shaders: Instrs: 1365060 -> 1365035 (-0.00%); split: -0.00%, +0.00% CodeSize: 20678592 -> 20680096 (+0.01%); split: -0.01%, +0.02% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41510>	2026-05-12 22:46:30 +00:00
Ian Romanick	907cc49c32	brw: Calcuate divergence before brw_from_nir Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details We were previously assuming that potentially stale divergence data was valid. On some paths the register pressure estimator would recalculate this, but, as is obvious from the results, not always. v2: Add an assertion in brw_from_nir_emit_impl to ensure we don't end up in this situation again. v3: Call nir_divergence_analysis from brw_nir_lower_deferred_urb_writes. This fixes assertion failures (the assertion added in v2) in basically every graphics shader. The altnerative was to call it from brw_compile_vs, brw_compile_gs, and brw_compile_tes. shader-db: All Intel platformms had similar results. (Lunar Lake shown) total instructions in shared programs: 17050403 -> 17054033 (0.02%) instructions in affected programs: 296344 -> 299974 (1.22%) helped: 0 / HURT: 376 total cycles in shared programs: 876063126 -> 875817316 (-0.03%) cycles in affected programs: 78627328 -> 78381518 (-0.31%) helped: 91 / HURT: 276 LOST: 1 GAINED: 10 fossil-db: All Intel platformms had similar results. (Lunar Lake shown) Totals: Instrs: 913770429 -> 916075391 (+0.25%); split: -0.00%, +0.26% CodeSize: 14647414640 -> 14726176320 (+0.54%); split: -0.02%, +0.56% Cycle count: 102308091527 -> 102290664775 (-0.02%); split: -0.26%, +0.24% Spill count: 3469632 -> 3469124 (-0.01%); split: -0.08%, +0.07% Fill count: 5007038 -> 4998674 (-0.17%); split: -0.51%, +0.34% Max live registers: 192568853 -> 192595355 (+0.01%); split: -0.00%, +0.02% Max dispatch width: 48713168 -> 48712880 (-0.00%); split: +0.00%, -0.00% Non SSA regs after NIR: 140252767 -> 140253718 (+0.00%) Totals from 223099 (11.11% of 2007586) affected shaders: Instrs: 314077245 -> 316382207 (+0.73%); split: -0.01%, +0.75% CodeSize: 5335583824 -> 5414345504 (+1.48%); split: -0.06%, +1.54% Cycle count: 45868025821 -> 45850599069 (-0.04%); split: -0.58%, +0.54% Spill count: 2062649 -> 2062141 (-0.02%); split: -0.14%, +0.11% Fill count: 3343019 -> 3334655 (-0.25%); split: -0.76%, +0.51% Max live registers: 36762498 -> 36789000 (+0.07%); split: -0.02%, +0.09% Max dispatch width: 5542224 -> 5541936 (-0.01%); split: +0.03%, -0.03% Non SSA regs after NIR: 43727142 -> 43728093 (+0.00%) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> [v1] Fixes: `1bff4f93ca` ("brw: Basic infrastructure to store convergent values as scalars") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41370>	2026-05-11 21:03:19 +00:00
Caio Oliveira	d08d345686	brw: Remove references to SIMD4x2 Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details In Gfx9 the enum value was changed to mean SIMD8 double precision, so drop the old unused enum. At least on Gfx9 there is an extension bit to set to use the old SIMD4x2 mode, we can recover if we ever need this in the future. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41457>	2026-05-11 20:16:02 +00:00
Iván Briano	2ad92e3ea4	anv/brw: handle FullyCoveredEXT Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Tested-by: Caleb Callaway <caleb.callaway@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38879>	2026-05-11 18:15:50 +00:00
Iván Briano	58006eaaa4	anv/brw: add conservative raster on/off to FS_CONFIG FullyCovered will need to know if conservative rasterization is enabled, so pass it on to the shader. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Tested-by: Caleb Callaway <caleb.callaway@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38879>	2026-05-11 18:15:50 +00:00

1 2 3 4 5 ...

5257 commits