fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-18 18:08:15 +02:00

Author	SHA1	Message	Date
Paulo Zanoni	ff5b909511	anv/sparse: bring back our (limited) support for depth/stencil Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The ambiguity of the Vulkan spec was clarified, and we don't need to support sparse depth/stencil with exactly the same number of samples as non-sparse. If you want to pass CTS, you'll need VK-GL-CTS commit 03976477f521 ("Don't require more than VK_SAMPLE_COUNT_1_BIT for non-color sparse resident images"). This is essentially a revert of `d5da6980d3` ("anv/sparse: don't support depth/stencil with sparse") and `7b337e214d` ("anv: remove dead code"). Thanks to Iván Briano for working with Khronos to get clarification on the spec and for implementing the VK-GL-CTS fix. Reviewed-by: Iván Briano <ivan.briano@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37423>	2026-05-07 23:47:52 +00:00
Paulo Zanoni	7eab94d542	intel/nir: fix sparse shadow comparison for BRW While Jay overwrites sparse_tex->op with the newer opcodes that only return red and the sparse stuff, BRW keeps using the original opcode of the cloned instruction, so it can't change def->num_components. This was not previously detectable since we did not have sparse enabled for depth/stencil on Anv for a while. A patch to re-enable that was proposed a while ago (MR !37423), never merged, but then a recent attempt to try to merge it (by me) detected this regression. Let's fix the regression first, then we can finally re-enable sparse depth/stencil support in Anv, hopefully. Fixes: `7468261d3d` ("intel/nir: Make intel_nir_lower_sparse work for either brw or jay") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37423>	2026-05-07 23:47:51 +00:00
Tapani Pälli	c540405ca3	anv: use INTEL_NEEDS_WA_14025112257 define for workaround Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41281>	2026-05-07 16:20:29 +00:00
Tapani Pälli	c381b4fdd4	intel/dev: update mesa_defs.json from workaround database This removes 18042479026 as we don't utilize BRW_AOP_MOV in compiler and adds missing xe2 entries for 14025112257. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41281>	2026-05-07 16:20:29 +00:00
Lionel Landwerlin	62b890046f	anv: remove old entrypoints Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40387>	2026-05-07 15:49:20 +00:00
Lionel Landwerlin	f123030dcd	anv: implement VK_KHR_device_address_commands Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40387>	2026-05-07 15:49:20 +00:00
Lionel Landwerlin	7adece7ce0	anv: fixup null address check Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40387>	2026-05-07 15:49:19 +00:00
Kenneth Graunke	2729b1608f	brw: Limit SIMD width based on NIR rather than first backend compile Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details I originally added this mechanism to have the first (SIMD8) compile note that certain features were in use which would prevent SIMD16/32 from compiling, so we could skip the work of trying those. But these days, there aren't many cases, and the ones we have are easily detectable based on the NIR. We can detect it earlier without even having to do the SIMD8 compile. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41122>	2026-05-07 08:29:40 +00:00
Kenneth Graunke	c5928d40ae	brw: Drop dead code from dispatch limit check for dual source blending We checked that ver is 11 or 12. It can't be >= 20. This is dead code. Dual source blending on Xe2 does not have native SIMD32 RT write message support, but SIMD splitting is currently lowering it to low/high SIMD16 message pairs when using SIMD32 dispatch. I'm not aware of any of the hardware errata from previous platform still applying. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41122>	2026-05-07 08:29:40 +00:00
Kenneth Graunke	599d26db00	brw: Set prog_data::dual_src_blend from NIR outputs written bitfield Simpler and set earlier. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41122>	2026-05-07 08:29:40 +00:00
Kenneth Graunke	afb97ff2af	brw: Switch FS outputs to semantic IO and FRAG_RESULT_DUAL_SRC_BLEND The new FRAG_RESULT_DUAL_SRC_BLEND option is easier to work with than looking for FRAG_RESULT_DATA0 with an index of 1. This also means we no longer care about the dual source blend index, and can just use the FRAG_RESULT location. That cascades to meaning we no longer have to store a tuple in driver_location. And, if we just need location, we can avoid populating that at all and use nir_io_semantics to get it. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41122>	2026-05-07 08:29:40 +00:00
Kenneth Graunke	fbaa5ad0c3	iris: Implement force_dual_color_blend_by_location via NIR We can just have iris look at its own program key and change the fragment shader output variable's location/index in the NIR. By doing this before lowering fragment shader outputs, the rest of the output lowering does the right thing, and the backend no longer has to consider hacks for broken OpenGL apps. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41122>	2026-05-07 08:29:40 +00:00
Calder Young	efc6a3053d	anv: Fix some usage flags not propagated to ISL for explicit layouts Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Some vulkancts tests rely on vkGetImageMemoryRequirements to return the same exact size after exporting and importing an image. This broke when we started adding padding to sampled surfaces to manage overfetch, because the texture usage flag does not get applied to the ISL surface when the image is recreated using an explicit layout. Fixes: `8d13628f7` ("isl: Add additional alignment/padding requirements to prevent overfetch") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41376>	2026-05-07 00:02:43 +00:00
Alyssa Rosenzweig	5636a57f60	jay/lower_scoreboard: use SYNC.allrd/allwr This collapses piles of silliness. Totals: CodeSize: 71626288 -> 70710000 (-1.28%) Totals from 1634 (61.73% of 2647) affected shaders: CodeSize: 66319376 -> 65403088 (-1.38%) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41398>	2026-05-06 23:25:26 +00:00
Alyssa Rosenzweig	c1dc9d3b1a	jay/lower_scoreboard: be the sole emitter of SYNC this gets closer to something we can schedule and avoids some pointless syncs. Totals from 491 (18.55% of 2647) affected shaders: Instrs: 602994 -> 602946 (-0.01%) CodeSize: 9063888 -> 9015904 (-0.53%) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41398>	2026-05-06 23:25:26 +00:00
Alyssa Rosenzweig	0885ed10f5	jay/lower_scoreboard: use .src annotations This is less heavy handed, avoiding unnecessary stalls after SENDs in a bunch of common cases. The stats (SIMD32) are: Totals: CodeSize: 70345392 -> 71674272 (+1.89%) Totals from 1774 (67.02% of 2647) affected shaders: CodeSize: 67359248 -> 68688128 (+1.97%) What's happening here is we are inserting extra SYNC.nop instructions in a bunch of cases for the .src preceding the eventual .dst. However, putting aside the i-cache impact for a moment, this is showing the optimization doing what it should (deferring dst syncs and inserting cheaper src syncs first). So this should be positive in reality despite the negative stat impact. The most hurt shaders are pooling up SYNC.nop's at the end of blocks due to local-only SWSB and lack of SYNC.allwr optimization. The latter is added later in this MR. The former is planned. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41398>	2026-05-06 23:25:25 +00:00
Alyssa Rosenzweig	130e724d5e	jay/lower_scoreboard: refactor SYNC.nop insertion for next commit Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41398>	2026-05-06 23:25:25 +00:00
Alyssa Rosenzweig	1ecd75a397	jay/lower_scoreboard: fix tracking for A@* and *@7 update the tracking with what we actually waited on, not what we ideally wanted to wait on. reduces extra annotations in some cases. SIMD32: Totals from 194 (7.33% of 2647) affected shaders: CodeSize: 14473840 -> 14469088 (-0.03%) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41398>	2026-05-06 23:25:25 +00:00
Alyssa Rosenzweig	93edf9a3fd	jay/lower_scoreboard: refactor wait pipe code for next commit. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41398>	2026-05-06 23:25:25 +00:00
Alyssa Rosenzweig	18e09858eb	jay/lower_scoreboard: elide more dependencies IGC does these optimizations and I think they should be safe given my mental model. Given a sequence like: r0 = add.f32 r1, r2 r1 = add.f32 r3, r4 Each ALU pipe is pipelined but in-order. Therefore, the second add cannot possibly complete before the first add, so it cannot write r1 before the first add reads r1, so we can elide the write-after-read dependency. That in term avoids a pipeline bubble between the two instructions. Ditto for write-after-write. Similarly if the distance is too great within an in-order pipe since there is a maximum pipeline length, it's not infinite. Note that if there was cross-pipe dependencies we do need the annotation since the pipes themselves are parallel. SIMD32: Totals from 58 (2.19% of 2647) affected shaders: CodeSize: 3316592 -> 3315056 (-0.05%); split: -0.05%, +0.00% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41398>	2026-05-06 23:25:25 +00:00
Alyssa Rosenzweig	e4dc161277	jay: assign accumulators post-RA Greedy post-RA substitution pass, similar to IGC's AccSubstitution pass. Stats together with the previous commits. SIMD16: Totals from 2209 (83.45% of 2647) affected shaders: Instrs: 2701029 -> 2696350 (-0.17%) CodeSize: 39166720 -> 40372272 (+3.08%); split: -0.36%, +3.44% SIMD32: Totals from 2211 (83.53% of 2647) affected shaders: Instrs: 4691165 -> 4641188 (-1.07%) CodeSize: 69365792 -> 69341616 (-0.03%); split: -0.50%, +0.47% The instruction count reduction is from RA shuffle code getting coalesced via accumulators. The code size changes are from: * Fewer moves from the instr count reduction (helped) * Smaller MADs encoded as MACs (helped) * Fewer SYNC.nop due to fewer scoreboarding annotations (helped) * Less compaction due to explicit accumulator operands (hurt) I expect significant cycle count changes from this but we don't have a cycle model wired up yet, so reading the assembly will have to do. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41398>	2026-05-06 23:25:25 +00:00
Alyssa Rosenzweig	8b324591d1	jay: move simd32 deswizzling to float pipe for more accumulator usage. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41398>	2026-05-06 23:25:25 +00:00
Alyssa Rosenzweig	712719a2ae	jay: do moves on the float pipe where possible this allows us to use accumulators more. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41398>	2026-05-06 23:25:25 +00:00
Alyssa Rosenzweig	6f2b1cece6	jay: model MAC Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41398>	2026-05-06 23:25:25 +00:00
Alyssa Rosenzweig	b6e88ab904	jay/to_binary: fix packing of simd-split accumulators Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41398>	2026-05-06 23:25:25 +00:00
Lionel Landwerlin	718a5d48b8	anv: add an option to disable push constant space reallocation Already called in genX(batch_emit_push_constants_alloc) above. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39584>	2026-05-06 22:12:39 +00:00
Lionel Landwerlin	a21da01994	anv: rename push constant allocation helper The name was confusing emission & allocation. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39584>	2026-05-06 22:12:39 +00:00
Lionel Landwerlin	696163d0e2	anv/iris: stop using 3DSTATE_PUSH_CONSTANT_ALLOC_PS on Gfx12.5 According to documents linked in HSD 1209977789, the push constant allocation for PS stage is not applicable on Gfx12.5+ (removed). The documents says push constant data is fetched by SBE in URB. The HW must still parse the command and do nothing with it. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39584>	2026-05-06 22:12:39 +00:00
Lionel Landwerlin	85c4c87a58	anv: group all performance drirc together Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39584>	2026-05-06 22:12:38 +00:00
Rhys Perry	ec59b59b97	nir: rename nir_src_parent_instr to nir_src_use_instr sed -i "s/nir_src_parent_instr/nir_src_use_instr/" `find ./ -type f` sed -i "s/nir_src_parent_if/nir_src_use_if/" `find ./ -type f` sed -i "s/nir_src_set_parent/nir_src_set_use/" `find ./ -type f` There are two kinds of "parent" in relation to a src/def: - the instruction where the def or src's def is defined - the instruction which the src is a part of and where the def is used Clarify that the parent here is where the src's def is used, not where it's defined. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Acked-by: Ian Romanick <ian.d.romanick@intel.com> Acked-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41344>	2026-05-06 17:09:22 +00:00
Lionel Landwerlin	0d39d4e99e	anv: expose VK_KHR_maintenance11 Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41334>	2026-05-06 11:21:28 +00:00
Samuel Pitoiset	9764225ff1	vulkan: replace VK_SHADER_CREATE_INDEPENDENT_SETS_BIT_MESA with the maint11 flag Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41377>	2026-05-06 10:43:56 +00:00
Lionel Landwerlin	fee5106b53	anv: add Gfx9 support VK_EXT_device_generated_commands This platform just needs a bit more care around vertex buffer state emission. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31384>	2026-05-06 09:49:52 +00:00
Lionel Landwerlin	afabf6e350	anv: add a device generated command debug option It prints out the constant of the generated commands. $ ANV_DEBUG=dgc-dump ./deqp-vk -n dEQP-VK.dgc.ext.compute.smoke.4_sequences_device_local_from_host_preprocess_state_same_universal_queue Test case 'dEQP-VK.dgc.ext.compute.smoke.4_sequences_device_local_from_host_preprocess_state_same_universal_queue'.. call from 0xffffeffeffe04694 0x0000000400000000: MI_STORE_DATA_IMM 0x10000403 0x00000178 0x00000004 0xffe047b8 0xffffeffe 0x0000000400000014: MI_BATCH_BUFFER_START 0x18800101 0x00000020 0x00000004 0x0000000400000020: MI_ARB_CHECK 0x02800100 0x0000000400000024: MEDIA_CURBE_LOAD 0x70010002 0x00000000 0x00000020 0x40000180 0x0000000400000034: GPGPU_WALKER 0x7105000d 0x00000000 0x00000000 0x00000000 0x40000003 0x00000000 0x00000000 0x00000001 0x00000000 0x00000000 0x0000004c 0x00000000 0x00000001 0x0000ffff 0xffffffff 0x0000000400000070: MEDIA_STATE_FLUSH 0x70040000 0x00000000 0x0000000400000078: MEDIA_CURBE_LOAD 0x70010002 0x00000000 0x00000020 0x40001400 0x0000000400000088: GPGPU_WALKER 0x7105000d 0x00000000 0x00000000 0x00000000 0x40000003 0x00000000 0x00000000 0x00000017 0x00000000 0x00000000 0x00000001 0x00000000 0x00000001 0x0000ffff 0xffffffff 0x00000004000000c4: MEDIA_STATE_FLUSH 0x70040000 0x00000000 0x00000004000000cc: MEDIA_CURBE_LOAD 0x70010002 0x00000000 0x00000020 0x40002680 0x00000004000000dc: GPGPU_WALKER 0x7105000d 0x00000000 0x00000000 0x00000000 0x40000003 0x00000000 0x00000000 0x00000001 0x00000000 0x00000000 0x00000001 0x00000000 0x000000d5 0x0000ffff 0xffffffff 0x0000000400000118: MEDIA_STATE_FLUSH 0x70040000 0x00000000 0x0000000400000120: MEDIA_CURBE_LOAD 0x70010002 0x00000000 0x00000020 0x40003900 0x0000000400000130: GPGPU_WALKER 0x7105000d 0x00000000 0x00000000 0x00000000 0x40000003 0x00000000 0x00000000 0x00000001 0x00000000 0x00000000 0x000000dc 0x00000000 0x00000001 0x0000ffff 0xffffffff Pass (Pass) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31384>	2026-05-06 09:49:52 +00:00
Lionel Landwerlin	50aee34651	anv: expose VK_EXT_device_generated_commands by default on Gfx12.5+ Prior generations are kept under experimental until we implement a more memory efficient preprocess buffer solution. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/work_items/14890 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/work_items/12380 Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31384>	2026-05-06 09:49:51 +00:00
Lionel Landwerlin	e69062f8c9	anv: track generated commands work with perfetto Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31384>	2026-05-06 09:49:50 +00:00
Lionel Landwerlin	badcfc164d	anv: handle preprocess buffer creation on <= Gfx12.0 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31384>	2026-05-06 09:49:48 +00:00
Lionel Landwerlin	d1ef313466	anv: add barrier flags handling for preprocess buffers Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31384>	2026-05-06 09:49:48 +00:00
Lionel Landwerlin	71732d79ac	anv: implement generated preprocess & execute Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31384>	2026-05-06 09:49:48 +00:00
Lionel Landwerlin	80bb2ddb77	anv: handle descriptor binding with DGC Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31384>	2026-05-06 09:49:47 +00:00
Lionel Landwerlin	08c5e2854a	anv: enable generation shader calls Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31384>	2026-05-06 09:49:47 +00:00
Lionel Landwerlin	5c3deebd6f	anv: allow simple shader spilling for complex ones Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31384>	2026-05-06 09:49:46 +00:00
Lionel Landwerlin	068351f848	anv: add unspecified internal kernel send count support Some kernel will be to large and potentially change too often to really have a consistent count. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31384>	2026-05-06 09:49:46 +00:00
Lionel Landwerlin	fb26ed6bf7	anv: add indirect command layout support Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31384>	2026-05-06 09:49:45 +00:00
Lionel Landwerlin	68885511d2	anv: add support for indirect execution set Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31384>	2026-05-06 09:49:45 +00:00
Lionel Landwerlin	35b0d3569e	anv: program relative push set offset for descriptor buffers device bindable shaders Up to now all push descriptor accesses where going through the binding table. That's not going to be the case anymore with descriptor buffers device bindable shaders. Those will do A64 messages to read the descriptor buffer (for example when build a bounded 64bit address for storage buffers, or 64bit image format atomic emulation, etc...) We need to have the offset relative to the push descriptor heap (internal_state_heap in this case). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31384>	2026-05-06 09:49:45 +00:00
Lionel Landwerlin	4960e103ef	anv: add a helper to flush the descriptors for indirect compute execution When we don't know what shader is executed. We'll still have the bind map from the indirect execution set. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31384>	2026-05-06 09:49:44 +00:00
Lionel Landwerlin	6f5d30c0a2	anv: add apply_layout support for device bindable shaders/pipelines We consider them like bindless stages (no binding table) as much as possible. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31384>	2026-05-06 09:49:44 +00:00
Lionel Landwerlin	af8c85b5bd	anv/apply_layout: use the resource index to compute descriptor buffer addresses Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31384>	2026-05-06 09:49:44 +00:00
Lionel Landwerlin	1281e2b9a0	anv/intel: add device generated commands shaders Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31384>	2026-05-06 09:49:43 +00:00

1 2 3 4 5 ...

16038 commits