fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-01-28 09:40:21 +01:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	e3328dfa2f	brw: only initialize sample mask flag if needed This is a refinement of `7c129d9365` ("intel/brw/xe2+: Keep PS sample mask in the f1.0 register whether or not kill is used."). Rather than always insert this move, do so only when we'll actually read the register: for memory writes and for discards. This deletes an instruction from piles of fragment shaders. shader-db on LNL: total instructions in shared programs: 17134031 -> 17042706 (-0.53%) instructions in affected programs: 9065743 -> 8974418 (-1.01%) helped: 65045 HURT: 0 helped stats (abs) min: 1.0 max: 3.0 x̄: 1.40 x̃: 1 helped stats (rel) min: <.01% max: 50.00% x̄: 3.06% x̃: 1.64% 95% mean confidence interval for instructions value: -1.41 -1.40 95% mean confidence interval for instructions %-change: -3.10% -3.03% Instructions are helped. total cycles in shared programs: 885172098 -> 884835306 (-0.04%) cycles in affected programs: 590294230 -> 589957438 (-0.06%) helped: 53636 HURT: 4500 helped stats (abs) min: 2.0 max: 1126.0 x̄: 8.02 x̃: 4 helped stats (rel) min: <.01% max: 50.00% x̄: 1.24% x̃: 0.24% HURT stats (abs) min: 2.0 max: 7706.0 x̄: 20.77 x̃: 6 HURT stats (rel) min: <.01% max: 82.06% x̄: 1.09% x̃: 0.54% 95% mean confidence interval for cycles value: -6.15 -5.43 95% mean confidence interval for cycles %-change: -1.10% -1.02% Cycles are helped. LOST: 385 GAINED: 47 Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38665>	2025-11-26 16:53:36 +00:00
Lionel Landwerlin	5324712952	anv: remove errors on format queries Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details It's pretty spammy and since the whole purpose of queries is to report support, why bother with errors? Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38661>	2025-11-26 16:06:57 +00:00
Kenneth Graunke	3182deaae1	brw: Combine output stores for TCS outputs even when unlinked Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Otherwise we get a lot of individual x/y/z stores to tesslevels when we should really just be storing the whole thing at once. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:44:03 +00:00
Kenneth Graunke	7e02738b63	brw: Drop check for legacy tess levels from remap_patch_urb_offsets The newly rewritten remap_tess_levels_legacy will have already lowered anything it cares about to URB intrinsics. So the generic remapping pass won't see them, as it operates on generic input/output intrinsics. This also drops some of the callback boilerplate we needed temporarily. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:44:03 +00:00
Kenneth Graunke	d95a9714c2	brw: Rewrite legacy tess level remapping This unifies the dynamic (SSO) and fixed (linked together) versions. We emit piles of NIR as if we were doing the dynamic version, but replace the tess config field access with constant values. It all should optimize away back to something reasonable. We lower these directly to URB read/write intrinsics. It also rewrites the dynamic version to directly read/write the URB rather than going through temporaries. The old version was broken in that tessellation control shader invocations can technically use the shared output area for cross-invocation data sharing with barriers, although doing so using the built-in tesslevel patch outputs is very unlikely. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:44:03 +00:00
Kenneth Graunke	ee407481c2	brw: Switch to URB intrinsics for TCS inputs Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:44:02 +00:00
Kenneth Graunke	943b2acf02	brw: Switch to NIR URB intrinsics for TES inputs Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:44:01 +00:00
Kenneth Graunke	c0d69b2faf	brw: Switch to NIR URB intrinsics for TCS outputs Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:44:01 +00:00
Kenneth Graunke	9aff3cac3c	brw: Add infrastructure for lowering to URB intrinsics Based on earlier code by Lionel Landwerlin. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:44:00 +00:00
Kenneth Graunke	13acc889af	brw: Use io_sem.location instead of base to get varying slots Alyssa noted we can be using semantic IO here rather than relying on bases not having been remapped. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:43:59 +00:00
Kenneth Graunke	96d331766a	brw: Generalize read_attribute_payload_intel to handle more cases We were using this for indirect loads of the shader input thread payload, but there's no reason we can't use it for constant access too. In this case we can just MOV from the ATTR file directly without a special opcode that turns into MOV_INDIRECT later. We also allow it to load multiple components now. This is useful for say, returning vec4 pushed inputs. And, we allow it in more stages than just the fragment stage. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:43:59 +00:00
Kenneth Graunke	792762617a	brw: Rename read_attribute_payload_intel to load_attribute_payload_intel We're going to change the intrinsic to a load(...) which puts "load" in the name. Also, it's just more consistent with our usual terminology. We also rename the corresponding backend opcode so they remain matched. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:43:58 +00:00
Kenneth Graunke	0f7590af81	brw, anv, iris: Switch to reversed patch header layouts These are a ton more convenient. When the TCS and TES were linked together, the legacy layouts were a hassle, but didn't impose any significant cost. With unlinked TCS and TES, the legacy layouts involve significant runtime code for scrambling the data, whereas the reversed layouts are substantially less overhead. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:43:57 +00:00
Kenneth Graunke	7d1dfc3468	brw: Lower tesslevel vars to vectors even for unlinked TCS/TES st/nir lowers this for iris, and brw_link_shaders lowers this for anv, but for unlinked tessellation control / evaluation shaders, the lowering was not happening for TCS. Just do it unconditionally when lowering TCS outputs and TES inputs. This lets the remapping code just assume vectors all the time, rather than getting single component stores with nir_intrinsic_component set (which came from nir_lower_io lowering compact arrays). This also requires changes to the dynamic unlinked TCS/TES lowering to temporaries, which needs to use vectors rather than arrays with this change. That code is going away in future patches anyway, but this keeps it going for now to avoid interim breakage. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:43:56 +00:00
Kenneth Graunke	7736e693b1	brw: Pass devinfo into remap_patch_urb_offsets Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:43:56 +00:00
Kenneth Graunke	4dc6413de8	brw: Rename remap_non_header_patch_values to remap_patch_values See rationale in the previous patch. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:43:56 +00:00
Kenneth Graunke	2b51963b8c	brw: Remap tesslevels before other patch remapping We now call remap_tess_levels before remap_non_header_patch_urb_offsets. The latter already excludes tess levels anyway, so the order shouldn't matter. This paves the way for remap_tess_levels to skip handling some header values in certain cases, because with reversed layouts, many of them no longer need any special handling and we can just let the generic pass handle them. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:43:56 +00:00
Kenneth Graunke	e8669a8333	brw: Rework the tess level remapping interface Just have a single remap_tess_levels that does either the statically-known-primitive or the dynamic (unlinked) mode. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:43:56 +00:00
Kenneth Graunke	1995c879a9	brw: Flip the TESS_LEVEL_INNER/OUTER vue map slot assignments Our current legacy patch header layout handling doesn't actually care which is which slot, and remaps everything to its correct spot anyway. For using the newer "reversed" patch header layouts, it will be more convenient to have outer as slot 0, and inner as slot 1, as that just works with no special remapping needed for both quads and triangles (but unfortunately isolines are still a pain). Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:43:56 +00:00
Kenneth Graunke	e5c1d00faf	brw: Pass devinfo to brw_nir_lower_tes_inputs This will be useful for using reversed patch header layouts. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:43:56 +00:00
Kenneth Graunke	a1c7ae9d15	brw: Implement URB handle intrinsics for TCS and TES stages Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:43:56 +00:00
Lionel Landwerlin	e290f9641d	brw: Implement load/store URB intrinsics These work the same regardless of stage. v2 (Ken): Rebase, move from mesh to all stages, add reorderable load variant, allow channel masks to be non-constant even on Xe2. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:43:55 +00:00
Lionel Landwerlin	0d8ee4ed23	brw: use default builder for urb handle adjustment Be consistent with lowering that happens after, so that it gets a full vector register and can stride into it. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:43:55 +00:00
Felix DeGrood	406e6e094a	anv/rt: avoid out of bound access by clamping global id Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Felix DeGrood <felix.j.degrood@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `cff9d82c66` ("anv/rt: rewrite encode.comp for better performance") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38636>	2025-11-25 19:59:42 +00:00
Lionel Landwerlin	b1e74a1bb1	anv: shrink image opaque data Noticed renderdoc complaining about our size : RDOC 692028: [18:08:18] vk_core.cpp(2272) - Warning - VkPhysicalDeviceDescriptorBufferPropertiesEXT.imageCaptureReplayDescriptorDataSizeis too large at 32 (must be <= 16), can't support capture of VK_EXT_descriptor_buffer Since we only need 2 pointers (main + private), we can shrink this to 16bytes. The 1/2 planes have a relative offset from the base. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38625>	2025-11-25 19:38:53 +00:00
Lionel Landwerlin	7e72d392d7	brw: switch to load_(pixel_coord\|frag_coord_z\|frag_coord_w) intrinsics Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Allows us to better determine if we need Z/W payload delivery. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36392>	2025-11-25 15:50:48 +00:00
Lionel Landwerlin	6d3be477ab	anv: enable application shader printfs with debug option Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38638>	2025-11-25 14:18:42 +00:00
Lionel Landwerlin	4c3bf04dd0	anv: enable mesh/task shader hashes Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38638>	2025-11-25 14:18:42 +00:00
Kenneth Graunke	e49418744a	brw: Set extended_bindless_surface_offset to true for Gfx12.5+ anv sets device->uses_ex_bso on verx10 >= 125 and then sets the compiler->extended_bindless_surface_offset to that. iris was not setting anything. However, LSC_ADDR_SURFTYPE_SS used for scratch on Gfx12.5 is bindless, and Xe2 uses ExBSO for all UGM access, so we need to be setting this. Just set it in the compiler so both drivers have it set. Fixes piglit arb_tessellation_shader-tes-gs-max-output -small -scan 1 50 on iris. Fixes: `80c89909f3` ("brw: fixup immediate bindless surface handling") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38645>	2025-11-25 08:21:30 +00:00
Simon McVittie	b860ae309a	vulkan: Optionally share one JSON manifest per driver between architectures If the library_path is just a basename like `libvulkan_lvp.so`, then we can share the same JSON manifest like `lvp_icd.json` between all of the architectures, like we already do for Vulkan layers. The library will be looked up in the dynamic linker's default search path in this case, and in practice will be found in `${libdir}`. This is how the Mesa's EGL driver and Vulkan layers work, how Mesa is packaged in Debian 13, and also how the Nvidia proprietary driver works; it makes installation simpler for distros, especially on multiarch systems like Debian and the freedesktop.org SDK. However, if we want a separate manifest per architecture in order to be able to write the full path into it, we still need per-architecture filename disambiguation like `lvp_icd.x86_64.json`. We presumably still want a separate per architecture on Windows, because the concept of a single monolithic `${libdir}` is less common there, and it can also be helpful during development when setting `$VK_DRIVER_FILES` to force the use of a specific driver installed in a non-default location. Use the following parameter to passed to vk_icd_gen: '--icd-lib-path', vulkan_icd_lib_path, '--icd-filename', icd_file_name, output : 'virtio_icd.' + vulkan_manifest_suffix, and the output is passed by '--out', '@OUTPUT@', so we can detect vulkan_manifest_per_architecture from the --out parameter in script. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13745 Signed-off-by: Simon McVittie <smcv@collabora.com> Co-authored-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37314>	2025-11-24 19:05:57 +00:00
Lionel Landwerlin	d51c0b8988	brw: fix SS surfaces usage In `80c89909f3` ("brw: fixup immediate bindless surface handling") I forgot that we have a special usage for the only _SS surface (the scratch surface). Because it's only delivered in the 31:10 bits of R0 and because we want to minimize the amount of shader instructions for scratch messages, the surface offset in shifted right by the driver to align things properly for the 31:6 extended descriptor format. This is unfortunately incompatible with the full 32bit format of ExBSO. So this surface type currently cannot be considered bindless. We might revisit later if we start using _SS surfaces for other things. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `80c89909f3` ("brw: fixup immediate bindless surface handling") Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38618>	2025-11-24 16:12:27 +00:00
Hyunjun Ko	01de6ac134	vulkan/video: Fix H.265 long-term reference handling Without these fixes, H.265 streams using long-term references would fail to decode correctly as the decoder wouldn't distinguish between short-term and long-term reference frames. Fixes: `896f95a37e` ("vulkan/video: fix h265 decoding with LT enabled.") Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38571>	2025-11-24 12:58:02 +00:00
Lionel Landwerlin	8f9acc0150	brw: compute final copy propagation resulting source Fixes this test on Xe2+: INTEL_DEBUG=no32 ./deqp-vk -n dEQP-VK.spirv_assembly.instruction.maint9_vectorization.bit_field_u_extract.result_v16i-base_v16i-offset_s64u-count_s16i Generate invalid code for that platform: and(16) g37<1>UW g65<16,4,4>UW 0x000fUW { align1 1H I@5 }; ERROR: Invalid register region for source 0. See special restrictions section. Several helpers like has_subdword_integer_region_restriction() do not see the final type of the source, so compute it early. Maybe new_src could be used in more cases. Being conservative for now. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38548>	2025-11-24 10:14:32 +00:00
Andy Hsu	2ee6b4d96e	intel/decoder: make libvulkan_intel to depend on stub decoder when buildtyle=release. The libvulkan_intel does not need the decoder when buildtype=release where the debugging is disabled. However, the decoder implementation is decided by the dep_expat which may be turned on by like -Dtools=intel and the binary size of libvulkan_intel increase unexpectedly. This change creates the stub dependency and decide the exact decoder dependency of libvulkan_intel by the buildtype. Test: meson setup builddir -D build-tests=true -Dbuildtype=release --reconfigure && ninja -C builddir && cd builddir && meson test Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Andy Hsu <hwandy@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38569>	2025-11-24 16:40:02 +08:00
Dylan Baker	1737638c98	meson: make dep_lua a disabler There are cases where the freedreno `crashdec` program will not be built, but will still be used. By making dep_lua a disabler, we move closer to being able to have those tests automatically disabled when crashdec isn't built. Acked-by: Rob Clark <rob.clark@oss.qualcomm.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38579>	2025-11-21 21:48:57 +00:00
Lionel Landwerlin	7c193ffef1	anv: put more readable PIPE_CONTROL reasons Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38542>	2025-11-21 21:45:18 +02:00
Lionel Landwerlin	6d98fdb3ec	anv: avoid pipe control reason tracking in emit_pipe_control This is the last level layer of emission, we want the tracking to be added above that, so that when flushing of previously accumulated reasons happens, another pointless reason isn't added. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38542>	2025-11-21 21:45:18 +02:00
Kenneth Graunke	3160c516ca	brw: Delete input_slots_valid from brw_wm_prog_key Nothing in the compiler seems to use this anymore. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38556>	2025-11-20 14:10:39 -08:00
Kenneth Graunke	868377e4c7	brw: Delete program_string_id from brw program keys This is strictly a GL thing. iris can manage it in its own program keys without polluting the compiler with stuff nobody else cares about. We can also drop a lot of padding that was introduced in commit `a18835a9ca` which doesn't appear to be necessary. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38556>	2025-11-20 14:10:38 -08:00
Lionel Landwerlin	07b7de35cc	anv: Wa_18040903259 only applies to RCS when in GPGPU mode Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Sadly this probably won't change anything in terms of perf as the CCS engine has a bunch of other restrictions. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `243c01c703` ("anv/iris: implement Wa_18040903259") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38484>	2025-11-20 08:17:35 +00:00
Marek Olšák	9e339f4b32	nir: rename nir_lower_indirect_derefs -> nir_lower_indirect_derefs_to_if_else_trees This describes better what it does. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Acked-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38471>	2025-11-20 05:42:11 +00:00
Sagar Ghuge	f0aad5bd7e	anv: Convert indirect to direct dispatch Saves unncessary PC and stall during encode phase. Thanks to Felix for pointing out that CCS always needs a CS stall once we add a pipe control, that will kill the performance for BVH construction. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38513>	2025-11-20 03:11:55 +00:00
Felix DeGrood	15ffe6c524	anv/perfetto: include all pc reasons Up to 4 reasons can be saved and displayed. Previously, we were only displaying one reason for Perfetto. Co-authored-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38500>	2025-11-20 02:53:53 +00:00
Connor Abbott	3b3954e2b8	util/glsl2spirv: Use better glslang flag for -Olib --create-unlinked also creates entrypoints for the functions, and obviates the need to create a dummy entrypoint. This is one step closer to removing glsl2spirv and aligns us with other users of glslang. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38088>	2025-11-20 02:14:50 +00:00
Lionel Landwerlin	6fe2035065	anv: bump maxTessellationControlTotalOutputComponents Our backend compiler explains the limits as : 32 bytes for the patch header (tessellation factors) 480 bytes for per-patch varyings (a varying component is 4 bytes and gl_MaxTessPatchComponents = 120) 16384 bytes for per-vertex varyings (a varying component is 4 bytes, gl_MaxPatchVertices = 32 and gl_MaxTessControlOutputComponents = 128) In all that's : * 32 patches * 128 components (counting tessellation factors) * 32 vertices * 128 components 8192 total components. I'm not sure why the limit was set so low, maybe leftover from older platforms? Bump the limit to something like competition. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38523>	2025-11-19 22:44:54 +00:00
Hyunjun Ko	9a9342e4aa	anv/video: handling segmentations features for vp9 decoding Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38418>	2025-11-19 15:54:47 +00:00
Hyunjun Ko	1479e1ef82	anv/video: rework for handling alternative quantizer for vp9 decoding. including prep-work for handling segmentation features. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38418>	2025-11-19 15:54:47 +00:00
Lionel Landwerlin	049adad4f4	anv: split non binding related intrinsics from apply_layout Trying to cut down apply_pipeline_layout a bit and also allowing some reuse for a new extension. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38495>	2025-11-19 10:27:27 +00:00
Felix DeGrood	198537039a	anv/rt: reduce writes to block_incr_and_start_prim Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36937>	2025-11-18 22:41:21 +00:00
Felix DeGrood	768bb1c7a3	anv/rt: multithread writing of invalid leaves Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36937>	2025-11-18 22:41:21 +00:00

1 2 3 4 5 ...

15033 commits