fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-08 09:08:10 +02:00

Author	SHA1	Message	Date
Adam Jackson	ca9f95f783	i915/corm: add shrink_vectors to reduce cross-register vec construction nir_opt_shrink_vectors narrows vector widths when only a subset of components are consumed, which eliminates unnecessary cross-register vec constructions. Follow with copy_prop + dce to clean up. shader-db (I915_FS=nir): 272/403 compiled, 4388 alu shader-db (I915_FS=both): nir won 272 (26 identical, 1 tied, 239 better, 6 only), 18 TGSI, 113 neither Assisted-by: Claude	2026-05-07 16:12:48 -04:00
Adam Jackson	6e38f519e0	i915/corm: add late scalarization as variant dimension Some shaders produce better code when fully scalarized after optimization: vec3(a, b, 1.0) feeding a dot product creates a cross-register vec construction, but scalarizing the fmul exposes 1.0*1.0 to constant folding, eliminating the vec entirely. Other shaders are worse fully scalar because corm's vec construction handles same_reg vecs at zero cost. Add late_scalar as a variant dimension so the multi-variant framework picks whichever is better per shader. shader-db (I915_FS=nir): 254/403 compiled, 4063 alu shader-db (I915_FS=both): nir won 254 (26 identical, 1 tied, 221 better, 6 only), 36 TGSI, 113 neither Assisted-by: Claude	2026-05-07 16:12:48 -04:00
Adam Jackson	bfbba3f3b4	i915/corm: extend ALU consumer fusion to ffma and 3-input ops Generalize the binary ALU consumer fusion to handle ffma (MAD) and any number of inputs. When a vec's only consumer is an ALU op where the vec occupies one source slot and all other sources are single registers, emit the ALU op per register group with partial writemasks. shader-db (I915_FS=nir): 252/403 compiled, 3618 alu shader-db (I915_FS=both): nir won 252 (26 identical, 1 tied, 219 better, 6 only), 38 TGSI, 113 neither Assisted-by: Claude	2026-05-07 16:12:48 -04:00
Adam Jackson	2a2ef36852	i915/corm: use utemp for vec texcoord to avoid phase boundaries When a vec construction feeds a single-use tex instruction, use a utemp (unpreserved temp) instead of an R-file temp for the vec dest. R-file temps written by ALU trigger tex indirect phase boundaries when read by subsequent texld instructions; utemps do not. Preserve the utemp allocation across i915_release_utemps so the value survives until the texld consumer reads it. shader-db (I915_FS=nir): 249/403 compiled, 3495 alu shader-db (I915_FS=both): nir won 249 (26 identical, 1 tied, 217 better, 5 only), 40 TGSI, 114 neither Assisted-by: Claude	2026-05-07 16:12:48 -04:00
Adam Jackson	800375c3c4	i915/corm: fuse binary ALU ops through vec construction When a vec's only consumer is a binary ALU op (MUL, ADD, MIN, MAX) and the other source is a single register, emit the ALU op directly per register group with partial writemasks instead of building the vec with MOVs and then applying the ALU op. For example, fmul(vec4(a.zw, b.xy), tex) becomes: MUL oC.xy, a.zw, tex MUL oC.zw, b.xy, tex instead of: MOV R.xy, a.zw MOV R.zw, b.xy MUL oC, R, tex shader-db (I915_FS=nir): 248/403 compiled, 3544 alu shader-db (I915_FS=both): nir won 248 (26 identical, 1 tied, 218 better, 3 only), 39 TGSI, 116 neither Assisted-by: Claude	2026-05-07 16:12:48 -04:00
Adam Jackson	595b9850e0	i915/corm: multi-component ALU dest folding in vec construction Generalize the scalar ALU dest fold to handle multi-component results. When a vec source covers contiguous channels with identity swizzle and all uses of the source come from this vec, patch the ALU instruction to write directly into the vec's dest register with the appropriate channel mask. This eliminates redundant MOVs for patterns like vec4(%result.x, %result.y, %result.z, %other) where %result is a vec3 ALU output — the ALU instruction now writes directly to the output register's .xyz channels. shader-db (I915_FS=nir): 233/403 compiled, 3328 alu shader-db (I915_FS=both): nir won 233 (26 identical, 1 tied, 203 better, 3 only), 54 TGSI, 116 neither Assisted-by: Claude	2026-05-07 16:12:48 -04:00
Adam Jackson	879cf1bd74	i915/corm: direct output writes for vec construction When a vec2/3/4 construction has a single consumer that is a store_output, emit the partial-writemask MOVs directly to the output register (oC/oD) instead of building in a temp and copying. Skip this for the same_reg case which already collapses to a zero-instruction swizzle alias. Also fix TGSI-win reporting: preserve loser stats before freeing so corm_win_reason shows the actual delta instead of "only". shader-db (I915_FS=nir): 214/403 compiled, 3231 alu shader-db (I915_FS=both): nir won 214 (26 identical, 1 tied, 184 better, 3 only), 73 TGSI, 116 neither Assisted-by: Claude	2026-05-07 16:12:48 -04:00
Adam Jackson	17b699ae24	i915/corm: use hardware swizzle constants for vec3/4 load_const When a vec3 or vec4 load_const has all components from {0, 1, -1}, emit a swizzle+negate alias using the hardware ZERO/ONE source constants instead of allocating a constant register via i915_emit_const4fv. This matches what the TGSI path does through its immediate recognition. Saves a constant register slot per qualifying load_const and converts 32 of 33 previous ties to identical output. shader-db (I915_FS=nir): 212/403 compiled, 3227 alu shader-db (I915_FS=both): nir won 212 (26 identical, 1 tied, 182 better, 3 only), 75 TGSI, 116 neither Assisted-by: Claude	2026-05-07 16:12:48 -04:00
Adam Jackson	4885eb02ab	i915/corm: optimize seq/sne against zero to 2 instructions When opts.seq_sne_opt is set and one operand is zero, use the abs+compare pattern: x == 0 becomes -abs(x) >= 0, and x != 0 becomes -abs(x) < 0. This reduces from 3 ALU instructions to 2. This is a variant dimension because it can increase register pressure in some shaders; the multi-variant framework picks the winner per-shader. shader-db (I915_FS=nir): 212/403 compiled, 3228 alu shader-db (I915_FS=both): nir won 212 (26 identical, 16 tied, 167 better, 3 only), 75 TGSI, 116 neither Assisted-by: Claude	2026-05-07 16:12:48 -04:00
Adam Jackson	75ef9f6d65	i915/corm: deferred constant allocation with per-channel UBO mixing When opts.deferred_const is set, defer scalar load_const allocation until the consuming ALU instruction. coalesce_constants resolves deferred constants with a preferred register hint so co-occurring constants pack into the same CONST register, avoiding dual-constant MOV penalties. Also fix per-channel UBO constant flags: mark only the actually loaded channels with I915_CONSTFLAG_USER_CH(comp+i) instead of setting all user bits, leaving free channels for immediates. shader-db (I915_FS=nir): 210/403 compiled, 3202 alu shader-db (I915_FS=both): nir won 210 (26 identical, 16 tied, 165 better, 3 only), 77 TGSI, 116 neither Assisted-by: Claude	2026-05-07 16:12:48 -04:00
Adam Jackson	28400d7c6c	i915/corm: add fsat folding, output dest folding, and vec dest folding Add the def_csr mechanism: track the instruction cursor position for each single-instruction SSA def so we can retroactively patch it. fsat folding: when a single-use SSA def feeds into fsat, fold A0_DEST_SATURATE into the previous instruction instead of emitting a separate MOV. Output dest folding: when store_output consumes a single-use temp, patch the previous instruction to write directly to the output register (OC/OD). Includes vec look-through for the identity-swizzle case where a vec was collapsed to a register alias. Vec dest folding: single-use scalar ALU results feeding a vec component get patched to write directly into the vec dest register. shader-db (I915_FS=nir): 209/403 compiled, 3157 alu shader-db (I915_FS=both): nir won 209 (26 identical, 16 tied, 164 better, 3 only), 78 TGSI, 116 neither Assisted-by: Claude	2026-05-07 16:12:48 -04:00
Adam Jackson	ed934ae17b	i915/corm: add vec construction optimizations Optimize vec2/3/4 construction with several strategies: - same_reg: when all components come from the same register, collapse to a single swizzle+negate alias (zero instructions) - const-swizzle piggybacking: ZERO/ONE sources share a MOV with real-register sources from the same register - per-channel negate: preserve per-channel negate bits through the swizzle path instead of emitting separate negation shader-db (I915_FS=nir): 130/403 compiled, 1614 alu shader-db (I915_FS=both): nir won 130 (26 identical, 16 tied, 86 better, 2 only), 156 TGSI, 117 neither Assisted-by: Claude	2026-05-07 16:12:48 -04:00
Adam Jackson	9a88dff9f4	i915/corm: add temporary register tracking and dead temp release Track the last use of each SSA def and release temporary registers as soon as they're dead, allowing more aggressive temp reuse. Includes the register aliasing fix for mov/fneg: these ops alias the def to the source register, so the source's lifetime must be extended to match the def's to prevent premature release. shader-db (I915_FS=nir): 52/403 compiled, 231 alu shader-db (I915_FS=both): nir won 52 (26 identical, 16 tied, 9 better, 1 only), 233 TGSI, 118 neither Assisted-by: Claude	2026-05-07 16:12:48 -04:00
Adam Jackson	c6be264c2d	i915/corm: add copy propagation before algebraic late nir_lower_io introduces identity vec instructions that block nir_opt_algebraic_late's fadd+fmul→ffma fusion pattern. Adding nir_opt_copy_prop + nir_opt_dce before algebraic cleans these up, enabling ffma fusion and eliminating redundant vec construction. shader-db (I915_FS=nir): 48/403 compiled, 62 alu shader-db (I915_FS=both): nir won 48 (26 identical, 16 tied, 6 better), 236 TGSI, 119 neither Assisted-by: Claude	2026-05-07 16:12:48 -04:00
Adam Jackson	3d3b557780	i915/corm: add NIR fragment shader backend Bare-minimum NIR-to-i915 fragment shader compiler with multi-variant framework, lexicographic cost metric (ALU > tex_indirect > temps > consts), and winner-tagged stats output. Stats are emitted once per shader with [NIR] or [TGSI] tag indicating which backend won. The corm_compile_opts struct is available for multi-variant compilation (currently empty). Assisted-by: Claude shader-db (I915_FS=nir): 48/403 compiled, 65 alu shader-db (I915_FS=both): nir won 48 (26 identical, 16 tied, 6 better), 236 TGSI, 119 neither	2026-05-07 16:12:48 -04:00
Adam Jackson	4087e3b7ef	i915: refactor constant and compiler infrastructure for NIR backend Rework the constant register encoding to track per-channel ownership (I915_CONSTFLAG_IMM / I915_CONSTFLAG_USER_CH) instead of whole-register flags, allowing compiler immediates and user UBO values to share a constant register on different channels. Update emit_constants() to handle per-channel source selection at upload time. Add i915_emit_const1f_prefer() for packing scalar constants into a preferred register, reducing dual-constant conflicts. Move i915_program_error(), i915_use_passthrough_shader(), and negate() from i915_fpc_translate.c to shared locations (i915_fpc_emit.c / i915_fpc.h) so the NIR backend can use them. Fix i915_emit_texld() to use a utemp instead of a temp register for texcoord swizzle copies, avoiding unnecessary tex indirect phase boundaries. Add a fallback path that copies to a utemp when bumping the phase count would exceed the hardware limit. Add nr_alu_insn, nr_tex_insn, nr_tex_indirect, nr_temps, writes_z, and input semantic tracking to i915_fragment_shader for use by the NIR backend's multi-variant comparison framework. Assisted-by: Claude	2026-05-07 16:12:12 -04:00
Adam Jackson	badd52c7d5	i915: fix incorrect texcoord optimization in TGSI compiler i915_fpc_optimize_mov_before_tex replaces a MOV+TEX pair with a direct TEX from the input register when the MOV copies from the input with identity swizzle. But it only checked the source swizzle, not the MOV's writemask. When the MOV wrote a subset of the channels the TEX reads (e.g., MOV TEMP.y, IN.y before a 2D TEX that reads XY), the optimization replaced the TEX source with IN, losing the X channel that was set by a different MOV. This caused incorrect texture sampling coordinates in shaders with multi-MOV texcoord construction (blur filters, shadow maps, etc.). Fix: verify the MOV's dest writemask covers all channels the TEX instruction reads before applying the optimization. Assisted-by: Claude	2026-05-07 16:09:50 -04:00
Adam Jackson	b1e709384b	i915: emit passthrough for empty TGSI fragment shaders The TGSI compiler rejected empty fragment shaders (num_instructions == 1, just TGSI_END) as errors. Instead, emit a passthrough program. Assisted-by: Claude	2026-05-07 16:09:22 -04:00
Adam Jackson	5e1ada315c	i915: improve shader-db stats reporting Report actual instruction counts (alu+tex) instead of program dwords/3, add a separate "alu" field for the 64-instruction bottleneck metric, and fix "temps" to use the actual temp register count instead of util_last_bit (highest register number). Before: "69 inst, 2 tex, 3 tex_indirect, 4 temps, 5 const" After: "21 instructions, 19 alu, 2 tex, 2 tex_indirect, 16 temps, 3 const" Assisted-by: Claude	2026-05-07 16:09:18 -04:00
Adam Jackson	2f5686e212	i915: pass NIR to draw instead of pre-converted TGSI draw_vs.c already handles the non-native-integer NIR→TGSI conversion internally, so i915 doesn't need to do it. keep nir_lower_point_size (i915-specific lowering) and pass the result to draw as NIR. Assisted-by: Claude	2026-05-07 16:09:07 -04:00
Samuel Pitoiset	470897f946	radv: allow DGC+multiview by default Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details It's now allowed in Vulkan. Fixes: `e47d584fed` ("radv: re-introduce DGC+multiview support and enable it for vkd3d-proton only") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41395>	2026-05-07 17:08:22 +00:00
Skyth	ce4e54f7a0	spirv2dxil: Replace UAV_FENCE_THREAD_GROUP usage with UAV_FENCE_GLOBAL. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41425>	2026-05-07 16:51:25 +00:00
Tapani Pälli	c540405ca3	anv: use INTEL_NEEDS_WA_14025112257 define for workaround Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41281>	2026-05-07 16:20:29 +00:00
Tapani Pälli	c6f503bed6	iris: use INTEL_NEEDS_WA_14025112257 define for workaround Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41281>	2026-05-07 16:20:29 +00:00
Tapani Pälli	c381b4fdd4	intel/dev: update mesa_defs.json from workaround database This removes 18042479026 as we don't utilize BRW_AOP_MOV in compiler and adds missing xe2 entries for 14025112257. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41281>	2026-05-07 16:20:29 +00:00
Lionel Landwerlin	62b890046f	anv: remove old entrypoints Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40387>	2026-05-07 15:49:20 +00:00
Lionel Landwerlin	f123030dcd	anv: implement VK_KHR_device_address_commands Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40387>	2026-05-07 15:49:20 +00:00
Lionel Landwerlin	7adece7ce0	anv: fixup null address check Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40387>	2026-05-07 15:49:19 +00:00
Lionel Landwerlin	3ffca66b3c	vulkan/runtime: fix invalid address flags value for CmdCopyBufferToImage2 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `a8e49be9d9` ("vulkan/runtime: add implementation of older entrypoints using KHR_DAC") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40387>	2026-05-07 15:49:19 +00:00
Icenowy Zheng	5d2cc50247	pvr: add dri options used by common WSI code Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The common Mesa Vulkan WSI code checks some DRI options. Add them to the option list of the PVR driver. Signed-off-by: Icenowy Zheng <zhengxingda@iscas.ac.cn> Reviewed-by: Simon Perretta <simon.perretta@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41197>	2026-05-07 15:33:01 +00:00
Icenowy Zheng	41ed38615b	pvr: prohibit clang-format from touching the dri options list The DRI options list is formatted specically and clang-format cannot handle it properly. Disable clang-format for this snippet. Signed-off-by: Icenowy Zheng <zhengxingda@iscas.ac.cn> Reviewed-by: Simon Perretta <simon.perretta@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41197>	2026-05-07 15:33:01 +00:00
hmtheboy154	88f5e5986b	pvr: add support for driconf for the Vulkan driver Bringing force_vk_vendor as the first option, force_vk_devicename will be added later Signed-off-by: hmtheboy154 <buingoc67@gmail.com> Reviewed-by: Simon Perretta <simon.perretta@imgtec.com> [Icenowy: rebased on top of main] Signed-off-by: Icenowy Zheng <zhengxingda@iscas.ac.cn> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41197>	2026-05-07 15:33:01 +00:00
Christoph Pillmayer	109af1b077	pan/kmod: Fix uninitialized timestamp info The kernel looks at drm_panthor_timestamp_info::flags, so it can't be uninitialized. Fixes: `302127fe` ("pan/kmod: Add timestamp uapi support") Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41418>	2026-05-07 13:37:25 +00:00
Faith Ekstrand	4714395eb8	pan/bi: Drop lower_index_to_offset from preprocess Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Ryan Mckeever <ryan.mckeever@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41320>	2026-05-07 13:11:08 +00:00
Faith Ekstrand	dd2d30656d	panfrost: Handle pre-Valhall images and texel buffers in lower_res_indices There's no point in having these as separate passes that live in the compiler. We already have lower_res_indices(), which is panfrost's equivalent to panvk's descriptor lowering. We can just do it there. Reviewed-by: Ryan Mckeever <ryan.mckeever@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41320>	2026-05-07 13:11:08 +00:00
Faith Ekstrand	d0bdd18212	panfrost: Prefix valhall bits of lower_res_indices Reviewed-by: Ryan Mckeever <ryan.mckeever@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41320>	2026-05-07 13:11:08 +00:00
Faith Ekstrand	4412f3bad7	panfrost: Take texture/sampler_index into account in lower_res_indices We currently rely on nir_lower_tex_options::lower_index_to_offset but there's really no reason for this. Our pan_nir_res_handle() helper can already take both an immediate and a dynamic index. Reviewed-by: Ryan Mckeever <ryan.mckeever@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41320>	2026-05-07 13:11:08 +00:00
Faith Ekstrand	0a69efb22b	panvk: Add MAX_VS_ATTRIBS to image indices in panvk_nir_lower_descriptors It's only a couple lines of code since we're already doing this for UBOs. It doesn't need to be a separate pass. Reviewed-by: Ryan Mckeever <ryan.mckeever@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41320>	2026-05-07 13:11:08 +00:00
Faith Ekstrand	c73d4e14f9	pan/nir/tex: Support full index+offset Previously, we only supported one of the index or the offset source and relied on lower_index_to_offset to ensure we only had one or the other. However, now that we're doing things in NIR, it's trivial to support the full index+offset form. Reviewed-by: Ryan Mckeever <ryan.mckeever@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41320>	2026-05-07 13:11:08 +00:00
Pierre-Eric Pelloux-Prayer	61acf0e781	radeonsi: add tests subfolder and move AMD_TEST code inside And move the exit(0) code to the si_tests function. Reviewed-by: David Rosca <david.rosca@amd.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41133>	2026-05-07 14:18:21 +02:00
Pierre-Eric Pelloux-Prayer	61ae8f60d1	radeonsi/gfx: move static inline helpers to si_gfx.h Reviewed-by: David Rosca <david.rosca@amd.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41133>	2026-05-07 14:18:19 +02:00
Pierre-Eric Pelloux-Prayer	b833aeb9de	radeonsi/gfx: remove unnecessary u_stub usage Everything inside the gfx folder isn't built when HAVE_GFX_COMPUTE isn't present so we don't need to stub these methods. Reviewed-by: David Rosca <david.rosca@amd.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41133>	2026-05-07 14:18:17 +02:00
Pierre-Eric Pelloux-Prayer	4c080ae32e	radeonsi: move function prototypes from si_pipe.h to si_gfx.h Reviewed-by: David Rosca <david.rosca@amd.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41133>	2026-05-07 14:18:15 +02:00
Pierre-Eric Pelloux-Prayer	714d3eb0b4	radeonsi: move more code to gfx subfolder Anything related to shaders, compute, mesh, nir should be inside this folder. Reviewed-by: David Rosca <david.rosca@amd.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41133>	2026-05-07 14:18:09 +02:00
Pierre-Eric Pelloux-Prayer	79b09571bb	radeonsi: move all multimedia files to mm Reviewed-by: David Rosca <david.rosca@amd.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41133>	2026-05-07 14:18:08 +02:00
Pierre-Eric Pelloux-Prayer	e4def8a61c	radeonsi: add si_context.c And move out most si_context code to the new file. Reviewed-by: David Rosca <david.rosca@amd.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41133>	2026-05-07 14:18:06 +02:00
Pierre-Eric Pelloux-Prayer	b2db3e1ddc	radeonsi: add si_gfx_context.c and move code from si_pipe.c Reviewed-by: David Rosca <david.rosca@amd.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41133>	2026-05-07 14:18:04 +02:00
Pierre-Eric Pelloux-Prayer	a335f4be7a	radeonsi/gfx: move code from si_get to si_gfx_screen These functions can be moved to the gfx subfolder and made static. Reviewed-by: David Rosca <david.rosca@amd.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41133>	2026-05-07 14:18:02 +02:00
Pierre-Eric Pelloux-Prayer	d1c57f742e	radeonsi/gfx: add si_gfx_screen.c And move code specific to gfx/compute from radeonsi_screen_create_impl there. ac_init_llvm_once has to stay in si_pipe.c because it has to be called very early to avoid conflicts with u_queue initialisation. Reviewed-by: David Rosca <david.rosca@amd.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41133>	2026-05-07 14:17:59 +02:00
Pierre-Eric Pelloux-Prayer	5f56a0e057	radeonsi: add si_resource_copy_buffer Same as si_resource_copy_resource except it only supports buffers. Also make sure that si_compute_clear_copy_buffer doesn't do anything when has_gfx_compute is false. Reviewed-by: David Rosca <david.rosca@amd.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41133>	2026-05-07 14:17:56 +02:00

1 2 3 4 5 ...

222246 commits