fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-06-10 09:58:18 +02:00

Author	SHA1	Message	Date
Konstantin Seurer	c9b60cfa4f	radv: Move a whole bunch of debug/profiling related into a subdir For slightly better separation of debug and driver code. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41044>	2026-05-29 10:59:38 +00:00
Konstantin Seurer	23677605f2	radv: Move debug options to radv_instance.h Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41044>	2026-05-29 10:59:38 +00:00
Frank Binns	c501f3b4c9	pvr: re-enable {EXT,KHR}_index_type_uint8 Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This was originally disabled by `a22ad99bdd` ("pvr: set device features/props/extensions to Vulkan 1.0 minimums (unless implemented)") in order to concentrate efforts on passing "base" Vulkan conformance before layering on additional functionality. The driver is now Vulkan 1.2 conformant. As the functionality is already implemented, simply enable the extension. Signed-off-by: Frank Binns <frank.binns@imgtec.com> Reviewed-by: Ella Stanforth <ella@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41859>	2026-05-29 09:56:28 +01:00
Samuel Pitoiset	76fdc606c8	ac/nir,radv: lower task payload to zeroes when the mesh shader has no task Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The Vulkan spec allows this to be used in a mesh shader as long as it's not accessed, so it can be eliminated. This fixes dEQP-VK.mesh_shader.ext.misc.payload_not_accessed. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41828>	2026-05-29 07:44:40 +02:00
Faith Ekstrand	932ee0f806	etnaviv: Remove f32_to_f16_fallback() in favor of float16::F16 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41375>	2026-05-29 05:13:24 +00:00
Faith Ekstrand	b60694b91e	compiler/rust: Add a float16 wrapper This adds an F16 struct which provides a 16-bit float type using Mesa's existing half-precision support internally. Right now, it only contains the basics but it could be expanded if needed. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41375>	2026-05-29 05:13:24 +00:00
Faith Ekstrand	7e2f41bd8f	nouveau: Don't build NAK tests on Android Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41375>	2026-05-29 05:13:24 +00:00
Caio Oliveira	d6ce874220	anv: Use empty_vs_input for default VERTEX_ELEMENT_STATE Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41790>	2026-05-29 04:48:20 +00:00
Caio Oliveira	c9dceab876	anv: Fill VERTEX_ELEMENT_STATE before further emissions genX(batch_emit_vertex_input) reserves 3DSTATE_VERTEX_ELEMENTS and then writes into that reserved memory. Any later anv_batch_emit() may allocate a new batch and finalize the previous one, running the valgrind defined-memory check over it. Fill the draw-parameter and dynamic VERTEX_ELEMENT_STATE entries before emitting 3DSTATE_VF_INSTANCING. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41790>	2026-05-29 04:48:20 +00:00
Timothy Arceri	26280cbc23	mesa: flush bitmap cache when scissor box changes The glBitmap calls will flush the cache when scissors are enabled or disabled but doesn't handle a change in the scissor box itself. Fixes: `33ca53ac79` ("st/mesa: fix the glBitmap cache wrt FS, scissor, and clamp color changes") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/work_items/15505 Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Marek Olšák <maraeo@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41825>	2026-05-29 03:38:55 +00:00
Gurchetan Singh	eb17e7b102	util: rust: minor fixups * Use little-endian since all hardware is basically little-endian * Fix dead_code warning on non-Linux platforms Reviewed-by: David Gilhooley <djgilhooley@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41754>	2026-05-29 03:07:52 +00:00
Gurchetan Singh	5be5cca792	util: rust: sync error handling fixes from downstream Useful for parse ints and using the ? operator. Reviewed-by: David Gilhooley <djgilhooley@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41754>	2026-05-29 03:07:52 +00:00
Val Packett	10620bdf21	util: rust: Add a way to create a Tube from an existing OwnedFd Signed-off-by: Val Packett <val@invisiblethingslab.com> Signed-off-by: Gurchetan Singh <gurchetan.singh.foss@gmail.com> Reviewed-by: David Gilhooley <djgilhooley@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41754>	2026-05-29 03:07:52 +00:00
Val Packett	4f13941e44	util: rust: Support detecting socket file descriptors Useful for Pipewire passthrough. Signed-off-by: Val Packett <val@invisiblethingslab.com> Signed-off-by: Gurchetan Singh <gurchetan.singh.foss@gmail.com> Reviewed-by: David Gilhooley <djgilhooley@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41754>	2026-05-29 03:07:52 +00:00
Val Packett	118e48402c	util: rust: align API with real eventfd capabilities Linux eventfds contain a 64-bit value which can be increased by arbitrary numbers, and waiting returns a numeric value that consumers might need to actually read. Also, reading/waiting does mutate kernel state, so make it &mut self like reading on std::fs::File is. Signed-off-by: Val Packett <val@invisiblethingslab.com> Signed-off-by: Gurchetan Singh <gurchetan.singh.foss@gmail.com> Reviewed-by: David Gilhooley <djgilhooley@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41754>	2026-05-29 03:07:51 +00:00
Mike Blumenkrantz	1f866164d7	zink: be more conservative about query pool sizing Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details 500 can be pretty big if lots of queries are in flight, especially if all the queries are TIMESTAMP. instead, start smaller and scale dynamically up to the previous threshold Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41874>	2026-05-29 02:46:35 +00:00
Danylo Piliaiev	c5fb1c3265	freedreno/a7xx: Don't force enable IJ_LINEAR_PIXEL for FragFace/FragCoord Apparently GRAS_CL_INTERP_CNTL has two fields FACENESS and CENTERRHW which allows us to not enable IJ_LINEAR_PIXEL input, which can improve performance in trivial cases by ~50%. Mirrors Turnip change. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41848>	2026-05-29 01:15:47 +00:00
Danylo Piliaiev	cbd00c6675	tu/a7xx: Don't force enable IJ_LINEAR_PIXEL for FragFace/FragCoord Apparently GRAS_CL_INTERP_CNTL has two fields FACENESS and CENTERRHW which allows us to not enable IJ_LINEAR_PIXEL input, which can improve performance in trivial cases by ~50%. Found via gpu-ratemeter bench: vk.pix.noaa.1flat.face Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41848>	2026-05-29 01:15:47 +00:00
Dylan Baker	4381ac9a91	intel/brw: Add assert for error case Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Coverity notices that there is an error case where `nir_get_io_data_src_number` could return `-1`, and that is then used to index into an array. Given that that is an exceptional case, we can just assert here. CID: 1681480 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40146>	2026-05-28 23:50:39 +00:00
Sid Pranjale	825df24023	nvk: Implement VK_EXT_shader_atomic_float Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41867>	2026-05-28 23:04:32 +00:00
Paulo Zanoni	c5942b4efb	brw: evict memory for workgroup scope in Xe2 and newer Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details On Xe2 and Xe3, the flushing is necessary due to aliasing of TGM data in L1 memory (HSD 14020414266). On newer platforms, it is necessary for proper post-format data conversion handling (HSD 22020984324). See the Instruction_Fence page (63969) for documentation on the fact that the threadgroup scope ignores flushes. Thanks to Francisco Jerez and Kenneth Graunke on their help for this patch. v2: restrict the flushing to TGM (Lionel). Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40732>	2026-05-28 22:07:01 +00:00
Dhruv Mark Collins	23f94c692c	freedreno: Update A6XX_PC_MODE_CNTL definition and values This register seems to be fairly critical on A7XX for vertex processing performance, and was set to an unoptimal value for the A730/A735/A740 which has now been updated to a value that maximizes performance and aligns with the proprietary driver. Fixes #15411 Signed-off-by: Dhruv Mark Collins <mark@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41451>	2026-05-28 21:44:42 +00:00
Job Noorman	360650d189	ir3/spill: extract child intervals for live-in reloads When reloading live-ins, child intervals need to be extracted to ensure we can add live-in phi nodes for them. Fixes asserts with spillall for a bunch of ray_query and ray_tracing_pipeline CTS tests: src/freedreno/ir3/ir3_spill.c: add_live_in_phi: Assertion `entry' failed. Signed-off-by: Job Noorman <jnoorman@igalia.com> Fixes: `613eaac7b5` ("ir3: Initial support for spilling non-shared registers") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41756>	2026-05-28 21:19:01 +00:00
Danylo Piliaiev	727a30f4c7	tu: Change of disable_fs state should force depth state recalculation tu6_build_depth_plane_z_mode depends on state.disable_fs. Fixes: `be481e6615` ("tu: Disable FS in certain cases even if FS is not empty") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41856>	2026-05-28 20:47:31 +00:00
Danylo Piliaiev	930f7f4a1b	tu: Start/End occlusion query should force depth state recalculation tu6_build_depth_plane_z_mode has a dependency on occlusion_query_may_be_running. Fixes: `8f5d433840` ("tu: Occlusion query counting should happen after FS that kills") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41856>	2026-05-28 20:47:31 +00:00
José Roberto de Souza	9f4358ee77	intel/dev: Add a Xe2+ table of URB min and max entries Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Some values were wrong, so here adding the whole table with all fixed values. Just to make easier to read and compare I have added all shader stages to XEHP_URB_MIN_MAX_ENTRIES. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41789>	2026-05-28 20:27:23 +00:00
José Roberto de Souza	c1bb5e101e	intel/dev: Use URB mesh/task min/max values in intel_device_info Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41789>	2026-05-28 20:27:23 +00:00
José Roberto de Souza	45d9c2c14b	intel/dev: Add URB max entries values Right now this value is not use but it will in the next patch. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41789>	2026-05-28 20:27:23 +00:00
José Roberto de Souza	0cc89ca03a	intel/dev: Remove unused intel_get_device_info_for_build() function Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41789>	2026-05-28 20:27:23 +00:00
Eric Engestrom	97391328a3	drirc: move 00-$drv-defaults.conf to each driver's folder Makes more send to put them there, and it means that modifying them will only trigger the CI jobs for that driver and not everything. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41866>	2026-05-28 19:55:05 +00:00
Eric Engestrom	3d10616389	util/meson: simplify list of per-driver drirc files Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41866>	2026-05-28 19:55:05 +00:00
Danylo Piliaiev	6de00252ff	tu: Fix TU_CMD_DIRTY_DRAW_STATE value collision Fixes: `789e765161` ("turnip, ir3: Use shader for vertex input count") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41855>	2026-05-28 19:29:54 +00:00
Georg Lehmann	dea444f80f	nir/deref: consider atomics that store derefs as complex use src[1] or src[2] would mean that the atomic uses the deref as data for the op, we only want to allow address source uses. Fixes: `bb311ce370` ("nir: Allow atomics as non-complex uses for var-splitting passes") Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41818>	2026-05-28 18:58:33 +00:00
Nanley Chery	6d0b1ebd3d	intel/isl: Define a CMF for ASTC formats These formats are not supported natively on gfx20+. However, with a driconf option enabled, we do create surfaces with these formats and use them for transfer and decompression operations. Provide a CMF for these formats to avoid hitting the unreachable in isl_get_render_compression_format(). Fixes: `27d515772e` ("intel/isl: Replace mc_format with aux_format") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/15547 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41830>	2026-05-28 18:31:00 +00:00
Samuel Pitoiset	f7321be05a	radv/amdgpu: stop deduplicating winsys Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This is no longer needed because allocated memory is tracked separately now. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41805>	2026-05-28 18:08:54 +00:00
Samuel Pitoiset	a0a9f9dda1	radv/amdgpu: rework tracking allocated memory for budget Deduplicating the winsys just for budget looks more like a hack than a real implementation. Reworking tracking allocated memory to remove the dedup. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41805>	2026-05-28 18:08:54 +00:00
Alyssa Rosenzweig	feb9ac168b	jay: allow npot operands in RA As long as we round up the /alignments/ in RA, and pad to power-of-two when calculating partitions (trivially true now, this informs future work though), this is fine. SIMD16: Totals from 1001 (37.82% of 2647) affected shaders: Instrs: 1897734 -> 1896157 (-0.08%); split: -0.25%, +0.16% CodeSize: 28330256 -> 28315472 (-0.05%); split: -0.30%, +0.25% Number of spill instructions: 1003 -> 999 (-0.40%) Number of fill instructions: 990 -> 986 (-0.40%) SIMD32: Totals from 1230 (46.47% of 2647) affected shaders: Instrs: 3284649 -> 3277437 (-0.22%); split: -1.18%, +0.96% CodeSize: 48977696 -> 48907376 (-0.14%); split: -1.10%, +0.96% Number of spill instructions: 41004 -> 40582 (-1.03%); split: -1.05%, +0.02% Number of fill instructions: 39298 -> 38572 (-1.85%); split: -1.91%, +0.06% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41808>	2026-05-28 16:23:33 +00:00
Alyssa Rosenzweig	30c392fc55	jay: simplify uniformity checks Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41808>	2026-05-28 16:23:33 +00:00
Alyssa Rosenzweig	a39a4b189e	jay: renumber reg files predictably Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41808>	2026-05-28 16:23:33 +00:00
Alyssa Rosenzweig	ac00e526ea	jay: gate early EOT code behind =strict Works fine on Xe2 Linux and fixes us for Xe3. simd16: Totals from 2635 (99.55% of 2647) affected shaders: Instrs: 2722355 -> 2719307 (-0.11%) CodeSize: 40641680 -> 40605472 (-0.09%) simd32: Totals from 2629 (99.32% of 2647) affected shaders: Instrs: 4052325 -> 4049578 (-0.07%); split: -0.07%, +0.00% CodeSize: 60360016 -> 60329392 (-0.05%); split: -0.05%, +0.00% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41808>	2026-05-28 16:23:33 +00:00
Alyssa Rosenzweig	0c376dde89	jay: hide MAD->MAC behind !JAY_DEBUG=strict in case this comes back to bite us. nice to keep our nonconformances programatically triaged. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41808>	2026-05-28 16:23:32 +00:00
Alyssa Rosenzweig	b2edda3037	jay: allow SIMD1 imageStore() Totals: Instrs: 2756797 -> 2755967 (-0.03%); split: -0.04%, +0.01% CodeSize: 41155232 -> 41141520 (-0.03%); split: -0.04%, +0.00% Totals from 130 (4.91% of 2647) affected shaders: Instrs: 40095 -> 39265 (-2.07%); split: -2.46%, +0.39% CodeSize: 618032 -> 604320 (-2.22%); split: -2.42%, +0.20% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41808>	2026-05-28 16:23:32 +00:00
Alyssa Rosenzweig	31e1e98ae3	jay: avoid overflow affinities with large UGPR vecs Totals: Instrs: 2759295 -> 2759035 (-0.01%); split: -0.01%, +0.00% CodeSize: 41189376 -> 41185024 (-0.01%); split: -0.01%, +0.00% Totals from 186 (7.03% of 2647) affected shaders: Instrs: 422705 -> 422445 (-0.06%); split: -0.09%, +0.03% CodeSize: 6313712 -> 6309360 (-0.07%); split: -0.09%, +0.03% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41808>	2026-05-28 16:23:32 +00:00
Alyssa Rosenzweig	67e3ee6fa9	jay: allow simd32 gl_SamplePosition this should work fine now. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41808>	2026-05-28 16:23:31 +00:00
Alyssa Rosenzweig	c78c2e03f4	jay: remove send split hack The new partitioning code behaves sensibly here now. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41808>	2026-05-28 16:23:31 +00:00
Alyssa Rosenzweig	04236b3912	jay: merge partition blocks The way we construct partitions, it is tricky to avoid empty blocks sometimes leading to mergeable adjacent blocks. Simplify this at runtime. Stats are kind of a mixed bag but this is the obvious right thing to do. simd16: Totals: Instrs: 2752919 -> 2752284 (-0.02%); split: -0.03%, +0.01% CodeSize: 41105312 -> 41096640 (-0.02%); split: -0.03%, +0.01% Totals from 236 (8.92% of 2647) affected shaders: Instrs: 740015 -> 739380 (-0.09%); split: -0.12%, +0.04% CodeSize: 11076496 -> 11067824 (-0.08%); split: -0.11%, +0.04% simd32: Totals: Instrs: 4121085 -> `4121638` (+0.01%); split: -0.01%, +0.02% CodeSize: 61394672 -> 61404720 (+0.02%); split: -0.00%, +0.02% Totals from 145 (5.48% of 2647) affected shaders: Instrs: 1104897 -> 1105450 (+0.05%); split: -0.02%, +0.07% CodeSize: 16845728 -> 16855776 (+0.06%); split: -0.02%, +0.08% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41808>	2026-05-28 16:23:31 +00:00
Alyssa Rosenzweig	598931f653	jay: rewrite partition handling Jay's novel SSA-based register allocator relies on a fixed partition of Intel GRFs mapping to logical GPRs. Previously, Jay used a simple partitioning scheme, which was good enough for simple compute and fragment shaders, but has both limitations preventing new feature bring-up and performance issues. Here we rewrite the Jay partitioning code at the heart of the Jay RA in order to lift these restrictions and allow fully flexible partitions. This should be easier to reason about, fix a bunch of issues around simd32 payloads, enable better performance, etc. The # of stride 16 GRFs reserved is halved in simd32 mode here to match how multisampling stuff works, which explains the large simd32-only instruction count reduction. While churning all this code, I took the opportunity to break off jay_partition.c... I think that is better organized and the diff was garbage otherwise. SIMD16: Totals from 2189 (82.70% of 2647) affected shaders: Instrs: `2702159` -> 2670951 (-1.15%); split: -1.41%, +0.26% CodeSize: 40296128 -> 39850304 (-1.11%); split: -1.40%, +0.30% SIMD32: Totals from 2373 (89.65% of 2647) affected shaders: Instrs: 4559418 -> 4072897 (-10.67%); split: -10.77%, +0.10% CodeSize: 68185488 -> 60635616 (-11.07%); split: -11.17%, +0.09% Number of spill instructions: 44069 -> 44055 (-0.03%) Number of fill instructions: 43292 -> 43278 (-0.03%) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41808>	2026-05-28 16:23:31 +00:00
Alyssa Rosenzweig	1860e7af30	jay: drop unneeded #include Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41808>	2026-05-28 16:23:30 +00:00
Alyssa Rosenzweig	fc3fb1c3f9	jay/validate_ra: use jay_def_stride higher level than jay_gpr_to_stride Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41808>	2026-05-28 16:23:30 +00:00
Alyssa Rosenzweig	805fe361a6	jay: stop asking for stride=4 ugpr's that doesn't make any sense, and will assert with later changes. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41808>	2026-05-28 16:23:30 +00:00

1 2 3 4 5 ...

223266 commits