fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-23 21:38:18 +02:00

Author	SHA1	Message	Date
Icenowy Zheng	d445a950fd	docs/zink: add maint5 to the list of required extensions Zink now unconditionally requires VK_KHR_maintenance5 to run. Add it to the required extension list of Zink documentation. Signed-off-by: Icenowy Zheng <zhengxingda@iscas.ac.cn> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41729>	2026-05-21 16:08:52 +00:00
Icenowy Zheng	8dd2f46b86	zink: move maint5 to gl21_baseline capabilities set VK_KHR_maintenance5 is now unconditionally required by Zink. Move it to the gl21_baseline capabilities set to make it required by every Zink profile. Signed-off-by: Icenowy Zheng <zhengxingda@iscas.ac.cn> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41729>	2026-05-21 16:08:52 +00:00
Alyssa Rosenzweig	3a447b4065	jay: use new fs payload variable more blow up harder if we try to load stuff in the wrong stage Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Kenneth Graunke	ababf12b04	jay: add a hack until we munge barycentrics dynamically Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Kenneth Graunke	a56aa9547b	jay: Call constant folding before collecting FS outputs Fixes "multiple stores to the same location" assertions in tests like dEQP-VK.pipeline.monolithic.color_write_enable_maxa.cwe_after_bind.attachments3_more0 In that case, the stores were actually to different locations, but some constant additions hadn't been folded into the location field yet. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Kenneth Graunke	23884ee02c	jay: Prohibit JAY_STRIDE_8 for EXPAND_QUAD No idea why we're getting a stride 8 here, but we can't handle it. Fixes baldurs_gate_3.vk.foz --graphics-pipeline-range 2248 2249. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Alyssa Rosenzweig	a9525f4b44	jay: hack for sample position Adding this to the list of design constraints for the next RA rework. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Alyssa Rosenzweig	1e31be0e52	jay: fix omask on single sample dEQP-GLES31.functional.shaders.sample_variables.sample_mask.discard_half_per_pixel.singlesample_rbo Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Kenneth Graunke	6a02e228bc	jay: Implement load_fs_config_intel We could lower this in to load_push_data_intel in NIR, but it's trivial, and probably less code just to implement it directly. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Kenneth Graunke	3d91cb9d1e	jay: Implement coverage mask This is the actual MSAA coverage mask. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Kenneth Graunke	35622f165f	jay, nir: Make a dispatch_mask_intel intrinsic jay is trying to use the fragment shader dispatch mask for helper invocation lowering, but it was using load_sample_mask_in for that (now load_coverage_mask_intel). But this isn't the MSAA coverage mask, the two are different payload fields. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Kenneth Graunke	0f3a311591	jay: Implement sample position Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Kenneth Graunke	a590500802	jay: Add a GPR_FROM_UGPRS opcode Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Kenneth Graunke	4555cd23c6	jay: Set Dispatch GRF Start Register in jay_setup_payload() We want it to be set to wherever the push constants ended up. Setting it close to the setup_payload_push() call makes this easier. We'll also be adding some extra UGPRs for the fragment shader payload soon, and the partitioning code will just have one big UGPR partition for payload fields, push constants, and general purpose UGPRs, so it really won't know how to do this very well without duplicating a bunch of information. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Kenneth Graunke	0670b40013	jay: Add comments summarizing the PS thread payload layout The documentation is large and hard to follow due to all the optional fields and the SIMD16 vs. SIMD32 split for barycentrics. This quick summary helps clarify what fields exist, which are split for SIMD32 or kept together, and which pairs of registers are involved for splits. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Kenneth Graunke	6c142f7edc	jay: Implement sample mask writes Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Kenneth Graunke	49299050ea	jay: Implement fragment shader stencil writes Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Kenneth Graunke	b01d286083	jay: Move render target store payload/descriptor construction to backend Constructing the render target store payload is more complex than we can reasonably handle at the NIR level. The main reason is that samplemask and stencil are packed 16-bit and 8-bit parameters, respectively, which are intermixed with other values that are 32-bit. In SIMD32 mode, the packed sub-32-bit values take up fewer registers than normal values. Currently we also don't specialize the NIR for each FS dispatch width, and we can't construct the message descriptor without knowing it. So, we alter nir_intrinsic_store_render_target_intel to take each of the expected parameters - colour, depth, stencil, samplemask, src0_alpha, and discard predicate. We construct the payloads and descriptors in the backend. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Alyssa Rosenzweig	bc22a37d98	jay: schedule for pressure Implement a simple pre-RA bottom-up list scheduler with the goal of decreasing register pressure. On Xe2, this significantly reduces spilling. SSA form allows us to estimate register demand cheaply and accurately, which theoretically [1] gives this algorithm the two Hippocratic properties: 1. Shaders with low register pressure are unaffected. 2. Register pressure can only be decreased, never increased. In other words: first, do no harm. The heuristic itself is very simple: greedily choose instructions that decrease liveness using a backwards list scheduler. This is far from optimal! But thanks to the above properties, even a heuristic that picked random instructions would be a win overall - by construction, we can only ever win. In other words: this scheduler is your older brother powering off the game console any time he's about to lose a game, maintaining a 100% win rate. [1] In reality, neither property is strictly satisfied due to the messy details of mapping our clean logical model onto Intel's many weird physical register files. Nevertheless, the algorithm is well-motivated and the empirical results on Xe2 are excellent. SIMD16: Totals: Instrs: 2754194 -> 2753957 (-0.01%); split: -0.23%, +0.22% CodeSize: 41094768 -> 41092768 (-0.00%); split: -0.23%, +0.23% Number of spill instructions: 1724 -> 1129 (-34.51%) Number of fill instructions: 1912 -> 1119 (-41.47%) Totals from 168 (6.35% of 2647) affected shaders: Instrs: 850994 -> 850757 (-0.03%); split: -0.75%, +0.73% CodeSize: 12825680 -> 12823680 (-0.02%); split: -0.74%, +0.73% Number of spill instructions: 1724 -> 1129 (-34.51%) Number of fill instructions: 1912 -> 1119 (-41.47%) SIMD32: Totals: Instrs: 4688858 -> 4557800 (-2.80%); split: -3.53%, +0.74% CodeSize: 70177200 -> 68214816 (-2.80%); split: -3.53%, +0.74% Number of spill instructions: 50316 -> 45795 (-8.99%); split: -9.56%, +0.57% Number of fill instructions: 51526 -> 45075 (-12.52%); split: -13.23%, +0.71% Totals from 819 (30.94% of 2647) affected shaders: Instrs: 3810182 -> 3679124 (-3.44%); split: -4.35%, +0.91% CodeSize: 57044000 -> 55081616 (-3.44%); split: -4.35%, +0.91% Number of spill instructions: 49264 -> 44743 (-9.18%); split: -9.76%, +0.58% Number of fill instructions: 50182 -> 43731 (-12.86%); split: -13.58%, +0.73% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Alyssa Rosenzweig	81e21a8756	jay: factor jay_op_(starts,ends)_block queries Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Alyssa Rosenzweig	e72ffb0046	jay: annotate pure sends for scheduling, CSE, etc Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Alyssa Rosenzweig	c069b7e47c	jay/opt_propagate: avoid branching on poison logically it doesn't matter because we'll bail on a later check, but this is still UB and therefore releases nasal demons. i am jealous of Faith's Rust compilers. there, I said it. ==107281== Conditional jump or move depends on uninitialised value(s) ==107281== at 0x7069768: propagate_backwards (jay_opt_propagate.c:327) ==107281== by 0x7069768: jay_opt_propagate_backwards (jay_opt_propagate.c:367) ==107281== by 0x7058960: jay_compile (jay_from_nir.c:2677) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Alyssa Rosenzweig	4b0c3f5c32	jay/lower_scoreboard: add asserts on key bounds if these are botched you get UB (-: Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Alyssa Rosenzweig	4c97493b69	jay/lower_scoreboard: handle accumulator hazard Challenging to hit but fixes dEQP-GLES3.functional.shaders.swizzle_math_operations.vector_multiply.mediump_ivec4_wzyx_zyxw_fragment with scheduling changes. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Alyssa Rosenzweig	9a68101bc2	jay/liveness: drop redundant source filtering Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Alyssa Rosenzweig	9b68b4e7a1	jay/liveness: speed up physical CFG merging on top of scheduler changes, compile-time of shaders/blender/1017.shader_test: Difference at 95.0% confidence -0.00173202 +/- 0.00116931 -0.791537% +/- 0.532384% Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Alyssa Rosenzweig	1b50d3eed2	jay/liveness: remove pointless bitset init dup initializes it. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Alyssa Rosenzweig	5da3b57605	jay: insert simd32 deswizzle in a dedicated pass we don't actually need the DESWIZZLE pseudo instruction, and the pseudo op complicates pre-RA scheduling. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Alyssa Rosenzweig	47c6601d5e	jay: relax fragment payload layout this isn't optimal but it should unblock bring up. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Co-authored-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:46 +00:00
Kenneth Graunke	cb75c9f962	brw: Lower sample_pos for non-per-sample shaders in NIR We generalize the sample_mask_in lowering to handle this too. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41688>	2026-05-21 15:34:45 +00:00
Mike Blumenkrantz	58308b7580	zink: add another anv/adl flake Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41728>	2026-05-21 14:58:52 +00:00
Mike Blumenkrantz	64be743fbe	zink: fix unbinding vertex buffers from null VS state num_bindings doesn't encompass all the bound buffers if bindings reuse the same buffers Fixes: `f8c96df9d2` ("zink: move vbo unbind to bind_vertex_state") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41728>	2026-05-21 14:58:52 +00:00
Samuel Pitoiset	07754c960a	radv: validate drirc option names at compile time This would prevent any typos or if something is backported incorrectly in the future. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41700>	2026-05-21 14:26:28 +00:00
Samuel Pitoiset	ccb669a05f	util: add very basic way to validate drirc files This just checks for option names that don't exist. This is something that already happened in the past with RADV. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41700>	2026-05-21 14:26:28 +00:00
Samuel Pitoiset	e685f8d6aa	radv/ci: cleanup list of expected failures Triage invalid tests to make it easier to see real failures. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41717>	2026-05-21 14:03:22 +00:00
Samuel Pitoiset	91cf0a6e6d	radv: use the new generation script for drirc Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41634>	2026-05-21 12:57:43 +00:00
Samuel Pitoiset	bf787fd91b	radv: rename few drirc options for consistency So that the option name matches everywhere. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41634>	2026-05-21 12:57:41 +00:00
Lucas Francisco Fryzek	7b84183201	util/u_trace: Don't use empty initializer list Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Modify empty initializer list to use a zero initializer so we aren't relying on the gnu extension. Fixes: `690d9b0d00` ("util/u_trace: Rework resource management") Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41715>	2026-05-21 12:18:07 +00:00
Benjamin Gaignard	0e91cf34af	pan/format: Advertise support for AFBC(32x8,sparse) Some video decoders spit out AFBC(32x8,sparse) images. Advertise support for this modifier so we can import such images. Signed-off-by: Benjamin Gaignard <benjamin.gaignard@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40886>	2026-05-21 11:50:16 +00:00
Daniel Stone	4203b770b4	pan/afbc: Properly validate format/parameter combinations AFBC has a number of superblock sizes and valid layouts, with differing combinations allowed. It's quite clear that 16x16 is ambivalent about whether or not block-split mode is used. 64x4 prohibits block-split mode, and 32x8 either requires or prohibits it depending on the format. Add proper handling so we filter out the right combinations. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40886>	2026-05-21 11:50:16 +00:00
Daniel Stone	c5415c7aed	pan/mod: Reorder linear modifier checks As with AFBC, split the checks into 'can this ever work' vs. 'can this work for what I want it to?'. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40886>	2026-05-21 11:50:16 +00:00
Daniel Stone	4364f5352a	pan/mod: Protect against no usage flags for 64k This doesn't happen now, but it will later. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40886>	2026-05-21 11:50:15 +00:00
Daniel Stone	0fb529053b	pan/afbc: Code motion for split modifier queries Reorder the AFBC modifier checking code to first query whether the device can do the mode at all, then to query whether or not the format + modifier is supported at all, then to query whether the specific image usage is OK, then to query whether or not it's optimal. This will come in useful later when we want to split modifier queries into: can this modifier ever be used, what can this modifier be used for, and is this the best modifier for this usage. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40886>	2026-05-21 11:50:15 +00:00
Karol Herbst	48ec237bf9	zink: proper advertise keep_weak_ffma for fp16 Zink never sets the fp16 screen cap, but the caps also are initialized after zink_screen_init_compiler. So just replicate the check to be safe here. Fixes: `2146e09962` ("zink: keep ffma_weak and use GLSLstd450Fma for it") Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41722>	2026-05-21 10:50:32 +00:00
squidbus	b1c72223af	kk: Support VK_KHR_unified_image_layouts Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Metal has no concept of image layouts, and we don't care about them. Reviewed-by: Aitor Camacho <aitor@lunarg.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41613>	2026-05-21 09:59:38 +00:00
squidbus	f52f7bf8d5	kk: Support attachment feedback loop extensions Metal GPU image optimization is disabled for attachment feedback usage since it causes some CTS flakes. Reviewed-by: Aitor Camacho <aitor@lunarg.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41613>	2026-05-21 09:59:38 +00:00
squidbus	2a119991f6	kk: Support VK_KHR_shader_fma Reviewed-by: Aitor Camacho <aitor@lunarg.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41692>	2026-05-21 09:36:35 +00:00
squidbus	33ce3040e6	kk: Support VK_EXT_host_image_copy Metal provides straightforward ways to copy an image to/from memory, and image-to-image copies can be implemented by chaining them. Note that host copy of combined depth-stencil is not supported, as Metal does not allow CPU copy for these formats. Additionally, GPU optimized contents are not allowed with host image copy usage; CTS directly initializes the raw memory of optimized images to random invalid data, which appears to decompress differently on GPU vs CPU and fail. Reviewed-by: Aitor Camacho <aitor@lunarg.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41714>	2026-05-21 02:06:46 -07:00
squidbus	76125cb7af	kk: Separate linear and GPU optimized image layout properties `linear` controls whether the created image is in linear layout, and `optimized_layout` controls only the `allowGPUOptimizedContents` Metal property. Reviewed-by: Aitor Camacho <aitor@lunarg.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41714>	2026-05-21 02:06:46 -07:00
Valentine Burley	8cc3ca6231	turnip/ci: Add nightly Android CTS job Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The job runs the following modules with ANGLE: - CtsGraphicsTestCases - CtsNativeHardwareTestCases - CtsSkQPTestCases Signed-off-by: Valentine Burley <valentine.burley@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41440>	2026-05-21 06:21:02 +00:00

1 2 3 4 5 ...

222881 commits