fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-16 16:18:06 +02:00

Author	SHA1	Message	Date
Iván Briano	58006eaaa4	anv/brw: add conservative raster on/off to FS_CONFIG FullyCovered will need to know if conservative rasterization is enabled, so pass it on to the shader. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Tested-by: Caleb Callaway <caleb.callaway@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38879>	2026-05-11 18:15:50 +00:00
Kenneth Graunke	2729b1608f	brw: Limit SIMD width based on NIR rather than first backend compile Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details I originally added this mechanism to have the first (SIMD8) compile note that certain features were in use which would prevent SIMD16/32 from compiling, so we could skip the work of trying those. But these days, there aren't many cases, and the ones we have are easily detectable based on the NIR. We can detect it earlier without even having to do the SIMD8 compile. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41122>	2026-05-07 08:29:40 +00:00
Kenneth Graunke	c5928d40ae	brw: Drop dead code from dispatch limit check for dual source blending We checked that ver is 11 or 12. It can't be >= 20. This is dead code. Dual source blending on Xe2 does not have native SIMD32 RT write message support, but SIMD splitting is currently lowering it to low/high SIMD16 message pairs when using SIMD32 dispatch. I'm not aware of any of the hardware errata from previous platform still applying. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41122>	2026-05-07 08:29:40 +00:00
Kenneth Graunke	599d26db00	brw: Set prog_data::dual_src_blend from NIR outputs written bitfield Simpler and set earlier. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41122>	2026-05-07 08:29:40 +00:00
Lionel Landwerlin	fab6f84126	brw: make the program key available on pass_tracker Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40631>	2026-04-03 12:17:01 +00:00
Iván Briano	fd556e54f6	brw: do not omit RT writes if dual_src_blend is on Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Dual source blending when one of the sources is not written to leaves those values undefined, but the other should still be valid. By omitting unwritten outputs, we ended up not writing anything at all for the case that OUT1 is written to but OUT0 is undefined. Fixes new CTS tests: dEQP-VK.pipeline..blend.dual_source.undefined_output.first Cc: mesa-stable Signed-off-by: Iván Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40357>	2026-03-19 23:38:40 +00:00
Kenneth Graunke	4a9aa3ecc4	brw: Combine brw_assign_*_urb_setup() into one function They all do exactly the same thing, except that GS multiplies by an extra factor, and TCS has urb_read_length == 0 so it skips one line. No need for four copies. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40328>	2026-03-12 21:40:37 +00:00
Kenneth Graunke	9933882182	brw: Purge source_depth_to_render_target This was used for Gfx4-5. Since then, we're just passing around a boolean that nobody wants. Even if someone did, a better plan is to just check nir->info directly. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40328>	2026-03-12 21:40:37 +00:00
José Roberto de Souza	91c5744e25	intel/brw: Use computed push constants size in brw_assign_urb_setup() It was already computed in brw_shader::assign_curb_setup() so we can use it in brw_assign_urb_setup(). There was a mismatch between assign_curb_setup() and brw_assign_urb_setup() when push_sizes were not multiple of REG_SIZE, the first one was aligning every push_sizes before sum it, while brw_assign_urb_setup() was only aligning the sum of all push_size. By luck the only places that did not had a push_size aligned to REG_SIZE only had one push_size, so this was not an issue. So here also fixing this mismatch and adding an assert to caught any future mismatch. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39817>	2026-02-19 16:53:03 +00:00
Kenneth Graunke	19d9e10f4d	brw: Drop VUE header values and position from wm_prog_data->inputs The FS doesn't read these from the VUE so we don't care about them. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38121>	2026-02-16 15:15:36 -08:00
Kenneth Graunke	5e48094d72	brw: Drop BRW_VARYING_SLOT_PAD and brw_varying_slot enum In elk, we tried to store our own "driver" enum values after Mesa's VARYING_SLOT_MAX. In brw, we eliminated all of these except for an unnecessary "BRW_VARYING_SLOT_PAD" value. This was used for empty slots, so vue_map::slot_to_varying[] could store something. This patch replaces BRW_VARYING_SLOT_PAD with -1. Our "driver" enum values overlapped with VARYING_SLOT_PATCH0, leading to unnecessary headaches. Now gl_varying_slot_name_for_stage will do the right thing for both regular and patch varyings. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38121>	2026-02-16 15:15:35 -08:00
Kenneth Graunke	3b4af8907f	brw: Delete wm_prog_data::urb_setup_channel[] The entire array is always initialized to zero and never modified. Cuts the size of brw_wm_prog_data by 32%. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39791>	2026-02-09 21:56:04 +00:00
Kenneth Graunke	c5859b2d40	intel: Rename wm_prog_key to fs_prog_key Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This is the shader key for the fragment shader. Nobody even knows what the windowizer/masker unit is or does anymore. Even on Gen4-6, "fs" is still clearer. This makes the codebase easier to read. This is only about 15 years overdue. Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39748>	2026-02-06 20:52:01 -08:00
Kenneth Graunke	56e638be81	intel: Rename wm_prog_data to fs_prog_data This is the program data for the fragment shader. Nobody even knows what the windowizer/masker unit is or does anymore. Even on Gen4-6, "fs" is still clearer. This makes the codebase easier to read. This is only about 15 years overdue. Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39748>	2026-02-06 20:51:59 -08:00
Kenneth Graunke	beb4b78fe7	intel: Rename intel_msaa_flags to intel_fs_config This started out as dynamic configuration for MSAA related state, but has since expanded to cover many dynamic fragment shader options. We rename it to intel_fs_config, similar to intel_tess_config, to better indicate its purpose. Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39748>	2026-02-06 20:51:43 -08:00
Caio Oliveira	354dbbe3ae	brw: Use the "early break" loop macros when possible Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This macro will stop the loop early if there's no chance to make further progress. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39504>	2026-01-28 19:52:02 +00:00
Caio Oliveira	da80122257	brw: Include backend NIR passes in mda files Add a pass tracker struct that can live the whole lifetime of brw_compile() functions, it will keep track of the debug_archiver and also store some metadata that allow us to name the passes. With that, we can also embed the loop tracking in the same struct, so that is free for any loop to use the "early break" optimization. There are other brw_nir_* passes that are called in the pre-processing phase. These are not currently included in the mda yet. Will be handled when we hook debug_archiver or similar to the runtime/driver. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39504>	2026-01-28 19:52:02 +00:00
Lionel Landwerlin	79aff6e274	brw: use fp64 to compute coarse_z For some reason we cannot get the precision needed from the HW at fp32. LNL internal fossildb changes : Totals from 7226 (0.76% of 947978) affected shaders: Instrs: 5512598 -> 5586086 (+1.33%); split: -0.00%, +1.33% Cycle count: 153836056 -> 155079472 (+0.81%); split: -0.77%, +1.58% Spill count: 2025 -> 2021 (-0.20%); split: -0.35%, +0.15% Fill count: 3139 -> 3112 (-0.86%); split: -1.12%, +0.25% Max live registers: 1034601 -> 1034632 (+0.00%); split: -0.00%, +0.00% Max dispatch width: 207296 -> 207264 (-0.02%); split: +0.02%, -0.03% Non SSA regs after NIR: 1147942 -> 1109326 (-3.36%) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12726 Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38996>	2026-01-21 16:00:52 +00:00
Lionel Landwerlin	a19e949824	brw: move coarse_z computation to NIR So that we can print it easily with debug printfs Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38996>	2026-01-21 16:00:52 +00:00
Lionel Landwerlin	89a53f048a	brw: make coarse pixel bit available to NIR lowering Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38996>	2026-01-21 16:00:51 +00:00
Lionel Landwerlin	e3fd1b0ac0	brw: populate wm_prog_data earlier So that we can put the coarse_pixel_dispatch value available to NIR lowering. LNL internal fossildb changes: Totals from 40 (0.01% of 490838) affected shaders: Instrs: 33321 -> 33311 (-0.03%); split: -0.04%, +0.01% Cycle count: 780136 -> 779936 (-0.03%); split: -0.03%, +0.00% Max live registers: 5292 -> 5298 (+0.11%) Non SSA regs after NIR: 26638 -> 26464 (-0.65%) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38996>	2026-01-21 16:00:51 +00:00
Lionel Landwerlin	faa857a061	intel: rework push constant handling Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details nr_params & params array are gone. brw_ubo_range is not stored on the prog_data structure anymore (Anv already stored a copy of that with its own additional information) The backend now only deals with load_push_data_intel. load_uniform & load_push_constant have to be lowered by the driver. Pre Gfx12.5 platforms have to provide a subgroup_id_param to specify where the subgroup_id value is located in the push constants. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38975>	2026-01-09 14:19:52 +00:00
Iván Briano	094f8f041f	anv: enable fragmentShadingRateWithShaderSampleMask on Xe2+ Before DG2, the value the HW gives us seems to be backwards, but since DG2 this is supposed to be supported just fine. However, due to Wa_22012766191, enable it only for Xe2 and up. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38641>	2025-12-11 22:50:10 +00:00
Lionel Landwerlin	a4e9e660d4	brw/iris: remove fs key for coherent_fb_fetch Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38737>	2025-12-02 12:44:35 +00:00
Lionel Landwerlin	515d8f8e3a	brw: fix sample mask flag emission Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details It's also used for testing helper invocations. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `e3328dfa2f` ("brw: only initialize sample mask flag if needed") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38699>	2025-11-27 15:59:35 +00:00
Alyssa Rosenzweig	e3328dfa2f	brw: only initialize sample mask flag if needed This is a refinement of `7c129d9365` ("intel/brw/xe2+: Keep PS sample mask in the f1.0 register whether or not kill is used."). Rather than always insert this move, do so only when we'll actually read the register: for memory writes and for discards. This deletes an instruction from piles of fragment shaders. shader-db on LNL: total instructions in shared programs: 17134031 -> 17042706 (-0.53%) instructions in affected programs: 9065743 -> 8974418 (-1.01%) helped: 65045 HURT: 0 helped stats (abs) min: 1.0 max: 3.0 x̄: 1.40 x̃: 1 helped stats (rel) min: <.01% max: 50.00% x̄: 3.06% x̃: 1.64% 95% mean confidence interval for instructions value: -1.41 -1.40 95% mean confidence interval for instructions %-change: -3.10% -3.03% Instructions are helped. total cycles in shared programs: 885172098 -> 884835306 (-0.04%) cycles in affected programs: 590294230 -> 589957438 (-0.06%) helped: 53636 HURT: 4500 helped stats (abs) min: 2.0 max: 1126.0 x̄: 8.02 x̃: 4 helped stats (rel) min: <.01% max: 50.00% x̄: 1.24% x̃: 0.24% HURT stats (abs) min: 2.0 max: 7706.0 x̄: 20.77 x̃: 6 HURT stats (rel) min: <.01% max: 82.06% x̄: 1.09% x̃: 0.54% 95% mean confidence interval for cycles value: -6.15 -5.43 95% mean confidence interval for cycles %-change: -1.10% -1.02% Cycles are helped. LOST: 385 GAINED: 47 Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38665>	2025-11-26 16:53:36 +00:00
Kenneth Graunke	792762617a	brw: Rename read_attribute_payload_intel to load_attribute_payload_intel We're going to change the intrinsic to a load(...) which puts "load" in the name. Also, it's just more consistent with our usual terminology. We also rename the corresponding backend opcode so they remain matched. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:43:58 +00:00
Lionel Landwerlin	7e72d392d7	brw: switch to load_(pixel_coord\|frag_coord_z\|frag_coord_w) intrinsics Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Allows us to better determine if we need Z/W payload delivery. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36392>	2025-11-25 15:50:48 +00:00
Yonggang Luo	ecb0ccf603	treewide: Replace calling to function ALIGN with align This is done by grep ALIGN( to align( docs,*.xml,blake3 is excluded Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38365>	2025-11-12 21:58:40 +00:00
Kenneth Graunke	73cbb35442	brw: Move into a new src/intel/compiler/brw subdirectory Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This keeps the directory structure a bit more organized: - brw specific code - elk specific code - common NIR passes that could be used in both places It also means that you can now 'git grep' in the brw directory without finding a bunch of elk code, or having to "grep thing b*". Reviewed-by: Dylan Baker <dylan.c.baker@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37755>	2025-10-09 07:01:47 +00:00

30 commits