fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-19 22:18:18 +02:00

Author	SHA1	Message	Date
Kenneth Graunke	dbb24ff56b	brw: Assert that urb_vec4_intel stores only have 4/8 components vec1-3, 5-7, and 9+ are not supported. Only vec4 and vec8. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39250>	2026-01-27 16:08:36 +00:00
Caio Oliveira	74f1d4f47b	intel/compiler: Use SPDX annotations Minor adjustments to formatting of the copyright line, but keep dates and holders. "Authors" entries that could be obtained via Git logs were also removed. The license in brw_disasm.c and elk_disasm.c don't match directly any SPDX pattern I could find, so kept as is. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39503>	2026-01-24 20:37:31 +00:00
Caio Oliveira	dc352f3d7c	brw: Don't increment block loads addresses unless needed Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39447>	2026-01-24 04:38:23 +00:00
Lionel Landwerlin	a19e949824	brw: move coarse_z computation to NIR So that we can print it easily with debug printfs Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38996>	2026-01-21 16:00:52 +00:00
Lionel Landwerlin	3d2a696763	brw: treat inline parameters like UNIFORM Makes a bunch of copy propagation and other passes work much better. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39382>	2026-01-20 21:25:53 +00:00
Caio Oliveira	b542ac4ca0	brw: Fix and properly use increment_a64_address() Since the move to MEMORY__LOGICAL the result value was being ignored, so change to use that. Since the conversion to use new registers, some issues were introduced: - Even with `has_64bit_int` ADD with 64-bit immediate value is not supported; - `dst_high` was not being filled if there was no overflow; - Only `dst_low` returned. Found when writing some new code involving large block loads. Fixes: `b79e85a93f` ("brw: always use new registers for load address increments") Fixes: `b55f77161d` ("intel/brw: Switch to emitting MEMORY__LOGICAL opcodes") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39282>	2026-01-15 19:47:23 +00:00
Lionel Landwerlin	fd744b0c8a	brw: switch buffer/image size intrinsics lowering to NIR Fossil-db DG2: Totals from 127 (0.01% of 1799288) affected shaders: Instrs: 60593 -> 60508 (-0.14%); split: -0.15%, +0.01% Cycle count: 7099635 -> 7116148 (+0.23%); split: -0.12%, +0.35% Spill count: 468 -> 466 (-0.43%) Fill count: 224 -> 222 (-0.89%) Max live registers: 6418 -> 6424 (+0.09%); split: -0.06%, +0.16% Non SSA regs after NIR: 11228 -> 11220 (-0.07%); split: -0.20%, +0.12% Fossil-db LNL: Totals from 135 (0.01% of 1573226) affected shaders: Instrs: 55173 -> 55143 (-0.05%); split: -0.07%, +0.01% Cycle count: 9178338 -> 9157052 (-0.23%); split: -0.32%, +0.09% Spill count: 454 -> 452 (-0.44%) Fill count: 181 -> 179 (-1.10%) Max live registers: 12915 -> 12919 (+0.03%); split: -0.06%, +0.09% Non SSA regs after NIR: 10860 -> 10852 (-0.07%); split: -0.20%, +0.13% shader-db LNL: total instructions in shared programs: 16911578 -> 16911566 (<.01%) instructions in affected programs: 1602 -> 1590 (-0.75%) helped: 7 HURT: 0 helped stats (abs) min: 1.0 max: 2.0 x̄: 1.71 x̃: 2 helped stats (rel) min: 0.48% max: 1.10% x̄: 0.75% x̃: 0.74% 95% mean confidence interval for instructions value: -2.17 -1.26 95% mean confidence interval for instructions %-change: -0.96% -0.55% Instructions are helped. total loops in shared programs: 5168 -> 5168 (0.00%) loops in affected programs: 0 -> 0 helped: 0 HURT: 0 total cycles in shared programs: 848964184 -> 848955094 (<.01%) cycles in affected programs: 1528020 -> 1518930 (-0.59%) helped: 9 HURT: 6 helped stats (abs) min: 2.0 max: 8484.0 x̄: 1212.89 x̃: 20 helped stats (rel) min: 0.02% max: 3.23% x̄: 0.57% x̃: 0.11% HURT stats (abs) min: 2.0 max: 1608.0 x̄: 304.33 x̃: 15 HURT stats (rel) min: <.01% max: 0.59% x̄: 0.19% x̃: 0.07% 95% mean confidence interval for cycles value: -1875.18 663.18 95% mean confidence interval for cycles %-change: -0.75% 0.23% Inconclusive result (value mean confidence interval includes 0). total spills in shared programs: 3345 -> 3345 (0.00%) spills in affected programs: 0 -> 0 helped: 0 HURT: 0 total fills in shared programs: 1777 -> 1777 (0.00%) fills in affected programs: 0 -> 0 helped: 0 HURT: 0 total sends in shared programs: 869299 -> 869299 (0.00%) sends in affected programs: 0 -> 0 helped: 0 HURT: 0 LOST: 0 GAINED: 0 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39258>	2026-01-14 10:37:32 +00:00
Lionel Landwerlin	c3bd1a1688	brw: handle layer_id only through system value Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39259>	2026-01-12 19:53:36 +00:00
Lionel Landwerlin	081c5bc6a5	brw: fix derivatives on non 32bit floats Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14600 Meh'd-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39226>	2026-01-12 15:18:46 +00:00
Lionel Landwerlin	b996b03f21	brw: enable topology opcodes in SIMD32 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36181>	2026-01-12 12:19:21 +00:00
Lionel Landwerlin	faa857a061	intel: rework push constant handling Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details nr_params & params array are gone. brw_ubo_range is not stored on the prog_data structure anymore (Anv already stored a copy of that with its own additional information) The backend now only deals with load_push_data_intel. load_uniform & load_push_constant have to be lowered by the driver. Pre Gfx12.5 platforms have to provide a subgroup_id_param to specify where the subgroup_id value is located in the push constants. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38975>	2026-01-09 14:19:52 +00:00
Kenneth Graunke	d83c699045	brw: Convert GS pulled inputs to use URB intrinsics We leave GS pushed inputs using load_per_vertex_input for now - they're relatively simple, and using load_attribute_payload doesn't work well since it's assumed to be convergent (for TES, FS inputs) while GS inputs are divergent. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38990>	2025-12-18 06:39:02 +00:00
Caio Oliveira	47d8ed1177	brw: Move PLN/LINE normalization Add validation for Source 0 and move the normalization into the code producing the instruction. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38877>	2025-12-16 01:34:44 +00:00
Kenneth Graunke	d831f38d11	brw: Delete all the old backend mesh/task URB handling code This has all been replaced by NIR lowering to URB intrinsics. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38918>	2025-12-16 00:58:46 +00:00
Lionel Landwerlin	d3053fb3d2	brw: Implement URB handle intrinsics for task/mesh stages (Split by Ken from a larger patch originally written by Lionel.) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38918>	2025-12-16 00:58:40 +00:00
Iván Briano	094f8f041f	anv: enable fragmentShadingRateWithShaderSampleMask on Xe2+ Before DG2, the value the HW gives us seems to be backwards, but since DG2 this is supposed to be supported just fine. However, due to Wa_22012766191, enable it only for Xe2 and up. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38641>	2025-12-11 22:50:10 +00:00
Lionel Landwerlin	86419dd519	brw: remove driver specific load_num_workgroup lowering Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38735>	2025-12-02 22:44:05 +00:00
Lionel Landwerlin	a4e9e660d4	brw/iris: remove fs key for coherent_fb_fetch Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38737>	2025-12-02 12:44:35 +00:00
Kenneth Graunke	ee407481c2	brw: Switch to URB intrinsics for TCS inputs Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:44:02 +00:00
Kenneth Graunke	943b2acf02	brw: Switch to NIR URB intrinsics for TES inputs Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:44:01 +00:00
Kenneth Graunke	c0d69b2faf	brw: Switch to NIR URB intrinsics for TCS outputs Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:44:01 +00:00
Kenneth Graunke	96d331766a	brw: Generalize read_attribute_payload_intel to handle more cases We were using this for indirect loads of the shader input thread payload, but there's no reason we can't use it for constant access too. In this case we can just MOV from the ATTR file directly without a special opcode that turns into MOV_INDIRECT later. We also allow it to load multiple components now. This is useful for say, returning vec4 pushed inputs. And, we allow it in more stages than just the fragment stage. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:43:59 +00:00
Kenneth Graunke	792762617a	brw: Rename read_attribute_payload_intel to load_attribute_payload_intel We're going to change the intrinsic to a load(...) which puts "load" in the name. Also, it's just more consistent with our usual terminology. We also rename the corresponding backend opcode so they remain matched. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:43:58 +00:00
Kenneth Graunke	a1c7ae9d15	brw: Implement URB handle intrinsics for TCS and TES stages Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:43:56 +00:00
Lionel Landwerlin	e290f9641d	brw: Implement load/store URB intrinsics These work the same regardless of stage. v2 (Ken): Rebase, move from mesh to all stages, add reorderable load variant, allow channel masks to be non-constant even on Xe2. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:43:55 +00:00
Lionel Landwerlin	0d8ee4ed23	brw: use default builder for urb handle adjustment Be consistent with lowering that happens after, so that it gets a full vector register and can stride into it. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38482>	2025-11-25 22:43:55 +00:00
Lionel Landwerlin	7e72d392d7	brw: switch to load_(pixel_coord\|frag_coord_z\|frag_coord_w) intrinsics Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Allows us to better determine if we need Z/W payload delivery. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36392>	2025-11-25 15:50:48 +00:00
Yonggang Luo	ecb0ccf603	treewide: Replace calling to function ALIGN with align This is done by grep ALIGN( to align( docs,*.xml,blake3 is excluded Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38365>	2025-11-12 21:58:40 +00:00
Yonggang Luo	db767eb7e0	brw: Do not use align as variable name, as it's a function in u_math.h and will be used Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38365>	2025-11-12 21:58:38 +00:00
Konstantin Seurer	de32f9275f	treewide: add & use parent instr helpers We add a bunch of new helpers to avoid the need to touch >parent_instr, including the full set of: * nir_def_is_* * nir_def_as__or_null nir_def_as_* [assumes the right instr type] * nir_src_is_* * nir_src_as_* * nir_scalar_is_* * nir_scalar_as_* Plus nir_def_instr() where there's no more suitable helper. Also an existing helper is renamed to unify all the names, while we're churning the tree: * nir_src_as_alu_instr -> nir_src_as_alu ..and then we port the tree to use the helpers as much as possible, using nir_def_instr() where that does not work. Acked-by: Marek Olšák <maraeo@gmail.com> --- To eliminate nir_def::parent_instr we need to churn the tree anyway, so I'm taking this opportunity to clean up a lot of NIR patterns. Co-authored-by: Konstantin Seurer <konstantin.seurer@gmail.com> Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38313>	2025-11-12 21:22:13 +00:00
Kenneth Graunke	9ffae42975	brw: Store brw_urb_inst::offset in bytes on Xe2 Xe2 uses byte offsets rather than OWord offsets. We've been storing the per-slot offsets in bytes on Xe2 for a while, but kept the global offset immediate in OWords for some reason, choosing to lower it during logical send lowering. This patch makes both offsets (global immediate, per-slot) in the same units, so they could be added together if necessary without scaling. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38343>	2025-11-11 10:55:44 +00:00
Kenneth Graunke	439c156831	brw: Add an assertion that writemasks can be fully ignored I noticed that our backend was completely ignoring writemasks, despite them appearing on many of the intrinsics we're implementing. Rhys Perry pointed out that nir_lower_mem_access_bitsizes is removing all non-trivial writemasking today, so ssbo/global/shared/scratch/etc. stores should only ever see all components enabled. Which means what we're doing is legitimate, if non-obvious. Add an assert to make it obvious. Thanks a lot to Rhys for helping me rediscover what made this work. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38343>	2025-11-11 10:55:42 +00:00
Iván Briano	aa97c23484	brw: shut -Wmaybe-uninitialized up Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Release builds are noisy about flush_type and scope being used uninitialized, even though they are always set. Initialize them to the final else values to make GCC happy. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38357>	2025-11-10 21:06:50 +00:00
Lionel Landwerlin	51893699a2	brw: stop emitting flush operations for begin/end interlock Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details NIR barrier intrinsics are already added for required flushing. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38242>	2025-11-06 09:33:25 +02:00
Lionel Landwerlin	c5d313a2a8	brw: handling dynamic programmable offsets pre-Xe2 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37929>	2025-10-21 06:13:10 +00:00
Lionel Landwerlin	b722e17203	brw: get rid of GET_BUFFER_SIZE opcode Rely on RESINFO which is what was used already. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37171>	2025-10-16 12:08:16 +00:00
Lionel Landwerlin	bcffd839aa	brw: new Xe2 sampler opcodes Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37171>	2025-10-16 12:08:16 +00:00
Lionel Landwerlin	efcba73b49	brw: switch to new sampler payload description scheme Instead of having abstracted opcodes, we target directly the HW format at the NIR translation. The payload description gives us the order of the payload sources (we can use that for pretty printing) and we don't have to have a complicated scheme in the logical send lowering for the ordering. All we have to do is build the header if needed as well as the descriptors. PTL Fossil-db stats: Totals from 66759 (13.54% of 492917) affected shaders: Instrs: 44289221 -> 43957404 (-0.75%); split: -0.81%, +0.06% Send messages: 2050378 -> 2042607 (-0.38%) Cycle count: 3878874713 -> 3712848434 (-4.28%); split: -4.44%, +0.16% Max live registers: 8773179 -> 8770104 (-0.04%); split: -0.06%, +0.03% Max dispatch width: 1677408 -> 1707952 (+1.82%); split: +1.85%, -0.03% Non SSA regs after NIR: 11407821 -> 11421041 (+0.12%); split: -0.03%, +0.15% GRF registers: 5686983 -> 5838785 (+2.67%); split: -0.24%, +2.91% LNL Fossil-db stats: Totals from 57911 (15.72% of 368381) affected shaders: Instrs: 39448036 -> 38923650 (-1.33%); split: -1.41%, +0.08% Subgroup size: 1241360 -> 1241392 (+0.00%) Send messages: 1846696 -> 1845137 (-0.08%) Cycle count: 3834818910 -> 3784003027 (-1.33%); split: -2.33%, +1.00% Spill count: 21866 -> 22168 (+1.38%); split: -0.07%, +1.45% Fill count: 59324 -> 60339 (+1.71%); split: -0.00%, +1.71% Scratch Memory Size: 1479680 -> 1483776 (+0.28%) Max live registers: 7521376 -> 7447841 (-0.98%); split: -1.04%, +0.06% Non SSA regs after NIR: 9744605 -> 10113728 (+3.79%); split: -0.01%, +3.80% Only 2 titles negatively impacted (spilling) : - Shadow of the Tomb Raider - Red Dead Redemption 2 All impacted shaders were already spilling. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37171>	2025-10-16 12:08:15 +00:00
Lionel Landwerlin	7c77c4768a	brw: add a new sampler payload parameter description Centralize all the information in one place and also make the mapping decision from nir_tex_instr -> HW opcode much earlier. This will help knowning exactly what the payload looks like early in the backend IR and when it needs to lowered to a smaller SIMD size due to HW limits. It will also allow NIR lowering to know when to combine parameters into a single packed component. Finally, this also reduces the amount of LOAD_PAYLOAD we need to carry in the backend IR, because we don't have to generate VEC() LOAD_PAYLOAD() for coordinates etc... Those are useless if there is any other parameter in the payload and we need need to add one more LOAD_PAYLOAD() when doing the logical send lowering. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37171>	2025-10-16 12:08:14 +00:00
Ian Romanick	1dea86f773	brw: Don't do non-obvious things with BFN parameter ordering Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Somehow dEQP-VK.spirv_assembly.instruction.graphics.float16.arithmetic_1.atan_frag was able to generate a bitfield_select with a constant first parameter. That makes the big comment here completely false. Don't be clever. If the constant is in the wrong place, commute_immediates during copy propagation will fix it. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37891>	2025-10-16 00:37:30 +00:00
Lionel Landwerlin	49226692e5	brw: fix invalid sparse bitfield offset computation Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details dest_size is the number of outputs to be provided into the IR, but the location of the sparse bitfield in the dst temporary SEND destination might be different (shorter due to masking of unused components computed above). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14094 Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37876>	2025-10-15 14:42:51 +00:00
Ian Romanick	ca493b5c45	brw: elk: Fix name of function in comment Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Trivial. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37186>	2025-10-10 17:25:11 +00:00
Ian Romanick	b948e6d503	brw: Use BFN to implement nir_opt_bitfield_select shader-db: Lunar Lake, Meteor Lake, and DG2 had similar results. (Lunar Lake shown) total instructions in shared programs: 17181559 -> 17181254 (<.01%) instructions in affected programs: 250921 -> 250616 (-0.12%) helped: 303 / HURT: 0 total cycles in shared programs: 888542568 -> 888543370 (<.01%) cycles in affected programs: 49861772 -> 49862574 (<.01%) helped: 181 / HURT: 110 fossil-db: Lunar Lake, Meteor Lake, and DG2 had similar results. (Lunar Lake shown) Totals: Instrs: 233260591 -> 233260196 (-0.00%); split: -0.00%, +0.00% Cycle count: 32754501248 -> 32754567116 (+0.00%); split: -0.00%, +0.00% Max live registers: 71738476 -> 71738442 (-0.00%) Non SSA regs after NIR: 67837262 -> 67837108 (-0.00%); split: -0.00%, +0.00% Totals from 226 (0.03% of 790721) affected shaders: Instrs: 382227 -> 381832 (-0.10%); split: -0.15%, +0.05% Cycle count: 72863878 -> 72929746 (+0.09%); split: -0.65%, +0.74% Max live registers: 36557 -> 36523 (-0.09%) Non SSA regs after NIR: 60427 -> 60273 (-0.25%); split: -0.26%, +0.00% No shader-db or fossil-db changes on any previous Intel platforms. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37186>	2025-10-10 17:25:11 +00:00
Ian Romanick	4193895145	brw/cmod: Enable limited cmod propagation for BFN cmod propagation needs more work. Since the result type is always UD, BRW_CONDITION_G should be able to substitute for NZ. Either that or users of the condition could be rewritten to use an inverted condition. v2: Add a couple more unit tests. Suggested by Matt. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37186>	2025-10-10 17:25:11 +00:00
Ian Romanick	65e8220180	brw: Enable saturating float to integer conversion opcodes Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37186>	2025-10-10 17:25:06 +00:00
Lionel Landwerlin	37a9c5411f	brw: serialize messages on Gfx12.x if required The Intel EU fusion feature needs to be disabled on SEND messages where either the texture handle, sampler handle, sampler header is not identical on fused threads. This is the case in particular with accesses on non-uniform texture/sampler handles but could also strike with dynamic programmable offsets (currently disabled). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Anne Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37394>	2025-10-10 11:19:39 +00:00
Kenneth Graunke	dd9e002129	brw: Fix mesh shader asserts in clip/cull distance setting mesh doesn't use brw_vue_prog_data. Also, I had been catching TCS shaders here, and shouldn't. Fixes: `bf76e86bc8` ("brw: Refactor clip/cull distance mask setting into a helper") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37809>	2025-10-10 09:51:26 +00:00
Kenneth Graunke	bb096b0f12	brw: Use BITFIELD_{MASK,RANGE} in clip/cull distance mask handling code Suggested by Alyssa. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37784>	2025-10-09 13:20:04 -07:00
Kenneth Graunke	bf76e86bc8	brw: Refactor clip/cull distance mask setting into a helper This was copy pasted between 4 different stages. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37784>	2025-10-09 13:20:03 -07:00
Kenneth Graunke	73cbb35442	brw: Move into a new src/intel/compiler/brw subdirectory Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This keeps the directory structure a bit more organized: - brw specific code - elk specific code - common NIR passes that could be used in both places It also means that you can now 'git grep' in the brw directory without finding a bunch of elk code, or having to "grep thing b*". Reviewed-by: Dylan Baker <dylan.c.baker@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37755>	2025-10-09 07:01:47 +00:00

50 commits