fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-20 04:48:07 +02:00

Author	SHA1	Message	Date
Calder Young	895ff7fe92	Revert "anv,brw: Allow multiple ray queries without spilling to a shadow stack" Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This optimization doesn't work when the ray query index isn't uniform across the subgroup, which is something the spec allows. While there are some smart ways to fix this and still avoid unnecessary spilling, its not worth investing the time until we find a realtime raytracing workload that actually needs to use multiple live ray queries for something. Fixes: `1f1de7eb` ("anv,brw: Allow multiple ray queries without spilling to a shadow stack") Acked-by: Sagar Ghuge <sagar.ghuge@intel.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39445>	2026-01-23 21:33:55 +00:00
Sagar Ghuge	6aa3b70382	anv: Mark RootNodeOffset at 256B always This commit change the BVH layout a little so that we can load the BVH offset as constant rather than reading from memory. We have to force the instance leaves pointer at the end which gets used in copy.comp shader. Totals: Instrs: 54798 -> 54728 (-0.13%) Send messages: 3854 -> 3847 (-0.18%) Cycle count: 1915106 -> 1913954 (-0.06%); split: -0.07%, +0.01% Non SSA regs after NIR: 18594 -> 18575 (-0.10%) Totals from 7 (7.37% of 95) affected shaders: Instrs: 5532 -> 5462 (-1.27%) Send messages: 367 -> 360 (-1.91%) Cycle count: 132592 -> 131440 (-0.87%); split: -1.01%, +0.14% Non SSA regs after NIR: 1989 -> 1970 (-0.96%) PERCENTAGE DELTAS Shaders Instrs Send messages Cycle count Non SSA regs after NIR q2rtx-rt-pipeline 95 -0.13% -0.18% -0.06% -0.10% -------------------------------------------------------------------------------------- All affected 7 -1.27% -1.91% -0.87% -0.96% -------------------------------------------------------------------------------------- Total 95 -0.13% -0.18% -0.06% -0.10% Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39106>	2026-01-22 23:20:04 +00:00
Caio Oliveira	c8375c0f71	brw/scoreboard: Support local implicit out-of-order dependencies In software scoreboard (Gfx12+) use information from previous instructions to trim out-of-order dependencies. For example, in send g1, g2 ($1) mov g3, g1 ($1.dst) // Depends on g1 (destination of $1) mov g4, g2 ($1.src) // Depends on g2 (source of $1) mov g5, g1 ($1.dst) // Depends on g1 (destination of $1) only the first `mov` needs to be annotated, because the execution will stall until that dependency is fulfilled, which in this case means the `send` is done and `g1` was already written. Note that while `$x.dst` implies `$x.src`, the reverse is not true, so if the first `mov` did not exist, both second and third `mov` in the example would have to keep their annotations. This patch add resolution of implicit out-of-order dependencies that are visible inside a block. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3526>	2026-01-21 22:29:28 +00:00
Caio Oliveira	ba317e14a0	brw: Provide ~ and &= operators for tgl_sbid_mode Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3526>	2026-01-21 22:29:28 +00:00
Caio Oliveira	2ebacbc78d	brw/scoreboard: Add tests showing implicit unordered dependencies in SWSB Mark tests as disabled for now. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3526>	2026-01-21 22:29:28 +00:00
Caio Oliveira	423916152e	brw/scoreboard: Use std::vector when applicable There's agreement now these are helpful and widely supported. We can always fallback to a custom vector class later if necessary. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3526>	2026-01-21 22:29:27 +00:00
Lionel Landwerlin	79aff6e274	brw: use fp64 to compute coarse_z For some reason we cannot get the precision needed from the HW at fp32. LNL internal fossildb changes : Totals from 7226 (0.76% of 947978) affected shaders: Instrs: 5512598 -> 5586086 (+1.33%); split: -0.00%, +1.33% Cycle count: 153836056 -> 155079472 (+0.81%); split: -0.77%, +1.58% Spill count: 2025 -> 2021 (-0.20%); split: -0.35%, +0.15% Fill count: 3139 -> 3112 (-0.86%); split: -1.12%, +0.25% Max live registers: 1034601 -> 1034632 (+0.00%); split: -0.00%, +0.00% Max dispatch width: 207296 -> 207264 (-0.02%); split: +0.02%, -0.03% Non SSA regs after NIR: 1147942 -> 1109326 (-3.36%) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12726 Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38996>	2026-01-21 16:00:52 +00:00
Lionel Landwerlin	a19e949824	brw: move coarse_z computation to NIR So that we can print it easily with debug printfs Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38996>	2026-01-21 16:00:52 +00:00
Lionel Landwerlin	89a53f048a	brw: make coarse pixel bit available to NIR lowering Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38996>	2026-01-21 16:00:51 +00:00
Lionel Landwerlin	e3fd1b0ac0	brw: populate wm_prog_data earlier So that we can put the coarse_pixel_dispatch value available to NIR lowering. LNL internal fossildb changes: Totals from 40 (0.01% of 490838) affected shaders: Instrs: 33321 -> 33311 (-0.03%); split: -0.04%, +0.01% Cycle count: 780136 -> 779936 (-0.03%); split: -0.03%, +0.00% Max live registers: 5292 -> 5298 (+0.11%) Non SSA regs after NIR: 26638 -> 26464 (-0.65%) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38996>	2026-01-21 16:00:51 +00:00
Lionel Landwerlin	6a7ff83874	brw: set nir_shader_compiler_options::has_pixel_coord Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38996>	2026-01-21 16:00:50 +00:00
Lionel Landwerlin	3d2a696763	brw: treat inline parameters like UNIFORM Makes a bunch of copy propagation and other passes work much better. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39382>	2026-01-20 21:25:53 +00:00
Lionel Landwerlin	1d1866a84b	brw: apply same workaround to spawn than trace opcode Working around BRW's limitations Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39382>	2026-01-20 21:25:52 +00:00
Lionel Landwerlin	0e9453291c	brw: improve push constant loading using base offsets Xe2+ only Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39382>	2026-01-20 21:25:52 +00:00
Lionel Landwerlin	c1ef494b08	brw: add missing base offset decoding Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39382>	2026-01-20 21:25:52 +00:00
Georg Lehmann	050507ab81	brw: make sure nir_opt_algebraic_late was called after late brw_nir_optimize Not only is it questionable for code quality to not call nir_opt_algebraic_late after nir_opt_algebraic, it also breaks correctness for late lowerings. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39180>	2026-01-19 16:11:28 +00:00
Alyssa Rosenzweig	a11aa3fc4e	brw: combine peephole select calls Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39361>	2026-01-16 21:24:15 +00:00
Calder Young	d69daf28d0	anv,brw: Add helper to get stack ids per dss for ray queries Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38778>	2026-01-16 09:21:50 +00:00
Calder Young	1f1de7ebd6	anv,brw: Allow multiple ray queries without spilling to a shadow stack Allows a shader to have multiple ray queries without spilling them to a shadow stack. Instead, the driver provides the shader with an array of multiple RTDispatchGlobals structs to give each query its own dedicated stack. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38778>	2026-01-16 09:21:50 +00:00
Caio Oliveira	b542ac4ca0	brw: Fix and properly use increment_a64_address() Since the move to MEMORY__LOGICAL the result value was being ignored, so change to use that. Since the conversion to use new registers, some issues were introduced: - Even with `has_64bit_int` ADD with 64-bit immediate value is not supported; - `dst_high` was not being filled if there was no overflow; - Only `dst_low` returned. Found when writing some new code involving large block loads. Fixes: `b79e85a93f` ("brw: always use new registers for load address increments") Fixes: `b55f77161d` ("intel/brw: Switch to emitting MEMORY__LOGICAL opcodes") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39282>	2026-01-15 19:47:23 +00:00
Lionel Landwerlin	fd744b0c8a	brw: switch buffer/image size intrinsics lowering to NIR Fossil-db DG2: Totals from 127 (0.01% of 1799288) affected shaders: Instrs: 60593 -> 60508 (-0.14%); split: -0.15%, +0.01% Cycle count: 7099635 -> 7116148 (+0.23%); split: -0.12%, +0.35% Spill count: 468 -> 466 (-0.43%) Fill count: 224 -> 222 (-0.89%) Max live registers: 6418 -> 6424 (+0.09%); split: -0.06%, +0.16% Non SSA regs after NIR: 11228 -> 11220 (-0.07%); split: -0.20%, +0.12% Fossil-db LNL: Totals from 135 (0.01% of 1573226) affected shaders: Instrs: 55173 -> 55143 (-0.05%); split: -0.07%, +0.01% Cycle count: 9178338 -> 9157052 (-0.23%); split: -0.32%, +0.09% Spill count: 454 -> 452 (-0.44%) Fill count: 181 -> 179 (-1.10%) Max live registers: 12915 -> 12919 (+0.03%); split: -0.06%, +0.09% Non SSA regs after NIR: 10860 -> 10852 (-0.07%); split: -0.20%, +0.13% shader-db LNL: total instructions in shared programs: 16911578 -> 16911566 (<.01%) instructions in affected programs: 1602 -> 1590 (-0.75%) helped: 7 HURT: 0 helped stats (abs) min: 1.0 max: 2.0 x̄: 1.71 x̃: 2 helped stats (rel) min: 0.48% max: 1.10% x̄: 0.75% x̃: 0.74% 95% mean confidence interval for instructions value: -2.17 -1.26 95% mean confidence interval for instructions %-change: -0.96% -0.55% Instructions are helped. total loops in shared programs: 5168 -> 5168 (0.00%) loops in affected programs: 0 -> 0 helped: 0 HURT: 0 total cycles in shared programs: 848964184 -> 848955094 (<.01%) cycles in affected programs: 1528020 -> 1518930 (-0.59%) helped: 9 HURT: 6 helped stats (abs) min: 2.0 max: 8484.0 x̄: 1212.89 x̃: 20 helped stats (rel) min: 0.02% max: 3.23% x̄: 0.57% x̃: 0.11% HURT stats (abs) min: 2.0 max: 1608.0 x̄: 304.33 x̃: 15 HURT stats (rel) min: <.01% max: 0.59% x̄: 0.19% x̃: 0.07% 95% mean confidence interval for cycles value: -1875.18 663.18 95% mean confidence interval for cycles %-change: -0.75% 0.23% Inconclusive result (value mean confidence interval includes 0). total spills in shared programs: 3345 -> 3345 (0.00%) spills in affected programs: 0 -> 0 helped: 0 HURT: 0 total fills in shared programs: 1777 -> 1777 (0.00%) fills in affected programs: 0 -> 0 helped: 0 HURT: 0 total sends in shared programs: 869299 -> 869299 (0.00%) sends in affected programs: 0 -> 0 helped: 0 HURT: 0 LOST: 0 GAINED: 0 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39258>	2026-01-14 10:37:32 +00:00
Alyssa Rosenzweig	c339b55f92	brw/nir_lower_fs_load_output: unify texture builders Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39271>	2026-01-14 08:18:15 +00:00
Lionel Landwerlin	0a3f3fd193	brw: drop unused color_outputs_valid key Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39264>	2026-01-12 20:21:48 +00:00
Lionel Landwerlin	c3bd1a1688	brw: handle layer_id only through system value Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39259>	2026-01-12 19:53:36 +00:00
Lionel Landwerlin	081c5bc6a5	brw: fix derivatives on non 32bit floats Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14600 Meh'd-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39226>	2026-01-12 15:18:46 +00:00
Lionel Landwerlin	a97b01801a	brw: enable SIMD32 compute shaders with ray queries Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11020 Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36181>	2026-01-12 12:19:21 +00:00
Lionel Landwerlin	527ae448e5	brw/nir/rt: ensure we can load 2 RT_DISPATCH_GLOBALS Each group of 16 lanes inside a SIMD32 shader will load different globals. In SIMD8/16 shaders, the divergence analysis will turn this load into nir_load_global_constant_uniform_block_intel. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36181>	2026-01-12 12:19:21 +00:00
Lionel Landwerlin	b996b03f21	brw: enable topology opcodes in SIMD32 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36181>	2026-01-12 12:19:21 +00:00
Lionel Landwerlin	286073f6eb	brw: handle lowering of a couple of opcodes Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36181>	2026-01-12 12:19:21 +00:00
Lionel Landwerlin	2fa09500a2	brw: enable ray query spilling in SIMD32 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36181>	2026-01-12 12:19:21 +00:00
Lionel Landwerlin	6d19b898e7	anv/brw: prep work for SIMD32 ray queries Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36181>	2026-01-12 12:19:21 +00:00
Alyssa Rosenzweig	43efc1cc7e	brw: use nir_is_shared_access Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39219>	2026-01-09 20:51:12 +00:00
Caio Oliveira	d160b7726a	brw/scoreboard: Disable nomask workaround for Xe2+ Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The issue was caused by fused EU feature that is not used in Xe2+ anymore. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36659>	2026-01-09 17:25:00 +00:00
Caio Oliveira	47a6ef3fef	brw/scoreboard: Use a predicate helper for the nomask workaround If it wasn't for the workaround, it wouldn't be necessary to track the whether instructions are exec_all or not. The workaround affects results when mixing a dep and inst with different exec_all. Add the predicate so that, when the workaround is disabled, none of the effects of having different exec_all will kick in, all them will be considered `exec_all = true`. This patch don't change any behavior, just adds the predicate. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36659>	2026-01-09 17:25:00 +00:00
Lionel Landwerlin	faa857a061	intel: rework push constant handling Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details nr_params & params array are gone. brw_ubo_range is not stored on the prog_data structure anymore (Anv already stored a copy of that with its own additional information) The backend now only deals with load_push_data_intel. load_uniform & load_push_constant have to be lowered by the driver. Pre Gfx12.5 platforms have to provide a subgroup_id_param to specify where the subgroup_id value is located in the push constants. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38975>	2026-01-09 14:19:52 +00:00
Lionel Landwerlin	60e359412d	iris: manage TBIMR null push constant wa in driver Anv already manages this itself. This allows removing the logic from the compiler. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38975>	2026-01-09 14:19:52 +00:00
Lionel Landwerlin	f4a0e05970	anv/brw/iris: get rid of param array on prog_data Drivers can do all the lowering to push constants to find the only value useful in that array (subgroup_id). Then drivers call into brw_cs_fill_push_const_info() to get the cross/per thread constant layout computed in the prog_data. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38975>	2026-01-09 14:19:51 +00:00
Lionel Landwerlin	ec456e99f2	brw: add a pass to lower ubo to push constant data Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38975>	2026-01-09 14:19:49 +00:00
Lionel Landwerlin	2c7254c131	brw: invert condition to reduce code nesting Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38975>	2026-01-09 14:19:48 +00:00
Caio Oliveira	dcefa0e6b3	brw: Rework UIP and JIP setting code Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The current code walks the instructions, and when needed, it will scan to find the next "end of scope" and sometimes the next "end of block". It also has a separate patching logic for HALTs. The new code collects the necessary scope information up front, then walks the instruction backwards, making avoiding the need to scan for the end of scope. It will also walk only the relevant instructions that were previously collected. It also replaces the previous HALT-specific patching logic. With this new change, many cases that were jumping to intermediate HALTs, will now jump straight to the end of scope (or the "end of the program" section). E.g. in ``` if ... (...) HALT ... (...) HALT endif ``` both HALTs now will jump to the end of the scope, instead of the first HALT jumping into the second one. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38914>	2026-01-08 22:01:45 +00:00
Caio Oliveira	c939744d2d	brw: Consolidate generator code for emitting "regular" instructions Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Most of instructions follow the basic formats (1, 2 and 3 src), so consolidate their emission code in generator. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38878>	2026-01-08 16:47:02 +00:00
Caio Oliveira	e1e055f23f	brw: Move LRP related validation Move validation, noting that LRP only supports BRW_TYPE_F -- the previous assert had DF because it also was used by MAD in the past. With that change, ALU3F can be replaced by ALU3 for LRP. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38878>	2026-01-08 16:47:02 +00:00
Caio Oliveira	68e1a07181	brw: Move normalization of 3-src instructions swizzles to a single place When repctrl is used, the swizzle/chansel is ignored. Instead of setting a swizzle that has all zeros and encode that, don't encode anything. For context see `e7598c5a62` ("intel/compiler: Set swizzle to BRW_SWIZZLE_XXXX for scalar region"). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38878>	2026-01-08 16:47:01 +00:00
José Roberto de Souza	0cc73385e6	intel/brw: Document UBO_START Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39175>	2026-01-07 14:25:42 +00:00
José Roberto de Souza	961ca451e0	intel/brw: Add comment to ubo_ranges Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39175>	2026-01-07 14:25:42 +00:00
Georg Lehmann	eb4737a1dd	nir: add nir_alu_instr_is_exact helper Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39103>	2026-01-07 09:40:57 +00:00
Marek Olšák	1912a00a91	ALL: use SHA1_DIGEST_LENGTH etc. instead of hardcoding the numbers only build_id is switched to use literal 20 instead of SHA1_DIGEST_LENGTH because we will increase SHA1_DIGEST_LENGTH to BLAKE3_KEY_LEN Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39110>	2026-01-07 08:32:33 +00:00
José Roberto de Souza	6f031a98e0	intel/brw: Nuke brw_inst::is_volatile() There is no users for that function, is_volatile is only used in brw_opt_cse.cpp is_expression() but it access the information using brw_send_inst struct. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39104>	2026-01-05 14:11:47 +00:00
Georg Lehmann	f3290219ab	nir: use a seperate enum for per alu floating point math control We don't need one bit per bitsize per instruction if only one actually matters in the end. First step towards moving NIR in the direction of full float_controls2 only. Also rename this from fp_fast_math, because that name implied that 0 is the no fast math mode, while the opposite was the case. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39026>	2025-12-29 10:57:05 +00:00
Sushma Venkatesh Reddy	d1d4e3d530	brw: Add EU assembler support for float8 Decode logic in Gfx12+ has become complex with the new types, so Caio suggested that we move to the table like other gens. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39007>	2025-12-19 00:09:53 +00:00

1 2 3 4 5 ...

4875 commits