fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-04-06 03:40:35 +02:00

Author	SHA1	Message	Date
Lionel Landwerlin	440e2e9200	genxml: fix 3DSTATE_TE definition on Gfx12.[05] Since Gfx12+ the instruction is 5 dwords. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36146>	2025-07-16 01:01:11 +00:00
Lionel Landwerlin	ac78693b6a	intel/genxml: rename body field So that the body field has the same name in COMPUTE_WALKER & EXECUTE_INDIRECT_DISPATCH. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36146>	2025-07-16 01:01:11 +00:00
Ian Romanick	b57bad1fd7	brw/reg_allocate: Check source / destination hazard for all larger SIMD Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details All platforms needs this check for SIMD32. Xe2+ do not need this for SIMD16. Also... delete some really stale comments about Gfx4/Gfx5. This compiler doesn't even support those platforms. No shader-db changes on any pre-Xe2 Intel platforms: shader-db: Lunar Lake total instructions in shared programs: 17108867 -> 17108855 (<.01%) instructions in affected programs: 35211 -> 35199 (-0.03%) helped: 19 / HURT: 6 total cycles in shared programs: 885026794 -> 885805580 (0.09%) cycles in affected programs: 140449880 -> 141228666 (0.55%) helped: 903 / HURT: 1142 LOST: 0 GAINED: 25 fossil-db: Lunar Lake Totals: Instrs: 208578317 -> 208574097 (-0.00%); split: -0.00%, +0.00% Cycle count: 31268800798 -> 31259914590 (-0.03%); split: -0.10%, +0.07% Spill count: 504472 -> 504102 (-0.07%); split: -0.09%, +0.02% Fill count: 606581 -> 606079 (-0.08%); split: -0.13%, +0.05% Scratch Memory Size: 35001344 -> 34957312 (-0.13%) Totals from 60714 (8.59% of 706970) affected shaders: Instrs: 48923370 -> 48919150 (-0.01%); split: -0.01%, +0.01% Cycle count: 11830486210 -> 11821600002 (-0.08%); split: -0.27%, +0.20% Spill count: 397150 -> 396780 (-0.09%); split: -0.12%, +0.02% Fill count: 469651 -> 469149 (-0.11%); split: -0.17%, +0.06% Scratch Memory Size: 25971712 -> 25927680 (-0.17%) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35903>	2025-07-15 19:35:44 +00:00
Ian Romanick	7e98ca89f2	brw/reg_allocate: Adjust source / destination hazard conditions for broadcast Broadcast selects one lane from the source to write to all the lanes of the destination. This makes it possible for the first half to overwrite the source used by the second half. No shader-db changes on any Intel platform. fossil-db: Lunar Lake Totals: Instrs: 208705405 -> 208705374 (-0.00%); split: -0.00%, +0.00% Cycle count: 31274597098 -> 31273711544 (-0.00%); split: -0.00%, +0.00% Totals from 77 (0.01% of 707133) affected shaders: Instrs: 220177 -> 220146 (-0.01%); split: -0.02%, +0.00% Cycle count: 461694212 -> 460808658 (-0.19%); split: -0.33%, +0.14% No fossil-db changes on any other Intel platforms. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35903>	2025-07-15 19:35:44 +00:00
Ian Romanick	67dc02acc2	brw/reg_allocate: Only add interference for the source with the hazard shader-db: Lunar Lake total instructions in shared programs: 17105892 -> 17105732 (<.01%) instructions in affected programs: 55720 -> 55560 (-0.29%) helped: 29 / HURT: 24 total cycles in shared programs: 884342344 -> 884663448 (0.04%) cycles in affected programs: 154776382 -> 155097486 (0.21%) helped: 719 / HURT: 761 total spills in shared programs: 3278 -> 3262 (-0.49%) spills in affected programs: 320 -> 304 (-5.00%) helped: 4 /HURT: 0 total fills in shared programs: 1632 -> 1616 (-0.98%) fills in affected programs: 368 -> 352 (-4.35%) helped: 4 / HURT: 0 LOST: 3 GAINED: 4 No shader-db changes on any other Intel platforms. fossil-db: Lunar Lake Totals: Instrs: 208696275 -> 208692511 (-0.00%); split: -0.00%, +0.00% Cycle count: 31325252074 -> 31274118190 (-0.16%); split: -0.27%, +0.11% Spill count: 504809 -> 504472 (-0.07%); split: -0.07%, +0.01% Fill count: 607047 -> 606581 (-0.08%); split: -0.08%, +0.01% Scratch Memory Size: 35037184 -> 35001344 (-0.10%); split: -0.11%, +0.01% Totals from 44135 (6.24% of 707112) affected shaders: Instrs: 39570465 -> 39566701 (-0.01%); split: -0.01%, +0.00% Cycle count: 11140437886 -> 11089304002 (-0.46%); split: -0.76%, +0.30% Spill count: 279756 -> 279419 (-0.12%); split: -0.13%, +0.01% Fill count: 354706 -> 354240 (-0.13%); split: -0.14%, +0.01% Scratch Memory Size: 18758656 -> 18722816 (-0.19%); split: -0.20%, +0.01% Meteor Lake, DG2, Tiger Lake, Ice Lake, and Skylake had similar results. (Meteor Lake shown) Totals: Cycle count: 25377247343 -> 25377246251 (-0.00%); split: -0.00%, +0.00% Totals from 11 (0.00% of 806166) affected shaders: Cycle count: 899080 -> 897988 (-0.12%); split: -0.48%, +0.36% Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35903>	2025-07-15 19:35:43 +00:00
Ian Romanick	4e05de7c3d	brw/reg_allocate: Require SIMD32 for destination / source interference on Xe2 No platforms other than Lunar Lake were affected in shader-db or fossil-db for obvious reasons. shader-db: Lunar Lake total instructions in shared programs: 17070074 -> 17069908 (<.01%) instructions in affected programs: 151939 -> 151773 (-0.11%) helped: 61 / HURT: 60 total cycles in shared programs: 891338314 -> 880188516 (-1.25%) cycles in affected programs: 550482120 -> 539332322 (-2.03%) helped: 8053 / HURT: 7183 total spills in shared programs: 3294 -> 3278 (-0.49%) spills in affected programs: 138 -> 122 (-11.59%) helped: 8 / HURT: 0 total fills in shared programs: 1653 -> 1632 (-1.27%) fills in affected programs: 212 -> 191 (-9.91%) helped: 8 / HURT: 0 LOST: 96 GAINED: 70 fossil-db: Lunar Lake Totals: Instrs: 208555066 -> 208509387 (-0.02%); split: -0.03%, +0.00% Cycle count: 31487691872 -> 31318442816 (-0.54%); split: -0.88%, +0.34% Spill count: 508701 -> 504809 (-0.77%); split: -0.86%, +0.10% Fill count: 612583 -> 607047 (-0.90%); split: -1.03%, +0.13% Scratch Memory Size: 35311616 -> 35037184 (-0.78%); split: -0.81%, +0.04% Totals from 214417 (30.33% of 706852) affected shaders: Instrs: 123732970 -> 123687291 (-0.04%); split: -0.04%, +0.01% Cycle count: 27410928904 -> 27241679848 (-0.62%); split: -1.01%, +0.39% Spill count: 452458 -> 448566 (-0.86%); split: -0.97%, +0.11% Fill count: 550991 -> 545455 (-1.00%); split: -1.15%, +0.14% Scratch Memory Size: 31138816 -> 30864384 (-0.88%); split: -0.92%, +0.04% Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35903>	2025-07-15 19:35:43 +00:00
Ian Romanick	e9ae997ffc	brw: Only apply GRF 127 send workaround to Gfx9 The portion of the Bspec dedicated to Gfx6-Gfx11 says that this workaround applies to "Pre-CNL" (with CNL being Gfx10). There is no mention of this workaround in the sections for Xe or Xe2. No shader-db or fossil-db changes on Skylake or older Intel platforms. shader-db: Lunar Lake, Meteor Lake, DG2, Tiger Lake, and Ice Lake (Lunar Lake shown) total instructions in shared programs: 17107031 -> 17107027 (<.01%) instructions in affected programs: 32182 -> 32178 (-0.01%) helped: 16 / HURT: 14 total cycles in shared programs: 895016760 -> 894975410 (<.01%) cycles in affected programs: 312774834 -> 312733484 (-0.01%) helped: 9279 / HURT: 8091 LOST: 40 GAINED: 33 The pre-Xe2 platforms had a lot more lost / gained shaders. This appears to be due to churn in the cycle counts and the SIMD32 heuristic. fossil-db: Lunar Lake Totals: Instrs: 208667436 -> 208671853 (+0.00%); split: -0.00%, +0.01% Subgroup size: 14241168 -> 14241200 (+0.00%) Cycle count: 31495149690 -> 31481397970 (-0.04%); split: -0.17%, +0.13% Spill count: 508467 -> 508701 (+0.05%); split: -0.10%, +0.14% Fill count: 611979 -> 612583 (+0.10%); split: -0.07%, +0.17% Scratch Memory Size: 35288064 -> 35311616 (+0.07%); split: -0.07%, +0.14% Totals from 205773 (29.10% of 707019) affected shaders: Instrs: 103153541 -> 103157958 (+0.00%); split: -0.01%, +0.01% Subgroup size: 4563584 -> 4563616 (+0.00%) Cycle count: 12979963010 -> 12966211290 (-0.11%); split: -0.42%, +0.32% Spill count: 494741 -> 494975 (+0.05%); split: -0.10%, +0.15% Fill count: 597988 -> 598592 (+0.10%); split: -0.07%, +0.17% Scratch Memory Size: 33351680 -> 33375232 (+0.07%); split: -0.08%, +0.15% Meteor Lake and DG2 had similar results. (Meteor Lake shown) Totals: Instrs: 233063764 -> 233057897 (-0.00%); split: -0.01%, +0.00% Subgroup size: 9892840 -> 9892856 (+0.00%) Cycle count: 25387597341 -> 25373885583 (-0.05%); split: -0.36%, +0.31% Spill count: 518469 -> 517940 (-0.10%); split: -0.19%, +0.09% Fill count: 559444 -> 558537 (-0.16%); split: -0.29%, +0.13% Scratch Memory Size: 19694592 -> 19658752 (-0.18%); split: -0.21%, +0.03% Max dispatch width: 7135248 -> 7131672 (-0.05%); split: +0.13%, -0.18% Totals from 301996 (37.49% of 805603) affected shaders: Instrs: 144535999 -> 144530132 (-0.00%); split: -0.01%, +0.01% Subgroup size: 3768528 -> 3768544 (+0.00%) Cycle count: 18687102311 -> 18673390553 (-0.07%); split: -0.50%, +0.42% Spill count: 515687 -> 515158 (-0.10%); split: -0.20%, +0.09% Fill count: 557638 -> 556731 (-0.16%); split: -0.29%, +0.13% Scratch Memory Size: 18662400 -> 18626560 (-0.19%); split: -0.22%, +0.03% Max dispatch width: 2029872 -> 2026296 (-0.18%); split: +0.44%, -0.62% Tiger Lake Totals: Instrs: 238813279 -> 238766482 (-0.02%); split: -0.04%, +0.02% Subgroup size: 9851320 -> 9851328 (+0.00%) Cycle count: 23668877036 -> 23646286421 (-0.10%); split: -0.51%, +0.42% Spill count: 559060 -> 554241 (-0.86%); split: -1.12%, +0.26% Fill count: 595926 -> 591843 (-0.69%); split: -1.46%, +0.78% Scratch Memory Size: 19929088 -> 19764224 (-0.83%); split: -1.19%, +0.36% Max dispatch width: 7102184 -> 7101840 (-0.00%); split: +0.13%, -0.13% Totals from 284125 (35.42% of 802235) affected shaders: Instrs: 144695094 -> 144648297 (-0.03%); split: -0.06%, +0.03% Subgroup size: 3567312 -> 3567320 (+0.00%) Cycle count: 11303753658 -> 11281163043 (-0.20%); split: -1.07%, +0.87% Spill count: 554624 -> 549805 (-0.87%); split: -1.13%, +0.26% Fill count: 592252 -> 588169 (-0.69%); split: -1.47%, +0.78% Scratch Memory Size: 19553280 -> 19388416 (-0.84%); split: -1.21%, +0.37% Max dispatch width: 1895488 -> 1895144 (-0.02%); split: +0.48%, -0.50% Ice Lake Totals: Instrs: 239034316 -> 239049108 (+0.01%); split: -0.03%, +0.04% Subgroup size: 9926440 -> 9926448 (+0.00%) Cycle count: 24944253156 -> 24919967386 (-0.10%); split: -0.25%, +0.15% Spill count: 575498 -> 571612 (-0.68%); split: -1.18%, +0.51% Fill count: 709760 -> 716665 (+0.97%); split: -1.31%, +2.28% Scratch Memory Size: 20699136 -> 20599808 (-0.48%); split: -1.45%, +0.97% Max dispatch width: 7140856 -> 7143568 (+0.04%); split: +0.15%, -0.12% Totals from 233451 (29.01% of 804669) affected shaders: Instrs: 127440610 -> 127455402 (+0.01%); split: -0.07%, +0.08% Subgroup size: 2835784 -> 2835792 (+0.00%) Cycle count: 11818511030 -> 11794225260 (-0.21%); split: -0.53%, +0.32% Spill count: 559557 -> 555671 (-0.69%); split: -1.22%, +0.52% Fill count: 694460 -> 701365 (+0.99%); split: -1.34%, +2.33% Scratch Memory Size: 19774464 -> 19675136 (-0.50%); split: -1.52%, +1.02% Max dispatch width: 1602736 -> 1605448 (+0.17%); split: +0.69%, -0.52% Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35903>	2025-07-15 19:35:42 +00:00
Calder Young	3c7a834ebc	anv: Add support for AV1 video decoding on Gfx125 and Xe2 Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36015>	2025-07-15 01:21:53 +00:00
Calder Young	3456a65619	intel/genxml: Update AVP instructions for Gfx125 and Xe2 Acked-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36015>	2025-07-15 01:21:53 +00:00
Caio Oliveira	f8db53ccae	brw: Fix comparison with unordered_mode when making baked dependency The unordered mode stored in dependencies might be a bitmask and not only a single mode. In practice, only the "stronger" mode will stick. Make sure that the code testing for the mode uses "&" instead of "==", to avoid prevent some valid combinations to happen, e.g. ``` // ... add(16) g104<1>F g94<1,1,0>F g34<1,1,0>F { align1 1H @7 $7.dst compacted }; ``` which without the fix ends up as ``` // ... sync nop(1) null<0,1,0>UB { align1 WE_all 1N F@7 }; add(16) g104<1>F g94<1,1,0>F g34<1,1,0>F { align1 1H $7.dst compacted }; ``` Enables two tests for the scoreboard pass that illustrate this case. For measuring the effect, re-enabled the sync.nop accounting on total of instructions and got the following results. ``` Totals: Instrs: 322041261 -> 321748285 (-0.09%) Cycle count: 22864587567 -> 22863073741 (-0.01%) Max dispatch width: 7989040 -> 7989024 (-0.00%); split: +0.00%, -0.00% Totals from 88212 (9.78% of 902056) affected shaders: Instrs: 102282050 -> 101989074 (-0.29%) Cycle count: 12787629859 -> 12786116033 (-0.01%) Max dispatch width: 525336 -> 525320 (-0.00%); split: +0.01%, -0.01% ``` Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36096>	2025-07-14 20:28:54 +00:00
Caio Oliveira	1e18a2d1a8	brw: Add scoreboard test for edge case involving baked dependency This is disable because it is adding a `sync.nop` instead of baking together both "@3 $0.dst". Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36096>	2025-07-14 20:28:54 +00:00
jhananit	a74ac59220	anv: Remove NIR_PASS_V usage Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> anv: Fix for metadata failure Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35889>	2025-07-14 19:25:52 +00:00
jhananit	debd903a00	intel: Update all NIR_PASS_V to NIR_PASS Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35889>	2025-07-14 19:25:52 +00:00
Jordan Justen	f19e2e69e9	anv: Set Xe3 as supported Backport-to: 25.1 Ref: `16a835ed3d` ("anv: Drop "not yet supported" warning for Xe2") Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31893>	2025-07-14 18:53:48 +00:00
Valentine Burley	84923ccfe9	iris/ci: Lower concurrency of iris-cml-traces Signed-off-by: Valentine Burley <valentine.burley@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36074>	2025-07-14 08:15:25 +00:00
Valentine Burley	2b50f93fb0	iris/ci: Add a performance traces job on ADL Add a new `iris-adl-traces-performance` job, which runs the same set of traces as the `zink-anv-adl-traces-performance` job. Signed-off-by: Valentine Burley <valentine.burley@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36074>	2025-07-14 08:15:25 +00:00
Valentine Burley	7d298e3c4b	iris/ci: Simplify performance trace template The `.profile-traces` template was nearly identical to `.piglit-performance-base`, differing only by one additional variable. Since all jobs extending `.piglit-performance-base` were already using `EGL_PLATFORM: surfaceless`, that setting has been moved into the base template, allowing `.profile-traces` to be simplified. This also hides the performance traces jobs from non-Marge pipelines, as intended. Signed-off-by: Valentine Burley <valentine.burley@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36074>	2025-07-14 08:15:25 +00:00
Sagar Ghuge	36172c41dc	intel/compiler: Drop unused param from set_memory_address Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36092>	2025-07-14 03:46:21 +00:00
Caio Oliveira	887642b0f2	intel: Add INTEL_DEBUG=no-vrt Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Add support for disabling the VRT (Variable Register Thread) feature. The strategy here is to force the old BRW_MAX_GRF limit for the register allocator (locks the upper limit) and make sure ptl_register_blocks() always return that amount of blocks (locks the lower limit). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35781>	2025-07-13 21:11:02 +00:00
Yiwei Zhang	b2a880b85e	hasvk: adopt wsi_common_get_memory Similar to anv. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36095>	2025-07-13 07:49:10 +00:00
Yiwei Zhang	c647c422db	hasvk: avoid leaking private binding for aliased wsi image This time for hasvk and is the same with https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35893 Aliased wsi image has to share the same private binding with the original wsi image for memory consistency. If the private binding exists, it needs to be released before being overridden. Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36095>	2025-07-13 07:49:10 +00:00
Yiwei Zhang	002235f64c	anv: adopt wsi_common_get_memory It's non-trivial to drop the private binding or transfer ownership to the bound memory. So we track the image in the device memory for dedicated allocation so that wsi image alias can find the original wsi image from the wsi memory. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36095>	2025-07-13 07:49:09 +00:00
Sagar Ghuge	e761c45390	anv: Set TG size based on number of threads Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Series shows improvement on TotalWarPharaoh-trace-dx11-1440p-ultra-n=2080 title by 0.96% (not a lot but still it's improvement, so will take that.) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35904>	2025-07-10 22:08:36 +00:00
Sagar Ghuge	5f1f67358c	blorp: Set TG size based on number of threads Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35904>	2025-07-10 22:08:36 +00:00
Sagar Ghuge	0c4e1c9efc	intel/common: Add helper for compute thread group dispatch size The recommended settings is just a guidance and not a programming requirement as per the Bspec. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35904>	2025-07-10 22:08:36 +00:00
José Roberto de Souza	59019a05f6	anv: Program DispatchWalkOrder and ThreadGroupBatchSize with optimized values for regular computer walkers It was only added to indirect compute walkers while HSD don't say anything about this optimization be specific to indirect compute walkers. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36058>	2025-07-10 20:54:30 +00:00
José Roberto de Souza	aea519cbc2	intel/blorp: Program DispatchWalkOrder and ThreadGroupBatchSize with optimized values for regular computer walkers It was only added to indirect compute walkers while HSD don't say anything about this optimization be specific to indirect compute walkers. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36058>	2025-07-10 20:54:30 +00:00
Eric Engestrom	89403487b1	hasvk/ci: disable jobs on anholt farm Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36024>	2025-07-10 18:15:36 +00:00
José Roberto de Souza	7aba9b3ebe	anv: Decode and print async submit batch when debug flag is set Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35986>	2025-07-10 16:21:05 +00:00
Ian Romanick	5adab50283	brw/nir: Use nir_opt_reassociate_matrix_mul This needs to be called before intel_nir_opt_peephole_ffma, so I arbitrarilly decided to call it right before. All Intel platforms had similar results. (Lunar Lake shown) total instructions in shared programs: 17120227 -> 17118227 (-0.01%) instructions in affected programs: 5854 -> 3854 (-34.16%) helped: 51 / HURT: 0 total cycles in shared programs: 895497762 -> 894733940 (-0.09%) cycles in affected programs: 4603518 -> 3839696 (-16.59%) helped: 95 / HURT: 21 LOST: 1 GAINED: 0 Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35925>	2025-07-09 19:28:49 +00:00
Yiwei Zhang	374d97f24c	hasvk: use AHARDWAREBUFFER_USAGE_CAMERA_MASK Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35785>	2025-07-09 03:47:07 +00:00
Yiwei Zhang	e394d29a75	hasvk: use common ANB swapchain gralloc usage query The usage bits issue probably isn't worth a separate backport for hasvk. Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35785>	2025-07-09 03:47:07 +00:00
Yiwei Zhang	4f80b14d0c	anv: use AHARDWAREBUFFER_USAGE_CAMERA_MASK now that AHB header has it defined. Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35785>	2025-07-09 03:47:07 +00:00
Yiwei Zhang	eb567fefc9	anv: use common ANB swapchain gralloc usage query Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35785>	2025-07-09 03:47:07 +00:00
Yiwei Zhang	8f4c938c1e	anv: fix ANB gralloc usage query to not append display usage bits The consumer of the Android surface may or may not be display. e.g. it can also be a media encoder. When BufferQueue makes the allocation, it takes the gralloc usage bits from both the client API (EGL/Vulkan) and the consumer side. Cc: mesa-stable Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35785>	2025-07-09 03:47:06 +00:00
Sviatoslav Peleshko	8d22eb960b	brw/disasm: Fix Gfx11 3src-instructions dst register disassembly The conversion from bit value to register file type is already done by the brw_eu_inst_3src_a1_dst_reg_file in the FFC macro now, so doing it again produced incorrect results. Fixes: `e7179232` ("intel/brw: Move encoding of Gfx11 3-src inside the inst helpers") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13141 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35960>	2025-07-08 19:49:09 +00:00
Daniel Schürmann	2c51a8870d	nir: add nir_vectorize_cb callback parameter to nir_lower_phis_to_scalar() Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Similar to nir_lower_alu_width(), the callback can return the desired number of components for a phi, or 0 for no lowering. The previous behavior of nir_lower_phis_to_scalar() with lower_all=true can be elicited via nir_lower_all_phis_to_scalar() while the previous behavior with lower_all=false now corresponds to nir_lower_phis_to_scalar() with NULL callback. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35783>	2025-07-08 15:33:59 +00:00
Marek Olšák	8def3f865d	agx,freedreno,intel,lima,panfrost,svga,virgl,zink: fix supports_indirect_inputs The GLSL compiler always lowers inputs to temps for VS and GS, so exclude them from driver support because the GLSL compiler will no longer do that unconditionally. Thus, indirect VS and GS inputs are completely untested and broken in a lot of drivers. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35945>	2025-07-08 06:11:42 +00:00
Lionel Landwerlin	67e452669e	anv: do not rely on sampler objects for pipeline compilation Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Descriptor set layout lifetime can be shorter than what the implementation requires. One example is : * create descriptor set layout * create graphics pipeline library * destroy descriptor set layout * link optimize library in a final pipeline The last step might need the descriptor set layout information again. We've so far worked around this by taking a reference on the descriptor set layout in the pipelines. But we forgot that descriptor set layouts have pointers to samplers (for immutable & embedded samplers). We could take a reference to samplers but that sucks for various reasons : - it consumes dynamic state heap space - it could cause issues with capture-replay placement So instead we copy the information from the samplers that might be needed in cases like link optimization. This includes : - ycbcr conversion state (used for NIR lowering) - embedded sampler data (to recreate the sampler) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35955>	2025-07-07 18:53:53 +00:00
Lionel Landwerlin	98bc185376	anv: rework embedded sampler hashing Create a hashing key on all samplers so we can just copy that anywhere we need it. That key already contains the needed parameters for embedded samplers, so the sha1 stuff can go away. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35955>	2025-07-07 18:53:53 +00:00
Sushma Venkatesh Reddy	fa0232d961	intel/executor: Add missing dependency to fix intermittent build failures The executor build was failing randomly due to a missing dependency on `idev_intel_dev`. This patch adds the required dependency to the `meson.build` file to ensure consistent and reliable builds across different configurations. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35928>	2025-07-07 18:35:56 +00:00
Sushma Venkatesh Reddy	29fc96cb80	anv: Add GPU breakpoint before/after specific compute dispatch call Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13089 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35353>	2025-07-07 17:43:41 +00:00
Sushma Venkatesh Reddy	172e475705	intel: Add env variable to add break point on/before compute dispatch Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13089 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35353>	2025-07-07 17:43:40 +00:00
Alyssa Rosenzweig	d31cb824df	treewide: use VARYING_BIT_* Some checks failed macOS-CI / macOS-CI (dri) (push) Has been cancelled Details macOS-CI / macOS-CI (xlib) (push) Has been cancelled Details Via Coccinelle patch generated by the following Python: varys = [ "POS", "COL0", "COL1", "FOGC", "TEX0", "TEX1", "TEX2", "TEX3", "TEX4", "TEX5", "TEX6", "TEX7", "PSIZ", "BFC0", "BFC1", "EDGE", "CLIP_VERTEX", "CLIP_DIST0", "CLIP_DIST1", "CULL_DIST0", "CULL_DIST1", "PRIMITIVE_ID", "PRIMITIVE_COUNT", "LAYER", "VIEWPORT", "FACE", "PRIMITIVE_SHADING_RATE", "PNTC", "TESS_LEVEL_OUTER", "TESS_LEVEL_INNER", "PRIMITIVE_INDICES", "BOUNDING_BOX0", "BOUNDING_BOX1", "VIEWPORT_MASK", "CULL_PRIMITIVE" ] t = """ @@ @@ -(1 << VARYING_SLOT_${V}) +VARYING_BIT_${V} @@ @@ -BITFIELD_BIT(VARYING_SLOT_${V}) +VARYING_BIT_${V} @@ @@ -(1ull << VARYING_SLOT_${V}) +VARYING_BIT_${V} @@ @@ -BITFIELD64_BIT(VARYING_SLOT_${V}) +VARYING_BIT_${V} """ for v in varys: from mako.template import Template print(Template(t).render(V = v)) Closes: #13453 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Marek Olšák <maraeo@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> [panfrost, common] Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> [broadcom] Reviewed-by: Corentin Noël <corentin.noel@collabora.com> [virgl] Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> [zink] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35917>	2025-07-04 19:01:04 +00:00
Mike Blumenkrantz	956d3f1562	mesa/st: handle renderbuffer with null zsbuf this matches cbuf handling Fixes: `2eb45daa9c` ("gallium: de-pointerize pipe_surface") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35941>	2025-07-04 17:36:40 +00:00
Yiwei Zhang	b21e62b71a	anv: avoid leaking private binding for aliased wsi image Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Aliased wsi image has to share the same private binding with the original wsi image for memory consistency. If the private binding exists, it needs to be released before being overridden. Fixes: `d85a9d658f` ("anv/image: Call into WSI to create swapchain images") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35893>	2025-07-03 17:40:31 +00:00
José Roberto de Souza	4830aec8ad	anv: Reduce compiled code for Wa_16018063123 Wa_16018063123 is not a workaround that depends on stepping, so we can use the INTEL_WA_16018063123_GFX_VER macro to reduce code generate for non affected platforms. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35700>	2025-07-03 14:09:13 +00:00
José Roberto de Souza	926e6a94ad	anv: Do not emit batch_emit_fast_color_dummy_blit() for video engine Wa_16018063123 don't apply to video engine also video engine don't support XY_FAST_COLOR_BLT. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Fixes: `ec43c20182` ("anv: implement dummy blit for Wa_16018063123") Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35700>	2025-07-03 14:09:12 +00:00
José Roberto de Souza	4618a99a4c	anv: Flush before invalidate aux map in copy and video engines BSpec: 43904 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `46f5359238` ("anv: Invalidate aux map for copy/video engine") Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35700>	2025-07-03 14:09:12 +00:00
José Roberto de Souza	e68f81eaf6	anv: Read the correct register for aux table invalidation when in GPGPU mode in render engine For 3D or GPGPU modes the same render engine should be used, CCS register should only be used when using compute engine. Fixes: `46f5359238` ("anv: Invalidate aux map for copy/video engine") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35700>	2025-07-03 14:09:12 +00:00

1 2 3 4 5 ...

14260 commits