fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-01-08 23:40:12 +01:00

Author	SHA1	Message	Date
Lionel Landwerlin	9371e8d370	brw: fixup coarse_z computation The delivered values in the coarse pixel size are 0 when coarse pixel dispatch is disabled and that is screwing up our half pixel offset adjustment. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36457>	2025-07-30 07:57:19 +00:00
Lionel Landwerlin	9dac7dda87	brw: fixup source depth enabling with coarse pixel shading Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36457>	2025-07-30 07:57:18 +00:00
Lionel Landwerlin	68c50d129e	brw: fix NIR metadata invalidation with closest-hit shaders Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36457>	2025-07-30 07:57:18 +00:00
Lionel Landwerlin	9430a1ed00	anv: remove unused helper arguments Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36457>	2025-07-30 07:57:18 +00:00
Lionel Landwerlin	fcd06aa8c3	anv: reuse runtime descriptor set layout base object Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36457>	2025-07-30 07:57:18 +00:00
Sagar Ghuge	3a9157a10b	anv: Use thread group preemption granularity Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36337>	2025-07-29 22:47:56 +00:00
Sagar Ghuge	9ae09d521c	intel/genxml: Update CS_CHICKEN1 register field Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36337>	2025-07-29 22:47:56 +00:00
Lucas Fryzek	ab52889d28	anv: Enable compression on astc emulation plane Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11108 Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30592>	2025-07-29 19:10:34 +00:00
José Roberto de Souza	07f5b53dd7	intel/brw: Remove duplicated implementation of brw_imm_uq/brw_imm_u64() Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36448>	2025-07-29 16:05:54 +00:00
José Roberto de Souza	14386eb7e5	intel/brw: Add comment to reg_unit() Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36448>	2025-07-29 16:05:54 +00:00
José Roberto de Souza	7981a18df2	intel/brw: Nuke unused brw_message_desc_header_present() Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36448>	2025-07-29 16:05:53 +00:00
Antonio Ospite	6ffe6c2aef	anv: fix returning _Bool instead of pointer When building for C23 the compiler warns about returning a boolean when a different type is expected instead. Change the code to return NULL instead of false, fixing the following error: ----------------------------------------------------------------------- ../src/intel/vulkan_hasvk/anv_nir_apply_pipeline_layout.c: In function ‘nir_deref_find_descriptor’: ../src/intel/vulkan_hasvk/anv_nir_apply_pipeline_layout.c:230:14: error: incompatible types when returning type ‘_Bool’ but ‘nir_intrinsic_instr ’ {aka ‘struct nir_intrinsic_instr ’} was expected 230 \| return false; \| ^~~~~ ----------------------------------------------------------------------- Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36323>	2025-07-29 14:07:06 +00:00
Juston Li	e1ca09317e	anv/android: refactor anb resolve to fix align assertion Retrieving memory requirement size and alignment via anv_image_get_memory_requirements() return's 0 before surfaces are added by resolve_anb_image() and will assert in align64() when align is 0: Abort message: '../src/util/u_math.h:713: uint64_t align64(uint64_t, uint64_t): assertion "util_is_power_of_two_nonzero64(alignment)" failed' Refactor out anv_image_bind_from_gralloc() into resolve_anb_image() so the checks are performed after the surface is adds. Resolving also requires API 29 so return VK_ERROR_EXTENSION_NOT_PRESENT without it. Fixes: `43cb986d9e` ("anv/android: resolve ANB swapchain images on bind") Signed-off-by: Juston Li <justonli@google.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36060>	2025-07-28 18:54:08 +00:00
Ian Romanick	fa74c31b22	brw: Allow additional flags registers on Xe2+ Xe2 adds two more flags registers. We barely use the second flags register on previous platforms, so the omission was not previously noticed. There are several efforts in progress that will add using of more flags registers. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35415>	2025-07-24 23:08:08 +00:00
Ian Romanick	1279f12c84	brw: Implement Wa_22012725308 for flags via SWSB too At this point, using the per-register granularity will only help in conjuction with fragment shader discard (which is implemented using f1). v2: Loop restructuring and code cleanups. Suggested by Curro. v3: Only apply Wa on Gfx12.5+. Suggested by Curro. v4: Also apply to implicit flag reads. Suggested by Curro. This version affects a lot more shaders (10,936 on Meteor Lake shader-db versus 4,482 before). The results are still very much in the 🤷 territory. v5: Add missing dependency. I thought I got them all the previous time. :( Noticed by Curro. shader-db: Lunar Lake total cycles in shared programs: 886315282 -> 886391040 (<.01%) cycles in affected programs: 204907250 -> 204983008 (0.04%) helped: 1 / HURT: 6716 LOST: 0 GAINED: 1 Meteor Lake and DG2 had similar results. (Meteor Lake shown) total cycles in shared programs: 883774789 -> 883921507 (0.02%) cycles in affected programs: 481836784 -> 481983502 (0.03%) helped: 4 / HURT: 10936 LOST: 3 GAINED: 7 fossil-db: Lunar Lake Totals: Cycle count: 32600441334 -> 32601862658 (+0.00%); split: -0.00%, +0.00% Totals from 90283 (11.44% of 789260) affected shaders: Cycle count: 17265933202 -> 17267354526 (+0.01%); split: -0.00%, +0.01% Meteor Lake and DG2 had similar results. (Meteor Lake shown) Totals: Cycle count: 26477292677 -> 26480321805 (+0.01%); split: -0.00%, +0.01% Max dispatch width: 8010440 -> 8010984 (+0.01%) Totals from 132952 (14.71% of 903925) affected shaders: Cycle count: 15349555348 -> 15352584476 (+0.02%); split: -0.00%, +0.02% Max dispatch width: 1085416 -> 1085960 (+0.05%) Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35415>	2025-07-24 23:08:07 +00:00
Ian Romanick	1fdcc9039b	brw: Add and use brw_reg_is_arf to test for a specific ARF Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35415>	2025-07-24 23:08:07 +00:00
Mike Blumenkrantz	8a4ef5977e	anv: fix format compatibility check typo Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details the view format is above this Fixes: `03cdb3078a` ("anv: Support multi-planar formats in anv_formats_are_compatible") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36224>	2025-07-24 00:11:29 +00:00
Caio Oliveira	8783828f3d	intel/genxml: Remove support for start/end atttributes Keep the support in gen_sort_xml.py to allow it still convert old MRs into the new format. Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36138>	2025-07-23 16:02:14 +00:00
Caio Oliveira	c418cb85f7	intel/genxml: Convert field format from start/end to dword/bits And change the gen_sort_xml.py script to default to the new format. Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36138>	2025-07-23 16:02:14 +00:00
Caio Oliveira	fb8f14820a	intel/genxml: Add support for dword/bits in fields to rest of the code Change code to temporarily support both the start/end old format and the dword/bits new format. Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36138>	2025-07-23 16:02:14 +00:00
Caio Oliveira	001f207ee0	intel/genxml: Add support for dword/bits in fields to gen_sort_tags.py script Add a `--bits-format` argument to normalize the output to either of the formats described below. For now, defaults to the old format. The documentation in PRMs and BSpec describe the fields with the dword and the bit range. Using the same convention makes easier to spot issues. Old format: ``` <field name="Disable SLM Read Merge Optimization" start="38" end="38" type="bool" /> <field name="Pixel Async Compute Thread Limit" start="39" end="41" type="uint" prefix="PACTL"> ``` New format: ``` <field name="Disable SLM Read Merge Optimization" dword="1" bits="6:6" type="bool" /> <field name="Pixel Async Compute Thread Limit" dword="1" bits="9:7" type="uint" prefix="PACTL"> ``` For Groups, we store the dword and if needed a offset_bits, in case a group starts in a non-aligned position. Size and count for groups are not changed. Do this first for gen_sort_tags.py in case is convenient to have for the stable tree to convert future patches from the new back into the old format. Later patches will add support to the rest of the code. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36138>	2025-07-23 16:02:13 +00:00
Caio Oliveira	395672b013	intel/decoder/tests: Sort gentest.xml file Avoid noise when changing to the new GenXML field format -- which would try to also sort it. Also add this file to be checked as part of tests. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36138>	2025-07-23 16:02:13 +00:00
Jianxun Zhang	8d98bf289d	anv: Fix PAT entry in importing (xe2) If a compressed bo is imported, we should set the corresponding compressed PAT. Fixes video corruption in gamescope: gamescope --force-composition -- vkcube Close: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13442 Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36275>	2025-07-22 20:18:44 +00:00
Jianxun Zhang	c766586957	anv: No compression on host memory allocation (xe2) The Xe kernel driver doesn't allow vm_bind on compressed bo if it has user pointer. And we probably shouldn't enable CCS compression on memory in any case. This change is necessary to prevent failures once we adjust the priority of compression PAT entries in a following commit: Vulkan CTS: dEQP-VK.api.buffer_marker.compute.external_host_mem.top_of_pipe. memory_dep.buffer_copy dEQP-VK.memory.external_memory_host.simple_allocation. minImportedHostPointerAlignment_x3 anv_kmd_backend.c:308: xe_vm_bind_op: Assertion `errno_ != EINVAL' failed. Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36275>	2025-07-22 20:18:44 +00:00
Jordan Justen	bca1acbb42	intel/dev: Add WCL PCI IDs Tested with: commit 3a252ff9d8b6dc22b20463bfcb31a4e8992b0e8f Merge: 9800bf6fae3b 11895f375939 Author: Simona Vetter <simona.vetter@ffwll.ch> Date: Fri Jul 11 11:25:34 2025 +0200 Note that the kernel treats WCL similar to PTL, so 94de1dfd4729 ("drm/xe/ptl: Drop force_probe requirement") also removed the force_probe for WCL. Backport-to: 25.1 Ref: 3c0f211bc8fc ("drm/xe: Add Wildcat Lake device IDs to PTL list") Ref: 94de1dfd4729 ("drm/xe/ptl: Drop force_probe requirement") Ref: drm/drm-next 3a252ff9d8b6dc22b20463bfcb31a4e8992b0e8f Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36148>	2025-07-21 21:22:05 +00:00
Jordan Justen	8b771e8937	intel/dev: Add WCL device info Backport-to: 25.1 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36148>	2025-07-21 21:22:05 +00:00
Jordan Justen	ffabca9101	intel/dev/mesa_defs.json: Add WCL WA entries Backport-to: 25.1 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36148>	2025-07-21 21:22:05 +00:00
Jordan Justen	23bc204faa	intel/dev: Add WCL platform enum Backport-to: 25.1 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36148>	2025-07-21 21:22:05 +00:00
Nanley Chery	4de638ae1e	intel: Enable CCS_E on linear surfaces on Xe2+ Allow CCS for non-display linear surfaces in isl_surf_supports_ccs(). We're going to rely more on the helper to determine CCS-enabling for Xe2 on iris. Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32120>	2025-07-21 18:36:31 +00:00
Nanley Chery	33795589ec	intel/isl: Only set CMF on renderable views on Xe2+ The compression format is only used during rendering. This prevents drivers from hitting an unreachable when we start enabling CCS on linear surfaces which may have non-renderable and non-pow2 formats. For now, continue to use the surface format instead of the view format to look up the CMF. This strategy should return the optimal CMF for compressed surfaces that undergo redescription during copies. Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32120>	2025-07-21 18:36:31 +00:00
Nanley Chery	dd3b0de42c	anv: Disable fast-clears on linear surfaces Bspec 57340 does not have a fast-clear rectangle for linear surfaces. Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32120>	2025-07-21 18:36:31 +00:00
Nanley Chery	93dbfea720	anv: Disable CCS if image bound to wrong heap on Xe2+ Avoids HIZ + CCS flushes and helps debug. Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32120>	2025-07-21 18:36:31 +00:00
Alyssa Rosenzweig	6b34e2174e	nir: introduce ergonomic tex builder for intrinsics, we have these really nice builders using designated initializers + macros to specify optional indices. texture instrs have even more craziness involved, but we can do the same trick. this commit takes the existing "fixed form" deref-centric tex builders and generalizes them to work with non-deref textures, making it useful also for GL and late VK passes, while providing an API that strives to be ergonomic and consistent. this series only implements a subset of possible texture operations for now, but more generalizing could be added as people have need. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36050>	2025-07-21 12:11:41 +00:00
Alyssa Rosenzweig	ecfca8ec6f	util: crib SWAP macro from freedreno we have a bunch of copies across the tree, unify them. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36257>	2025-07-21 11:42:18 +00:00
Caio Oliveira	3c7dd0ccf1	brw: Make brw_builder() shader constructor use CFG if available Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Properly pick the end of the last block as a cursor. Also remove the default constructor since is not needed anymore. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34681>	2025-07-19 17:49:48 +00:00
Caio Oliveira	ab8af62745	brw: Use a builder to track position in lower_simd Removes brw_builder::at() since it is now unused, replaced by various other helpers. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34681>	2025-07-19 17:49:48 +00:00
Caio Oliveira	8826b1e680	brw: Use a more specific builder helper in combine constants Also remove commentary about older Gfx versions that don't apply anymore. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34681>	2025-07-19 17:49:47 +00:00
Caio Oliveira	ac2b072312	brw: Add more specific brw_builder helpers Replace uses of brw_builder::at() with various more descriptive variants. Use block pointer from instruction when possible. A couple of special cases remained and will be handled in separate patches. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34681>	2025-07-19 17:49:47 +00:00
Caio Oliveira	6c5132ec9a	brw: Move insert/remove code to the block Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34681>	2025-07-19 17:49:46 +00:00
Caio Oliveira	2dfd4dcbc5	brw: Fix cmat conversion between bfloat16 and non-float32 The HW only supports converting BRW_TYPE_BF values to/from BRW_TYPE_F, so intermediate conversion is needed. Move the intermediate conversion to the implementation of `@convert_cmat_intel` and simplify the brw_nir_lower_cooperative_matrix pass. This has two positive effects - Fixes conversion between BF and integer type cooperative matrices, that was still using the old emit_alu1 approach instead of the new code for `@convert_cmat_intel`. - Guarantee the intermediate conversion will result in a valid layout for conversions involved USE_B matrices. If we instead used the intrinsic twice in brw_nir_lower_cooperative_matrix.c, a matrix with invalid layout would be visible at NIR level and we wouldn't be able to keep the current assertion for USE_B case. Due to the configurations we have exposed, we still don't need to write a more complex USE_B conversion -- they are all between same size types (and, consequently, packing factors), so no shuffling of data is needed to respect the USE_B layout. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36185>	2025-07-18 21:55:43 +00:00
Ian Romanick	2594fcadd4	brw: Split virtual GRFs again at the end of optimizations Logical sends and load_payload can have large VGRFs that cannot be split. Once all of the lowering passes and optimization passes that might eliminate any of those instructions have completed, try to split larger VGRFs one last time. Register allocation can only handle VGRFs up to a certain size, so this is the last opportunity to prevent later failures due to VGRFs that are too large. Closes: #13239 shader-db: Lunar Lake, Meteor Lake, DG2, and Tiger Lake had similar results. (Lunar Lake shown) total instructions in shared programs: 17114494 -> 17114496 (<.01%) instructions in affected programs: 2790 -> 2792 (0.07%) helped: 2 / HURT: 4 total cycles in shared programs: 886617364 -> 886315282 (-0.03%) cycles in affected programs: 4067540 -> 3765458 (-7.43%) helped: 48 / HURT: 9 Ice Lake and Skylake had similar restuls. (Ice Lake shown) total instructions in shared programs: 20799801 -> 20799691 (<.01%) instructions in affected programs: 1210 -> 1100 (-9.09%) helped: 1 / HURT: 0 total cycles in shared programs: 865495386 -> 865498990 (<.01%) cycles in affected programs: 60132 -> 63736 (5.99%) helped: 2 / HURT: 1 total spills in shared programs: 3987 -> 3981 (-0.15%) spills in affected programs: 24 -> 18 (-25.00%) helped: 1 / HURT: 0 total fills in shared programs: 3535 -> 3519 (-0.45%) fills in affected programs: 36 -> 20 (-44.44%) helped: 1 / HURT: 0 fossil-db: All Intel platforms had similar results. (Lunar Lake shown) Totals: Instrs: 208647246 -> 208646499 (-0.00%); split: -0.00%, +0.00% Cycle count: 31257819536 -> 31263957016 (+0.02%); split: -0.02%, +0.04% Max live registers: 66160877 -> 66155728 (-0.01%) Totals from 34703 (4.91% of 707053) affected shaders: Instrs: 13766639 -> 13765892 (-0.01%); split: -0.02%, +0.01% Cycle count: 3693572086 -> 3699709566 (+0.17%); split: -0.15%, +0.32% Max live registers: 4843852 -> 4838703 (-0.11%) Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36202>	2025-07-18 19:04:01 +00:00
Ian Romanick	f6da6399d7	brw/reg_allocate: Don't access out of bounds in non-debug builds In debug builds, the assertion should be preferred as it will highlight the actual problem. In non-debug builds, it is possible to fail register allocation more gracefully. If the problem only occurs in, for example, a SIMD32 version of a shader, the application may even continue to function. Closes: #13239 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36202>	2025-07-18 19:04:01 +00:00
Calder Young	389ce946fd	isl: Set tiling requirements for video surfaces Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35927>	2025-07-16 04:08:16 +00:00
Calder Young	07097c84ca	anv: Fix tiling for H.265 and VP9 video surfaces on GFX 12.5+ Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35927>	2025-07-16 04:08:16 +00:00
Lionel Landwerlin	440e2e9200	genxml: fix 3DSTATE_TE definition on Gfx12.[05] Since Gfx12+ the instruction is 5 dwords. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36146>	2025-07-16 01:01:11 +00:00
Lionel Landwerlin	ac78693b6a	intel/genxml: rename body field So that the body field has the same name in COMPUTE_WALKER & EXECUTE_INDIRECT_DISPATCH. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36146>	2025-07-16 01:01:11 +00:00
Ian Romanick	b57bad1fd7	brw/reg_allocate: Check source / destination hazard for all larger SIMD Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details All platforms needs this check for SIMD32. Xe2+ do not need this for SIMD16. Also... delete some really stale comments about Gfx4/Gfx5. This compiler doesn't even support those platforms. No shader-db changes on any pre-Xe2 Intel platforms: shader-db: Lunar Lake total instructions in shared programs: 17108867 -> 17108855 (<.01%) instructions in affected programs: 35211 -> 35199 (-0.03%) helped: 19 / HURT: 6 total cycles in shared programs: 885026794 -> 885805580 (0.09%) cycles in affected programs: 140449880 -> 141228666 (0.55%) helped: 903 / HURT: 1142 LOST: 0 GAINED: 25 fossil-db: Lunar Lake Totals: Instrs: 208578317 -> 208574097 (-0.00%); split: -0.00%, +0.00% Cycle count: 31268800798 -> 31259914590 (-0.03%); split: -0.10%, +0.07% Spill count: 504472 -> 504102 (-0.07%); split: -0.09%, +0.02% Fill count: 606581 -> 606079 (-0.08%); split: -0.13%, +0.05% Scratch Memory Size: 35001344 -> 34957312 (-0.13%) Totals from 60714 (8.59% of 706970) affected shaders: Instrs: 48923370 -> 48919150 (-0.01%); split: -0.01%, +0.01% Cycle count: 11830486210 -> 11821600002 (-0.08%); split: -0.27%, +0.20% Spill count: 397150 -> 396780 (-0.09%); split: -0.12%, +0.02% Fill count: 469651 -> 469149 (-0.11%); split: -0.17%, +0.06% Scratch Memory Size: 25971712 -> 25927680 (-0.17%) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35903>	2025-07-15 19:35:44 +00:00
Ian Romanick	7e98ca89f2	brw/reg_allocate: Adjust source / destination hazard conditions for broadcast Broadcast selects one lane from the source to write to all the lanes of the destination. This makes it possible for the first half to overwrite the source used by the second half. No shader-db changes on any Intel platform. fossil-db: Lunar Lake Totals: Instrs: 208705405 -> 208705374 (-0.00%); split: -0.00%, +0.00% Cycle count: 31274597098 -> 31273711544 (-0.00%); split: -0.00%, +0.00% Totals from 77 (0.01% of 707133) affected shaders: Instrs: 220177 -> 220146 (-0.01%); split: -0.02%, +0.00% Cycle count: 461694212 -> 460808658 (-0.19%); split: -0.33%, +0.14% No fossil-db changes on any other Intel platforms. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35903>	2025-07-15 19:35:44 +00:00
Ian Romanick	67dc02acc2	brw/reg_allocate: Only add interference for the source with the hazard shader-db: Lunar Lake total instructions in shared programs: 17105892 -> 17105732 (<.01%) instructions in affected programs: 55720 -> 55560 (-0.29%) helped: 29 / HURT: 24 total cycles in shared programs: 884342344 -> 884663448 (0.04%) cycles in affected programs: 154776382 -> 155097486 (0.21%) helped: 719 / HURT: 761 total spills in shared programs: 3278 -> 3262 (-0.49%) spills in affected programs: 320 -> 304 (-5.00%) helped: 4 /HURT: 0 total fills in shared programs: 1632 -> 1616 (-0.98%) fills in affected programs: 368 -> 352 (-4.35%) helped: 4 / HURT: 0 LOST: 3 GAINED: 4 No shader-db changes on any other Intel platforms. fossil-db: Lunar Lake Totals: Instrs: 208696275 -> 208692511 (-0.00%); split: -0.00%, +0.00% Cycle count: 31325252074 -> 31274118190 (-0.16%); split: -0.27%, +0.11% Spill count: 504809 -> 504472 (-0.07%); split: -0.07%, +0.01% Fill count: 607047 -> 606581 (-0.08%); split: -0.08%, +0.01% Scratch Memory Size: 35037184 -> 35001344 (-0.10%); split: -0.11%, +0.01% Totals from 44135 (6.24% of 707112) affected shaders: Instrs: 39570465 -> 39566701 (-0.01%); split: -0.01%, +0.00% Cycle count: 11140437886 -> 11089304002 (-0.46%); split: -0.76%, +0.30% Spill count: 279756 -> 279419 (-0.12%); split: -0.13%, +0.01% Fill count: 354706 -> 354240 (-0.13%); split: -0.14%, +0.01% Scratch Memory Size: 18758656 -> 18722816 (-0.19%); split: -0.20%, +0.01% Meteor Lake, DG2, Tiger Lake, Ice Lake, and Skylake had similar results. (Meteor Lake shown) Totals: Cycle count: 25377247343 -> 25377246251 (-0.00%); split: -0.00%, +0.00% Totals from 11 (0.00% of 806166) affected shaders: Cycle count: 899080 -> 897988 (-0.12%); split: -0.48%, +0.36% Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35903>	2025-07-15 19:35:43 +00:00
Ian Romanick	4e05de7c3d	brw/reg_allocate: Require SIMD32 for destination / source interference on Xe2 No platforms other than Lunar Lake were affected in shader-db or fossil-db for obvious reasons. shader-db: Lunar Lake total instructions in shared programs: 17070074 -> 17069908 (<.01%) instructions in affected programs: 151939 -> 151773 (-0.11%) helped: 61 / HURT: 60 total cycles in shared programs: 891338314 -> 880188516 (-1.25%) cycles in affected programs: 550482120 -> 539332322 (-2.03%) helped: 8053 / HURT: 7183 total spills in shared programs: 3294 -> 3278 (-0.49%) spills in affected programs: 138 -> 122 (-11.59%) helped: 8 / HURT: 0 total fills in shared programs: 1653 -> 1632 (-1.27%) fills in affected programs: 212 -> 191 (-9.91%) helped: 8 / HURT: 0 LOST: 96 GAINED: 70 fossil-db: Lunar Lake Totals: Instrs: 208555066 -> 208509387 (-0.02%); split: -0.03%, +0.00% Cycle count: 31487691872 -> 31318442816 (-0.54%); split: -0.88%, +0.34% Spill count: 508701 -> 504809 (-0.77%); split: -0.86%, +0.10% Fill count: 612583 -> 607047 (-0.90%); split: -1.03%, +0.13% Scratch Memory Size: 35311616 -> 35037184 (-0.78%); split: -0.81%, +0.04% Totals from 214417 (30.33% of 706852) affected shaders: Instrs: 123732970 -> 123687291 (-0.04%); split: -0.04%, +0.01% Cycle count: 27410928904 -> 27241679848 (-0.62%); split: -1.01%, +0.39% Spill count: 452458 -> 448566 (-0.86%); split: -0.97%, +0.11% Fill count: 550991 -> 545455 (-1.00%); split: -1.15%, +0.14% Scratch Memory Size: 31138816 -> 30864384 (-0.88%); split: -0.92%, +0.04% Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35903>	2025-07-15 19:35:43 +00:00

1 2 3 4 5 ...

14304 commits