fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-09 08:58:02 +02:00

Author	SHA1	Message	Date
Marc Herbert	75fb91501e	docs: show which pkg-config Fedora uses for cross-compilation Learned the hard way. Only tested on Fedora but other RPM-based distros are likely to be the same. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31945>	2025-01-30 05:13:29 +00:00
Marc Herbert	0051807e7e	docs: show how to use ccache when cross-compiling On my desktop system, this gets compilation of (a subset of) 32bits Mesa from 2.5 minutes down to 15 seconds - and most of what left is linking. Also show that absolute paths are not required. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31945>	2025-01-30 05:13:29 +00:00
Marc Herbert	eaabf0e5f0	docs: cross-compile: add useful "apt" and "dnf" builddep commands These have been tested only with x86_64->i686 cross-compilation but I don't see why they couldn't work with other architectures if/when Linux distributions support them. For at least x86_64->i686 cross-compilation, these commands save an enormous amount of time. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31945>	2025-01-30 05:13:29 +00:00
Marc Herbert	56081c0b22	docs: add "apt-get build-dep" and "dnf buildep" So much easier and faster than installing every dependency one by one. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31945>	2025-01-30 05:13:28 +00:00
Caio Oliveira	f18dee3618	intel/brw: Fallback to SEND from SEND_GATHER if possible After optimization happen, if the sources are still in one or two contigous spans for some reason (e.g. some data read from memory now being written), it is beneficial to just use regular SEND and avoid having to set the ARF scalar instruction. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <None> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32410>	2025-01-30 04:43:58 +00:00
Caio Oliveira	b6b32933ad	intel/brw: Use SHADER_OPCODE_SEND_GATHER in Xe3 Add an optimization pass to turn regular SENDs into SEND_GATHERs. This allows the payload to be "broken" into smaller pieces that can be further optimized, which _may_ result in - less register pressure (no need to contiguous space), and - less instructions (no need to MOV to such space). For debugging, the INTEL_DEBUG=no-send-gather option skips this optimization, and reporting how many opportunities were missed. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <None> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32410>	2025-01-30 04:43:58 +00:00
Caio Oliveira	26d4d04d63	intel/brw: Add lowering for SHADER_OPCODE_SEND_GATHER Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <None> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32410>	2025-01-30 04:43:58 +00:00
Caio Oliveira	650ec7169d	intel/brw: Add SHADER_OPCODE_SEND_GATHER Starting in Xe3, there's a variant of SEND that take the register numbers from the ARF scalar register, and don't require them to be contiguous. The new opcode added here represents that kind of SEND. To make the original sources still reachable, we keep them around during the IR, just ignoring them at generator time. This allow software scoreboard to properly reason the dependencies without trying to decode the contents of ARF scalar register being used. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <None> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32410>	2025-01-30 04:43:58 +00:00
Caio Oliveira	2fca22347c	intel/brw: Plumb through generator whether SEND is gather variant Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <None> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32410>	2025-01-30 04:43:58 +00:00
Caio Oliveira	00fac79f99	intel/brw: Add scoreboard support for scalar register Xe3 adds a new pipe that handles only MOVs from immediate into the scalar register. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Lionel Landwerlin <None> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32410>	2025-01-30 04:43:57 +00:00
Daniel Schürmann	3868102a04	nir/loop_analyze: stack-allocate loop_info_state Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33131>	2025-01-30 03:48:36 +00:00
Daniel Schürmann	fbaabcfb0a	nir/loop_analyze: store nir_loop_induction_variable hash table in loop_info No need to create a separate array. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33131>	2025-01-30 03:48:36 +00:00
Daniel Schürmann	f327ece9bf	nir/loop_analyze: re-use the same nir_loop_variable struct before and after the increment The information is redundant. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33131>	2025-01-30 03:48:36 +00:00
Daniel Schürmann	de30bfd5b8	nir/loop_analyze: ignore terminating induction variable in guess_loop_limit() The array access might be using a different induction variable. Totals from 23 (0.03% of 79395) affected shaders: (Navi31) Instrs: 113742 -> 121017 (+6.40%) CodeSize: 592152 -> 636228 (+7.44%) Latency: 439244 -> 426784 (-2.84%) InvThroughput: 36264 -> 35199 (-2.94%) SClause: 3048 -> 3426 (+12.40%) Copies: 10630 -> 10733 (+0.97%) Branches: 3774 -> 4310 (+14.20%) PreSGPRs: 1683 -> 1696 (+0.77%) PreVGPRs: 1230 -> 1232 (+0.16%) VALU: 51026 -> 55912 (+9.58%) SALU: 15270 -> 15638 (+2.41%) SMEM: 4456 -> 5149 (+15.55%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33131>	2025-01-30 03:48:36 +00:00
Daniel Schürmann	7eb2e96d16	nir/loop_analyze: insert only induction vars into hash map Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33131>	2025-01-30 03:48:35 +00:00
Daniel Schürmann	f0fd04327f	nir/loop_analyze: replace nir_loop_variable array with hash table Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33131>	2025-01-30 03:48:35 +00:00
Daniel Schürmann	642a980c9e	nir/loop_analyze: don't initialize nir_loop_variable separately Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33131>	2025-01-30 03:48:35 +00:00
Daniel Schürmann	f11edceae3	nir/loop_analyze: directly record induction variables into nir_loop_info Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33131>	2025-01-30 03:48:35 +00:00
Daniel Schürmann	e639c4d74f	nir/loop_analyze: remove nir_loop_variable::in_loop This information is redundant. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33131>	2025-01-30 03:48:35 +00:00
Daniel Schürmann	7f244ced10	nir/loop_analyze: remove nir_loop_variable::in_if_branch and nir_loop_variable::in_nested_loop This information is redundant. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33131>	2025-01-30 03:48:35 +00:00
Daniel Schürmann	83f395a7ce	nir/loop_analyze: only iterate loop header phis in compute_induction_information() Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33131>	2025-01-30 03:48:35 +00:00
Daniel Schürmann	65f95ae74e	aco/insert_NOPs: implement VALU -> VALU case for VALUReadSGPRHazard on GFX12 Totals from 36918 (46.50% of 79395) affected shaders: (GFX1200) Instrs: 34997889 -> 35296429 (+0.85%); split: -0.00%, +0.85% CodeSize: 186161112 -> 187334364 (+0.63%); split: -0.00%, +0.63% Latency: 250265551 -> 250330784 (+0.03%); split: -0.00%, +0.03% InvThroughput: 41185298 -> 41192503 (+0.02%); split: -0.00%, +0.02% Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32682>	2025-01-30 03:13:16 +00:00
Daniel Schürmann	6c7355f0e6	aco/insert_NOPs: refactor VALUReadSGPRHazard detection Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32682>	2025-01-30 03:13:16 +00:00
Mike Blumenkrantz	4b0f2d1a2b	zink: refcount needs_present resource it's theoretically possible that this resource could be destroyed between flush_resource and flush...maybe Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33285>	2025-01-30 01:38:32 +00:00
Mike Blumenkrantz	c1e09c7309	zink: add zink_resource_reference() util function same as pipe version but using different types Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33285>	2025-01-30 01:38:32 +00:00
Mike Blumenkrantz	2d630952b0	zink: check for bound gfx stages before dereferencing this avoids a null deref in a pattern like bind TES->unbind TES with the same descriptor bound cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33285>	2025-01-30 01:38:32 +00:00
Caio Oliveira	fbacf3761f	intel: Add meson option -Dintel-elk Defaults to true. When set to false Iris and various tools can be built without ELK support. In both cases this means supporting only Gfx9+. This option must be true to build Crocus or Hasvk. This allows skipping re-building ELK when developing for newer platforms with tools/tests enabled. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11575 Reviewed-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33054>	2025-01-30 00:45:59 +00:00
Caio Oliveira	31e5d909e7	intel/tools: Merge libaub into libintel_tools Reviewed-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33054>	2025-01-30 00:45:59 +00:00
Caio Oliveira	ec2d20a70d	intel/tools: Add helpers for decoder_init/disasm Isolate the BRW/ELK differences in a single place. The way is done now, we are not reusing the isa_info between calls. For the tools here this is probably fine, if its someday this gets in the way, we can add an opaque pointer to store the right data. This intentionally is not used in Iris, since there the driver need more detailed view into BRW/ELK and we don't want to create an all encompassing abstraction for that. Reviewed-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33054>	2025-01-30 00:45:59 +00:00
Caio Oliveira	aa2bd16dec	intel/tools: Use idep_libintel_common in meson Since the internal dependency object exists and is already used in some cases, let's be consistent. Reviewed-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33054>	2025-01-30 00:45:59 +00:00
Francisco Jerez	d455d5d86c	anv/xe3+: Enable VRT. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32664>	2025-01-29 23:39:32 +00:00
Francisco Jerez	9c13506dd9	iris/xe3+: Enable VRT. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32664>	2025-01-29 23:39:32 +00:00
Francisco Jerez	dd1712515b	anv/xe3+: Set RegistersPerThread for bindless shader dispatch. v2: Use MOV and wrap in conditional during BTD spawn header setup (Lionel). Remove references to SIMD8 (Tapani). v3: Update brw_bsr() to specify number of registers per thread, don't initialize Registers Per Thread on BTD spawn header (Lionel). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32664>	2025-01-29 23:39:32 +00:00
Francisco Jerez	b25d0f899b	anv/xe3+: Set RegistersPerThread during shader state setup based on prog_data. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32664>	2025-01-29 23:39:32 +00:00
Francisco Jerez	7537f8edee	intel/blorp/xe3+: Set RegistersPerThread during shader state setup based on prog_data. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32664>	2025-01-29 23:39:32 +00:00
Francisco Jerez	2a12ea3df0	iris/xe3+: Set RegistersPerThread during shader state setup based on prog_data. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32664>	2025-01-29 23:39:32 +00:00
Francisco Jerez	f6a1c51de7	intel/genxml/xe3+: Update definitions for shader state setup. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32664>	2025-01-29 23:39:32 +00:00
Francisco Jerez	fb40b449cd	intel/brw: Define ptl_register_blocks() helper. Since this calculation will be needed in many places to set up the state of each shader stage. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32664>	2025-01-29 23:39:32 +00:00
Francisco Jerez	70fecb1483	intel/brw: Report number of GRF registers used in brw_stage_prog_data. This is similar to what we used to do on pre-SNB platforms, the number of GRF registers used by the shader will be used on Xe3+ to adjust the trade-off between thread-level parallelism and size of the GRF file. Plumb the value through prog_data so the driver can set up the hardware state accordingly. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32664>	2025-01-29 23:39:32 +00:00
Francisco Jerez	6513bf65c3	intel/brw/xe3+: Optimize CS/TASK/MESH compile time optimistically assuming SIMD32. This is similar in principle to the previous commit "intel/brw/xe3+: brw_compile_fs() implementation for Xe3+." but applied to compute-like shader stages. It changes the implementation of brw_compile_cs/task/mesh() to reduce compile time and take advantage of wider dispatch modes more aggressively than the original logic, since as of Xe3 SIMD32 builds succeed without spills in most cases thanks to VRT. The new "optimistic" SIMD selection logic starts with the SIMD width that is potentially highest performance and only compiles additional narrower variants if that fails (typically due to spilling), while the old "pessimistic" logic did the opposite: It started with the narrowest SIMD width and compiled additional variants with increasing register pressure until one of them failed to compile. In typical non-spilling cases where we formerly compiled SIMD16 and SIMD32 variants of the same compute shader, this change will halve the number of backend compilations required to build it. XXX - Possibly don't do this in cases with variable workgroup size until effect on runtime performance can be measured directly. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> v2: Don't do this for now in cases with variable workgroup size, still compile every possible variant in such cases. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32664>	2025-01-29 23:39:32 +00:00
Sagar Ghuge	7e1362e9c0	intel/brw/xe3+: Don't compile SIMD32 if there is ray queries Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32664>	2025-01-29 23:39:32 +00:00
Francisco Jerez	5b6906076e	intel/brw/xe3+: brw_compile_fs() implementation for Xe3+. This reworks the implementation of brw_compile_fs() to reduce compile time and take advantage of wider dispatch modes more aggressively than the original logic. The new "optimistic" PS compilation logic starts with the SIMD width that is potentially highest performance and only compiles additional narrower variants if that fails (typically due to spilling or hardware restrictions), while the old "pessimistic" logic did the opposite: It started with the narrowest SIMD width and compiled additional variants with increasing register pressure until one of them failed to compile. The main disadvantage of this is that selectively throwing away some of the compiled variants based on the static analysis of their performance behavior will no longer be possible, however this is expected to be less useful on Xe3+ since the GRF space allocated to a thread can be scaled up or down, which leads to less dramatic differences in scheduling between SIMD variants. In typical non-spilling cases where we formerly compiled SIMD16 and SIMD32 variants of the same fragment shader, this change will halve the number of backend compilations required to build a shader. With multi-polygon PS dispatch enabled (which is disabled by default right now) this has an even more dramatic effect since the number of compiler iterations can be reduced down to a fifth in the best case scenario. Even though in most cases we will only attempt to return a single binary from the pixel shader compilation, the hardware allows a pair of PS kernels to be specified, and we'll still take advantage of this when the multi-polygon PS kernel has the potential to have worse performance than the single-polygon shader because only the latter register-allocates successfully at SIMD32 -- Only in such case (SIMD2x8 multi-polygon, SIMD32 single-polygon) we'll continue programming both so the hardware will chose one or the other at runtime depending on the SIMD fullness and number of polygons it can buffer at runtime. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32664>	2025-01-29 23:39:32 +00:00
Francisco Jerez	1b2bd1fcb8	intel/brw: Exit early from run_fs() if compilation failed before optimization loop. This avoids running the optimizer uselessly if compilation of the current kernel failed due to some hardware (e.g. SIMD-width) restriction. This isn't only inefficient but it can break assumptions throughout the compiler which would lead to crashes on Xe3 when this arises during translation from NIR. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32664>	2025-01-29 23:39:32 +00:00
Francisco Jerez	afff3eb95e	intel/brw: Indent conditional block from brw_compile_fs() not applicable to Xe2+. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32664>	2025-01-29 23:39:32 +00:00
Francisco Jerez	d7d08ec2e2	intel/brw: Indent body of brw_compile_fs() not applicable to xe3+. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32664>	2025-01-29 23:39:32 +00:00
Francisco Jerez	d03eac3133	intel/brw/xe3+: Disable round-robin allocation heuristic on Xe3+. Xe3+ benefits from packing register allocations tightly in order to make optimal use of the GRF space. The round-robin heuristic previously in use often causes the whole GRF space to be used even if register pressure is substantially lower, which would severely decrease thread-level parallelism on Xe3+. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32664>	2025-01-29 23:39:32 +00:00
Francisco Jerez	a67ff3e7e3	intel/brw/xe3+: Bump number of SBID tokens for Xe3. Xe3 supports 32 SBID tokens per thread regardless of the number of register blocks allocated per thread. Take advantage of the increased number of SBIDs in the scoreboard pass to reduce the frequency of false dependencies on Xe3+. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32664>	2025-01-29 23:39:32 +00:00
Francisco Jerez	8d2331fe4b	intel/brw/xe3: Extend regalloc sets to maximum Xe3 GRF size. Extend our regalloc sets to 256 registers to match the maximum capacity of the GRF file on Gfx30. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32664>	2025-01-29 23:39:32 +00:00
Francisco Jerez	ca1636d457	intel/brw/xe3: Define XE3_MAX_GRF. Gfx30 supports up to 256 (512b) GRFs which requires a max GRF define of 512 in REG_SIZE units. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32664>	2025-01-29 23:39:32 +00:00
Francisco Jerez	67cb23a4b1	intel/common/xe2+: Allow SIMD32 PS for all multisample cases. These don't seem to be disallowed by recent hardware anymore. Stop disabling SIMD32 due to hardware restrictions of multisample rasterization, since it should have better performance, and on Xe3+ there may be no shader variant available other than SIMD32. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32664>	2025-01-29 23:39:32 +00:00

1 2 3 4 5 ...

201009 commits