fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-07 02:48:06 +02:00

Author	SHA1	Message	Date
Rohan Garg	3e46ee61d5	intel/fs/xe2+: Lift CPS dispatch width restrictions on Xe2+. These restrictions don't seem to be applicable anymore, and limiting to SIMD8 wouldn't work since we're no longer building shaders with that dispatch width. [ Francisco: This one-liner change was squashed by Rohan Garg into a previous version of my patch "Stop building SIMD8 programs", but it makes more sense as a separate commit -- Formatted as a separate patch. ] Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26605>	2023-12-22 10:37:00 -08:00
Ian Romanick	84b53e1a54	intel/fs/xe2+: Pass correct dispatch_width to fs_generator for geometry-processing stages. Instead of hard-coding a dispatch_width value which is no longer correct on Xe2+. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26605>	2023-12-22 10:37:00 -08:00
Francisco Jerez	3f92dde55e	intel/fs/xe2+: Stop building SIMD8 shaders for geometry stages (VS/TCS/TES/GS). They are no longer suppored by the fixed-function hardware. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26605>	2023-12-22 10:37:00 -08:00
Francisco Jerez	6877916155	intel/fs/xe2+: Stop building SIMD8 fragment shaders. They are no longer suppored by the fixed-function hardware. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26605>	2023-12-22 10:37:00 -08:00
Francisco Jerez	7397ba61c2	intel/fs/xe2+: Stop building SIMD8 compute-like shaders (CS/BS/TS/MS). SIMD8 kernels are no longer able to utilize the ALUs efficiently, since they have twice the vector width as previous platforms. However even though there aren't many reasons to use it, SIMD8 is still supported by the instruction set technically, and it will still be used for some SIMD-lowering sequences. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26605>	2023-12-22 10:37:00 -08:00
Francisco Jerez	69cc72e50a	anv/gfx12: Hook up dual-SIMD8 fragment shader dispatch. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:31 +00:00
Francisco Jerez	4ec54e84da	iris/gfx12: Hook up dual-SIMD8 fragment shader dispatch. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:31 +00:00
Francisco Jerez	ccb5795938	intel/gfx12: Enable SIMD8 dispatch in 3DSTATE_PS for FS multipolygon dispatch. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:31 +00:00
Francisco Jerez	4868408e6e	intel/genxml: Add 3DSTATE_PS definitions needed for dual-SIMD8 dispatch on Gfx12+. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:31 +00:00
Francisco Jerez	1f2c44dc21	intel/compiler: Attempt to build dual-SIMD8 variant of fragment shaders on gfx12+ platforms. Similar to other FS dispatch modes, attempt to build a dual-SIMD8 program if the regular SIMD8 program didn't spill and doubling the amount of space for varyings doesn't cause us to go over the thread payload limit. Dual-SIMD8 builds in combination with coarse pixel shading are currently not handled. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:31 +00:00
Francisco Jerez	261d07f398	intel: Add debug flag for enabling dual-SIMD8 fragment shader dispatch. Note that this option isn't enabled by default yet pending additional performance evaluation. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:31 +00:00
Francisco Jerez	28aec45eed	intel/fs/gfx12: Implement multi-polygon format of render target array index in PS payload. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:31 +00:00
Francisco Jerez	5b1ab77423	intel/fs/gfx12: Implement multi-polygon format of back/front-facing flag in PS payload. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:31 +00:00
Francisco Jerez	4672fcbc76	intel/fs: Fix PS thread payload setup for depth_w_coef_reg. It's not replicated per SIMD16 half of a SIMD32 thread on the PS payload. Make fs_visitor::payload::depth_w_coef_reg a scalar rather than an array. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:31 +00:00
Francisco Jerez	09ea840987	intel/fs: No need to copy null destinations in lower_simd_width. The copy would be discarded immediately. Until now we were relying on DCE to eliminate these, but it seems like in some cases MOVs into the null register emitted by lower_simd_width() are never eliminated, likely because a lower_simd_width() call has been introduced close to the bottom of optimize() which isn't follow by any additional DCE passes. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:31 +00:00
Francisco Jerez	5e0760a993	intel/fs/gfx12: Don't consider multipolygon PS to have packed dispatch. This fixes a number of regressions and hangs in multipolygon fragment shaders that have FIND_LIVE_CHANNEL sequences which would otherwise lead to access of a dead channel. Note that the failures don't seem to be reproducible in simulation. Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:31 +00:00
Francisco Jerez	8f92baa5d3	intel/fs/gfx12+: Don't set nir_divergence_single_prim_per_subgroup option for fragment shaders. Flat-shaded inputs and other per-primitive values can no longer be considered to be uniform across fragment shader subgroups due to multipolygon dispatch. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:31 +00:00
Francisco Jerez	6bf99e6a45	intel/compiler: Don't change types for copies from ATTR file. Since the <8;8,0> regions they use in multipolygon mode could violate regioning restrictions in some cases, depending on the execution type of the instruction. Note that the assertion is removed from try_copy_propagate() since a more accurate check is used within that function than what fs_inst::can_change_types() can do. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:31 +00:00
Francisco Jerez	2ed36050fb	intel/fs: Don't copy-propagate ATTR registers in multi-polygon FS shaders when invalid. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:31 +00:00
Jordan Justen	3f89fa63e6	intel/compiler: Pass max_polygons to copy-prop from fs_visitor. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:31 +00:00
Francisco Jerez	b62ad4e028	intel/fs: Rework layout of FS vertex setup data in ATTR file to support multi-polygon dispatch. The updated layout includes one copy of each plane parameter per channel of the SIMD thread, in order to allow channels to process different polygons. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:31 +00:00
Francisco Jerez	a844c0b185	intel/fs: Fix fs_reg::component_size() to handle two-dimensional register regions. Add code to calculate the size in bytes of arbitrary two-dimensional regions for FIXED_GRF and ARF registers. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:31 +00:00
Francisco Jerez	83a0252e8d	intel/fs: Pass builder to per_primitive_reg(). Matches prototype of interp_reg(), will be useful in a subsequent commit. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:30 +00:00
Francisco Jerez	8e9f09dbe5	intel/fs: Provide component index explicitly to interp_reg(). Main motivation is that for multipolygon PS shaders the i-th plane parameter for the j-th input attribute will no longer necessarily be a scalar, since different channels may be processing different polygons with different input plane parameters, so simply taking a component() of the result of interp_reg() will no longer work. Instead of duplicating the multipolygon handling logic in every caller of interp_reg(), fold the component() call into interp_reg() so we can replace it with multipolygon-correct code more easily. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:30 +00:00
Francisco Jerez	742a575bd6	intel/fs: Consider ATTR registers with different fs_reg::nr as belonging to disjoint register spaces. Instead of treating fs_reg::nr as an offset for ATTR registers simply consider different indices as denoting disjoint spaces that can never be accessed simultaneously by a single region. From now on geometry stages will just use ATTR #0 for everything and select specific attributes via offset() with the native dispatch width of the program, which should work on current platforms as well as on Xe2+. See "intel/fs: Map all GS input attributes to ATTR register number 0." for the rationale. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:30 +00:00
Francisco Jerez	2d26ed6688	intel/fs: Assert fs_reg::nr is always zero for ATTR registers in geometry stages. Instead of treating fs_reg::nr as an offset for ATTR registers simply consider different indices as denoting disjoint spaces that can never be accessed simultaneously by a single region. From now on geometry stages will just use ATTR #0 for everything and select specific attributes via offset() with the native dispatch width of the program, which should work on current platforms as well as on Xe2+. See "intel/fs: Map all GS input attributes to ATTR register number 0." for the rationale. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:30 +00:00
Francisco Jerez	b26cf8b189	intel/fs: Map all TES input attributes to ATTR register number 0. Instead of treating fs_reg::nr as an offset for ATTR registers simply consider different indices as denoting disjoint spaces that can never be accessed simultaneously by a single region. From now on geometry stages will just use ATTR #0 for everything and select specific attributes via offset() with the native dispatch width of the program, which should work on current platforms as well as on Xe2+. See "intel/fs: Map all GS input attributes to ATTR register number 0." for the rationale. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:30 +00:00
Francisco Jerez	ef12565a37	intel/fs: Map all VS input attributes to ATTR register number 0. Instead of treating fs_reg::nr as an offset for ATTR registers simply consider different indices as denoting disjoint spaces that can never be accessed simultaneously by a single region. From now on geometry stages will just use ATTR #0 for everything and select specific attributes via offset() with the native dispatch width of the program, which should work on current platforms as well as on Xe2+. See "intel/fs: Map all GS input attributes to ATTR register number 0." for the rationale. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:30 +00:00
Francisco Jerez	1d22721b5a	intel/fs: Map all GS input attributes to ATTR register number 0. The fs_reg::nr field currently has a somewhat inconsistent meaning for ATTR registers depending on the shader stage. In geometry stages it has a similar effect as fs_reg::offset except it's expressed in 32B units instead of B units. In the PS however it's expressed in units of logical scalar attributes (16B on present platforms), which isn't currently handled correctly throughout the back-end since some places assume 32B units in all cases. The different format of the PS setup data in multi-polygon dispatch modes would make its behavior even more irregular, which would be worsened further (for both geometry and pixel stages) by the register size changes coming up on Xe2, particularly in brw_ir_fs.h helpers where neither the devinfo struct nor the shader stage are available. Instead of treating it as an offset simply consider different fs_reg::nr indices as denoting disjoint spaces that can never be accessed simultaneously by a single region. From now on geometry stages will just use ATTR #0 for everything and select specific attributes via offset() with the native dispatch width of the program, which should work on current platforms as well as on Xe2+. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:30 +00:00
Francisco Jerez	e4aca2ebaa	intel/fs: Add separate constructor of fs_visitor for fragment shaders. To allow specifying the number of polygons that will be processed per SIMD thread. Rework: * Jordan: Add needs_register_pressure following `09cdb77a92` ("intel/fs: report max register pressure in shader stats") Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:30 +00:00
Francisco Jerez	1eff2fcb62	intel/compiler: Add polygon count statistic to brw_compile_stats. And use it in ANV in order to return a "SIMDNxM" name from vkGetPipelineExecutablePropertiesKHR. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:30 +00:00
Francisco Jerez	ccf9174655	intel/compiler: Add multipolygon dispatch fields to brw_wm_prog_data. Add fields that track the number of polygons processed per PS SIMD thread (note that this might be lower than the value that was specified to the compiler via brw_compile_fs_params if compilation at the desired polygon count wasn't possible), and the dispatch width of the multi-polygon PS kernel. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:30 +00:00
Francisco Jerez	e7b1993376	intel/compiler: Add max_polygons FS compilation parameter. Add a brw_compile_fs_params parameter that specifies to the compiler the maximum number of polygons that may be processed in parallel per PS SIMD thread. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26585>	2023-12-22 18:05:30 +00:00
Caio Oliveira	6fccacda1e	compiler/types: Use a typedef for glsl_type Most of the code now will see `const glsl_type ` instead of `const struct glsl_type `. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26708>	2023-12-22 07:53:25 -08:00
Caio Oliveira	550fdc2026	compiler/types: Remove glsl_type C++ helpers All code now use the C functions. Remove glsl_type_impl.h that contained the inline C++ wrappers around those. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26707>	2023-12-22 06:51:01 -08:00
Caio Oliveira	d06f0305f6	glsl: Use glsl_type C helpers Acked-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26707>	2023-12-22 06:51:01 -08:00
Caio Oliveira	db5f73dc9f	compiler/types: Add a few more glsl_type C helpers These will be used once the C++ ones are removed. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26707>	2023-12-22 06:44:23 -08:00
Caio Oliveira	6af93b1801	lima: Use glsl_type C helpers Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26707>	2023-12-22 06:44:23 -08:00
Caio Oliveira	7d0d4a494e	mesa: Use glsl_type C helpers Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26707>	2023-12-22 06:44:23 -08:00
Caio Oliveira	582c20c431	nir: Use glsl_type C helpers Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26707>	2023-12-22 06:44:23 -08:00
Caio Oliveira	cc809d4de9	nouveau: Use glsl_type C helpers Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26707>	2023-12-22 06:44:23 -08:00
Caio Oliveira	2cbc318193	r600/sfn: Use glsl_type C helpers In one case, just used glsl_without_array instead of checking if its an array to decide to use. Using that helper with a non-array type just returns the type. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26707>	2023-12-22 06:44:23 -08:00
Caio Oliveira	55cde229d5	intel/compiler: Use glsl_type C helpers Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26707>	2023-12-22 06:44:23 -08:00
Yonggang Luo	1e6fcd6a61	dzn: Remove #if D3D12_SDK_VERSION blocks now that 611 is required Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26794>	2023-12-22 13:35:35 +00:00
Rob Clark	8023ede00a	ci: Remove per-driver wayland-dEQP-EGL xfails Since these are not driver specific and have been added to all-skips.txt, remove them from per-driver CI expectations. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26779>	2023-12-22 11:13:23 +00:00
Rob Clark	c2bb95653b	ci: Add wayland-dEQP-EGL.functional.render.* skips These appear to be failing in the same way as the color_clears tests. Same results on llvmpipe and freedreno (as with the color_clears tests) so it does not appear to be a driver specific issue. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26779>	2023-12-22 11:13:23 +00:00
Rob Clark	4261621a7e	ci: List specific color_clears skips No need to throw the baby out with the bathwater. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26779>	2023-12-22 11:13:23 +00:00
Rob Clark	dbe5b8b5a4	ci: More context for color_clear skips for Wayland Add some more notes, so context isn't lost to time when someone gets around to digging deeper. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26779>	2023-12-22 11:13:23 +00:00
Dmitry Baryshkov	f49624fc97	freedreno/drm: fallback to default BO allocation if heap alloc fails Allow fd_bo_heap_alloc() to return NULL if the heap is exausted (or fragmented) instead of segfaulting. Then handle the error properly in bo_new(). Signed-off-by: Dmitry Baryshkov <dmitry.baryshkov@linaro.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26787>	2023-12-22 10:48:53 +00:00
Tapani Pälli	9e88c711a3	drirc/anv: disable FCV optimization for Baldur's Gate 3 Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26754>	2023-12-22 09:47:19 +00:00

1 2 3 4 5 ...

182548 commits