fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-25 00:00:11 +01:00

Author	SHA1	Message	Date
Ian Romanick	a2292f53b5	nir: Optimize uniform vote_all and vote_any No shader-db changes on any Intel platform. fossil-db: All Ice Lake and newer platforms had similar results. (Ice Lake) Totals: Instrs: 165513303 -> 165511820 (-0.00%) Cycles: 15125314947 -> 15125211500 (-0.00%); split: -0.00%, +0.00% Totals from 82 (0.01% of 656120) affected shaders: Instrs: 544627 -> 543144 (-0.27%) Cycles: 22616493 -> 22513046 (-0.46%); split: -0.46%, +0.00% No fossil-db changes on Gfx9. Suggested-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 09:44:32 -08:00
Ian Romanick	535caaf3e0	nir: Optimize uniform iadd, fadd, and ixor reduction operations This adds optimizations for iadd, fadd, and ixor with reduce, inclusive scan, and exclusive scan. NOTE: The fadd and ixor optimizations had no shader-db or fossil-db changes on any Intel platform. NOTE 2: This change "fixes" arb_compute_variable_group_size-local-size and base-local-size.shader_test on DG2 and MTL. This is just changing the code path taken to not use whatever path was not working properly before. This is a subset of the things optimized by ACO. See also https://gitlab.freedesktop.org/mesa/mesa/-/issues/3731#note_682802. The min, max, iand, and ior exclusive_scan optimizations are not implemented. Broadwell on shader-db is not happy. I have not investigated. v2: Silence some warnings about discarding const. v3: Rename mbcnt to count_active_invocations. Add a big comment explaining the differences between the two paths. Suggested by Rhys. shader-db: All Gfx9 and newer platforms had similar results. (Ice Lake shown) total instructions in shared programs: 20300384 -> 20299545 (<.01%) instructions in affected programs: 19167 -> 18328 (-4.38%) helped: 35 / HURT: 0 total cycles in shared programs: 842809750 -> 842766381 (<.01%) cycles in affected programs: 2160249 -> 2116880 (-2.01%) helped: 33 / HURT: 2 total spills in shared programs: 4632 -> 4626 (-0.13%) spills in affected programs: 206 -> 200 (-2.91%) helped: 3 / HURT: 0 total fills in shared programs: 5594 -> 5581 (-0.23%) fills in affected programs: 664 -> 651 (-1.96%) helped: 3 / HURT: 1 fossil-db results: All Intel platforms had similar results. (Ice Lake shown) Totals: Instrs: 165551893 -> 165513303 (-0.02%) Cycles: 15132539132 -> 15125314947 (-0.05%); split: -0.05%, +0.00% Spill count: 45258 -> 45204 (-0.12%) Fill count: 74286 -> 74157 (-0.17%) Scratch Memory Size: 2467840 -> 2451456 (-0.66%) Totals from 712 (0.11% of 656120) affected shaders: Instrs: 598931 -> 560341 (-6.44%) Cycles: 184650167 -> 177425982 (-3.91%); split: -3.95%, +0.04% Spill count: 983 -> 929 (-5.49%) Fill count: 2274 -> 2145 (-5.67%) Scratch Memory Size: 52224 -> 35840 (-31.37%) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 09:44:11 -08:00
Ian Romanick	c63ea755fe	intel/fs: Use nir_opt_uniform_subgroup shader-db: All Skylake and newer platforms had similar results. (Ice Lake shown) total instructions in shared programs: 20300435 -> 20300384 (<.01%) instructions in affected programs: 303 -> 252 (-16.83%) helped: 2 / HURT: 0 total cycles in shared programs: 842810326 -> 842809750 (<.01%) cycles in affected programs: 8374 -> 7798 (-6.88%) helped: 2 / HURT: 0 fossil-db: All Intel platforms (note below) had similar results. (Ice Lake shown) Instrs: 165559735 -> 165551893 (-0.00%) Cycles: 15133083961 -> 15132539132 (-0.00%); split: -0.00%, +0.00% Spill count: 45262 -> 45258 (-0.01%) Fill count: 74293 -> 74286 (-0.01%) Totals from 854 (0.13% of 656120) affected shaders: Instrs: 3461998 -> 3454156 (-0.23%) Cycles: 154252729 -> 153707900 (-0.35%); split: -0.36%, +0.01% Spill count: 2655 -> 2651 (-0.15%) Fill count: 3881 -> 3874 (-0.18%) DG2 did not see changes in spills or fills. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 08:38:45 -08:00
Ian Romanick	f10d1ef372	nir: Initial framework for optimizing uniform subgroup operations The first commit just optimizes operation where the result of the subgroup operation is the same as each of the individual channel results. This is a subset of the things optimized by ACO. See also https://gitlab.freedesktop.org/mesa/mesa/-/issues/3731#note_682802. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 08:38:31 -08:00
Ian Romanick	8fb37ef985	intel/fs: Add fast path for ballot(true) This doesn't help very much now. A later commit adds a NIR optimization pass, tentatively called nir_opt_uniform_subgroup, that converts many kinds of subgroup operations to things involving bitCount(ballot(true)). This commit makes a huge difference in the results of that later commit. No shader-db changes on any Intel platform. Fossil-db results: All Intel platforms had similar results. (Ice Lake shown) Totals: Instrs: 165558033 -> 165557519 (-0.00%) Cycles: 15156188362 -> 15156178922 (-0.00%); split: -0.00%, +0.00% Totals from 299 (0.05% of 656117) affected shaders: Instrs: 88293 -> 87779 (-0.58%) Cycles: 3709498 -> 3700058 (-0.25%); split: -0.28%, +0.03% v2: Rebase on splitting ELK from BRW. Remove devinfo->ver >= 8 check. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 08:37:46 -08:00
Ian Romanick	c42830c64a	intel/fs: Use constant of same type to write flag Otherwise the compiler generates an extra MOV to load the constant into a register first because reasons. 🤷 vote_any, vote_all, vote_ieq, and vote_feq handling already do this. No shader-db changes on any Intel plaform. Fossil-db results: All Intel platforms had similar results. (Ice Lake shown) Totals: Instrs: 165592451 -> 165557937 (-0.02%) Cycles: 15133282615 -> 15133059360 (-0.00%); split: -0.00%, +0.00% Totals from 33779 (5.15% of 656115) affected shaders: Instrs: 4396576 -> 4362062 (-0.79%) Cycles: 86867412 -> 86644157 (-0.26%); split: -0.37%, +0.11% Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 08:37:15 -08:00
Ian Romanick	b22fff90d5	intel/fs: Enable nir_opt_uniform_atomics in all shader stages The problem seems to have been related to nir_intrinsic_load_global_block_intel being marked as non-divergent. No shader-db or fossil-db changes on any Intel platform. v2: Rebase on splitting ELK from BRW. Remove devinfo->ver >= 8 check. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 08:37:05 -08:00
Ian Romanick	75de4458a1	nir: Mark nir_intrinsic_load_global_block_intel as divergent This is divergent because it specifically loads sequential values into successive SIMD lanes. No shader-db or fossil-db changes on any Intel platform. Fixes: `9f44a26462` ("nir/divergence: handle load_global_block_intel") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 08:36:42 -08:00
Ian Romanick	56a3f031f4	intel/fs: Delete stale comment in nir_intrinsic_ballot implementation Discard actually uses f1.x, so this implementation of ballot is fine. Trivial. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 08:36:34 -08:00
Ian Romanick	5da5106727	nir: Add documentation for subgroup_.._mask v2: Fix reference to GL_ARB_shader_ballot. Noticed by Lionel. Suggested-by: Lionel Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 08:36:09 -08:00
Samuel Pitoiset	727170fee0	radv/ci: enable RADV_PERFTEST=shader_object for vkcts-tahiti-valve Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27812>	2024-02-27 14:28:54 +00:00
Martin Roukala (né Peres)	83efcd6a72	radv/ci: add a vkcts-tahiti job Signed-off-by: Martin Roukala (né Peres) <martin.roukala@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27812>	2024-02-27 14:28:54 +00:00
Sebastian Wick	0fe7e06f5f	vulkan/wsi/wayland: Use dispatch_queue_timeout in acquire_next_image Signed-off-by: Sebastian Wick <sebastian.wick@redhat.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27511>	2024-02-27 13:10:13 +00:00
Sebastian Wick	a00f9c401b	loader/wayland: Add fallback wl_display_dispatch_queue_timeout This is almost a 1:1 copy of the same function in libwayland. If the version with the symbol propagates far enough the fallback can be removed again. Signed-off-by: Sebastian Wick <sebastian.wick@redhat.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27511>	2024-02-27 13:10:13 +00:00
Sebastian Wick	df801bec1e	util: Add timespec_sub_saturate to avoid negative time for deadlines Signed-off-by: Sebastian Wick <sebastian.wick@redhat.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27511>	2024-02-27 13:10:13 +00:00
Connor Abbott	a80a23dc49	tu: Enable UBWC for storage images on a7xx I'm not sure exactly when this was introduced. It doesn't work on a650, but does work on a7xx, I'm not sure whether it works on the a660 generation. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27506>	2024-02-27 07:34:15 -05:00
Connor Abbott	b9e04f8293	tu: Enable UBWC for SNORM formats on a740+ The fast-clear value is now the same for SNORM and UNORM, so our trick of reinterpreting SNORM as UNORM when copying now works with UBWC. We can also freely reinterpret UNORM, SNORM, and UINT formats, as tested by dEQP-VK.image.mutable.*. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27506>	2024-02-27 07:33:59 -05:00
Connor Abbott	4529b2ea54	tu: Reenable MSAA UBWC on a6xx gen1 This passes a full CTS run now, probably due to other fixes in the meantime. Closes: #7438 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27506>	2024-02-27 07:17:29 -05:00
Daniel Stone	dcbf61f5df	egl/dri: Use pipe_format instead of DRI_IMAGE_FORMAT Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	68eaf820e1	st/dri: Use pipe_format from config directly The driver already stores the pipe_format within the mode, so we can just reuse that rather than trying to re-derive it back from a hardcoded set of masks. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	832d814852	egl: Remove shifts/sizes from dri2_add_config argument All the callers now do their own filtering on driver configs to decide whether or not to add them. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	e66305d681	egl/x11: Compare config shifts/sizes locally Rather than getting the egl_dri2 core to compare our shift/size arrays, do it in the caller, as X11 is the only caller that constructs configs like this. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	273e54391a	egl/android: Remove hard-coded color-channel data We don't need to write out tables of the shift/size of every colour channel, because we already have that as part of the format description if we use pipe_format. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	d8d153d4c2	gbm: Remove hardcoded color-channel data We don't need to write out the shifts and sizes everywhere; we already have them as part of the format description. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	a10f10c4ce	gbm/dri: Query DRIImage for FourCC directly No need to go through the DRIImage format query when we can just get the FourCC directly. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	6b04b64a21	egl: Use pipe_format for pbuffer configs There's no need to pass shifts and masks now; we can just take a list of pbuffer configs to add. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	630fda2492	egl: Automatically set EGLConfig ID There's no reason to make the users go through and count all their configs; just set it ourselves in the core. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	487016796b	egl/{surfaceless,device}: Remove check for EGLConfig presence We now check in the EGL core that we actually have some configs for the display, so no need to do it in the platform. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	e12a55c979	egl/android: Remove check for EGLConfig presence We now check in the EGL core that we actually have some configs for the display, so no need to do it in the platform. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	ac304d0393	egl/x11: Remove check for EGLConfig presence We now check in the EGL core that we actually have some configs for the display, so no need to do it in the platform. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	6770921918	egl/gbm: Remove check for EGLConfig presence We now check in the EGL core that we actually have some configs for the display, so no need to do it in the platform. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	bf7ebe6ff8	egl/wayland: Remove check for EGLConfig presence We now check in the EGL core that we actually have some configs for the display, so no need to do it in the platform. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	cbb6f62135	egl: Fail display creation if no EGLConfigs created If we failed to create any EGLConfigs, throw an error and refuse to create the display. This lets us eliminate a bunch of copy-and-wasted code throughout the drivers. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	97c435c926	egl/wayland: Remove shift/size masks We don't need these anymore. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	7889c9dc92	egl/wayland: Use pipe_format for format names We don't need to carry our own strings. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	84bb6d0e0a	egl/wayland: Remove WL_SHM_* format listings We don't need to type all of these out, because they're exactly the same as the DRM ones, with the exception of ARGB8888 and XRGB8888 which carry their own format codes. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	5ac70182e8	egl/wayland: Simplify alternate-format fallback for configs Rather than doing everything to add the config twice, we can simply reuse the helpers to check if the different formats are supported, then add the config only once. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	ee04cc5d2b	egl/wayland: Eliminate double loop for configs We used to loop like this: foreach (config_supported_by_driver) { foreach (config_wl_knows_about) { dri2_add_config(wl_config.rgba_masks, wl_config.rgba_shifts)) { if (wl_config.rgba_masks != driver_config.rgba_masks \|\| wl_config.rgba_shifts != driver_config.rgba_shifts) { return NULL; /* driver config != wl config */ } } } } This is a pretty painful way to discover the relationship between the different sets of configs, especially as we can just look up our Wayland visual entry directly. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	f63e676370	egl/wayland: Use helper to look up visual No functional change. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	ebdf9ad85c	egl/wayland: Add helper to check server format support Often when we look up a dri2_wl_visual, the only thing we want to do with it is check whether or not the server supports it. Add a helper for this common pattern. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	a34d3c290e	egl/wayland: Use FourCC to look up wl_buffer support No point in using a DRI_IMAGE_FORMAT, since we'll need to get a FourCC for dmabuf/wl_drm support anyway. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	6a084e2b08	egl/wayland: Use pipe_format to look up configs Instead of using the sizes and shifts to look up a dri2_wl_visual from a config, just use the pipe_format we stash in the config. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	9ea9a963aa	egl/wayland: Fix EGL_EXT_present_opaque This extension has been broken ever since the initial commit. It created an XRGB DRIImage for the driver to render to, so whilst the presentation was opaque, the buffer also completely lacked an alpha channel. Fix it by making sure we only modify the FourCC we send to the Wayland server when creating a buffer. Closes: mesa/mesa#5886 Fixes: `9aee7855d2` ("egl: implement EGL_EXT_present_opaque on wayland") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	c74f480391	egl/wayland: Add opaque-equivalent FourCCs Add a mapping for the opaque version of an alphaful format (e.g. ARGB8888 -> XRGB8888) to better support EGL_EXT_present_opaque. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	084fea1e7d	egl/wayland: s/DRI_IMAGE_FORMAT/pipe_format/g All the DRI_IMAGE_FORMATs we use are now pipe_formats, so eliminate the old type names. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	ba3b867552	egl/wayland: Query image FourCC for linear copies This entire pattern really wants to be a shared helper, to allocate a shadow linear image from another device and then import it across into the rendering GPU. Querying the FourCC from the DRIImage makes it easier to pull out into shared code. This temporarily makes the implementation more ugly, however it's already pretty hard on the eyes, so probably no great loss. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	57c3343c2d	dri/kopper: Flatten pipe_format/DRIImage/FourCC conversion No need to go through DRI_IMAGE_FORMAT_* tokens; we can just directly convert from pipe_format to FourCC. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	3c46252570	dri/kopper: Reorder format tables Just make it more obvious that they're identical. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	d60415bad5	dri/kopper: Add translations for sRGB formats There was DRI_IMAGE_FORMAT -> FourCC format translation, but this was unreachable as we only ever called image_format_to_fourcc() on the result of get_dri_format(). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00
Daniel Stone	ca0f94c805	dri/kopper: Move format -> FourCC translation up a level There's no reason to use DRIImage formats here, given that it's only ever used to immediately convert to a FourCC. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27709>	2024-02-27 10:51:03 +00:00

1 2 3 4 5 ...

185352 commits