fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-06 22:28:04 +02:00

Author	SHA1	Message	Date
Kenneth Graunke	8cd7e94eca	iris: Add a separate PIPE_CONTROL_L3_READ_ONLY_CACHE_INVALIDATE bit This will let us use it without performing a VF cache invalidation, should we want to do that. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	b92cd58508	iris: Add an iris_is_domain_l3_coherent helper. The render, depth, sampler, and data (HDC) caches are all coherent with L3. We consider OTHER_READ and OTHER_WRITE to be non-coherent, as they're kitchen-sink domains which include non-L3-clients. Starting with Tigerlake, the VF cache is coherent with L3 (because we set the L3BypassDisable bit in the vertex/index buffer packets). Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	536eee31d0	iris: Fix UBO cache tracking for the !indirect_ubos_use_sampler case On Tigerlake, we use the data cache for reading indirect UBOs instead of the sampler. But we still use the constant cache for direct UBO access, so unfortunately we may access it through two different domains. To work around this, we add a new domain for pull constants (UBOs), which will be either constant+texture or constant+data. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	d39bd7ba70	iris: Split out an IRIS_DOMAIN_SAMPLER_READ domain from OTHER_READ The bulk of IRIS_DOMAIN_OTHER_READ domain usage was the 3D sampler, but there were also a few oddball cases like command streamer reads, blitter access, and so on. The sampler is definitely L3 coherent, but some off the more esoteric reads may not be, so I'd like to separate them, so that OTHER_READ can become a non-L3-coherent kitchen-sink domain. The sampler cases only need TEXTURE_CACHE_INVALIDATE, and can skip the CONSTANT_CACHE_INVALIDATE we had on IRIS_DOMAIN_OTHER_READ. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	8e0ff0275d	iris: Use IRIS_DOMAIN_DEPTH_WRITE for read only depth/stencil. We were using IRIS_DOMAIN_OTHER_READ for read-only depth/stencil access in an attempt to avoid unnecessary flushing; IRIS_DOMAIN_DEPTH_WRITE could indicate read-write access. However, IRIS_DOMAIN_OTHER_READ is clearly the wrong domain. Depth and stencil data is read via the depth cache, while IRIS_DOMAIN_OTHER_READ currently corresponds to the sampler cache and constant cache together (although this will change in future patches). It's unclear whether this hack was useful. For now, just drop it and use the correct depth cache domain, even if it's marked as read-write. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Gert Wollny	6a264e7024	virgl: Apply integer op fix only for ALU ops and clear modifiers For texture fetches and buffer load the fix is not needed, and the override creates faulty TGSI. In addition remove all modifiers from the src in the additional mov instruction. Fixes: `d1c7a7b131` virgl: Add an extra mov for int outputs from constant and immediate inputs v2: Move workaround after the use of virgl_tgsi_rewrite_src_for_input_temp (Emma) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15896>	2022-04-13 08:56:47 +00:00
Gert Wollny	29564031cf	r600: Assign shader type when creating a new CS state Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15898>	2022-04-13 08:48:13 +00:00
Kenneth Graunke	68ef895674	st/mesa: Transcode ASTC to BC7 (BPTC) where possible This patch adds support for transcoding ASTC to BC7 (BPTC) and prefers it over BC3 (DXT5) when hardware supports that format. BC7 is a much newer format (~2009 vs. ~1999) and offers higher quality than the older BC3 format. Furthermore, our encoder seems to be faster. Tapani put together a small benchmark for transcoding a 1024x1024 ASTC texture, and switching from BC3 to BC7 improves performance of that microbenchmark by 25% on my Tigerlake NUC (with hardware ASTC disabled so we can test this path). Presumably, this isn't fundamental to the formats, but rather reflects the speed of our in-tree compressors. So, we should use BC7 where possible. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15875>	2022-04-13 07:58:11 +00:00
Kenneth Graunke	d4521a2515	st/mesa: Make transcode_astc also check for non-SRGB format support This is probably unnecessary in that all drivers which support the sRGB format likely also support the non-sRGB format. But we may as well check both the formats we use, for documentation if nothing else. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15875>	2022-04-13 07:58:11 +00:00
Tomeu Vizoso	7d474c100e	ci: Move most stuff out of root .gitlab-ci.yml This file was getting a bit hard to navigate. Split container, build and test jobs to their own files. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15891>	2022-04-13 07:34:36 +00:00
Tomeu Vizoso	2a578c6505	ci: Allow local installations to build additional stuff into the rootfs This can make it more convenient for other projects to reuse these scripts. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15891>	2022-04-13 07:34:36 +00:00
Tomeu Vizoso	e81693a1b4	ci: Add env var to add packages to install in debian/arm_build image This can make it more convenient for other projects to reuse these scripts. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15891>	2022-04-13 07:34:36 +00:00
Tomeu Vizoso	79aef41881	ci: Add env var to add packages to install in rootfs This can make it more convenient for other projects to reuse these scripts. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15891>	2022-04-13 07:34:36 +00:00
Tomeu Vizoso	b46000f076	ci: Allow specifying a different kernel in LAVA jobs To make it possible to use a kernel different from that built along with the rootfs. This can make it more convenient for other projects to reuse these scripts. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15891>	2022-04-13 07:34:36 +00:00
Tomeu Vizoso	f7713b0af0	ci: Use CI_PROJECT_NAME instead of hardcoding 'mesa' This can make it more convenient for other projects to reuse these scripts. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15891>	2022-04-13 07:34:36 +00:00
Lionel Landwerlin	3394680368	nir/lower_shader_calls: name resume shaders Helpful when lost in a sea of NIR :) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15887>	2022-04-13 06:59:29 +00:00
Tomeu Vizoso	8506c2b7ee	ci: Disable Google's lab The runner is down and pipelines are being stuck. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15909>	2022-04-13 08:11:07 +02:00
Mike Blumenkrantz	c3ad1331be	zink: rework choose_pdev to (finally) be competent now zink will init using a priority system if multiple devices are available multiple devices will ONLY be available if: * the user does not specify VK_ICD_FILENAMES as they should * the user does not specify LIBGL_ALWAYS_SOFTWARE * multiple drivers exist I've prioritized the virtualized gpu here with the assumption that if such a thing is detected, the environment is most likely virtualized Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15857>	2022-04-13 00:14:57 +00:00
Mike Blumenkrantz	0c0ff57c61	aux/trace: clean up some zink+lavapipe tracing awfulness now that it's easier to determine whether zink is being used (mostly), this whole thing can be simplified Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15857>	2022-04-13 00:14:57 +00:00
Mike Blumenkrantz	d5ff82df38	zink: ZINK_USE_LAVAPIPE -> LIBGL_ALWAYS_SOFTWARE this is a documented variable, so reuse it Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15857>	2022-04-13 00:14:57 +00:00
Mike Blumenkrantz	42ff02de14	egl: don't make LIBGL_ALWAYS_SOFTWARE and MESA_LOADER_DRIVER_OVERRIDE=zink exclusive Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15857>	2022-04-13 00:14:57 +00:00
Indrajit Kumar Das	3abc66dc9f	ac/gpu_info: disallow displayable DCC for Navi12 and Navi14 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15813>	2022-04-12 23:52:24 +00:00
Jason Ekstrand	69b5424ea4	intel/nir: Lower 8 and 16-bit bitwise unops Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15829>	2022-04-12 23:19:38 +00:00
Jason Ekstrand	a482877c70	intel/fs: Implement 16-bit [ui]mul_high Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15829>	2022-04-12 23:19:38 +00:00
Jason Ekstrand	d0ace28790	nir/lower_int64: Fix [iu]mul_high handling `e551040c60`, which added a new mechanism for 64-bit imul which is more efficient on BDW and later Intel hardware also introduced a bug where we weren't properly walking both X and Y. No idea how testing didn't find this. Fixes: `e551040c60` ("nir/glsl: Add another way of doing lower_imul64 for gen8+" Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6306 Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15829>	2022-04-12 23:19:38 +00:00
Mike Blumenkrantz	48ae404b42	kopper: print better error message if loader not detected silently failing on release builds is annoying Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Michel Dänzer <mdaenzer@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15851>	2022-04-12 21:34:30 +00:00
Erico Nunes	cf1390e1b8	lima: fix vector const src referenced multiple times It can happen that a single vector constant is referenced multiple times by the same node, with different swizzles. This needs to be taken into account by checking and updating the swizzles for all the srcs of a target node when inserting the const node to the same instruction. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15726>	2022-04-12 20:07:32 +00:00
Mike Blumenkrantz	19a22ae110	features: mark off ARB_seamless_cubemap_per_texture for zink forgot to do this with the MR Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15902>	2022-04-12 19:12:57 +00:00
Gert Wollny	c3096e562d	ntt: translate nir_intrinsic_shader_clock Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15889>	2022-04-12 18:47:08 +00:00
Mike Blumenkrantz	dea65ae590	zink: finish up radv piglit baseline updates Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15900>	2022-04-12 14:00:47 -04:00
Konstantin Seurer	521492e8b1	radv: Refactor ray tracing support checks Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15860>	2022-04-12 16:13:38 +00:00
Konstantin Seurer	a9fce44dd6	radv: Refactor radv_tex_aniso_filter Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15860>	2022-04-12 16:13:38 +00:00
Mike Blumenkrantz	6b65d4234c	radv: set read/write without format flags for supported texel buffers if the storage case is supported, this should be supported too Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15826>	2022-04-12 15:52:03 +00:00
Samuel Pitoiset	2b688942c1	Revert "radv: Disable NGG for GS with suboptimal output vertex count." It breaks too many things and shouldn't have been merged. The fix isn't trivial and it will probably not be backported because it's intrusive. It will be re-applied later when everything will work. This reverts commit `94706601fa`. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15882>	2022-04-12 12:26:32 +00:00
Gert Wollny	e466d73368	r600: make r600_load_ar available to driver code This is needed for the new NIR assembler Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15714>	2022-04-12 12:10:19 +00:00
Gert Wollny	050e05db22	r600: Set the last bit if an alu group is split by kcache allocation Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15714>	2022-04-12 12:10:19 +00:00
Gert Wollny	d920200ad6	r600: Force last instruction of group when starting a new CF When emitting the AR forces splitting an ALU group, and at the same time a new CF instruction is started, then the last instrcution in the finished CF block might not have the "last" bit set, which results in an invalid shader that might hang, or crash SB. So when a new CF is started, force the last bit in the last ALU instruction. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15714>	2022-04-12 12:10:19 +00:00
Gert Wollny	04fd9a6488	r600: don't reschedule INTERP_LOAD_P0 With the NIR code, we have instructions groups that use INTERP_LOAD_P0 that don't fill all slots. Just make sure the backend scheduler doesn't fill in INTERP_LOAD_P0 instructions with a different LDS location. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15714>	2022-04-12 12:10:19 +00:00
Gert Wollny	3c4644afb0	r600: ignore dest sel for non-write targets when counting registers Since the value is not written, there is no need to allocate a register for it, so don't take it into account. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15714>	2022-04-12 12:10:19 +00:00
Gert Wollny	67d145d9ab	r600: Don't limit scheduling of PARAM_SRC values ALU_SRC_PARAM_BASE is an inline constant that defines the address for pulling data from LDS memory for interpolation and not a value from the kcache, so there is no need to take these values into account when allocating kcache load slots. v2: Fix the constant range check to not exclude the translated ranges for kcache banks 2 and 3. v3: limit range check to only include kcache values and and rename relevant function (Emma). Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15714>	2022-04-12 12:10:19 +00:00
Rhys Perry	f6262804af	radv: increase inline push constant limit if we can inline all constants fossil-db (Sienna Cichlid): Totals from 665 (0.49% of 134627) affected shaders: CodeSize: 4519620 -> 4491724 (-0.62%); split: -0.62%, +0.01% Instrs: 842745 -> 837313 (-0.64%); split: -0.66%, +0.01% Latency: 7289925 -> 7279661 (-0.14%); split: -0.30%, +0.16% InvThroughput: 1240770 -> 1240639 (-0.01%); split: -0.01%, +0.00% VClause: 15799 -> 15772 (-0.17%) SClause: 33773 -> 32604 (-3.46%); split: -3.66%, +0.20% Copies: 67695 -> 64992 (-3.99%); split: -4.49%, +0.50% PreSGPRs: 38597 -> 38640 (+0.11%); split: -0.14%, +0.25% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12145>	2022-04-12 11:44:30 +00:00
Rhys Perry	773c7cbcbc	radv,aco: implement 64-bit inline push constants fossil-db (Sienna Cichlid): Totals from 21 (0.02% of 134621) affected shaders: CodeSize: 1932 -> 1560 (-19.25%) Instrs: 357 -> 303 (-15.13%) Latency: 6576 -> 5883 (-10.54%) InvThroughput: 26304 -> 23532 (-10.54%) SClause: 42 -> 24 (-42.86%) Copies: 90 -> 105 (+16.67%); split: -10.00%, +26.67% PreSGPRs: 144 -> 201 (+39.58%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12145>	2022-04-12 11:44:30 +00:00
Rhys Perry	7f6262bb85	radv: allow holes in inline push constants Use a dword mask instead of a range to track which push constants to inline. fossil-db (Sienna Cichlid): Totals from 5724 (4.25% of 134621) affected shaders: CodeSize: 20894044 -> 20815748 (-0.37%); split: -0.39%, +0.02% Instrs: 4002568 -> 3988385 (-0.35%); split: -0.38%, +0.02% Latency: 29285060 -> 29224414 (-0.21%); split: -0.22%, +0.01% InvThroughput: 5529700 -> 5526893 (-0.05%); split: -0.05%, +0.00% VClause: 78093 -> 78240 (+0.19%); split: -0.23%, +0.41% SClause: 135495 -> 131027 (-3.30%); split: -3.30%, +0.00% Copies: 330856 -> 324552 (-1.91%); split: -2.37%, +0.46% PreSGPRs: 226031 -> 224778 (-0.55%); split: -0.61%, +0.05% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12145>	2022-04-12 11:44:30 +00:00
Rhys Perry	72cf6cca91	radv: allow inline push constants in more situations We don't need to disable this path if there are indirect or 8/16/64-bit push constant loads. We can just use the default path for them. fossil-db (Sienna Cichlid): Totals from 21 (0.02% of 134621) affected shaders: CodeSize: 2028 -> 1884 (-7.10%) Instrs: 366 -> 363 (-0.82%); split: -2.46%, +1.64% Latency: 6630 -> 6579 (-0.77%) InvThroughput: 26520 -> 26316 (-0.77%) Copies: 84 -> 102 (+21.43%) PreSGPRs: 141 -> 222 (+57.45%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12145>	2022-04-12 11:44:30 +00:00
Mykhailo Skorokhodov	9c7e750ffe	intel/fs: Enable b2f(inot(a)) and b2i(inot(a)) optimization for Gfx12+ The commit enables the optimization for Intel Gfx12+ graphics. Tigerlake ``` total instructions in shared programs: 1289326 -> 1289015 (-0.02%) instructions in affected programs: 37841 -> 37530 (-0.82%) helped: 78 HURT: 9 helped stats (abs) min: 1 max: 26 x̄: 4.69 x̃: 3 helped stats (rel) min: 0.10% max: 12.50% x̄: 2.07% x̃: 1.21% HURT stats (abs) min: 1 max: 18 x̄: 6.11 x̃: 4 HURT stats (rel) min: 0.16% max: 1.95% x̄: 0.94% x̃: 0.61% 95% mean confidence interval for instructions value: -4.95 -2.20 95% mean confidence interval for instructions %-change: -2.34% -1.18% Instructions are helped. total cycles in shared programs: 105606388 -> 105606442 (<.01%) cycles in affected programs: 620119 -> 620173 (<.01%) helped: 49 HURT: 28 helped stats (abs) min: 2 max: 3618 x̄: 228.63 x̃: 12 helped stats (rel) min: 0.02% max: 23.31% x̄: 4.60% x̃: 1.11% HURT stats (abs) min: 1 max: 2142 x̄: 402.04 x̃: 29 HURT stats (rel) min: 0.01% max: 36.42% x̄: 5.01% x̃: 0.46% 95% mean confidence interval for cycles value: -151.80 153.20 95% mean confidence interval for cycles %-change: -3.00% 0.79% Inconclusive result (value mean confidence interval includes 0). ``` Related-to: `7725d60938` Signed-off-by: Mykhailo Skorokhodov <mykhailo.skorokhodov@globallogic.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14017>	2022-04-12 10:55:05 +00:00
Gert Wollny	d1c7a7b131	virgl: Add an extra mov for int outputs from constant and immediate inputs virglrenderer doesn't properly emit the conversion code when the source is a integer value and the output is also integer. Fixes on NTT: dEQP-GLES31.functional.shaders.sample_variables.sample_mask.inverse_per_* v2: fix typo (Emma) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15836>	2022-04-12 10:44:17 +00:00
Gert Wollny	a083ae818a	virgl: Always make some extra temps available for transformations The host driver will optimize unused variables away, and checking thoroughly whether we may need an extra temp is just uselessly costly. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15836>	2022-04-12 10:44:17 +00:00
Gert Wollny	a4a34cd323	virgl: Propagate precice flag through moves NIR doesn't propagate precise through moves, and with NTT the last output is usually preceded by a move, so that we no longer see that the evaluation of some value is supposed to be exact, and, hence we can't decorate the outputs accordingly. Fixes with NTT: dEQP-GLES31.functional.tessellation.common_edge. triangles_equal_spacing_precise triangles_fractional_odd_spacing_precise triangles_fractional_even_spacing_precise quads_equal_spacing_precise quads_fractional_odd_spacing_precise quads_fractional_even_spacing_precise v2: Don't clear the precise flag when we hit a mov, because we may hit a if/else construct like below and we don't track branches IF X TEMP[0] = OP_PRECICE ... ELSE TEMP[0] = MOV CONST[] ENDIF Thanks Emma for pointing out the problem. v2: allocate precise handling flags to transform_prolog (Emma) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15836>	2022-04-12 10:44:17 +00:00
Juan A. Suarez Romero	0439f0e9fc	ci: add Broadcom CI maintainer Include in the CODEOWNERS file who to ping in case of issues with the Broadcom (V3D/V3DV/VC4) CI. v2: - Add Chema (Chema) Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Acked-by: Iago Toral Quiroga <itoral@igalia.com> Acked-by: Alejandro Piñeiro <apinheiro@igalia.com> Acked-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15858>	2022-04-12 10:42:31 +00:00
Juan A. Suarez Romero	18c4ad6e3b	CODEOWNERS: add Broadcom maintainers v2: - Add more maintainers (Iago) Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Acked-by: Iago Toral Quiroga <itoral@igalia.com> Acked-by: Alejandro Piñeiro <apinheiro@igalia.com> Acked-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15858>	2022-04-12 10:42:31 +00:00

1 2 3 4 5 ...

152338 commits