fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-23 21:38:18 +02:00

Author	SHA1	Message	Date
Tomeu Vizoso	eaecd0ffd6	etnaviv/ml: Adapt to changes in teflon regarding multiple inputs The Gallium API that Teflon uses now supports a variable number of inputs per operation. Adapt to this change without any change in functionality. Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32105>	2024-11-15 16:41:05 +00:00
Tomeu Vizoso	986f8c7ff2	teflon: Support multiple graph inputs and outputs Operations other than tensor addition will also need to be able to handle multiple inputs, and a variable number of them. And for testing individual operations, we also need to support models with multiple inputs. Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32105>	2024-11-15 16:41:04 +00:00
Tomeu Vizoso	f6c3544392	etnaviv/ml: Zero all BOs A few bugs due to uninitialized buffers have cropped up. For now let's zero them all and see if we want to do something else when we get concerned about compilation times. Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32104>	2024-11-15 15:35:32 +00:00
Karol Herbst	a5149f3fef	rusticl/kernel: fix kernel variant selection Apparently I messed up enough so that the optimized kernel variant was almost never selected. This fixes that :) Fixes: `f098620c21` ("rusticl/kernel: add optimized Kernel variant") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32139>	2024-11-15 14:21:36 +00:00
Corentin Noël	a7c8677241	virgl: Simply loop over the resources to figure-out if it is already added There is not that many resources added to a command buffer to justify the resource id being cached. Signed-off-by: Corentin Noël <corentin.noel@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32134>	2024-11-15 11:33:52 +00:00
Robert Mader	758941ab0c	v3d: Support SAND128 base modifier The BROADCOM_SAND128 modifier is usually used with an extra parameter to pass in the stride via a side channel. Quoting from drm_fourcc.h: > The pitch between the start of each column is set to optimally > switch between SDRAM banks. This is passed as the number of lines > of column width in the modifier (we can't use the stride value due > to various core checks that look at it , so you should set the > stride to width*cpp). So apparently this is just a workaround for limitations in some kernel APIs. DRM modifiers, however, are arguably a bad fit for extra parameters that aren't known in advance. In the Wayland/KMS ecosystem many components depend on being able to treat modifiers as opaque, e.g. for negotiations etc. In practice the current approach requires various software components to manually use the `DRM_FORMAT_MOD_BROADCOM_SAND128_COL_HEIGHT()` macro - using the `DRM_FORMAT_MOD_BROADCOM_SAND128` modifier directly with formats like `NV12` results in a rejection in the KMS driver and corrupted output in Mesa (because we'd bail out early in `v3d_sand8_blit()`). Fortunately the stride check limitations mentioned above don't seem to apply to Mesa though. Thus we can just add support for the base modifier and stride (coming from V4L2), allowing various toolkits, Wayland compositors and V4L2 decoder implementations to support e.g. `NV12` + `DRM_FORMAT_MOD_BROADCOM_SAND128` (`NC12` in V4L2) in a generic way. Notes: 1. Wayland compositors trying to offload composition to KMS will still fail when doing a test commit. 2. There is another limitation - in the V4L2 MPLANE API - that requires userspace to know the correct offset of the second plane. That's a known API limitation though and only affects V4L2 decoder implementations. Cc: mesa-stable Signed-off-by: Robert Mader <robert.mader@collabora.com> Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32033>	2024-11-15 11:09:02 +00:00
Yinjie Yao	19c4b734f2	radeonsi/vcn: Fix compile warnings with previously uninitialized variables. Signed-off-by: Yinjie Yao <yinjie.yao@amd.com> Reviewed-by: Ruijing Dong <ruijing.dong@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32126>	2024-11-14 17:01:54 +00:00
Yinjie Yao	03462aff8f	radeonsi/vcn: Indentation fix Signed-off-by: Yinjie Yao <yinjie.yao@amd.com> Reviewed-by: Ruijing Dong <ruijing.dong@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32126>	2024-11-14 17:01:54 +00:00
David Heidelberg	d21f7f75ff	llvmpipe: align with u_cpu_detect struct changes Cc: mesa-stable # 24.3 Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Signed-off-by: David Heidelberg <david@ixit.cz> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31998>	2024-11-13 23:33:15 +00:00
David Heidelberg	a78c2bf2a4	util: Remove MMX/MMXext detection code Currently pointless, Pentium II or Celeron and later has SSE. Cc: mesa-stable # 24.3 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Signed-off-by: David Heidelberg <david@ixit.cz> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31998>	2024-11-13 23:33:15 +00:00
Rhys Perry	45c1280d2c	nir_lower_mem_access_bit_sizes: pass access to callback Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31904>	2024-11-13 12:59:26 +00:00
Rhys Perry	61752152f7	nir_lower_mem_access_bit_sizes: add nir_mem_access_shift_method Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31904>	2024-11-13 12:59:26 +00:00
Eric Engestrom	234b9c72f9	nvk/ci: document flakes seen recently Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32080>	2024-11-13 12:26:50 +00:00
Eric Engestrom	6018d15f32	radv/ci: document flakes seen recently Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32080>	2024-11-13 12:26:49 +00:00
Erik Faye-Lund	62da644221	panfrost: use mesa_log infra instead of stdio It's generally useful to use mesa_log for error messages etc. This makes it easier to forward diagnostics into the right logs etc. So let's be more consistent about where we're logging things. Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32094>	2024-11-13 09:15:05 +00:00
Tomeu Vizoso	936da3eb9c	etnaviv/ml: Zero out the NN config As some bits were being left unitialized and causing flakiness. Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31842>	2024-11-13 07:39:35 +00:00
Tomeu Vizoso	459da82db6	etnaviv/ml: Make use of the new depthwise support in V8 The V8 hardware supports a faster way of executing depthwise convolutions, instead of having to fully lower them to regular convolutions. Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31842>	2024-11-13 07:39:35 +00:00
Tomeu Vizoso	63a10f770c	etnaviv/ml: Only reshuffle when needed on V8 Because of how depthwise convolutions are implemented on V8, we sometimes don't need reshuffling the input with strided convolutions. Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31842>	2024-11-13 07:39:35 +00:00
Tomeu Vizoso	93298a873b	etnaviv/ml: Fix reshuffle TP jobs on V8 What we had didn't work on V8, but with these fixes for V8, these jobs still run well on V7. Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31842>	2024-11-13 07:39:35 +00:00
Tomeu Vizoso	f186844545	etnaviv/ml: Disable caching on V8 The assumptions we make on V7 doesnt work as-is on V8. Revisit this later. Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31842>	2024-11-13 07:39:35 +00:00
Tomeu Vizoso	41a9540ab9	etnaviv/ml: Set two bits in the NN instruction for V8 Not sure why they have to be set, but they are always on V8. Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31842>	2024-11-13 07:39:35 +00:00
Tomeu Vizoso	bb06e082f8	etnaviv/ml: Implement tiling for V8 Have had to tweak the code to stay safe on the i.MX8MP. Also, we are for now being very conservative with tiling to prevent underruns. In the future, we may want to consider testing different possibilities during compilation and choosing the optimal one. Also maybe detecting underruns by checking whether the NPU hung with a given combination. Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31842>	2024-11-13 07:39:35 +00:00
Tomeu Vizoso	0ef5aa5fb6	etnaviv/ml: Fix padding for convolutions in V8 Two bits that aren't used in V7 seem to be used for this in V8. Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31842>	2024-11-13 07:39:35 +00:00
Tomeu Vizoso	b4ba62fcda	etnaviv/ml: Add encoding of coefficients for V8 In V8 the weights and biases of convolution operations are encoded with a totally different scheme. The initial reverse engineering and implementation was done by: Philipp Zabel <p.zabel@pengutronix.de> Support for zero run length encoding and average bias is not implemented yet. Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31842>	2024-11-13 07:39:35 +00:00
Tomeu Vizoso	f3d765ed5d	etnaviv/ml: Split V7 coefficient encoding to a new file In preparation for V8 support, which uses a completely different encoding. Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31842>	2024-11-13 07:39:35 +00:00
Tomeu Vizoso	88b5b998d2	etnaviv/ml: Rework the dumping of tensors Name the file dumps after the operation and suboperation they belong to. Also dump the command stream for each operation. Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31842>	2024-11-13 07:39:35 +00:00
Tomeu Vizoso	f9bb9aa7d5	etnaviv/nn: Fix use of etna_core_info Right now we were retrieving the properties of the NPU from the etna_core_info of the GPU. Fixes: `92a6f697d5` ("etnaviv: npu: Switch to use etna_core_info") Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31842>	2024-11-13 07:39:35 +00:00
Tomeu Vizoso	70bff0c971	etnaviv/ml: Fix includes etnaviv_ml.h uses dynarray, but the u_inlines.h header is needed by some of the files that include it. Fixes: `d6473ce28e` ("etnaviv: Use NN cores to accelerate convolutions") Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31842>	2024-11-13 07:39:35 +00:00
Peyton Lee	79b34a6539	frontends/va: add support for VAProcColorStandardExplicit for video post processing, add support for VAProcColorStandardExplicit Signed-off-by: Peyton Lee <peytolee@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32087>	2024-11-13 06:52:39 +00:00
Peyton Lee	a9e4461c26	frontends/va: add support for VAProcColorStandardExplicit for video post processing, add support for VAProcColorStandardExplicit Signed-off-by: Peyton Lee <peytolee@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32087>	2024-11-13 06:52:39 +00:00
Jose Maria Casanova Crespo	5b951bcdd7	v3d: Enable Early-Z with discards when depth updates are disabled The Early-Z optimization is disabled when there is a discard instruction in the shader used in the draw call. But if discard is the only reason to disable Early-Z, and at draw call time the updates in the draw call are disabled we can enable Early-Z using a shader variant. If there are occlussion queries active we also need to disable Early-z optimization. So this patch enables Early-Z in this scenario. The performance improvement is significant when running gfxbench benchmark showing an average improvement of 11.15% fps_avg helped: gl_gfxbench_aztec_high.trace: 3.13 -> 3.73 (19.13%) fps_avg helped: gl_gfxbench_aztec.trace: 4.82 -> 5.68 (17.88%) fps_avg helped: gl_gfxbench_manhattan31.trace: 5.10 -> 6.00 (17.59%) fps_avg helped: gl_gfxbench_manhattan.trace: 7.24 -> 8.36 (15.52%) fps_avg helped: gl_gfxbench_trex.trace: 19.25 -> 20.17 ( 4.81%) Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32028>	2024-11-12 13:26:38 +00:00
Alyssa Rosenzweig	7e57e0aa7d	asahi: factor out more compiled shader Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32081>	2024-11-11 14:33:02 +00:00
Alyssa Rosenzweig	f36ea1818b	asahi: drop dead param Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32081>	2024-11-11 14:33:02 +00:00
Alyssa Rosenzweig	e7f100013f	asahi: don't take compiled_shader in agx_build_internal_usc Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32081>	2024-11-11 14:33:02 +00:00
Alyssa Rosenzweig	8d73a3ae40	asahi: assert/cse resource valid Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32081>	2024-11-11 14:33:02 +00:00
Alyssa Rosenzweig	b9429930b9	asahi: correct core count, max freq fixes clinfo. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32081>	2024-11-11 14:33:02 +00:00
Alyssa Rosenzweig	2963cd900f	libagx: don't key unroll to index size Probably a premature optimization, it's annoying for precomp and for DGC. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32081>	2024-11-11 14:33:02 +00:00
Mary Guillemard	1a621a6967	agx: Add support for EGL_NV_context_priority_realtime Signed-off-by: Mary Guillemard <mary@mary.zone> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32081>	2024-11-11 14:33:02 +00:00
Asahi Lina	85c5a25ec3	asahi: In-place decompress shared resources for feedback loops Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32081>	2024-11-11 14:33:01 +00:00
Asahi Lina	f04387a415	asahi: Introduce batch->feedback to disable compression in PBE Used for RTs that have feedback with in-place decompression. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32081>	2024-11-11 14:33:01 +00:00
Asahi Lina	9288a3a583	asahi: Extract agx_decompress_inplace() Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32081>	2024-11-11 14:33:01 +00:00
Asahi Lina	f28a1b3fcf	asahi: Add PIPE_BIND_SHARED to imported resources Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32081>	2024-11-11 14:33:01 +00:00
Asahi Lina	59501af723	asahi: Add pipe bind flags to resource debug Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32081>	2024-11-11 14:33:01 +00:00
Konstantin Seurer	69ebba82d4	aco: Pass debug information to the driver Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29298>	2024-11-11 08:39:13 +00:00
Martin Roukala (né Peres)	dc1fe83aa5	zink/ci: document new-ish vangogh flakes Signed-off-by: Martin Roukala (né Peres) <martin.roukala@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32071>	2024-11-10 07:21:41 +02:00
Marek Olšák	1299f5c50a	gallium/radeon: import libdrm_radeon source code, drop the dependency Only radeon_surface.h/c is used from libdrm and radeon_drm.h is imported too. This code doesn't change anymore. We don't need the dependency. Acked-by: Pavel Ondračka <pavel.ondracka@gmail.com> Acked-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31827>	2024-11-10 00:52:18 +00:00
Alyssa Rosenzweig	0a81434adf	agx: rewrite address mode lowering AGX load/stores supports a single family of addressing modes: 64-bit base + sign/zero-extend(32-bit) << (format shift + optional shift) This is a base-index addressing mode, where the index is minimally in elements (never bytes, unless we're doing 8-bit load/stores). Both base and the resulting address must be aligned to the format size; the mandatory shift means that alignment of base is equivalent to alignment of the final address, which is taken care of by lower_mem_access_bit_size anyhow. The other key thing to note is that this is a 64-bit shift, after the sign- or zero-extension of the 32-bit index. That means that AGX does NOT implement 64-bit base + sign/zero-extend(32-bit << shift) This has sweeping implications. For addressing math from C-based languages (including OpenCL C), the AGX mode is more helpful, since we tend to get 64-bit shifts instead of 32-bit shifts. However, for addressing math coming from GLSL, the AGX mode is rather annoying since we know UBOs/SSBOs are at most 4GB so nir_lower_io & friends are all 32-bit byte indexing. It's tricky to teach them to do otherwise, and would not be optimal either since 64-bit adds&shifts are usually much more expensive than 32-bit on AGX except for when fused into the load/store. So we don't want 32-bit NIR, since then we can't use the hardware addressing mode at all. We also don't want 64-bit NIR, since then we have excessive 64-bit math resulting from deep deref chains from complex struct/array cases. Instead, we want a middle ground: 32-bit operations that are guaranteed not to overflow 32-bit and can therefore be losslessly promoted to 64-bit. We can make that no-overflow guarantee as a consequence of the maximum UBO/SSBO size, and indeed Mesa relies on this already all over the place. So, in this series, we use relaxed amul opcodes for addressing math. Then, we rewrite our address mode pattern matching to fuse AGX address modes. The actual pattern matching is rewritten. The old code was brittle handwritten nir_scalar chasing, based on a faulty model of the hardware (with the 32-bit shift). We delete it all, it's broken. In the new approach, we add some NIR pseudo-opcodes for address math (ulea_agx/ilea_agx) which we pattern match with NIR algebraic rules. Then the chasing required to fuse LEA's into load/stores is trivial because we never go deeper than 1 level. After fusing, we then lower the leftover lea/amul opcodes and let regular nir_opt_algebraic take it from here. We do need to be very careful around pass order to make sure things like load/store vectorization still happen. Some passes are shuffled in this commit to make this work. We also need to cleanup amul before fusing since we specifically do not have nir_opt_algebraic do so - the entire point of the pseudo-opcodes is to make nir_opt_algebraic ignore the opcodes until we've had a chance to fuse. If we simply used the .nuw bit on iadd/imul, nir_opt_algebraic would "optimize" things and lose the bit and then we would fail to fuse addressing modes, which is a much more expensive failure case than anything nir_opt_algebraic can do for us. I don't know what the "optimal" pass order for AGX would look like at this point, but what we have here is good enough for now and is a net positive for shader-db. That all ends up being much less code and much simpler code, while fixing the soundness holes in the old code, and also optimizing a significantly richer set of addressing calculations. Now we don't juts optimize GL/VK modes, but also CL. This is crucial even for GL/VK performance, since we rely on CL via libagx even in graphics shaders. Terraintessellation is up 10% to ~310fps, which is quite nice. The following stats are for the end of the series together, including this change + libagx change + the NIR changes building up to this... but not including the SSBO vectorizer stats or the IC modelling fix. In other words, these are the stats for "rewriting address mode handling". This is on OpenGL, and since the old code was targeted at GL, anything that's not a loss is good enough - we need this for the soundness fix regardless. total instructions in shared programs: 2751356 -> 2750518 (-0.03%) instructions in affected programs: 372143 -> 371305 (-0.23%) helped: 715 HURT: 75 Instructions are helped. total alu in shared programs: 2279559 -> 2278721 (-0.04%) alu in affected programs: 304170 -> 303332 (-0.28%) helped: 715 HURT: 75 Alu are helped. total fscib in shared programs: 2277843 -> 2277008 (-0.04%) fscib in affected programs: 304167 -> 303332 (-0.27%) helped: 715 HURT: 75 Fscib are helped. total ic in shared programs: 632686 -> 621886 (-1.71%) ic in affected programs: 113078 -> 102278 (-9.55%) helped: 1159 HURT: 82 Ic are helped. total bytes in shared programs: 21489034 -> 21477530 (-0.05%) bytes in affected programs: 3018456 -> 3006952 (-0.38%) helped: 751 HURT: 107 Bytes are helped. total regs in shared programs: 865148 -> 865114 (<.01%) regs in affected programs: 1603 -> 1569 (-2.12%) helped: 10 HURT: 9 Inconclusive result (value mean confidence interval includes 0). total uniforms in shared programs: 2120735 -> 2120792 (<.01%) uniforms in affected programs: 22752 -> 22809 (0.25%) helped: 76 HURT: 49 Inconclusive result (value mean confidence interval includes 0). total threads in shared programs: 27613312 -> 27613504 (<.01%) threads in affected programs: 1536 -> 1728 (12.50%) helped: 3 HURT: 0 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31964>	2024-11-08 21:15:42 -04:00
Alyssa Rosenzweig	3c222da6c0	agx: vectorize SSBOs this was missed due to the lowering, and mitigates a lot of stats weirdness with the address mode rework. total instructions in shared programs: 2755170 -> 2751399 (-0.14%) instructions in affected programs: 16323 -> 12552 (-23.10%) helped: 71 HURT: 0 helped stats (abs) min: 10 max: 178 x̄: 53.11 x̃: 42 helped stats (rel) min: 2.04% max: 50.00% x̄: 34.73% x̃: 40.79% 95% mean confidence interval for instructions value: -60.94 -45.28 95% mean confidence interval for instructions %-change: -37.81% -31.65% Instructions are helped. total alu in shared programs: 2169888 -> 2168281 (-0.07%) alu in affected programs: 9547 -> 7940 (-16.83%) helped: 71 HURT: 0 helped stats (abs) min: 5 max: 90 x̄: 22.63 x̃: 16 helped stats (rel) min: 1.02% max: 43.33% x̄: 25.39% x̃: 29.41% 95% mean confidence interval for alu value: -26.33 -18.93 95% mean confidence interval for alu %-change: -27.91% -22.87% Alu are helped. total fscib in shared programs: 2165597 -> 2163990 (-0.07%) fscib in affected programs: 9547 -> 7940 (-16.83%) helped: 71 HURT: 0 helped stats (abs) min: 5 max: 90 x̄: 22.63 x̃: 16 helped stats (rel) min: 1.02% max: 43.33% x̄: 25.39% x̃: 29.41% 95% mean confidence interval for fscib value: -26.33 -18.93 95% mean confidence interval for fscib %-change: -27.91% -22.87% Fscib are helped. total bytes in shared programs: 21517750 -> 21489352 (-0.13%) bytes in affected programs: 126270 -> 97872 (-22.49%) helped: 71 HURT: 0 helped stats (abs) min: 80 max: 1084 x̄: 399.97 x̃: 324 helped stats (rel) min: 1.77% max: 50.57% x̄: 35.07% x̃: 42.31% 95% mean confidence interval for bytes value: -455.66 -344.28 95% mean confidence interval for bytes %-change: -38.34% -31.79% Bytes are helped. total regs in shared programs: 864490 -> 865162 (0.08%) regs in affected programs: 4567 -> 5239 (14.71%) helped: 4 HURT: 61 helped stats (abs) min: 6 max: 6 x̄: 6.00 x̃: 6 helped stats (rel) min: 4.51% max: 5.13% x̄: 4.82% x̃: 4.82% HURT stats (abs) min: 2 max: 24 x̄: 11.41 x̃: 12 HURT stats (rel) min: 1.98% max: 82.35% x̄: 21.05% x̃: 16.00% 95% mean confidence interval for regs value: 8.52 12.16 95% mean confidence interval for regs %-change: 14.91% 24.00% Regs are HURT. total threads in shared programs: 27613056 -> 27613312 (<.01%) threads in affected programs: 3200 -> 3456 (8.00%) helped: 4 HURT: 0 helped stats (abs) min: 64 max: 64 x̄: 64.00 x̃: 64 helped stats (rel) min: 7.69% max: 8.33% x̄: 8.01% x̃: 8.01% 95% mean confidence interval for threads value: 64.00 64.00 95% mean confidence interval for threads %-change: 7.42% 8.60% Threads are helped. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31964>	2024-11-08 21:15:42 -04:00
Alyssa Rosenzweig	b593a6aa98	rusticl: respect late_lower_int64 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31964>	2024-11-08 21:15:42 -04:00
Eric Engestrom	7e0e433482	radv+zink/ci: add flakes seen recently Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32066>	2024-11-08 22:49:21 +00:00

1 2 3 4 5 ...

67311 commits