fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-20 09:08:07 +02:00

Author	SHA1	Message	Date
Juston Li	34031e3e3b	anv/android: remove unneeded ANB implicit import flags ANB is only used by Android WSI which uses explicit sync so these flags can be dropped. Signed-off-by: Juston Li <justonli@google.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29883>	2024-07-30 09:27:28 -07:00
Jianxun Zhang	c5ee7e9bdc	anv: Disable legacy CCS setup in binding (xe2) The condition of flat ccs and vram_only checker causes different aux usage at binding stage. The current design is reusing CCS_E on Xe2, so we want both Xe2 integrated and discreted GPUs behave the same way. Xe2 shouldn't need any special setup of CCS in the loop. Backport-to: 24.2 Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30111>	2024-07-29 01:42:27 +00:00
Jianxun Zhang	e054068787	anv: Disable compression on legacy modifiers (xe2) On pre-Xe2 platforms, the compression on these modifiers that don't support compression are enabled. The compressed will be resolved when needed. On Xe2+ we haven't support explicit resolve, so all the paths to resolves are prohibited now. But the code is still doing it, causing an assertion failure: Fixes: vkcube src/intel/vulkan/anv_private.h:5467: anv_image_get_fast_clear_type_addr: Assertion `device->info->ver < 20' failed. Backport-to: 24.2 Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30111>	2024-07-29 01:42:27 +00:00
Jianxun Zhang	49c91a4ea0	anv: Fix assertion failures on BMG (xe2) Fixes: `beb0ea2469` ("anv: Disable tracking fast clear and aux state (xe2)") crucible run func.first dEQP-VK.api.copy_and_blit.core.image_to_image. all_formats.color.2d_to_2d.a1r5g5b5_unorm_pack16. r16_uint.optimal_optimal dEQP-VK.pipeline.monolithic.multisample.misc.clear_attachments. r8g8b8a8_unorm_r16g16b16a16_sfloat_r16g16b16a16_sint_d32_sfloat_ s8_uint.16x.ds_resolve_sample_zero.whole_framebuffer src/intel/vulkan/anv_private.h:5491: anv_image_get_compression_state_addr: Assertion `device->info->ver < 20' failed. Backport-to: 24.2 Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30111>	2024-07-29 01:42:26 +00:00
Ian Romanick	fdb6afe71e	intel/elk: Fix undefined left shift of negative value in elk_texture_offset Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30333>	2024-07-26 17:18:08 -07:00
Ian Romanick	f3f4a057b9	intel/elk: Fix undefined left shift of large UW value in elk_imm_uw Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30333>	2024-07-26 17:18:06 -07:00
Ian Romanick	0e5ac7d6b0	intel/elk: Fix undefined left shift of negative value in update_uip_jip v2: Add comment and assertion to explain why the shift is safe. Suggested by Caio. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30333>	2024-07-26 17:18:04 -07:00
Ian Romanick	c2dda8c8e7	intel/elk: Fix undefined shift by 64 of uint64_t in elk_compute_first_urb_slot_required Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30333>	2024-07-26 17:18:01 -07:00
Ian Romanick	e6669467b8	intel/brw: Fix undefined left shift of negative value in brw_texture_offset When -fsanitize=shift is used, many instances of the following are produced: src/intel/compiler/brw_fs_nir.cpp:114:30: runtime error: left shift of negative value -1 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30333>	2024-07-26 17:17:59 -07:00
Ian Romanick	4f24c2707f	intel/brw: Fix undefined left shift of large UW value in brw_imm_uw When -fsanitize=shift is used, 'ninja test' would fail in several Intel assembly tests (mul.asm and and.asm) with: src/intel/compiler/brw_reg.h:703:22: runtime error: left shift of 65532 by 16 places cannot be represented in type 'int' Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30333>	2024-07-26 17:17:56 -07:00
Ian Romanick	abb7c012ff	intel/brw: Fix undefined left shift of negative value in update_uip_jip When -fsanitize=shift is used, many instances of the following are produced: src/intel/compiler/brw_eu_compact.c:2244:50: runtime error: left shift of negative value -306 v2: Add comment and assertion to explain why the shift is safe. Suggested by Caio. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30333>	2024-07-26 17:17:53 -07:00
Ian Romanick	228e049db6	intel/brw: Fix undefined shift by 64 of uint64_t in brw_compute_first_urb_slot_required When -fsanitize=shift is used, many instances of the following are produced: src/intel/compiler/brw_compiler.h:1661:44: runtime error: shift exponent 64 is too large for 64-bit type 'long long unsigned int' I think this is an actual bug. It should check the sentinel value, but the sentinel value is 64. The shift by 64 is treated as a shift by 0. The varying 0 is explicitly filtered by the rest of the if-test. How does this work? Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30333>	2024-07-26 17:17:15 -07:00
Sushma Venkatesh Reddy	455deacbce	intel/brw: Fix DEBUG_OPTIMIZER Due to recent regression, adding INTEL_DEBUG=optimizer is dumping shader optimization pass details to console rather than to respective files. Thank you, Kenneth W Graunke for helping me figure this out. Fixes: `17b7e49089` ("intel/brw: Move out of fs_visitor and rename print instructions") Signed-off-by: Sushma Venkatesh Reddy <sushma.venkatesh.reddy@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30389>	2024-07-26 22:22:58 +00:00
José Roberto de Souza	eb5a3617e2	anv: Handle internal shader compilation failure Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30380>	2024-07-26 21:58:21 +00:00
José Roberto de Souza	196b3d7b5b	anv: Improve error message when pipeline creation fails during shader compilation Due the lack of SIMD8 in Xe2 platforms we are not able to compile a shader for dEQP-VK.protected_memory.stack.stacksize_1024 that fits into scratch space. So before this patch when such failure happened it would return VK_ERROR_OUT_OF_HOST_MEMORY error. So here when available include the compiler error string to better inform what the actual failure. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30380>	2024-07-26 21:58:21 +00:00
Jianxun Zhang	349e7a2919	intel/common: Remove blank lines in intel_set_ps_dispatch_state() (xe2) Backport-to: 24.2 Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29907>	2024-07-26 21:02:24 +00:00
Jianxun Zhang	cb7f816fc4	intel/common: Ensure SIMD16 for fast-clear kernel (xe2) Add a restriction on SIMD mode for fast-clear pixel shader according to the Bspec. Backport-to: 24.2 Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29907>	2024-07-26 21:02:24 +00:00
José Roberto de Souza	5fdacb56ed	anv: Propagate protected information to blorp_batch_isl_copy_usage() This fixes protected tests that uses vkCmdCopyBuffer(). Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30369>	2024-07-26 20:36:32 +00:00
José Roberto de Souza	79f95a3711	isl: Fix Xe2 protected mask BSpec 71045 and 57023 still points that protected/encrypted bit is still bit 0, bit 1 should not be set or undesired MOCS index could be set. Fixes: `7be8bc2c97` ("isl: Add mocs for xe2") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30369>	2024-07-26 20:36:32 +00:00
Lionel Landwerlin	d5b0526507	anv: propagate protected information for blorp operations Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29982>	2024-07-26 18:15:43 +00:00
Lionel Landwerlin	8d9cc6aa23	anv: properly flag image/imageviews for ISL protection Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29982>	2024-07-26 18:15:43 +00:00
Lionel Landwerlin	4eab285d4a	isl: account for protection in base usage checks Only Cc stable because it's needed for the next patches. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29982>	2024-07-26 18:15:43 +00:00
Caio Oliveira	23b0798551	intel/brw: Move interp_reg and per_primitive_reg out of fs_visitor Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30169>	2024-07-25 15:37:13 +00:00
Caio Oliveira	a5cc8c4807	intel/brw: Move VARYING_PULL_CONSTANT_LOAD from fs_visitor to fs_builder Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30169>	2024-07-25 15:37:13 +00:00
Caio Oliveira	8a39231e4f	intel/brw: Move calculate_cfg out of fs_visitor Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30169>	2024-07-25 15:37:13 +00:00
Caio Oliveira	b98930c770	intel/brw: Move regalloc and scheduling functions out of fs_visitor Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30169>	2024-07-25 15:37:13 +00:00
Caio Oliveira	5cb1f46fd1	intel/brw: Remove workgroup_size() helper from fs_visitor Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30169>	2024-07-25 15:37:13 +00:00
Caio Oliveira	17b7e49089	intel/brw: Move out of fs_visitor and rename print instructions They use the brw_print prefix now. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30169>	2024-07-25 15:37:13 +00:00
Caio Oliveira	bb7f2db5a2	intel/brw: Move printing functions to its own file Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30169>	2024-07-25 15:37:13 +00:00
Caio Oliveira	cdbee4156e	intel/brw: Reduce scope of some MESH specific functions Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30169>	2024-07-25 15:37:13 +00:00
Caio Oliveira	67ead4edff	intel/brw: Reduce scope of some TES specific functions Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30169>	2024-07-25 15:37:13 +00:00
Caio Oliveira	f9ddf51b70	intel/brw: Reduce scope of some TCS specific functions Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30169>	2024-07-25 15:37:13 +00:00
Caio Oliveira	47b9dc9070	intel/brw: Reduce scope of some GS specific functions Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30169>	2024-07-25 15:37:13 +00:00
Caio Oliveira	28858b3ad1	intel/brw: Reduce scope of some FS specific functions Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30169>	2024-07-25 15:37:13 +00:00
Caio Oliveira	a8b4b9dd51	intel/brw: Reduce scope of some VS specific functions Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30169>	2024-07-25 15:37:13 +00:00
Caio Oliveira	fdb029fe1b	intel/brw: Move and reduce scope of run_*() functions Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30169>	2024-07-25 15:37:13 +00:00
Caio Oliveira	c92b8a802e	intel/brw: Move remaining compile stages to their own files Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30169>	2024-07-25 15:37:13 +00:00
Matt Turner	a3714b55f4	intel/elk: Use REG_CLASS_COUNT Fixes: `d44462c08d` ("intel/elk: Fork Gfx8- compiler by copying existing code") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30314>	2024-07-25 14:55:09 +00:00
Matt Turner	5e24c21625	intel/brw: Use REG_CLASS_COUNT Fixes: `5d87f41a54` ("intel/fs/ra: Define REG_CLASS_COUNT constant specifying the number of register classes.") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30314>	2024-07-25 14:55:09 +00:00
Paulo Zanoni	dd5362c78a	anv/xe: try harder when the vm_bind ioctl fails From all the many possible errors returned by the vm_bind ioctl, some can actually happen in the wild when the system is under memory pressure. Thomas Hellström pointed to us that, due to its asynchronous nature, the vm_bind ioctl itself has to pin some memory, so if the number of bind operations passed is too big, there is a probability that it may run out of memory. Previously the Kernel would return ENOMEM when this condition happened. Since commit e8babb280b5e ("drm/xe: Convert multiple bind ops into single job") the Kernel has started returning ENOBUFS when it doesn't have enough memory to do what it wants but thinks we'd succeed if we tried to do one bind operation at a time (instead of doing multiple operations in the same ioctl), and ENOMEM in some other situations. Still-uncommitted commit "drm/xe: Return -ENOBUFS if a kmalloc fails which is tied to an array of binds" proposes converting a few more ENOMEM cases no ENOBUFS. Still, even ENOMEM situations could in theory be possible to recover from, because if we wait some amount of time, resources that may have been consuming memory could end up being freed by other threads or processes, allowing the operations to succeed. So our main idea in this patch is that we treat both ENOMEM and ENOBUFS in the same way, so our implementation can work with any xe.ko driver regardless of having or not having the commits mentioned above. So in this patch, when we detect the system is under memory pressure (i.e., the vm_bind() function returns VK_ERROR_OUT_OF_HOST_MEMORY), we throw away our performance expectations and try to go slowly and steady. First we wait everything we're supposed to wait (hoping that this alone could also help to alleviate the memory pressure), and then we synchronously bind one piece at a time (as this will ensure ENOBUFS can't be returned), hoping that this won't cause the Kernel to try to reserve too much memory. All this while also hoping that whatever thing that may be eating all the memory goes away in the meantime. If even this fails, we give up and hope the upper layer will be able to figure out what to do. This fixes a bunch of LNL failures and flaky tests (as LNL is our first officially supported xe.ko platform). This can be seen in dEQP but only if multiple tests are being run parallel. Happens in multiple tests, some of which may include: - dEQP-VK.sparse_resources.image_sparse_binding.2d_array.rgba8_snorm.1024_128_8 - dEQP-VK.sparse_resources.image_sparse_binding.3d.rgba16_snorm.1024_128_8 - dEQP-VK.sparse_resources.image_sparse_binding.3d.rgba16ui.512_256_6 I don't ever see these errors when running Alchemist/DG2 with xe.ko. Fixes: `e9f63df2f2` ("intel/dev: Enable LNL PCI IDs without INTEL_FORCE_PROBE") Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30276>	2024-07-24 23:18:36 +00:00
Matt Turner	aae82061af	intel/clc: Free disk_cache Fixes: `c15bf88f01` ("intel: Add a little OpenCL C compiler binary") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30313>	2024-07-24 20:46:28 +00:00
Matt Turner	1574372de4	intel/clc: Free parsed_spirv_data This declaration shadowed a variable by the same type and name in an outer scope. That variable is passed to clc_free_parsed_spirv(). Fixes: `4fd7495c69` ("intel/clc: add ability to output NIR") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30313>	2024-07-24 20:46:28 +00:00
José Roberto de Souza	945564e498	anv: Wait for Xe exec queue to be idle before destroying it Paulo reported that Vulkan is also affected by the drop of permanent exec queues in Xe KMD, Iris already have this handling. So here using the special DRM_IOCTL_XE_EXEC with num_batch_buffer == 0 to get a syncobj signaled when the last DRM_IOCTL_XE_EXEC is completed. Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30156>	2024-07-24 16:27:30 +00:00
Yiwei Zhang	6d0273f67a	anv: improve vma usage for descriptor buffer The dynamic visible memory type (or the prior descriptor buffer memory type) doesn't need special aux-tt alignment or additional ccs space. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30317>	2024-07-23 17:56:20 +00:00
Marek Olšák	b2d32ae246	nir: add nir_intrinsic_load_per_primitive_input, split from io_semantics flag Instead of having 1 bit in nir_io_semantics indicating a per-primitive FS input, add a dedicated intrinsic for it. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29895>	2024-07-23 16:13:16 +00:00
Kenneth Graunke	c429d5025e	intel/brw: Don't force g1's live range to be the entire program The idea here was that pixel shader framebuffer writes used the g0 and g1 thread payload register values to construct the message header. However, most messages are headerless and don't use either. There's a 2012-era comment that the simulator at one point had a bug where certain headerless messages would incorrectly take the values from the g0/g1 register contents rather than using sideband. But, that was likely fixed eons ago. So we really don't need to do this. Furthermore, there are many more shader stages these days: - VS: r1 contains output URB handles - TCS: r1 contains ICP handles - TES: r1 contains gl_TessCoord.x (r4 contains output URB handles) - GS: r1 contains output URB handles - CS: r1 contains LocalID.X on DG2+ but nothing on older hardware - Task/Mesh: r1 contains LocalID.X - BS: r1 contains bindless stack handles Vertex and geometry aren't likely to benefit here because r1 is needed for their output messages, which are also what terminate the shader. TES will definitely benefit because we were making a value pointlessly live for the whole program. Same for TCS, to a lesser extent. Compute prior to DG2 was the worst, as g1 literally has no meaningful content, so there is no point to keeping it live. fossil-db on Alchemist shows substantial spill/fill improvements: Totals: Instrs: 148782351 -> 148741996 (-0.03%); split: -0.03%, +0.01% Cycles: 12602907531 -> 12605795191 (+0.02%); split: -0.70%, +0.72% Subgroup size: 7518608 -> 7518632 (+0.00%) Send messages: 7341727 -> 7341762 (+0.00%) Spill count: 54633 -> 52575 (-3.77%) Fill count: 104694 -> 100680 (-3.83%) Scratch Memory Size: 3375104 -> 3287040 (-2.61%) Totals from 301172 (48.21% of 624670) affected shaders: Instrs: 95531927 -> 95491572 (-0.04%); split: -0.05%, +0.01% Cycles: 9643531593 -> 9646419253 (+0.03%); split: -0.91%, +0.94% Subgroup size: 4492512 -> 4492536 (+0.00%) Send messages: 4399737 -> 4399772 (+0.00%) Spill count: 20034 -> 17976 (-10.27%) Fill count: 41530 -> 37516 (-9.67%) Scratch Memory Size: 1522688 -> 1434624 (-5.78%) Assassin's Creed Odyssey in particular has 20% fewer fills. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30146>	2024-07-23 02:26:52 +00:00
Michael Cheng	60c73e09c6	anv: Remove extra hdc_flush from Perfetto Remove extra reporting of hdc_flush when viewing a Perfetto trace for anv. Signed-off-by: Michael Cheng <michael.cheng@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30312>	2024-07-23 01:57:59 +00:00
Caio Oliveira	8ba8e33c39	intel/brw: Simplify @file annotations Doxygen documentation says > If the file name is omitted (i.e. the line after \file is left > blank) then the documentation block that contains the \file command will > belong to the file it is located in. so we can omit the filename itself when using the annotation. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30168>	2024-07-22 22:48:03 +00:00
Lionel Landwerlin	1908d2c171	anv: split image view from anv_image.c Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30285>	2024-07-22 18:46:05 +00:00
Lionel Landwerlin	eff01c46d8	anv: split buffer view from anv_image.c Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30285>	2024-07-22 18:46:05 +00:00

1 2 3 4 5 ...

12429 commits