fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-15 22:58:05 +02:00

Author	SHA1	Message	Date
Paulo Zanoni	b94d7dbe66	anv/sparse: join multiple NULL binds when possible When it's a NULL bind we always set the bo_offset (aka memory offset) to zero, so we have to avoid the "bind.offset == prev.offset + size" check. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26036>	2023-11-04 02:06:52 +00:00
Paulo Zanoni	2fc0bbe814	anv/sparse: join multiple bind operations when possible If the next bind is just an extension of the previous one, join both in the same bind operation. Due to how mip levels are laid in memory, this can only happen for mip level 0. As of today xe.ko doesn't try to join contiguous operations for us. Due to how rebinds happen each additional rebind operation may end up resulting in many extra things done, so these simple checks end up saving us a lot of cycles the Kernel would otherwise waste. This will be true even after we issue all binds in a single ioctl. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26036>	2023-11-04 02:06:52 +00:00
Paulo Zanoni	2883c6ddaa	anv: alloc client visible addresses at the bottom of vma_hi Kill vma_cva and just toggle heap->alloc_high instead. This way, client visible addresses will remain isolated in their own little corner, except we have one less vma to deal with. For TR-TT we'll need a special vma, and if we don't use the trick above we'll need yet another trtt_cva_vma, increasing complexity even more. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26036>	2023-11-04 02:06:52 +00:00
Paulo Zanoni	e1b50074fe	anv: don't forget to destroy device->vma_mutex This actually doesn't fix any bugs or leaks, because according to the man page: "In the LinuxThreads implementation, no resources are associated with mutex objects, thus pthread_mutex_destroy actually does nothing except checking that the mutex is unlocked. still, it's better to have it than not to have it, especially since other implementations may do something. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26036>	2023-11-04 02:06:52 +00:00
Jesse Natalie	228329f4da	vulkan: Consolidate common ICD methods Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25998>	2023-11-03 20:01:14 +00:00
Jesse Natalie	32f0034ec9	vulkan: Remove no-longer-needed prototypes for ICD entrypoints The comment around these is no longer true, vk_icd.h does in fact have prototypes for these functions. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25998>	2023-11-03 20:01:14 +00:00
Mark Janes	a1e6879021	anv: make shader cache content deterministic Pointer values in shader cache data generate binary differences for functionally identical shader content. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25923>	2023-11-02 02:53:41 +00:00
Felix DeGrood	aa23120e4f	anv: remove CS_FLUSH from query regression Fixes performance regression introduced by prior refactoring of pipe control code that unnecessarily added CS_FLUSH to query start and end. Issue was diagnosed by Ben L (thank you!) Confirmed this restores performance on: * Borderlands3 +2% * Payday +3% * Factorio +3% * HogwartsLegacy +4% * Ghostrunner +7% Fixes: `6dc95685` (convert genX_query pipe controls to use pc helper) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25983>	2023-11-02 02:28:02 +00:00
Lionel Landwerlin	cdca0b2ce4	anv: fix corner case of mutable descriptor pool creation Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `63e91148b7` ("anv: Enable VK_VALVE_mutable_descriptor_type") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10065 Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25958>	2023-10-30 18:29:46 +00:00
Lionel Landwerlin	e64a97694a	anv: use anv_state_pool_state_address for blorp vertex buffer address Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25955>	2023-10-30 14:47:18 +00:00
Lionel Landwerlin	8d813a90d6	anv: fail pool allocation when over the maximal size Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25955>	2023-10-30 14:47:18 +00:00
Lionel Landwerlin	8fc42d83be	anv: make sure pools can handle more than 2Gb Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25955>	2023-10-30 14:47:18 +00:00
Lionel Landwerlin	cc67bd48d9	anv: add max_size argument for block & state pools Not enforced yet. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25955>	2023-10-30 14:47:18 +00:00
Lionel Landwerlin	b30428416a	anv: deal with state stream allocation failures In case we run out of space, all the parts of the driver that rely on this should deal with failure. The helpers will set the batch in error state so that it cannot be submitted by the application. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25955>	2023-10-30 14:47:18 +00:00
Lionel Landwerlin	ed83d1415c	anv: rename internal heaps Some of the names are a bit confusing. The main change is to introduce the "indirect_" prefix. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25955>	2023-10-30 14:47:18 +00:00
Lionel Landwerlin	f9753488ec	blorp: handle binding table & surface state allocation failures The embedding driver could be failing the allocation for whatever reason, in which case we should skip the surface state writes. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25955>	2023-10-30 14:47:18 +00:00
Jordan Justen	9bd47aabaf	anv: Add more space for init_render_queue_state() batch (MTL regression) It may be some MTL specific code paths, but `7cdacaf493` is triggering anvil to run out of space when initializing the render batch. Fixes: `7cdacaf493` ("intel/xehp: Adjust TBIMR performance chicken bits.") Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25949>	2023-10-30 10:05:10 +00:00
Francisco Jerez	57decad976	intel/xehp: Enable TBIMR by default. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25493>	2023-10-27 14:50:42 -07:00
Francisco Jerez	ed9886321c	intel/xehp+: Use TBIMR tile box check in order to avoid performance regressions. This allows the hardware to behave as if TBIMR was disabled until a polygon is processed which spans at least one tile. This is a rather heavy-handed heuristic meant to prevent regressions in heavily geometry-bound workloads that render large numbers of tiny primitives much smaller than a TBIMR tile. A particularly bad example of this was observed in SoTR, where certain draw calls with a long-running VS and a mostly trivial PS render more triangles than pixels, filling up the URB and TBIMR batch pretty quickly, which causes EU utilization to tank (since once the URB has filled up the parallelism of the VS is limited by the number of polygons that fit in a TBIMR batch at the completion of each tile walk, which isn't a lot in relation to the total EU count of a DG2), and causes the bottleneck to be the rate at which the tile sequencer performs additional tile passes, each one processing a small number (<1024 polygons) of the hundreds of thousands of triangles of the draw call. Enabling this heuristic seems effective at avoiding that scenario in SoTR among other titles (e.g. Total War Warhammer 3), but it's a bit of a compromise since one could imagine cases where TBIMR is helpful even if the geometry doesn't pass the box check, so a better heuristic or a driconf rule may be useful in the future. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25493>	2023-10-27 14:50:42 -07:00
Francisco Jerez	f0d24b155b	intel/xehp+: Adjust TBIMR batch size based on slice count. This programs a TBIMR batch size equal to 128 polygons per slice in order to match the hardware spec recommendation (BSpec 68436). This has been confirmed to improve performance slightly relative to the hardware default batch size of 256 polygons. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25493>	2023-10-27 14:50:42 -07:00
Francisco Jerez	7cdacaf493	intel/xehp: Adjust TBIMR performance chicken bits. This enables a couple of TBIMR performance tunables in CHICKEN_RASTER_2 that default to disabled. TBIMR fast clip appears to help slightly with some geometry-bound workloads. TBIMR open batch allows the rasterizer to start working immediately on the first tile of the framebuffer, even before the batch has been closed, which helps reduce the latency cost of the tile walk. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25493>	2023-10-27 14:50:42 -07:00
Francisco Jerez	08fd259b5b	anv/xehp+: Enable TBIMR in generated draw calls. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25493>	2023-10-27 14:50:42 -07:00
Francisco Jerez	65bbe58b25	anv/xehp: Implement TBIMR tile pass setup and pipeline bandwidth estimation. This sets up the basic parameters needed for tiled rendering based on a back-of-the-envelope estimate of the amount of memory used by the pixel pipeline during the tile pass. The actual cache footprint of a tile can vary wildly based on runtime factors which aren't easily predictable based on static analysis, so this is only intended to provide a rough approximation within the right order of magnitude. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25493>	2023-10-27 14:50:42 -07:00
Francisco Jerez	694d64188b	intel/xehp+: Define driconf option for selectively disabling TBIMR. This may help debugging performance problems in the possible case that TBIMR negatively impacts the performance of some application. It could also allow applying application-specific band-aid fixes in the XML file until a more general workaround is implemented. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25493>	2023-10-27 14:48:29 -07:00
Francisco Jerez	da28582eec	intel/xehp+: Add dynamic state flags controlling whether TBIMR is enabled during 3D primitives. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25493>	2023-10-27 14:48:29 -07:00
Francisco Jerez	6b9583734b	intel/l3: Set up L3FullWayAllocationEnable config if ALL partition has over 126 ways. L3 configurations with an ALL partition of 128 ways per bank or more cannot be represented with the normal L3ALLOC partitioning mechanism since the "All L3 client pool" field would overflow, instead the L3FullWayAllocationEnable bit has to be set, which causes the whole L3 to be used in a unified cache configuration. That's precisely the configuration we're currently using on recent platforms, but previously we were relying on the L3 config tables being empty and the selected L3 configuration being a NULL pointer to detect this condition. This is about change, the L3 configuration structure will be defined for gfx12.5+ platforms since they provide useful information about the cache hierarchy to the drivers. Instead of checking whether the pointer is NULL in order to apply a unified L3 cache configuration, use it when there is a single ALL partition larger than can be represented via L3ALLOC. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25493>	2023-10-27 14:48:28 -07:00
Caio Oliveira	9d73bfc9cd	anv: Fix leak when compiling internal kernels Cc: mesa-stable Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Tested-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25928>	2023-10-27 18:01:24 +00:00
Lionel Landwerlin	24631d308c	anv: ensure we reapply always pipeline dynamic state in runtime state Doing something like this is allowed : vkCreateGraphicsPipeline(.., scissorState, &pipeline); vkCmdBindPipeline(pipeline); vkCmdSetScissor(...) vkCmdBindPipeline(pipeline) If we don't reapply the pipeline dynamic state, the command buffer runtime state will keep the dynamic state set in between the 2 binds. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25915>	2023-10-26 18:02:53 +00:00
Lionel Landwerlin	ce5472137f	anv/meson: add missing dependency on the interface header Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `db335d9b73` ("anv: factor out host/gpu internal shaders interfaces") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25905>	2023-10-26 12:26:05 +00:00
Tapani Pälli	c945e0777d	anv: add required PC for Wa_14014966230 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25671>	2023-10-26 11:51:47 +00:00
Tapani Pälli	2254eaa3ae	anv: add current_pipeline for batch_emit_pipe_control This way we can implemented workarounds depending on the pipeline. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25671>	2023-10-26 11:51:47 +00:00
Yonggang Luo	43715516fc	treewide: Merge num_mesh_vertices_per_primitive and u_vertices_per_prim into mesa_vertices_per_prim Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25880>	2023-10-26 09:35:04 +00:00
Lionel Landwerlin	a97065adab	anv: fix uninitialized use of compute initialization batch We sometimes fail initialization. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `09d12e6727` ("anv: Add support for I915_ENGINE_CLASS_COMPUTE in init_device_state()") Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25891>	2023-10-25 19:27:23 +00:00
Lionel Landwerlin	3de5da7a5d	anv: fixup 32bit build of internal shaders Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `11b4c23d19` ("anv: add ring buffer mode to generated draw optimization") Fixes: `db335d9b73` ("anv: factor out host/gpu internal shaders interfaces") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10037 Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25870>	2023-10-25 11:47:40 +00:00
Chia-I Wu	b653669fc5	anv: add gen9 astc workaround gen9 does not handle denorms in void extent blocks correctly. We need to flush them to zero. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25800>	2023-10-25 00:06:04 +00:00
Chia-I Wu	c42b1a5a74	anv: prep for gen9 astc workaround We will reuse astc emu for gen9 astc workaround. This commit contains minor cleanups and has no functional change. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25800>	2023-10-25 00:06:04 +00:00
Rohan Garg	3bf1b7deba	anv: selectively enable FCV optimization for DG2 Enabling FCV on MTL breaks a number of games and benchmarks. Let's disable it for now till we can root cause the issue. Closes: #9987 Fixes: 26c2c9 ('anv: enable FCV for Gen12.5') Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25863>	2023-10-24 19:27:14 +00:00
Rohan Garg	25a232238f	anv: turn off non zero fast clears for CCS_E This helps fix a performance regression on games such as F1 22 and RDR2. Turning on non zero fast clears causes additional partial resolves for these games that degrades performance. Let's turn off non zero fast clears till we can eliminate the partial resolves. Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25863>	2023-10-24 19:27:14 +00:00
Rohan Garg	f85d8d908c	anv: cleanup includes Signed-off-by: Rohan Garg <rohan.garg@intel.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25766>	2023-10-24 10:33:57 +00:00
José Roberto de Souza	bd546f9e54	anv: Switch Xe KMD vm bind to sync It was never actually async as it was doing a DRM_IOCTL_SYNCOBJ_WAIT right after DRM_IOCTL_XE_VM_BIND but it was required to allow the partial binds required by sparse. But it is now fixed and we can switch back to sync vm bind. In future we will switch back to async vm bind to improve performance but this time it will be properly implemented. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25300>	2023-10-23 23:24:26 +00:00
José Roberto de Souza	531605accf	intel: Sync xe_drm.h Sync xe_drm.h with commit xxxxx ("drm/xe/uapi: Fix naming of XE_QUERY_CONFIG_MAX_EXEC_QUEUE_PRIORITY"). One not so straght forward change is that sync VM binds now don't require a syncobj anymore, the uAPI will return as soon the VM bind operations are done. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25300>	2023-10-23 23:24:26 +00:00
Nanley Chery	9e402e93d2	anv: Delete implicit CCS code Stop allocating CCS at the end of some BOs. Anv no longer uses that memory range. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25003>	2023-10-23 21:37:24 +00:00
Nanley Chery	4cdd3178fb	anv: Meet CCS alignment reqs with dedicated allocs At image bind time, we require BOs to meet aux-map alignment requirements in order to enable CCS on images. This is a heuristic controlled by anv_bo_allows_aux_map(). To improve the chances of getting a properly aligned BO, we make use of the dedicated allocation extension. Firstly, we report to applications a preference for dedicated memory if an image would like to use the aux map. Secondly, we align the VMA for dedicated allocations to meet aux-map requirements. To make enabling modifiers much easier on integrated gfx12, report dedicated allocations as a requirement for modifiers which specify CCS. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (v1) Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25003>	2023-10-23 21:37:24 +00:00
Nanley Chery	2cbec81041	anv: Loosen anv_bo_allows_aux_map Instead of requiring that a BO has the has_implicit_ccs flag set, simply require that the BO is aligned according to aux-map requirements. Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25003>	2023-10-23 21:37:24 +00:00
Nanley Chery	ee6e2bc4a3	anv: Place images into the aux-map when safe to do so At image bind time, if an image's addresses can be placed into the aux-map without causing conflicts with a pre-existing mapping, do so. The code aux management code in the binding function operates on a per-plane basis. So, use the per-plane CCS memory range from the image rather than the CCS memory region for the entire BO. Another way to avoid aux-map conflicts is to rely solely on having a dedicated allocation for an image. Unfortunately, not all workloads change their behavior when drivers report a preference for dedicated allocations. In particular, 3DMark Wild Life Extreme does not make more dedicated allocations and such a solution was measured to perform ~16% worse than this solution. With this solution, I did not measure a loss of CCS on that benchmark. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6304 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (v1) Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25003>	2023-10-23 21:37:24 +00:00
Nanley Chery	207db22117	anv: Refactor CCS disabling at image bind time Split out the discrete and integrated implicit CCS cases. We'll do more work in the integrated case in a future commit. Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25003>	2023-10-23 21:37:24 +00:00
Nanley Chery	d31c62f384	anv: Wrap aux surface image binding queries Add and use anv_image_get_aux_memory_range. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25003>	2023-10-23 21:37:24 +00:00
Nanley Chery	cd12eec496	anv: Allocate space for aux-map CCS in image bindings This makes images a bit larger by reserving space to store the compression control surface when the device uses an aux-map. This space is not used currently because anv still maps main surface addresses to space at the end of the anv_bo. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25003>	2023-10-23 21:37:24 +00:00
Nanley Chery	5e07255148	anv: Move scope of CCS binding determination Move the determination of the image binding for CCS to a larger scope, so that it can be reused for other aux usages in add_aux_surface_if_supported(). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25003>	2023-10-23 21:37:24 +00:00
Nanley Chery	b1a14fe923	intel: Return a bool from intel_aux_map_add_mapping Make intel_aux_map_add_mapping return false if a mapping is attempted that would conflict with an existing one. If this function doesn't return false, it will either fail to return or return true. The Vulkan driver will make use of this feature to opportunistically enable CCS if a BO's VMA range has not been already mapped. Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25003>	2023-10-23 21:37:24 +00:00

1 2 3 4 5 ...

5036 commits