fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-17 05:18:12 +02:00

Author	SHA1	Message	Date
Francisco Jerez	694d64188b	intel/xehp+: Define driconf option for selectively disabling TBIMR. This may help debugging performance problems in the possible case that TBIMR negatively impacts the performance of some application. It could also allow applying application-specific band-aid fixes in the XML file until a more general workaround is implemented. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25493>	2023-10-27 14:48:29 -07:00
Francisco Jerez	da28582eec	intel/xehp+: Add dynamic state flags controlling whether TBIMR is enabled during 3D primitives. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25493>	2023-10-27 14:48:29 -07:00
Francisco Jerez	622c2498d4	intel/xehp+: Import algorithm for TBIMR tiling parameter calculation. This implements a minimalistic algorithm that can be used to obtain an approximate solution for the integer programming problem of finding the optimal tile dimensions based on an estimate of the tile cache consumption per pixel of the current graphics pipeline -- Including the TC footprint of render targets, depth and stencil buffers and their auxiliary surfaces. Considering other (less local) memory accesses performed by the pipeline (like texturing and shader storage) would be useful (and could be considered by this algorithm with little modification), but it would be pretty difficult to estimate the L3 cache consumption per pixel of such accesses based on static analysis of the pipeline state alone without some sort of dynamic feedback. The present algorithm returns a config with tile area large enough to utilize a target fraction of the L3, which can be adjusted to obtain greater/lower utilization of the L3 at the cost of higher/lower risk of L3 cache thrashing respectively. The aspect ratio of the tile layout returned attempts to minimize the number of poorly utilized tiles around the boundaries of the framebuffer (due to partial coverage), since having the tile sequencer process additional tiles comes at a cost due to the latency of the additional passes, even if they're mostly empty. Finally, among the solutions with satisfactory cache footprint and tile count, the tile aspect ratio closest to 1 is returned where possible, since tiles with very high aspect ratios can have a negative impact on cache locality. The algorithm is primarily intended for TBIMR, but it could be used for PTBR as well with little modifications, since the TBIMR-specific assumptions are few and noted in comments below. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25493>	2023-10-27 14:48:29 -07:00
Francisco Jerez	cec5541b02	intel/xehp+: Add TBIMR-related genxml definitions. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25493>	2023-10-27 14:48:29 -07:00
Francisco Jerez	3e3fd921ac	intel/mtl: Import L3 cache configurations. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25493>	2023-10-27 14:48:28 -07:00
Francisco Jerez	468904e833	intel/dg2: Import L3 cache configurations. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25493>	2023-10-27 14:48:28 -07:00
Jordan Justen	524996106c	intel/l3: Use devinfo->urb.size when cfg urb-size is 0. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25493>	2023-10-27 14:48:28 -07:00
Anuj Phogat	ed5ff8f297	intel/l3: Adjust URB weight calculation for gfx12.5+. Gfx12.5+ devices use special-purpose memory for the URB instead of requiring a portion of the L3 to be carved out. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25493>	2023-10-27 14:48:28 -07:00
Francisco Jerez	6b9583734b	intel/l3: Set up L3FullWayAllocationEnable config if ALL partition has over 126 ways. L3 configurations with an ALL partition of 128 ways per bank or more cannot be represented with the normal L3ALLOC partitioning mechanism since the "All L3 client pool" field would overflow, instead the L3FullWayAllocationEnable bit has to be set, which causes the whole L3 to be used in a unified cache configuration. That's precisely the configuration we're currently using on recent platforms, but previously we were relying on the L3 config tables being empty and the selected L3 configuration being a NULL pointer to detect this condition. This is about change, the L3 configuration structure will be defined for gfx12.5+ platforms since they provide useful information about the cache hierarchy to the drivers. Instead of checking whether the pointer is NULL in order to apply a unified L3 cache configuration, use it when there is a single ALL partition larger than can be represented via L3ALLOC. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25493>	2023-10-27 14:48:28 -07:00
Francisco Jerez	f36027f389	intel/l3: Define helper for obtaining the size of an L3 partition in KB. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25493>	2023-10-27 14:48:28 -07:00
Francisco Jerez	19e62e8fba	intel/l3/gfx11+: Add tile cache partition to intel_l3_config struct. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25493>	2023-10-27 14:48:28 -07:00
Caio Oliveira	9d73bfc9cd	anv: Fix leak when compiling internal kernels Cc: mesa-stable Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Tested-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25928>	2023-10-27 18:01:24 +00:00
Lionel Landwerlin	7cff4cc9c8	intel/fs: Xe2 fix for ExBSO on UGM Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> BSpec: 56890 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25506>	2023-10-27 10:58:12 +03:00
Alyssa Rosenzweig	c8192c1c93	hasvk: Support builiding on non-Intel Should help Eric build test releases on their MacBook :-) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Cc: mesa-stable Reviewed-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25882>	2023-10-26 19:48:19 +00:00
Lionel Landwerlin	24631d308c	anv: ensure we reapply always pipeline dynamic state in runtime state Doing something like this is allowed : vkCreateGraphicsPipeline(.., scissorState, &pipeline); vkCmdBindPipeline(pipeline); vkCmdSetScissor(...) vkCmdBindPipeline(pipeline) If we don't reapply the pipeline dynamic state, the command buffer runtime state will keep the dynamic state set in between the 2 binds. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25915>	2023-10-26 18:02:53 +00:00
Jani Nikula	ae74d486ad	docs/isl: use hawkmoth instead of doxygen Use the hawkmoth c:auto* directives to incorporate isl documentation. Convert @param style parameter descriptions to rst info field lists. Add static stubs for generated headers. Fix a lot of references, in particular the symbols are now in the Sphinx C domain, not C++ domain. Tweak syntax here and there. Based on the earlier work by Erik Faye-Lund <kusmabite@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24507>	2023-10-26 16:13:26 +00:00
Jani Nikula	0ed5b8af01	isl: drop < style documentation comments Prepare for using Hawkmoth. Hawkmoth does not support trailing comments using /< ... */ syntax. Replace with regular documentation comments. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24507>	2023-10-26 16:13:25 +00:00
Lionel Landwerlin	ce5472137f	anv/meson: add missing dependency on the interface header Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `db335d9b73` ("anv: factor out host/gpu internal shaders interfaces") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25905>	2023-10-26 12:26:05 +00:00
Tapani Pälli	c945e0777d	anv: add required PC for Wa_14014966230 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25671>	2023-10-26 11:51:47 +00:00
Tapani Pälli	2254eaa3ae	anv: add current_pipeline for batch_emit_pipe_control This way we can implemented workarounds depending on the pipeline. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25671>	2023-10-26 11:51:47 +00:00
Tapani Pälli	3cf71ddfac	intel/dev: provide intel_device_info_is_adln helper Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25671>	2023-10-26 11:51:47 +00:00
Yonggang Luo	43715516fc	treewide: Merge num_mesh_vertices_per_primitive and u_vertices_per_prim into mesa_vertices_per_prim Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25880>	2023-10-26 09:35:04 +00:00
Lionel Landwerlin	439b0e8688	intel/fs: fix dynamic interpolation mode selection We can end up in situation where we are dispatched with a multisample framebuffer but not at per-sample. In this case we would request the at_sample value with the wrong message configuration. Relying on the BRW_WM_MSAA_FLAG_MULTISAMPLE_FBO flag superseeds BRW_WM_MSAA_FLAG_PERSAMPLE_DISPATCH. Fixes piglit tests : spec@arb_gpu_shader5@arb_gpu_shader5-interpolateatsample* With Zink on Anv Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `68027bd38e` ("intel/fs: implement dynamic interpolation mode for dynamic persample shaders") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25854>	2023-10-25 21:15:48 +00:00
Lionel Landwerlin	a97065adab	anv: fix uninitialized use of compute initialization batch We sometimes fail initialization. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `09d12e6727` ("anv: Add support for I915_ENGINE_CLASS_COMPUTE in init_device_state()") Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25891>	2023-10-25 19:27:23 +00:00
Lionel Landwerlin	3de5da7a5d	anv: fixup 32bit build of internal shaders Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `11b4c23d19` ("anv: add ring buffer mode to generated draw optimization") Fixes: `db335d9b73` ("anv: factor out host/gpu internal shaders interfaces") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10037 Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25870>	2023-10-25 11:47:40 +00:00
Tapani Pälli	d52c39a6cd	intel/dev: expand existing fix for all gfx12 with small EU count Commit `7db1b94e07` added a fix for ADL-N but this issue has been reproduced also on RPL-S and is likely common with all gfx12 variants with a small EU count. cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25861>	2023-10-25 05:15:47 +00:00
Chia-I Wu	b653669fc5	anv: add gen9 astc workaround gen9 does not handle denorms in void extent blocks correctly. We need to flush them to zero. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25800>	2023-10-25 00:06:04 +00:00
Chia-I Wu	c42b1a5a74	anv: prep for gen9 astc workaround We will reuse astc emu for gen9 astc workaround. This commit contains minor cleanups and has no functional change. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25800>	2023-10-25 00:06:04 +00:00
Caio Oliveira	b91ed68fa0	intel/compiler: Don't emit calls to validate() in release build While the fs_visitor::validate() implementation is empty in release build, we still emit calls to it since it is defined in a separate compilation unit than its callers. To fix this, just expose an inline empty function in the header for the release mode. Fossil run time differences in TGL laptop (difference at 95.0% confidence): ``` Rise of The Tomb Rider (Native) [n=7] -0.482857 +/- 0.010932 -1.60608% +/- 0.0363621% Cyberpunk 2077 (DXVK) [n=7] -0.987143 +/- 0.0904516 -0.82996% +/- 0.076049% Batman Arkham City (DXVK) [n=7] -7.74857 +/- 0.329561 -1.46298% +/- 0.0622231% ``` Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25847>	2023-10-24 21:10:35 +00:00
Rohan Garg	3bf1b7deba	anv: selectively enable FCV optimization for DG2 Enabling FCV on MTL breaks a number of games and benchmarks. Let's disable it for now till we can root cause the issue. Closes: #9987 Fixes: 26c2c9 ('anv: enable FCV for Gen12.5') Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25863>	2023-10-24 19:27:14 +00:00
Rohan Garg	25a232238f	anv: turn off non zero fast clears for CCS_E This helps fix a performance regression on games such as F1 22 and RDR2. Turning on non zero fast clears causes additional partial resolves for these games that degrades performance. Let's turn off non zero fast clears till we can eliminate the partial resolves. Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25863>	2023-10-24 19:27:14 +00:00
Rohan Garg	f85d8d908c	anv: cleanup includes Signed-off-by: Rohan Garg <rohan.garg@intel.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25766>	2023-10-24 10:33:57 +00:00
José Roberto de Souza	bd546f9e54	anv: Switch Xe KMD vm bind to sync It was never actually async as it was doing a DRM_IOCTL_SYNCOBJ_WAIT right after DRM_IOCTL_XE_VM_BIND but it was required to allow the partial binds required by sparse. But it is now fixed and we can switch back to sync vm bind. In future we will switch back to async vm bind to improve performance but this time it will be properly implemented. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25300>	2023-10-23 23:24:26 +00:00
José Roberto de Souza	531605accf	intel: Sync xe_drm.h Sync xe_drm.h with commit xxxxx ("drm/xe/uapi: Fix naming of XE_QUERY_CONFIG_MAX_EXEC_QUEUE_PRIORITY"). One not so straght forward change is that sync VM binds now don't require a syncobj anymore, the uAPI will return as soon the VM bind operations are done. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25300>	2023-10-23 23:24:26 +00:00
Nanley Chery	d57611fe25	intel/isl: Add scores for GEN12_RC_CCS and MTL_RC_CCS Now that these CCS-enabled modifiers have non-zero scores, anv is enabled to use them. We found this to improve the performance of Borderlands 3 by 18.73%. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6701 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Tested-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25003>	2023-10-23 21:37:24 +00:00
Nanley Chery	9e402e93d2	anv: Delete implicit CCS code Stop allocating CCS at the end of some BOs. Anv no longer uses that memory range. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25003>	2023-10-23 21:37:24 +00:00
Nanley Chery	4cdd3178fb	anv: Meet CCS alignment reqs with dedicated allocs At image bind time, we require BOs to meet aux-map alignment requirements in order to enable CCS on images. This is a heuristic controlled by anv_bo_allows_aux_map(). To improve the chances of getting a properly aligned BO, we make use of the dedicated allocation extension. Firstly, we report to applications a preference for dedicated memory if an image would like to use the aux map. Secondly, we align the VMA for dedicated allocations to meet aux-map requirements. To make enabling modifiers much easier on integrated gfx12, report dedicated allocations as a requirement for modifiers which specify CCS. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (v1) Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25003>	2023-10-23 21:37:24 +00:00
Nanley Chery	2cbec81041	anv: Loosen anv_bo_allows_aux_map Instead of requiring that a BO has the has_implicit_ccs flag set, simply require that the BO is aligned according to aux-map requirements. Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25003>	2023-10-23 21:37:24 +00:00
Nanley Chery	ee6e2bc4a3	anv: Place images into the aux-map when safe to do so At image bind time, if an image's addresses can be placed into the aux-map without causing conflicts with a pre-existing mapping, do so. The code aux management code in the binding function operates on a per-plane basis. So, use the per-plane CCS memory range from the image rather than the CCS memory region for the entire BO. Another way to avoid aux-map conflicts is to rely solely on having a dedicated allocation for an image. Unfortunately, not all workloads change their behavior when drivers report a preference for dedicated allocations. In particular, 3DMark Wild Life Extreme does not make more dedicated allocations and such a solution was measured to perform ~16% worse than this solution. With this solution, I did not measure a loss of CCS on that benchmark. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6304 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (v1) Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25003>	2023-10-23 21:37:24 +00:00
Nanley Chery	207db22117	anv: Refactor CCS disabling at image bind time Split out the discrete and integrated implicit CCS cases. We'll do more work in the integrated case in a future commit. Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25003>	2023-10-23 21:37:24 +00:00
Nanley Chery	d31c62f384	anv: Wrap aux surface image binding queries Add and use anv_image_get_aux_memory_range. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25003>	2023-10-23 21:37:24 +00:00
Nanley Chery	cd12eec496	anv: Allocate space for aux-map CCS in image bindings This makes images a bit larger by reserving space to store the compression control surface when the device uses an aux-map. This space is not used currently because anv still maps main surface addresses to space at the end of the anv_bo. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25003>	2023-10-23 21:37:24 +00:00
Nanley Chery	5e07255148	anv: Move scope of CCS binding determination Move the determination of the image binding for CCS to a larger scope, so that it can be reused for other aux usages in add_aux_surface_if_supported(). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25003>	2023-10-23 21:37:24 +00:00
Nanley Chery	b1a14fe923	intel: Return a bool from intel_aux_map_add_mapping Make intel_aux_map_add_mapping return false if a mapping is attempted that would conflict with an existing one. If this function doesn't return false, it will either fail to return or return true. The Vulkan driver will make use of this feature to opportunistically enable CCS if a BO's VMA range has not been already mapped. Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25003>	2023-10-23 21:37:24 +00:00
Lionel Landwerlin	454870dd5f	anv: merge gfx9/11 indirect draw generation shaders Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Tested-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25361>	2023-10-20 13:07:53 +00:00
Lionel Landwerlin	11b4c23d19	anv: add ring buffer mode to generated draw optimization When the number of draw calls is very large, instead of allocating large amounts of batch buffer space for the draws, use a ring buffer and process the draw calls by batches. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8645 Reviewed-by: Ivan Briano <ivan.briano@intel.com> Tested-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25361>	2023-10-20 13:07:53 +00:00
Lionel Landwerlin	718e77eee5	anv: index indirect data buffer with absolute offset This will help for a follow up change where we will respawn the shader multiple times in a loop and the base offset will be edited by the shader itself. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Tested-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25361>	2023-10-20 13:07:53 +00:00
Lionel Landwerlin	db335d9b73	anv: factor out host/gpu internal shaders interfaces This will prevent host/gpu structure definitions to go out of sync. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Tested-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25361>	2023-10-20 13:07:53 +00:00
Lionel Landwerlin	c700d47c56	anv: move generation batch fields to a sub-struct Just tyding things a bit since we're about to add more. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Tested-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25361>	2023-10-20 13:07:53 +00:00
Lionel Landwerlin	2e0ff4c551	anv: avoid MI commands to copy draw indirect count We can just make the address of the count available to the generation shader. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Tested-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25361>	2023-10-20 13:07:53 +00:00

1 2 3 4 5 ...

10480 commits