fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-23 22:00:13 +01:00

Author	SHA1	Message	Date
Lionel Landwerlin	108e79db1a	anv: factor out some more gpu_memcpy setup We want to have all the setup/workaround in a single spot. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29297>	2024-06-05 15:22:25 +00:00
Lionel Landwerlin	63676ed502	anv: fix Wa_16013994831 macros The commit that switched to the WA framework forgot to update one of the ifdef section. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `e6e320fc79` ("anv: make Wa_16013994831 to use intel_needs_workaround") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27676>	2024-02-19 12:48:33 +00:00
Tapani Pälli	1693d0b857	anv: implement Wa_16014912113 When URB state for DS changes, we need to emit URB setup for VS with 256 handles and 0 for rest, commit this using a HDC flush before setting real values. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26920>	2024-02-05 13:50:58 +00:00
Tapani Pälli	36f428f1de	anv: check for wa 16013994831 in emit_so_memcpy_end We are toggling preemption on/off during streamout, this is also happening on gfx12 platforms, not just dg2. Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27002>	2024-01-15 08:36:29 +00:00
Ian Romanick	b741a9a851	anv: Set PIPELINE_SELECT systolic mode enable flag Set the flag on compute shaders when the application has enabled the cooperative matrix feature. We might still want to enable this only when DPAS is actually used. The current method is based on many suggestions from Lionel. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25994>	2023-12-29 20:28:54 -08:00
Rohan Garg	de6653dc5d	anv: WA 16014538804 for DG2, MTL A0 Send empty/dummy PIPE_CONTROL after every third 3DPRIMITIVE command. Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25039>	2023-11-08 11:00:55 +00:00
Tapani Pälli	2254eaa3ae	anv: add current_pipeline for batch_emit_pipe_control This way we can implemented workarounds depending on the pipeline. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25671>	2023-10-26 11:51:47 +00:00
Tapani Pälli	8d2dcd55d7	anv: refactor to fix pipe control debugging While earlier changes to pipe control emission allowed debug dump of each pipe control, they also changed debug output to almost always print same reason/function for each pc. These changes fix the output so that we print the original function name where pc is emitted. As example: pc: emit PC=( +depth_flush +rt_flush +pb_stall +depth_stall ) reason: gfx11_batch_emit_pipe_control_write pc: emit PC=( ) reason: gfx11_batch_emit_pipe_control_write changes back to: pc: emit PC=( +depth_flush +rt_flush +pb_stall +depth_stall ) reason: gfx11_emit_apply_pipe_flushes pc: emit PC=( ) reason: cmd_buffer_emit_depth_stencil Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25282>	2023-09-20 06:04:37 +00:00
Lionel Landwerlin	50f6903bd9	anv: add new low level emission & dirty state tracking A single Vulkan state can map to multiple fields in different GPU instructions. This change introduces the bottom half of a simplified emission mechanism where we do the following : Vulkan runtime state \| V Intermediate driver state \| V Instruction programming This way we can detect that the intermediate state didn't change and avoid HW instruction emission. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24536>	2023-09-06 20:07:02 +00:00
Tapani Pälli	71a2d651c1	anv: refactor batch_set_preemption to use batch_emit_pipe_control This makes it easier to hook workarounds for this pipe control. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24690>	2023-08-17 16:07:59 +00:00
Sagar Ghuge	49eabb9ea6	anv: Add GPU breakpoint before/after specific draw call This change allow us to insert the MI_SEMAPHORE_WAIT before/after specific draw call. With GTX tool, we can always update the memory address to unblock spinning wait. v2: - Make sure draw_call_count is thread-safe (Lionel) - Add static inline helper (Lionel) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24308>	2023-08-08 17:36:19 +00:00
Marcin Ślusarz	87dd96bbbe	anv: drop support for VK_NV_mesh_shader Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24071>	2023-07-14 08:27:14 +00:00
Jordan Justen	492b07625d	anv,iris,hasvk: Use ISL_SURF_USAGE_STREAM_OUT_BIT for setting stream-out MOCS Cc: 23.2 <mesa-stable> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23823>	2023-07-12 23:47:25 -07:00
Tapani Pälli	6a7dcd3e12	anv: change pipe controls in genX_gpu_memcpy to use pc helper Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23583>	2023-06-16 08:04:20 +00:00
Lionel Landwerlin	e9c1eaa535	anv: only disable mesh when enabled at the VkDevice level Saving ourselves some instructions since it's not going to get used. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23074>	2023-06-14 09:43:56 +03:00
Rohan Garg	d0e0ba897f	anv: split ANV_PIPE_RENDER_TARGET_BUFFER_WRITES for finer grained flushing split ANV_PIPE_RENDER_TARGET_BUFFER_WRITES into separate CS_STALL, RT_FLUSH & TILE_FLUSH flags in order to have finer control over cache coherency. Tigerlake CS has it's own cache fetching directly from the memory controller, so we need to do a tile flush to ensure the query data is visible. This fixes test_resolve_non_issued_query_data in vkd3d on TGL. Signed-off-by: Rohan Garg <rohan.garg@intel.com> Fixes: `3c4c18341a` ("anv: narrow flushing of the render target to buffer writes") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23500>	2023-06-12 14:46:44 +00:00
Tapani Pälli	e6e320fc79	anv: make Wa_16013994831 to use intel_needs_workaround Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22560>	2023-06-06 12:06:22 +00:00
Lionel Landwerlin	7381405095	anv: fixup workaround 16011411144 We're missing it for the memcpy with streamout Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `5cc4075f95` ("anv, iris: Add Wa_16011411144 for DG2") Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22930>	2023-05-11 15:24:03 +03:00
Lionel Landwerlin	6f02f9d108	anv: fix preemption enable emission in gpu_memcpy This has to be before the MI_BATCH_BUFFER_END otherwise it has no effect. This also was messing around with you batch length alignment. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `b9aa66d5d0` ("anv: disable preemption for 3DPRIMITIVE during streamout") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20802>	2023-01-20 22:35:41 +02:00
Lionel Landwerlin	9a16effeac	anv: record secondaries' traces into primaries Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16655>	2023-01-13 01:22:15 +00:00
Tapani Pälli	97f2b60833	anv: implement Wa_14015814527 for task shaders After using task shader, we need to emit a zero URB state and a nullprim (empty pipe control) before rendering with primitives. After this, a normal URB state needs to be returned, this will happen when pipeline batch is emitted during pipeline switch. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20334>	2023-01-03 12:44:08 +00:00
José Roberto de Souza	c6d1f76da2	anv: Add and use emit_pipeline_select() To avoid the replication of code to properly emit PIPELINE_SELECT. init_compute_queue_state() had a different emit of PIPELINE_SELECT but as there is no compute engine in GFX VER 11 we are safe with the differences. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20444>	2022-12-29 08:34:15 -08:00
Tapani Pälli	b9aa66d5d0	anv: disable preemption for 3DPRIMITIVE during streamout This is required by Wa_16013994831. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20438>	2022-12-27 15:53:42 +00:00
Iván Briano	766508f56a	Revert "anv: Refactor anv_pipeline to use the anv_pipeline_type" This reverts commit `b1126abb38`. This breaks all hell at least on DG2, as there are several cases left where current_pipeline gets checked against GPGPU to decide what to do, and the value doesn't match that of ANV_HW_PIPELINE_STATE_COMPUTE. On top of that, it also misses checking for ANV_HW_PIPELINE_STATE_RAYTRACING. Then there's the fact that in some cases, current_pipeline will be UINT32_MAX, because it's the original undefined state and also used after executing a secondary command buffer because we are not tracking on which pipeline did the secondary left us. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7910 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20349>	2022-12-16 06:39:32 +00:00
Lionel Landwerlin	b21cd1ee1b	anv: fixup another dirty issue with gpu_memcpy Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20335>	2022-12-15 17:30:55 +00:00
Rohan Garg	b1126abb38	anv: Refactor anv_pipeline to use the anv_pipeline_type Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20316>	2022-12-15 16:38:18 +00:00
Jason Ekstrand	c70ef757e6	anv: Use extended parameters on Gen11+ Gen11 added a nifty feature where we have three custom system-generated values called extended parameters that we can set to any 32-bit values we want. These work just like vertex and instance ID and are controlled in the pipeline by the 3DSTATE_SGVS_2 packet. They are provided to the draw call either by extra DWORDs on the end of 3DSTATE_PRIMITIVE or by storing values to more state registers. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20295>	2022-12-13 19:22:02 +00:00
Lionel Landwerlin	324d945589	anv: disable mesh in memcpy We can't have streamout and mesh enabled at the same time. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `ef04caea9b` ("anv: Implement Mesh Shading pipeline") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19323>	2022-10-26 19:55:11 +00:00
Lionel Landwerlin	54bc34f70a	anv: comment out the Gfx8/9 VB cache key workaround for newer Gens This code shows up a little on profiling on Gfx12 and since it's only a gfx8/9 workaround we might as well ifdef it out. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19050>	2022-10-14 23:03:16 +00:00
Tapani Pälli	f2645229c2	anv: implement Wa_14016118574 After each 3DPRIMITIVE, we need to send a dummy post sync op if point or line list was used or if had only 1 or 2 vertices per primitive. v2: add missing _3DPRIM_POINTLIST_BF (Lionel) Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18746>	2022-09-23 12:27:05 +00:00
Tapani Pälli	85fc1decf0	anv: remove primitive_topology from 3DPRIMITIVE calls Field is ignored on BDW+, 3DSTATE_VF_TOPOLOGY is used to set topology. We still want to preserve topology information in state because of other upcoming changes that require it. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18698>	2022-09-21 04:42:42 +00:00
Tapani Pälli	f32ac1d30b	anv: implement Wa_14015946265 for DG2 SOL unit issues, wa is to send PC with CS stall after SO_DECL. v2: emit also in genX_gpu_memcpy (Lionel) Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18409>	2022-09-07 04:38:05 +00:00
Kenneth Graunke	215b1b69cb	anv: Delete use_relocations flag There are no relocations. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18208>	2022-09-02 09:40:46 +00:00
Kenneth Graunke	3daeb22735	anv: Drop checks for version 8 or 9 anv no longer supports versions below this. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18208>	2022-09-02 09:40:46 +00:00
Lionel Landwerlin	a659819f79	anv: remove unused gfx7 code Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Jason Ekstrand <jason.ekstrand@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18208>	2022-09-02 09:40:46 +00:00
José Roberto de Souza	356a60bd6c	anv: Do not duplicate intel_device_info memory in each logical device Each logical device can point to its physical device intel_device_info saving at least one intel_device_info. This also allow us to set 'const' to avoid values in intel_device_info being changed by mistake. Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Jordan Justen <jordan.l.justen@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17897>	2022-08-19 16:29:58 +00:00
Jason Ekstrand	7d25c04236	anv: Switch to using common dynamic state tracking Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17564>	2022-07-19 23:16:45 +00:00
Lionel Landwerlin	4efc997472	anv: fix invalid utrace memcpy l3 config on gfx < 11 device->l3_config is only valid on Gfx11+ This only fixes using GPU_TRACE=1 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `02a4d622ed` ("anv: expose a couple of emit helper to build utrace buffer copies") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16291>	2022-05-03 13:18:48 +00:00
Lionel Landwerlin	f4f350a06c	anv: reemit 3DSTATE_STREAMOUT after memcpy This doesn't fix anything because memcpy is only used before secondary buffer execution and we dirty everything after that. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16189>	2022-04-27 18:43:00 +00:00
Lionel Landwerlin	02a4d622ed	anv: expose a couple of emit helper to build utrace buffer copies We'll want to copy timestamp buffers when commands buffers are resubmitted multiple times. v2: Merge a couple of #if GFX_VER >= 8 (Rohan) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Acked-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13996>	2022-01-14 20:17:44 +00:00
Felix DeGrood	6c345ddbe4	anv: Cache VB/IB in L3$ for Gfx12 Gfx12 enables caching of Vertex and Index Buffers in L3. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9834>	2021-06-15 12:57:42 +00:00
Anuj Phogat	b75f095bc7	intel: Rename genx keyword to gfxx in source files Commands used to do the changes: export SEARCH_PATH="src/intel src/gallium/drivers/iris src/mesa/drivers/dri/i965" grep -E "gen[[:digit:]]+" -rIl $SEARCH_PATH \| xargs sed -ie "s/gen$[[:digit:]]\+$/gfx\1/g" Exclude pack.h and xml changes in this patch: grep -E "gfx[[:digit:]]+_pack\.h" -rIl $SEARCH_PATH \| xargs sed -ie "s/gfx$[[:digit:]]\+_pack\.h$/gen\1/g" grep -E "gfx[[:digit:]]+\.xml" -rIl $SEARCH_PATH \| xargs sed -ie "s/gfx$[[:digit:]]\+\.xml$/gen\1/g" Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9936>	2021-04-02 18:33:07 +00:00
Anuj Phogat	9da8a55b08	intel: Rename GEN_GEN macro to GFX_VER Commands used to do the changes: export SEARCH_PATH="src/intel src/gallium/drivers/iris src/mesa/drivers/dri/i965" grep -E "GEN_GEN" -rIl $SEARCH_PATH \| xargs sed -ie "s/GEN_GEN/GFX_VER/g" Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9936>	2021-04-02 18:33:06 +00:00
Anuj Phogat	692472a376	intel: Rename "gen_" prefix used in common code to "intel_" This patch renames functions, structures, enums etc. with "gen_" prefix defined in common code. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9413>	2021-03-10 22:23:51 +00:00
Anuj Phogat	733b0ee8cb	intel: Rename files with gen_ prefix in common code to intel_ Changes in this patch include: - Rename all files in src/intel/common path - Update the filenames used in source and build files Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9413>	2021-03-10 22:23:51 +00:00
Kenneth Graunke	02fe825a61	isl, anv, iris: Add a centralized helper to select MOCS based on usage On Gen12+, we can enable additional caches in certain usage situations. This routes that decision making to a central place in ISL, based on surface usage flags, and updates both drivers to use it. (i965 doesn't need to change because it doesn't support Gen12.) We continue handling the "external" decision via an anv_mocs() wrapper for now, since we store that flag in anv_bo, which isl doesn't know about. (We could introduce an ISL_SURF_USAGE_EXTERNAL, but I'm not actually sure that would be cleaner.) This patch should not have any functional nor performance effects, as we continue selecting the exact same MOCS values for now. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7104>	2020-10-19 19:18:11 +00:00
Jason Ekstrand	164aed6c81	anv:gpu_memcpy: Emit 3DSTATE_VF_INDEXING on Gen8+ If this gets run right after something which uses VK_VERTEX_INPUT_RATE_INSTANCE on its first vertex binding, we could end up in serious trouble. Fixes: `3d9747780b` "anv: Add a helper for doing buffer copies with..." Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5090>	2020-05-18 21:42:05 +00:00
Caio Marcelo de Oliveira Filho	cf54785239	anv/gen12: Lower VK_KHR_multiview using Primitive Replication Identify if view_index is used only for position calculation, and use Primitive Replication to implement Multiview in Gen12. This feature allows storing per-view position information in a single execution of the shader, treating position as an array. The shader is transformed by adding a for-loop around it, that have an iteration per active view (in the view_mask). Stores to the position now store into the position array for the current index in the loop, and load_view_index() will return the view index corresponding to the current index in the loop. The feature is controlled by setting the environment variable ANV_PRIMITIVE_REPLICATION_MAX_VIEWS, which defaults to 2 if unset. For pipelines with view counts larger than that, the regular instancing will be used instead of Primitive Replication. To disable it completely set the variable to 0. v2: Don't assume position is set in vertex shader; remove only stores for position; don't apply optimizations since other passes will do; clone shader body without extract/reinsert; don't use last_block (potentially stale). (Jason) Fix view_index immediate to contain the view index, not its order. Check for maximum number of views supported. Add guard for gen12. v3: Clone the entire shader function and change it before reinsert; disable optimization when shader has memory writes. (Jason) Use a single environment variable with _DEBUG on the name. v4: Change to use new nir_deref_instr. When removing stores, look for mode nir_var_shader_out instead of the walking the list of outputs. Ensure unused derefs are removed in the non-position part of the shader. Remove dead control flow when identifying if can use or not primitive replication. v5: Consider all the active shaders (including fragment) when deciding that Primitive Replication can be used. Change environment variable to ANV_PRIMITIVE_REPLICATION. Squash the emission of 3DSTATE_PRIMITIVE_REPLICATION into this patch. Disable Prim Rep in blorp_exec_3d. v6: Use a loop around the shader, instead of manually unrolling, since the regular unroll pass will kick in. Document that we don't expect to see copy_deref or load_deref involving the position variable. Recover use_primitive_replication value when loading pipeline from the cache. Set VARYING_SLOT_LAYER to 0 in the shader. Earlier versions were relying on ForceZeroRTAIndexEnable but that might not be sufficient. Disable Prim Rep in cmd_buffer_so_memcpy. v7: Don't use Primitive Replication if position is not set, fallback to instancing; change environment variable to be ANV_PRIMITVE_REPLICATION_MAX_VIEWS and default it to 2 based on experiments. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/2313> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/2313>	2020-04-07 17:16:09 +00:00
Jason Ekstrand	e6b39850f0	anv: Plumb deref block size through to 3DSTATE_SF Cc: "20.0" mesa-stable@lists.freedesktop.org Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3454>	2020-01-30 18:46:28 -06:00
Jason Ekstrand	46af0ecc1d	anv: Use PIPE_CONTROL flushes to implement the gen8 VF cache WA Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-12-05 10:59:10 -06:00

1 2

72 commits