fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-16 01:08:05 +02:00

Author	SHA1	Message	Date
Lionel Landwerlin	caf2389bc5	anv: use a single generation shader for indirect draws The indirect draw count shader can be used as a more generic case of the indirect draw one. We'll never enter the last condition of the shader (writing the MI_BATCH_BUFFER_START) with non count variants of draws. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20497>	2023-03-03 11:30:54 +00:00
Lionel Landwerlin	897a92f576	anv: remove MI_NOOPs at the end of the generation batch Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `c950fe97a0` ("anv: implement generated (indexed) indirect draws") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20497>	2023-03-03 11:30:54 +00:00
Lionel Landwerlin	aa18d52728	anv: make sure mi_memcpy lands before push constant loads Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `e2dc32d755` ("anv: move functions around to plan for generated draws") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20497>	2023-03-03 11:30:54 +00:00
Lionel Landwerlin	e68615aeaa	anv: fix indirect draws VF cache tracking of index buffer Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `e2dc32d755` ("anv: move functions around to plan for generated draws") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20497>	2023-03-03 11:30:54 +00:00
Lionel Landwerlin	1454b789b1	anv: fix 3DSTATE_PS emission in generation shaders We have to use the helper and also were missing the vector mask programming. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `c950fe97a0` ("anv: implement generated (indexed) indirect draws") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20497>	2023-03-03 11:30:54 +00:00
Lionel Landwerlin	8f16ca8741	anv: remove commented code Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `c950fe97a0` ("anv: implement generated (indexed) indirect draws") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20497>	2023-03-03 11:30:54 +00:00
Lionel Landwerlin	f5dc88910f	anv: remove pre hasvk split assert With softpin we should not always expect a BO in addresses. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20497>	2023-03-03 11:30:54 +00:00
Lionel Landwerlin	ae398284c9	anv: limit push constant dirtyness with generation shaders We only use the fragment shader push constants. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `c950fe97a0` ("anv: implement generated (indexed) indirect draws") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20497>	2023-03-03 11:30:54 +00:00
Lionel Landwerlin	2ea106e758	anv: correctly program 3DSTATE_SF in generation shaders Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `c950fe97a0` ("anv: implement generated (indexed) indirect draws") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20497>	2023-03-03 11:30:54 +00:00
Lionel Landwerlin	e698040061	anv: remove BTI related flush in generation shaders Earlier versions of the generation shaders were using the binding table. We since switch to A64 messages. So the flush can be removed. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `c950fe97a0` ("anv: implement generated (indexed) indirect draws") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20497>	2023-03-03 11:30:54 +00:00
Lionel Landwerlin	1dcb536acd	anv: remove copied code from generation shader Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `c950fe97a0` ("anv: implement generated (indexed) indirect draws") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20497>	2023-03-03 11:30:54 +00:00
Lionel Landwerlin	63fa6d9f49	anv: fix generated forward jump with more than 67M draws The issue here is that for draw indirect count variants, we want to jump after the last generated draw call to the next location where commands are. But if we have more than 67M draws (8k * 8k chunks), we only know the location once we've generated each of the 8k * 8k chunks. This change adds a CPU side pointer in the push constant struct so that we can create a single linked list of chunks to edit and go through to write the correct jump address after all the generated space has been allocated. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `c950fe97a0` ("anv: implement generated (indexed) indirect draws") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20497>	2023-03-03 11:30:54 +00:00
Lionel Landwerlin	c1c680c08b	anv: correctly reset generation address on command buffer reset Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20497>	2023-03-03 11:30:54 +00:00
Lionel Landwerlin	4246a519f3	anv: fix incorrect parameter cmd_buffer_update_dirty_vbs_for_gfx8_vb_flush takes a value RANDOM/SEQUENTIAL. Not a boolean. Fortunately this worked okay because true == RANDOM Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20497>	2023-03-03 11:30:54 +00:00
Lionel Landwerlin	6ee7a2ecfa	anv: pull Wa_14016118574 out of some loop not changing state The WA is meant to be here to apply some state that is not propagated properly inside the HW. But if you have a loop like : for ( ... ) { emit(3DPRIMITIVE, some param); } You're not really changing any state, just push more draws into the pipeline. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `f2645229c2` ("anv: implement Wa_14016118574") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21660>	2023-03-03 09:34:16 +00:00
Lionel Landwerlin	d82e8e01c8	anv: fixup condition for Wa_14016118574 We don't want the WA to kick-in if it's not point/line topology. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `f2645229c2` ("anv: implement Wa_14016118574") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21660>	2023-03-03 09:34:16 +00:00
Samuel Pitoiset	f775873f81	ci: uprev CTS to 1.3.5.0 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Eric Engestrom <eric@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21390>	2023-03-03 08:23:21 +00:00
José Roberto de Souza	a24d93aa89	intel/dev: Query and compute hardware topology for Xe Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21368>	2023-03-03 05:25:35 +00:00
José Roberto de Souza	4b81a80f55	intel/dev: Implement Xe functions to handle hwconfig Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21368>	2023-03-03 05:25:35 +00:00
José Roberto de Souza	bc24091c52	intel/dev: Implement Xe functions to fill intel_device_info Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21368>	2023-03-03 05:25:35 +00:00
José Roberto de Souza	545d7e07ca	intel/dev: Add INTEL_KMD_TYPE_XE As mentioned in the previous patch, if intel-xe-kmd is disabled it will fail to detected in run time but it will still compile all Xe files. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21368>	2023-03-03 05:25:35 +00:00
Mark Janes	276f4a9d8c	intel/dev: Print required workarounds with intel_dev_info With the addition of workarounds, the output from this tool is more verbose than some users will want. Provide optional parameters for enabling hwconfig and workaround details. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21639>	2023-03-03 04:55:08 +00:00
Faith Ekstrand	9a4641cf6b	intel/nir: Limit unaligned loads to vec4 This probably doesn't affect Vulkan or GL because they can't have anything bigger than a vec4 anyway unless it's a u64vec4 and those have to be at least 8B aligned. This may affect CL apps if they use __attribute__((packed)) on something with big vectors, depending on how LLVM decides to translate that. Fixes: `f8aa83f0c8` ("intel/nir: Use nir_lower_mem_access_bit_sizes()") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21524>	2023-03-03 02:00:39 +00:00
Faith Ekstrand	eb9a56b6ca	nir: Rename nir_mem_access_size_align::align_mul to align It's a simple alignment so calling it align_mul is a bit misleading. Suggested-by: M Henning <drawoc@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21524>	2023-03-03 02:00:39 +00:00
Faith Ekstrand	ca4d73ba36	nir: Add a combined alignment helper Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@colllabora.com> Reviewed-by: M Henning <drawoc@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21524>	2023-03-03 02:00:39 +00:00
Faith Ekstrand	116a851264	nir: Add mode filtering to lower_mem_access_bit_sizes Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: M Henning <drawoc@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21524>	2023-03-03 02:00:39 +00:00
Lionel Landwerlin	f1e4d5c910	anv: fix scratch buffer reloc in 3DSTATE_HS We need to have the scratch buffer added to the pipeline BO tracking list, so it's added to the batch buffer and finally to the execbuffer list. Otherwise we pagefault (or read the default scratch page on i915). Fixes dEQP-VK.subgroups.ballot_broadcast.graphics.subgroupbroadcast_u16vec4 on CI (and probably other tests). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `2028f1caa3` ("anv: emit 3DSTATE_HS in cmd_buffer_flush_gfx_state") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21653>	2023-03-02 17:51:41 +00:00
Väinö Mäkelä	e509afacf3	hasvk: Disable non-zero fast clears for 8xMSAA images Using texelFetch to read samples from an 8xMSAA fast cleared image on Haswell can read transparent black pixels around triangles from where there should be none. This issue isn't present when using sample shading, resolving the image using vkCmdResolveImage or in a copy the image. The easiest way to fix this is by just disabling non-zero fast clears for 8xMSAA images. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7587 Cc: mesa-stable Reviewed-by: Filip Gawin <filip@gawin.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21444>	2023-03-02 17:26:09 +00:00
Lionel Landwerlin	c914e70bce	anv/hasvk: speed up null image/view descriptor writes Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reported-by: Chuansheng Liu <chuansheng.liu@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21642>	2023-03-02 15:03:25 +00:00
Tapani Pälli	207eb94445	intel/compiler: add comment about workaround on simd width Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21619>	2023-03-02 14:06:36 +00:00
Mark Janes	3c9a8f7a6d	intel/dev: generate helpers to identify platform workarounds Workarounds for defects in Intel silicon have been manually implemented: - consult defect database for the current platform - add workaround code behind platform ifdef or devinfo->ver checks Some bugs have occurred due to the manual process. Typical failure modes: - defect database is updated after a platform is enabled - version checks are overly broad (eg gfx11+) for defects that were fixed (eg in gfx12) - version checks are too narrow for defects that were extended to subsequent platforms. - missed workarounds This commit automates workaround handling: - Internal automation queries the defect database to collate and summarize defect documentation in json. - mesa_defs.json describes all public defects and impacted platforms. Defects which are extended to subsequent platforms are listed under the original defect. - gen_wa_helpers.py generates workaround helpers to be called in place of version checks: - NEEDS_WORKAROUND_{ID} provides a compile time check suitable for use in genX routines. - intel_device_info_needs_wa() provides a more precise runtime check, differentiating platforms within a generation and platform steppings. Internal automation will generate new mesa_defs.json as needed. Workarounds enabled with these helpers will apply correctly based on updated information in Intel's defect database. Reviewed-by: Dylan Baker <dylan@pnwbakers> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20825>	2023-03-02 00:01:27 +00:00
Dylan Baker	a0fa31bcdd	intel/dev: create a helper dependency for libintel_dev This ensures that users of libintel_dev.a won't be compiled until include files are generated, and that they are recompiled when the header changes. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Mark Janes <markjanes@swizzler.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20825>	2023-03-02 00:01:27 +00:00
Iván Briano	4887b88d22	anv: use the parameter passed to the macro The two points calling this macro pass dyn->rs.provoking_vertex to it, which is why it works, but it's cleaner to use the parameter instead. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21613>	2023-03-01 19:07:41 +00:00
Dylan Baker	a8691f916b	intel/mi: use 64bit constant for bitshift Coverity complains that we could end up rolling over on a 32bit platform, which isn't really true because of the assertion, but there's also no harm in ensuring that we have exactly the same behavior for both 32 bit and 64 bit platforms. CID: 1515989 Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21572>	2023-03-01 18:42:25 +00:00
Timothy Arceri	d75a36a9ee	glsl: remove do_copy_propagation_elements() optimisation pass Since `13b859de` do_copy_propagation_elements() has a flaw where the time it takes to complete grows exponentially slowers as the number of nested loops increases. It can also hurt rather than help verses just letting NIR optimise the code. So if the NIR linker is enabled we let it handle it instead. shader-db results Iris (BDW): total instructions in shared programs: 11177181 -> 11199739 (0.20%) instructions in affected programs: 119424 -> 141982 (18.89%) helped: 109 HURT: 65 total cycles in shared programs: 368946819 -> 372277173 (0.90%) cycles in affected programs: 116539428 -> 119869782 (2.86%) total spills in shared programs: 3983 -> 8785 (120.56%) spills in affected programs: 2072 -> 6874 (231.76%) helped: 0 HURT: 6 total fills in shared programs: 2016 -> 6068 (200.99%) fills in affected programs: 230 -> 4282 (1761.74%) helped: 0 HURT: 6 LOST: 85 GAINED: 77 freedreno results: total instructions in shared programs: 11011122 -> 11011620 (<.01%) instructions in affected programs: 939829 -> 940327 (0.05%) total full in shared programs: 762725 -> 762674 (<.01%) full in affected programs: 1096 -> 1045 (-4.65%) total constlen in shared programs: 1772092 -> 1771596 (-0.03%) constlen in affected programs: 2780 -> 2284 (-17.84%) total stp in shared programs: 4040 -> 4058 (0.45%) stp in affected programs: 3656 -> 3674 (0.49%) total ldp in shared programs: 2160 -> 2178 (0.83%) ldp in affected programs: 1748 -> 1766 (1.03%) stp HURT: shaders/robclark-shaders/gfxbench5/gl_5_high_off/13.shader_test CL: 1231 -> 1234 (0.24%) stp HURT: shaders/robclark-shaders/gfxbench5/gl_5_normal_off/13.shader_test CL: 1231 -> 1234 (0.24%) stp HURT: shaders/robclark-shaders/gfxbench5/gl_5_high_off/15.shader_test CL: 453 -> 456 (0.66%) stp HURT: shaders/robclark-shaders/gfxbench5/gl_5_normal_off/15.shader_test CL: 453 -> 456 (0.66%) stp HURT: shaders/robclark-shaders/gfxbench5/gl_5_high_off/17.shader_test CL: 144 -> 147 (2.08%) stp HURT: shaders/robclark-shaders/gfxbench5/gl_5_normal_off/17.shader_test CL: 144 -> 147 (2.08%) however, those stp counts are misleading -- gfxbench gl-5-normal actually gets its scratch (ldp/stp) stored as 16 bits instead of 32 thanks to better NIR copy prop, and the result is 2.64398% +/- 0.0991923% perf improvement! i915 results: total instructions in shared programs: 510528 -> 510489 (<.01%) instructions in affected programs: 3303 -> 3264 (-1.18%) total tex_indirect in shared programs: 16708 -> 16717 (0.05%) tex_indirect in affected programs: 134 -> 143 (6.72%) total temps in shared programs: 30181 -> 30169 (-0.04%) temps in affected programs: 1268 -> 1256 (-0.95%) LOST: 0 GAINED: 1 i915 highlights: instructions HURT: shaders/closed/steam/legend-of-grimrock/47.shader_test FS: 141 -> 144 (2.13%) instructions HURT: shaders/closed/steam/steamworld-dig/22.shader_test FS: 84 -> 108 (28.57%) temps HURT: shaders/closed/steam/left-4-dead-2/medium/3682.shader_test FS: 7 -> 13 (85.71%) r300 results: total instructions in shared programs: 1340439 -> 1340845 (0.03%) instructions in affected programs: 32354 -> 32760 (1.25%) total temps in shared programs: 179394 -> 179329 (-0.04%) temps in affected programs: 1505 -> 1440 (-4.32%) total consts in shared programs: 1177742 -> 1177885 (0.01%) consts in affected programs: 1107 -> 1250 (12.92%) total lits in shared programs: 24992 -> 25019 (0.11%) lits in affected programs: 138 -> 165 (19.57%) instructions HURT: shaders/closed/steam/legend-of-grimrock/26.shader_test FS: 47 -> 52 (10.64%) instructions HURT: shaders/closed/steam/sanctum-2/6072.shader_test FS: 43 -> 48 (11.63%) instructions HURT: shaders/closed/steam/champions-of-regnum/2378.shader_test VS: 35 -> 40 (14.29%) Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13288>	2023-03-01 16:09:25 +00:00
Lionel Landwerlin	42e8a2c1d6	genxml: fix border color offset field on Gfx12+ I wonder if the docs are correct for Gfx11 because this is the generation that gave us the Bindless Sampler Heap of 4Gb. So it would make sense that the border colors can also be placed anywhere in that 4Gb heap. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21600>	2023-03-01 08:45:11 +00:00
Lionel Landwerlin	58b687d77b	genxml: Fix STATE_BASE_ADDRESS::BindlessSurfaceStateSize field size BSpec 44507 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21600>	2023-03-01 08:45:11 +00:00
Dave Airlie	1f0fdcb619	anv: always pick graphics queue to execute prime blits on. This will change when we get transfer queues but this should avoid video queues being picked by accident. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21204>	2023-03-01 03:37:36 +00:00
Lionel Landwerlin	672b2f9ad1	anv: remove more Gfx7 code Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21599>	2023-02-28 23:49:27 +00:00
Lionel Landwerlin	3cd72a2840	anv: fixup Wa_16011107343 for Gfx12 only Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `75968398f3` ("anv: emit 3DSTATE_HS for each primitive on gfx12") Acked-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21605>	2023-02-28 23:03:21 +00:00
Marcin Ślusarz	e74a3284f5	anv: halve the push constants space in mesh pipelines It's only used by fragment shaders, so halving it matches the size used in the most optimal primitive pipeline (VS + FS). This change frees some URB space for mesh and task shaders and as a result improves vk_meshlet_cadscene performance by up to 2%, depending on the model. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21559>	2023-02-28 20:18:01 +00:00
David Heidelberg	baea3b328b	intel/vulkan: add missing dependency on generated headers Adding correct dependencies prevents occasional build flakes with parallel builds. ``` FAILED: src/intel/vulkan/libanv_common.a.p/anv_generated_indirect_draws.c.o ccache cc -Isrc/intel/vulkan/libanv_common.a.p -Isrc/intel/vulkan -I../src/intel/vulkan -Iinclude -I../include -Isrc -I../src -Isrc/mapi -I../src/mapi -Isrc/mesa -I../src/mesa -I../src/gallium/include -Isrc/intel -I../src/intel -Isrc/compiler -I../src/compiler -Isrc/compiler/nir -I../src/compiler/nir -Isrc/vulkan/util -I../src/vulkan/util -Isrc/vulkan/runtime -I../src/vulkan/runtime -Isrc/vulkan/wsi -I../src/vulkan/wsi -Isrc/intel/genxml -Isrc/intel/vulkan/shaders -Isrc/intel/ds -I/usr/local/include -I/usr/local/include/libdrm -fvisibility=hidden -fdiagnostics-color=always -D_FILE_OFFSET_BITS=64 -Wall -Winvalid-pch -Werror -std=c11 -O2 -g -D__STDC_CONSTANT_MACROS -D__STDC_FORMAT_MACROS -D__STDC_LIMIT_MACROS '-DPACKAGE_VERSION="23.1.0-devel"' '-DPACKAGE_BUGREPORT="https://gitlab.freedesktop.org/mesa/mesa/-/issues"' -DHAVE_OPENGL=1 -DHAVE_OPENGL_ES_1=1 -DHAVE_OPENGL_ES_2=1 -DHAVE_SWRAST -DHAVE_VIRGL -DHAVE_RADEONSI -DHAVE_ZINK -DHAVE_CROCUS -DHAVE_IRIS -DHAVE_I915 -DVIDEO_CODEC_VC1DEC=1 -DVIDEO_CODEC_H264DEC=1 -DVIDEO_CODEC_H264ENC=1 -DVIDEO_CODEC_H265DEC=1 -DVIDEO_CODEC_H265ENC=1 -DHAVE_X11_PLATFORM -DHAVE_SURFACELESS_PLATFORM -DHAVE_DRM_PLATFORM -DHAVE_XCB_PLATFORM -DHAVE_ST_VDPAU -DENABLE_ST_OMX_BELLAGIO=0 -DENABLE_ST_OMX_TIZONIA=0 -DGLX_INDIRECT_RENDERING -DGLX_DIRECT_RENDERING -DGLX_USE_DRM -DALLOW_KCMP -DENABLE_SHADER_CACHE -DHAVE___BUILTIN_BSWAP32 -DHAVE___BUILTIN_BSWAP64 -DHAVE___BUILTIN_CLZ -DHAVE___BUILTIN_CLZLL -DHAVE___BUILTIN_CTZ -DHAVE___BUILTIN_EXPECT -DHAVE___BUILTIN_FFS -DHAVE___BUILTIN_FFSLL -DHAVE___BUILTIN_POPCOUNT -DHAVE___BUILTIN_POPCOUNTLL -DHAVE___BUILTIN_UNREACHABLE -DHAVE___BUILTIN_TYPES_COMPATIBLE_P -DHAVE_FUNC_ATTRIBUTE_CONST -DHAVE_FUNC_ATTRIBUTE_FLATTEN -DHAVE_FUNC_ATTRIBUTE_MALLOC -DHAVE_FUNC_ATTRIBUTE_PURE -DHAVE_FUNC_ATTRIBUTE_UNUSED -DHAVE_FUNC_ATTRIBUTE_WARN_UNUSED_RESULT -DHAVE_FUNC_ATTRIBUTE_WEAK -DHAVE_FUNC_ATTRIBUTE_FORMAT -DHAVE_FUNC_ATTRIBUTE_PACKED -DHAVE_FUNC_ATTRIBUTE_RETURNS_NONNULL -DHAVE_FUNC_ATTRIBUTE_ALIAS -DHAVE_FUNC_ATTRIBUTE_NORETURN -DHAVE_FUNC_ATTRIBUTE_VISIBILITY -DHAVE_UINT128 -DHAVE_REALLOCARRAY -D_GNU_SOURCE -DUSE_SSE41 -DUSE_GCC_ATOMIC_BUILTINS -DUSE_X86_64_ASM -DMAJOR_IN_SYSMACROS -DHAS_SCHED_H -DHAS_SCHED_GETAFFINITY -DHAVE_LINUX_FUTEX_H -DHAVE_ENDIAN_H -DHAVE_DLFCN_H -DHAVE_SYS_SHM_H -DHAVE_CET_H -DHAVE_SYS_INOTIFY_H -DHAVE_STRTOF -DHAVE_MKOSTEMP -DHAVE_TIMESPEC_GET -DHAVE_MEMFD_CREATE -DHAVE_RANDOM_R -DHAVE_FLOCK -DHAVE_STRTOK_R -DHAVE_GETRANDOM -DHAVE_GNU_QSORT_R -DHAVE_STRUCT_TIMESPEC -DHAVE_PROGRAM_INVOCATION_NAME -DHAVE_ISSIGNALING -DHAVE_POSIX_MEMALIGN -DHAVE_DIRENT_D_TYPE -DHAVE_STRTOD_L -DHAVE_DLADDR -DHAVE_DL_ITERATE_PHDR -DSUPPORT_INTEL_INTEGRATED_GPUS -DHAVE_ZLIB -DHAVE_COMPRESSION -DHAVE_PTHREAD -DHAVE_PTHREAD_SETAFFINITY -DHAVE_LIBDRM -DLLVM_AVAILABLE '-DMESA_LLVM_VERSION_STRING="13.0.1"' -DLLVM_IS_SHARED=1 -DDRAW_LLVM_AVAILABLE -DUSE_LIBELF -DMESA_EXECMEM -DHAVE_LIBUNWIND -DHAVE_OPENMP -DHAVE_DRI -DHAVE_DRI2 -DHAVE_DRI3 -DHAVE_DRI3_MODIFIERS -DHAVE_DRISW_KMS -DHAVE_PERFETTO -mtls-dialect=gnu2 -Werror=implicit-function-declaration -Werror=missing-prototypes -Werror=return-type -Werror=empty-body -Werror=incompatible-pointer-types -Werror=int-conversion -Wimplicit-fallthrough -Wmisleading-indentation -Wno-missing-field-initializers -Wno-format-truncation -Wno-nonnull-compare -fno-math-errno -fno-trapping-math -fno-common -Wno-unused-function -Werror=format -Wformat-security -ffunction-sections -fdata-sections -fPIC -DVK_USE_PLATFORM_XCB_KHR -DVK_USE_PLATFORM_XLIB_KHR -DVK_USE_PLATFORM_DISPLAY_KHR -DVK_USE_PLATFORM_XLIB_XRANDR_EXT -Wno-override-init -DANV_SUPPORT_RT=0 -MD -MQ src/intel/vulkan/libanv_common.a.p/anv_generated_indirect_draws.c.o -MF src/intel/vulkan/libanv_common.a.p/anv_generated_indirect_draws.c.o.d -o src/intel/vulkan/libanv_common.a.p/anv_generated_indirect_draws.c.o -c ../src/intel/vulkan/anv_generated_indirect_draws.c ../src/intel/vulkan/anv_generated_indirect_draws.c:34:10: fatal error: shaders/generated_draws_spv.h: No such file or directory 34 \| #include "shaders/generated_draws_spv.h" \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ compilation terminated. ``` Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21592>	2023-02-28 17:09:32 +01:00
Tapani Pälli	75968398f3	anv: emit 3DSTATE_HS for each primitive on gfx12 This is Wa_16011107343, same workaround as commit `880a3efe6c` but for gfx12. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21551>	2023-02-28 08:07:01 +00:00
Emma Anholt	d246948ce3	anv: Skip BTI RT flush if we're doing an op that doesn't use render targets. rt_flushes emitted on zink sauer.trace --loop=500 -2.02118% +/- 1.15992% (n=8). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21508>	2023-02-27 21:44:56 +00:00
Emma Anholt	2bd304bc8f	anv: Skip the RT flush when doing depth-only rendering. The spec citation says it's just for when the RT write message BTI might point to a different RT, and if we don't have any color attachments then we won't have one of those at all. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21508>	2023-02-27 21:44:56 +00:00
Caio Oliveira	c80268a20d	intel/compiler: Mark various memory barriers intrinsics unreachable Now that both SPIR-V and GLSL are using scoped barriers, we can stop handling the specialized ones. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3339>	2023-02-27 20:24:01 +00:00
Yonggang Luo	669a68489d	meson: Use sse2_arg and sse2_args to replace usage of c and c_sse2_args And now c_sse2_arg and c_sse2_args are remvoed Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21375>	2023-02-27 13:50:11 +00:00
Mike Blumenkrantz	7c8a5f6e37	vulkan/wsi: switch to using an options struct for last param this makes adding values easier since the drivers won't need to be updated Acked-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21447>	2023-02-27 13:21:21 +00:00
Francisco Jerez	4420251947	intel/rt: Fix L3 bank performance bottlenecks due to SW stack stride alignment. Power-of-two SW stack sizes are prone to causing collisions in the hashing function used by the L3 to map memory addresses to banks, which can cause stack accesses from most DSSes to bottleneck on a single L3 bank. Fix it by padding the SW stack stride by a single cacheline if it was a power of two. This has been reported by Felix DeGrood to improve Quake2 RTX performance by ~30% on DG2-512 in combination with other RT patches Lionel Landwerlin has been working on. Many thanks to Felix DeGrood for doing much of the legwork and providing several iterations of Q2RTX performance counter dumps which eventually prompted me to consider the hash collision theory and motivated this patch, and for providing additional performance counter dumps confirming that there is no longer an appreciable imbalance in traffic across L3 banks after this change. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21461>	2023-02-26 11:48:33 -08:00
David Heidelberg	b52917f9fc	intel: use c_see2_arg instead of explicit -msse2 This allows us to also inherit `-mfpmath=sse` added in previous commit. Acked-by: Yonggang Luo <luoyonggang@gmail.com> Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21371>	2023-02-25 15:34:33 +01:00

1 2 3 4 5 ...

9153 commits