fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-05 11:48:06 +02:00

Author	SHA1	Message	Date
Jesse Natalie	733264bd7c	microsoft/compiler: Fix codegen when a loop ends in a jump Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7792 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20255>	2022-12-12 17:18:45 +00:00
Jesse Natalie	16c4c1a549	microsoft/compiler: Handle holes in driver_location when adding sysvals All of the full runtime+compiler stacks reassign these driver_location values to compact them and sort between shader stages, but for the spirv2dxil tool, we leave the original shader's "location" intact. That means that there can be holes in the driver_location space, and simply counting how many inputs there are can lead to collisions. So instead place all sysvals after the last-used driver_location. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7811 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20253>	2022-12-12 16:45:46 +00:00
Corentin Noël	1071d33c37	ci: Bump virglrenderer version Update virglrenderer to the latest version on time. Signed-off-by: Corentin Noël <corentin.noel@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20277>	2022-12-12 15:49:08 +00:00
Danylo Piliaiev	0d34df0e6c	ir3/freedreno: Find regs for FS inputs when printing info FS inputs are not directly loaded into regs, but require additional instruction to do so. So in order to print in which reg the input is loaded we have to scan the shader for the instruction which loads the input. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20247>	2022-12-12 15:25:00 +00:00
Mikhail Korolev	c147a35644	radv: fix assertion on gpu hang detection fixes assert in RADV_DECL_PIPELINE_DOWNCAST when bound pipline is a compute pipeline Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20276>	2022-12-12 12:55:07 +00:00
Caio Oliveira	e9efd05af5	intel/compiler: Remove leftover declarations of old NIR passes Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19805>	2022-12-12 10:03:04 +00:00
Lionel Landwerlin	6106396825	intel/nir/rt: fixup primitive id There is a delta index value in the hit structure, we forgot to add it to the base value. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `0465714790` ("intel/nir/rt: add more helpers for ray queries") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7565 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19346>	2022-12-12 10:16:21 +02:00
Samuel Pitoiset	13f39da71a	radv: fix hashing descriptor set layout Shouldn't have pointers. Fixes: `19f8d33876` ("radv: Use vk_descriptor_set_layout") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20250>	2022-12-12 07:33:21 +00:00
Friedrich Vock	e20564cfdb	nir/lower_shader_calls: Remove phis after dead control flow This potentially gets rid of some more phis without sources. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19960>	2022-12-11 22:13:32 +00:00
Friedrich Vock	a54c2c8289	nir: Do not consider phis with incompatible dests equal CSE tries to collapse equal instructions, and collapsing two phis with incompatible dests is illegal. Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Fixes: `6bdce55c` ("nir: Add a basic CSE pass") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19960>	2022-12-11 22:13:32 +00:00
Eric Engestrom	c9c44d63da	docs/release-calendar: add 22.3.x dates Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20131>	2022-12-11 22:06:49 +00:00
Emma Anholt	110d550941	zink: Don't set dynamic color attachment state for 0 attachments. Fixes some validation failures like: VUID-vkCmdSetColorBlendEquationEXT-attachmentCount-arraylength(ERROR / SPEC): msgNum: -175001922 - Validation Error: [ VUID-vkCmdSetColorBlendEquationEXT-attachmentCount-arraylength ] Object 0: handle = 0xaaaae7632fa0, type = VK_OBJECT_TYPE_DEVICE; \| MessageID = 0xf591aebe \| vkCmdSetColorBlendEquationEXT: parameter attachmentCount must be greater than 0. The Vulkan spec states: attachmentCount must be greater than 0 (https://www.khronos.org/registry/vulkan/specs/1.3-extensions/html/vkspec.html#VUID-vkCmdSetColorBlendEquationEXT-attachmentCount-arraylength) However, we still have some around dynamic color attachment state: Objects: 1 [0] 0xaaaafcab4150, type: 6, name: NULL VUID_Undefined(ERROR / SPEC): msgNum: 2044605652 - Validation Error: [ VUID_Undefined ] Object 0: handle = 0xaaaafcab4150, type = VK_OBJECT_TYPE_COMMAND_BUFFER; \| MessageID = 0x79de34d4 \| VkCommandBuffer 0xaaaafcab4150[]: Dynamic color blend enable state not set for this command buffer. Objects: 1 [0] 0xaaaafcab4150, type: 6, name: NULL VUID_Undefined(ERROR / SPEC): msgNum: 2044605652 - Validation Error: [ VUID_Undefined ] Object 0: handle = 0xaaaafcab4150, type = VK_OBJECT_TYPE_COMMAND_BUFFER; \| MessageID = 0x79de34d4 \| VkCommandBuffer 0xaaaafcab4150[]: Dynamic color blend equation state not set for this command buffer. Objects: 1 [0] 0xaaaafcab4150, type: 6, name: NULL VUID_Undefined(ERROR / SPEC): msgNum: 2044605652 - Validation Error: [ VUID_Undefined ] Object 0: handle = 0xaaaafcab4150, type = VK_OBJECT_TYPE_COMMAND_BUFFER; \| MessageID = 0x79de34d4 \| VkCommandBuffer 0xaaaafcab4150[]: Dynamic color write mask state not set for this command buffer. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20186>	2022-12-11 21:05:43 +00:00
Bas Nieuwenhuizen	efa4e9568b	radv: Use correct watermark for early loop exit. The previous check assumed the stack starts at offset=0, which isn't necessarily true for ray queries. Note that this didn't cause correctness issues, just made an optimization not apply. Found when I accidentally made this load-bearing in a refactor. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20265>	2022-12-11 18:51:29 +00:00
Bas Nieuwenhuizen	f0d6a1a685	radv: Rename stack_base to stack_low_watermark. Better covers the purpose. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20265>	2022-12-11 18:51:29 +00:00
Gert Wollny	b0a6e0e174	Revert "r600/sfn: Make use of variable length DOT" This reverts commit `fcafe1ffc8`. Variable length DOT products are not supported for pre EG cards, and the read port evaluation is not correctly checked, so that scheduling might fail. Revert for now to fix the issues below and get gack with a better implementation later. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7876 Related: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7878 Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20268>	2022-12-11 18:10:27 +01:00
Marek Olšák	c9b13a9338	cso: remove cso_draw_vbo from all draws, call the driver or u_vbuf directly Instead of calling like this: st_draw_gallium -> cso_draw_vbo -> driver_draw_vbo Do it like this: st_draw_gallium -> driver_draw_vbo OR st_draw_gallium -> u_vbuf_draw_vbo It's accomplished by adding a draw_vbo function pointer into cso_context. The pointer is equal to pipe_context::draw_vbo when needed, so there is no call overhead from this if cso's draw_vbo callback is indeed equal to driver_draw_vbo. We just call cso_context_base::draw_vbo to jump into the driver directly, or u_vbuf if needed. The cso function with the indirect function call is inlined, so draws don't actually visit any cso_context function. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20025>	2022-12-11 14:37:27 +00:00
Marek Olšák	85f01982a0	cso: add a base class cso_context_base holding pipe_context* We'll add more stuff there. The first change is that we need pipe_context* there. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20025>	2022-12-11 14:37:27 +00:00
Marek Olšák	37e89b41f1	cso: unify cso_draw_vbo and cso_multi_draw This is going to be inlined. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20025>	2022-12-11 14:37:27 +00:00
Marek Olšák	8b4201d6bd	gallium/u_vbuf: change u_vbuf_draw_vbo to accept pipe_context as first param This makes the parameters equal to pipe_context::draw_vbo. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20025>	2022-12-11 14:37:27 +00:00
Marek Olšák	4a92492a8a	gallium: add the u_vbuf pointer into pipe_context This will allow removing the draw_vbo wrapping in cso_context. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20025>	2022-12-11 14:37:27 +00:00
Alyssa Rosenzweig	a9934a9f64	asahi: Implement occlusion queries While the hardware supports both counter and boolean occlusion queries, the programming model is quite different from OpenGL. In AGX (and in Metal), there is a single "visibility result buffer" associated with the render pass. Each draw that uses occlusion queries writes into this render pass global visibility result buffer at a particular index. By contrast, the OpenGL occlusion query model supposes that each query has independent state that can be mixed and matched within a render pass. We can't simply allocate backing memory for a query and write to it from a job. We can't allocate visibility result buffers for each batch up front and statically assign OpenGL queries to indices, because the OpenGL query can span multiple batches. Finally we can't use a global visibility result buffer without introducing additional synchronization, given that we now support multiple batches in-flight at once. In this patch, I've elected to use a simple solution: allocate visibility result buffers and indices on the fly as needed, and accumulate the results on the CPU at the end of the render pass. When we have proper synchronization we'll want to revisit this, but as everything is stalling at submit time now, I'm not inclined to "optimize" something I can't test. Passes dEQP-GLES3.functional.occlusion_query.* and the relevant piglit tests. The piglits are considerably more thorough, checking lots of "we hate tilers" conditions that dEQP skips over. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:55:30 -05:00
Alyssa Rosenzweig	4dabbb761b	asahi: Move query functions to agx_query.c New file. They're just stubs now but will get nontrivial in a moment. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:51:05 -05:00
Alyssa Rosenzweig	7a5f88cb38	asahi: Don't upload samplers for clears/stores Unlikely to help but makes the traces neater. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:51:05 -05:00
Alyssa Rosenzweig	d2f27d282f	asahi: Avoid reloads with staging blits Noticed by inspection. Not likely to matter unless these staging blits are in a hot path, but it's an easy win. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:51:05 -05:00
Alyssa Rosenzweig	dc4cf64a76	asahi: Don't reload uninitialized surfaces Pointless. This should save some bandwidth in some cases (possibly mipmap generation?) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:51:05 -05:00
Alyssa Rosenzweig	949a760c9f	asahi: Fix Z32S8 harder Fixes dEQP-GLES3.functional.texture.format.sized.2d.depth32f_stencil8_pot after stencil texturing broke it. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:51:04 -05:00
Alyssa Rosenzweig	0c2500168d	asahi: Don't shadow idle resources Pointless allocation+memcpy. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:51:04 -05:00
Alyssa Rosenzweig	c9144eff48	asahi: Model alignment of occlusion query indices 8-byte offsets. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:51:04 -05:00
Alyssa Rosenzweig	3a318e4265	asahi: Identify some more fields used with layered These values depend on the framebuffer width/height and maybe other stuff. Maybe strides. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:51:04 -05:00
Alyssa Rosenzweig	c3eb81fd16	asahi: Identify XML for anisotropic filtering Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:51:04 -05:00
Alyssa Rosenzweig	7f247743a3	asahi: Check-box implement rasterizer discard Passes dEQP-GLES3.functional.rasterizer_discard.* Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:51:04 -05:00
Alyssa Rosenzweig	d2a2d1997e	asahi: Wire in 1D (array) textures Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:51:04 -05:00
Alyssa Rosenzweig	5612d2cbeb	asahi: Dirty track VS/FS key updates drawoverhead 1 score doubled to 7668. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:51:04 -05:00
Alyssa Rosenzweig	37feaf9c0c	asahi: Separate VS/FS shader keys First remove agx_shader_key from asahi_shader_key. It's trivial. agx_shader_key is going to go away soon now that we lower everything in NIR. Then everything else is mutually exclusive between stages. That means much less to hash. drawoverhead test 1 from 2331 to 3443. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:50:52 -05:00
Alyssa Rosenzweig	720ff76de4	asahi: Implement invalidate_resource From Panfrost. This lets us avoid storing depth/stencil attachments at the end of the frame in GLES. On my 4K monitor, glmark2 -btexture at fullscreen goes from 705fps to 1150fps. I assume gains on real workloads will be smaller. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:50:46 -05:00
Alyssa Rosenzweig	28b652af80	asahi: Track batch masks on ZS/blend CSO Adapted from panfrost, with the work happening at CSO create time instead of draw time allowing us to do more sophisticated analysis. We'll use these for accurate masks in a moment. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:50:45 -05:00
Alyssa Rosenzweig	33b1876857	asahi: Dirty track blend state We'll want this to reduce variant lookups eventually. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:50:45 -05:00
Alyssa Rosenzweig	29e6c00e3c	asahi: Enable dirty tracking Whoops. drawoverhead test 1 score from 496 -> 2377. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:50:45 -05:00
Alyssa Rosenzweig	b28fe26d7c	ail: Save level_offsets_compressed_B So we can bind specific mip levels for rendering into compressed Z/S. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20123>	2022-12-10 21:50:45 -05:00
Aleksey Komarov	3895545b83	panfrost: implement clear_depth_stencil Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20238>	2022-12-10 10:56:09 +00:00
David Heidelberg	b19a14a094	nine: enable on panfrost Also, enable required kmsro dependencies. Tested-by: Aleksey Komarov <q4arus@ya.ru> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20238>	2022-12-10 10:56:09 +00:00
David Heidelberg	be841f0e78	panfrost: implement clear_render_target Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Co-authored-by: Aleksey Komarov <q4arus@ya.ru> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Tested-by: Aleksey Komarov <q4arus@ya.ru> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20238>	2022-12-10 10:56:09 +00:00
David Heidelberg	8560c7613d	panfrost: Handle resources without depth in batch_to_fb_info Prevent preloading data from resources which doesn't exist. Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Tested-by: Aleksey Komarov <q4arus@ya.ru> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20238>	2022-12-10 10:56:09 +00:00
David Heidelberg	d76d791565	panfrost: Implement GL_EXT_clip_control Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Co-authored-by: Aleksey Komarov <q4arus@ya.ru> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Tested-by: Aleksey Komarov <q4arus@ya.ru> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20238>	2022-12-10 10:56:09 +00:00
Paulo Zanoni	a099d6ae4d	intel: add devinfo->has_64bit_float_via_math_pipe Unusual hardware features that require special hanlding usually get a devinfo field, so do this for MTL's unordered DF types. This will guarantee that any platform based on MTL (thus inheriting from MTL_FEATURES) will automatically be handled in these special cases. v2: s/has_unordered_64bit_float/has_64bit_float_via_math_pipe/ (Curro). Reviewed-by: Francisco Jerez <currojerez@riseup.net> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20072>	2022-12-10 03:59:19 +00:00
Paulo Zanoni	eac00f4ec7	intel/compiler: fix intel_swsb_decode for newer platforms In the previous patch we adjusted the scoreboard pass to take into consideration a new case of unordered operations for TGL. Fix the decoding as well. v2: use intel_device_info_is_mtl() (Curro, Jordan) v3: the part where we export num_sources_from_inst() is now a separate patch (Curro). v4: Work around false positive maybe-unitialized warning since Marge uses -Werror=maybe-uninitialized (Marge). Reviewed-by: Francisco Jerez <currojerez@riseup.net> (v3) Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20072>	2022-12-10 03:59:19 +00:00
Paulo Zanoni	295c5f59e0	intel/compiler: export brw_num_sources_from_inst We want to call this from brw_disasm.c, so move it out to brw_eu.c since it's about to become more of a shared utility function than something specific to the EU validator. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20072>	2022-12-10 03:59:19 +00:00
Paulo Zanoni	df50add27e	intel/compiler: avoid 64bit SEL_EXEC on MTL On MTL, instructions with DF type are unordered, executed in the math pipe. This means that they require different SWSB dependency handling, and also that in some cases such as MOVs it's generally faster to simply use 2 smaller ordered moves than a single unordered MOV. One problem we have with the current code is that generate_code() is not setting the proper SWSB dependencies for the generated DF MOVs, causing some tests to fail. One solution would be to fix generate_code() by making it set the appropriate dependencies. This was the first patch I wrote. Another solution to this problem, pointed to us by Curro, is to change required_exec_type() so we use UD instructions instead of DF, just like we do with platforms that don't have 64 bit instructions, which means there won't be anything to fix in generate_code(). The second solution is what this patch implements. This fixes at least: - dEQP-VK.subgroups.arithmetic.framebuffer.subgroupmin_double_vertex Thanks to Francisco Jerez for all the major help provided with this problem. Credits-to: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20072>	2022-12-10 03:59:19 +00:00
Paulo Zanoni	951855c349	intel/compiler: avoid (RegDist, SBID) on DF instructions on MTL When we use this form there's no way to specify which pipe RegDist refers to, so there are a few rules to figure this out, which is what inferred_sync_pipe() implements. But for MTL there's no long pipe and the documentation does not explicitly explain what should be the inferred type for its long (DF) instructions - which are out-of-order, by the way. One way to interpret this is that such case should be avoided. So add the extra check to entirely avoid this case. Notice that this is not actually fixing any bug, since returning TGL_PIPE_LONG (what we do today) will actually make these DF instructions incompatible with every in-order instruction, so we'll never opt to use the (RegDist, SBID) form anyway. But still, it's better to have this case explicitly documented instead of having it covered by a semi coincidence. v2: use intel_device_info_is_mtl() (Curro, Jordan) Reviewed-by: Francisco Jerez <currojerez@riseup.net> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20072>	2022-12-10 03:59:19 +00:00
Paulo Zanoni	16b9f87104	intel/compiler: on MTL, DF instructions run in the math pipe Adjust the scoreboard code to take that into account. Fixes at least: - dEQP-VK.glsl.builtin.precision_double.refract.compute.vec3 - dEQP-VK.glsl.builtin.precision_double.matrixcompmult.compute.mat4 v2: use intel_device_info_is_mtl() (Curro, Jordan) Reviewed-by: Francisco Jerez <currojerez@riseup.net> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20072>	2022-12-10 03:59:19 +00:00

1 2 3 4 5 ...

164155 commits