fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-01-01 03:10:09 +01:00

Author	SHA1	Message	Date
Jason Ekstrand	d65dbe8018	anv: Allow MSAA resolve with different numbers of planes The Vulkan spec for VK_KHR_depth_stencil_resolve allows a format mismatch between the primary attachment and the resolve attachment within certain limits. In particular, VUID-VkSubpassDescriptionDepthStencilResolve-pDepthStencilResolveAttachment-03181 If pDepthStencilResolveAttachment is not NULL and does not have the value VK_ATTACHMENT_UNUSED and VkFormat of pDepthStencilResolveAttachment has a depth component, then the VkFormat of pDepthStencilAttachment must have a depth component with the same number of bits and numerical type VUID-VkSubpassDescriptionDepthStencilResolve-pDepthStencilResolveAttachment-03182 If pDepthStencilResolveAttachment is not NULL and does not have the value VK_ATTACHMENT_UNUSED, and VkFormat of pDepthStencilResolveAttachment has a stencil component, then the VkFormat of pDepthStencilAttachment must have a stencil component with the same number of bits and numerical type So you can resolve from a depth/stencil format to a depth-only or stencil-only format so long as the number of bits matches. Unfortunately, this has never been tested because the CTS tests which purport to test this are broken and actually test with a destination combined depth/stencil format. Fixes: `5e4f9ea363` ("anv: Implement VK_KHR_depth_stencil_resolve") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15333>	2022-03-11 22:25:42 +00:00
Mike Blumenkrantz	ffd67b39e7	zink: remove flake this had already been resolved by the time the flake was added Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15350>	2022-03-11 14:46:41 -05:00
Jason Ekstrand	95a44a5b09	lavapipe: Use the auto-generated vk_enqueue_BeginRendering Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15325>	2022-03-11 11:40:41 -06:00
Jason Ekstrand	f76621f719	vulkan/cmd_queue: Properly support non-array pointer members Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5440 Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15325>	2022-03-11 11:36:53 -06:00
Thong Thai	027f1302fc	radeonsi: add option to disable EFC Signed-off-by: Thong Thai <thong.thai@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15196>	2022-03-11 14:10:08 +00:00
Thong Thai	23e5b910c5	radeon: add EFC support to only VCN2.0 devices Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5228 Signed-off-by: Thong Thai <thong.thai@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15196>	2022-03-11 14:10:08 +00:00
Thong Thai	9fa6ab962a	frontends/va: zero-copy efc Signed-off-by: Thong Thai <thong.thai@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15196>	2022-03-11 14:10:08 +00:00
Thong Thai	9602526568	frontends/va: add encoder format conversion (EFC) support Signed-off-by: Thong Thai <thong.thai@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15196>	2022-03-11 14:10:08 +00:00
Thong Thai	73153746d5	gallium: add parameters for encoder format conversion (EFC) support Signed-off-by: Thong Thai <thong.thai@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15196>	2022-03-11 14:10:08 +00:00
Tapani Pälli	adea096029	ci: update various ci result files Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12936>	2022-03-11 09:58:28 +00:00
Tapani Pälli	2a14baab85	mesa: check for valid internalformat with glTex[Sub]Image This changes our error handling to be compatible with upcoming specification change that unifies glTex[Sub]Image error handling between OpenGL ES 2.0 vs ES 3.0+ specifications, see: https://gitlab.khronos.org/opengl/API/-/issues/147 OpenGL ES 2.0.25 spec states: "Specifying a value for internalformat that is not one of the above values generates the error INVALID_VALUE. If internalformat does not match format, the error INVALID_OPERATION is generated." This fixes following new tests: KHR-GLES31.core.compressed_format.* KHR-GLES32.core.compressed_format.* v2: GL_INVALID_OPERATION -> GL_INVALID_VALUE in extension checks, remove (now overlapping) extension checks from _mesa_gles_error_check_format_and_type (Eric Anholt) v3: take GLES version in to account in internalformat checks Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12936>	2022-03-11 09:58:28 +00:00
Lionel Landwerlin	6cea8a43fa	anv: silence compiler warning Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15241>	2022-03-11 08:47:15 +00:00
Lionel Landwerlin	90000aea9b	anv: make a couple of descriptor function private Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15241>	2022-03-11 08:47:15 +00:00
Lionel Landwerlin	e12698724e	anv: rename host only descriptor internal flag We add an assert to verify that those are not bound. v2: Drop != 0 (Tapani) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15241>	2022-03-11 08:47:15 +00:00
Lionel Landwerlin	87f59b18cf	anv: don't lazy allocate surface states in descriptor sets In `4001d9ce1a` we started lazily allocating surface states in the descriptor sets rather than upfront in the descriptor pool. This was to workaround vkd3d-proton allocating more than we could handle at the HW level. The issue introduced in that change is that we didn't protect the descriptor pool free list as well as the anv_state_stream which are now potentially used from different threads through the descriptor set write functions. This reverts the lazy allocation part of that change. Host only descriptor sets changes remain. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `4001d9ce1a` ("anv: Handle VK_DESCRIPTOR_POOL_CREATE_HOST_ONLY_BIT_VALVE for descriptor sets") Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15241>	2022-03-11 08:47:15 +00:00
Lionel Landwerlin	71cd6a7b84	anv: fix acceleration structure descriptor copies We're not supposed to have a VkWriteDescriptorSetAccelerationStructureKHR when doing a copy. We should instead get the acceleration structure object from the source descriptor. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `03e1e19246` ("anv: Refactor descriptor copy") Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15241>	2022-03-11 08:47:15 +00:00
Pierre-Eric Pelloux-Prayer	968d68125c	radeonsi: don't clear framebuffer.state before dcc decomp This causes inconsistencies between sctx->framebuffer.state and other sctx->framebuffer properties (like compressed_cb_mask). The point of this code was to fix an issue with vi_separate_dcc_stop_query, which was removed by `804e292440` we can safely drop it. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6099 Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15261>	2022-03-11 08:31:36 +00:00
Kenneth Graunke	01442cf4d4	iris: Restore flagging of dirty bindings in binder_realloc When I switched iris over to use 3DSTATE_BINDING_TABLE_POOL_ALLOC, I stopped flagging things dirty when allocating a new binder, because the contents of the binding table were still valid, thanks to us not having to subtract Surface State Base Address anymore. This unfortunately missed the point that the old binding table is in the old buffer, which is no longer what the binder pool base address points to. So we'd either need to copy it over, or just flag it dirty and re-emit it on the next draw. Fixes misrendering in Ryujinx. Fixes: `8b9045e7a4` ("intel: Use 3DSTATE_BINDING_TABLE_POOL_ALLOC exclusively on Gfx11+") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15314>	2022-03-11 07:59:18 +00:00
Samuel Pitoiset	b366fef091	radv: optimize the number of loaded components for VS inputs in NIR fossils-db (Sienna Cichlid): Totals from 3691 (2.74% of 134913) affected shaders: VGPRs: 121368 -> 121584 (+0.18%); split: -0.36%, +0.54% CodeSize: 7597912 -> 7561140 (-0.48%); split: -0.66%, +0.18% MaxWaves: 104706 -> 104772 (+0.06%) Instrs: 1441229 -> 1437652 (-0.25%); split: -0.53%, +0.28% Latency: 5500766 -> 5482101 (-0.34%); split: -0.45%, +0.11% InvThroughput: 804401 -> 797178 (-0.90%); split: -1.09%, +0.20% VClause: 25185 -> 25143 (-0.17%); split: -0.50%, +0.33% SClause: 27486 -> 27445 (-0.15%); split: -0.57%, +0.42% Copies: 143816 -> 147900 (+2.84%); split: -0.54%, +3.38% PreSGPRs: 109584 -> 110396 (+0.74%); split: -0.04%, +0.79% PreVGPRs: 95541 -> 94583 (-1.00%); split: -1.12%, +0.12% fossils-db (Polaris10): Totals from 1773 (1.30% of 135960) affected shaders: SGPRs: 80848 -> 80864 (+0.02%); split: -0.14%, +0.16% VGPRs: 56424 -> 55600 (-1.46%); split: -1.47%, +0.01% CodeSize: 1732588 -> 1696840 (-2.06%); split: -2.07%, +0.01% MaxWaves: 12103 -> 12106 (+0.02%) Instrs: 347684 -> 341597 (-1.75%); split: -1.76%, +0.01% Latency: 2542840 -> 2523946 (-0.74%); split: -0.95%, +0.21% InvThroughput: 924601 -> 905102 (-2.11%); split: -2.13%, +0.02% VClause: 9565 -> 9545 (-0.21%); split: -0.51%, +0.30% SClause: 10587 -> 10333 (-2.40%); split: -2.82%, +0.43% Copies: 19321 -> 20307 (+5.10%); split: -0.78%, +5.88% PreSGPRs: 30879 -> 30875 (-0.01%); split: -0.20%, +0.18% PreVGPRs: 41211 -> 41270 (+0.14%); split: -0.73%, +0.87% Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15317>	2022-03-11 07:40:10 +00:00
Dave Airlie	1ec4e568de	radv: abstract queue family away from queue family index. If we introduce another queue type (video decode) we can have a disconnect between the RADV_QUEUE_ enum and the API queue_family_index. currently the driver has GENERAL, COMPUTE, TRANSFER which would end up at QFI 0, 1, <nothing> since we don't create transfer. Now if I add VDEC we get GENERAL, COMPUTE, TRANSFER, VDEC at QFI 0, 1, <nothing>, 2 or if you do nocompute GENERAL, COMPUTE, TRANSFER, VDEC at QFI 0, <nothing>, <nothing>, 1 This means we have to add a remapping table between the API qfi and the internal qf. This patches tries to do that, in theory right now it just adds overhead, but I'd like to exercise these paths. v2: add radv_queue_ring abstraction, and pass physical device in, as it makes adding uvd later easier. v3: rename, and drop one direction as unneeded now, drop queue_family_index from cmd_buffers. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13687>	2022-03-11 04:38:55 +00:00
Mike Blumenkrantz	afb4cced5c	lavapipe: more descriptor validation Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14656>	2022-03-11 04:26:28 +00:00
Mike Blumenkrantz	0c130d64d3	lavapipe: validate per-stage descriptor limits when creating pipeline layouts this is super annoying to track down later, so just crash early if it's seen Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14656>	2022-03-11 04:26:28 +00:00
Mike Blumenkrantz	21abb01fb9	lavapipe: make device limits a physical device struct it's useful to have this info around and a bit simpler to gather info on init Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14656>	2022-03-11 04:26:28 +00:00
Mike Blumenkrantz	5ab0e3f0bb	anv: fix some dynamic rasterization discard cases in pipeline construction cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15280>	2022-03-11 04:02:02 +00:00
Mike Blumenkrantz	1e3e7b3a4d	anv: fix CmdSetColorWriteEnableEXT for maximum rts Fixes: `b15bfe92f7` ("anv: implement VK_EXT_color_write_enable") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15280>	2022-03-11 04:02:02 +00:00
Mike Blumenkrantz	52f6978484	anv: fix xfb usage with rasterizer discard in the initial implementation, a stream like: * CmdBeginTransformFeedbackEXT * CmdSetRasterizerDiscardEnableEXT * CmdDraw * CmdEndTransformFeedbackEXT * CmdBeginTransformFeedbackEXT * CmdDraw * CmdEndTransformFeedbackEXT would never enable transform feedback, as it only checked for the change in rasterizer_discard state Fixes: `4d531c67df` ("anv: support rasterizer discard dynamic state") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15269>	2022-03-11 03:37:17 +00:00
Dave Airlie	e8c3be0eb8	crocus: don't map scanout buffers as write-back This essentially ports `6440523077` Author: Keith Packard <keithp@keithp.com> Date: Fri Aug 6 16:11:18 2021 -0700 iris: Map scanout buffers WC instead of WB [v2] to crocus. Fixes: `f3630548f1` ("crocus: initial gallium driver for Intel gfx 4-7") Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15313>	2022-03-11 03:26:41 +00:00
Mike Blumenkrantz	42e78ba125	llvmpipe: fix occlusion queries with early depth test for genuine early depth tests, the samplecount must be updated after depth test but before samplemask is applied for inferred-early or regular depth tests, the samplemask can be applied before the depth test Fixes: `d9276ae965` ("llvmpipe: handle gl_SampleMask writing.") fixes: dEQP-VK.fragment_operations.early_fragment.sample_count_early_fragment_tests_depth_samples_4 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15319>	2022-03-11 00:45:05 +00:00
Jason Ekstrand	f7175bf416	lavapipe: Use the common vk_enqueue_CmdBindDescriptorSets Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15329>	2022-03-10 21:08:36 +00:00
Jason Ekstrand	ac58e93633	lavapipe: Reference count pipeline layouts Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15329>	2022-03-10 21:08:36 +00:00
Jason Ekstrand	48a10c5dd3	lavapipe: Allocate descriptor set layouts with DEVICE scope Because they can come and go at any time, we can't use OBJECT scope because that might confuse the client allocator. Instead, use DEVICE scope and always allocate off the device allocator. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15329>	2022-03-10 21:08:36 +00:00
Jason Ekstrand	94ea3b9c03	vulkan/cmd_queue: Add a common vk_cmd_enqueue_CmdBindDescriptorSets In order for this to work, the driver must reference-count pipeline layouts so we can take a reference while the command is in the queue. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15329>	2022-03-10 21:08:36 +00:00
Jason Ekstrand	c1070556a0	vulkan/cmd_queue: Add a driver_free_cb hook If a driver sets driver_data but not driver_free_cb, driver_data will get freed along with the command. If a driver sets driver_free_cb, driver_data will not get automatically freed but the callback will get called before the rest of the data structure is freed. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15329>	2022-03-10 21:08:36 +00:00
Mike Blumenkrantz	2106c3bab6	lavapipe: ci updates Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15322>	2022-03-10 20:40:56 +00:00
Mike Blumenkrantz	cf5c32a4b2	lavapipe: run nir_opt_copy_prop_vars during optimization loop this enables better elimination of operations fixes: dEQP-VK.graphicsfuzz.spv-stable-mergesort-flatten-selection-dead-continues fixes #5458 cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15322>	2022-03-10 20:40:56 +00:00
Mike Blumenkrantz	c94c8a7029	lavapipe: ci updates Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15320>	2022-03-10 20:24:01 +00:00
Mike Blumenkrantz	6a4c7ef728	lavapipe: skip format checks for EXTENDED_USAGE we can effectively skip any kind of checks here and just assume that one of two scenarios is in effect: * the user is about to attempt some incredibly illegal behavior that VVL will catch * the user is about to attempt a pro gamer move and we'll be fine in either case, it's EXTENDED_USAGE, so hopefully we're about to make a texture view from a compatible and supported format cc: mesa-stable fixes: dEQP-VK.image.extended_usage_bit_compatibility.image_format_properties* Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15320>	2022-03-10 20:24:01 +00:00
Mike Blumenkrantz	c40dc39b5a	lavapipe: use the correct value for dynamic render resolve attachment indexing subpass->color_count is (obviously) not set yet, so this would just clobber the color attachments any time resolves were used Fixes: `8a6160a354` ("lavapipe: VK_KHR_dynamic_rendering") fixes: dEQP-VK.draw.dynamic_rendering.multiple_interpolation.structured.with_sample_decoration.4_samples Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15330>	2022-03-10 19:48:59 +00:00
Dave Airlie	938488f439	lavapipe: remove broken workaround for zink depth texturing. Cc: mesa-stable Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15297>	2022-03-11 04:58:34 +10:00
Dave Airlie	30cb63bead	zink: workaround depth texture mode alpha. Since spir-v only has single channel depth sampling, it breaks with the old school GL_ALPHA depth mode swizzle, so just detect that case and smash all the channels. Cc: mesa-stable Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15297>	2022-03-11 04:58:01 +10:00
Connor Abbott	cdee38a57b	tu: Expose subgroup arithmetic Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	1a78604d20	ir3: Add support for subgroup arithmetic Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	a433db60c1	ir3: Track physical edges when inserting (ss) for shared regs Normally this wouldn't matter, but it will matter for the upcoming scan macro because the running tally is communicated through a shared register across a physical edge. It may also matter if a live-range split occurs. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	410e746198	util/bitset: Fix off-by-one in __bitset_set_range Fixes: `b3b03e33c9` ("util/bitset: add BITSET_SET_RANGE(..)") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	72b32d83fb	ir3/spill: Mark reload destination as early-clobber Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	2ff5826f09	ir3/ra: Add IR3_REG_EARLY_CLOBBER We'll need this to model the subgroup reduction macros. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	34803d15ab	ir3/ra: Add proper support for multiple destinations We weren't considering the other destinations when allocating a destination, so we could allocate overlapping destinations. This wasn't done before because we never had a need for it, but the subgroup reduction macros will need it. The trickiest part of this is that we have to rewrite the compress_regs_left fallback, because we may have to move around the other already-allocated destinations. We now have a list of destinations to (re)allocate in addition to the popped live intervals. For the rest of the destination handling, we can just bail out if the proposed spot for something overlaps another destination, but for the fallback we have to handle all the cases gracefully. I also added support for odd combinations of multiple destinations where some of them are tied, which we'll use in the next commit to handle early-clobber destinations and which will actually be used because one of the destinations of the subgroup reduction macro will be early-clobber. The result is that the order of intervals to allocate is now a lot more complicated. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	ab0ed4ff3f	ir3/ra: Sanitize parallel copy flags better For pcopies we only care about the register's type, i.e. whether its a half-register and whether it's an array (plus its size). Copying over other flags like IR3_REG_RELATIV just leads to sadness and validator assertions. Fixes: `0ffcb19b9d` ("ir3: Rewrite register allocation") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	0135660dfc	ir3/ra: Fix ra_foreach_dst_n Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	077d07a983	ir3/ra: Fix tied destination handling with multiple destinations Before, we were careful to 1. Get the source physreg. 2. Allocate the destination. 3. Insert a copy with the source being the physreg from step 1. and this guaranteed that if the tied source were moved in step 2 we'd still insert a copy from the correct place. However this won't work with multiple destinations because an earlier destination could've already moved the tied source around. Instead flip steps 2 and 3 (we'll insert the copy before we allocate the interval, but that's ok) and run the first two steps in a separate loop before any destinations are allocated. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00

1 2 3 4 5 ...

151034 commits