fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-28 01:18:15 +02:00

Author	SHA1	Message	Date
Rob Clark	d2f4d332db	freedreno/ir3: new pre-RA scheduler This replaces the depth-first search scheduler with a more traditional ready-list scheduler. It primarily tries to reduce register pressure (number of live values), with the exception of trying to schedule kills as early as possible. (Earlier iterations of this scheduler had a tendency to push kills later, and in particular moving texture fetches which may not be necessary ahead of kills.) Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4440>	2020-04-13 20:47:28 +00:00
Rob Clark	0f22f85fe7	freedreno/ir3: fix location of inserted mov's If the group pass must insert a mov to resolve conflicts, avoid the mov appearing after the meta:collect whose src it is. The current pre-RA scheduler doesn't really care about the initial instruction order, but the new one will in some cases. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4440>	2020-04-13 20:47:28 +00:00
Rob Clark	908044ef4b	freedreno/ir3: simplify grouping pass Since `bdf6b7018c` the logic only needs to handle grouping collect srcs, So remove the now unnecessary indirection. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4440>	2020-04-13 20:47:28 +00:00
Rob Clark	860f5981f0	freedreno/ir3: make falsedep use's optional Add option when collecting uses to control whether they include falsedeps or not. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4440>	2020-04-13 20:47:28 +00:00
Rob Clark	d09e3afdcc	freedreno/ir3: spiff out disasm a bit for verbose mode, print also the instruction "cycle" (which takes into account (rptN) and (nopN)) in addition to instruction offset. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4440>	2020-04-13 20:47:28 +00:00
Jonathan Marek	40ccbae622	freedreno/computerator: support bindless sampler instructions Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4526>	2020-04-13 20:15:48 +00:00
Jonathan Marek	bc9a28beed	freedreno/computerator: support nop prefix Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4526>	2020-04-13 20:15:48 +00:00
Eric Anholt	95d4a956c0	freedreno/ir3: CSE the up/downconversion of SEL's cond's size. Not many programs hit this, but if you were, say, selecting between vec4s, you'd convert the cond 4 times. instructions in affected programs: 2957 -> 2717 (-8.12%) nops in affected programs: 989 -> 899 (-9.10%) non-nops in affected programs: 1968 -> 1818 (-7.62%) dwords in affected programs: 3232 -> 2752 (-14.85%) last-baryf in affected programs: 102 -> 90 (-11.76%) full in affected programs: 5 -> 4 (-20.00%) sstall in affected programs: 329 -> 329 (0.00%) (ss) in affected programs: 86 -> 105 (22.09%) (sy) in affected programs: 14 -> 12 (-14.29%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4516>	2020-04-13 19:24:52 +00:00
Eric Anholt	82375ccaa4	freedreno/ir3: Stop doing b2n on the SEL condition. SEL_B32 (and presumably B16) checks for 0 or nonzero in the condition (tested by just stuffing a uniform's value into it), so there's no need to do ir3_b2n() on it, or any preceding ir3_n2b(). instructions in affected programs: 664444 -> 659927 (-0.68%) nops in affected programs: 267898 -> 266312 (-0.59%) non-nops in affected programs: 420260 -> 417329 (-0.70%) dwords in affected programs: 144032 -> 137568 (-4.49%) last-baryf in affected programs: 10801 -> 10321 (-4.44%) full in affected programs: 2003 -> 2002 (-0.05%) sstall in affected programs: 76670 -> 77405 (0.96%) (ss) in affected programs: 4515 -> 4525 (0.22%) (sy) in affected programs: 612 -> 604 (-1.31%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4516>	2020-04-13 19:24:52 +00:00
Eric Anholt	904d5d63b4	freedreno: Fix leak of binning shader variants. The v->binning variant is never added to shader->variants, so just free each one as we free the nonbinning variant. Noticed from drm-shim mode running out of open fds, since each bo ends up with an fd. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4502>	2020-04-10 18:42:20 +00:00
Kristian H. Kristensen	5ec1f264f1	freedreno/ir3: Fix sz vs class confusion Add bounds checking to make sure we don't silently access out of bounds again. Fixes: `90f7d12236` ("freedreno/ir3/ra: pick higher numbered scalars in first pass") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4503>	2020-04-10 10:24:14 -07:00
Connor Abbott	089e1fb287	tu: Implement descriptor set update templates Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4358>	2020-04-09 15:56:55 +00:00
Connor Abbott	e1595026f6	tu: Add missing code for immutable samplers Actually fill out the samplers, based on the radv implementation. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4358>	2020-04-09 15:56:55 +00:00
Connor Abbott	a07b55443b	tu: Emit CP_LOAD_STATE6 for descriptors This restores the pre-loading of descriptor state, using the new SS6_BINDLESS method that allows us to pre-load bindless resources. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4358>	2020-04-09 15:56:55 +00:00
Connor Abbott	d37843fee1	tu: Switch to the bindless descriptor model Under the bindless model, there are 5 "base" registers programmed with a 64-bit address, and sam/ldib/ldc and so on each specify a base register and an offset, in units of 16 dwords. The base registers correspond to descriptor sets in Vulkan. We allocate a buffer at descriptor set creation time, hopefully outside the main rendering loop, and then switching descriptor sets is just a matter of programming the base registers differently. Note, however, that some kinds of descriptors need to be patched at command recording time, in particular dynamic UBO's and SSBO's, which need to be patched at CmdBindDescriptorSets time, and input attachments which need to be patched at draw time based on the the pipeline that's bound. We reserve the fifth base register (which seems to be unused by the blob driver) for these, creating a descriptor set on-the-fly and combining all the dynamic descriptors from all the different descriptor sets. This way, we never have to copy the rest of the descriptor set at draw time like the blob seems to do. I mostly chose to do this because the infrastructure was already there in the form of dynamic_descriptors, and other drivers (at least radv) don't cheat either when implementing this. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4358>	2020-04-09 15:56:55 +00:00
Connor Abbott	fc850080ee	ir3: Rewrite UBO push analysis to support bindless Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4358>	2020-04-09 15:56:55 +00:00
Connor Abbott	274f3815a5	ir3: Plumb through bindless support Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4358>	2020-04-09 15:56:55 +00:00
Connor Abbott	7d0bc13fca	ir3: LDC also has a destination Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4358>	2020-04-09 15:56:55 +00:00
Connor Abbott	1842961e58	ir3: Also don't propagate immediate offset with LDC Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4358>	2020-04-09 15:56:55 +00:00
Connor Abbott	de7d90ef53	ir3: Plumb through support for a1.x This will need to be used in some cases for the upcoming bindless support, plus ldc.k instructions which push data from a UBO to const registers. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4358>	2020-04-09 15:56:55 +00:00
Connor Abbott	c8b0f90439	ir3: Add bindless instruction encoding Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4358>	2020-04-09 15:56:55 +00:00
Connor Abbott	122a900d7d	freedreno/a6xx: Add registers for the bindless model In Vulkan, descriptors for samplers, SSBO's, etc. are collected into descriptor sets, and shaders can use multiple descriptor sets. At command-recording time, users can swap out only some of the descriptor sets, and the driver is supposed to do the minimum amount necessary to update any internal binding tables, knowing that only some of the descriptors have changed. With the old binding model, focused on GL, where there are separate tables for each type of resource, we can do somewhat better than now by preserving descriptors from lower descriptor sets when switching higher descriptor sets. However we still have to copy around descriptors before each draw. At least for a6xx, qualcomm went further, essentially copying the Vulkan binding model as an alternate way to load resources. There's an array of registers (actually an array for compute and one for everything else), where each register holds a pointer to a descriptor set that can contain various different descriptor types. The descriptors are padded out to 16 dwords, so that every instruction can use an index instead of a dword offset. It's called "bindless", I think, because it can also be used to implement the old GL bindless extensions (presumably it allows more samplers and textures than the old model). This commit adds the register and cmdstream parts. Next up will be the instruction encoding. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4358>	2020-04-09 15:56:55 +00:00
Connor Abbott	e088d82aa6	freedreno/a6xx: Add UBO size field Verified with the vulkan blob, which uses ldc and UBO descriptors, and turnip will too soon. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4358>	2020-04-09 15:56:55 +00:00
Connor Abbott	d3b7681df2	tu: ir3: Emit push constants directly Carve out some space at the beginning for push constants, and push them directly, rather than remapping them to a UBO and then relying on the UBO pushing code. Remapping to a UBO is easy now, where there's a single table of UBO's, but with the bindless model it'll be a lot harder. I haven't removed all the code to move the remaining UBO's over by 1, though, because it's going to all get rewritten with bindless anyways. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4358>	2020-04-09 15:56:55 +00:00
Connor Abbott	63c2e8137d	tu: Dump out shader assembly when requested We don't use the ir3 variant machinery, so we have to do this ourselves. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4358>	2020-04-09 15:56:55 +00:00
Jonathan Marek	2e084c2cb3	turnip: new clear/blit implementation with shader path fallback The shader path is used to implement the following cases: * stencil aspect mask on D24S8 (for image_to_buffer,buffer_to_image) * clear/copy msaa destination (2D engine can't have msaa dest) Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3783>	2020-04-09 14:43:02 +00:00
Jonathan Marek	de6967488a	turnip: add vk_format_is_snorm/is_float Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3783>	2020-04-09 14:43:02 +00:00
Jonathan Marek	51fe52d2fd	turnip: rework format helpers * Take tile_mode as input directly * tu6_format_gmem to tu6_base_format, use may not be limited to GMEM * Add new helpers that will return the correct tile_mode as for image level as part of the format. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3783>	2020-04-09 14:43:02 +00:00
Jonathan Marek	009082dcff	turnip: use dirty bits for dynamic viewport/scissor state CmdClearAttachments shader path will overwrite this state, so it needs to be re-emitted with dirty bits in that case. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3783>	2020-04-09 14:43:02 +00:00
Jonathan Marek	ed83281f0c	turnip: save attachment samples in renderpass state This is needed to be able to know the number of samples during CmdClearAttachments which can be used while the framebuffer is unknown. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3783>	2020-04-09 14:43:02 +00:00
Jonathan Marek	0637eab678	turnip: disable 8x msaa Not everything supports 8x msaa, and the blob doesn't support it at all. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3783>	2020-04-09 14:43:02 +00:00
Jonathan Marek	f03e63cd99	turnip: fix nir validate failure from push constant lowering Fixes newly added checks in nir validate failing. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3783>	2020-04-09 14:43:02 +00:00
Jonathan Marek	86d1a4c907	turnip: split up gmem/tile alignment Note: the x1/y1 align in tu6_emit_blit_scissor was broken Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3783>	2020-04-09 14:43:02 +00:00
Jonathan Marek	f494799a7f	turnip: RB_CCU_CNTL fixes * Correct bypass value for a618 * Bypass value for blitter * Don't set RB_CCU_CNTL again unnecessarily in tu6_emit_binning_pass Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3783>	2020-04-09 14:43:02 +00:00
Jonathan Marek	e4c05a5335	freedreno/registers: add RB_CCU_CNTL bitfields Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3783>	2020-04-09 14:43:02 +00:00
Jonathan Marek	420ca1e4a1	turnip: use buffer size instead of bo size for VFD_FETCH_SIZE Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4224>	2020-04-09 02:05:52 +00:00
Jonathan Marek	e62f8ae15a	turnip: improve vertex input handling Emit vertexBindingDescriptionCount bindings, instead of one per attribute. Verified with dEQP-VK.pipeline.vertex_input.* Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4224>	2020-04-09 02:05:52 +00:00
Jonathan Marek	d6a8591f72	turnip: fix compute shaders crashing after geometry shader change Fixes: `1af71bee73` ("turnip: Set has_gs in ir3_shader_key") Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4483>	2020-04-08 01:56:53 +00:00
Kristian H. Kristensen	4399cacaf0	turnip: Drop dep_llvm from dependencies Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4478> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4478>	2020-04-07 18:44:21 +00:00
Kristian H. Kristensen	5789505ab3	turnip: Make Android platform build We still don't have a way to keep this from breaking, but I don't think this ever built. Let's call it progress. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4478>	2020-04-07 18:44:21 +00:00
Kristian H. Kristensen	97578c69e8	turnip: Stub out VK_KHR_external_{fence,semaphore}_fd Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4478>	2020-04-07 18:44:21 +00:00
Kristian H. Kristensen	e99f6f2ea1	turnip: Add missing VKAPI_ATTR annotations Make sure the types match. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4478>	2020-04-07 18:44:21 +00:00
Eric Anholt	1618159772	freedreno/a6xx: Set a level's pitch based on minified level0 pitch, not width0. Found from piglit fbo-generatemipmaps failures, then tracked down with the texturator test. The piece that really revealed things was finding that 1024x1 linear RGBA8 on the older blob drivers would have a pitch of 5120 instead of 4096, and the following levels minified that pitch. Fixes ~124 piglit tests (~8.5% of piglit failures) on cheza. Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3987> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3987>	2020-04-07 18:02:56 +00:00
Eric Anholt	4b881d5270	freedreno: Add the outline of a test for a6xx texture layout. Trying to work out texture layout by remembering what things looked like in texturator is hard. Instead, let's use texture layouts from tracing the blob as a source of truth to make sure that we pick the same layouts they do (and don't break known-good ones). More testcases will be added as I fix layout bugs. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3987>	2020-04-07 18:02:56 +00:00
Eric Anholt	9c6bfe8733	freedreno/a6xx: Drop the "alignment" layout temporary. It's just 1 for !3d, which means that the align we're doing in that case is pointless. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3987>	2020-04-07 18:02:56 +00:00
Eric Anholt	59a2220398	freedreno/a6xx: Remove the "aligned_height" temporary. Now that we're not incrementally minifying height, we can just modify it. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3987>	2020-04-07 18:02:56 +00:00
Eric Anholt	cdff81fa9a	freedreno/a6xx: Sink the per-level size temps inside the loop. u_minify(n, 1) is no cheaper than u_minify(n, level), and this makes the logic a lot simpler to follow. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3987>	2020-04-07 18:02:56 +00:00
Jonathan Marek	a1727598a0	turnip: implement timestamp query Passes tests in: dEQP-VK.pipeline.timestamp.* Signed-off-by: Jonathan Marek <jonathan@marek.ca> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4027> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4027>	2020-04-07 14:58:47 +00:00
Brian Ho	d64a7d6e69	turnip: Enable geometryShader device feature Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4436> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4436>	2020-04-07 14:13:21 +00:00
Brian Ho	bdf6b481d8	turnip: Enable geometry shaders for CP_DRAWs Enable geometry shading on draw if the pipeline has a geometry stage. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4436>	2020-04-07 14:13:20 +00:00

1 2 3 4 5 ...

949 commits