fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-17 09:38:05 +02:00

Author	SHA1	Message	Date
Connor Abbott	2e2e6865b4	tu, freedreno/a6xx: Fix setting PC_XS_OUT_CNTL::PRIMITVE_ID This is supposed to be set when that stage needs the PrimID sysval preloaded, except for the VS which doesn't have this bit and instead infers it from the HS or GS bit (depending on whether tess/GS is enabled). Therefore for HS, GS, and DS we should set it whenever the corresponding sysval is there. This includes adding a missing PC_HS_OUT_CNTL, which I confirmed is set when the HS reads PrimID from the VS. Note that the DS sysval is currently always enabled whenever there's a GS, if we were to fix that then we should also change the logic here. This doesn't fix anything that I know of, but aligns us more with what the blob does. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12166>	2021-08-05 16:35:41 +00:00
Connor Abbott	8115cde3ba	tu, freedreno/a6xx, ir3: Rewrite tess PrimID handling The previous handling conflated RelPatchID and PrimID, which would result in incorrect gl_PrimitiveID when doing draw splitting and didn't work with PrimID passthrough which fills the VPC slot with the "correct" PrimID value from the tess factor BO which we left 0. Replace PrimID in the tess lowering pass with a new RelPatchID sysval, and relace PrimID with RelPatchID in the VS input code in turnip/freedreno at the same time so that there is no net change in the tess lowering code. However, now we have to add new mechanisms for getting the user-level PrimID: - In the TCS it comes from the VS, just like gl_PrimitiveIDIn in the GS. This means we have to add another register to our VS->TCS ABI. I decided to put PrimID in r0.z, after the TCS header and RelPatchID, because it might not be read in the TCS. - If any stage after the TCS uses PrimID, the TCS stores it in the first dword of the tess factor BO, and it is read by the fixed-function tessellator and accessed in the TES via the newly-uncovered DSPRIMID field. If we have tess and GS, the TES passes this value through to the GS in the same way as the VS does. PrimID passthrough for reading it in the FS when there's tess but no GS also "just works" once we start storing it in the TCS. In particular this fixes dEQP-VK.pipeline.misc.primitive_id_from_tess which tests exactly that. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12166>	2021-08-05 16:35:41 +00:00
Connor Abbott	cd687c4e3b	freedreno: Rename and document tess primid-related sysvals DSPATCHID and HSPATCHID, which we mapped gl_PrimitiveID to, are actually relative to the current subdraw. Subdraws aren't supported yet by turnip but they are by freedreno for indirect draws. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12166>	2021-08-05 16:35:40 +00:00
Danylo Piliaiev	97b0981ed9	tu: disable gmem in primary cmdbuffer if secondary has it disabled If secondary command buffer is emitted within a subpass it may have barriers which forces us to disable gmem for current renderpass. Fixes: `20547a110e` "tu: delay decision of forcing sysmem due to subpass self-dependencies" Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12219>	2021-08-05 18:51:51 +03:00
Antonio Caggiano	50e849fb94	freedreno/ci: Add a manual job for tracking performance Use Piglit's replay profile to measure and store the time that frames take to render in the GPU. This job won't run automatically in regular pipelines, but will be triggered automatically by a script for every successful pre-merge pipeline. This is because we want to generate performance data for every relevant commit merged in main, but we don't want to keep a device busy during the pre-merge run. Signed-off-by: Antonio Caggiano <antonio.caggiano@collabora.com> Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Michel Dänzer <mdaenzer@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12185>	2021-08-05 07:55:32 +00:00
Tomeu Vizoso	19670129c5	ci: Update canvas_text trace The previous one had all rendering and setup in a single frame, so repeatedly replaying it for performance tracking was reaching OOM due to the repeated creation of resources that weren't being released. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12195>	2021-08-05 05:44:57 +00:00
Emma Anholt	8cfa6b4d75	ci/freedreno: Mark border_color as passing on a5xx. They're marked as flaky, but they're all passing on my board, and maybe in CI too. Let's see if CI still sees them as flaky once the expectations are right. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12192>	2021-08-04 17:21:43 +00:00
Emma Anholt	1f66fcf5a6	freedreno/a5xx: Don't try to emit FS images in binning command streams. We don't have a compiled FS in that case, so the shader deref fails. We don't need the state, anyway. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12192>	2021-08-04 17:21:43 +00:00
Emma Anholt	c20287f8e4	freedreno/a5xx: Disable TF when pausing or transitioning to non-TF. This is not as clean as on a6xx where we just re-emit the SO state, but it cleans up our dEQP results a ton. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12192>	2021-08-04 17:21:43 +00:00
Timothy Arceri	a9ed4538ab	nir: add indirect loop unrolling to compiler options This is where it should be rather than having to pass it into the optimisation pass every time. It also allows us to call the loop analysis pass without having to duplicate these options which we will do later in this series. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12064>	2021-08-03 10:54:50 +00:00
Emma Anholt	c2a6143755	turnip: Fix assertions on checking mutable combined samplers support. We would determine that it was unsupported, then ask for the size and triggered the assertion checking that we never ask for the size of a combined sampler. Fixes: `ee3495e465` ("turnip: Add support for VK_VALVE_mutable_descriptor_type") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12148>	2021-08-02 17:41:54 +00:00
Danylo Piliaiev	ea7a42775b	turnip: reduce maxComputeWorkGroupSize Blob advertises { 1024, 1024, 64 }, but from tests they all could be 1024. Fixes tests: dEQP-VK.compute.basic.max_local_size_x dEQP-VK.compute.basic.max_local_size_y dEQP-VK.compute.basic.max_local_size_z Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9409>	2021-08-02 20:19:18 +03:00
Connor Abbott	b157a5d0d6	tu: Implement non-aligned multisample GMEM STORE_OP_STORE We have to a bit careful here when disabling draw states. This also necessitates moving the actual recording of the stores to the end so that we set the dirty flag correctly. Closes: #4462 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12102>	2021-08-02 11:20:25 +00:00
Connor Abbott	7948c4b0b4	tu: Make tile stores use a dedicated CS We were trying to calculate how much space they need, That was already difficult and one of the most opaque and hard-to-verify uses of sub_cs, but it will become even more difficult with the 3D path. What's worse is that sometimes we have to touch that path when we start touching registers that would affect rasterization, and there's no indication that you have to then recalculate the size etc. Just rip this out and start keeping a separate CS for it instead. Note that this adds a small amount of memory wastage and extra buffers (at worst one buffer per command buffer). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12102>	2021-08-02 11:20:25 +00:00
Juan A. Suarez Romero	dfccbdff98	ci: update to VK-GL-CTS 1.2.7.0 v2: - Read mustpass files from vk-default.txt (Matt) - Remove freedreno atomic geom tests from fail list (Emma) - Move freedreno flake to separated line (Emma) Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12069>	2021-07-30 20:02:13 +00:00
Tomeu Vizoso	f376597d00	freedreno/ci: Move rules for restricted jobs to test-source-dep.yml GitLab doesn't merge the rules array from a job that is extended, so we were missing the changes rules. To avoid this, create a .freedreno-rules-restricted job that includes the changes rules and the restricted user checks. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Acked-by: Michel Dänzer <mdaenzer@redhat.com> Fixes: `92f9141f00` ("ci/freedreno: Test with non-redistributable traces") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5139 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12122>	2021-07-30 08:42:57 +02:00
Connor Abbott	d9a4a0aebd	tu: Handle multisample vkCmdCopyColorImage() There was a bit of code already to select the 3d path, but we actually need another shader variant for it. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12080>	2021-07-29 23:54:29 +00:00
Connor Abbott	9b0a4cc893	ir3: Delete old packed struct encoding turnip clear/blit shaders were the last user of these. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12079>	2021-07-29 23:20:18 +00:00
Connor Abbott	fc0c0e9d45	tu: Use NIR for clear/blit shaders This is much more maintainable, extensible, and easy to read than hand-rolled structs approximating assembly. This also removes the last use of the old hand-written packing structs. There are a few minor differences: - The shaders are larger because ir3 currently doesn't support (rpt), which means that some shaders are larger than one instrlen and the current logic has to be extended to allow for that. This seems a small price to pay, ir3 will gain support for (rpt) eventually, and we shouldn't have limitations like this baked in anyway. For example some GL blob r8g8 <-> r16 copy shaders are apparently quite large. - Due to the inability to switch inputs/outputs on the fly, we need to split the VS into two variants. I made the layer-writing variant also used for other clears, because the old method of overloading c0.z/c1.z to mean both "src x coordinate" and "z clear value" in the same shader seemed too clever and I didn't want to add yet another variant. This means that non-layered clears will also write the layer (to 0), but that shouldn't be a big deal performance-wise. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12079>	2021-07-29 23:20:18 +00:00
Alejandro Piñeiro	476dc3c050	vulkan: add vk_spec_info_to_nir_spirv util method All vulkan drivers have been copying anv's code to convert VkSpecializationInfo into nir_spirv_specialization. Recently there was a Vulkan spec change on allowed values for VkSpecializationInfo, and all drivers got affected. This commits creates a new helper, and uses it on all Vulkan Mesa drivers. v2: use (uint8_t) castings, instead of void, to avoid C2036 with MSVC (detected by the CI, inspired on what radv was doing) Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12047>	2021-07-29 03:28:52 +00:00
Dylan Baker	097cf3952b	freedreno/ir3: Add build id to the disassembler test This is required (at least for me on x86) to get the tool to pass it's own test, otherwise it fails the build_id assertion. Fixes: `1462b00391` ("freedreno/ir3: Add a unit test for our disassembler.") Acked-by: Rob Clark <robclark@freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12084>	2021-07-28 21:26:45 +00:00
Connor Abbott	7a14484bca	ir3: Preserve gl_ViewportIndex in the binning shader Fixes dEQP-VK.draw.shader_viewport_index.* with TU_DEBUG=forcebin. Fixes: `efff734220` ("turnip: multiViewport and VK_EXT_shader_viewport_index_layer") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12104>	2021-07-28 16:49:46 +00:00
Danylo Piliaiev	20547a110e	tu: delay decision of forcing sysmem due to subpass self-dependencies DXVK always inserts vertex stage subpass self-dependency for every subpass regardless of whether there actually would be a barrier. This effectively disabled gmem rendering with DXVK. Thus we delay the decision to disable gmem rendering until we see a barrier with vertex stages. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12038>	2021-07-28 15:35:44 +00:00
Antonio Caggiano	92f9141f00	ci/freedreno: Test with non-redistributable traces Add a job to test with traces that we cannot redistribute, listed in a separate file. Since those traces might not be accessible by everyone, this job is created only when the pipeline is triggered by `marge-bot`. This job is optional because otherwise it could be blocking a merge request of someone who cannot really debug the issue due to lack of access to these traces. The documentation available under `docs/ci` goes into more details explaining the rationale behind optional traces. Signed-off-by: Antonio Caggiano <antonio.caggiano@collabora.com> Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6441>	2021-07-28 13:49:11 +00:00
Connor Abbott	d903f51631	tu: Triage some CTS failures These should be fixed by the linked VK-GL-CTS change. I couldn't reproduce the other renderpass failure on a650 though. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12065>	2021-07-27 13:55:46 +00:00
Eduardo Lima Mitev	ee3495e465	turnip: Add support for VK_VALVE_mutable_descriptor_type v1. Hyunjun Ko <zzoon@igalia.com> - Add to hanlde VK_DESCRIPTOR_POOL_CREATE_HOST_ONLY_BIT_VALVE - Don't support VK_DESCRIPTOR_TYPE_INPUT_ATTACHMENT and VK_DESCRIPTOR_TYPE_COMBINED_IMAGE_SAMPLER v2. Hyunjun Ko <zzoon@igalia.com> - Fix some indentations and nitpicks. - Add the extension to features.txt v3. Hyunjun Ko <zzoon@igalia.com> - Remove unnecessary asserts. Signed-off-by: Eduardo Lima Mitev <elima@igalia.com> Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9322>	2021-07-27 04:53:02 +00:00
Danylo Piliaiev	b45cddda18	tu: handle half-reg fs outputs This would allow to enable translation of RelaxedPrecision spirv variable decorator into mediump which for us means fp16. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12067>	2021-07-26 16:25:14 +00:00
Emma Anholt	b1dbbc3dc1	freedreno: Lock access to msm_pipe for RB object suballocation. Hopefully this fixes the flakes that have happened since the suballoc changes. Fixes: `737d4caa83` ("freedreno: Suballocate our long-lived ring objects.") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12002>	2021-07-22 21:45:31 +00:00
Jason Ekstrand	17f7b4b83e	turnip: Replace tu_lower_image_size with nir_lower_image Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12005>	2021-07-22 14:22:35 -05:00
Rob Clark	215adc2f18	freedreno/registers: update dsi registers to support tpg Based-on: https://patchwork.freedesktop.org/patch/445672/?series=92870&rev=1 Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12021>	2021-07-22 08:25:25 -07:00
Emma Anholt	bda26dfcfc	freedreno/ir3: Reduce choose_instr_dec() and _inc() overhead. If you didn't have a freed+ready instruction, you'd redo the live_effect and check_instr() logic multiple times per instr. Replace the multiple loops in each function with a ranking that I think is more readable, reducing the overhead in the process. debugoptimized dEQP-GLES31.functional.ubo.random.all_per_block_buffers.20 runtime goes from ~3.5s -> ~3.0s on my lazor. No shader-db change. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11855>	2021-07-19 23:16:54 +00:00
Danylo Piliaiev	53d4485a02	freedreno: fix wrong tile aligment for 3 CCU gpu Fixes: `78c8a8af80` "freedreno: Generate device-info tables at build time" Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5060 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11928>	2021-07-16 15:02:27 +00:00
Rob Clark	86f09b14df	freedreno+turnip: Add a6xx gen4 support This adds support for a660 and a635. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11790>	2021-07-14 01:58:00 +00:00
Rob Clark	7c7722304b	freedreno+turnip: Get device name from device-info table Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11790>	2021-07-14 01:58:00 +00:00
Rob Clark	a4559c9550	freedreno+turnip: Add has_8bpp_ubwc Newer a6xx devices seem to drop 8b/pixel UBWC support. The turnip part was adapted from Jonathans patch on !10892 Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11790>	2021-07-14 01:58:00 +00:00
Rob Clark	e552784e68	freedreno+turnip: Add has_cp_reg_write Newer a6xx devices drop this packet from the sqe firmware, and use direct (pkt4) register writes instead for the few cases that previously used CP_REG_WRITE. The turnip part was adapted from Jonathans patch on !10892 Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11790>	2021-07-14 01:58:00 +00:00
Rob Clark	f74d0bf05e	turnip: Get has_sample_locations from fd_dev_info Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11790>	2021-07-14 01:58:00 +00:00
Rob Clark	3f1c4a86bb	turnip: Get has_tex_filter_cubic from fd_dev_info Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11790>	2021-07-14 01:58:00 +00:00
Rob Clark	64af60cfb3	turnip: Get indirect_draw_wfm_quirk from fd_dev_info At some point we might want to change this to minimum fw version, but for now it can be a bool. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11790>	2021-07-14 01:58:00 +00:00
Rob Clark	06000f42ed	turnip: Get storage_16bit from fd_dev_info Removing more gpu_id checks that will become bogus as we add more a6xx. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11790>	2021-07-14 01:58:00 +00:00
Rob Clark	948d87dab8	turnip: Drop unused vshs_workgroup param Unused since `d968995c67`, and this gets rid of one more gpu_id check. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11790>	2021-07-14 01:58:00 +00:00
Rob Clark	f4cfb5a61e	freedreno/ir3: Get reg_size_vec4 from fd_dev_info Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11790>	2021-07-14 01:58:00 +00:00
Rob Clark	4335f47f41	freedreno/ir3: Get tess_use_shared from fd_dev_info A step towards getting rid of checks for gpu_id sprinkled around. Checking major generation is ok, but checking for == or >= a specific gpu_id is going to start getting messy as we add more a6xx. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11790>	2021-07-14 01:58:00 +00:00
Rob Clark	c9bcd835fa	turnip: Convert fd_dev_info to const pointer Split out from earlier patch to reduce churn. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11790>	2021-07-14 01:58:00 +00:00
Rob Clark	78c8a8af80	freedreno: Generate device-info tables at build time This way we can make the tables const. At the same time, for a6xx, this introduces a "sub-generation template" to reduce the copy/paste for parameters which are keyed to the sub-generation. It also explicitly lists every supported GPU, to get rid of duplicate lists of supported gpus between the device-info and drivers. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11790>	2021-07-14 01:58:00 +00:00
Rob Clark	0eda0188aa	freedreno: Rename _dev_info Everywhere else symbols/types/etc are shortend to "fd_", so lets do the same here for consistency. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11790>	2021-07-14 01:58:00 +00:00
Jonathan Marek	1a6dd7f9b1	freedreno/common: unhardcode CCU color cache offset Replace it with a calculation which works for all current GPUs. Duplicated the calculation in both drivers because freedreno_dev_info isn't meant for derived parameters (and drivers might want to just calculate on the fly instead). Signed-off-by: Jonathan Marek <jonathan@marek.ca> Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11790>	2021-07-14 01:58:00 +00:00
Jonathan Marek	d34b18a6ce	tu: remove workaround for conditional rendering + hw binning - It hurts users with newer firmware who don't need the workaround - Kernel now rejects older firmware due to security issues, so this will prevent users from using older firmware anyway. - Only whitelisting 650 enables the workaround by default for any new GPUs Signed-off-by: Jonathan Marek <jonathan@marek.ca> Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11790>	2021-07-14 01:58:00 +00:00
Emma Anholt	10d8e123c5	freedreno: Optimize duplicate obj-obj ring relocs. No need to include the same BO multiple times in the long-lived ringbuffer object's list of relocs to be added to the submit. Improves non-TC drawoverhead -test 9 (8 tex updates) throughput by 1.4901% +/- 0.8705% (n=20) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11697>	2021-07-13 22:12:56 +00:00
Emma Anholt	737d4caa83	freedreno: Suballocate our long-lived ring objects. On drawoverhead -test 9 (8 texture changes), this saves us 172kb of memory. That's only ~1% of the GEM memory while the test is running, but more importantly it saves us 29% of the gem BO allocations. non-TC drawoverhead -test 9 (8 texture change) throughput 0.449019% +/- 0.336296% (n=100), but this gets better as we get better suballocation density. Note that this means that all fd_ringbuffer_new_object calls can now return data aligned to 64 bytes, instead of 4k. We may find that we need to increase it if some of our objects (tex consts, sampler consts, etc.) require more alignment than that. But, this may help non-drawoverhead perf if any of our RB objects have a cache in front of them (indirect consts?) and we don't have most of our data in the same cache set any more. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11697>	2021-07-13 22:12:56 +00:00

1 2 3 4 5 ...

2453 commits