fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-06 09:28:07 +02:00

Author	SHA1	Message	Date
Karol Herbst	186a22712f	rusticl: allocate printf buffer as staging Signed-off-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20161>	2023-02-22 14:20:21 +00:00
Karol Herbst	3abbeed74f	rusticl: no compute only This might be a good optimization, but we should figure first what drivers should be able to disable and what not first. Signed-off-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20161>	2023-02-22 14:20:21 +00:00
Karol Herbst	ced9d5d635	rusticl/device: limit CL_DEVICE_MAX_CONSTANT_ARGS At the moment we implement constant memory as normal global memory, but we still should limit to the actual constant buffer cap once we properly use UBOs for that. Signed-off-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20161>	2023-02-22 14:20:21 +00:00
Karol Herbst	bbea8761c2	rusticl/device: fix some device limits Signed-off-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20161>	2023-02-22 14:20:21 +00:00
Karol Herbst	188c0d7c8f	llvmpipe/ci: increase deqp-runner timeout Some of the OpenCL tests are flaky, because they just take that long. Builtins can generated really complex code and if we are unlucky they can timeout. Proper support for functions would also solve the issue, probably, but for now increase the deqp-runner timeout so it's less of an annoyence. Signed-off-by: Karol Herbst <kherbst@redhat.com> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20161>	2023-02-22 14:20:21 +00:00
Samuel Pitoiset	7f2775bc8a	radv/ci: cleanup CI lists for dEQP-VK.memory.* tests that timeout These tests usually take more than 30s to complete, so exclude them completely instead. This should also make runs slightly faster. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21464>	2023-02-22 13:25:13 +00:00
Eric Engestrom	2809bb8499	broadcom/ci: re-enable egl on wayland Swap them, as suggested by @enunes, to avoid a ton of timeouts in wayland if the x11 tests ran first. Needs to be investigated, but at least like this we can get the CI coverage back. Signed-off-by: Eric Engestrom <eric@igalia.com> Acked-by: Daniel Stone <daniels@collabora.com> Reviewed-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21453>	2023-02-22 12:59:04 +00:00
Eric Engestrom	67b2f48645	broadcom/ci: add two known failures The first one in a Pass on both the rpi and the simulator, while the second one is skipped as NotSupported on both. No idea why it's failing on the CI, but at least it's consistently failing... Signed-off-by: Eric Engestrom <eric@igalia.com> Acked-by: Daniel Stone <daniels@collabora.com> Reviewed-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21453>	2023-02-22 12:59:04 +00:00
Eric Engestrom	50361c71f4	broadcom/ci: skip buffer_age.no_preserve and swap_buffers_with_damage on wayland They hang the runner, but they all pass fine on my rpi. Will need investigating, but for now skip them. Signed-off-by: Eric Engestrom <eric@igalia.com> Acked-by: Daniel Stone <daniels@collabora.com> Reviewed-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21453>	2023-02-22 12:59:04 +00:00
Eric Engestrom	d01fded429	broadcom/ci: drop create_pixmap_surface from the fails; it passes now Signed-off-by: Eric Engestrom <eric@igalia.com> Acked-by: Daniel Stone <daniels@collabora.com> Reviewed-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21453>	2023-02-22 12:59:04 +00:00
Eric Engestrom	ccc61357d5	broadcom/ci: add x11- prefix to x11 EGL tests Makes things more obvious now that we're enabling wayland as well. Signed-off-by: Eric Engestrom <eric@igalia.com> Acked-by: Daniel Stone <daniels@collabora.com> Reviewed-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21453>	2023-02-22 12:59:04 +00:00
Erik Faye-Lund	176add76ad	zink: get rid of needless dependency We haven't needed this header since `7f56fd9655` ("zink: it's kopperin' time"), so let's get rid of it and the dependency that comes with it. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21441>	2023-02-22 12:23:39 +00:00
Erik Faye-Lund	e02cdb397e	zink: prefer vulkan_core.h over vulkan.h There's just two places where we need any of the WSI specific vulkan includes, the rest of Zink should do just fine with vulkan_core.h. So let's include the win32-specific header explicitly in those two places, and reduce the need for WSI specifics inside zink itself. Kopper handles the rest of the WSI integration. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21441>	2023-02-22 12:23:39 +00:00
Konstantin Seurer	6242fe3923	anv: Use vk_acceleration_structure Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21047>	2023-02-22 11:58:57 +00:00
Konstantin Seurer	4d2a7ea146	radv: Use vk_acceleration_structure Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21047>	2023-02-22 11:58:57 +00:00
Konstantin Seurer	eb33a1adb0	vulkan: Add vk_acceleration_structure Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21047>	2023-02-22 11:58:57 +00:00
Sui Jingfeng	1ba3e83958	meson: add basic support for loongarch Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Signed-off-by: Sui Jingfeng <suijingfeng@loongson.cn> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21182>	2023-02-22 09:09:43 +00:00
Samuel Pitoiset	1583b150d6	radv: set VS_OUT_MISC_SIDE_BUS_ENA for clip distances on GFX10.3+ On GFX10.3, all auxiliary position exports are optimized, so set it for clip/cull distances. Both RadeonSI and llpc set it too. Suggested by Marek. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21439>	2023-02-22 07:17:33 +00:00
Mike Blumenkrantz	50a65e2e2b	driconf: add zink glthread disable for a game ref #8333 fixes #8328 cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21424>	2023-02-22 03:22:50 +00:00
Mike Blumenkrantz	e375fb0c5c	zink: make ZINK_DESCRIPTOR_MODE=db the default this has been getting beat up a lot lately by radv ci (and me), so it should be ready to turn on by default for even more testing in advance of the 23.1 release Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21398>	2023-02-22 02:58:19 +00:00
Mike Blumenkrantz	685bd95de9	zink: move db input attachment size check to screen init it'd be weird to crash later on after allowing db mode to init Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21398>	2023-02-22 02:58:19 +00:00
Danylo Piliaiev	e6f5480180	ir3: Add cat7 sleep instruction Has short and long variants, long seem to be ~20 times longer. The exact difference between it and a bunch of nops is unknown. The emission of this instruction were not observed in the wild. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14419>	2023-02-21 19:59:14 +00:00
Danylo Piliaiev	121e4ca87d	ir3: Add cat5/cat7 cache related instructions - tcinv - Likely Texture Cache Invalidate (unverified) - icinv - Mostly sure that it is Instruction Cache Invalidate - dccln - Data Cache Clean - dcinv - Data Cache Invalidate - dcflu - Data Cache Flush The emission of these instructions were not observed in the wild. TODO: find out the difference between .shr and .all modes of dccln, dcinv, dcflu. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14419>	2023-02-21 19:59:14 +00:00
Sviatoslav Peleshko	42dba8ebc5	driconf/anv: Apply limit_trig_input_range WA to Rise of the Tomb Raider During its Ambient Occlusion calculations the game ends up calculating sin/cos of some pretty big values, for which HW produces completely bogus results (e.g. cos(3929491.25) ~= -0.011, while correct would be ~0.923). Limit the arguments to the reasonable (-2Pi; 2Pi) range with the limit_trig_input_range WA. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8292 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21396>	2023-02-21 19:23:28 +00:00
Mike Blumenkrantz	b43d32b77d	kopper: fix loop iterating for msaa texture creation the pipe_resource template values need to always be initialized or else texture creation fails and rendering is broken fixes #8331 cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21450>	2023-02-21 18:57:52 +00:00
Charmaine Lee	1b9b060f0e	svga: use upload buffer if texture has pending changes When establishing a texture transfer map, if there is any pending changes on the texture, instead of trying direct map with DONTBLOCK first, just use the upload buffer path. Fixes piglit tests gen-teximages, arb_copy_images-formats Cc: mesa-stable Reviewed-by: Neha Bhende <bhenden@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21393>	2023-02-21 09:48:23 -08:00
Charmaine Lee	3a359385cb	svga: fix compatible formats for shareable surfaces Add typeless format to the compatible format lists for shareable surfaces. Fixes webgl benchmark crash in eglCreateImage running from firefox on Fedora 37. Cc: mesa-stable Reviewed-by: Martin Krastev <krastevm@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21393>	2023-02-21 09:48:23 -08:00
Charmaine Lee	75b7296fc3	svga: fix resource_get_handle from resource created without SHARED bind flag When an EGLImage is created from a 2D texture and used for texture sharing, the texture surface might not have been created with the SHARED bind flag. To allow these surfaces for sharing, this patch sets the USAGE SHARED bit for surfaces that can be potentially used for sharing even when the SHARED bind flag is not originally set. Instead of unconditionally enabling the SHARED bind flag for all surfaces and unnecessarily bypass the surface cache optimization, this patch only enables the USAGE SHARED bit for surfaces that also have the RENDER TARGET bind flag. When the surface handle is inquired and if the surface is currently marked as cachable, we will need to unset the cachable bit so the surface handle will not be recycled again. This patch fixes an assertion in svga_resource_get_handle() when the EGL_MESA_image_dma_buf_export extension is used in webgl benchamrk running from firefox in Fedora 37. Cc: mesa-stable Reviewed-by: Martin Krastev <krastevm@vmware.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21393>	2023-02-21 09:48:23 -08:00
Roland Scheidegger	427ac51c10	llvmpipe: only use accurate_a0 hack if there are no textures bound This hack caused problems with some dx9 tests before (due to mipgen test using nearest filter sampling with tex coords exactly between two texels hence being extremely sensitive to arithmetic inaccuracies), and we can no longer distinguish this by using pixel_offset to not get it enabled. But to pass other tests we don't really need the hack when there's texture sampling involved anyway. Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21407>	2023-02-21 17:15:34 +00:00
Eric Engestrom	9390766b83	broadcom/ci: use weston's xwayland instead of starting X as well Signed-off-by: Eric Engestrom <eric@igalia.com> Acked-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21449>	2023-02-21 16:50:54 +00:00
Eric Engestrom	5425064c53	broadcom/ci: group x11 and wayland variant of the same test failing Signed-off-by: Eric Engestrom <eric@igalia.com> Acked-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21449>	2023-02-21 16:50:54 +00:00
Konstantin Seurer	af19762935	radv/rt: Skip instances after loading the entire node This avoids waiting for instance_data which can improve performance: vk_ray_tracing_ao_KHR_app: 0.2% (The TLAS has 2 instances) Quake II RTX: 1% Control: 1% We also have to shuffle around some code to avoid increasing VGPR usage. That leaves us with the following stats: Quake II RTX: Totals from 7 (14.29% of 49) affected shaders: CodeSize: 165612 -> 165716 (+0.06%) Instrs: 31446 -> 31460 (+0.04%) Latency: 596709 -> 554292 (-7.11%) InvThroughput: 121998 -> 113327 (-7.11%) VClause: 596 -> 587 (-1.51%) Copies: 4664 -> 4646 (-0.39%) PreVGPRs: 620 -> 639 (+3.06%) Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21421>	2023-02-21 15:51:14 +00:00
Rhys Perry	ab3184c0a2	aco: don't apply modifiers through DPP to unsupported instructions Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21201>	2023-02-21 14:59:38 +00:00
Georg Lehmann	3bd5b583f9	aco: combine a ^ ~b and ~(a ^ b) to v_xnor_b32 Foz-DB Navi21: Totals from 13 (0.01% of 134913) affected shaders: CodeSize: 225432 -> 225180 (-0.11%) Instrs: 41973 -> 41908 (-0.15%) Latency: 297464 -> 297326 (-0.05%) InvThroughput: 82536 -> 82467 (-0.08%) Copies: 2452 -> 2440 (-0.49%) Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21410>	2023-02-21 13:35:31 +00:00
Illia Abernikhin	dea36fce6a	util: Extend vk_enum_to_str with bitmasks vk_enum_to_str only generates literals for enums with type: @type="enum", but many enums have type: @type="bitmask" and were not taken into account here. Main changes: Empty enums are now always skipped For bitmasks skipped *MAX_ENUM value Signed-off-by: Illia Abernikhin <illia.abernikhin@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8173 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21146>	2023-02-21 13:59:19 +02:00
Marcin Ślusarz	e29a964d02	intel/compiler/mesh: follow the type of offset variable This allows copy propagation to kick in, decreasing the overall number of generated instructions. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21098>	2023-02-21 11:10:24 +00:00
Marcin Ślusarz	15afb8dcc6	intel/compiler/mesh: apply URB payload mask once per program Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21098>	2023-02-21 11:10:23 +00:00
Daniel Schürmann	93a47bab04	nir: simplify nir_block_cf_tree_{next\|prev} Removes some case distinction by first checking if this is the first/last block of a cf_node. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13962>	2023-02-21 10:41:11 +00:00
Daniel Schürmann	2e394b5cc1	nir/lower_continue_targets: only repair SSA when necessary Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13962>	2023-02-21 10:41:11 +00:00
Daniel Schürmann	7fba5abfd7	nir/lower_continue_constructs: special-case Continue Constructs with zero or one predecessors If a loop has only a single continue, the control flow is already converged and we can inline the continue construct. If a loop has no continue statement at all, the Continue Construct is unreachable and can simply be deleted. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13962>	2023-02-21 10:41:11 +00:00
Daniel Schürmann	f42eb71df9	spirv: use Loop Continue Construct to emit SPIR-V loops and lower after parsing Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13962>	2023-02-21 10:41:11 +00:00
Daniel Schürmann	c20751d61d	nir: add lowering for Loop Continue Constructs This pass lowers Loop Continue Constructs to the previous solution by inserting it at the beginning of the loop: loop { if (i != 0) { continue construct } loop body } Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13962>	2023-02-21 10:41:11 +00:00
Daniel Schürmann	312510448f	nir: create nir_push_continue() and related helpers nir_control_flow.h: void nir_loop_add_continue_construct(nir_loop loop); void nir_loop_remove_continue_construct(nir_loop loop); Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13962>	2023-02-21 10:41:11 +00:00
Daniel Schürmann	2bb369dd8d	nir: add assertions that loops don't have a Continue Construct Hoping that I didn't miss any, this should add assertions to all functions and passes which explicitly handle 'nir_loop'. Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13962>	2023-02-21 10:41:11 +00:00
Daniel Schürmann	d4b97bf3fa	nir: add Continue Construct to nir_loop The added continue_list corresponds to the SPIR-V Continue Construct and serves as a converged control-flow construct and is executed after each continue statement and before the next iteration of the loop body. Also adds validation rules for loops with Continue Construct Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13962>	2023-02-21 10:41:11 +00:00
Caio Oliveira	e0c6ad1ce5	glsl: Account for unsized arrays in NIR linker Follow the same approach as the pre-NIR linker. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5891 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Acked-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21401>	2023-02-21 09:08:18 +00:00
Gert Wollny	10f8240caf	zink/kopper: Add extra swapchain images for Venus Together with the previous patch that corrects the number of swapchain images on Xwayland this gives Zink/Venus a spead boost in a number of work loads and close the gap or even surpass VirGL when the benchmark is not GPU bound. Some numbers: zink (Virtio-GPU Venus (Host: RADV RENOIR)) / VirGL Benchmark VirGL baseline Zink/Venus +1 and Xwayland +1 ================================================================== OpenArena (FPS) 63.8 60.1 148.5 Unigine Sancuary (FPS) 129.1 121.4 164.7 Unigine Tropics (FPS) 107.2 85.7 114.3 Unigine Heaven (FPS) 48.5 48.0 51.5 Unigine Valley (FPS) 48.0 45.6 47.4 Xonotic (FPS) 90.5 59.4 89.2 GpuTest/Volcano (Points) 2960 2966 3013 zink (Virtio-GPU Venus (Host: Intel Xe TGL GT2)) / VirGL Benchmark VirGL baseline Zink/Venus +1 and Xwayland +1 =========================================================== OpenArena (FPS) 95.1 59.8 78.9 Unigine Sancuary (FPS) 85.5 76.6 81.8 Unigine Tropics (FPS) 66.0 59.8 62.7 Unigine Heaven (FPS) 28.8 28.7 28.0 Unigine Valley (FPS) 29.0 28.0 27.0 Xonotic (FPS) 64.2 49.4 51.1 GpuTest/Volcano (Points) 2855 2718 2747 v2: Fix limiting minImageCount (Mike) Signed-off-by: Gert Wollny <gert.wollny@collabora.co.uk> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21136>	2023-02-21 08:43:30 +00:00
Gert Wollny	04d654a5d0	vulkan/wsi: Take Xwayland into account for x11_min_image_count For wayland we report a minimum of four swapchain images, so for Xwayland we should report the same. v2: Fix typo (Eric) v3: Make that four images on Xwayland (Daniel) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21136>	2023-02-21 08:43:30 +00:00
Alyssa Rosenzweig	23ec0d6bb2	asahi: Make shader-db work again We need a nontrivial blend state otherwise the whole frag shader is optimized out. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21432>	2023-02-21 08:24:58 +00:00
Alyssa Rosenzweig	029c686c6d	asahi: Implement color masks with masked stores Blend states can require masking colour. Currently, this is handled by nir_lower_blend, which lowers masks to a read-modify-write operation as required on Mali hardware. However, our "tilebuffer store" instruction supports a write mask, allowing us to write only a subset of channels to the tilebuffer. It's more efficient to use that than to emit pointless tilebuffer loads. Note that even without tilebuffer loads, non-opaque masks don't work with opaque pass types. Here, we handle this with a translucent pass type, which gets HSR to do the right thing and is consistent with the pass type used previously. However, it's a bit heavy handed -- Apple manages to use an opaque pass type with masking but with some unknown HSR fields twiddled. IMO reverse-engineering those details shouldn't block this because this gets us closer to optimal (just not all the way there) and is strictly better than what we had before. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21431>	2023-02-21 08:10:15 +00:00

1 2 3 4 5 ...

167042 commits