mesa/src
Eric Anholt 80923e8d58 util/format: Add some NEON intrinsics-based u_format_unpack.
In looking at the profile of dEQP, GLES3 was spending 5-10% of its time in
ReadPixels, and almost all of that is b8g8r8a8_unorm8.  It's really slow
because we're getting about 47MB/s by doing uncached reads 32 bits at a
time in the code-generated unpack.  If we use NEON to generate larger bus
transactions, we can speed things up to 136MB/s.  In comparison, raw
ldr/str read/writes with no byte swapping can hit a max of 216MB/sec.

Reviewed-by: Jesse Natalie <jenatali@microsoft.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10014>
2021-04-19 17:59:44 +00:00
..
amd aco: fix opquantize2f16 on GFX6-7 2021-04-19 16:33:37 +00:00
android_stub android: add some more stub functions for cross compilation 2021-03-25 06:06:16 +00:00
broadcom v3dv: better tracking of dirty push constant state 2021-04-16 12:29:11 +00:00
compiler nir: add lowering pass for helperInvocationEXT() 2021-04-19 17:11:36 +00:00
drm-shim drm-shim: report support for timeline semaphores 2021-02-09 21:08:52 +00:00
egl egl/android: include "util/compiler.h" for FALLTHROUGH macro 2021-04-16 16:42:20 +00:00
etnaviv etnaviv: tell the truth if alpha-test is supported 2021-04-08 19:25:26 +00:00
freedreno turnip: implement VK_KHR_shader_terminate_invocation 2021-04-19 17:11:36 +00:00
gallium util: Switch the non-block formats to unpacking rgba rows instead of rects. 2021-04-19 17:59:44 +00:00
gbm egl: fix software flag in _eglAddDevice call on DRM 2021-03-26 10:32:31 +00:00
getopt scons: Remove. 2021-03-20 10:38:55 +00:00
glx glx: Remove some ancient backwards-compatibility typedefs 2021-04-16 10:13:07 +00:00
gtest
hgl hgl: Major refactor and cleanup 2021-01-09 20:51:35 -06:00
imgui
intel anv: disable baked in pipeline bits from dynamic emission path 2021-04-16 11:53:25 +00:00
loader loader: prefer iris on Android 2021-03-31 06:28:18 +00:00
mapi scons: Remove. 2021-03-20 10:38:55 +00:00
mesa util: Switch the non-block formats to unpacking rgba rows instead of rects. 2021-04-19 17:59:44 +00:00
microsoft microsoft/clc: If local size isn't specified either in the shader or at runtime, set it to (1,1,1) 2021-04-19 15:38:57 +00:00
nouveau nouveau: add drm-shim support 2021-01-11 22:45:01 +00:00
panfrost nir: Generate load_ubo_vec4 directly for !PIPE_CAP_NATIVE_INTEGERS 2021-04-16 21:58:00 +00:00
util util/format: Add some NEON intrinsics-based u_format_unpack. 2021-04-19 17:59:44 +00:00
virtio venus: add vn_wsi_create_scanout_image 2021-04-16 17:29:38 -07:00
vulkan vulkan/util: cast enums to int64_t in switch 2021-04-19 06:14:11 +00:00
meson.build venus: add driver skeleton 2021-04-08 17:15:37 +00:00