mesa/src
Rhys Perry fc04a2fc31 aco: take LDS into account when calculating num_waves
pipeline-db (Vega):
SGPRS: 344 -> 344 (0.00 %)
VGPRS: 424 -> 524 (23.58 %)
Spilled SGPRs: 84 -> 80 (-4.76 %)
Spilled VGPRs: 0 -> 0 (0.00 %)
Private memory VGPRs: 0 -> 0 (0.00 %)
Scratch size: 0 -> 0 (0.00 %) dwords per thread
Code Size: 52812 -> 52484 (-0.62 %) bytes
LDS: 135 -> 135 (0.00 %) blocks
Max Waves: 56 -> 53 (-5.36 %)

v2: consider WGP, rework to be clearer and apply the
    "maximum 16 workgroups per CU" limit properly
v2: use "SIMD" instead of "EU"
v2: fix spiller by introducing "Program::max_waves"
v2: rename "lds_size" to "lds_limit"
v3: make max_waves actually independant of register usage
v3: fix issue where max_waves was way too high
v3: use DIV_ROUND_UP(a, b) instead of max(a / b, 1)
v3: rename "workgroups_per_cu" to "workgroups_per_cu_wgp"
v4: fix typo from "workgroups_per_cu" rename

Signed-off-by: Rhys Perry <pendingchaos02@gmail.com>
Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> (v3)
2019-10-23 19:11:21 +01:00
..
amd aco: take LDS into account when calculating num_waves 2019-10-23 19:11:21 +01:00
broadcom Revert "v3d: do not report alpha-test as supported" 2019-10-23 13:03:55 +02:00
compiler Revert "nir: drop support for using load_alpha_ref_float" 2019-10-23 13:03:52 +02:00
drm-shim meson: replace libmesa_util with idep_mesautil 2019-08-03 00:08:37 +00:00
egl egl/android: Remove our own reference to buffers. 2019-10-21 07:50:31 +00:00
etnaviv etnaviv: drm: add softpin interface 2019-10-18 19:05:21 +02:00
freedreno freedreno/ir3: fixup register footprint fixup 2019-10-22 17:46:19 +00:00
gallium Revert "vc4: do not report alpha-test as supported" 2019-10-23 13:03:59 +02:00
gbm gbm: Add GBM_MAX_PLANES definition 2019-10-18 13:18:28 +00:00
getopt meson: build getopt when using msvc 2019-09-10 20:36:47 +00:00
glx dri: Avoid swapbuffer throttling in glXCopySubBufferMESA 2019-10-05 13:19:37 +00:00
gtest
hgl haiku: fix Mesa build 2019-10-01 10:31:02 +00:00
imgui imgui: bump imgui memory editor copy 2019-06-18 15:34:13 +00:00
intel anv: implement VK_INTEL_performance_query 2019-10-23 05:41:15 +00:00
loader loader: Simplify handling of the radeonsi driver 2019-10-08 09:02:34 +00:00
mapi mesa: add EXT_dsa NamedCopyBufferSubDataEXT function 2019-10-18 10:26:26 +02:00
mesa intel/perf: move registers to their own header 2019-10-23 05:41:14 +00:00
panfrost nir/lower_idiv: add new llvm-based path 2019-10-21 18:49:46 +00:00
util util/u_atomic: fix return type of p_atomic_{inc,dec}_return() and p_atomic_{cmp,}xchg() 2019-10-16 19:41:47 +01:00
vulkan vulkan: Update the XML and headers to 1.1.125 2019-10-14 08:23:27 -07:00
meson.build meson: rename glvnd_missing_pc_files to not glvnd_has_headers_and_pc_files 2019-10-10 22:18:04 +01:00
SConscript