ir3: Fix vectorizer condition for SSBOs

SSBO access works very differently from UBO access. Straddling loads/stores isn't an issue, loads/stores instead must be aligned to the element size and can have up to 4 components. We support 16-bit access with SSBOs on a650+, and sometimes the vectorizer tries to create a misaligned 32-bit access when combining 32-bit and 16-bit accesses. The UBO-focused logic didn't reject this, which is now fixed. This fixes a number of VK-CTS regressions on a650+. Fixes: bf49d4a084 ("freedreno/ir3: Enable load/store vectorization for SSBO access, too.") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17040>
2026-02-24 11:40:25 +01:00 · 2022-06-14 20:47:29 +02:00 · 2022-06-14 20:47:29 +02:00 · 7d706af76b
commit 7d706af76b
parent 6fc2622abd
1 changed files with 8 additions and 1 deletions
--- a/src/freedreno/ir3/ir3_nir.c
+++ b/src/freedreno/ir3/ir3_nir.c
@ -37,10 +37,17 @@ ir3_nir_should_vectorize_mem(unsigned align_mul, unsigned align_offset,
                             nir_intrinsic_instr *low,
                             nir_intrinsic_instr *high, void *data)
 {
+   unsigned byte_size = bit_size / 8;
+
+   if (low->intrinsic != nir_intrinsic_load_ubo) {
+      return bit_size <= 32 && align_mul >= byte_size &&
+         align_offset % byte_size == 0 &&
+         num_components <= 4;
+   }
+
   assert(bit_size >= 8);
   if (bit_size != 32)
      return false;
-   unsigned byte_size = bit_size / 8;

   int size = num_components * byte_size;