r600/sfn: Add some documentation

Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3225> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3225>
2026-05-05 03:08:05 +02:00 · 2019-12-28 18:23:19 +01:00 · 2019-12-28 18:23:19 +01:00 · 897a4a0041
commit 897a4a0041
parent 7413aab3c8
1 changed files with 69 additions and 0 deletions
--- a/src/gallium/drivers/r600/sfn/sfn_docu.txt
+++ b/src/gallium/drivers/r600/sfn/sfn_docu.txt
@ -0,0 +1,69 @@
+# R600 shader from NIR
+
+This code is an attempt to implement a NIR backend for r600.
+
+## State
+
+piglits glsl-1.10 - 3.3 and gl-1.* gl-2.* and gl-3.* pass mostly like with TGSI, there are some fixes but
+also a few regressions.
+
+## Currently missing features w.r.t. TGSI:
+
+ - Tesselation shaders
+ - compute shader support
+ - image load/store
+ - work group shared values
+ - SSBO atomics
+
+## Needed optimizations:
+
+  - Register allocator and scheduler (Could the sb allocator and scheduler
+    be ported?)
+
+  - peepholes:
+    - compare + set predicate
+
+  - copy propagation:
+    - Moves from inputs are usually not required, they could be forwarded
+    - texture operations often move additional parameters in extra registers
+      but they are actually needed in the same registes they come from and
+      could just be swizzled into the right place
+      (lower in NIR like it is done in e.g. in ETNAVIV)
+
+
+## Problems
+
+- figure out what is wrong with the textcoord semantics: disabling it results in
+  varyings beyond the supporteed VAR31, and enabling it lets some shaders with
+  VAR0 fail.
+
+- UBOs have a strange behaviour: with
+  glsl-1.50/uniform_buffer/gs-mat4x3.shader_test
+  on TGSI we have
+     ADD TEMP[1].xyz = CONST[1][0].xyzz CONST[1][1].xyzz
+  with NIR we have
+     vec4 ssa_12 = intrinsic load_ubo(_r600) (0, 0)(0 , 4 ,0)
+     vec4 ssa_13 = intrinsic load_ubo(_r600) (0, 1)(0 , 4 ,0)
+     vec3 ssa_14 = fadd ssa_12.xyw, ssa_13.xyw
+  so why is the "w" component emitted?
+
+## Unknows
+
+- multi-function shaders, how to deal with them? fp64 seems to have lots
+  of them, one option is to inline them
+
+- can type information from variables be harvested?
+
+lowering passes in NIR:
+  - TESS IO address evaluation should be lowered
+
+## Work plan
+
+The idea is to create two conversions: a NIR to a new R600 IR that
+can be  used to run some finalizing optimizations (replacing the
+need for r600/sb) and the binary code generation.
+
+The implementation uses C++ to separate the code for the different
+shader types and the byte code generation backends. The initial attempt
+will use the already available r600_asm code
+