mesa/src/gallium/drivers
Kenneth Graunke 059ae38c11 iris: Check prog[] instead of uncompiled[] for BLORP state skipping
Huge thanks to Tapani Pälli for debugging this issue, figuring out
what was going wrong, proposing fixes, and walking me through where
things were going off the rails.

BLORP always disables tessellation and geometry shaders.  Our handling
tried to look at ice->shaders.uncompiled[] to determine whether the next
draw needed those shaders.  If not, we can leave BLORP's residual state
that disabled those stages in place, and skip looking at it.

Unfortunately, predicting the future is a bit fraught, in part due to
the uncompiled[] and prog[] arrays being slightly out of sync at times.

Consider the following case:

1. Draw with tessellation shaders in place

   => uncompiled[TES] and prog[TES] will both point at valid shaders.

2. Gallium calls pipe->bind_tes_state(NULL).

   => This makes uncompiled[TES] point at NULL, and flags
      IRIS_STAGE_DIRTY_UNCOMPILED_TES.

      Because iris_update_compiled_shaders() hasn't happened yet,
      uncompiled[TES] is NULL but prog[TES] has the stale TES from
      the previous draw still.

3. BLORP operations happen

   => BLORP sees uncompiled[TES] == NULL and decides that tessellation
      is off for the upcoming draws.  So it skips flagging tess state.

4. Gallium calls pipe->bind_tes_state(shader from step #1).

   => uncompiled[TES] points at the original shader.
      IRIS_STAGE_DIRTY_UNCOMPILED_TES gets flagged again.

5. Draw again

   => This calls iris_update_compiled_shaders(), which sees that
      a TES is bound, and calls iris_update_compiled_tes().  But
      because the same shader was bound as before, the program it
      comes up with is identical to the one already bound at
      ice->shaders.prog[TES].  So, it thinks it doesn't have to
      flag any tessellation state dirty because it was already
      set up for the last draw.

This random unbind and rebind between draws leads to a situation
where, at step #3, BLORP thinks it can skip flagging tessellation
state (nothing is bound), and at step #5, normal state handling
thinks it can skip flagging tessellation state (nothing changed
since last time).  So nobody does, and things break.

This unbind appears to be happening when st_release_variants()
decides it wants to free some shaders.  Then a rebind happens to
put back the actual shader for the draw.  So, it's not theoretical.

To fix this, we change BLORP to look at ice->shaders.prog[] rather
than uncompiled[].  This is equivalent to thinking about the previous
draw, rather than the next.  If the last draw had tessellation off,
then BLORP's disabling was a no-op, and the GPU is still in the same
state as the previous draw.  This is more reliable than predicting
the future.

Cc: mesa-stable
Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8308
Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9678
Reviewed-by: Tapani Pälli <tapani.palli@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24880>
(cherry picked from commit d693027a00)
2023-09-04 11:30:26 +01:00
..
asahi asahi,agx: Upload constant buffers immediately 2023-08-16 11:49:09 +01:00
crocus crocus: Avoid fast-clear with incompatible view 2023-07-18 22:38:07 +01:00
d3d12 d3d12: Fix indexing of local_reference_state 2023-07-18 22:38:06 +01:00
etnaviv etnaviv: Fully replicate back stencil config 2023-09-04 11:30:24 +01:00
freedreno freedreno/a5xx: Skip emitting unused texture descriptors for images. 2023-08-10 15:59:37 +01:00
i915 i915: use util_unreference_framebuffer_state to unref fb state 2023-05-05 19:05:45 +01:00
iris iris: Check prog[] instead of uncompiled[] for BLORP state skipping 2023-09-04 11:30:26 +01:00
lima lima: update CI fixes and flakes 2023-07-21 13:40:20 +01:00
llvmpipe llvmpipe: fix early depth + alpha2coverage + occlusion query interaction 2023-08-10 21:42:16 +01:00
nouveau nv50: limit max code uploads to 0x8000 2023-08-18 12:11:11 +01:00
panfrost panfrost: Fix transform feedback on v9 harder 2023-07-21 18:08:09 +01:00
r300 r300: fix unconditional KIL on R300/R400 2023-04-26 17:37:27 +01:00
r600 r600/sfn: Take source uses into account when switching channels 2023-08-07 18:53:55 +01:00
radeonsi util/u_queue: fix util_queue_finish deadlock by merging lock and finish_lock 2023-07-18 22:48:05 +01:00
softpipe softpipe: use util_unreference_framebuffer_state to unref fb state 2023-05-05 19:05:49 +01:00
svga svga: use util_unreference_framebuffer_state to unref fb state 2023-05-05 19:05:54 +01:00
tegra tegra: Add support for get_screen_fd 2023-03-31 13:39:05 +00:00
v3d v3d: use util_unreference_framebuffer_state to unref fb state 2023-05-05 19:05:50 +01:00
vc4 vc4: drop duplicate .lower_ldexp 2023-09-04 11:30:23 +01:00
virgl ci: disable Material Testers.x86_64_2020.04.08_13.38_frame799.rdc trace 2023-09-04 11:30:23 +01:00
zink zink: don't start multiple cache jobs for the same program 2023-09-04 11:30:25 +01:00