fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-27 12:18:11 +02:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	cd2a4cc47c	pan/bi: Unit test message preloading optimization To make sure it is applied in the cases we expect it to be, to avoid code generation regressions. Functional regressions are expected to be caught by integration-testing, so that is not focused on here. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9438>	2022-02-24 14:13:21 -05:00
Alyssa Rosenzweig	eb1479bda2	pan/bi: Support message preloading Preload LD_VAR_IMM or VAR_TEX instructions in the first block of fragment shaders on v7. Preloaded messages write to fixed registers; when replacing instructions we insert moves from the registers at the start of the program and hope coalescing goes to town. (Admittedly we don't do any coalescing yet...) The extra moves hurts instruction count in some cases; the win for cycle count should cancel this out. When we get smarter copy prop or RA, those moves should go away anyway. This optimization may hurt register pressure by extending the lifetime of up to eight registers written in the first block. This is expected to be acceptable: on a large shader-db, there are no additional spills/fills, and only two shaders are hurt on thread count. This optimization only applies to v7, as the hardware was not introduced on v6 and was removed for Valhall. total instructions in shared programs: 2451624 -> 2454286 (0.11%) instructions in affected programs: 909046 -> 911708 (0.29%) helped: 4719 HURT: 3341 helped stats (abs) min: 1.0 max: 10.0 x̄: 1.49 x̃: 1 helped stats (rel) min: 0.08% max: 33.33% x̄: 6.79% x̃: 3.92% HURT stats (abs) min: 1.0 max: 50.0 x̄: 2.90 x̃: 2 HURT stats (rel) min: 0.12% max: 66.67% x̄: 6.39% x̃: 3.45% 95% mean confidence interval for instructions value: 0.27 0.39 95% mean confidence interval for instructions %-change: -1.55% -1.11% Inconclusive result (value mean confidence interval and %-change mean confidence interval disagree). total tuples in shared programs: 1969529 -> 1963429 (-0.31%) tuples in affected programs: 601327 -> 595227 (-1.01%) helped: 5907 HURT: 1297 helped stats (abs) min: 1.0 max: 8.0 x̄: 1.41 x̃: 1 helped stats (rel) min: 0.07% max: 33.33% x̄: 7.25% x̃: 5.26% HURT stats (abs) min: 1.0 max: 40.0 x̄: 1.73 x̃: 1 HURT stats (rel) min: 0.16% max: 31.75% x̄: 3.38% x̃: 2.02% 95% mean confidence interval for tuples value: -0.88 -0.81 95% mean confidence interval for tuples %-change: -5.52% -5.15% Tuples are helped. total clauses in shared programs: 401689 -> 387830 (-3.45%) clauses in affected programs: 136944 -> 123085 (-10.12%) helped: 8427 HURT: 4 helped stats (abs) min: 1.0 max: 4.0 x̄: 1.65 x̃: 2 helped stats (rel) min: 0.49% max: 50.00% x̄: 19.88% x̃: 18.18% HURT stats (abs) min: 1.0 max: 4.0 x̄: 2.50 x̃: 2 HURT stats (rel) min: 1.96% max: 19.05% x̄: 14.18% x̃: 17.86% 95% mean confidence interval for clauses value: -1.66 -1.63 95% mean confidence interval for clauses %-change: -20.15% -19.58% Clauses are helped. total cycles in shared programs: 202735.83 -> 201862.21 (-0.43%) cycles in affected programs: 16295.46 -> 15421.83 (-5.36%) helped: 3349 HURT: 1962 helped stats (abs) min: 0.041665999999999315 max: 1.0 x̄: 0.32 x̃: 0 helped stats (rel) min: 0.24% max: 100.00% x̄: 40.77% x̃: 33.33% HURT stats (abs) min: 0.041665999999999315 max: 1.5833329999999997 x̄: 0.10 x̃: 0 HURT stats (rel) min: 0.09% max: 31.40% x̄: 2.95% x̃: 1.94% 95% mean confidence interval for cycles value: -0.17 -0.16 95% mean confidence interval for cycles %-change: -25.48% -23.76% Cycles are helped. total arith in shared programs: 74665.50 -> 74920.00 (0.34%) arith in affected programs: 16059.92 -> 16314.42 (1.58%) helped: 860 HURT: 3409 helped stats (abs) min: 0.041665999999999315 max: 0.25 x̄: 0.06 x̃: 0 helped stats (rel) min: 0.24% max: 37.50% x̄: 4.73% x̃: 2.56% HURT stats (abs) min: 0.041665999999999315 max: 1.5833329999999997 x̄: 0.09 x̃: 0 HURT stats (rel) min: 0.09% max: 100.00% x̄: 8.99% x̃: 4.21% 95% mean confidence interval for arith value: 0.06 0.06 95% mean confidence interval for arith %-change: 5.83% 6.62% Arith are HURT. total texture in shared programs: 13083.50 -> 11877 (-9.22%) texture in affected programs: 1663 -> 456.50 (-72.55%) helped: 2377 HURT: 3 helped stats (abs) min: 0.5 max: 1.0 x̄: 0.51 x̃: 0 helped stats (rel) min: 6.25% max: 100.00% x̄: 87.12% x̃: 100.00% HURT stats (abs) min: 0.5 max: 0.5 x̄: 0.50 x̃: 0 HURT stats (rel) min: 0.00% max: 25.00% x̄: 16.67% x̃: 25.00% 95% mean confidence interval for texture value: -0.51 -0.50 95% mean confidence interval for texture %-change: -87.98% -86.00% Texture are helped. total vary in shared programs: 10220.62 -> 4183.88 (-59.06%) vary in affected programs: 10126.50 -> 4089.75 (-59.61%) helped: 8538 HURT: 0 helped stats (abs) min: 0.125 max: 1.0 x̄: 0.71 x̃: 0 helped stats (rel) min: 7.14% max: 100.00% x̄: 74.74% x̃: 87.50% 95% mean confidence interval for vary value: -0.71 -0.70 95% mean confidence interval for vary %-change: -75.32% -74.16% Vary are helped. total quadwords in shared programs: 1766717 -> 1757161 (-0.54%) quadwords in affected programs: 553801 -> 544245 (-1.73%) helped: 6760 HURT: 711 helped stats (abs) min: 1.0 max: 11.0 x̄: 1.58 x̃: 1 helped stats (rel) min: 0.09% max: 29.41% x̄: 5.31% x̃: 4.84% HURT stats (abs) min: 1.0 max: 33.0 x̄: 1.54 x̃: 1 HURT stats (rel) min: 0.10% max: 31.13% x̄: 2.53% x̃: 1.61% 95% mean confidence interval for quadwords value: -1.31 -1.25 95% mean confidence interval for quadwords %-change: -4.67% -4.46% Quadwords are helped. total threads in shared programs: 52899 -> 52897 (<.01%) threads in affected programs: 4 -> 2 (-50.00%) helped: 0 HURT: 2 total preloads in shared programs: 0 -> 116492 preloads in affected programs: 0 -> 116492 helped: 0 HURT: 8604 HURT stats (abs) min: 2.0 max: 24.0 x̄: 13.54 x̃: 14 HURT stats (rel) min: 0.00% max: 0.00% x̄: 0.00% x̃: 0.00% 95% mean confidence interval for preloads value: 13.45 13.63 95% mean confidence interval for preloads %-change: 0.00% 0.00% Preloads are HURT. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9438>	2022-02-24 14:09:14 -05:00
Alyssa Rosenzweig	c8437cd415	pan/bi: Account for message preloading in shaderdb If a message-passing instruction like LD_VAR is preloaded, it will no longer be counted in the shader cycle counts. Add a special message preload counter that approximates the cost of preloading, so this information doesn't get a lost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9438>	2022-02-24 12:51:04 -05:00
Alyssa Rosenzweig	19541dc8c8	pan/bi: Add bi_before_nonempty_block helper To be used in the message preloading pass. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9438>	2022-02-24 12:51:04 -05:00
Alyssa Rosenzweig	6618697e0e	panfrost: Pack message preloads from compiler Include full message preload descriptors in the RSD on v7, and do the obvious packing for fragment shader message preloads. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9438>	2022-02-24 12:51:04 -05:00
Alyssa Rosenzweig	bd06a26662	panfrost: Add an unpacked message preload struct The compiler will soon produce preloaded messages, but it should not pack them itself, as this would require depending on GenXML or handcoding bitfields / bit packs in the compiler. Instead, add a struct encoding the unpacked form of the message, used as ABI between the compiler and the common driver. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9438>	2022-02-24 12:51:04 -05:00
Alyssa Rosenzweig	2d0c4973dc	panfrost: Remove Message Preload Descriptor from v6.xml It is an anachronism, as this descriptor was added in v7 and, seemingly, removed immediately after. Good work. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9438>	2022-02-24 12:50:58 -05:00
Alyssa Rosenzweig	43bbe367ea	panfrost/ci: Move T860 flake to skip Actually an xfail but occassionally passes and gives us no new information, only noise. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Suggested-and-acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15154>	2022-02-24 14:51:31 +00:00
Alyssa Rosenzweig	5c07f7c427	panfrost/ci: Move T720 flakes to skips Doesn't seem like these will be resolved anytime soon.. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Suggested-and-acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15154>	2022-02-24 14:51:31 +00:00
Tomeu Vizoso	c0695bb473	ci: Allow disabling the whole of the Collabora farm Add a global-level variable that allows disabling all jobs that would have gone to the Collabora lab, to be used in case of outages. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15150>	2022-02-24 07:33:45 +01:00
Alyssa Rosenzweig	6b2eda6b72	pan/bi: Reorder pushed uniforms to avoid moves On Bifrost and Valhall, push uniforms are loaded into Fast Access Uniform Random Access Memory (FAU-RAM). FAU-RAM is organized as an array of 64-bit slots. A given tuple (Bifrost) or instruction (Valhall) may access at most a single 64-bit slot. If an instruction requires uniforms from multiple 64-bit slots, a uniform-to-register move must be inserted to avoid the hazard. However, if an instruction requires a pair of 32-bit uniforms from the same 64-bit slot, no move is required. To reduce the number of moves we emit, this commit adds an optimization pass that reorders pushed uniforms, trying to group uniforms used by the same instruction. The pass works by creating a graph of pushed uniforms, where edges denote the "both 32-bit uniforms required by the same instruction" relationship. We perform depth-first search on this graph to find the connected components, where each connected component is a cluster of uniforms that are used together. We then select pairs of uniforms from each connected component. The remaining unpaired uniforms (from components of odd sizes) are paired together arbitrarily. In principle, we should weight the graph by number of occurences and choose pairs that maximize the total selected edge weight. This is left for future work, as it is nontrivial -- selecting these edges optimally appears to be NP-hard at first blush. Implementation note: As position and varying shaders share FAU on Bifrost, extra care is taken with a `push_offset` shader stage info parameter that ensures varying shaders do not reorder uniforms selected by the previous position shader. total instructions in shared programs: 2503343 -> 2451758 (-2.06%) instructions in affected programs: 1553309 -> 1501724 (-3.32%) helped: 14256 HURT: 8 helped stats (abs) min: 1.0 max: 80.0 x̄: 3.62 x̃: 3 helped stats (rel) min: 0.06% max: 36.36% x̄: 7.31% x̃: 6.67% HURT stats (abs) min: 1.0 max: 2.0 x̄: 1.38 x̃: 1 HURT stats (rel) min: 1.30% max: 12.50% x̄: 4.99% x̃: 3.85% 95% mean confidence interval for instructions value: -3.66 -3.58 95% mean confidence interval for instructions %-change: -7.41% -7.20% Instructions are helped. total tuples in shared programs: `2008399` -> 1969627 (-1.93%) tuples in affected programs: 1146344 -> 1107572 (-3.38%) helped: 12867 HURT: 147 helped stats (abs) min: 1.0 max: 61.0 x̄: 3.03 x̃: 2 helped stats (rel) min: 0.17% max: 42.86% x̄: 6.79% x̃: 4.65% HURT stats (abs) min: 1.0 max: 3.0 x̄: 1.20 x̃: 1 HURT stats (rel) min: 0.29% max: 20.00% x̄: 2.12% x̃: 1.19% 95% mean confidence interval for tuples value: -3.03 -2.93 95% mean confidence interval for tuples %-change: -6.82% -6.57% Tuples are helped. total clauses in shared programs: 408005 -> 401708 (-1.54%) clauses in affected programs: 90760 -> 84463 (-6.94%) helped: 6006 HURT: 164 helped stats (abs) min: 1.0 max: 9.0 x̄: 1.08 x̃: 1 helped stats (rel) min: 0.45% max: 33.33% x̄: 12.44% x̃: 14.29% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 1.64% max: 25.00% x̄: 9.81% x̃: 5.26% 95% mean confidence interval for clauses value: -1.03 -1.01 95% mean confidence interval for clauses %-change: -12.03% -11.66% Clauses are helped. total cycles in shared programs: 203308.37 -> 202737.83 (-0.28%) cycles in affected programs: 19264.71 -> 18694.17 (-2.96%) helped: 3024 HURT: 41 helped stats (abs) min: 0.041665999999999315 max: 2.5416680000000014 x̄: 0.19 x̃: 0 helped stats (rel) min: 0.17% max: 33.33% x̄: 3.83% x̃: 2.83% HURT stats (abs) min: 0.041665999999999315 max: 0.125 x̄: 0.06 x̃: 0 HURT stats (rel) min: 0.30% max: 5.88% x̄: 1.41% x̃: 0.93% 95% mean confidence interval for cycles value: -0.19 -0.18 95% mean confidence interval for cycles %-change: -3.89% -3.64% Cycles are helped. total arith in shared programs: 76265.67 -> 74669.25 (-2.09%) arith in affected programs: 45001.50 -> 43405.08 (-3.55%) helped: 12945 HURT: 97 helped stats (abs) min: 0.041665999999999315 max: 2.5416680000000014 x̄: 0.12 x̃: 0 helped stats (rel) min: 0.17% max: 50.00% x̄: 8.06% x̃: 4.88% HURT stats (abs) min: 0.041665999999999315 max: 0.125 x̄: 0.05 x̃: 0 HURT stats (rel) min: 0.21% max: 33.33% x̄: 2.16% x̃: 0.96% 95% mean confidence interval for arith value: -0.12 -0.12 95% mean confidence interval for arith %-change: -8.16% -7.81% Arith are helped. total quadwords in shared programs: 1796563 -> 1766803 (-1.66%) quadwords in affected programs: 948830 -> 919070 (-3.14%) helped: 12078 HURT: 219 helped stats (abs) min: 1.0 max: 42.0 x̄: 2.49 x̃: 2 helped stats (rel) min: 0.10% max: 33.33% x̄: 5.57% x̃: 5.26% HURT stats (abs) min: 1.0 max: 4.0 x̄: 1.21 x̃: 1 HURT stats (rel) min: 0.33% max: 6.67% x̄: 2.00% x̃: 1.14% 95% mean confidence interval for quadwords value: -2.46 -2.38 95% mean confidence interval for quadwords %-change: -5.52% -5.36% Quadwords are helped. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14163>	2022-02-24 01:35:33 +00:00
Alyssa Rosenzweig	31b7ebcbc7	pan/mdg: Fix overflow in intra-bundle interference There are up to 4 instructions in the latter stage (if a branch is included), not 3. Bump the limit to fix memory corruption. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reported-by: Icecream95 <ixn@disroot.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15147>	2022-02-23 20:42:33 +00:00
Alyssa Rosenzweig	abb7f04674	panfrost: Inline pan_emit_sfbd_tiler Easier to read, the common code was already common. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15123>	2022-02-23 12:56:30 +00:00
Alyssa Rosenzweig	910d4f8245	panfrost: Remove pan_emit_fbd thunking Use a common interface. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15123>	2022-02-23 12:56:30 +00:00
Alyssa Rosenzweig	099d61c95d	panfrost: Use txl instead of tex in the blitter We always blit from a particular level, so it's a waste to compute the LOD. This corresponds to a simple texture instruction with implement 0 LOD, which is the optimal texturing path on Bifrost -- it maps to TEXS_2D but does not require helper invocations. Functional change on Bifrost: Blit shaders no longer set .computed_lod or shader_contains_barrier. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15123>	2022-02-23 12:56:30 +00:00
Alyssa Rosenzweig	5b1a00c565	panfrost: Inline pan_blit_emit_dcd Easier to follow the logic without having a million arguments passed around. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15123>	2022-02-23 12:56:30 +00:00
Alyssa Rosenzweig	c9784c9512	panfrost: Decouple tiler job and DCD emit We can share the "emit quad" logic, even though the DCDs differ. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15123>	2022-02-23 12:56:30 +00:00
Alyssa Rosenzweig	1eb3dbafdb	panfrost: Set defaults for deprecated DCD fields There are always set to true. Don't pollute the driver code with them, make their existence a local detail to pre-Valhall XML and that's it. Functional change: "four components per vertex" is now set on vertex job DCDs. This should be a no-op. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15123>	2022-02-23 12:56:30 +00:00
Alyssa Rosenzweig	bd3d7e33b6	panfrost: Use pan_shader_prepare_rsd in blitter This reduces code duplication and will ease Valhall porting. Functional changes on v7: * Shader contains barrier is now set (perf loss, fixed later in series) * Shader register allocation is now set (perf win) * Point sprite inverted, no-op for blit shaders Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15123>	2022-02-23 12:56:30 +00:00
Alyssa Rosenzweig	6fc81f163e	pan/mdg: Fix partial execution mode names cont -> skip, last -> kill, and fix the special case handling. It's just an enum. Makes the disassembly easier to read and closer to Bifrost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15123>	2022-02-23 12:56:30 +00:00
Alyssa Rosenzweig	2e86767370	pan/bi: Add BIFROST_MESA_DEBUG=nosb option To disable the new scoreboarding optimizations when debugging. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	c81c022e66	pan/bi: Implement basic scoreboarding pass Extend our existing bi_scoreboard infrastructure with a simple data flow analysis pass that calculates which dependency slots need waiting. We still lack a heuristic for selecting dependency slots. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	8f25d88d90	pan/bi: Print scoreboarding state Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	6ad9a7f650	pan/bi: Add scoreboard state to IR To a limited degree, scoreboarding must be global, so add the data structures for tracking this to the IR. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	91c02893d8	pan/bi: Clean up nits in liveness analysis Fix minor silly things. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	734a8bdc5d	pan/bi: Use bi_exit_block The "generic" one is a vestige of Midgard. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	75406a561f	pan/bi: Add bi_{start, exit}_block helpers Useful for data flow analysis. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	e5423bb129	pan/bi: Do not cull post-RA staging writes Bifrost post-RA dead code elimination can cull the destinations of regular ALU instructions, by weakening from a register write to a temporary write. However, there is no way to suppress staging writes, so culling the destinations will result in invalid code generation. Fixes a regression in dEQP-GLES3.functional.shaders.switch.switch_in_for_loop_static_vertex with scoreboarding. The root cause there is the backend dead code elimination not being sufficiently aggressive in the presence of control flow. Usually this does not matter, since the backend optimizations are intended to be local with global optimizations happening in NIR. Unfortunately, our implementation of IDVS hits this hard. That will need to be optimized (probably by specializing IDVS shaders in NIR instead of the backend). In the mean time, let's fix the actual bug affecting scoreboarding. No shader-db changes. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	87d46f40c8	pan/bi: Cull DTSEL_IMM dests in post-RA DCE They are useless (given the semantics of DTSEL_IMM) and complicate scoreboarding. Just remove them in the pass that removes all the other silly register destinations. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	956b969616	pan/bi: Clarify requirement for barriers Barriers need to wait on all outstanding messages. This is more of an API requirement than a hardware requirement, but it's still an invariant the scoreboarding pass must respect. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	606ac8d61e	pan/bi: Enable nir_opt_shrink_vectors total instructions in shared programs: 1939513 -> 1935815 (-0.19%) instructions in affected programs: 809066 -> 805368 (-0.46%) helped: 3195 HURT: 865 helped stats (abs) min: 1.0 max: 15.0 x̄: 1.99 x̃: 1 helped stats (rel) min: 0.10% max: 25.00% x̄: 2.26% x̃: 1.28% HURT stats (abs) min: 1.0 max: 22.0 x̄: 3.09 x̃: 2 HURT stats (rel) min: 0.10% max: 83.33% x̄: 2.67% x̃: 1.39% 95% mean confidence interval for instructions value: -1.00 -0.82 95% mean confidence interval for instructions %-change: -1.34% -1.08% Instructions are helped. total tuples in shared programs: 1523194 -> 1521789 (-0.09%) tuples in affected programs: 745526 -> 744121 (-0.19%) helped: 2947 HURT: 1844 helped stats (abs) min: 1.0 max: 18.0 x̄: 2.06 x̃: 1 helped stats (rel) min: 0.15% max: 25.00% x̄: 2.65% x̃: 1.59% HURT stats (abs) min: 1.0 max: 29.0 x̄: 2.54 x̃: 1 HURT stats (rel) min: 0.09% max: 40.00% x̄: 2.32% x̃: 1.52% 95% mean confidence interval for tuples value: -0.39 -0.20 95% mean confidence interval for tuples %-change: -0.85% -0.62% Tuples are helped. total clauses in shared programs: 329158 -> 325350 (-1.16%) clauses in affected programs: 111654 -> 107846 (-3.41%) helped: 2787 HURT: 498 helped stats (abs) min: 1.0 max: 17.0 x̄: 1.57 x̃: 1 helped stats (rel) min: 0.76% max: 40.00% x̄: 6.92% x̃: 5.26% HURT stats (abs) min: 1.0 max: 3.0 x̄: 1.14 x̃: 1 HURT stats (rel) min: 0.87% max: 50.00% x̄: 4.73% x̃: 3.77% 95% mean confidence interval for clauses value: -1.21 -1.10 95% mean confidence interval for clauses %-change: -5.39% -4.93% Clauses are helped. total cycles in shared programs: 172084.50 -> 166827.62 (-3.05%) cycles in affected programs: 74698.83 -> 69441.96 (-7.04%) helped: 3706 HURT: 568 helped stats (abs) min: 0.041665999999999315 max: 19.0 x̄: 1.44 x̃: 1 helped stats (rel) min: 0.24% max: 75.00% x̄: 9.48% x̃: 6.90% HURT stats (abs) min: 0.041665999999999315 max: 1.0 x̄: 0.15 x̃: 0 HURT stats (rel) min: 0.25% max: 50.00% x̄: 2.21% x̃: 1.42% 95% mean confidence interval for cycles value: -1.28 -1.18 95% mean confidence interval for cycles %-change: -8.18% -7.67% Cycles are helped. total arith in shared programs: 57145.04 -> 57211.37 (0.12%) arith in affected programs: 27595.12 -> 27661.46 (0.24%) helped: 1933 HURT: 2259 helped stats (abs) min: 0.041665999999999315 max: 0.75 x̄: 0.09 x̃: 0 helped stats (rel) min: 0.16% max: 33.33% x̄: 2.74% x̃: 1.52% HURT stats (abs) min: 0.04166399999999726 max: 1.3333329999999997 x̄: 0.11 x̃: 0 HURT stats (rel) min: 0.10% max: 100.00% x̄: 2.79% x̃: 1.62% 95% mean confidence interval for arith value: 0.01 0.02 95% mean confidence interval for arith %-change: 0.07% 0.40% Arith are HURT. total texture in shared programs: 12857 -> 12857 (0.00%) texture in affected programs: 0 -> 0 helped: 0 HURT: 0 total vary in shared programs: 11157.75 -> 10222 (-8.39%) vary in affected programs: 5643 -> 4707.25 (-16.58%) helped: 3196 HURT: 0 helped stats (abs) min: 0.125 max: 1.875 x̄: 0.29 x̃: 0 helped stats (rel) min: 2.78% max: 75.00% x̄: 18.49% x̃: 15.00% 95% mean confidence interval for vary value: -0.30 -0.29 95% mean confidence interval for vary %-change: -18.88% -18.11% Vary are helped. total ldst in shared programs: 146420 -> 140270 (-4.20%) ldst in affected programs: 66027 -> 59877 (-9.31%) helped: 2942 HURT: 10 helped stats (abs) min: 1.0 max: 19.0 x̄: 2.09 x̃: 2 helped stats (rel) min: 0.90% max: 100.00% x̄: 16.81% x̃: 8.33% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 2.22% max: 50.00% x̄: 13.03% x̃: 3.33% 95% mean confidence interval for ldst value: -2.15 -2.02 95% mean confidence interval for ldst %-change: -17.53% -15.89% Ldst are helped. total quadwords in shared programs: 1398329 -> 1392117 (-0.44%) quadwords in affected programs: 704641 -> 698429 (-0.88%) helped: 3677 HURT: 1299 helped stats (abs) min: 1.0 max: 26.0 x̄: 2.51 x̃: 1 helped stats (rel) min: 0.10% max: 26.92% x̄: 2.64% x̃: 1.89% HURT stats (abs) min: 1.0 max: 20.0 x̄: 2.31 x̃: 1 HURT stats (rel) min: 0.11% max: 44.44% x̄: 2.34% x̃: 1.55% 95% mean confidence interval for quadwords value: -1.34 -1.16 95% mean confidence interval for quadwords %-change: -1.44% -1.25% Quadwords are helped. total threads in shared programs: 35234 -> 35311 (0.22%) threads in affected programs: 119 -> 196 (64.71%) helped: 91 HURT: 14 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 100.00% max: 100.00% x̄: 100.00% x̃: 100.00% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 50.00% max: 50.00% x̄: 50.00% x̃: 50.00% 95% mean confidence interval for threads value: 0.60 0.87 95% mean confidence interval for threads %-change: 70.08% 89.92% Threads are helped. total loops in shared programs: 125 -> 125 (0.00%) loops in affected programs: 0 -> 0 helped: 0 HURT: 0 total spills in shared programs: 149 -> 144 (-3.36%) spills in affected programs: 22 -> 17 (-22.73%) helped: 1 HURT: 0 total fills in shared programs: 966 -> 956 (-1.04%) fills in affected programs: 44 -> 34 (-22.73%) helped: 1 HURT: 0 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15090>	2022-02-22 15:21:09 +00:00
Alyssa Rosenzweig	e0e63c2a8e	pan/bi: Specialize IDVS in NIR It's a bit more code, but it's needed to chew through control flow since we don't have a backend version of dead_cf. Results are really good, meaning I really screwed this up the first time around (hence the cc mesa-stable). total instructions in shared programs: 1963576 -> 1939513 (-1.23%) instructions in affected programs: 671053 -> 646990 (-3.59%) helped: 4436 HURT: 729 helped stats (abs) min: 1.0 max: 43.0 x̄: 5.75 x̃: 6 helped stats (rel) min: 0.21% max: 100.00% x̄: 6.47% x̃: 5.17% HURT stats (abs) min: 1.0 max: 22.0 x̄: 2.01 x̃: 1 HURT stats (rel) min: 0.50% max: 50.00% x̄: 10.45% x̃: 9.09% 95% mean confidence interval for instructions value: -4.77 -4.55 95% mean confidence interval for instructions %-change: -4.36% -3.80% Instructions are helped. total tuples in shared programs: 1533335 -> 1523194 (-0.66%) tuples in affected programs: 483167 -> 473026 (-2.10%) helped: 3414 HURT: 1288 helped stats (abs) min: 1.0 max: 20.0 x̄: 3.73 x̃: 2 helped stats (rel) min: 0.27% max: 100.00% x̄: 4.87% x̃: 3.03% HURT stats (abs) min: 1.0 max: 19.0 x̄: 2.02 x̃: 1 HURT stats (rel) min: 0.24% max: 38.10% x̄: 8.10% x̃: 5.88% 95% mean confidence interval for tuples value: -2.28 -2.03 95% mean confidence interval for tuples %-change: -1.62% -1.02% Tuples are helped. total clauses in shared programs: 351432 -> 329158 (-6.34%) clauses in affected programs: 142237 -> 119963 (-15.66%) helped: 5328 HURT: 3 helped stats (abs) min: 1.0 max: 43.0 x̄: 4.18 x̃: 4 helped stats (rel) min: 0.74% max: 100.00% x̄: 19.44% x̃: 17.24% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 9.09% max: 12.50% x̄: 10.90% x̃: 11.11% 95% mean confidence interval for clauses value: -4.25 -4.11 95% mean confidence interval for clauses %-change: -19.72% -19.12% Clauses are helped. total cycles in shared programs: 202830.92 -> 172084.50 (-15.16%) cycles in affected programs: 117078.42 -> 86332 (-26.26%) helped: 5450 HURT: 1 helped stats (abs) min: 0.083333 max: 49.0 x̄: 5.64 x̃: 5 helped stats (rel) min: 1.42% max: 100.00% x̄: 27.94% x̃: 25.64% HURT stats (abs) min: 0.25 max: 0.25 x̄: 0.25 x̃: 0 HURT stats (rel) min: 2.46% max: 2.46% x̄: 2.46% x̃: 2.46% 95% mean confidence interval for cycles value: -5.74 -5.54 95% mean confidence interval for cycles %-change: -28.30% -27.58% Cycles are helped. total arith in shared programs: 57274.29 -> 57145.04 (-0.23%) arith in affected programs: 16418.33 -> 16289.08 (-0.79%) helped: 2442 HURT: 1784 helped stats (abs) min: 0.041665999999999315 max: 0.75 x̄: 0.14 x̃: 0 helped stats (rel) min: 0.23% max: 100.00% x̄: 5.51% x̃: 2.87% HURT stats (abs) min: 0.041665999999999315 max: 0.9166670000000003 x̄: 0.12 x̃: 0 HURT stats (rel) min: 0.00% max: 100.00% x̄: 25.13% x̃: 9.09% 95% mean confidence interval for arith value: -0.04 -0.03 95% mean confidence interval for arith %-change: 6.61% 8.24% Inconclusive result (value mean confidence interval and %-change mean confidence interval disagree). total texture in shared programs: 12857 -> 12857 (0.00%) texture in affected programs: 0 -> 0 helped: 0 HURT: 0 total vary in shared programs: 11157.75 -> 11157.75 (0.00%) vary in affected programs: 0 -> 0 helped: 0 HURT: 0 total ldst in shared programs: 177208 -> 146420 (-17.37%) ldst in affected programs: 117098 -> 86310 (-26.29%) helped: 5447 HURT: 0 helped stats (abs) min: 1.0 max: 49.0 x̄: 5.65 x̃: 5 helped stats (rel) min: 1.92% max: 100.00% x̄: 27.91% x̃: 25.64% 95% mean confidence interval for ldst value: -5.75 -5.55 95% mean confidence interval for ldst %-change: -28.27% -27.56% Ldst are helped. total quadwords in shared programs: 1436507 -> 1398329 (-2.66%) quadwords in affected programs: 515101 -> 476923 (-7.41%) helped: 5150 HURT: 111 helped stats (abs) min: 1.0 max: 39.0 x̄: 7.46 x̃: 6 helped stats (rel) min: 0.17% max: 100.00% x̄: 10.02% x̃: 8.24% HURT stats (abs) min: 1.0 max: 9.0 x̄: 2.01 x̃: 1 HURT stats (rel) min: 0.43% max: 21.62% x̄: 3.57% x̃: 1.94% 95% mean confidence interval for quadwords value: -7.41 -7.11 95% mean confidence interval for quadwords %-change: -9.98% -9.49% Quadwords are helped. total threads in shared programs: 35025 -> 35228 (0.58%) threads in affected programs: 218 -> 421 (93.12%) helped: 208 HURT: 5 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 100.00% max: 100.00% x̄: 100.00% x̃: 100.00% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 50.00% max: 50.00% x̄: 50.00% x̃: 50.00% 95% mean confidence interval for threads value: 0.91 0.99 95% mean confidence interval for threads %-change: 93.40% 99.55% Threads are helped. total loops in shared programs: 128 -> 125 (-2.34%) loops in affected programs: 3 -> 0 helped: 3 HURT: 0 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 100.00% max: 100.00% x̄: 100.00% x̃: 100.00% total spills in shared programs: 158 -> 149 (-5.70%) spills in affected programs: 15 -> 6 (-60.00%) helped: 9 HURT: 0 total fills in shared programs: 1133 -> 966 (-14.74%) fills in affected programs: 197 -> 30 (-84.77%) helped: 9 HURT: 0 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15090>	2022-02-22 15:21:09 +00:00
Alyssa Rosenzweig	3c1021cd1e	panvk: Use more reliable assert for UBO pushing The important thing isn't the number of words pushed, it's that there are no UBOs required for us to upload. Check that instead. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15090>	2022-02-22 15:21:09 +00:00
Alyssa Rosenzweig	e392dd8237	pan/bi: Promote MUX to CSEL in the scheduler Helps scheduling, and makes scheduling more predictable when deciding between MUX and CSEL. total tuples in shared programs: 1523328 -> 1516256 (-0.46%) tuples in affected programs: 509800 -> 502728 (-1.39%) helped: 1977 HURT: 181 helped stats (abs) min: 1.0 max: 48.0 x̄: 3.71 x̃: 2 helped stats (rel) min: 0.04% max: 14.29% x̄: 1.98% x̃: 1.28% HURT stats (abs) min: 1.0 max: 5.0 x̄: 1.43 x̃: 1 HURT stats (rel) min: 0.14% max: 7.69% x̄: 1.40% x̃: 0.70% 95% mean confidence interval for tuples value: -3.47 -3.08 95% mean confidence interval for tuples %-change: -1.79% -1.60% Tuples are helped. total clauses in shared programs: 350552 -> 349906 (-0.18%) clauses in affected programs: 34839 -> 34193 (-1.85%) helped: 570 HURT: 49 helped stats (abs) min: 1.0 max: 16.0 x̄: 1.22 x̃: 1 helped stats (rel) min: 0.67% max: 20.00% x̄: 3.26% x̃: 2.22% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 0.92% max: 16.67% x̄: 4.31% x̃: 4.17% 95% mean confidence interval for clauses value: -1.13 -0.96 95% mean confidence interval for clauses %-change: -2.95% -2.38% Clauses are helped. total cycles in shared programs: 202589.37 -> 202512.25 (-0.04%) cycles in affected programs: 7644.46 -> 7567.33 (-1.01%) helped: 771 HURT: 147 helped stats (abs) min: 0.041665999999999315 max: 1.8333360000000027 x̄: 0.11 x̃: 0 helped stats (rel) min: 0.16% max: 14.29% x̄: 2.10% x̃: 1.35% HURT stats (abs) min: 0.041665999999999315 max: 0.3333340000000007 x̄: 0.07 x̃: 0 HURT stats (rel) min: 0.24% max: 7.41% x̄: 1.49% x̃: 1.11% 95% mean confidence interval for cycles value: -0.09 -0.07 95% mean confidence interval for cycles %-change: -1.69% -1.36% Cycles are helped. total arith in shared programs: 56755.96 -> 56585.50 (-0.30%) arith in affected programs: 18746.29 -> 18575.83 (-0.91%) helped: 1605 HURT: 352 helped stats (abs) min: 0.04166399999999726 max: 1.8333360000000027 x̄: 0.12 x̃: 0 helped stats (rel) min: 0.07% max: 20.00% x̄: 1.92% x̃: 1.12% HURT stats (abs) min: 0.041665999999999315 max: 0.3333340000000007 x̄: 0.06 x̃: 0 HURT stats (rel) min: 0.17% max: 33.33% x̄: 2.09% x̃: 1.08% 95% mean confidence interval for arith value: -0.09 -0.08 95% mean confidence interval for arith %-change: -1.34% -1.07% Arith are helped. total quadwords in shared programs: 1429737 -> 1424670 (-0.35%) quadwords in affected programs: 418175 -> 413108 (-1.21%) helped: 1682 HURT: 198 helped stats (abs) min: 1.0 max: 35.0 x̄: 3.17 x̃: 2 helped stats (rel) min: 0.04% max: 13.33% x̄: 1.72% x̃: 1.29% HURT stats (abs) min: 1.0 max: 5.0 x̄: 1.38 x̃: 1 HURT stats (rel) min: 0.15% max: 7.41% x̄: 1.30% x̃: 0.92% 95% mean confidence interval for quadwords value: -2.86 -2.53 95% mean confidence interval for quadwords %-change: -1.48% -1.32% Quadwords are helped. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14576>	2022-02-19 03:02:10 +00:00
Alyssa Rosenzweig	a8418abd74	pan/bi: Revert "Fix load_const of 1-bit booleans" This reverts commit `29d319c767`. Now that we use nir_lower_bool_to_bitsize, we don't see 1-bit booleans anymore, so the issue this fixed doesn't apply. Actually, that issue was (in part) why I started looking into boolean handling in the first place. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14576>	2022-02-19 03:02:10 +00:00
Alyssa Rosenzweig	21bdee7bcc	pan/bi: Switch to lower_bool_to_bitsize Instead of ingesting 1-bit booleans and trying to force everything to be 16-bit, except when it isn't, and creating a mess in the backend... just use the NIR pass designed to select bitsize for booleans. Yes, this means we need to handle more NIR instructions, but the handling is easier and the conversion is more obvious (except for some edge cases like 16-bit vectorized b32csel). This generates noticeably better code, and the generated code will be easier to optimize. total instructions in shared programs: 90257 -> 88941 (-1.46%) instructions in affected programs: 49145 -> 47829 (-2.68%) helped: 201 HURT: 2 helped stats (abs) min: 1.0 max: 40.0 x̄: 6.57 x̃: 3 helped stats (rel) min: 0.29% max: 13.89% x̄: 2.57% x̃: 1.90% HURT stats (abs) min: 2.0 max: 2.0 x̄: 2.00 x̃: 2 HURT stats (rel) min: 2.15% max: 2.74% x̄: 2.45% x̃: 2.45% 95% mean confidence interval for instructions value: -7.71 -5.26 95% mean confidence interval for instructions %-change: -2.84% -2.20% Instructions are helped. total tuples in shared programs: 73740 -> 72922 (-1.11%) tuples in affected programs: 36564 -> 35746 (-2.24%) helped: 184 HURT: 7 helped stats (abs) min: 1.0 max: 74.0 x̄: 4.49 x̃: 2 helped stats (rel) min: 0.30% max: 16.67% x̄: 2.86% x̃: 1.89% HURT stats (abs) min: 1.0 max: 2.0 x̄: 1.29 x̃: 1 HURT stats (rel) min: 0.12% max: 12.50% x̄: 4.26% x̃: 3.33% 95% mean confidence interval for tuples value: -5.29 -3.28 95% mean confidence interval for tuples %-change: -3.06% -2.13% Tuples are helped. total clauses in shared programs: 15993 -> 15928 (-0.41%) clauses in affected programs: 2464 -> 2399 (-2.64%) helped: 35 HURT: 16 helped stats (abs) min: 1.0 max: 27.0 x̄: 2.31 x̃: 1 helped stats (rel) min: 0.49% max: 18.88% x̄: 7.63% x̃: 5.88% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 0.79% max: 6.25% x̄: 1.91% x̃: 1.01% 95% mean confidence interval for clauses value: -2.46 -0.09 95% mean confidence interval for clauses %-change: -6.38% -2.90% Clauses are helped. total cycles in shared programs: 7622.13 -> 7594.75 (-0.36%) cycles in affected programs: 1078.67 -> 1051.29 (-2.54%) helped: 103 HURT: 4 helped stats (abs) min: 0.041665999999999315 max: 3.0833319999999986 x̄: 0.27 x̃: 0 helped stats (rel) min: 0.32% max: 21.05% x̄: 3.62% x̃: 2.44% HURT stats (abs) min: 0.0416669999999999 max: 0.0833330000000001 x̄: 0.05 x̃: 0 HURT stats (rel) min: 0.13% max: 7.14% x̄: 2.94% x̃: 2.25% 95% mean confidence interval for cycles value: -0.33 -0.19 95% mean confidence interval for cycles %-change: -4.14% -2.61% Cycles are helped. total arith in shared programs: 2762.46 -> 2728.08 (-1.24%) arith in affected programs: 1550.12 -> 1515.75 (-2.22%) helped: 197 HURT: 6 helped stats (abs) min: 0.041665999999999315 max: 3.0833319999999986 x̄: 0.18 x̃: 0 helped stats (rel) min: 0.32% max: 21.05% x̄: 2.93% x̃: 1.61% HURT stats (abs) min: 0.0416669999999999 max: 0.0833330000000001 x̄: 0.06 x̃: 0 HURT stats (rel) min: 0.13% max: 20.00% x̄: 5.78% x̃: 3.37% 95% mean confidence interval for arith value: -0.21 -0.13 95% mean confidence interval for arith %-change: -3.20% -2.15% Arith are helped. total quadwords in shared programs: 68155 -> 67555 (-0.88%) quadwords in affected programs: 27944 -> 27344 (-2.15%) helped: 151 HURT: 9 helped stats (abs) min: 1.0 max: 52.0 x̄: 4.09 x̃: 3 helped stats (rel) min: 0.23% max: 12.35% x̄: 2.87% x̃: 2.17% HURT stats (abs) min: 1.0 max: 5.0 x̄: 1.89 x̃: 1 HURT stats (rel) min: 0.20% max: 6.76% x̄: 1.91% x̃: 1.13% 95% mean confidence interval for quadwords value: -4.67 -2.83 95% mean confidence interval for quadwords %-change: -2.99% -2.21% Quadwords are helped. total threads in shared programs: 2232 -> 2233 (0.04%) threads in affected programs: 1 -> 2 (100.00%) helped: 1 HURT: 0 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14576>	2022-02-19 03:02:10 +00:00
Alyssa Rosenzweig	a64534754d	pan/bi: Handle vectorized u2f16/i2f16 Will be useful when we enable int16, I guess... No shader-db changes. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14576>	2022-02-19 03:02:10 +00:00
Alyssa Rosenzweig	6a05852f5b	pan/bi: Handle trivial i2i32 lower_bool_to_bitsize can generate i2i32 from a 32-bit source, which is trivial but needs to be handled explicitly to avoid going down the 8-bit conversion path. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14576>	2022-02-19 03:02:10 +00:00
Alyssa Rosenzweig	f7d44a46cd	pan/bi: Optimize replication Bifrost's 16-bit support comes in the form of vectorized instructions, so when we manipulate scalars, we usually replicate to both bottom and top halves of 32-bit registers. Add an analysis pass that detects replication. Then, use that replication pass to optimize out useless swizzle instructions (by changing them to plain moves, which can be copypropped). This optimization is a slight shader-db win on its own, and allows us to transition to lower_bool_to_bitsize without regressing shader-db. total instructions in shared programs: 90323 -> 90257 (-0.07%) instructions in affected programs: 2513 -> 2447 (-2.63%) helped: 20 HURT: 0 helped stats (abs) min: 1.0 max: 16.0 x̄: 3.30 x̃: 2 helped stats (rel) min: 1.25% max: 11.11% x̄: 4.80% x̃: 4.29% 95% mean confidence interval for instructions value: -5.05 -1.55 95% mean confidence interval for instructions %-change: -6.06% -3.54% Instructions are helped. total tuples in shared programs: 73769 -> 73740 (-0.04%) tuples in affected programs: 1611 -> 1582 (-1.80%) helped: 17 HURT: 0 helped stats (abs) min: 1.0 max: 9.0 x̄: 1.71 x̃: 1 helped stats (rel) min: 0.58% max: 16.67% x̄: 4.80% x̃: 3.33% 95% mean confidence interval for tuples value: -2.70 -0.71 95% mean confidence interval for tuples %-change: -7.06% -2.54% Tuples are helped. total clauses in shared programs: 15997 -> 15993 (-0.03%) clauses in affected programs: 27 -> 23 (-14.81%) helped: 4 HURT: 0 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 7.69% max: 25.00% x̄: 18.17% x̃: 20.00% 95% mean confidence interval for clauses value: -1.00 -1.00 95% mean confidence interval for clauses %-change: -29.91% -6.44% Clauses are helped. total cycles in shared programs: 7623.13 -> 7622.13 (-0.01%) cycles in affected programs: 64.83 -> 63.83 (-1.54%) helped: 13 HURT: 0 helped stats (abs) min: 0.0416660000000002 max: 0.375 x̄: 0.08 x̃: 0 helped stats (rel) min: 1.02% max: 5.56% x̄: 2.82% x̃: 2.50% 95% mean confidence interval for cycles value: -0.13 -0.02 95% mean confidence interval for cycles %-change: -3.79% -1.85% Cycles are helped. total arith in shared programs: 2763.75 -> 2762.46 (-0.05%) arith in affected programs: 67.17 -> 65.88 (-1.92%) helped: 18 HURT: 0 helped stats (abs) min: 0.0416660000000002 max: 0.375 x̄: 0.07 x̃: 0 helped stats (rel) min: 1.02% max: 22.22% x̄: 5.68% x̃: 3.16% 95% mean confidence interval for arith value: -0.11 -0.03 95% mean confidence interval for arith %-change: -8.56% -2.80% Arith are helped. total quadwords in shared programs: 68173 -> 68155 (-0.03%) quadwords in affected programs: 1258 -> 1240 (-1.43%) helped: 14 HURT: 0 helped stats (abs) min: 1.0 max: 3.0 x̄: 1.29 x̃: 1 helped stats (rel) min: 0.42% max: 8.70% x̄: 3.88% x̃: 3.67% 95% mean confidence interval for quadwords value: -1.64 -0.93 95% mean confidence interval for quadwords %-change: -5.27% -2.49% Quadwords are helped. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14576>	2022-02-19 03:02:10 +00:00
Alyssa Rosenzweig	35ff537814	pan/bi: Constant fold swizzles on constants This lets us avoid generating SWZ instructions. Those instructions could be constant folded but that complicates the replication analysis introduced in the next commit. Almost no shader-db changes. quadwords HURT: shaders/glmark/1-22.shader_test MESA_SHADER_FRAGMENT: 718 -> 722 (0.56%) total quadwords in shared programs: 68169 -> 68173 (<.01%) quadwords in affected programs: 718 -> 722 (0.56%) helped: 0 HURT: 1 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14576>	2022-02-19 03:02:10 +00:00
Alyssa Rosenzweig	62533a6e64	pan/bi: Lower swizzles on MUX.v2i16 We'll generate this in a moment. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14576>	2022-02-19 03:02:10 +00:00
Alyssa Rosenzweig	8bd4976d98	pan/bi: Lower swizzles on CSEL.i32/MUX.i32 This is counter-intuitive, but required for correct operation when CSEL.i32 takes a 1-bit (stored 16-bit) boolean argument. The impedance mismatch ultimately is between CSEL.b32 (nir's bcsel, nonexistant in the hardware) and the lowering CSEL.i32. However, a similar problem exists even with MUX.i32 which lacks a good way of zero/sign-extending booleans. Cherry-picked from my Valhall branch though the issue also affects Bifrost. Fixes piglit shaders@glsl-vs-if-bool on Bifrost. Unfortunately, shader-db is quite unhappy :-( The proper fix is to use lower_bool_to_bitsize, but that can't be backported to mesa-stable. total instructions in shared programs: 157539 -> 158953 (0.90%) instructions in affected programs: 55621 -> 57035 (2.54%) helped: 2 HURT: 259 helped stats (abs) min: 2.0 max: 2.0 x̄: 2.00 x̃: 2 helped stats (rel) min: 2.11% max: 2.67% x̄: 2.39% x̃: 2.39% HURT stats (abs) min: 1.0 max: 40.0 x̄: 5.47 x̃: 2 HURT stats (rel) min: 0.36% max: 16.13% x̄: 2.55% x̃: 1.59% 95% mean confidence interval for instructions value: 4.44 6.40 95% mean confidence interval for instructions %-change: 2.21% 2.82% Instructions are HURT. total tuples in shared programs: 132322 -> 132907 (0.44%) tuples in affected programs: 31806 -> 32391 (1.84%) helped: 5 HURT: 152 helped stats (abs) min: 1.0 max: 2.0 x̄: 1.40 x̃: 1 helped stats (rel) min: 0.39% max: 3.03% x̄: 1.70% x̃: 1.61% HURT stats (abs) min: 1.0 max: 42.0 x̄: 3.89 x̃: 2 HURT stats (rel) min: 0.29% max: 18.18% x̄: 2.50% x̃: 1.79% 95% mean confidence interval for tuples value: 2.88 4.58 95% mean confidence interval for tuples %-change: 1.87% 2.85% Tuples are HURT. total clauses in shared programs: 28672 -> 28698 (0.09%) clauses in affected programs: 869 -> 895 (2.99%) helped: 1 HURT: 24 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 5.88% max: 5.88% x̄: 5.88% x̃: 5.88% HURT stats (abs) min: 1.0 max: 2.0 x̄: 1.12 x̃: 1 HURT stats (rel) min: 0.49% max: 33.33% x̄: 8.46% x̃: 3.59% 95% mean confidence interval for clauses value: 0.82 1.26 95% mean confidence interval for clauses %-change: 3.84% 11.93% Clauses are HURT. total cycles in shared programs: 15119.04 -> 15137.88 (0.12%) cycles in affected programs: 922.87 -> 941.71 (2.04%) helped: 4 HURT: 79 helped stats (abs) min: 0.0416669999999999 max: 0.0833330000000001 x̄: 0.05 x̃: 0 helped stats (rel) min: 0.40% max: 3.17% x̄: 1.57% x̃: 1.35% HURT stats (abs) min: 0.041665999999999315 max: 1.75 x̄: 0.24 x̃: 0 HURT stats (rel) min: 0.30% max: 20.00% x̄: 2.83% x̃: 2.12% 95% mean confidence interval for cycles value: 0.17 0.29 95% mean confidence interval for cycles %-change: 1.86% 3.37% Cycles are HURT. total arith in shared programs: 4922.71 -> 4947.71 (0.51%) arith in affected programs: 1423.79 -> 1448.79 (1.76%) helped: 5 HURT: 177 helped stats (abs) min: 0.0416669999999999 max: 0.0833330000000001 x̄: 0.06 x̃: 0 helped stats (rel) min: 0.40% max: 3.17% x̄: 1.82% x̃: 1.67% HURT stats (abs) min: 0.041665999999999315 max: 1.75 x̄: 0.14 x̃: 0 HURT stats (rel) min: 0.30% max: 22.22% x̄: 2.50% x̃: 1.52% 95% mean confidence interval for arith value: 0.11 0.17 95% mean confidence interval for arith %-change: 1.86% 2.90% Arith are HURT. total quadwords in shared programs: 120605 -> 120956 (0.29%) quadwords in affected programs: 26535 -> 26886 (1.32%) helped: 6 HURT: 143 helped stats (abs) min: 1.0 max: 7.0 x̄: 2.83 x̃: 1 helped stats (rel) min: 0.93% max: 6.33% x̄: 2.29% x̃: 1.71% HURT stats (abs) min: 1.0 max: 21.0 x̄: 2.57 x̃: 2 HURT stats (rel) min: 0.34% max: 13.79% x̄: 2.02% x̃: 1.22% 95% mean confidence interval for quadwords value: 1.86 2.86 95% mean confidence interval for quadwords %-change: 1.45% 2.24% Quadwords are HURT. total threads in shared programs: 4670 -> 4669 (-0.02%) threads in affected programs: 2 -> 1 (-50.00%) helped: 0 HURT: 1 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14576>	2022-02-19 03:02:10 +00:00
Alyssa Rosenzweig	9168dcbbc1	pan/bi: Disambiguate IDVS variants in shader-db Label IDVS variants as being MESA_SHADER_{POSITION, VARYING} stages; reserve the MESA_SHADER_VERTEX label for non-IDVS shaders. This reduces confusion where a single shader compiles to two MESA_SHADER_VERTEX shaders with different stats. While we're at it, de-vendor the blend shader stage name; these stats are internal anyway. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15086>	2022-02-19 00:01:07 +00:00
Alyssa Rosenzweig	8f4b3c749e	pan/bi: Test avoiding FADD.v2f16 hazards in scheduler There are many of them, and integration testing of the scheduler won't hit every case. Add targeted unit tests for the various scheduling hazards of this funny instruction. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15072>	2022-02-18 16:15:04 +00:00
Alyssa Rosenzweig	9d95561c93	pan/bi: Test avoiding *FADD.v2f16 hazard in optimizer This hazard exists but is obscure enough to be missed on our existing test coverage (e.g the conformance tests). Add piles of unit tests for it. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15072>	2022-02-18 16:15:04 +00:00
Alyssa Rosenzweig	24d2bdb1e0	pan/bi: Avoid *FADD.v2f16 hazard in scheduler Obscure encoding restriction. Fixes crash (assertion fail when instruction packing) in asphalt9/2659.shader_test on Bifrost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15072>	2022-02-18 16:15:04 +00:00
Alyssa Rosenzweig	8e0eb592d5	pan/bi: Avoid *FADD.v2f16 hazard in optimizer This is a very obscure encoding restriction in the Bifrost ISA. Unknown if any real apps or tests hit this, but we still need to get it right sadly. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15072>	2022-02-18 16:15:04 +00:00
Alyssa Rosenzweig	c2178d09d0	pan/va: Identify LEA_TEX_IMM table Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15069>	2022-02-18 15:51:46 +00:00
Alyssa Rosenzweig	839f15259a	pan/va: Fix conservative branch handling Mixed up lanes and conservative branch combine. Fix that. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15069>	2022-02-18 15:51:46 +00:00
Alyssa Rosenzweig	81a9c857c8	pan/va: Make subgroup 4-bits Future proofing. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15069>	2022-02-18 15:51:46 +00:00

1 2 3 4 5 ...

3620 commits