fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-28 12:08:24 +02:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	ba209fe493	agx: Implement formatted loads These will be generated by the UBO and VBO lowerings. (and eventually by other lowerings too?) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19996>	2022-12-02 06:25:20 +00:00
Alyssa Rosenzweig	580f25a266	agx: Add shift to device_load We'll use this as an optimization soon. This acts in addition to the format's shift. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19996>	2022-12-02 06:25:20 +00:00
Alyssa Rosenzweig	1555ac6f0b	agx: Clamp point sizes Fixes vs-point_size-zero. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20017>	2022-12-01 05:58:30 +00:00
Alyssa Rosenzweig	7108619c0d	agx: Handle 32-bit gl_FragCoord.zw The coefficient register is 16-bit so our builder will make the iter 16-bit too (maybe not the best design...), force fp32 to match the NIR intrinsic. Fixes glsl-fs-fragcoord-zw-ortho Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20017>	2022-12-01 05:58:30 +00:00
Alyssa Rosenzweig	eb4187b02d	agx: Handle large varying indices Fixes glsl-max-varyings. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20017>	2022-12-01 05:58:30 +00:00
Alyssa Rosenzweig	6de5bd5f41	agx: Fix signedness issues packing UBSan complains otherwise: ../src/asahi/compiler/agx_pack.c:701:21: runtime error: left shift of 1 by 31 places cannot be represented in type 'int' ../src/asahi/compiler/agx_pack.c:534:18: runtime error: left shift of 8 by 28 places cannot be represented in type 'int' Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19971>	2022-11-24 23:37:48 +00:00
Alyssa Rosenzweig	d608ca0363	agx: Handle vertex shaders that use <= 8 halfregs r5 and r6 are always getting lowered. Will prevent a regression with VBO lowering on a shader which has stride=0 and hence gets the vertex ID read optimized out with NIR: dEQP-GLES2.functional.draw.random.50 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19971>	2022-11-24 23:37:48 +00:00
Alyssa Rosenzweig	94124925ca	agx: Try to align sources of pack_64_2x32_split Helps with coalescing the pack. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19971>	2022-11-24 23:37:48 +00:00
Alyssa Rosenzweig	442e29890d	agx: Implement nir_op_pack_64_2x32_split This maps to a collect where the dest size is 64 and the src size is 32. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19971>	2022-11-24 23:37:48 +00:00
Yonggang Luo	40a9fc57aa	tree-wide: Use __func__ instead of __FUNCTION__ in non-gallium code Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Acked-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19861>	2022-11-22 06:53:46 +00:00
Alyssa Rosenzweig	74e92274af	asahi,agx: Use new tilebuffer infrastructure Flag day change to replace the previous hardcoded background/end-of-tile shaders and the API-style load/store_output in fragment shaders with the generated shaders and lowered *_agx intrinsics. This gets us working non-UNORM8 render targets and working MRT. It's also a step in the direction of working MSAA but that needs a lot more work, since the multisampling programming model on AGX is quite different from any of the APIs (including Metal). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	4a166acc93	agx: Add block_image_store instruction This hw instruction writes out an entire block from the tilebuffer to an attached render target (PBE descriptor). It is used (only?) in end-of-tile shaders to implement write out. We need to handle it in the compiler as a prerequisite to compiling end-of-tile shaders ourselves, instead of hardcoding. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	0e106681e0	agx: Add helper to map pipe formats to agx_formats Or a restricted subset thereof anyway. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	db0461a8d0	agx: Implement nir_texop_txf_ms Mutlisampled texture fetch (txf_ms) is encoded like regular txf. However, we now need to pack the multisample index in the right place, which we do by extending our existing NIR texture source lowering pass. 2D MS arrays use a new value of dim which requires tweaking the encoding slightly. Otherwise, everything is bog standard. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	6ee6cfec41	asahi: Use PIPE_FORMATs for driver-compiler ABI This avoids exposing the ISA-internal agx_format to the driver, instead hiding it behind a real PIPE_FORMAT. This lets us use real pipe formats in formatted intrinsics in NIR, which is convenient; it will allow us to simplify the compiler/driver ABI; and it lets us use common format helpers (e.g. util_format_get_blocksize) for the internal formats in driver lowering. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19871>	2022-11-19 20:25:41 +00:00
Alyssa Rosenzweig	6a12d793d8	agx: Handle collects in backwards isel Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19811>	2022-11-19 04:27:10 +00:00
Alyssa Rosenzweig	3b9d271646	agx: Assert more invariants in RA Was helpful for debugging. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19811>	2022-11-19 04:27:10 +00:00
Alyssa Rosenzweig	c2159ce9e4	agx: Validate part of SSA form To debug backend pass problems. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19811>	2022-11-19 04:27:10 +00:00
Alyssa Rosenzweig	1110fcccc2	agx: Split off NIR preprocessing from compiling So we can specialize after lowering I/O. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19811>	2022-11-19 04:27:10 +00:00
Alyssa Rosenzweig	972354b5fd	agx: Handle scalar texture destinations Fixes dEQP-GLES3.functional.shaders.texture_functions.texturelod.sampler2dshadow_fragment. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19811>	2022-11-19 04:27:10 +00:00
Alyssa Rosenzweig	a92fb4f38c	agx: Don't depend on GenXML Separation of concerns, unused #include. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19811>	2022-11-19 04:27:10 +00:00
Alyssa Rosenzweig	3789dba5f6	agx: Lower packs/unpacks and bitfields Needed for GLES3. These could be optimized. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19811>	2022-11-19 04:27:10 +00:00
Yonggang Luo	e399dc3544	util: normalize include files under src/util/*.h with util/ prefix in mesa code base Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19546>	2022-11-10 06:27:25 +00:00
Alyssa Rosenzweig	35a531fcd4	agx: Don't assert on texop twice This is already asserted for lod modes. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19590>	2022-11-10 02:25:09 +00:00
Alyssa Rosenzweig	ededb108d9	agx: Implement unary math ops Implement nir_op_bitfield_reverse, nir_op_bit_count, and nir_op_ufind_msb. These map to native instructions. With appropriate integer render target and multiple render target support, passes: dEQP-GLES31.functional.shaders.builtin_functions.integer.bitfieldreverse.vertex dEQP-GLES31.functional.shaders.builtin_functions.integer.bitfieldreverse.fragment dEQP-GLES31.functional.shaders.builtin_functions.integer.bitcount.vertex dEQP-GLES31.functional.shaders.builtin_functions.integer.bitcount.fragment dEQP-GLES31.functional.shaders.builtin_functions.integer.findLSB.vertex dEQP-GLES31.functional.shaders.builtin_functions.integer.findLSB.fragment dEQP-GLES31.functional.shaders.builtin_functions.integer.findMSB.vertex dEQP-GLES31.functional.shaders.builtin_functions.integer.findMSB.fragment Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19590>	2022-11-10 02:25:09 +00:00
Alyssa Rosenzweig	44ccdca768	agx: Implement {i,u}mul_2x32_64 With support for MRT in the driver (not included here), passes: dEQP-GLES31.functional.shaders.builtin_functions.integer.imulextended.int_highp_fragment dEQP-GLES31.functional.shaders.builtin_functions.integer.umulextended.int_highp_fragment Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19590>	2022-11-10 02:25:09 +00:00
Alyssa Rosenzweig	74a884f73c	agx: Implement nir_op_unpack_64_2x32_split_{x,y} Used in the umul_extended lowering. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19590>	2022-11-10 02:25:09 +00:00
Alyssa Rosenzweig	ea88ebefb9	agx/ra: Remove index_to_reg Use stronger asserts instead. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19590>	2022-11-10 02:25:09 +00:00
Alyssa Rosenzweig	dea00bcc8f	agx: Add CSE optimization pass Ported from the Bifrost compiler, in turn based on the ir3 one. This cleans up a lot of junk we emit during NIR->AGX and will help with some SSA RA troubles. total instructions in shared programs: 34803 -> 34381 (-1.21%) instructions in affected programs: 18652 -> 18230 (-2.26%) helped: 198 HURT: 0 helped stats (abs) min: 1.0 max: 28.0 x̄: 2.13 x̃: 1 helped stats (rel) min: 0.31% max: 12.50% x̄: 3.94% x̃: 2.78% 95% mean confidence interval for instructions value: -2.45 -1.81 95% mean confidence interval for instructions %-change: -4.40% -3.48% Instructions are helped. total bytes in shared programs: 238094 -> 234824 (-1.37%) bytes in affected programs: 126472 -> 123202 (-2.59%) helped: 200 HURT: 0 helped stats (abs) min: 6.0 max: 168.0 x̄: 16.35 x̃: 8 helped stats (rel) min: 0.37% max: 17.65% x̄: 4.25% x̃: 3.38% 95% mean confidence interval for bytes value: -18.49 -14.21 95% mean confidence interval for bytes %-change: -4.67% -3.84% Bytes are helped. total halfregs in shared programs: 10078 -> 10107 (0.29%) halfregs in affected programs: 565 -> 594 (5.13%) helped: 22 HURT: 22 helped stats (abs) min: 1.0 max: 4.0 x̄: 1.23 x̃: 1 helped stats (rel) min: 5.71% max: 25.00% x̄: 23.38% x̃: 25.00% HURT stats (abs) min: 2.0 max: 4.0 x̄: 2.55 x̃: 2 HURT stats (rel) min: 4.44% max: 30.77% x̄: 15.61% x̃: 12.73% 95% mean confidence interval for halfregs value: 0.03 1.28 95% mean confidence interval for halfregs %-change: -10.17% 2.40% Inconclusive result (%-change mean confidence interval includes 0). Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19590>	2022-11-10 02:25:09 +00:00
Alyssa Rosenzweig	4387d0886d	agx: Describe whether instructions may be reordered As per NIR, for the benefit of CSE. It is assumed that instructions that cannot be eliminated also cannot be reordered. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19590>	2022-11-10 02:25:09 +00:00
Alyssa Rosenzweig	27869f6966	agx: Add and use replace_src helper From Bifrost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19590>	2022-11-10 02:25:09 +00:00
Alyssa Rosenzweig	adf3cbc04c	agx: Use nir_opt_phi_precision No shader-db changes, but helped a custom shader I wrote to test loops. My shader-db is too small. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19590>	2022-11-10 02:25:09 +00:00
Alyssa Rosenzweig	98f0ebf264	agx: Pass agx_index to agx_copy More straightforward interface and will allow including immediates later if we want to. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19590>	2022-11-10 02:25:09 +00:00
Alyssa Rosenzweig	023f27fada	agx: Coalesce collects when possible Track collects and use them as affinities when choosing registers. On glmark2: total instructions in shared programs: 5498 -> 5388 (-2.00%) instructions in affected programs: 2748 -> 2638 (-4.00%) helped: 31 HURT: 0 helped stats (abs) min: 1.0 max: 12.0 x̄: 3.55 x̃: 3 helped stats (rel) min: 0.09% max: 57.14% x̄: 10.58% x̃: 5.97% 95% mean confidence interval for instructions value: -4.61 -2.49 95% mean confidence interval for instructions %-change: -15.16% -6.00% Instructions are helped. total bytes in shared programs: 37280 -> 36620 (-1.77%) bytes in affected programs: 18880 -> 18220 (-3.50%) helped: 31 HURT: 0 helped stats (abs) min: 6.0 max: 72.0 x̄: 21.29 x̃: 18 helped stats (rel) min: 0.07% max: 48.98% x̄: 9.16% x̃: 5.17% 95% mean confidence interval for bytes value: -27.64 -14.94 95% mean confidence interval for bytes %-change: -13.03% -5.29% Bytes are helped. total halfregs in shared programs: 1267 -> 1279 (0.95%) halfregs in affected programs: 37 -> 49 (32.43%) helped: 0 HURT: 9 HURT stats (abs) min: 1.0 max: 2.0 x̄: 1.33 x̃: 1 HURT stats (rel) min: 16.67% max: 66.67% x̄: 35.58% x̃: 28.57% 95% mean confidence interval for halfregs value: 0.95 1.72 95% mean confidence interval for halfregs %-change: 21.50% 49.67% Halfregs are HURT. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19590>	2022-11-10 02:25:09 +00:00
Alyssa Rosenzweig	4cc2427ad6	agx: Introduce agx_foreach_ssa_{src,dest} macros These are convenient iterators especially in the register allocator. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19590>	2022-11-10 02:25:09 +00:00
Alyssa Rosenzweig	4971870441	agx/ra: Factor out assign_regs Prepare to record bookkeeping needed for live range splits. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19590>	2022-11-10 02:25:09 +00:00
Alyssa Rosenzweig	2b806b5cf8	agx/ra: Use BITSET_*_RANGE in some places A bit neater. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19590>	2022-11-10 02:25:09 +00:00
Alyssa Rosenzweig	be5357a353	agx: Free dests of splits that are never read Otherwise the registers "leak", bloating register pressure by arbitrarily large amounts. This is easy to handle in DCE by rewriting to a null destination, though we could use a sideband channel if we didn't want null destinations in the IR. glmark2 subset of shader-db is much improved: total instructions in shared programs: 7324 -> 7313 (-0.15%) instructions in affected programs: 483 -> 472 (-2.28%) helped: 5 HURT: 2 total bytes in shared programs: 42788 -> 42722 (-0.15%) bytes in affected programs: 2808 -> 2742 (-2.35%) helped: 5 HURT: 2 total halfregs in shared programs: 2421 -> 2058 (-14.99%) halfregs in affected programs: 1235 -> 872 (-29.39%) helped: 28 HURT: 0 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19590>	2022-11-10 02:25:09 +00:00
Alyssa Rosenzweig	9a48c35668	agx: Refuse to handle discontiguous iter This will cause problems with register allocation. instructions HURT: shaders/glmark/1-24.shader_test MESA_SHADER_FRAGMENT: 135 -> 136 (0.74%) instructions HURT: shaders/glmark/1-8.shader_test MESA_SHADER_FRAGMENT: 84 -> 85 (1.19%) bytes HURT: shaders/glmark/1-24.shader_test MESA_SHADER_FRAGMENT: 914 -> 922 (0.88%) bytes HURT: shaders/glmark/1-8.shader_test MESA_SHADER_FRAGMENT: 574 -> 580 (1.05%) halfregs helped: shaders/glmark/1-8.shader_test MESA_SHADER_FRAGMENT: 20 -> 19 (-5.00%) halfregs helped: shaders/glmark/1-24.shader_test MESA_SHADER_FRAGMENT: 25 -> 23 (-8.00%) halfregs helped: shaders/glmark/7-3.shader_test MESA_SHADER_FRAGMENT: 11 -> 10 (-9.09%) halfregs helped: shaders/glmark/4-2.shader_test MESA_SHADER_FRAGMENT: 23 -> 19 (-17.39%) total instructions in shared programs: 5716 -> 5718 (0.03%) instructions in affected programs: 219 -> 221 (0.91%) helped: 0 HURT: 2 total bytes in shared programs: 38118 -> 38132 (0.04%) bytes in affected programs: 1488 -> 1502 (0.94%) helped: 0 HURT: 2 total halfregs in shared programs: 1639 -> 1631 (-0.49%) halfregs in affected programs: 79 -> 71 (-10.13%) helped: 4 HURT: 0 helped stats (abs) min: 1.0 max: 4.0 x̄: 2.00 x̃: 1 helped stats (rel) min: 5.00% max: 17.39% x̄: 9.87% x̃: 8.55% 95% mean confidence interval for halfregs value: -4.25 0.25 95% mean confidence interval for halfregs %-change: -18.31% -1.43% Inconclusive result (value mean confidence interval includes 0). Total CPU time (seconds): 11.41 -> 11.72 (2.72%) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19590>	2022-11-10 02:25:09 +00:00
Alyssa Rosenzweig	af2137883c	agx: Don't emit writeout 0xC200 Metal omits this in OpenGL mode, and since we have no clue what it does, I see no reason for us not to do the same. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19590>	2022-11-10 02:25:09 +00:00
Yonggang Luo	bfa3ce44a6	mesa: Move glheader.h from mesa/main/glheader.h to util/glheader.h So it's can be accessed in broader places Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Acked-by: Brian Paul brianp@vmware.com Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19472>	2022-11-03 16:07:31 +00:00
Alyssa Rosenzweig	3570e94bcc	agx: Use agx_nir_opt_preamble Now that everything is in place, we can actually take advantage of preambles. This wins us a crude form of UBO pushing (accounting for most of the win here), as well as its intended purpose of optimizing uniform-on-uniform arithmetic. shader-db results are excellent. The shader that's regressed for instruction count is a fragment shader that solely consists of `gl_FragColor = uniform`, which goes from a vectorized UBO load to four scalar moves. That's more instructions (and more bytes) but presumably faster, since ALU should be much cheaper than load/store. total instructions in shared programs: 6502 -> 5764 (-11.35%) instructions in affected programs: 5136 -> 4398 (-14.37%) helped: 60 HURT: 1 helped stats (abs) min: 2.0 max: 47.0 x̄: 12.33 x̃: 8 helped stats (rel) min: 0.84% max: 34.48% x̄: 18.69% x̃: 21.05% HURT stats (abs) min: 2.0 max: 2.0 x̄: 2.00 x̃: 2 HURT stats (rel) min: 33.33% max: 33.33% x̄: 33.33% x̃: 33.33% 95% mean confidence interval for instructions value: -14.69 -9.51 95% mean confidence interval for instructions %-change: -20.49% -15.20% Instructions are helped. total bytes in shared programs: 42186 -> 38310 (-9.19%) bytes in affected programs: 33182 -> 29306 (-11.68%) helped: 60 HURT: 1 helped stats (abs) min: 10.0 max: 272.0 x̄: 64.83 x̃: 50 helped stats (rel) min: 0.72% max: 30.00% x̄: 15.16% x̃: 16.67% HURT stats (abs) min: 14.0 max: 14.0 x̄: 14.00 x̃: 14 HURT stats (rel) min: 31.82% max: 31.82% x̄: 31.82% x̃: 31.82% 95% mean confidence interval for bytes value: -77.73 -49.35 95% mean confidence interval for bytes %-change: -16.66% -12.11% Bytes are helped. total halfregs in shared programs: 2370 -> 1639 (-30.84%) halfregs in affected programs: 1804 -> 1073 (-40.52%) helped: 60 HURT: 0 helped stats (abs) min: 1.0 max: 40.0 x̄: 12.18 x̃: 8 helped stats (rel) min: 3.85% max: 72.73% x̄: 41.37% x̃: 36.17% 95% mean confidence interval for halfregs value: -14.77 -9.60 95% mean confidence interval for halfregs %-change: -46.00% -36.75% Halfregs are helped. Total CPU time (seconds): 2.71 -> 2.80 (3.32%) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18813>	2022-10-22 15:00:12 -04:00
Alyssa Rosenzweig	5e8b0289c3	agx: Add agx_nir_opt_preamble pass This pass creates preamble shaders. The heavylifting is done by nir_opt_preamble. We do need to define the cost model for nir_opt_preamble, set up 16-bit units for the register file, and scalarize the resulting load/store_preamble intrinsics. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18813>	2022-10-22 15:00:07 -04:00
Alyssa Rosenzweig	ec9eae99b1	agx: Report GPRs to the driver This needs to be passed onto the hardware. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18813>	2022-10-22 15:00:01 -04:00
Alyssa Rosenzweig	6e32826345	agx: Avoid reading high uniforms from device_load This does not seem to be possible architecturally. Exhaustively checked all bits of the encoding. Avoids regressing dEQP-GLES3.functional.texture.units.8_units.only_2d.0 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18813>	2022-10-22 14:59:59 -04:00
Alyssa Rosenzweig	5bd245d2cd	agx: Handle 64-bit moves lower_resinfo generates some 64-bit math, so we need to handle it. Even though we don't have native 64-bit moves, it's convenient to pretend we do to avoid special cases in the IR. In particular, modelling 64-bit mov_imm in the IR means our existing small constant propagation code works, with zero-extension from 8->64. Fixes dEQP-GLES3.functional.texture.units.2_units.only_2d_array.* Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18813>	2022-10-22 14:59:52 -04:00
Alyssa Rosenzweig	1521d9c58c	agx: Restrict copyprop of uniforms Some instructions don't accept uniform registers as sources (yet?), avoid this hazard in the optimizer. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18813>	2022-10-22 14:59:51 -04:00
Alyssa Rosenzweig	cef13f8ab1	agx: Handle uniforms passed to COLLECT It's useful to be able to copyprop uniform registers into COLLECT. That requires handling of uniform registers in the parallel copy lowering, which isn't too hard to add. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18813>	2022-10-22 14:59:48 -04:00
Alyssa Rosenzweig	056280a4a1	agx: Implement scalar load/store_preamble These need to copy values between GPRs and uniform registers. This is pretty easy in either direction. This implements scalar versions of the intrinsics. A backend NIR pass will scalarize for us. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18813>	2022-10-22 14:59:46 -04:00
Alyssa Rosenzweig	14fe5bc598	agx: Strengthen assert for packing ld/st instructions We really need to autogenerate the packing code... It's on the todo list, currently in discussions on how to best go about this. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18813>	2022-10-22 14:59:44 -04:00

1 2 3 4 5 ...

329 commits