fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-24 14:58:10 +02:00

Author	SHA1	Message	Date
Pavel Ondračka	3fcdd9e4a7	nir/lower_bool: ntt: Generate a good opcode for bcsel This is heavily copy-pasted from a patch of Ian Romanick, including the commit message. Previously, this pass always generated fcsel for bcsel. This was the only place that generate fcsel, so various drivers assumed (and needed!) that src0 was a Boolean with 0.0 or 1.0 as the only values. Specifically, many DX9 / GL_ARB_vertex_program platforms lack a CMP instruction in vertex shaders. In those cases, they would use LRP to implement fcsel. The bummer is that many plaforms have a real fcsel instruction, and those platforms would benefit from other places generating that opcode. Instead of leaving assumptions in drivers about the sources of an opcode that they can't really support, allow them to control the way the lowering pass translates bcsel. Two flags are used to control this: - If the driver sets has_fused_comp_and_csel in nir_options, fcsel_gt will be used. Since the Boolean value is 0.0 or 1.0, this is equivalent to fcsel. - If the parameter has_fcsel_ne is set, fcsel will be used. This is the old path. - Otherwise, the lowering pass assumes we're on a crufty, old DX9 vertex program, and it emits flrp. With this, the assumptions about src0 of fcsel in NTT can be removed. If a platform can't handle fcsel, it should ensure that the lowering pass won't generate it. No change in shader-db. Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20162>	2023-01-12 23:01:05 +00:00
Alyssa Rosenzweig	161e35e156	lima: Use u_default_set_debug_callback Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18351>	2022-09-01 14:50:24 +00:00
Jason Ekstrand	c5c90e6ea4	lima: Lower undefs to zero and run DCE after from_ssa Undefs can happen even in real GLSL shaders so it's best to handle them. Lowering to zero is a perfectly valid implementation. Also, run DCE because some of the undefs may be dead after from_ssa and there's no point in processing those in the back-end. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16817>	2022-08-01 22:13:24 +00:00
Vasily Khoruzhick	5ec0301ec7	lima: Lower nir_op_idiv with other scalar math operations. This will end up appearing in the backend once we move GLSL's idiv->fmul+frcp lowering to NIR. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16823>	2022-06-07 02:38:42 +00:00
Timothy Arceri	bb2f97eb27	lima: switch to NIR loop unrolling Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16366>	2022-06-04 16:11:49 +00:00
Timothy Arceri	9e1ce5de40	lima: fixup nir indirect unroll options to match gallium CAP Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16366>	2022-06-04 16:11:49 +00:00
Timothy Arceri	53df6dfb5b	lima: lower all undefs to zero in vs Otherwise we will later hit: gpir_error("nir_ssa_undef_instr is not supported\n"); Unfortunatly this causes a piglit failure due to increased register pressure in an unrealistic shader but since not doing this can result in hitting the not supported error in more relistic shaders this seems the right thing to do for now. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16366>	2022-06-04 16:11:49 +00:00
Timothy Arceri	d7a071a28f	gallium/drivers: set force_indirect_unrolling_sampler for all required drivers This is set to true for all drivers that have a GLSL level of support lower than 4.00. This matches the rule for setting the GLSL IR option EmitNoIndirectSampler. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16543>	2022-05-17 02:12:21 +00:00
Emma Anholt	536c8ee96d	nir/lower_tex: Make the adding a 0 LOD to nir_op_tex in the VS optional. This controls the whole lowering of "make tex ops with implicit derivatives on non-implicit-derivative stages be tex ops with an explicit lod of 0 instead", but it's really hard to describe that in a git commit summary. All existing callers get it added except: - nir_to_tgsi which didn't want it. - nouveau, which didn't want it (fixes regressions in shadowcube and shadow2darray with NIR, since the shading languages don't expose txl of those sampler types and thus it's not supported in HW) - optional lowering passes in mesa/st (lower_rect, YUV lowering, etc) Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16156>	2022-04-28 21:26:08 +00:00
Qiang Yu	aaf951c47f	lima: enable nir lower_varying_from_uniform Mali GPU pass varying by memory, so enable this optimization. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15341>	2022-03-22 01:33:23 +00:00
Vasily Khoruzhick	437b97de1c	lima: fix crash with sparse samplers Fixes following piglit tests: spec@arb_fragment_program@fp-fragment-position spec@arb_fragment_program@sparse-samplers Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13939>	2021-11-29 18:19:19 +00:00
Vasily Khoruzhick	764760314d	lima: add native txp support Currently lima uses generic TXP lowering that results in downgrading coords precision to FP16 since we have to do some calculations with coords instead of loading them directly from varying. Mali4x0 has native TXP support, however coords and projector have to come from a single source. Add NIR lowering pass that combines coords and projector into a single backend-specific source and use it instead of generic lowering. Unfortunately this change regresses one test, but it also fails in blob and disassembly is now identical. shader-db diff: total instructions in shared programs: 15623 -> 15603 (-0.13%) instructions in affected programs: 877 -> 857 (-2.28%) helped: 7 HURT: 0 helped stats (abs) min: 2 max: 8 x̄: 2.86 x̃: 2 helped stats (rel) min: 0.87% max: 10.53% x̄: 4.93% x̃: 1.85% 95% mean confidence interval for instructions value: -4.95 -0.76 95% mean confidence interval for instructions %-change: -9.31% -0.55% Instructions are helped. total loops in shared programs: 3 -> 3 (0.00%) loops in affected programs: 0 -> 0 helped: 0 HURT: 0 total spills in shared programs: 136 -> 137 (0.74%) spills in affected programs: 0 -> 1 helped: 0 HURT: 1 total fills in shared programs: 598 -> 602 (0.67%) fills in affected programs: 0 -> 4 helped: 0 HURT: 1 Tested-by: Denis Pauk <pauk.denis@gmail.com> Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13111>	2021-11-16 19:13:42 +00:00
Connor Abbott	3d957b40cc	lima: Add a NIR load duplicating pass and use it with vertex shaders. Note: The commit was originally authored by Connor Abbott <cwabbott@gmail.com> and was cherry-picked from <mesa/mesa!2315>. Apart from some changes, which were necessary due to rebasing, the following changes have been added: clone_intrinsic() was changed to use nir_instr_clone() instead of doing it manually. Tests against `src->parent_instr->type != nir_instr_type_phi` have been inserted, otherwise we may run into a nir validation error. Intrinsic load_input and load_uniform are not duplicated, if their source type is nir_instr_type_load_const. The above changes are Signed-off-by: Andreas Baierl <ichgeh@imkreisrum.de> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7682>	2021-09-01 08:30:57 +00:00
Timothy Arceri	a9ed4538ab	nir: add indirect loop unrolling to compiler options This is where it should be rather than having to pass it into the optimisation pass every time. It also allows us to call the loop analysis pass without having to duplicate these options which we will do later in this series. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12064>	2021-08-03 10:54:50 +00:00
Rhys Perry	1cbcfb8b38	nir, nir/algebraic: add byte/word insertion instructions Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3151>	2021-06-08 08:57:42 +00:00
Connor Abbott	a40714abf7	nir/lower_phis_to_scalar: Add "lower_all" option We don't want to have to deal with vector phis in freedreno, because vectors are always split/unsplit around vectorized instructions anyways, and the stated reason for not scalarising them (it hurting coalescing) won't apply to us because we won't be using nir_from_ssa. Add this option so that we don't have to do the equivalent thing while translating from NIR. Reviewed-by: Rob Clark <robdclark@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10809>	2021-05-17 09:59:45 +00:00
Vasily Khoruzhick	bff7fa3fe3	lima: compute nir_sha1 for shader key even if disk cache is disabled We're using it for in-memory cache as well, so it needs to be computed unconditionally. Fixes: `bf09ba5385` ("lima: implement shader disk cache") Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9838>	2021-03-25 21:58:57 +00:00
Vasily Khoruzhick	bf09ba5385	lima: implement shader disk cache Wire up disk cache routines and change fs and vs keys to use nir_sha1 instead of pointer to uncompiled shader to be able to reuse them for disk cache. Tested-by: Emmanuel Gil Peyrot <linkmauve@linkmauve.fr> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9672>	2021-03-25 06:31:41 +00:00
Vasily Khoruzhick	3099ce7d02	lima: rename lima_{fs,vs}_bind_state to lima_{fs,vs}_uncompiled_shader Tested-by: Emmanuel Gil Peyrot <linkmauve@linkmauve.fr> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9672>	2021-03-25 06:31:41 +00:00
Vasily Khoruzhick	1261c06c68	lima: rename _shader_state to _compiled_shader Tested-by: Emmanuel Gil Peyrot <linkmauve@linkmauve.fr> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9672>	2021-03-25 06:31:41 +00:00
Vasily Khoruzhick	d2ca8be998	lima: add precompile debug flag This flag will be used by run from mesa-shader-db to trigger shader compilation with default settings. Tested-by: Emmanuel Gil Peyrot <linkmauve@linkmauve.fr> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9583>	2021-03-14 16:16:03 +00:00
Erico Nunes	c453921c91	lima: run nir dce after nir_lower_vec_to_movs Some of the 'vec*' nir instructions may hold references to dead code until the nir_lower_vec_to_movs pass runs. After nir_lower_vec_to_movs, that code can finally be cleaned by dce, so add an additional dce pass. This not only potentially further removes unneeded code from the nir representation but also prevents bugs with the compiler from special case unused code that is not expected (e.g. root undef type nodes). Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9507>	2021-03-12 11:46:54 +00:00
Eric Anholt	3bdd39f03c	lima: avoid stomping over bound shader state when creating new shaders It shouldn't affect bound program state, and the current context state shouldn't be relevant for shader creation precompiles anyway (level load isn't going to have the eventual set of sampler views bound when you go to draw with that shader). Closes: #4306 Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9089>	2021-03-04 18:34:35 +00:00
Eric Anholt	4ac3f85054	lima: upload the shader to a BO at shader creation No need to conditionally upload later. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9089>	2021-03-04 18:34:35 +00:00
Eric Anholt	5a550c8dc7	lima: don't look at dirty bits for setup of FS key You always have to populate the key with the right texture swizzles, even if textures haven't changed since binding a new shader. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9089>	2021-03-04 18:34:35 +00:00
Eric Anholt	d4f706389c	lima: stop encoding the texture format in the shader key We can compose the swizzles at sampler view creation time, saving recompiles on texture format changes. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9089>	2021-03-04 18:34:34 +00:00
Erico Nunes	83eebadf92	lima: implement GL_EXT_texture_swizzle This is a desktop OpenGL 2.1 extension that seems to be required by glamor to support glyph rendering acceleration with R8 textures. Implementation borrowed from vc4. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8969>	2021-02-13 18:57:43 +00:00
Erico Nunes	7c885ad6c0	lima: introduce fs and vs shader cache Some opengl features require shader recompilation to be implemented in the mali400 and can only be detected at draw time. In some applications, this could lead to recompiling shaders on every frame which could kill performance. Introduce a shader cache to prevent this issue. lima didn't have vertex shader recompilation yet but it is needed to implement followup features, so include it here too. Code heavily borrowed from the vc4 shader cache implementation. One notable difference between the vc4 reference implementation and lima is that lima still compiles the base shader at state creation time without deferring it to the next draw (so that it stays compatible with behaviour expected by unmodified shader-db, for example). Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8357>	2021-01-16 22:20:35 +00:00
Daniel Schürmann	bd8e84eb8d	nir: replace .lower_sub with .has_fsub and .has_isub This allows a more fine-grained control about whether a backend supports one of these instructions. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6597>	2021-01-11 19:13:51 +00:00
Erico Nunes	faaba0d6af	nir/lower_vec_to_movs: don't vectorize unsupports ops If the instruction being coalesced would be vectorized but the target doesn't support vectorizing that op, skip coalescing. Reuse the callbacks from alu_to_scalar to describe which ops should not be vectorized. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Eric Anholt <eric@anholt.net> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6506>	2021-01-11 13:13:30 +00:00
Kenneth Graunke	140f53e646	Revert "nir: replace lower_ffma and fuse_ffma with has_ffma" This reverts commit `939ddf3f67`. Intel has a separate pass for fusing FFMAs selectively. We split these flags in commit `1b72c31e1f` and the reasoning still stands. The patch being reverted was just a cleanup, so there should be no issue with reverting it. Acked-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6849>	2020-09-24 13:11:50 -07:00
Marek Olšák	939ddf3f67	nir: replace lower_ffma and fuse_ffma with has_ffma Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6756>	2020-09-24 12:29:11 +00:00
Marek Olšák	771aad3027	nir: split lower_ffma into lower_ffma16/32/64 AMD wants different behavior for each bit size Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6756>	2020-09-24 12:29:11 +00:00
Eric Anholt	f25e169897	nir/opt_vectorize: Add a callback for filtering of vectorizing. For NIR-to-TGSI, we don't want to revectorize 64-bit ops that we split to scalar beyond vec2 width. We even have some ops that we would rather retain as scalar due to TGSI opcodes being scalar, or having more unusual requirements. This could be used to do the vectorize_vec2_16bit filtering, but that shader compiler option is also used in algebraic so leave it in place for now. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6567>	2020-09-02 09:59:17 -07:00
Jason Ekstrand	4f521e596a	lima: Only call nir_lower_io on shader_in/out Gallium drivers should never see nir_var_uniform because gallium lowers regular uniforms to a UBO. No GL driver should ever see either nir_var_mem_shared because that's lowered in GLSL IR. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5418>	2020-07-06 19:54:30 +00:00
Timothy Arceri	04dbf709ed	nir: add callback to nir_remove_dead_variables() This allows us to do API specific checks before removing variable without filling nir_remove_dead_variables() with API specific code. In the following patches we will use this to support the removal of dead uniforms in GLSL. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4797>	2020-06-03 02:22:23 +00:00
Axel Davy	522bd414f3	ttn: Add new allow_disk_cache parameter For now this parameter doesn't do anything. It means the implementation is allowed to use a cache on disk. Signed-off-by: Axel Davy <davyaxel0@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4993>	2020-05-13 19:43:05 +00:00
Erico Nunes	8c4157138f	lima/ppir: duplicate consts in nir Move the duplicate consts step to a nir pass. This makes the nir representation closer to what ppir will have in the result. Additionally, it handles the case where a const is used multiple times by a single node (which can happen in instructions like fcsel). The new implementation will only emit a single load const for that case. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4535>	2020-05-09 11:30:07 +00:00
Erico Nunes	5e6c386118	lima/ppir: duplicate intrinsics in nir Move the duplicate uniform and varying steps to a nir pass, along with some changes in the duplicating strategy. Node duplication is now done per user of the varying/uniform. This is inspired by what the offline shader compiler seems to usually do, and as usual aims to reduce register pressure and better utilize the ld_uni and ld_var instruction slots. It is worth noting that due to a bug/feature, ppir was already duplicating uniforms per successor in ppir_node_add_src even if the comment indicated it was meant to be per-block. Additionally, ppir was duplicating load uniform nodes twice for nodes that use the same uniform in more than one source, resulting in one unnecessary (and unpipelineable) load. This new implementation in nir only creates one load in that case. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4535>	2020-05-09 11:30:07 +00:00
Vasily Khoruzhick	1b49534df2	lima: add support for R and RG formats Unfortunately these are not supported natively for sampling so we have to lower them. Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4241> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4241>	2020-03-20 17:00:10 +00:00
Vasily Khoruzhick	2756b62917	lima/gpir: add better lowering for ftrunc GP doesn't support ftrunc natively and unfortunately one in generic opt_algebraic is not GP-friendly either. Introduce our own lowering that utilizes fsign() that GP supports: ftrunc(a) = fmul(fsign(a), ffloor(fmax(a, -a))) Tested-by: Andreas Baierl <ichgeh@imkreisrum.de> Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4126>	2020-03-16 16:28:33 -07:00
Qiang Yu	6fc0890cd9	lima: rename lima_submit to lima_job Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3755> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3755>	2020-02-17 02:54:15 +00:00
Qiang Yu	a4b048c046	lima: move pp_max_stack_size to lima_submit pp_max_stack_size is preserved across draws. Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3755>	2020-02-17 02:54:15 +00:00
Vasily Khoruzhick	fe5267d322	lima: disable early-z if fragment shader uses discard We have to disable early-z if fragment shader uses discard, otherwise we'll get misrendering. Reported-by: Icenowy Zheng <icenowy@aosc.io> Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Tested-by: Marge Bot <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3570> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/merge_requests/3570>	2020-01-27 22:35:43 -08:00
Erico Nunes	7701b7b7ee	lima/ppir: enable lower_fdph Otherwise we may lower some fdot to fdph which is not implemented in pp. Fixes #2126 Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-12-11 15:55:48 +01:00
Erico Nunes	d939f5d463	lima: fix nir shader memory leak Fix memory leak on allocation for nir shader, reported by valgrind. 3,502 (480 direct, 3,022 indirect) bytes in 1 blocks are definitely lost in loss record 77 of 84 at 0x48483F8: malloc (in /usr/lib/valgrind/vgpreload_memcheck-arm64-linux.so) by 0x5750817: ralloc_size (ralloc.c:119) by 0x5750977: rzalloc_size (ralloc.c:151) by 0x575C173: nir_shader_create (nir.c:45) by 0x5763ACB: nir_shader_clone (nir_clone.c:728) by 0x55D5003: st_create_fp_variant (st_program.c:1242) by 0x55D789F: st_get_fp_variant (st_program.c:1522) by 0x55D789F: st_get_fp_variant (st_program.c:1507) by 0x56400C3: st_update_fp (st_atom_shader.c:163) by 0x563D333: st_validate_state (st_atom.c:261) by 0x55D07CB: prepare_draw (st_draw.c:132) by 0x55D08DF: st_draw_vbo (st_draw.c:184) by 0x55576CB: _mesa_draw_arrays (draw.c:374) by 0x55576CB: _mesa_draw_arrays (draw.c:351) Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com>	2019-11-07 23:03:01 +00:00
Vasily Khoruzhick	65a5b24aee	lima: add support for gl_PointSize GP handles gl_PointSize similar to gl_Position, i.e. it needs separate buffer and it has special type in varying descriptors, also for indexed draw we need to emit special PLBU command to pass address of gl_PointSize buffer. Blob also clamps gl_PointSize to 1 .. 100 (as well as line width), so let's do the same. Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-11-05 17:44:56 -08:00
Vasily Khoruzhick	6dd0ad66de	lima/ppir: add NIR pass to split varying loads NIR may emit a single instrinsic to load several packed varyings, but that's suboptimal for Utgard PP for several reasons: - varyings that are used as sampler inputs can be passed using pipeline register with increased precision - we have small number of regs, so using a vec4 regs for storing two vec2 varyings increases reg pressure. Add NIR pass to split a single load into several loads and utilize it in lima. Reviewed-by: Qiang Yu <yuq825@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-09-26 18:51:10 -07:00
Vasily Khoruzhick	d214778753	lima: implement BO cache Allocating BOs is expensive, so we should avoid doing that by caching freed BOs. BO cache is modelled after one in v3d driver and works as follows: - in lima_bo_create() check if we have matching BO in cache and return it if there's one, allocate new BO otherwise. - in lima_bo_unreference() (renamed from lima_bo_free()): put BO in cache instead of freeing it and remove all stale BOs from cache Reviewed-by: Qiang Yu <yuq825@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-09-22 19:20:59 -07:00
Vasily Khoruzhick	576341324d	lima: run opt_algebraic between int_to_float and boot_to_float for vs int_to_float emits ftrunc and ftrunc lowering generates bool ops. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-09-09 10:25:30 -07:00

1 2

85 commits