fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-28 01:18:15 +02:00

Author	SHA1	Message	Date
Vasily Khoruzhick	34a75ce15c	lima: fix blending with min/max ops It turns out that BLEND_MIN and BLEND_MAX in Utgard take blend factors into account. My guess is that actual equation looks like: OP(As * S + Ad * D, Ad) for alpha, and OP(Cs * S + Cd * D, Cd) for color. So we have to set S factor to 1 and D factor to 0 to be compliant with GL spec. Fixes following piglit tests: spec@!opengl 1.4@blendminmax spec@arb_blend_func_extended@arb_blend_func_extended-fbo-extended-blend (with patch my for ES2_compatibility and EXT_blend_func_extended) Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13873>	2021-11-29 19:31:59 +00:00
Vasily Khoruzhick	5f9434b611	lima: use 1 as blend factor for dst_alpha for SRC_ALPHA_SATURATE As per [1] alpha blend factors for Sa and Da should be 1 for SRC_ALPHA_SATURATE [1] https://www.khronos.org/registry/OpenGL/extensions/ARB/ARB_blend_func_extended.txt Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13873>	2021-11-29 19:31:59 +00:00
Vasily Khoruzhick	d1d3ebb48c	lima: implement dual source blend It was a bit trickier to RE, since blob doesn't expose this functionality at all, however we had a clue from the very beginning: lima_blend_factor is 3 bits, i.e. 8 values, but only 5 of them were used, it just waited till someone tried what 3 unused values do. Interestingly enough, it turns out "5" works just as "0" (which is PIPE_BLENDFACTOR_SRC_), but only if output register for gl_FragColor is $0, So it looks suspiciously similar with PIPE_BLENDFACTOR_SRC1_ behavior, and looks like secondary output is taken from $0. Since output regs for all other outputs are configured via RSW, there must be a field in RSW for output register for secondary color, it's likely 4 bits and it's currently set to 0 for reg $0. Then it was just a matter of brute-forcing various consecutive 4 bits in RSW - and indeed, setting top 4 bits of rsw->aux0 to the index of gl_FragColor output register fixes blending tests when we use "5" blend factor instead of "0". So it must be a register number for gl_SecondaryFragColor. Unlike gl_FragColor, the field is only repeated once in RSW. Wire it up in compiler, and piglit arb_blend_func_extended now passes. Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13873>	2021-11-29 19:31:59 +00:00
Vasily Khoruzhick	b8f4d36ee4	lima: disasm: call util_cpu_detect() to init CPU caps It's needed by _mesa_half_to_float(), without this change it hits assertion failure in util_get_cpu_caps(). Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13968>	2021-11-29 18:34:58 +00:00
Vasily Khoruzhick	711a4ccddb	lima: disasm: use last argument as a filename Otherwise it fails to open a file. Fixes: `9660427ab7` ("lima: Print usage if --help is any of the arguments.") Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13968>	2021-11-29 18:34:58 +00:00
Vasily Khoruzhick	437b97de1c	lima: fix crash with sparse samplers Fixes following piglit tests: spec@arb_fragment_program@fp-fragment-position spec@arb_fragment_program@sparse-samplers Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13939>	2021-11-29 18:19:19 +00:00
Vasily Khoruzhick	3b15fb3575	lima/ppir: implement gl_FragDepth support Mali4x0 supports writing depth and stencil from fragment shader and we've been using it quite a while for depth/stencil buffer reload. The missing part was specifying output register for depth/stencil. To figure it out, I changed reload shader to use register $4 as output and poked RSW bits (or rather consecutive 4 bit groups) until tests that rely on reload started to pass again. It turns out that register number for gl_FragDepth/gl_FragStencil is in rsw->depth_test and register number for gl_FragColor is in rsw->multi_sample and it's repeated 4 times for some reason (likely for MSAA?) With this knowledge we now can modify ppir compiler to support multiple store_output intrinsics. To do that just add destination SSA for store_output to the registers list for regalloc and mark them explicitly as output. Since it's never read in shader we have to take care about it in liveness analysis - basically just mark it alive from the time when it's written to the end of the block. If it's live only in the last instruction, mark it as live_internal, so regalloc doesn't clobber it. Then just let regalloc do its job, and then copy register number to the shader state and program it in RSW. The tricky part is gl_FragStencil, since it resides in the same register as gl_FragDepth and with the current design of the compiler it's hard to merge them. However gl_FragStencil doesn't seem to be part of GL2 or GLES2, so we can just leave it not implemented. Also we need to take care of stop bit for instructions - now we can't just set it in every instruction that stores output, since there may be several outputs. So if there's any store_output instructions in the block just mark that block has a stop, and set stop bit in the last instruction in the block. The only exception is discard - we always need to set stop bit in discard instruction. Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13830>	2021-11-24 02:26:08 +00:00
Vasily Khoruzhick	98a7c4c6f8	lima/ppir: check if mul node is a source of add node before inserting We can't insert mul node into add node instruction if it's a virtual dep (sequence or write_or_read dep), so use ppir_node_has_single_src_succ in addition to ppir_node_has_single_succ. We can't use ppir_node_has_single_src_succ alone, since node may have a virtual dependency in addition to source dependency, and we can't insert it either in this case. Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13830>	2021-11-24 02:26:08 +00:00
Mike Blumenkrantz	c9a47c85da	gallium: rename PIPE_CAP_PREFER_BLIT_BASED_TEXTURE_TRANSFER this is now a bitfield enum for more functionality Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11984>	2021-11-18 07:58:29 -05:00
Erico Nunes	ee2e14b352	ci: temporarily disable lima CI The lima board farm will be unavailable for a few days, so disable it to avoid CI failures. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13595>	2021-11-17 11:40:19 +00:00
Vasily Khoruzhick	02e5f4fb10	lima: add more wrap modes Using 1 bit per wrap mode looked very suspicious and after some experiments it turns out it's 3-bit enum. Border color is also here, it sits right after depth field. For some reason it uses 16 bit per channel just like for clear color in RSW GL_CLAMP mode is broken for nearest filter just as on Midgard, so add the same workaround - use GL_CLAMP_TO_EDGE for nearest filter. Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13213>	2021-11-16 22:58:12 +00:00
Vasily Khoruzhick	cbed4d784e	lima: handle 1D samplers It's just a matter of changing number of dimensions in texture descriptor. Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13213>	2021-11-16 22:58:12 +00:00
Vasily Khoruzhick	fa86a2a94d	lima: add support for 3D textures It looks like MBS format used by blob doesn't distinguish sampler2D from sampler3D, so load texture instruction is the same for 2D and 3D textures. So all we need to RE is texture descriptor for 3D textures, but blob doesn't implement it, so we need to do some guesswork: - unknown_3_1 looks like a depth since it sits after height/width and always set to 1 - unknown_2_2 is exactly 3 bits and it follows wrap_t, so it must be wrap_r - missing part is texture type for 3D textures. By trial and error it seems to be 4. First bit is only set for cubemap, so it's likely a separate flag, and rest 2 bits look like number of tex dimensions akin to midgard and later (thanks, panfrost!) with 0 for 1D, 1 for 2D and 2 for 3D. Put it all together and we have working 3D textures on lima! Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13213>	2021-11-16 22:58:12 +00:00
Vasily Khoruzhick	764760314d	lima: add native txp support Currently lima uses generic TXP lowering that results in downgrading coords precision to FP16 since we have to do some calculations with coords instead of loading them directly from varying. Mali4x0 has native TXP support, however coords and projector have to come from a single source. Add NIR lowering pass that combines coords and projector into a single backend-specific source and use it instead of generic lowering. Unfortunately this change regresses one test, but it also fails in blob and disassembly is now identical. shader-db diff: total instructions in shared programs: 15623 -> 15603 (-0.13%) instructions in affected programs: 877 -> 857 (-2.28%) helped: 7 HURT: 0 helped stats (abs) min: 2 max: 8 x̄: 2.86 x̃: 2 helped stats (rel) min: 0.87% max: 10.53% x̄: 4.93% x̃: 1.85% 95% mean confidence interval for instructions value: -4.95 -0.76 95% mean confidence interval for instructions %-change: -9.31% -0.55% Instructions are helped. total loops in shared programs: 3 -> 3 (0.00%) loops in affected programs: 0 -> 0 helped: 0 HURT: 0 total spills in shared programs: 136 -> 137 (0.74%) spills in affected programs: 0 -> 1 helped: 0 HURT: 1 total fills in shared programs: 598 -> 602 (0.67%) fills in affected programs: 0 -> 4 helped: 0 HURT: 1 Tested-by: Denis Pauk <pauk.denis@gmail.com> Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13111>	2021-11-16 19:13:42 +00:00
Vasily Khoruzhick	15013958d0	lima: enable PIPE_CAP_PREFER_POT_ALIGNED_VARYINGS Mali4x0 PP doesn't have a swizzle for load_input, so use POT-aligned varyings to avoid unnecessary movs for vec3 and precision downgrade in case if this vec3 is coordinates for a sampler shader-db: total instructions in shared programs: 15707 -> 15623 (-0.53%) instructions in affected programs: 3906 -> 3822 (-2.15%) helped: 47 HURT: 18 helped stats (abs) min: 1 max: 9 x̄: 3.09 x̃: 2 helped stats (rel) min: 1.49% max: 23.53% x̄: 8.20% x̃: 6.45% HURT stats (abs) min: 1 max: 7 x̄: 3.39 x̃: 3 HURT stats (rel) min: 0.78% max: 20.59% x̄: 10.45% x̃: 10.97% 95% mean confidence interval for instructions value: -2.18 -0.41 95% mean confidence interval for instructions %-change: -5.70% -0.38% Instructions are helped. total spills in shared programs: 146 -> 136 (-6.85%) spills in affected programs: 39 -> 29 (-25.64%) helped: 6 HURT: 0 total fills in shared programs: 617 -> 598 (-3.08%) fills in affected programs: 125 -> 106 (-15.20%) helped: 6 HURT: 0 HURT shaders are vertex shaders where we may need more instructions for non-packed vec3s. It's acceptable trade-off since we don't get precision downgrade if this varying is coordinates for a sampler. Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13151>	2021-11-15 22:52:55 +00:00
Andreas Baierl	ee41e1bbd2	lima: Fix drawing wide lines GLES2.0 spec allows parts of wide lines and points to be drawn even if their center is outside the viewport. Therefore 0x2000 in PLBU_CMD_PRIMITIVE_SETUP has to be set for points. This is already our default setting as it seems to have no negative effect when this bit is always set. Points work as expected but lines don't. It's hard to RE it, because the affected deqp tests also fail with the blob. To respect this behaviour for lines and solve another 2 tests, we need to do a workaround and temporarily extend the viewport by half of the line width. The scissor rectangle is still equal with the initial viewport. Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Signed-off-by: Andreas Baierl <ichgeh@imkreisrum.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12971>	2021-11-11 11:25:58 +00:00
Marek Olšák	cf9afc7b0c	gallium: add missing point and line CAPs The returned values are the same as the GL frontend. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13676>	2021-11-08 14:37:49 +00:00
Marek Olšák	b80dca86c3	gallium: rename PIPE_CAPF_MAX_POINT_WIDTH -> MAX_POINT_SIZE Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13676>	2021-11-08 14:37:49 +00:00
Marek Olšák	7ce3f8e639	gallium/util: fix util_can_blit_via_copy_region with unbound render condition It returned false when a render condition was not bound, but it should have returned true. The bool stuff is random and incomplete, but that's life. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13602>	2021-11-04 15:06:09 +00:00
Emma Anholt	38dff02bfb	ci/deqp-runner: Rename the deqp-drivername-.txt files to drivername-.txt We have two testsuites with the same format for fails/flakes/skips files, and test names that are definitely unique. As I'm about to add a third testsuite (gtest for libva-utils), so let's have just one file each for fails/flakes/skips instead of one per type of testsuite. This starts the move with just the bulk rename of deqp. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13517>	2021-10-27 20:54:11 +00:00
Emma Anholt	9ddfd297e0	ci/deqp-runner: Simplify the --jobs argument setup. We can use the general "how parallel should we go on this runner?" env var and save a bunch of massaging env var names. Fixes how PIGLIT_PARALLEL looked like it was useful but actually wasn't passed through to HW runners. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13372>	2021-10-21 07:34:19 +00:00
Marcin Ślusarz	b8cafaa91d	lima: use nir_shader_instructions_pass in lima_nir_split_load_input Changes: - nir_metadata_preserve(..., nir_metadata_block_index \| nir_metadata_dominance) is called only when pass makes progress - nir_metadata_preserve(..., nir_metadata_all) is called when pass doesn't make progress Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13176>	2021-10-04 15:54:06 +00:00
Andreas Baierl	1e9f18008f	lima/parser: add shader disassembly to dump Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Signed-off-by: Andreas Baierl <ichgeh@imkreisrum.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13138>	2021-10-04 08:37:50 +02:00
Vasily Khoruzhick	5db5ff58b7	lima: split_load_input: don't split unaligned vec2 Mali4x0 can't fetch unaligned vec2 (i.e. .yz), so don't split it. Fixes: `6dd0ad66de` ("lima/ppir: add NIR pass to split varying loads") Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13131>	2021-10-01 18:11:54 +00:00
Andreas Baierl	187f786108	lima: Fix glFrontFace handling Bit 12 of render->aux1 is GL_CCW/GL_CW. For GL_CCW (default of glFrontFace) we have to set that bit active. This is not what the blob does and what the original reverse engineering documentation says. The blob sets this value inverted and does some bogus negation of the fragment shaders gl_FrontFacing variable instead. Anyway, doing it this way does not cause regressions but fixes dEQP-GLES2.functional.shaders.builtin_variable.frontfacing and 4 piglit tests. Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Signed-off-by: Andreas Baierl <ichgeh@imkreisrum.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7690>	2021-09-27 09:24:32 +02:00
Emma Anholt	13384b9626	mesa: Prioritize checking for GLES2's uniform transpose error. The negative API tests ask to transpose a non-matrix uniform, and expect the transpose error rather than the non-matrix error. This may be a test bug about ambiguous results, but since every other driver is presumably doing this too, just follow along. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12924>	2021-09-21 23:06:42 +00:00
Emma Anholt	5a39938b00	mesa: Throw an error for compressed glGenerateMipmap on GLES2 contexts. This error is gone from GLES3. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12924>	2021-09-21 23:06:42 +00:00
Andreas Baierl	3c19ab4a7b	lima: Remove depth near/far workaround because this is fixed now. Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Signed-off-by: Andreas Baierl <ichgeh@imkreisrum.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12804>	2021-09-21 08:54:53 +00:00
Andreas Baierl	d1798ad1b5	lima: Expose GL_EXT_clip_control Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Signed-off-by: Andreas Baierl <ichgeh@imkreisrum.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12804>	2021-09-21 08:54:53 +00:00
Emma Anholt	aed4c0b5a9	nir: Drop the unused instr arg for src/dest copy functions. Now that we don't use ralloc, we don't need this arg to get at the right ralloc ctx. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11776>	2021-09-14 17:53:06 +00:00
Icecream95	1976f4980c	lima: Add a noop drm-shim Hard-code Mali450 with six cores for now, matching the hardware I have. Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12737>	2021-09-10 21:34:36 +00:00
Icecream95	2777d4f69d	lima: Improve error messages for unsupported GP operations Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12738>	2021-09-08 04:19:56 +00:00
Icecream95	4ad4aa38fa	lima: Fix crashes for GPUs with more than four cores Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12738>	2021-09-08 04:19:56 +00:00
Icecream95	c1f40c762c	lima: Enable PIPE_CAP_VERTEX_COLOR_UNCLAMPED Fixes lighting being too bright in Neverball. Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12738>	2021-09-08 04:19:56 +00:00
Connor Abbott	77a852c1ba	lima/gpir: Rewrite register allocation for value registers The usual linear-scan register allocation algorithm can't handle preallocated registers, since we might be forced to choose a color for a non-preallocated variable that overlaps with a pre-allocated variable. But in such cases we can simply split the live range of the offending variable when we reach the beginning of the pre-allocated variable's live range. This is still optimal in the sense that it always finds a coloring whenever one is possible, but we may not insert the smallest possible number of moves. However, since it's actually the scheduler which splits live ranges afterwards, we can simply fold in the move while keeping its fake dependencies, and then everything still works! In other words, inserting a live range split for a value register during register allocation is pretty much free. This means that we can split register allocation in two. First globally allocate the cross-block registers accessed through load_reg and store_reg instructions, which is still done via graph coloring, and then run a linear scan algorithm over each block, treating the load_reg and store_reg nodes as referring to pre-allocated registers. This makes the existing RA more complicated, but it has two benefits: first, using round-robin with the linear scan allocator results in much fewer fake dependencies, resulting in around 15 less instructions in the glmark2 jellyfish shader and fixing a regression in instruction count since branching support went in. Second, it will simplify handling spilling. With just graph coloring for everything, every time we spill a node, we have to create new value registers which become new nodes in the graph and re-run RA. This is worsened by the fact that when writing a value to a temporary, we need to have an extra register available to load the write address with a load_const node. With the new scheme, we can ignore this entirely in the first part and then in the second part we can just reserve an extra register in sections where we know we have to spill. So no re-running RA many times, and we can get a good result quickly. The current implementation does linear scan backwards, so that we can insert the fake dependencies while allocating and avoid creating any move nodes at all when we have to split a live range. However, it turns out that this makes handling schedule_first nodes a bit more complicated, so it's not clear if that was worth it. Note: The commit was originally authored by Connor Abbott <cwabbott@gmail.com> and was cherry-picked from <mesa/mesa!2315>. Rebasing was necessary due to changes to BITSET_FOREACH_SET, see `4413537c` Because some deqp tests pass now, deqp-lima-fails.txt was also changed. The above changes are Signed-off-by: Andreas Baierl <ichgeh@imkreisrum.de> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7682>	2021-09-01 08:30:57 +00:00
Connor Abbott	3d957b40cc	lima: Add a NIR load duplicating pass and use it with vertex shaders. Note: The commit was originally authored by Connor Abbott <cwabbott@gmail.com> and was cherry-picked from <mesa/mesa!2315>. Apart from some changes, which were necessary due to rebasing, the following changes have been added: clone_intrinsic() was changed to use nir_instr_clone() instead of doing it manually. Tests against `src->parent_instr->type != nir_instr_type_phi` have been inserted, otherwise we may run into a nir validation error. Intrinsic load_input and load_uniform are not duplicated, if their source type is nir_instr_type_load_const. The above changes are Signed-off-by: Andreas Baierl <ichgeh@imkreisrum.de> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7682>	2021-09-01 08:30:57 +00:00
Marek Olšák	bb89cf4bf3	gallium: add take_ownership into set_sampler_views to skip reference counting Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12334>	2021-08-20 15:04:20 +00:00
Andreas Baierl	5df677e996	lima: CI: Enable GL_R8 and GL_RG8 texture formats This is fixed in deqp now. See https://github.com/KhronosGroup/VK-GL-CTS/pull/241 Since CI is using deqp version > vulkan-cts-1.2.6.0, this isn't an issue anymore. Signed-off-by: Andreas Baierl <ichgeh@imkreisrum.de> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12409>	2021-08-17 14:49:51 +02:00
Erico Nunes	574bff9087	ci: enable CI for lima again Enable CI for lima again on meson-gxl-s805x-libretech-ac boards with Mali-450. These boards are managed by a LAVA instance and so follow the LAVA CI workflow in Mesa. The goal is to have coverage for deqp-gles2, as lima is a GLES2-only driver. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11789>	2021-08-17 11:22:59 +00:00
Roman Stratiienko	5ec6b6e9bb	lima: Implement lima_resource_get_param() callback Currently stride, offset, modifier is obtained by invoking lima_resource_get_handle() with WINSYS_HANDLE_TYPE_KMS. Before commit `47f000c170` this path was working. Obtained handle was simply ignored by DRI frontend and only requested data used. After commit `47f000c170` such requests started to fail when DRI is initialized using KMSRO and resource has no scanout data. When lima_resource_get_param() is implemented, it will be used in a first place to obtain resource data. Fixes: `47f000c170` ("lima: fail in get_handle(TYPE_KMS) without a scanout resource") Signed-off-by: Roman Stratiienko <r.stratiienko@gmail.com> Reviewed-by: Simon Ser <contact@emersion.fr> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12362>	2021-08-17 10:50:51 +00:00
Timothy Arceri	a9ed4538ab	nir: add indirect loop unrolling to compiler options This is where it should be rather than having to pass it into the optimisation pass every time. It also allows us to call the loop analysis pass without having to duplicate these options which we will do later in this series. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12064>	2021-08-03 10:54:50 +00:00
Simon Ser	47f000c170	lima: fail in get_handle(TYPE_KMS) without a scanout resource The previous logic was returning a handle valid for the render-only device if rsc->scanout was NULL. However the caller doesn't expect this: the caller will use the handle with the KMS device. Instead of returning a handle for the wrong device, fail if we don't have one. Signed-off-by: Simon Ser <contact@emersion.fr> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12074>	2021-07-29 21:01:10 +02:00
Erico Nunes	e6cdb01c51	lima: avoid crash with negative viewport values The viewport value computations done in lima_set_viewport_states can result in a negative value for viewport. These could end up converted to unsigned values in lima_clip_scissor_to_viewport causing crashes from invalid scissor commands. Prevent this by limiting the minimum value to zero as is already done for the left and bottom values. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2938 Cc: mesa-stable Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12055>	2021-07-27 08:48:28 +00:00
Vasily Khoruzhick	4a3269dff6	lima: handle fp16 vertex formats `12128fb135` marked fp16 vertex formats supported, but they aren't actually handled by lima_pipe_format_to_attrib_type(). Fix it by handling it there. FP16 seems to be the only missing index which is 0x3. Fixes: `12128fb135` ("lima: add natively supported vertex buffer formats") Cc: 21.1 mesa-stable Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11872>	2021-07-15 01:06:52 +00:00
Paul Kocialkowski	eefd93c176	lima: Take offset in account when checking BO size BO resources imported from a handle may have an offset provided, which reduces the available size within the BO. Take this in account when checking that the size is sufficient in lima. Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Paul Kocialkowski <paul.kocialkowski@bootlin.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11076>	2021-07-13 21:26:21 +00:00
Jason Ekstrand	d4b482d378	android: Drop the Android.mk build system Android.mk files haven't really been supported by Mesa devs for a long time. Most of us have been willing to update Makefile.sources if we remember and sometimes we try to blind code some Android.mk for a new generator. However, the reality is that it breaks regularly and ends up being maintained by the Android community. To address this problem another approach was implemented in !10183 utilizing the maintained meson build system. The old Android.mk files are no longer required. This commit was created with the following commands: git rm /Android.mk git rm /Android..mk git rm */Makefile.sources git rm CleanSpec.mk Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4487 Acked-by: Roman Stratiienko <r.stratiienko@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9728>	2021-07-08 14:44:02 -05:00
Jason Ekstrand	624e799cc3	nir: Drop nir_ssa_def::name and nir_register::name We say that they're for debug only but we don't really have a good policy around when to set them and when not to. In particular, nir_lower_system_values and nir_lower_vars_to_ssa which are the chief producers of SSA values which might reasonably have a name do not bother to set one. We have some names set from things like BLORP and RADV's meta shaders but AFAICT, they're setting a name more because it's there than because they actually care. Also, most things other than nir_clone and nir_serialize don't bother to try and preserve them. You can see in the diffstat of this commit exactly what passes attempt to preserve names. Notably missing from the list is opt_algebraic which is the single largest source of SSA def churn and it happily throws names away. These observations lead me to question whether or not names are actually useful at all or if they're just taking up space (8B per instruction) and wasting CPU cycles (to ralloc_strdup on the off chance we do have one). I don't think I can think of a single time in recent history where I've been debugging a shader issue and a SSA value name has been there and been useful. If anything, the few times they are there, they just throw me off because they mess up the indentation in nir_print. iris shader-db on my system gets runtime -2.07734% +/- 1.26933% (n=5) Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5439>	2021-07-08 17:34:41 +00:00
Emma Anholt	9cc1f08919	ci/deqp: Skip flush_finish on all CI jobs. They're too slow to run in CI even on non-tiled renderers, they don't block conformance (unless you crash), and provide unreliable warning results unless you isolate them from other activity on the system. This means that the following jobs now skip these tests: - deqp-iris-* - deqp-llvmpipe (you know, the one mentioned in the comment!) - deqp-virgl-gl - deqp-zink-lvp Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11333>	2021-06-14 12:09:19 -07:00
Emma Anholt	e8ca9b99cb	ci/deqp: Drop stress/perf skips lists. The mustpass doesn't have any tests matching these, so no need to skip. These tests only show up if you run without using a mustpass list. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11333>	2021-06-14 12:09:19 -07:00
Daniel Stone	d0e5203855	ci/lava: Use per-job rootfs overlay for environment Trying to get arbitrary strings suitably quoted for shell, embedded in a YAML file, processed by Python templating, is like seven bad ideas all embedded into one big can of bees. Reuse the same script we use for bare-metal to generate the environment, tar that up into a per-job overlay which is added to the inter-pipeline-reusable rootfs built by the container jobs and the intra-pipeline-reusable overlay built by the build jobs. @anholt wrote a chunk of this - replacing the $ENV_VARS GitLab CI variable with a Python loop across the POSIX job environment - in !11192, but this still had YAML quoting nightmares, and was more needless duplication between LAVA and bare-metal. The diff is large and annoying, but is mostly a sed job to get ENV_VARS="FOO=bar BAZ=quux" into FOO: bar\nBAZ: quux. Signed-off-by: Daniel Stone <daniels@collabora.com> Co-authored-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11309>	2021-06-11 12:13:00 +00:00

1 2 3 4 5 ...

493 commits