fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-17 18:18:06 +02:00

Author	SHA1	Message	Date
Rhys Perry	32d0bae8ec	aco: fix branch definition validation Like how they have to be register allocated differently, branch definitions at merge block predecessors need to be validated differently. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13432>	2022-03-03 20:21:08 +00:00
Rhys Perry	bed5a31005	aco: add validate_instr_defs() Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13432>	2022-03-03 20:21:08 +00:00
Rhys Perry	d5349a99c2	aco/ra: fix register allocation of branch definitions fossil-db (Sienna Cichlid): Totals from 704 (0.52% of 134913) affected shaders: CodeSize: 7177288 -> 7182072 (+0.07%); split: -0.00%, +0.07% Instrs: 1371781 -> 1372977 (+0.09%); split: -0.00%, +0.09% Latency: 17993572 -> 18001344 (+0.04%); split: -0.00%, +0.04% InvThroughput: 4198996 -> 4199569 (+0.01%); split: -0.00%, +0.01% Copies: 122456 -> 123516 (+0.87%); split: -0.01%, +0.88% Branches: 43815 -> 43818 (+0.01%); split: -0.02%, +0.03% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13432>	2022-03-03 20:21:08 +00:00
Rhys Perry	608d48b787	aco/ra: add get_reg_phi() helper Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13432>	2022-03-03 20:21:08 +00:00
Rhys Perry	ceca5e68c4	aco: remove vcc hint from branch definitions This doesn't seem to have much benefit anymore. fossil-db (Sienna Cichlid): Totals from 198 (0.15% of 134913) affected shaders: CodeSize: 2610536 -> 2610872 (+0.01%); split: -0.01%, +0.02% Instrs: 479001 -> 479085 (+0.02%); split: -0.01%, +0.03% Latency: 7310684 -> 7300735 (-0.14%); split: -0.16%, +0.02% InvThroughput: 2439084 -> 2437446 (-0.07%); split: -0.07%, +0.00% SClause: 14760 -> 14722 (-0.26%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13432>	2022-03-03 20:21:08 +00:00
Pavel Ondračka	558f632967	r300: schedule TEX instructions before OUT instructions NIR-to-TGSI produces partial output writes contrary to the old paths that always wrote the full outputs. Therefore if there is now a partial output write ready to be scheduled and nothing else besides a tex is ready, we would schedule the output write first. This was not a problem before as usually at last some component of the full output write depended on the tex result. This is not optimal from the performance point of view and resulted in ~20% slowdown in the Unigine demos. The docs say: The first OUTPUT instruction will reserve space in the output register fifo. This space is limited, therefore issuing an OUTPUT earlier than necessary may cause threads to stall earlier than necessary. You should not set an ALU instruction as type OUTPUT unless it is actually writing to an output register, or it is the last instruction of the program. Fix it by explicitly prefering a TEX before OUT and restore the performance: 9.66 -> 12.12 fps (as compared to 11.83 with the old glsl-to-TGSI path) in Unigine Sanctuary. No change in Lightsmark or GLmark. This is also a win from the intructions point of view as we are usually able to schedule the partial output writes in a single pair at the end. total instructions in shared programs: 106009 -> 105891 (-0.11%) instructions in affected programs: 10153 -> 10035 (-1.16%) helped: 118 HURT: 0 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5840 Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15165>	2022-03-03 20:05:32 +00:00
Pavel Ondračka	aff1a85c09	r300: remove some dead logic in tex pair scheduling The max_score == -1 condition is already before so this will never trigger. Its unclear what was the intention anyway. Now we emit either: - if we have accumulated enough tex intructions for a full block - if we have nothing else to emit - or if we can emit all remaining tex instructions already. Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15165>	2022-03-03 20:05:32 +00:00
Igor Torrente	688b23885b	Venus: Add `vn_physical_device_{features, properties}` for better organization New extensions properties/feature are being put in the `vn_physical_device` which is not ideal from an organization point of view. Here the `vn_physical_device_{features,properties}` are two new struct to help the `vn_physical_device` organzation. Signed-off-by: Igor Torrente <igor.torrente@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15170>	2022-03-03 19:43:52 +00:00
Ilia Mirkin	539fae796a	freedreno/a4xx: fix integer tg4 Something is slightly off in the integer values returned. It passes many tests without the fixup, but the dEQP-GLES31 tests complain. The blob ends up doing 3x gathers, and selects between them based on getinfo results. Since we already have a per-sampler key with some spare bits, just stick the bit-size info in there. And we can derive signedness from the associated type info. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14670>	2022-03-03 18:26:43 +00:00
Ilia Mirkin	96211adf77	freedreno/a4xx: add swizzles to shader keys for tg4 workaround Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14670>	2022-03-03 18:26:43 +00:00
Ilia Mirkin	68a2d25d0d	freedreno/a4xx: move tex_type to header This will be used in several places. Factor it out for common use. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14670>	2022-03-03 18:26:43 +00:00
Ilia Mirkin	8ed07c0da9	nir: remove bogus logic to allow cube + offset to work This was done for an a4xx hack which is now removed. No API allows cube texturing to have offsets. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14670>	2022-03-03 18:26:43 +00:00
Ilia Mirkin	37306ba3f1	freedreno/ir3: remove bogus tg4 -> tex lowering pass It can't be done. This just provides bad results. The blob had a comparable approach where they fixed up coordinates, but that also can't work with a separate texture definition with nearest filtering. By then, might as well provide a unswizzled variant instead, and using native functionality. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14670>	2022-03-03 18:26:43 +00:00
Alex Xu (Hello71)	80bf9c7b97	r300/compiler/tests: print regoff_t as size_t fixes compilation on musl Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13949>	2022-03-03 17:48:17 +00:00
Samuel Pitoiset	516aee64cc	radv,aco: do not lower nir_op_pack_{unorm,snorm}_2x16 v_cvt_pknorm_{u16,i16}_f32 can be emitted instead, it's supported on all generations. No fossils-db changes. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15215>	2022-03-03 14:54:12 +01:00
Michel Zou	f1f1b3d7f8	vulkan/wsi: drop unused wsi_create_win32_image fixes: `ed391d2a` Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15088>	2022-03-03 06:13:07 +00:00
Andrii Simiklit	ddf2778269	glsl: add member's location layout qualifier rules for `arrayed` in/out blocks From Section 4.4.1 (Input Layout Qualifiers) of the GLSL 4.50 spec: "For some blocks declared as arrays, the location can only be applied at the block level: When a block is declared as an array where additional locations are needed for each member for each block array element, it is a compile-time error to specify locations on the block members. That is, when locations would be under specified by applying them on block members, they are not allowed on block members. For arrayed interfaces (those generally having an extra level of arrayness due to interface expansion), the outer array is stripped before applying this rule" From Section 1.2.1 (Changes from Revision 6 of GLSL Version) of the GLSL 4.50 spec: "Private Bug 15678: Don’t allow location = on block members where the block needs an array of locations" From Section 4.4.1 (Input Layout Qualifiers) of the GLSL ES 3.20 spec "If an input is declared as an array of blocks, excluding per-vertex-arrays as required for tessellation, it is an error to declare a member of the block with a location qualifier" From Section 1.1.3 (Changes from GLSL ES 3.2 revision 3) of the GLSL ES 3.20 spec: "Arrayed blocks cannot have layout location qualifiers on members" Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Signed-off-by: Andrii Simiklit <andrii.simiklit@globallogic.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11522>	2022-03-03 05:42:45 +00:00
Mike Blumenkrantz	0313110c92	zink: ci updates Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15227>	2022-03-03 05:21:40 +00:00
Mike Blumenkrantz	712ce86bd1	zink: split primitives generated queries if xfb/gs states change if one of these states change then it affects which result needs to be used for that query, so split it up over multiple query ids to make sure the correct result is obtained fixes (lavapipe): GTF-GL46.gtf40.GL3Tests.transform_feedback2.transform_feedback2_pause_resume GTF-GL46.gtf40.GL3Tests.transform_feedback2.transform_feedback2_states Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15227>	2022-03-03 05:21:40 +00:00
Mike Blumenkrantz	0cb3ae949c	zink: split out query suspending into util function no functional changes Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15227>	2022-03-03 05:21:40 +00:00
Mike Blumenkrantz	5aecec48ee	zink: update query states before starting renderpass during draw this gives some leeway for doing transfer ops without crashing the renderpass Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15227>	2022-03-03 05:21:40 +00:00
Ilia Mirkin	965ab44c50	nvc0: disable EXT_texture_sRGB_RG8 Looks like the green component doesn't get srgb-decoding, and no obvious way to force it. It works fine on nv50 though. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15211>	2022-03-03 04:37:12 +00:00
Ilia Mirkin	897a7fbbf1	mesa: enable GL_EXT_texture_sRGB_RG8 on desktop Looks like an extension number was assigned in late 2020. This makes it possible to hook up this format to teximage-colors without teaching it about ES. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15211>	2022-03-03 04:37:12 +00:00
Mike Blumenkrantz	af5f49f663	zink: remove loop from generated tcs this is already using per-vertex io, no need to add conditionals to verify Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15225>	2022-03-03 02:58:43 +00:00
Rob Clark	7e63fa2bb1	freedreno/registers: Add a couple regs we need for kernel Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15221>	2022-03-03 02:19:47 +00:00
Dave Airlie	34379a937f	gallivm/llvmpipe: add support for NIR to the linear/aos paths. When the AOS/linear code was added it only worked with TGSI which meant nothing in mesa upstream was really using it. This adds support to analyse NIR shaders, and adds aos support to the backend. AOS support is limited to mov,vec,fmul,tex sampling in order to accelerate mostly compositing operations. I've tested weston uses the fast path. gnome-shell can't use it yet as we can't optimise the depth test paths. Acked-by: Jose Fonseca <jfonseca@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15140>	2022-03-03 01:39:39 +00:00
Dave Airlie	6efd489ac9	gallivm/nir: split load_const out into backend helper. This just makes adding aos support easier. Acked-by: Jose Fonseca <jfonseca@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15140>	2022-03-03 01:39:39 +00:00
Dave Airlie	65c7ca617f	llvmpipe/linear: fix disk caching. Acked-by: Jose Fonseca <jfonseca@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15140>	2022-03-03 01:39:39 +00:00
Mike Blumenkrantz	503362f008	zink: switch to u_foreach_bit for ntv image access decorations no functional changes Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15217>	2022-03-03 01:28:13 +00:00
Mike Blumenkrantz	fcdcfd9967	zink: emit Aliased decorations for any image that isn't explicitly marked restrict these might be aliased fixes: arb_shader_image_load_store-restrict fixes #6090 Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15217>	2022-03-03 01:28:13 +00:00
Mike Blumenkrantz	351378ae80	zink: remove a bunch of flakes Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15219>	2022-03-03 01:14:20 +00:00
Dave Airlie	e1e9a44a69	lavapipe: always set read/write on ssbo/images. This fixes a regressions with overlap in llvmpipe, this is pessimistic we should write code to make it work properly. Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15219>	2022-03-03 01:14:20 +00:00
Alyssa Rosenzweig	b48236ea3e	pan/bi: Add arithmetic flag to RSHIFT ops Models ops like ARSHIFT_OR.i32 on Valhall without adding piles of new instructions to the IR. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:44 +00:00
Alyssa Rosenzweig	0b0e74ae82	pan/bi: Extend LD_TILE with a register format Required for Valhall. NIR has the information anyway, pass it along. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:44 +00:00
Alyssa Rosenzweig	74107abfc6	pan/bi: Add BRANCHZI instruction Technically this is just JUMP on Valhall, but the semantic is an indirect branch based on comparing with zero. It can also be used as a conservative branch (like BRANCHC), but this isn't modeled. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:44 +00:00
Alyssa Rosenzweig	3dc2095b07	pan/bi: Model LD_BUFFER instructions We'll use these to read from UBOs on Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:44 +00:00
Alyssa Rosenzweig	5796777889	pan/bi: Model offset for LOAD/STORE Needed to model the immediate offset on Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	039bb4e68c	pan/bi: Model pos/vary segments in STORE instructions For Bifrost, we model load/store segments, for example for thread local storage. We need something similar on Valhall -- access modifiers. There are four access modifiers on Valhall, controlling memory subsystem optimizations for the access: none: Nothing may be assumed. Corresponds to "global". istream: Internally streaming within the GPU. Corresponds to "pos", as it's used for position stores. estream: Externally streaming outside the GPU. Corresponds to "vary", as it's used for varying stores. force: Force access in discarded threads. Corresponds to "tl", as it's required for correct behaviour of helper invocations that use the stack. If these access modifiers end up being useful outside these fixed purposes, we may need to rework this part of the IR. For now, this should suffice. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	aaa39f0e60	pan/bi: Model LEA_BUF_IMM in the IR Required for varying stores in malloced IDVS jobs on Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	eba9ef4c25	pan/bi: Add LD_VAR_BUF_IMM.f16/f32 instructions For use on Valhall with memory-allocated IDVS jobs. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	48a398bf5b	pan/bi: Generalize I->table for Valhall Can be reused for resource tables in a natural way. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	20891e75c2	pan/bi: Extend BLEND to take a register format Needed on Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	3c817ed511	pan/bi: Model Valhall texture instructions These act like a TEXC+immediate. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	234d3efb9b	pan/va: Add memory access modifier to LOADs Might be required for correct spilling in some circumstances. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	79aa4af078	pan/va: Remap "store segment" to "memory access" For now, the difference does not matter. However it's better to model the actual hardware behaviour, rather than isomorphic driver behaviour, when we can do so. So fix the names. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	254a641290	pan/va: Fix LEA_BUF_IMM definition Technically the table is folded, too; the 0xD refers to table 61. But this instruction is more general than previously thought. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	7c798fbb9f	pan/va: Fix definitions of LD_VAR_BUF_IMM So close! However, LD_VAR_IMM is something else -- Bifrost-style varying interpolation, without a hardware buffer. For ES3, we'll need to support both. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	c62836661e	pan/va: Add TEX_GATHER instruction Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	65cb3af38a	pan/va: Add TEX_DUAL instruction Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	47b70ca584	pan/va: Add modifiers required for gathers Mostly isomorphic to Bifrost-style gathers. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00

1 2 3 4 5 ...

139392 commits