fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-19 09:18:10 +02:00

Author	SHA1	Message	Date
Erik Faye-Lund	45e7e16222	pan: use imm-helpers Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23855>	2023-06-29 07:08:18 +00:00
Alyssa Rosenzweig	815efcdf7e	nir: Use nir_builder_create perl -p0e 's/nir_builder ([^;]);\snir_builder_init\(&\1, /nir_builder \1 = nir_builder_create(/g' -i $(git grep -l nir_builder_init) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23860>	2023-06-27 18:13:02 +00:00
Alyssa Rosenzweig	99a00e2247	treewide: Use nir_trim_vector more Via Coccinelle patches @@ expression a, b, c; @@ -nir_channels(b, a, (1 << c) - 1) +nir_trim_vector(b, a, c) @@ expression a, b, c; @@ -nir_channels(b, a, BITFIELD_MASK(c)) +nir_trim_vector(b, a, c) @@ expression a, b; @@ -nir_channels(b, a, 3) +nir_trim_vector(b, a, 2) @@ expression a, b; @@ -nir_channels(b, a, 7) +nir_trim_vector(b, a, 3) Plus a fixup for pointless trimming an immediate in RADV and radeonsi. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23352>	2023-06-06 18:52:25 +00:00
Erik Faye-Lund	28b1c5bca1	nir: use nir_i{ne,eq}_imm helpers We already have these, so let's use them more. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23393>	2023-06-05 13:40:07 +00:00
Alyssa Rosenzweig	2b2685f551	pan/lower_framebuffer: Use nir_replicate Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Italo Nicola <italonicola@collabora.com> Acked-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23259>	2023-05-30 16:24:21 -04:00
Alyssa Rosenzweig	7f6491b76d	nir: Combine if_uses with instruction uses Every nir_ssa_def is part of a chain of uses, implemented with doubly linked lists. That means each requires 2 * 64-bit = 16 bytes per def, which is memory intensive. Together they require 32 bytes per def. Not cool. To cut that memory use in half, we can combine the two linked lists into a single use list that contains both regular instruction uses and if-uses. To do this, we augment the nir_src with a boolean "is_if", and reimplement the abstract if-uses operations on top of that list. That boolean should fit into the padding already in nir_src so should not actually affect memory use, and in the future we sneak it into the bottom bit of a pointer. However, this creates a new inefficiency: now iterating over regular uses separate from if-uses is (nominally) more expensive. It turns out virtually every caller of nir_foreach_if_use(_safe) also calls nir_foreach_use(_safe) immediately before, so we rewrite most of the callers to instead call a new single `nir_foreach_use_including_if(_safe)` which predicates the logic based on `src->is_if`. This should mitigate the performance difference. There's a bit of churn, but this is largely a mechanical set of changes. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22343>	2023-04-07 23:48:03 +00:00
Karol Herbst	87aeea20ac	panfrost: move max_thread_count and take reg_count into account We'll need it to report proper thread counts for OpenCL. Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19855>	2023-03-31 20:29:00 +00:00
Faith Ekstrand	e001995dc5	util,mesa,panfrost: Drop some author tags This is what git blame is for Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22120>	2023-03-26 00:16:25 +00:00
Alyssa Rosenzweig	f888994679	panfrost: Move panfrost_sysvals to GL driver This shouldn't be used by anything else at this point. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20906>	2023-03-23 23:53:46 +00:00
Alyssa Rosenzweig	3e64b13193	panfrost: Move sysvals to GL driver struct Only the GL driver produces/consumes these, they shouldn't be in the common shader_info. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20906>	2023-03-23 23:53:46 +00:00
Alyssa Rosenzweig	ffb9919c2f	panfrost: Lower sysvals in GL Drop the backend compiler sysval handling in favour of the pass in the GL driver, bringing us into compliance with Ekstrand's rule. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20906>	2023-03-23 23:53:46 +00:00
Alyssa Rosenzweig	2745daa05a	pan/lower_framebuffer: Lower MSAA blend shaders Do it explicitly in NIR rather than implicitly in the Midgard compiler. This avoids a nasty sideband input for the render target formats and sample count, for blend shaders on midgard only. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20906>	2023-03-23 23:53:46 +00:00
Alyssa Rosenzweig	ca2042f359	panfrost: Preprocess shaders in the driver This is a flag-day change to how we compile. We split preprocessing NIR into a separate step from compiling, giving the driver a chance to apply its own lowerings on the preprocessed NIR before the final optimization loop. During that time, the different producers of NIR (panfrost, panvk, blend shaders, blit shaders...) will be able to (differently) lower system values. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20906>	2023-03-23 23:53:46 +00:00
Alyssa Rosenzweig	bccd6d3880	pan/lower_framebuffer: Use nir_shader_instructions_pass Removes a lot of indentation, and improves metadata handling. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20906>	2023-03-23 23:53:46 +00:00
Alyssa Rosenzweig	8059eb1577	pan/lower_framebuffer: Only call for FS It doesn't make sense for shader stages other than fragment (and blend which is fragment-like), assert this. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20906>	2023-03-23 23:53:46 +00:00
Alyssa Rosenzweig	c333c0ea57	panfrost: Remove unused inputs.nr_cbufs Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20906>	2023-03-23 23:53:46 +00:00
Alyssa Rosenzweig	da0815fb9b	panfrost: Remove inputs->blend.rt This sideband input is now unused, as the information is available locally within the NIR as it should be. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20906>	2023-03-23 23:53:45 +00:00
Alyssa Rosenzweig	8db30010dc	pan/bi: Lower load_output to make sysval explicit See previous commits for justification. Later, we'll split up NIR processing in a few steps to give the caller a chance to lower the sysval, at which point the goofy inputs here will go away. v2: Only lower in fragment shaders. Likely harmless to run elsewhere but still wrong because the location enum is defined per-stage. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> [v1] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20906>	2023-03-23 23:53:45 +00:00
Alyssa Rosenzweig	5c9ffaad8f	pan/bi: Lower sample mask writes in NIR This uses the new NIR sysvals to avoid materializing magic sysvals in the driver, getting us closer to the Ekstrand Rule. v2: Only lower for fragment shaders. Lowering in vertex shaders should be a no-op, except that FRAG_RESULT_SAMPLE_MASK shadows a VARYING_SLOT for fog coords, causing v1 of this patch to regress fog. Caught by the G52 piglit job in CI. Thank you, Marge. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> [v1] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20906>	2023-03-23 23:53:45 +00:00
Alyssa Rosenzweig	63f30802eb	pan/lower_framebuffer: Operate on lowered I/O This turns the early pass into a late pass, which is important because it depends on the shader key and therefore should be called by the driver instead of the compiler preprocessing. It's also simpler this way. The shader key work is waiting for review in another merge request. In the mean time, this patch will let us run blend lowering early for blend shaders on Midgard. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20836>	2023-02-17 23:09:19 +00:00
Alyssa Rosenzweig	1b6607fa13	nir: Augment raw_output_pan with IO_SEMANTICS+BASE This is a form of lowered I/O, it needs I/O semantics so we can know the location to store to instead of passing via a sideband. Over in !20906, we will use the BASE to lower blend shader with multisampling in NIR instead of passing the number of samples and framebuffer format along a sideband to the Midgard compiler. That's not needed for this series (this patch was cherry-picked to avoid regressions in the lower_blend changes) but it's good to model the full form of the I/O lowered intrinsic here. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Italo Nicola <italonicola@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20836>	2023-02-17 23:09:19 +00:00
Alyssa Rosenzweig	0afd691f29	panfrost: clang-format the tree This switches us over to Mesa's code style [1], normalizing us within the tree. The results aren't perfect, but they bring us a hell of a lot closer to the rest of the tree. Panfrost doesn't feel so foreign relative to Mesa with this, which I think (in retrospect after a bunch of years of being "different") is the right call. I skipped PanVK because that's paused right now. find panfrost/ -type f -name '.h' \| grep -v vulkan \| xargs clang-format -i; find panfrost/ -type f -name '.c' \| grep -v vulkan \| xargs clang-format -i; clang-format -i gallium/drivers/panfrost/.c gallium/drivers/panfrost/.h ; find panfrost/ -type f -name '*.cpp' \| grep -v vulkan \| xargs clang-format -i [1] https://docs.mesa3d.org/codingstyle.html Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20425>	2022-12-24 02:22:57 +00:00
Alyssa Rosenzweig	f6d73ea7b4	pan/lower_framebuffer: Remove unused pack Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20420>	2022-12-23 16:27:16 +00:00
Alyssa Rosenzweig	5f93feed61	panfrost: Don't merge workgroups with variable shared mem If nir->info.shared_size = 0 but grid->variable_shared_mem > 0, the shader uses shared memory but the compiler may not realize that. We need to disable workgroup merging even in this case. The alternate approach is to statically check for shared intrinsics in the compiler, but this is a bit easier all things considered. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18581>	2022-11-02 23:36:56 +00:00
Alyssa Rosenzweig	2316b80d77	panfrost: Don't use nir_variable to link varyings NIR deemphasizes nir_variable. We want to transition off it. Instead of walking the list of variables and playing games with the GLSL types to collect varying information, walk the list of instructions and use the I/O semantics to collect similar information. In addition to avoiding the reliance on nir_variable, this fixes handling of struct varyings under certain circumstances. Such programs are compiled by the GLES3.1 CTS but not used, so without this fix, the affected tests would regress when precompiling. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19363>	2022-11-02 16:52:11 +00:00
Alyssa Rosenzweig	d0281fc16a	pan/mdg: Use bifrost_nir_lower_store_component Move the pass from the Bifrost compiler to the Midgard/Bifrost common code directory, and take advantage of it on Midgard, where it fixes the same tests as it fixed originally on Bifrost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19363>	2022-11-02 16:52:11 +00:00
Alyssa Rosenzweig	2a6338722e	panfrost: Don't use nir_variable in the compilers More future proof, simpler, and works with early I/O lowering. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19456>	2022-11-02 04:22:06 +00:00
Alyssa Rosenzweig	1ff3b87ba2	panfrost: Enable rendering to 16-bit and 32-bit Bifrost onwards handle this in hardware, and the Midgard lowering isn't too terrible. Enable the format, otherwise desktop GL apps such as Hacknet try to render to the format and get an incomplete framebuffer. Cc stable because apparently we've been advertising this format unintentionally as a result of some other interaction? Unclear how Hacknet is hitting this, maybe it's an app bug. Shrug, it's not a big deal regardless. Additionally, we need to restrict texturing from 32-bit normalized due to a restriction added with the v7 pixel format fiasco. That means restricting rendering to 32-bit normalized on v7 onwards. Closes: #7251 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Tested-by: Dang Huynh <danct12@disroot.org> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19358>	2022-10-29 18:23:55 +00:00
Alyssa Rosenzweig	0955fe8fe2	panfrost: Use compute-based XFB on Midgard Now we're back to a single XFB implementation for all gens. Fixes: KHR-GLES31.core.draw_indirect.advanced-twoPasses-transformFeedback-arrays KHR-GLES31.core.draw_indirect.advanced-twoPasses-transformFeedback-elements Cc: mesa-stable Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19238>	2022-10-27 20:13:11 +00:00
Alyssa Rosenzweig	b261a18550	panfrost: Honour flush-to-zero controls on Valhall Fixes math_bruteforce.atan2 and contractions tests. For OpenCL, we want to flush fp32 and preserve fp16, applying to both inputs and outputs so F16_TO_F32 acts as preserve, which implements CL spec text: > Denormalized numbers for the half data type which may be generated when converting a float to a half using vstore_half and converting a half to a float using vload_half cannot be flushed to zero Note that our libclc builds flush denorms and rusticl does not advertise denorms so we're expected to flush to zero. rusticl correctly sets the desired float controls, we just have to match to the hardware requirements. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18656>	2022-09-19 17:22:58 +00:00
Alyssa Rosenzweig	e55b60d0bb	panfrost: Route shader-db to debug, not stderr This brings us in line with the rest of Mesa, fixing multithreaded shader-db reports and the Total CPU Time report at the end. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18351>	2022-09-01 14:50:24 +00:00
Alyssa Rosenzweig	d680560970	panfrost: Handle untyped_color_outputs on Bifrost For untyped_color_outputs, we need to ignore the type of the colour output in the shader and instead use the type from the format. We have all the information to do this at blend descriptor pack time, but not at shader compile time. This means we need a (somewhat expensive) fixup in this edge case to ingest NIR-to-TGSI. This will prevent a regression from the rest of the series. Although the register_format field is also present on Valhall blend descriptors, it is ignored so we don't need the fixup there. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17841>	2022-08-21 19:37:10 +00:00
Icecream95	379ae6d823	panfrost: Emit the correct number of attributes create_vertex_elements_state is sometimes called with a too large num_elements argument, for example with util_blitter, which causes a buffer overflow. There is no documentation to forbid this practice, so don't rely on so->num_elements being correct and instead use the vertex shader attribute count, which matches the value used to allocate the descriptors. Use attributes_read_count rather than attribute_count because the latter also includes images and PAN_VERTEX_ID/PAN_INSTANCE_ID. Fixes: `76de3e691c` ("panfrost: Merge attribute packing routines") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17447>	2022-07-23 00:56:10 +00:00
Alyssa Rosenzweig	fbe430fae9	panfrost: Move bifrost_lanes_per_warp to common Whereas the compiler needs to know the warp size for lowering divergent indirects, the driver needs to know it to report the subgroup size. Move the Bifrost-specific helper to common and add the trivial implementation for Midgard. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17265>	2022-07-08 01:14:55 +00:00
Alyssa Rosenzweig	ed5a5a9d6d	panfrost: Wire up transfrom feedback sysvals Wire the Gallium interface for transform feedback up to the system values that will be fed into our lowering code. This is based on our existing transform feedback implementation for Midgard. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15720>	2022-06-04 14:35:56 +00:00
Alyssa Rosenzweig	4e341e70d8	pan/bi: Handle transform feedback intrinsics Translate the intrinsics we introduced to lower away transform feedback into Panfrost system values which the GL driver can handle. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15720>	2022-06-04 14:35:56 +00:00
Alyssa Rosenzweig	67f5721349	panfrost: Set allow_rotating_primitives On Valhall, the driver should set this flag if the hardware may rotate primitives. This happens if: 1. The rasterization of lines does not matter, AND 2. The provoking vertex does not matter. The first condition we may satisfy by checking for LINES and the second by checking for flat shading. Otherwise, we should set this flag to allow optimizations. This may be more efficient for tiling. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16748>	2022-05-30 14:00:55 +00:00
Icecream95	a4323b0979	panfrost: Only write depth / stencil once if MRT is used We can't assume that RT0 will be written, so this has to be based on whether a combined store has already been emitted, not the location of the store. Emit a non-special combined_store intrinsic that only writes colour for the other RTs, as reordering stores breaks the Midgard compiler. Fixes: `d37e901e35` ("pan/mdg: Add new depth store lowering") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6527 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16685>	2022-05-24 16:13:33 +00:00
Icecream95	9f9ed959bd	nir: Add store_combined_output_pan BASE back It's meaningful for this intrinsic and so does not add noise to the lowering pass. (Although dual-source writes must be to RT 0, depth and stencil writes, which store_combined_output_pan is also used for, can still be done with MRT enabled.) Fixes: `5c168f09eb` ("nir: Eliminate store_combined_output_pan BASE") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16685>	2022-05-24 16:13:33 +00:00
Jason Ekstrand	f0a47d8602	bifrost,midgard: Allow providing a fixed sysval layout Vulkan doesn't need nearly as many system values and would like to bake its layout up-front instead of having it provided by the back-end compiler. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16276>	2022-05-12 10:53:16 +00:00
Jason Ekstrand	e07a296398	panfrost: Add some sanity checking for sysvals Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16276>	2022-05-12 10:53:15 +00:00
Jason Ekstrand	4e60f0655a	panfrost,panvk: Make fixed_sysval_ubo < 0 mean compiler-assigned In `3559efb9bf` ("panfrost: Allow passing an explicit UBO index for the sysval UBO"), an explicit UBO index was added and it was implicitly assumed that it would be > num_ubos. This was convenient because it meant 0, the default for designated initializers, implicitly meant compiler-assigned. However, we're about to move the sysval UBO to 0 which breaks this assumption. Also, we don't want the back-end compiler to even look at num_ubos since it's meaningless in Vulkan. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16276>	2022-05-12 10:53:15 +00:00
Jason Ekstrand	7aec8db161	midgard: Handle FB fetch from non-vec4 output variables. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16309>	2022-05-11 14:47:33 +00:00
Alyssa Rosenzweig	0fcddd4d2c	pan/bi: Rework varying linking on Valhall Valhall introduces hardware-allocated varyings. Instead of allocating varying descriptors on the CPU with a slot based interface, the driver just tells the hardware how many bytes to allocate per vertex and loads/stores with byte offsets. This is much nicer! However, this requires us to rework our linking code to account for separable shaders. With separable shaders, we can't rely on driver_location matching between stages, and unlike on Midgard, we can't resolve the differences with curated command stream descriptors. However, we can rely on slots matching. So we should "just" determine the byte offsets based on the slot, and then separable shaders work. For GLES, it really is that easy. For desktop GL, it's not -- desktop GL brings unpredictable extra varyings like COL1 and TEX2. Allocating space for all of these unconditionally would hamper performance. To cope, we key fragment shaders to the set of non-GLES varyings written by the linked vertex shader. Then we may define an efficient ABI, where only apps only pay for what they use. Fixes various tests in dEQP-GLES31.functional.separate_shader.random.* on Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16310>	2022-05-04 13:07:59 +00:00
Icecream95	2864094f69	pan/bi: Use texture index instead of sampler for message preloading The VAR_TEX definition in ISA.xml only has a field for texture_index, so trying to read sampler_index will return zero; read from texture_index instead, and rename other fields for consistency. The texture and sampler indices must be equal for VAR_TEX to be used, so either name could be used for the field. Fixes the wrong textures being used in Thief. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6219 Fixes: `eb1479bda2` ("pan/bi: Support message preloading") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16255>	2022-05-02 12:50:44 +00:00
Jason Ekstrand	3f824e0e85	panvk: Eliminate unused vertex attributes We use nir_assign_io_var_locations() which compacts the varyings and eliminates any unused input slots. We need to do the same thing when processing pVertexAttributeDescriptions[] or else we'll end up with mismatches between the shader and the state setup code. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16183>	2022-04-27 14:18:25 +00:00
Alyssa Rosenzweig	ccdec68aee	pan/bi: Report whether workgroups can be merged This flag gates a Valhall hardware optimization for compute shaders. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	f487c09045	pan/bi: Make psiz variants Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	b371e509da	panfrost: Add a table for images For the default Valhall ABI. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:44 +00:00
Alyssa Rosenzweig	7bda838c56	panfrost: Push twice as many uniforms The limit for Bifrost is twice as high as previously thought -- the limit is 64 slots of FAU, not 64 words. Each slot is 2 words. We can push twice as much, saving a considerable number of cycles in some cases. total instructions in shared programs: 2454260 -> 2431502 (-0.93%) instructions in affected programs: 845176 -> 822418 (-2.69%) helped: 3376 HURT: 304 helped stats (abs) min: 1.0 max: 60.0 x̄: 7.92 x̃: 6 helped stats (rel) min: 0.13% max: 45.45% x̄: 4.60% x̃: 4.11% HURT stats (abs) min: 1.0 max: 60.0 x̄: 13.06 x̃: 8 HURT stats (rel) min: 0.16% max: 35.09% x̄: 7.58% x̃: 6.52% 95% mean confidence interval for instructions value: -6.50 -5.87 95% mean confidence interval for instructions %-change: -3.75% -3.43% Instructions are helped. total tuples in shared programs: 1963383 -> 1951560 (-0.60%) tuples in affected programs: 638622 -> 626799 (-1.85%) helped: 2959 HURT: 573 helped stats (abs) min: 1.0 max: 54.0 x̄: 5.61 x̃: 4 helped stats (rel) min: 0.15% max: 28.57% x̄: 3.61% x̃: 3.12% HURT stats (abs) min: 1.0 max: 50.0 x̄: 8.35 x̃: 6 HURT stats (rel) min: 0.25% max: 27.34% x̄: 6.24% x̃: 4.92% 95% mean confidence interval for tuples value: -3.61 -3.08 95% mean confidence interval for tuples %-change: -2.18% -1.85% Tuples are helped. total clauses in shared programs: 387817 -> 365111 (-5.85%) clauses in affected programs: 135527 -> 112821 (-16.75%) helped: 3489 HURT: 25 helped stats (abs) min: 1.0 max: 43.0 x̄: 6.52 x̃: 5 helped stats (rel) min: 0.82% max: 58.33% x̄: 17.48% x̃: 15.87% HURT stats (abs) min: 1.0 max: 3.0 x̄: 1.56 x̃: 1 HURT stats (rel) min: 2.94% max: 11.11% x̄: 6.87% x̃: 6.67% 95% mean confidence interval for clauses value: -6.67 -6.26 95% mean confidence interval for clauses %-change: -17.65% -16.96% Clauses are helped. total cycles in shared programs: 201842.21 -> 168754.04 (-16.39%) cycles in affected programs: 84035.50 -> 50947.33 (-39.37%) helped: 3547 HURT: 136 helped stats (abs) min: 0.041665999999999315 max: 54.0 x̄: 9.33 x̃: 8 helped stats (rel) min: 0.17% max: 80.77% x̄: 36.10% x̃: 36.84% HURT stats (abs) min: 0.041665999999999315 max: 1.0 x̄: 0.12 x̃: 0 HURT stats (rel) min: 0.18% max: 12.24% x̄: 1.18% x̃: 0.61% 95% mean confidence interval for cycles value: -9.26 -8.71 95% mean confidence interval for cycles %-change: -35.34% -34.11% Cycles are helped. total arith in shared programs: 74918.46 -> 75022.62 (0.14%) arith in affected programs: 22471.04 -> 22575.21 (0.46%) helped: 1571 HURT: 1492 helped stats (abs) min: 0.041665999999999315 max: 1.125 x̄: 0.17 x̃: 0 helped stats (rel) min: 0.17% max: 40.00% x̄: 2.50% x̃: 1.96% HURT stats (abs) min: 0.041665999999999315 max: 2.375 x̄: 0.25 x̃: 0 HURT stats (rel) min: 0.16% max: 100.00% x̄: 5.35% x̃: 2.37% 95% mean confidence interval for arith value: 0.02 0.05 95% mean confidence interval for arith %-change: 1.08% 1.56% Arith are HURT. total ldst in shared programs: 174812 -> 137889 (-21.12%) ldst in affected programs: 81319 -> 44396 (-45.41%) helped: 3722 HURT: 0 helped stats (abs) min: 1.0 max: 62.0 x̄: 9.92 x̃: 8 helped stats (rel) min: 1.82% max: 100.00% x̄: 47.18% x̃: 43.75% 95% mean confidence interval for ldst value: -10.20 -9.64 95% mean confidence interval for ldst %-change: -47.97% -46.39% Ldst are helped. total quadwords in shared programs: 1757124 -> 1714130 (-2.45%) quadwords in affected programs: 584065 -> 541071 (-7.36%) helped: 3474 HURT: 173 helped stats (abs) min: 1.0 max: 90.0 x̄: 12.66 x̃: 9 helped stats (rel) min: 0.26% max: 34.18% x̄: 8.78% x̃: 8.33% HURT stats (abs) min: 1.0 max: 26.0 x̄: 5.76 x̃: 4 HURT stats (rel) min: 0.45% max: 20.66% x̄: 4.48% x̃: 2.63% 95% mean confidence interval for quadwords value: -12.21 -11.37 95% mean confidence interval for quadwords %-change: -8.36% -7.95% Quadwords are helped. total threads in shared programs: 52898 -> 53142 (0.46%) threads in affected programs: 262 -> 506 (93.13%) helped: 250 HURT: 6 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 100.00% max: 100.00% x̄: 100.00% x̃: 100.00% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 50.00% max: 50.00% x̄: 50.00% x̃: 50.00% 95% mean confidence interval for threads value: 0.92 0.99 95% mean confidence interval for threads %-change: 93.69% 99.28% Threads are helped. total spills in shared programs: 161 -> 107 (-33.54%) spills in affected programs: 54 -> 0 helped: 27 HURT: 0 total fills in shared programs: 1386 -> 796 (-42.57%) fills in affected programs: 590 -> 0 helped: 27 HURT: 0 Fixes: `d4dccea0ba` ("panfrost: Add UBO push data structure") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15239>	2022-03-04 15:22:04 +00:00

1 2 3 4 5

206 commits