fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-04 22:49:13 +02:00

Author	SHA1	Message	Date
Timothy Arceri	f1ef6517e8	glsl: remove Par-linking from the standalone linker lima was the last user of this feature so lets remove it. This will allow us to drop more soon to be unused glsl ir code once full nir linker support lands. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29519>	2024-06-18 12:34:52 +00:00
Timur Kristóf	0bf10ad4ad	radv: Use number of TES inputs for TCS-TES linking. This is to match what ac_nir_lower_tess_io_to_mem also does. Doesn't address any known bug, but it's theoretically possible that TCS outputs_written and TES inputs_read mismatch, so let's be on the safe side here. Fixes: `be49b02f05` Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29696>	2024-06-18 12:06:22 +00:00
Timur Kristóf	0355364743	ac/nir/tess: Fix per-patch output VRAM mapping. VARYING_SLOT_PATCH0 is greater than 64 so it is wrong to use it with BITFIELD64_BIT. Check for VARYING_SLOT_TESS_LEVEL_* properly when mapping output locations in VRAM. Fixes: `2cf7f282df` Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11253 Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29696>	2024-06-18 12:06:21 +00:00
Timur Kristóf	0f0ebd8512	ac/nir/tess: Fix per-patch output LDS mapping. VARYING_SLOT_PATCH0 is greater than 64 so it is wrong to use it with BITFIELD64_BIT. Check for VARYING_SLOT_TESS_LEVEL_* properly when mapping output locations in LDS. Fixes: `c61eb54806` Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29696>	2024-06-18 12:06:21 +00:00
Timur Kristóf	348b8859dc	ac/nir/tess: Only write tess factors that the TES reads. Otherwise we would write to a memory location reserved for another per-patch output. Fixes: `2cf7f282df` Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11324 Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29696>	2024-06-18 12:06:21 +00:00
Zan Dobersek	9845e99960	tu: avoid memory polling in occlusion query endings using ZPASS_DONE On newer hardware where ZPASS_DONE events are used for sample count writes the memory polling in occlusion query endings can be wholly avoided. A WFI is still required, but the performance gain is still in the range of 10% on the trivial occlusionquery demo. Signed-off-by: Zan Dobersek <zdobersek@igalia.com> Tested-by: Mike Lothian <mike@fireburn.co.uk> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29403>	2024-06-18 11:39:57 +00:00
Zan Dobersek	5653c52151	tu: fix ZPASS_DONE interference between occlusion queries and autotuner On newer devices where ZPASS_DONE events have sample count writing abilities the firmware expects these events to come in begin-end pairs, essentially corresponding to a typical occlusion query usage. Since this event is also used in the autotuner we have to avoid event pairs to be emitted in an interleaved fashion. Additional renderpass state now tracks whether a given renderpass contains an occlusion query. If so, autotuner will emit miscellaneous ZPASS_DONE events in order to form its own begin-end pairs before and after the renderpass commands. Occlusion query behavior inside a renderpass doesn't change. But when used outside of a renderpass, possible autotuner usage requires to again emit ZPASS_DONE events that end up forming begin-end pairs of these events both at the start and the end of the query. Signed-off-by: Zan Dobersek <zdobersek@igalia.com> Fixes: `4e6a1f8852` ("tu/autotune: Use `CP_EVENT_WRITE7::ZPASS_DONE` on A7XX") Tested-by: Mike Lothian <mike@fireburn.co.uk> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29403>	2024-06-18 11:39:57 +00:00
Job Noorman	6bc7cd6108	ir3: only add live-in phis for top-level intervals while spilling When both an interval and some of its children would be live-in, we used to add phis for all of them. This could lead to cases where the pressure after spilling was higher than before. This happens, for example, when both a split and its parent are live-in. Before spilling, the split wouldn't add to the pressure because its parent had already been inserted. After spilling, since we created a phi for the split, the link with its parent would be lost and it would add to the pressure. Fix this by only adding phis for top-level intervals and adding splits after them. Fixes: `613eaac7b5` ("ir3: Initial support for spilling non-shared registers") Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29497>	2024-06-18 11:09:23 +00:00
Job Noorman	18cd803cef	ir3: refactor ir3_spill.c to use the ir3_cursor/ir3_builder API There were a few places that used an instruction pointer to decide where new instructions should be created. NULL was used to add them at the end of the block. While fixing a spilling bug, a new option was needed to add instructions at the beginning of the block. This will be much easier to implement using cursors. Fixes: `613eaac7b5` ("ir3: Initial support for spilling non-shared registers") Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29497>	2024-06-18 11:09:23 +00:00
Job Noorman	1972db36c6	ir3: add ir3_cursor/ir3_builder helpers Whenever instructions need to be created at specific locations, ir3 often passes around an instruction pointer. When set, new instructions are added before or after it (depending on the context). When NULL, new instructions are added at the end of the block. This whole scheme is confusing. This patch adds ir3_cursor and ir3_builder structs and the associated helper functions. The API mirrors the one from nir_cursor/nir_builder. This patch does not refactor existing code to use the new API. This will happen in future patches. Fixes: `613eaac7b5` ("ir3: Initial support for spilling non-shared registers") Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29497>	2024-06-18 11:09:23 +00:00
Job Noorman	dc04fd8e62	ir3: restore interval_offset after liveness recalculation in shared RA This value is usually set by ir3_merge_regs. Since we don't need to call this again after shared RA, we have to copy it manually to the new liveness struct. Fixes: `fa22b0901a` ("ir3/ra: Add specialized shared register RA/spilling") Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29497>	2024-06-18 11:09:23 +00:00
Job Noorman	3f3c190649	ir3: move liveness recalculation inside ir3_ra_shared Similar to how ir3_spill does it. This will make it easier to optimize this in the future. E.g., we only need to recalculate liveness when any instruction were added. Fixes: `fa22b0901a` ("ir3/ra: Add specialized shared register RA/spilling") Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29497>	2024-06-18 11:09:23 +00:00
Job Noorman	7a5b198a44	ir3: index instructions before fixing up merge sets after spilling ir3_force_merge (through merge_merge_sets) expects instructions to be indexed. However, the instructions created during spilling would not be automatically indexed at this point. Fixes: `613eaac7b5` ("ir3: Initial support for spilling non-shared registers") Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29497>	2024-06-18 11:09:23 +00:00
Job Noorman	018d0ab805	ir3: make indexing instructions optional in ir3_merge_regs While fixing up merge sets after spilling, we need to index before calling ir3_merge_regs so it would be a waste to index again. Fixes: `613eaac7b5` ("ir3: Initial support for spilling non-shared registers") Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29497>	2024-06-18 11:09:23 +00:00
Job Noorman	17b155fede	ir3: expose instruction indexing helper for merge sets We will need it to fix up merge sets after spilling. Fixes: `613eaac7b5` ("ir3: Initial support for spilling non-shared registers") Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29497>	2024-06-18 11:09:23 +00:00
Job Noorman	1bc3b819e6	ir3: don't remove collects early while spilling It might happen that a collect that cannot be coalesced with one of its sources while spilling can be coalesced with it afterwards. In this case, we might be able to remove it in remove_src_early during spilling but not afterwards (because it may have a child interval). If this happens, we could end up with a register pressure that is higher after spilling than before. Prevent this by never removing collects early while spilling. Fixes: `613eaac7b5` ("ir3: Initial support for spilling non-shared registers") Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29497>	2024-06-18 11:09:23 +00:00
Job Noorman	eaec57ab6b	ir3: don't remove intervals for non-killed tex prefetch sources Fixes: `613eaac7b5` ("ir3: Initial support for spilling non-shared registers") Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29497>	2024-06-18 11:09:23 +00:00
Job Noorman	70e10babea	ir3: correctly set wrmask for reload.macro We used to set it MASK(elems) which would break when not all elements are contiguous (which could happen for tex instructions after dce). Fixes: `613eaac7b5` ("ir3: Initial support for spilling non-shared registers") Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29497>	2024-06-18 11:09:23 +00:00
Job Noorman	37c929ce5d	ir3: set offset on splits created while spilling Fixes: `613eaac7b5` ("ir3: Initial support for spilling non-shared registers") Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29497>	2024-06-18 11:09:23 +00:00
Job Noorman	af6f82b954	ir3: fix handling of early clobbers in calc_min_limit_pressure Early clobbers should always add to the register pressure since they cannot overlap with sources. handle_instr in ir3_spill.c handles this properly but calc_min_limit_pressure did not. Fixes: `2ff5826f09` ("ir3/ra: Add IR3_REG_EARLY_CLOBBER") Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29497>	2024-06-18 11:09:23 +00:00
Job Noorman	023c7351f2	ir3: fix crash in try_evict_regs with src reg try_evict_regs might end up calling check_dst_overlap which only works for dst regs. Make sure this doesn't happen for src regs. Fixes: `34803d15ab` ("ir3/ra: Add proper support for multiple destinations") Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29497>	2024-06-18 11:09:23 +00:00
Job Noorman	c24aad5867	ir3: set current instruction before all validation asserts The first assert happened before setting the current instruction which caused the error message to refer to the previous instruction. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29497>	2024-06-18 11:09:23 +00:00
Job Noorman	f57bee676f	ir3: debug print limit pressure and post-spill max pressure Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29497>	2024-06-18 11:09:23 +00:00
Job Noorman	eadabc2eab	ir3: print dst_offset of spill.macro Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29497>	2024-06-18 11:09:23 +00:00
Job Noorman	ac2a582fac	ir3: print intervals when dumping merge sets Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29497>	2024-06-18 11:09:23 +00:00
Job Noorman	0a0ac6a72f	ir3: print sharedness/halfness of merge set regs This helps debugging issues when different types of registers end up in the same merge set. Signed-off-bype Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29497>	2024-06-18 11:09:22 +00:00
Eric Engestrom	39f5bbf871	ci/vkd3d: drop redundant "vkd3d-proton execution: SUCCESS" The `+ exit 0` on the next line says the same thing, and we no longer use that string for grepping the status anyway. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29749>	2024-06-18 09:58:55 +00:00
Eric Engestrom	9a9204764a	ci/vkd3d: drop `quiet` wrapper It could be useful for complex printf with embedded subshells, but here it's just spammy, which is the opposite of what it claims to be for. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29749>	2024-06-18 09:58:55 +00:00
Eric Engestrom	b6633e5880	ci/vkd3d: drop the "clear results folder without deleting the folder" logic I might be wrong about this one, but I don't see what it could be for. We don't want to delete too much either, in case this script gets called in the same job as another test suite. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29749>	2024-06-18 09:58:55 +00:00
Eric Engestrom	eddbadb0b1	ci/vkd3d: put `then` on the same line as the `if` to match the rest of the code style Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29749>	2024-06-18 09:58:55 +00:00
Eric Engestrom	57485f9a3d	ci/vkd3d: print URL to the vkd3d-proton.log file to make it easier to access Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29749>	2024-06-18 09:58:55 +00:00
Eric Engestrom	662b0ad3cd	ci/vkd3d: rename vkd3d test log file to end in .txt That way GitLab allows reading it in the browser instead of forcing a download. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29749>	2024-06-18 09:58:55 +00:00
Eric Engestrom	6f6a13f5fc	ci/vkd3d: print a real error message when failing to get the list of failing tests Instead of just `++ trap_err 1`. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29749>	2024-06-18 09:58:55 +00:00
Eric Engestrom	e266b6287e	ci/vkd3d: limit the vulkaninfo capture to the driverInfo line This made for huge error messages when printing `Found $(vulkaninfo), expected $MESA_VERSION`, with the actual information lost in the flood. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29749>	2024-06-18 09:58:55 +00:00
Eric Engestrom	285ba60166	ci/vkd3d: group version check lines together The comment clearly applies to the check, not the variable, but I'm guessing stuff got added in between, so let's move the code and its comment back together. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29749>	2024-06-18 09:58:55 +00:00
Eric Engestrom	bec7b417ca	ci/vkd3d: don't ignore errors There was not even a matching `set -e` to restore the state afterwards, so we just stopped caring about errors from that point forward. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29749>	2024-06-18 09:58:55 +00:00
Eric Engestrom	1c23b95aa4	ci/vkd3d: stop ignoring errors in a block where errors can't happen The `if` will catch the exit code of what it calls (that's the whole point). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29749>	2024-06-18 09:58:54 +00:00
Eric Engestrom	b50c8217ef	ci/vkd3d: fix error message printing Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29749>	2024-06-18 09:58:54 +00:00
Eric Engestrom	7a1f28c1eb	ci/vkd3d: drop override of job artifacts `.b2c-test` always exports all of `results/`, so having an override here to only export one of the files in that folder and only `on_success` is not very useful. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29749>	2024-06-18 09:58:54 +00:00
Eric Engestrom	4db58a04f9	ci/vkd3d: print a message when the expected failures file is missing Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29749>	2024-06-18 09:58:54 +00:00
Eric Engestrom	b1f82ce646	ci/vkd3d: deduplicate the diff between the expectation and the results We're seeing weird errors where the results file has disappeared, so let's start by combining the "is this right?" and "what's wrong?" logic into one. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29749>	2024-06-18 09:58:54 +00:00
Danylo Piliaiev	e602a7a392	freedreno/replay: Fix replaying without SET_IOVA Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29753>	2024-06-18 09:46:19 +00:00
Danylo Piliaiev	7c07c44d57	freedreno/rddecompiler: Make possible to use original shader Sometimes decompiled shader isn't easily compiled back into the same binary, e.g. when some part of bitset is not decoded. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29753>	2024-06-18 09:46:19 +00:00
Kenneth Graunke	9e750f00c3	intel/brw: Make opt_copy_propagation_defs clean up its own trash Copy propagation often eliminates all uses of an instruction. If we detect that we've done so, we can eliminate the instruction ourselves rather than leaving it hanging until the next DCE pass. This saves some CPU time as other passes don't see dead code. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28666>	2024-06-18 09:02:25 +00:00
Kenneth Graunke	2af84c2d49	intel/brw: Use the defs-based copy propagation along with the old one The new def-based pass works better in many cases, and should be less resource intensive. However, the limited visibility of the defs-based pass due to many values not being SSA yet makes it unable to fully replace the old pass. Try the new one, and if it can't make progress, then try the old one. That way, things will mostly be handled by the new pass, but everything that was being cleaned up still will be. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28666>	2024-06-18 09:02:25 +00:00
Kenneth Graunke	580e1c592d	intel/brw: Introduce a new SSA-based copy propagation pass (Quite a few of the restrictions here are ported from the old pass.) Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28666>	2024-06-18 09:02:25 +00:00
Kenneth Graunke	9690bd369d	intel/brw: Delete old local common subexpression elimination pass We no longer use this older pass, so there's no need to keep it. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28666>	2024-06-18 09:02:25 +00:00
Kenneth Graunke	8f09c58ddc	intel/brw: Switch to the new defs-based global CSE pass While the limited visibility due to partial SSA is a downside to the new pass, it has a huge number of advantages that make it worth switching over even now. It's much more efficient, can eliminate redundant memory loads across blocks, and doesn't generate loads of unnecessary copies that other passes have to clean up. This means we also eliminate the infighting between the old CSE, coalescing, and copy propagation passes. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28666>	2024-06-18 09:02:25 +00:00
Kenneth Graunke	234c45c929	intel/brw: Write a new global CSE pass that works on defs This has a number of advantages compared to the pass I wrote years ago: - It can easily perform either Global CSE or block-local CSE, without needing to roll any dataflow analysis, thanks to SSA def analysis. This global CSE is able to detect and coalesce memory loads across blocks. Although it may increase spilling a little, the reduction in memory loads seems to more than compensate. - Because SSA guarantees that values are never written more than once, the new CSE pass can directly reuse an existing value. The old pass emitted copies at the point where it discovered a value because it had no idea whether it'd be mutated later. This led it to generate a ton of trash for copy propagation to clean up later, and also a nasty fragility where CSE, register coalescing, and copy propagation could all fight one another by generating and cleaning up copies, leading to infinite optimization loops unless we were really careful. Generating less trash improves our CPU efficiency. - It uses hash tables like nir_instr_set and nir_opt_cse, instead of linearly walking lists and comparing each element. This is much more CPU efficient. - It doesn't use liveness analysis, which is one of the most expensive analysis passes that we have. Def analysis is cheaper. In addition to CSE'ing SSA values, we continue to handle flag writes, as this is a huge source of CSE'able values. These remain block local. However, we can simply track the last flag write, rather than creating entire sets of instruction entries like the old pass. Much simpler. The only real downside to this pass is that, because the backend is currently only partially SSA, it has limited visibility and isn't able to see all values. However, the results appear to be good enough that the new pass can effectively replace the old pass in almost all cases. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28666>	2024-06-18 09:02:25 +00:00
Kenneth Graunke	2b30b3bbd4	intel/brw: Print defs in dump_instructions Like NIR, we print SSA defs as %1, %2, and so on. The number here is the VGRF number. VGRFs that don't correspond to a SSA def remain printed as vgrf1, vgrf2, and so on. This makes it much easier to see what values are SSA and which aren't. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28666>	2024-06-18 09:02:25 +00:00

1 2 3 4 5 ...

190839 commits