fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-18 11:38:06 +02:00

Author	SHA1	Message	Date
Iván Briano	8423998d69	hasvk: fix non matching image/view format attachment resolve Port of `5a7e58a430` ("anv: fix non matching image/view format attachment resolve") to hasvk. Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31696>	2024-10-17 20:24:37 +00:00
Lionel Landwerlin	02294961ee	anv: stop using a binding table entry for gl_NumWorkgroups This will make things easier in situations where we don't want to use the binding table at all (indirect draws/dispatches). The mechanism is simple, upload a vec3 either through push constants (<= Gfx12.0) or through the inline parameter register (>= Gfx12.5). In the shader, do this : if vec.x == 0xffffffff: addr = pack64_2x32 vec.y, vec.z vec = load_global addr This works because we limit the maximum number of workgroup size to 0xffff in all dimension : maxComputeWorkGroupCount = { 65535, 65535, 65535 }, So we can use the large values to signal the need for indirect loading. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31508>	2024-10-17 19:35:59 +00:00
Lionel Landwerlin	97b17aa0b1	brw/nir: rework inline_data_intel to work with compute This intrinsic was initially dedicated to mesh/task shaders, but the mechanism it exposes also exists in the compute shaders on Gfx12.5+. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31508>	2024-10-17 19:35:59 +00:00
Lionel Landwerlin	1dc125338e	brw: fix mesh fence emission In SIMD32, the fence instruction is currently going to read grf0-3 leading to such assertions in the backend : ../src/intel/compiler/brw_fs_reg_allocate.cpp:206: void fs_visitor::calculate_payload_ranges(bool, unsigned int, int*) const: Assertion `j < payload_node_count' failed. The reason we haven't seen the problem yet is that there always enough payload register to accomodate this. But the following change is going to make the inline parameter register optional. Since SHADER_OPCODE_MEMORY_FENCE is emitted in the generator as SIMD1 NoMask (see brw_memory_fence), we can limit ourselves to SIMD1 exec_all() in the IR as well so that the IR accounts for grf0 as a source. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31508>	2024-10-17 19:35:59 +00:00
Lionel Landwerlin	b2c5ca0ade	brw: remove rebuild single element special case No shader-db difference on DG2. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31508>	2024-10-17 19:35:59 +00:00
Lionel Landwerlin	19eb601cfc	brw: avoid clashing nested loop indices Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31508>	2024-10-17 19:35:59 +00:00
Lionel Landwerlin	f5d123b977	brw: delay printf lowering Useful to insert debug traces a bit later in the lowering process (in particular after load/store vectorization). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31508>	2024-10-17 19:35:59 +00:00
Lionel Landwerlin	be3f62af15	brw: remove unused prototype Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31508>	2024-10-17 19:35:59 +00:00
Lionel Landwerlin	ea2bbe3271	anv: use stage mask to deduce cs/pb-stall requirements When flushing the render target cache for future operations, we need a stall at pixel scoreboard. We likely didn't see any issue until now because a change in render target added the pb-stall. When using a 2 compute shaders with the following pattern : vkCmdDispatch() vkCmdPipelineBarrier() ImageBarrier with (src\|dst)AccessMask=0 & identical layout vkCmdDispatch() we should ensure that the first dispatch is completed before executing the second one, otherwise they can race to on resource accesses. This fixes failures in some new CTS tests. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31676>	2024-10-17 11:55:33 +00:00
Georg Lehmann	cba575f4df	nir: always emit ddx intrinsics Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31014>	2024-10-17 09:50:19 +00:00
Georg Lehmann	6cb6bc7133	elk: remove alu fddx/fddy check Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31014>	2024-10-17 09:50:19 +00:00
Dylan Baker	bf3b1d8ea9	anv: when handling descriptor sets, layout cannot be NULL if dynamic_offets are used We have no cases where we intentionally pass a NULL layout when dynamic offsets, and doing so would cause a null dereference. Le't asd an assert for that. CID: 1620447 Fixes: `f39cd30f4f` ("anv: Track all the descriptor sets") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31638>	2024-10-16 19:54:07 +00:00
Dylan Baker	0714fbff8f	anv: assert that we don't read off the end color_att array Coverity notices that we've insured that index index is < MAX_RTS in one case, but that it must be greater in one case. Since `color_att_count` is a uint32_t, it can easily exceed MAX_RTS (8), and would thus create an out-of-bounds read situation. While the type system would allow this, the actually implementation shouldn't, so an assert should make Coverity happy and help us check our assumption. CID: 1620440 Fixes: `d2f7b6d5a7` ("anv: implement VK_KHR_dynamic_rendering_local_read") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31640>	2024-10-16 19:09:25 +00:00
Paulo Zanoni	4d60f905b3	anv/trtt: extract anv_trtt_first_bind_init_queue() The code that initializes each queue got big enough that the repetitive error handling is getting ugly and it could benefit from being on its own function. v2: Rebase, try to improve the comments. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30953>	2024-10-15 23:05:31 +00:00
Paulo Zanoni	8280a6fee9	anv/trtt: convert anv_trtt_bind arrays to util_dynarray Since the L2 bug fix we've been overestimating l3l2_binds by a lot in most of the cases: almost every single call to anv_sparse_bind_trtt ends up using either 0 or 1 elements for l3l2_binds, with occasionally something using 512 or more. By switching to util_dynarray we can guarantee the best of every case: - l1_binds will remain a stack array for the vast majority of the calls - even more than before, since STACK_ARRAY was limited to 8 elements and now we do 32 - l1 will be properly dimensioned without the need for reallocs - l3l2_binds will be completely empty most of the times and only trigger allocations when necessary Here's the top 10 most common results of anv_sparse_bind_trtt() for a trace of Assassin's Creed: Valhalla. The first column is how many times we had that case while running the trace. After this patch, all these cases will proceed without any memory allocations. 168 trtt_binds: num_vm_binds:04 l3l2:0000 l1:0004 344 trtt_binds: num_vm_binds:01 l3l2:0000 l1:0004 420 trtt_binds: num_vm_binds:01 l3l2:0000 l1:0012 422 trtt_binds: num_vm_binds:04 l3l2:0000 l1:0008 479 trtt_binds: num_vm_binds:01 l3l2:0000 l1:0024 560 trtt_binds: num_vm_binds:03 l3l2:0000 l1:0003 1005 trtt_binds: num_vm_binds:01 l3l2:0000 l1:0002 1024 trtt_binds: num_vm_binds:02 l3l2:0000 l1:0004 2145 trtt_binds: num_vm_binds:02 l3l2:0000 l1:0002 3735 trtt_binds: num_vm_binds:01 l3l2:0000 l1:0001 Only 70 out of total 11340 calls to anv_sparse_bind_trtt() contained l3l2 elements. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30953>	2024-10-15 23:05:31 +00:00
Paulo Zanoni	5839b9d083	anv/trtt: remove TODO comment regarding the reloc list We use 2MB page table BOs, as defined by ANV_TRTT_PAGE_TABLE_BO_SIZE. Each BO is enough to hold 512 pages, since each one has 4096 bytes. Each L1 page can fit 1024 entries of 64kb size, which means our 512 pages should be able to fit a little less than 32gb of sparse resource memory, since we also need some L2 pages and an L3 page. I don't see any real world application using more than a single BO. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30953>	2024-10-15 23:05:31 +00:00
Paulo Zanoni	e047365d76	anv/trtt: delay batch garbage collection Move it past the (n_l3l2_binds == 0 && n_l1_binds == 0) check so we don't end up trying to do garbage collection more often than we submit batches. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30953>	2024-10-15 23:05:31 +00:00
Paulo Zanoni	acb759c44f	anv/trtt: decrement trtt->timeline_val when submission fails Otherwise code such as anv_sparse_trtt_garbage_collect_batches() may end up stuck waiting forever on a timeline of a submission that failed. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30953>	2024-10-15 23:05:31 +00:00
Paulo Zanoni	0802bbd486	anv/trtt: don't submit empty batches when there are no binds to do The application can submit bind operations where it simply resets state that is already in our page tables, so there's nothing to do. Before commit `7da5b1caef` ("anv: move trtt submissions over to the anv_async_submit") we would simply return and not submit any batches when this happened, but the commit reorganized things in a way where we started submitting empty batches instead. Fix this by simply jumping out when we detect this case. Because of this, rename the "error" labels to "out" as they can now happen on a happy case. It should be noted that an alternative to this implementation would be to move all the handling of 'submit' to after the n_lX_binds check, but this would put all the initialization inside the trtt->mutex, creating extra contention even when we have stuff to bind. Since the "there's nothing to bind" check is now rare (after we stopped doing NULL binds during resource creation), it is probably better to reduce lock contention in the common case at the expense of a little more CPU in the rare case. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30953>	2024-10-15 23:05:31 +00:00
Paulo Zanoni	aea9ac47d2	anv/trtt: fix error handling when adding binds We're missing a check for 'result' in the middle of a loop and we have an unnecessary check for 'result' after the loop. Fixes: `7da5b1caef` ("anv: move trtt submissions over to the anv_async_submit") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30953>	2024-10-15 23:05:31 +00:00
Paulo Zanoni	fd43c4a973	anv/trtt: unset trtt->l3_addr if initialization fails There is a lot that can go wrong during initialization after we assign trtt->l3_addr, and we use its value to check if trtt is initialized. If an initialization fails after l3_addr is already assigned, the next bind will attempt to use the leftover values from the failed initialization attempt and will likely cause all sorts of random errors. So when we fail, just set l3_addr back to 0, causing the next bind to attempt to initialize everything again. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30953>	2024-10-15 23:05:31 +00:00
Paulo Zanoni	c832ed6bd2	anv/trtt: ensure all L3 entries are NULL-bound during init Since everything is always NULL-bound at init and we always bind things back to NULL in anv_free_sparse_bindings(), this means we don't need to do NULL bindings during anv_init_sparse_bindings(), saving us a bunch of time, espcially since we don't track L1 entries so we may end up submitting TR-TT batches just to write zeroes on top of zeroes. v2: Don't unnecessarily check for uses_relocs (Lionel). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30953>	2024-10-15 23:05:31 +00:00
Paulo Zanoni	3845d0d393	anv/trtt: inline anv_trtt_init_queues_state() - There's only one caller. - The caller is rather small. - We want to introduce initialization code that's not exactly queue state and reuse the 'submit'. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30953>	2024-10-15 23:05:31 +00:00
Paulo Zanoni	6af1d1ae7f	anv/trtt: extract anv_trtt_first_bind_init() In the next commits we're going to move this out of anv_sparse_bind_trtt() and we're also going to add more code to it. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30953>	2024-10-15 23:05:31 +00:00
Paulo Zanoni	4c366ef67b	anv/trtt: set every entry to NULL when we create an L2 table When we create sparse resources the first thing we do is a NULL bind on them, as the Vulkan spec mandates certain behavior even for unbound sparse resources. We do this with the minimal effort possible: if we can get away with marking an L2 pointer as NULL in the L3 table, we just do it and return, instead of going all the way to creating L1 tables and marking all the final entries as NULL. The strategy we were using had a bug that could lead to previously created NULL entries not being marked as NULL anymore. Let's give an example: (before proceeding, keep in mind that a NULL entry in the L3 and L2 tables has bit 1 set, it does not have the value 0) - Create a 64mb buffer that uses an entire L1 table (needs to be properly aligned), which triggers a NULL bind. - Our algorithm will just set the L3 entry (pointing to the L2 table) as NULL. - Create a 64kb buffer that uses the same L2 table (but a different L1 table). - The NULL bind triggered won't do anything as the L2 table is already NULL. - Bind the first buffer to actual memory. This will end up creating the L2 table and the L1 table. The only entry we will set in the L2 table will be the one pointing to the L1 table. All the other values will be 0 (so they won't have neither the NULL or Invalid bits set: access to them will lead to page faults). - Try to use the second buffer, which is still unbound. It was relying on the fact that its L2 table pointer was NULL, but now it's not anymore, so the page walker will fetch the L1 entries in the L2 table and they will all be zero instead of having the NULL bit set. The fix is pretty simple: whenever we create a new L2 table, set every entry to NULL (except the one we're about to set to non-NULL). This preserves behavior for every other NULL resource relying on the L3 entry being set to NULL. We don't need to do this for the L1 table because its entries are different and instead of having bits to signal NULL entries we have a special TR-TT register that we can set that gets compared to check if an entry is NULL, and we conveniently program it to 0: see ANV_TRTT_L1_NULL_TILE_VAL. I am not aware of any real workloads that are triggering this behavior, I found this issue while investigating something else, running a custom sparse program in our pre-silicon environment, and it told us about the page faults. Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30953>	2024-10-15 23:05:30 +00:00
Kenneth Graunke	4cb67cb07a	intel/brw: Use whole 512-bit registers in constant combining on Xe2 Xe2 increased the register size from 256-bits to 512-bits. So we can store 32 16-bit values in a register, rather than 16 values. Prior to this patch, we hadn't updated the pass, so the second half of each of our registers was unused. Backport-to: 24.2 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31499>	2024-10-15 18:14:37 +00:00
Kenneth Graunke	d9e5022650	intel/brw: Delete more Gfx8 code from brw_fs_combine_constants These platforms are supported by elk, not brw. Backport-to: 24.2 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31499>	2024-10-15 18:14:37 +00:00
Kenneth Graunke	dea61b7399	intel/brw: Fix register and builder size in emit_barrier() for Xe2 We were manually allocating 1 REG_SIZE for the barrier payload, which is only half a register on Xe2. This should eventually get allocated to a whole register anyway, but it's awkward in the meantime. Also, we were zero-initializing the header using group(8, 0) which only initialized half the register. The rest of the fields are Reserved MBZ, so they're likely unused and unread anyway - but it's better to zero-initialize them so we don't get random undefined, miserable-to-debug behavior. Backport-to: 24.2 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31499>	2024-10-15 18:14:37 +00:00
Kenneth Graunke	7c9eb8b289	intel/brw: Make a ubld temporary in emit_barrier() Saves typing .exec_all() in a lot of places. Backport-to: 24.2 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31499>	2024-10-15 18:14:37 +00:00
Kenneth Graunke	a9d9488788	intel/brw: Delete Gfx7-8 code from emit_barrier() Those are supported by elk, not brw. Backport-to: 24.2 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31499>	2024-10-15 18:14:37 +00:00
Kenneth Graunke	c747c1e1f4	intel/brw: Fix spill/fill count for load/store_scratch in SIMD32 Honestly, I don't know what I was thinking - we are emitting a single spill/fill message here, but were counting it as 2 spill/fills in SIMD32 shaders. So our eventual shader stat reporting would subtract the number of spills and fills from send_count, and get a negative number, wrapping around to just shy of UINT32_MAX. That's way too many sends. This is especially noticable on Xe2 which often uses SIMD32 shaders. Backport-to: 24.2 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31499>	2024-10-15 18:14:37 +00:00
Deborah Brouwer	0007077c11	ci: remove xfail program@build@include-directories Now that build-piglit.sh is no longer removing ‘include_test.h’ this test `program@build@include-directories` is passing which is causing jobs to fail due to this unexpected improvement. Remove this test from expected fails so that the jobs can pass. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31379>	2024-10-15 15:50:47 +01:00
Collabora's Gfx CI Team	68aa78a858	Uprev Piglit to 7ce69da1199d12ed0ddaa251ed489750523798fb `e9ab30aeae...7ce69da119` Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31379>	2024-10-15 15:50:47 +01:00
Marek Olšák	65ace5649b	nir: reject unsupported component counts from all vectorize callbacks If you allow an unsupported component count in the callback for loads, nir_opt_load_store_vectorize will align num_components to the next supported vector size, essentially overfetching. This changes all callbacks to reject it. AMD will enable it in a later commit. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29398>	2024-10-15 05:50:24 +00:00
Marek Olšák	02923e237d	nir: add hole_size parameter into the vectorize callback It will be used to allow merging loads with a hole between them. Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29398>	2024-10-15 05:50:24 +00:00
José Roberto de Souza	ab12e01af6	intel/perf: Add BMG OA support Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30982>	2024-10-14 23:54:04 +00:00
José Roberto de Souza	aa2f9b72b2	intel/perf: Sync oa-lnl.xml IGT have sync the oa-lnl.xml with latest OA definitions. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30982>	2024-10-14 23:54:04 +00:00
Caio Oliveira	b9787fcc80	intel/brw: Move emit_scan/emit_scan_step near its usage Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30496>	2024-10-11 06:40:29 +00:00
Caio Oliveira	0ba1159b0a	intel/brw: Add SHADER_OPCODE_*_SCAN Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30496>	2024-10-11 06:40:29 +00:00
Caio Oliveira	9537b62759	intel/brw: Add SHADER_OPCODE_REDUCE Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30496>	2024-10-11 06:40:29 +00:00
Caio Oliveira	4361a08254	intel/brw: Reduce scope of has_source_and_destination_hazard This predicate at the moment is only relevant during register allocation, so move it there and the code can ignore virtual instructions that were already lowered previously. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30496>	2024-10-11 06:40:29 +00:00
Caio Oliveira	bf9456753d	intel/brw: Validate some instructions exists only up until some phases Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30496>	2024-10-11 06:40:29 +00:00
Caio Oliveira	affa7567c2	intel/brw: Add phases to backend The general idea is to be able to validate that certain instructions were lowered and certain restrictions were already handled. Passes can now assert their expectations, i.e. if a pass is mean to run after certain lowerings or not. The actual phases are a initial stab and as we re-organized the passes, we may remove/add phases. This commit just add some phase steps, later commits will make use of them. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30496>	2024-10-11 06:40:29 +00:00
Caio Oliveira	21f78454bf	intel/brw: Fix Gfx9 3-src validation to handle FIXED_GRF Note this validation path is not being used at the moment, but will in a later commit. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30496>	2024-10-11 06:40:29 +00:00
Caio Oliveira	3e8796b677	intel/brw: Print Non-SSA regs after NIR in debug output Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30496>	2024-10-11 06:40:29 +00:00
Caio Oliveira	2811cb2923	intel: Add statistic for Non SSA registers after NIR to BRW This is going to be useful while we convert the NIR to BRW to produce SSA definitions. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30496>	2024-10-11 06:40:29 +00:00
Caio Oliveira	6db7d1af16	intel/compiler: Rename shader_stats structs Add the `brw_` and `elk_` prefixes to the structs to avoid compilation failure building with LTO ("violates the C++ One Definition Rule") when the structs diverge. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30496>	2024-10-11 06:40:29 +00:00
Caio Oliveira	13d99979d2	intel/brw: Remove the remaining DO_SRC macro from EU validation Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31296>	2024-10-11 04:13:48 +00:00
Caio Oliveira	f1036da345	intel/brw: Add vstride/width/hstride to brw_hw_decoded_inst Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31296>	2024-10-11 04:13:48 +00:00
Caio Oliveira	2251748aad	intel/brw: Add dst/srcs register numbers to brw_hw_decoded_inst Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31296>	2024-10-11 04:13:48 +00:00

1 2 3 4 5 ...

12873 commits