fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-04 20:38:06 +02:00

Author	SHA1	Message	Date
Andrew Gazizov	bafc27583c	venus: Make sure that guest allocated blobs from hostmem are mappable For guest-based blob allocations from hostmem (Host visible memory), to make sure that virtio-gpu driver will send to the host the address (offset in the region) of the allocated blob using RESOURCE_MAP_BLOB command a flag VK_MEMORY_PROPERTY_HOST_VISIBLE_BIT must be set. Otherwise, if the upper layers didn't set it, host can't import memory and guest allocation from Host visible memory region makes no sense. Signed-off-by: Andrew D. Gazizov <andrew.gazizov@opensynergy.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26130>	2023-11-17 18:24:20 +00:00
Andrew Gazizov	8929889563	venus: Tighten the conditions for guest_vram device memory alloc In addition to the platform requirement (use_guest_vram), device memory allocations from dedicated heap (guest_vram) are necessary only when: 1. VK_MEMORY_PROPERTY_HOST_VISIBLE_BIT is set and it indicates that memory is host visible and assumed to be accessed by CPU (vkMapMemory). 2. One of external memory handle types is set, that indicates memory can be exported with external handle. In other cases it's not necessary to create virtgpu_bo object in the guest and enough just perform vkAllocateMemory on host side without memory import from dedicated heap. Reported-by: Yiwei Zhang <zzyiwei@chromium.org> Signed-off-by: Andrew D. Gazizov <andrew.gazizov@opensynergy.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26130>	2023-11-17 18:24:20 +00:00
Andrew Gazizov	816f66cdfd	venus: Use vk_object_id as blob_id for guest_vram device memory alloc blob_id == 0 does not refer to an existing VkDeviceMemory and implies a shmem allocation. So for guest_vram device memory allocations, 0 is not a valid blob id and must be greater than 0. Therefore, set vk_object_id as blob_id for guest_vram device memory allocations. Considering that vk_object_id made from valid pointer, it will be always greater than 0. Signed-off-by: Andrew D. Gazizov <andrew.gazizov@opensynergy.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26130>	2023-11-17 18:24:20 +00:00
Paulo Zanoni	c853e259ab	anv+zink/ci: document new sparse failures Now that we have sparse resources on Anv these tests are finally running, but they're failing. We'll eventually fix them, but let's not make Zink gatekeep the entirety of sparse resource on Anv. v2: KHR-GL46.sparse_buffer_tests.BufferStorageTest was initially reported as Crash by Mesa CI. On my second run in Mesa CI it gave me a Timeout. On my machine it passes but takes about 4 minutes to finish, so skip it entirely. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:29 +00:00
Paulo Zanoni	563678f310	anv/sparse: don't support YCBCR 2x1 compressed formats Regarding supporting these formats, the spec says: "A sparse image created using VK_IMAGE_CREATE_SPARSE_RESIDENCY_BIT supports all non-compressed color formats with power-of-two element size that non-sparse usage supports. Additional formats may also be supported and can be queried via vkGetPhysicalDeviceSparseImageFormatProperties. VK_IMAGE_TILING_LINEAR tiling is not supported." Regarding the formats themselves, the spec says: "VK_FORMAT_B8G8R8G8_422_UNORM specifies a four-component, 32-bit format containing a pair of G components, an R component, and a B component, collectively encoding a 2×1 rectangle of unsigned normalized RGB texel data. One G value is present at each i coordinate, with the B and R values shared across both G values and thus recorded at half the horizontal resolution of the image. This format has an 8-bit B component in byte 0, an 8-bit G component for the even i coordinate in byte 1, an 8-bit R component in byte 2, and an 8-bit G component for the odd i coordinate in byte 3. This format only supports images with a width that is a multiple of two. For the purposes of the constraints on copy extents, this format is treated as a compressed format with a 2×1 compressed texel block." Since these formats are to be considered compressed 2x1 blocks and we don't necessarily have to support non-compressed formats that non-sparse support, we can claim them as not supported with sparse. In addition to all of that, if you look at isl_gfx125_filter_tiling() you'll see that we don't even support Tile64 for these formats, so sparse residency (i.e., non-opaque image binds) doesn't really make sense for them yet. The Vulkan spec defines 4 other YCBCR "2x1 compressed" formats like the ones we have in this commit, but we don't support them even without sparse, so there's no reason to check them here. A recent change in VK-GL-CTS made tests that use these formats go from unsupported to failures: 7ecc7716a983 ("Do not use and check for STORAGE image support, when it is not used in the test") This commit "fixes" the following VK-GL-CTS failures (by making them return NotSupported): dEQP-VK.sparse_resources.image_block_shapes.2d.b8g8r8g8_422_unorm.samples_1 dEQP-VK.sparse_resources.image_block_shapes.2d.g8b8g8r8_422_unorm.samples_1 dEQP-VK.sparse_resources.image_block_shapes.2d_array.b8g8r8g8_422_unorm.samples_1 dEQP-VK.sparse_resources.image_block_shapes.2d_array.g8b8g8r8_422_unorm.samples_1 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:29 +00:00
Paulo Zanoni	a0559768db	anv: enable sparse by default on i915.ko On i915.ko we don't have the vm_bind ioctl, so sparse requires TR-TT. Unfortunately, on gfx < 20 TR-TT is not compatible with non-render queues, so we have to disable those when sparse is enabled. Notice that although we don't have TR-TT for non-render queues on gfx >= 20, vm_bind is the default, and it doesn't have this restriction. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:29 +00:00
Paulo Zanoni	fda5163f34	anv/trtt: properly handle the lifetime of TR-TT batch BOs We need to wait for the batches to complete before we return the BOs to the pool. We were previously doing this completely synchronously, which made the code unnecessarily wait. Now we have a timeline syncobj that signals completion of the previous BOs, so sometimes we check where we are in the timeline and then return the BOs that we know are unused. This, in addition to the previous patch that made us wait for the other syncobjs through the execbuf ioctl instead of through the CPU, makes TR-TT batches at least an order of magnitude faster. Still, I don't think we'll notice any changes in games's FPS as they don't bind sparse resources that often. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:29 +00:00
Paulo Zanoni	0f21836272	anv/trtt: add support for queue->sync to the TR-TT batches At this moment this patch won't buy us anything since we're already being completely synchronous, but the next patch is going to change this and so queue->sync will start making sense. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:29 +00:00
Paulo Zanoni	1534ee46b8	anv/trtt: add struct anv_trtt_batch_bo and pass it around For now it just wraps the bo and size, so there's really no value to having it. In the next commit we'll add more elements to the struct. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:29 +00:00
Paulo Zanoni	18bd00c024	anv/trtt: don't wait/signal syncobjs using the CPU anymore Pass them as part of the TR-TT batch. This is what a lot of the previous commits were building up to. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:28 +00:00
Paulo Zanoni	f2206a0eb1	anv/xe: allow passing extra syncs to xe_exec_process_syncs() We're going to use this in two different patches. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:28 +00:00
Paulo Zanoni	4b435d6983	anv/i915: extract setup_execbuf_fence_params() I'm about to add a 3rd caller for it. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:28 +00:00
Paulo Zanoni	d797d9233d	anv/sparse: process image binds before opaque image binds When sparse images are being used, applications normally use non-opaque binds and leave opaque binds just for the miptail part. Since miptails are always at the end of the array layers, processing the opaque binds after processing the non-opaque binds increases the chance that anv_sparse_submission_add() will join the miptail bind operation with the last non-opaque opreration, especially if the user is trying to bind the last few non-miptail levels and the miptail in the same vkQueueBindSparse opration. In the real world this case does happen, so we're able to save a bind operation every once in a while in Steam games. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:28 +00:00
Paulo Zanoni	040063c156	anv/sparse: move waiting/signaling syncobjs to the backends Move waiting/signaling to the backends so we can fix each backend separately. As I write this patch the vm_bind backend is back to using synchronous vm_binds so we can't pass syncobjs to the synchronous vm_bind ioctl anymore. We'll need more discussions and possibly some rework before we go back to asynchronous vm_binds. This commit should allow us to fix the TR-TT backend in the next commit and leave vm_bind for later. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:28 +00:00
Paulo Zanoni	cbf09b4254	anv/trtt: use 'queue' from anv_sparse_submission in the backend Don't pass it as a parameter when it's also part of a struct. Have to touch 9 files just for that... Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:28 +00:00
Paulo Zanoni	f6d28bec6d	anv/sparse: add 'queue' to anv_sparse_submission If we're going to move syncobj waiting/signaling down to the backend we're going to need a queue to signal as lost in case those operations fail. In some places of the stack we don't have a queue available, such as when we're creating or destroying resources. For those, for vm_bind cases we don't use the queue for anything so passing it as NULL is fine. For TR-TT we are already using device->trtt.queue. For TR-TT specifically this also means we're going to start using the actual queues from the call stack instead of trtt->queue, but that shouldn't make any difference since we only ever have one queue. Still, this is more technigally correct. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:28 +00:00
Paulo Zanoni	576275907a	anv/sparse: pass anv_sparse_submission to the backend functions Our ultimate goal is to have the backend functions deal with the wait and signal syncobjs instead of waiting for them on the CPU inside anv_queue_submit_sparse_bind_locked(). For that, we'll need waits and signals parameters to be passed all the way to the backend functions that actually make the submission, and this is what this patch does, through struct anv_sparse_submission. This patch just deals with passing the parameters to the functions, nothing is using the new variables yet. There should be no functional changes here. The goal here is to make code review easier. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:28 +00:00
Paulo Zanoni	6c7753ee0b	anv/sparse: join all submissions into a single anv_sparse_bind() call Currently, a single vkQueueBindSparse() call may lead to multiple bind calls in the backend (either a vm_bind ioctl or a command submission that updates the TR-TT page tables). These operations can be quite slow so it's better for us if we try to emit as few of them as possible. On top of that, this gives our "just extend the last operation's size if possible" code a little more chance to act and save us real time. Our ultimate goal here is to also pass submit->waits and submit->signals to the backend so we can avoid doing CPU waits, so having a single call to the backend helps simplify things a little too, and we just created the structure to carry these extra pointers forward. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:28 +00:00
Paulo Zanoni	11e9a700f6	anv/sparse: drop anv_sparse_binding_data from dump_anv_vm_bind() Having it helped us printing the resource offset, which made debugging some situations easier. The problem is that we want to rework the code a little bit and we won't have a 'sparse' struct anymore to pass around. Since it's all debug code drop it for now so it doesn't get in the way of the rework. If we need it later we can find a way to add it back, or we find another way to print the value. Drive-by drop the DEBUG_SPARSE check that's already in the caller. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:28 +00:00
Paulo Zanoni	b4fef9a745	anv/trtt: also join the L3/L2 writes into a single MI_STORE_DATA_IMM Same as the L1 case, but this one deals with 64bit entry addresses and pte addresses. Consecutive L3/L2 writes are much rarer than L1 writes since they require some pretty big buffers, but we can still those cases in the wild. I just don't think any change will be noticeable though. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:28 +00:00
Paulo Zanoni	31f720fd6e	anv/trtt: join L1 writes into a single MI_STORE_DATA_IMM when possible If the addresses are sequential, we can emit only a single MI_STORE_DATA_IMM instruction. This is a very common case, it should save us some space: 4 bytes per extra_write. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:28 +00:00
Paulo Zanoni	ccb30ae0cd	anv/sparse: fix limits.sparseAddressSpaceSize when using vm_bind When using vm_bind (not TR-TT), in practice sparse addresses will be allocated from the high_heap, so narrow down the available sparseAddressSpaceSize from the whole address space to the part we can actually allocate things from. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:28 +00:00
Paulo Zanoni	04bfe828db	anv/sparse: allow sparse resouces to use TR-TT as its backend TR-TT is a hardware feature supported by both i915.ko and xe.ko, which means we can now finally have Sparse Resources on i915.ko and we also have 2 options for xe.ko (and whatever is the best should be the default). In this patch we use batch commands to write the page tables and forever keep them in device memory. We maintain a mirror of both the L3 and and L2 tables because that helps us never having to read the tables that are in device memory. We still have some things to improve, but with this commit, workloads that didn't work at all due to the lack of sparse resources should at least run. This is still all disabled by default in i915.ko, you can turn it on by exporting ANV_SPARSE=1 before launching the applications. For xe.ko, switch the default with ANV_SPARSE_USE_TRTT=1. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25512>	2023-11-17 17:58:28 +00:00
Rhys Perry	a279767424	ac/gpu_info: update conformant_trunc_coord comment Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25911>	2023-11-17 15:15:28 +00:00
Rhys Perry	9e9eb87dd3	radv: enable radv_disable_trunc_coord for vkd3d-proton/DXVK This fixes diagonal SSAO artifacts in some games. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9253 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6395 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25911>	2023-11-17 15:15:28 +00:00
Rhys Perry	4d2a3b9573	radv: add radv_disable_trunc_coord option Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25911>	2023-11-17 15:15:27 +00:00
Eric Engestrom	1b18ca83fa	amd/ci: avoid re-running all the test jobs when changing the expectations for only one of them Previously, any change to any job would trigger every job. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26127>	2023-11-17 14:22:46 +00:00
Alyssa Rosenzweig	0f0f6c6227	nir/validate: Specialize if source validation Yet another bit of branchiness we should tame. 99% of the time, sources are not for if's, so we shouldn't need to do the extra checking to handle that 1%. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26084>	2023-11-17 09:36:08 -04:00
Alyssa Rosenzweig	88d0fd03d6	nir/validate: Split out validate_sized_src We don't check the sizes for ALU srcs, which is the hot path here, so split out that simplified version for ALU instructions to use, while deriving a sized version for other kinds of instructions. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26084>	2023-11-17 09:36:08 -04:00
Alyssa Rosenzweig	bfcc323954	nir/validate: Inline validate_ssa_src There's no more nir_register. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26084>	2023-11-17 09:36:08 -04:00
Alyssa Rosenzweig	bfdb2862d6	nir/validate: Drop stale todo We have dominance validation elsewhere in the file. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26084>	2023-11-17 09:36:08 -04:00
Alyssa Rosenzweig	9f341cefbb	nir/validate: Don't check dimensions in validate_def Instead, check it at the call sites when actually required (basically just intrinsics), reducing the branching required when not (ALU validation, the hottest of hot paths for CI). IMHO this is more obvious too. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26084>	2023-11-17 09:36:08 -04:00
Alyssa Rosenzweig	d2b1e6bed0	nir/validate: Use unlikely for validate_assert No apparent performance difference, but documents the intention. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26084>	2023-11-17 09:36:08 -04:00
Alyssa Rosenzweig	e9d185dccf	nir/validate: Don't validate out-of-bounds channels Nothing should ever be reading them, they logically do not exist. So there's no point validating them, especially when the validation in question is so useless (just checking the bit width, without any semantic awareness). Yet now that we support vec16, this loop is quite hot even on scalar ISAs, and rather pointlessly so. Just remove it and bring the ALU src validation complexity to O(# of channels in source) instead of O(max # of channels in NIR). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26084>	2023-11-17 09:36:08 -04:00
Alyssa Rosenzweig	8d5a1e5a77	nir/validate: Don't spam nir_alu_instr_channels It doesn't inline and so is about 1% of M1 CTS time. Expand out the definition and simplify the logic. Honestly, I think this is clearer too. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26084>	2023-11-17 09:36:08 -04:00
Alyssa Rosenzweig	1786032029	nir/validate: Optimize ssa_srcs set Profiling showed that maintaining this ssa_srcs set consumes ~3% of CTS time with a debugoptimized build. Unfortunately, we really do benefit from getting this coverage in CI. So rather than remove the validation, let's optimize the data structure used so we can keep the coverage at a fraction of the cost. The expensive piece is the pointer set, which is backed by a relatively expensive hash table. It would be much cheaper to use an invasive set instead, with a single "present" bit. We don't want to bloat nir_src for this, however there's an easy solution: use a tagged pointer to steal a bit in the nir_src for the job. We untag everything at the end of validation (and this meta-invariant is asserted with an auxiliary counter), so while we mutate the IR while validating, the mutations do not escape nir_validate. We tag the parent pointer and not the def pointer, because it is dramatically less used and therefore has far fewer disrupted call sites. The M1 job is improved from 3:03 to 2:55 of deqp-runner reported time, which is excellent. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26084>	2023-11-17 09:36:08 -04:00
Alyssa Rosenzweig	6d73f62e42	nir: Simplify nir_alu_instr_channel_used definition Deduplicates the "get # of channels" logic which was the same between the helpers. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26084>	2023-11-17 09:36:08 -04:00
Violet Purcell	667de678a0	gallium: Fix undefined symbols in version scripts Currently, multiple version scripts unconditionally use symbols from gallium drivers that may not be enabled, which causes linking to fail with --no-undefined-version (as is default in LLD 17), and can cause issues with LTO. This commit adds logic to generate version scripts based on the enabled gallium drivers, ensuring only defined symbols are used. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8003 Signed-off-by: Violet Purcell <vimproved@inventati.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25551>	2023-11-17 12:35:24 +00:00
Lionel Landwerlin	c76cb19b01	anv/blorp: move helper function about BTI changes to blorp Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26247>	2023-11-17 10:17:51 +00:00
Lionel Landwerlin	1b6fe35ec8	anv: get rid of the duplicate pipeline fields in command buffer state This can be error prone if you forget to update one. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26247>	2023-11-17 10:17:51 +00:00
Lionel Landwerlin	ec3f8c0d42	intel/blorp: move Wa_18019816803 out of blorp code The driver already need to track this WA for blorp. We can completely remove any blorp code dealing with this and instead have the flush required by the workaround be combined with potential other flushes the driver already has to insert before blorp operations. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26247>	2023-11-17 10:17:51 +00:00
Lionel Landwerlin	6e85fa8a80	intel/decoder: handle 3DPRIMITIVE_EXTENDED in accumulated prints Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26247>	2023-11-17 10:17:51 +00:00
Lionel Landwerlin	9689607f10	anv: fix dirty state tracking for 3DSTATE_PUSH_CONSTANT_ALLOC ANV_CMD_DIRTY_PIPELINE also includes reprogramming of 3DSTATE_PUSH_CONSTANT_ALLOC_* instructions. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `50f6903bd9` ("anv: add new low level emission & dirty state tracking") Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26247>	2023-11-17 10:17:51 +00:00
Lionel Landwerlin	1257d08bcb	anv: fix CC_VIEWPORT pointer dirty after blorp/simple-shaders Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `50f6903bd9` ("anv: add new low level emission & dirty state tracking") Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26247>	2023-11-17 10:17:51 +00:00
Lionel Landwerlin	e517b1e095	anv: fix missing naming for dirty bit Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26247>	2023-11-17 10:17:51 +00:00
Lionel Landwerlin	1a1747712c	anv: fix source_hash propagation with libraries Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26247>	2023-11-17 10:17:51 +00:00
Daniel Schürmann	f2bb7b185d	aco: delete instruction selection for boolean subgroup operations These are now lowered in NIR. Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/218>	2023-11-17 09:45:40 +00:00
Daniel Schürmann	88afbbba11	nir: optimize open-coded quadVote* directly to new nir_quad intrinsics Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/218>	2023-11-17 09:45:40 +00:00
Connor Abbott	0d186d356c	amd: Enable boolean subgroup lowering Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/218>	2023-11-17 09:45:40 +00:00
Connor Abbott	1dab2c5bd2	nir/subgroups: Add option to lower Boolean subgroup reductions This will be useful for AMD, and probably Intel as well. Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/218>	2023-11-17 09:45:40 +00:00

1 2 3 4 5 ...

180894 commits