fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-03-18 16:40:34 +01:00

Author	SHA1	Message	Date
Lionel Landwerlin	c434050a00	brw: add pre ray trace intrinsic moves Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Some intrinsics are implemented by reading memory location that could be rewritten by a further tracing calls. So we need to move those reads prior to tracing operations in the shaders. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8979 Tested-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34214>	2025-05-06 13:34:53 +00:00
Lionel Landwerlin	37608c075f	anv: promote VK_EXT_robustness2 to VK_KHR_robustness2 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34821>	2025-05-06 13:16:13 +00:00
Hyunjun Ko	86d21fd2cf	anv: Set tc/beta offset according to the flag from PPS. Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Consider the flag from PPS when setting tc/beta offset. This fixes some artifacts when decoding a hevc video, hevc_scaling_list4.mkv from Lynne. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34782>	2025-05-06 04:24:22 +00:00
José Roberto de Souza	a82b569649	anv: Reduce memory pool usage in MTL and ARL Those platforms requires aux map with 1MB alignment, for slab that means that any buffer needs to have size of multiple of 1MB what causes a lot of memory to be wasted causing it to run out of memory when running multiple GPU applications. Fixes: `ea18572ff2` ("anv: Add support for ANV_BO_ALLOC_AUX_CCS in anv_slab_bo") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34817>	2025-05-05 16:42:14 +00:00
Valentine Burley	0f5ab7af3d	anv/ci: Update expectations These Vulkan Video tests were fixed in `f7ff9b240d` ("anv: Do not support the tiling of DRM modifier if DECODE_DST") Signed-off-by: Valentine Burley <valentine.burley@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34792>	2025-05-05 13:28:40 +00:00
Valentine Burley	9ac2b73cf4	iris/ci: Update trace checksums These traces have been failing for a while now. Signed-off-by: Valentine Burley <valentine.burley@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34792>	2025-05-05 13:28:40 +00:00
Iván Briano	cf9b0dd589	anv, hasvk: ignore QFOT if both src and dst queue families are equal Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34779>	2025-05-02 17:38:56 +00:00
José Roberto de Souza	3e5a735d01	intel/tools: Fix batch buffer decoder Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details intel_decoder_init() initializes intel_batch_decode_ctx so later we can call decode functions but it depends on data stored in brw/elk_isa_info but that was being allocated in stack of intel_decoder_init() then when the decode functions were executed it was accessing garbage at the brw/elk_isa_info memory. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `ec2d20a70d` ("intel/tools: Add helpers for decoder_init/disasm") Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34776>	2025-05-01 13:27:44 +00:00
Lionel Landwerlin	63f633557f	intel: fix null render target setup logic Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Or current render target cache setting is to key on the binding table index, meaning the HW associates a number in the range [0, 7] to a RENDER_SURFACE_STATE description. If you want change the render target 0 between 2 draw calls, you need to insert a PIPE_CONTROL in between the 2 draw calls with pb-stall + rt-flush in order to flush an writes to a previous RENDER_SURFACE_STATE that has now becomed disassociated with the [0, 7] number. This PIPE_CONTROL taking care of the flush is dealt with in cmd_buffer_maybe_flush_rt_writes(). This function diffs the current BTI setup for render targets (first 0 to 7 BTIs) with what the next fragment shader wants. The issue here is we might have a render pass with 0 color attachments and yet in `98cdb9349a` we added one pointing to the render target 0, but in the emit_binding_table() when we finally program the BTI, we check the render pass color count and program a null surface state instead of an actual surface state. And this leads to hangs because the render target cache will end up with inconsistent state data. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `98cdb9349a` ("anv: ensure null-rt bit in compiler isn't used when there is ds attachment") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12955 Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34603>	2025-05-01 11:25:18 +00:00
José Roberto de Souza	615d0c9669	anv: Remove ANV_BO_ALLOC_HOST_CACHED from ANV_BO_ALLOC_MAPPED assert() on anv_device_alloc_bo() Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details ANV_BO_ALLOC_MAPPED are internal allocated bos that need mmap() but as internally we don't do any cflush() we need to make sure those are also ANV_BO_ALLOC_HOST_COHERENT. Checking for ANV_BO_ALLOC_HOST_CACHED could lead a cached+uncoherent bo being allocated internally with ANV_BO_ALLOC_MAPPED. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34778>	2025-05-01 02:44:03 +00:00
José Roberto de Souza	57bf646685	anv: Fix assert failure in discrete GPUs when allocating a LMEM+SMEM slab parent It was failing in the first assert of anv_device_alloc_bo() because it has ANV_BO_ALLOC_MAPPED but it don't have ANV_BO_ALLOC_HOST_COHERENT or ANV_BO_ALLOC_HOST_CACHED(this second one is wrong and fixed in the next patch). LMEM is always write-combine, even SMEM on discrete GPU is always write-back + coherent because the PCI bus protocol snooping at CPU caches and that behavior can't be disabled. So we can add this coherent flag without any side effects. The ANV_BO_ALLOC_MAPPED is needed for ANV_BO_SLAB_HEAP_LMEM_SMEM because to trigger SMEM+LMEM in anv_device_alloc_bo() we need ANV_BO_ALLOC_MAPPED or ANV_BO_ALLOC_LOCAL_MEM_CPU_VISIBLE but the second one is mostly used with small PCI bar discrete GPUs. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `dabb012423` ("anv: Implement anv_slab_bo and enable memory pool") Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34778>	2025-05-01 02:44:03 +00:00
Karmjit Mahil	9d01b318a3	anv,tu: Bypass RMV pcie_family_id check Since RMV 1.9 pcie_family_id is checked to verify whether a capture is supported. Signed-off-by: Karmjit Mahil <karmjit.mahil@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34763>	2025-04-30 16:12:11 +00:00
Lionel Landwerlin	f7bc22e0d7	anv: force fragment shader execution when occlusion queries are active Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34732>	2025-04-30 13:37:14 +00:00
Felix DeGrood	4f0aa96d26	anv: Do conservative oversubscription of pages to 2MB Round up allocations to nearest 2MB interval if this increases the allocation by no more than 1.33x. This reduces page count but at the cost of extra memory consumption. Optimization only applied to MTL(Xe KMD only)/LNL platforms, which are particularly impacted by page misses. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:40 +00:00
José Roberto de Souza	2c05488be1	anv: Align size of bos larger than 1MB to 64k to enable 64k pages BOs larger than 1MB don't go memory pool due the size but applications tend to use a lot of VkMemory with size larger than 1MB so to reduce the number of pages and improve performance here I'm aligning the size of BOs larger than 1MB to 64kb, this allows 64kb pages to be used at least on Xe KMD. This bring substantial perfomance benefit in exchange of a small memory waste. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:40 +00:00
José Roberto de Souza	dde91cf9cb	anv: Always grow fixed address pools by 2MB in platforms that there is a performance gain MTL and newer integrated platforms has a performance gain when using transparent huge pages, because of the fixed address requirement we can't use slab for this case but we can change the initial pool size to 2MB so all allocations get the transparent huge page optimization. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:40 +00:00
José Roberto de Souza	7361b3287f	anv: Remove useless if block I can't think in any case where that would be false, so lets drop it. While at it, also making some variables const. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:40 +00:00
José Roberto de Souza	6f7a32ec92	anv: Add support for batch buffers in anv_slab_bo in i915 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:40 +00:00
José Roberto de Souza	39bb51ab27	anv: Add support for batch buffers in anv_slab_bo in Xe KMD Because of the ANV_BO_ALLOC_CAPTURE flag, batch buffers were not allowed to use memory pool. So to workaround that here adding a new anv_bo_slab_heap heap for cached+coherent+capture buffers with the main goal to get batch buffer to memory pool but other buffers will as well. For now that will only work in Xe KMD as i915 requires more changes to support it. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:40 +00:00
José Roberto de Souza	a0a600ca5f	anv: Skip anv_bo_pool if memory pool is enabled The whole purpose of anv_bo_pool is to reduce the number of gem_create/destroy calls in command buffers that is something with a short life span. But slab_bo/memory pool does the same with even other benefits like doing 2MB allocations to enable THP. So here skipping the meat of anv_bo_pool_free() to directly return the bo to slab_bo. This change is also necessary because the way anv_bo_pool stores freed buffers it requires that all bos has a unique gem handle, what not true of buffer allocated by anv_slab. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Suggested-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:40 +00:00
José Roberto de Souza	0b561f691b	anv: Add support for ANV_BO_ALLOC_DYNAMIC_VISIBLE_POOL in anv_slab_bo This flag was not supported in anv_slab_bo because it is set together with ANV_BO_ALLOC_CAPTURE and more important it has a specific VMA range. We can support it by adding a custom heap and allocating all bos in the heap with all necessary flags, but because application can also allocate those with vkAllocateMemory() here the ANV_BO_ALLOC_CAPTURE is appended to the vkAllocateMemory() path for integrated gpu and anv_slab_bo check if all the alloc_flags matches, because application could choose to allocate it in a cached but not coherent memory type for example. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:40 +00:00
José Roberto de Souza	8fd4423d99	anv: Add support for ANV_BO_ALLOC_DESCRIPTOR_POOL in anv_slab_bo This flag was not supported in anv_slab_bo because it is set together with ANV_BO_ALLOC_CAPTURE and more important it has a specific VMA range. But we can easily support it by adding a custom heap with it and allocating all bos in the heap with all necessary flags. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:39 +00:00
José Roberto de Souza	ea18572ff2	anv: Add support for ANV_BO_ALLOC_AUX_CCS in anv_slab_bo This changes allow us to support memory pool of bos with ANV_BO_ALLOC_AUX_CCS set. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:39 +00:00
José Roberto de Souza	dabb012423	anv: Implement anv_slab_bo and enable memory pool This is implementing the functions in anv_slab_bo and actually enabling memory pool. This is heavily based on Iris memory pool implementation, the main difference is that the concept of heaps only exist in anv_slab_bo, we have function that takes the anv_bo_alloc_flags and decides what heap to place that bo. Some anv_bo_alloc_flags blocks memory pool, we can relax and remove some flags from this denied list later. This feature can be disabled in runtime by setting ANV_DISABLE_SLAB=true, this can help us to easily check if bugs are due to this feature or not. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:39 +00:00
José Roberto de Souza	3bf6d42fda	anv: Add the base infrastructure to support memory pool Allocating larger buffers allows KMD/HW to enable optimizations that makes access to memory faster, also because of minimum alignment required in some cases we allocate 4k or 64k long buffers for usages that only needs a few bytes, wasting a lot of memory. Memory pool takes care of both of those things and here I'm adding the base infrastruture to implement this feature. The next patch will implement the functions in anv_slab_bo.c, spliting it in two to make review easier. The idea here is take the same approach as Iris and use pb_slab.h. In 99% of the places it will be transparent that anv_bo is actually a slab of a larger and real anv_bo, the remaning 1% of the places are handled here. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:39 +00:00
José Roberto de Souza	5d8ec0ce5c	anv: Move VMA alignment requirements to its own function That will make easy to implement memory pool in the next patches as we need to calculate the VMA aligment without the KMD alignment requirement for memory pool. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:39 +00:00
José Roberto de Souza	4e7ba17413	anv: Export anv_bo_is_small_heap() This function will be needs in two places in the next patches. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:39 +00:00
José Roberto de Souza	e0a9ec34e7	intel: Add has_partial_mmap_offset to intel_device_info Commit 3fc79582a1db ("drm/i915: Increase I915_PARAM_MMAP_GTT_VERSION version to indicate support for partial mmaps") increased the I915_PARAM_MMAP_GTT_VERSION version, with that we can detect what kernel version has the partial mmap fix or not and limit the usage of this workaround. This time o mmap will be used in memory pool, so here adding this propertly to enable or not the feature. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:39 +00:00
Lionel Landwerlin	374ef9228b	anv: add ability to mmap at offset Jose: Added support for placed address Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33558>	2025-04-30 12:56:39 +00:00
Lionel Landwerlin	1d46a663ae	anv: update Wa_22019225126 check Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34754>	2025-04-30 11:55:24 +00:00
Tapani Pälli	eeffb4e674	intel/dev: update mesa_defs.json from internal database Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34753>	2025-04-30 11:19:07 +00:00
Iago Toral Quiroga	103a16e4fa	frontend/dri: don't call set_damage_region with a null resource Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This can happen if texture allocation failed. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34668>	2025-04-30 07:05:44 +00:00
Iván Briano	29d7b90cfc	brw: make HALT instruction act as barrier in new CSE pass Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This brings back `c9e33e5cbf` ("intel/fs/cse: Make HALT instruction act as CSE barrier."), from the old CSE pass into the new one. Fixes new CTS test: dEQP-VK.subgroups.shader_quad_control.terminated_invocation Fixes: `9690bd369d` ("intel/brw: Delete old local common subexpression elimination pass") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34643>	2025-04-29 20:28:24 +00:00
Sagar Ghuge	821c1bfa7e	intel/compiler: Fix stackIDs on Xe2+ Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details For Xe2+, from Bspec 64643, bit field "StackID": The maximum number of StackIDs can be 2^12- 1. Cc: mesa-stable Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34709>	2025-04-29 17:03:35 +00:00
Rohan Garg	b9fe5aad37	anv: enable VK_KHR_shader_bfloat16 Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34105>	2025-04-29 16:29:37 +00:00
Caio Oliveira	07fa3b3785	intel: Add support for BFloat16 as cooperative matrix source Re-organize the configuration lists to make easier to include BFloat16 only for the Gfx125+ that support it, while keeping MTL supporting the "lowered" configurations from pre-Gfx125. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34105>	2025-04-29 16:29:37 +00:00
Caio Oliveira	d4381c0908	brw/cmat: Implement conversion from/to BFloat16 When converting BFloat16 from/to non-Float32 type, use the Float32 conversion as an intermediate step. Take the opportunity to separate the unary_op/convert code-paths. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34105>	2025-04-29 16:29:37 +00:00
Caio Oliveira	de88184ab6	brw/cmat: Support different src/dst packing factors in emit_packed_alu1 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34105>	2025-04-29 16:29:37 +00:00
Caio Oliveira	7fa7be970d	brw/cmat: Extract emit_packed_alu1() function Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34105>	2025-04-29 16:29:37 +00:00
Caio Oliveira	4b4500ad35	brw/cmat: Store more information about cmat slices Store the cmat_description and packing_factor so that various functions don't need to extract and recalculate them. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34105>	2025-04-29 16:29:37 +00:00
Caio Oliveira	a7ff177a88	brw: Consider bfloat16 in lower simd width pass Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34105>	2025-04-29 16:29:37 +00:00
Caio Oliveira	2c31516b3e	brw: Consider bfloat16 in lower regioning pass Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34105>	2025-04-29 16:29:37 +00:00
Caio Oliveira	5936768ce0	brw: Consider bfloat16 in copy propagation Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34105>	2025-04-29 16:29:37 +00:00
Caio Oliveira	129c074811	brw: Implement support for BFloat16 ALU opcodes Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34105>	2025-04-29 16:29:37 +00:00
Caio Oliveira	a38960e8f3	brw, nir: Use glsl_base_type instead of nir_alu_type for @dpas_intel This will allow including types that don't have a nir_alu_type equivalent, like bfloat16. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34105>	2025-04-29 16:29:37 +00:00
Rohan Garg	9e5d7eb88d	compiler/types: add a bfloat16 type Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34105>	2025-04-29 16:29:36 +00:00
Caio Oliveira	3e0418ba02	intel/executor: Fix bfloat example for converting F to packed BF In float pointing rules adding +0.0f preserves all values except for -0.0f, so what we want here is to add -0.0f. In the future we should add proper support for float immediates in the assembler. Fixes: `fafdd24285` ("intel/executor: Update bfloat example") Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34105>	2025-04-29 16:29:36 +00:00
Eric Engestrom	4227982326	ci: rename misleading -postmerge stages to -nightly Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details These stages are for the jobs that are skipped in merge pipelines, automatically run in nightly pipelines, and are available to run manually in other pipelines. None of these ever run in post-merge pipelines. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34590>	2025-04-29 05:49:00 +00:00
Valentine Burley	10ea0002a6	ci/intel: Convert to using the new container based rootfs Signed-off-by: Valentine Burley <valentine.burley@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34451>	2025-04-28 20:08:32 +00:00
Ian Romanick	c2ac7fa77b	brw/cmod: Allow integer CMP to ADD propagation only for Z and NZ No shader-db chnages on any Intel platform. v2: Add a note about integer types in the saturate handling path. fossil-db: All Intel platforms had similar results. (Lunar Lake shown) Totals: Instrs: 210743769 -> 210743727 (-0.00%) Cycle count: 30377699060 -> 30377700318 (+0.00%); split: -0.00%, +0.00% Totals from 36 (0.01% of 706776) affected shaders: Instrs: 17032 -> 16990 (-0.25%) Cycle count: 291716 -> 292974 (+0.43%); split: -0.01%, +0.44% Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34509>	2025-04-28 19:44:23 +00:00

1 2 3 4 5 ...

13974 commits