fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-01-28 16:10:23 +01:00

Author	SHA1	Message	Date
Lionel Landwerlin	3362b8dcb5	brw: use a scalar builder for the load_payload on transpose loads I noticed SIMD32 shaders have that kind of pattern : mov(32) g94<1>D 0D { align1 WE_all }; send(1) g15UD g94UD nullUD 0x6210d500 0x02010000 ugm MsgDesc: ( load, a32, d32, V16, transpose, L1STATE_L3MOCS dst_len = 1, src0_len = 1, src1_len = 0 bti ) BTI 2 base_offset 16 { align1 WE_all 1N I@5 $1 }; Why use a 32 wide register for a SEND that is only going to read the first lane? We can stick a single physical register and reduce register pressure. DG2 fossils-db results : Totals: Instrs: 157417515 -> 157417796 (+0.00%); split: -0.00%, +0.00% Cycle count: 15362185116 -> 15363086774 (+0.01%); split: -0.05%, +0.05% Max live registers: 29059141 -> 29051166 (-0.03%) Max dispatch width: 5071256 -> 5075720 (+0.09%); split: +0.33%, -0.24% Totals from 82132 (14.43% of 569221) affected shaders: Instrs: 26564632 -> 26564913 (+0.00%); split: -0.00%, +0.00% Cycle count: 4630907475 -> 4631809133 (+0.02%); split: -0.16%, +0.18% Max live registers: 5425037 -> 5417062 (-0.15%) Max dispatch width: 128384 -> 132848 (+3.48%); split: +12.92%, -9.45% LNL fossils-db results : Totals: Instrs: 141870413 -> 141870745 (+0.00%); split: -0.00%, +0.00% Cycle count: 20176018818 -> 20191262632 (+0.08%); split: -0.07%, +0.14% Max live registers: 44858167 -> 44838370 (-0.04%) Totals from 51859 (10.55% of 491590) affected shaders: Instrs: 16834547 -> 16834879 (+0.00%); split: -0.00%, +0.00% Cycle count: 5761980106 -> 5777223920 (+0.26%); split: -0.24%, +0.50% Max live registers: 5893878 -> 5874081 (-0.34%) Perf A/B testing only reported a 0.5% improvement on DG2 on one trace, no changes on BMG. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36958>	2025-08-26 12:03:22 +00:00
Lionel Landwerlin	27c69acb6a	brw: remove uniform from opt_offsets Those are for push constants, no point in doing that because : - there is no HW constant offsets in push constants (payload delivery), it's just register offset calculation - if we have an dynamic value it's already using MOV_INDIRECT Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `e103afe7be` ("brw: run the nir_opt_offsets pass and set the maximum offset size") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36958>	2025-08-26 12:03:22 +00:00
Sagar Ghuge	2cd564c1de	anv: Add missing L3 flushes We are reading out some of the parameters from IR data structure those have been written previously, on some platforms L3 is not coherent, so explicitly add those flushes. Cc: mesa-stable Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36952>	2025-08-25 17:36:08 +00:00
Sagar Ghuge	4473e21e2f	anv: Enable CS stall for ACCELERATION_STRUCTURE_COPY stage Cc: mesa-stable Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36952>	2025-08-25 17:36:08 +00:00
Sagar Ghuge	75d770b4f8	anv: Add missing ACCELERATION_STRUCTURE_READ in barrier handling Cc: mesa-stable Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36952>	2025-08-25 17:36:08 +00:00
Eric Engestrom	fa74e939bf	ci/piglit: automatically use LAVA proxy This avoids having to hardcode the proxy in the traces `download-url` or jobs setting `PIGLIT_REPLAY_EXTRA_ARGS` and accidentally overriding the default args when the author meant to append. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36955>	2025-08-25 14:52:38 +00:00
Konstantin Seurer	9df7b48d2f	nir: Use nir_def_as_* in more places Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36746>	2025-08-24 14:03:09 +00:00
Yiwei Zhang	dcffe932a0	anv: adopt common GetAndroidHardwareBufferPropertiesANDROID ANV currently carries a partial copy of the gralloc mapper's format resolving code, while the ground truth solely resides inside the gralloc. The local copy is delicate and unable to maintain compatibility with different gralloc implementations because AHB formats like Y8Cb8Cr8_420 and IMPLEMENTATION_DEFINED are flexible formats, and can be resolved to different underlying drm fourcc formats depending on the usage and media IPs. The common impl is more correct as it relies on the info from gralloc mapper side, and it only sets the minimal set of explicit formats to avoid hitting spec corner case of allocating out AHB with flexible formats (missing half of the media usage bits might end up allocating something different that potentially get resolved to a different VkFormat as well). Reviewed-by: Lucas Fryzek <lfryzek@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36866>	2025-08-22 23:40:35 +00:00
Yiwei Zhang	a34eb09c89	anv: drop anv_ahb_format_for_vk_format The vk_image::ahb_format is for drivers that support more than the common explicit AHB formats. It is used on AHB image memory export allocation path, and more specifically vk_device_memory_create will use that AHB format to allocate the AHB out from gralloc. To be noted, export allocation path only deals with explicit format but not external format. So even with the obsolete HAL_PIXEL_FORMAT_NV12_Y_TILED_INTEL private format, we don't need such either as multi-planar formats are supposed to be reported as external format. Reviewed-by: Lucas Fryzek <lfryzek@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36866>	2025-08-22 23:40:35 +00:00
Yiwei Zhang	ef885eb9ac	anv: adopt vk_android_get_ahb_image_properties The current impl misses the probe against gralloc mapper, which is the required handshake before advertising support. For simplicity, just adopt the common AHB helper. It does not rely on driver specific format mapping, since the query doesn't allow external format at all. Reviewed-by: Lucas Fryzek <lfryzek@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36866>	2025-08-22 23:40:34 +00:00
Yiwei Zhang	3b19aa6261	anv: avoid setting image format twice for AHB image AHB images are created with the right VkFormat when external format isn't used. When external format does get used, the proper VkFormat has already being set in the common runtime. Upon AHB props query, we resolve external format to VkFormat and set to the externalFormat field to be used by the app. The app would than chain the exact external format when creating the AHB image if it wants to go down the external format code path instead of being explicit. So in the end, the format we resolve is the format we get. Thus no need to set it twice. Reviewed-by: Lucas Fryzek <lfryzek@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36866>	2025-08-22 23:40:34 +00:00
Yiwei Zhang	b6427520d6	anv: drop obsolete anv_create_ahw_memory Reviewed-by: Lucas Fryzek <lfryzek@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36866>	2025-08-22 23:40:33 +00:00
Lucas Fryzek	b927b52e24	hasvk: Remove special CROS_GRALLOC path from format logic Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36866>	2025-08-22 23:40:32 +00:00
Lucas Fryzek	a43fa85fab	anv: Remove special CROS_GRALLOC path from format logic Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36866>	2025-08-22 23:40:32 +00:00
Faith Ekstrand	59f85e678f	vulkan/wsi: Take a vk_queue in wsi_common_queue_present() The common entrypoint wrapper already depends on vk_queue, as do all the drivers that implement drv_QueuePresentKHR() so there's no point in passing through Vulkan API types anymore. The one functional change here is that ANV is no longer forcing the queue index to be zero, which I suspect was a mistake in the first place. Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36827>	2025-08-22 23:05:02 +00:00
Faith Ekstrand	81325cf887	vulkan,anv,hasvk: Drop vk_queue_wait_before_present() This helper existed to ensure that drivers waited for semaphores to materialize before processing a QueuePresent(). However, most drivers never called this and they were kind-of fine. Now that we have explicit and dma-buf sync built into WSI, this wait happens as part GetSemaphoreFd when we fetch the sync file from the semaphore. It's also less racy to just rely on GetSemaphoreFd() because, even though we were stalling the submit thread prior to present, the present itself does one or more submits and those may go to the thread and potentially race with the window system. The GetSemaphoreFd(), however, happens at the right time to ensure we actually stall before handing off to the window system. Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36827>	2025-08-22 23:05:02 +00:00
Faith Ekstrand	650debdf40	anv: Stop picking our own blit queue This reverts commit `1f0fdcb619` ("anv: always pick graphics queue to execute prime blits on.") which was added to avoid prime blits on video queues. However, this was fixed properly in `d7938de8fe` ("vulkan/wsi: don't support present with queues where blit is unsupported") which made us stop advertising presentation on video queues entirely. We no longer need the code in ANV. Acked-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36827>	2025-08-22 23:05:01 +00:00
Faith Ekstrand	e0c30b0fc2	anv,hasvk: Use vk_drm_syncobj_copy_payloads Acked-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36827>	2025-08-22 23:05:00 +00:00
Collabora's Gfx CI Team	640e2eddea	Uprev ANGLE to 995c4c4d89ed6a5c28b210e9c0f83eb4f8b6e2f5 `6a04a50f98...995c4c4d89` - Skip tests failing on all drivers due to a CTS bug - Disable clang options not supported by the 'unbundled' toolchain Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36908>	2025-08-22 07:35:15 +00:00
Iván Briano	07057e270c	anv, hasvk: allow using a 3D image as a resolve target Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This is allowed by the specification, as the following VUIDs state: VUID-vkCmdResolveImage-srcImage-04446 If dstImage is of type VK_IMAGE_TYPE_3D, then for each element of pRegions, srcSubresource.layerCount must be 1 VUID-vkCmdResolveImage-srcImage-04447 If dstImage is of type VK_IMAGE_TYPE_3D, then for each element of pRegions, dstSubresource.baseArrayLayer must be 0 and dstSubresource.layerCount must be 1 New tests coming for it: dEQP-VK.pipeline..multisample.3d. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36895>	2025-08-21 20:53:42 +00:00
Caio Oliveira	74a4e7dd4b	brw: Fix folding case for MAD instruction with all immediates Fixes: `b605f76b2a` ("brw/algebraic: Constant fold multiplicands of MAD") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36867>	2025-08-21 17:19:18 +00:00
Caio Oliveira	eec64c865f	brw: Add disabled test for MAD constant folding Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36867>	2025-08-21 17:19:18 +00:00
Lionel Landwerlin	1bab95551a	anv: fix uninitialized return value We don't go through the loop when there are no queues. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `884df891d7` ("anv: allow device creation with no queue") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36910>	2025-08-21 16:07:56 +00:00
Calder Young	c7e48f79b7	brw,anv: Reduce UBO robustness size alignment to 16 bytes Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Instead of being encoded as a contiguous 64-bit mask of individual registers, the robustness information is now encoded as a vector of up to 4 bytes that represent the limits of each of the pushed UBO ranges in 16 byte units. Some buggy Direct3D workloads are known to depend on a robustness alignment as low as 16 bytes to work properly. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36455>	2025-08-21 09:04:55 +00:00
Lionel Landwerlin	2281e88381	brw: make assign_curb_setup visible in optimizer debug Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36455>	2025-08-21 09:04:54 +00:00
Lionel Landwerlin	df37c7ca74	brw: fix analysis dirtying with pulled constants Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `5c17299084` ("brw: enable A64 pulling of push constants") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36455>	2025-08-21 09:04:53 +00:00
Yiwei Zhang	fc2c490975	anv: advertise present_id/wait behind ANV_USE_WSI_PLATFORM wsi_common_vk_instance_supports_present_wait returns true for all supported wsi platforms here, so we can unconditionally advertise them behind ANV_USE_WSI_PLATFORM like the other wsi extensions (also to not tangle with Android). v2: guard presentId2 and presentWait2 features as well Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (v1) Acked-by: Daniel Stone <daniels@collabora.com> (v1) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36835>	2025-08-21 07:53:15 +00:00
Yiwei Zhang	9669b1852b	hasvk: advertise present_id/wait behind ANV_USE_WSI_PLATFORM wsi_common_vk_instance_supports_present_wait returns true for all supported wsi platforms here, so we can unconditionally advertise them behind ANV_USE_WSI_PLATFORM like the other wsi extensions (also to not tangle with Android). Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36835>	2025-08-21 07:53:15 +00:00
Valentine Burley	7ea1da4af4	iris/ci: Add a new iris deqp job on Alder Lake Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Valentine Burley <valentine.burley@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36880>	2025-08-21 07:05:27 +00:00
Valentine Burley	e0220c6e71	anv/ci: Add a job replaying traces with ANGLE The new anv-adl-traces-restricted job runs 10 ANGLE traces on Alder Lake, using ANGLE's Vulkan backend. Signed-off-by: Valentine Burley <valentine.burley@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36880>	2025-08-21 07:05:27 +00:00
Valentine Burley	1fce16d33f	anv/ci: Run full anv-adl-angle job pre-merge We have enough devices to run the full job without a fraction, which also allows deleting the nightly job. Signed-off-by: Valentine Burley <valentine.burley@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36880>	2025-08-21 07:05:26 +00:00
Marek Olšák	c601308615	nir: convert nir_instr_worklist to init/fini semantics w/out allocation This removes the malloc overhead. Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36728>	2025-08-21 06:13:49 +00:00
Marek Olšák	3aadae22ad	nir: make nir_block::predecessors & dom_frontier sets non-malloc'd We can just place the set structures inside nir_block. This reduces the number of ralloc calls by 6.7% when compiling Heaven shaders with radeonsi+ACO using a release build (i.e. not including nir_validate set allocations, which are also removed). Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36728>	2025-08-21 06:13:48 +00:00
Iván Briano	20f546d6c1	anv: fix capture/replay of sparse images with descriptor buffer Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details We were not implementing vkGetImageOpaqueCaptureDescriptorDataEXT, relying on the common implementation that does nothing. That works well enough for regular images because the fixed address needed for capture/replay is handled by the memory allocation path, but for sparse images we initialize the sparse bindings at image creation time. Here we implement the function to retrieve the addresses of all the used bindings for the image, then use all of them at creation time. Also, set the correct alloc_flags for this to work. Fixes: `43b57ee8a5` ("anv: add capture/replay support for image with descriptor buffers") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35872>	2025-08-20 21:08:10 +00:00
Nataraj Deshpande	f67edacf8b	anv: add feature flags for linearly tiled ASTC images In case of emulated ASTC on supported platforms, currently returning 0 for linear tiled images causes vpGetPhysicalDeviceProfileSupport failure during AndroidBaselineProfile test. The patch handles it similar to linearly-tiled images that are used for transfers. Fixes android.graphics.cts.VulkanFeaturesTest#testAndroidBaselineProfile2021Support. Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36798>	2025-08-20 15:28:50 +00:00
Lionel Landwerlin	fe38fb858c	brw: workaround broken indirect RT messages on Gfx11 Unfortunately we cannot use the indirect descriptor on Gfx11, it appears to just drop writes. Other platforms appear to be fine. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36883>	2025-08-20 15:01:50 +00:00
Lionel Landwerlin	a0844458b8	brw: enable opt_register_coalesce to work with multiple EOT blocks Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36883>	2025-08-20 15:01:50 +00:00
Lionel Landwerlin	c4c7ff3f8f	brw: enable register allocation to deal with multiple EOTs Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36883>	2025-08-20 15:01:50 +00:00
Lionel Landwerlin	ed471927e5	vulkan/runtime: use a pipeline flag for unaligned dispatches Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The problem with the current flag is that it seems to belong to VkShaderCreateFlagsEXT, not VkPipelineShaderStageCreateFlagBits. Also it is completely skipped by the vk_pipeline.c code. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `7b634ebb63` ("vulkan/runtime: Add VK_SHADER_CREATE_UNALIGNED_DISPATCH_BIT_MESA flag") Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36828>	2025-08-20 11:17:52 +00:00
Valentine Burley	6b88e2bd38	anv/ci: Update expectations from nightly jobs Document current failures and flakes from the nightly jobs, and add a skip for tests that are timing out. Signed-off-by: Valentine Burley <valentine.burley@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36608>	2025-08-20 08:53:36 +00:00
Valentine Burley	e4fc3e4ee6	anv/ci: Lower concurrency for nightly jobs The nightly jobs can hit OOMs on JSL and ADL, so reduce the number of threads used by deqp-runner to avoid that. Signed-off-by: Valentine Burley <valentine.burley@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36608>	2025-08-20 08:53:36 +00:00
Caio Oliveira	4fda724fd4	brw: Avoid invalid access when compacting out-of-bounds JIP/UIP Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Usually JIP will be valid, but as part of other changes, it will be possible to have a shader that have multiple EOT messages and end with and ENDIF instruction. Its JIP will point after the program ends. This is fine but was tripping up the compaction code. Change compaction to not read its internal structures beyond the last instruction. Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36822>	2025-08-20 00:54:41 +00:00
Caio Oliveira	148063670d	brw: If the instruction is already a SEND, no need to resize sources Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Kept an assert as a placeholder in case we had something odd going on that this code was protecting. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36817>	2025-08-19 13:54:43 +00:00
Caio Oliveira	cebac156c4	brw: Only access valid sources in lower_btd_logical_send() Only the SHADER_OPCODE_BTD_SPAWN_LOGICAL has sources, so only reach for them when handling that instruction. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36817>	2025-08-19 13:54:43 +00:00
Caio Oliveira	dc960936fc	brw: Move resize_sources() earlier when lowering FIND_LIVE_CHANNELS Move it before the new source is used. This currently works because all instructions have a minimum amount of sources allocated, but a later commit will change that. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36817>	2025-08-19 13:54:43 +00:00
Caio Oliveira	fe2e2fabcd	brw: Make sure copied instruction don't copy the list pointers Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36817>	2025-08-19 13:54:43 +00:00
Caio Oliveira	5a34f676a5	brw: Define order for fixes in 3-src operand fix Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36817>	2025-08-19 13:54:43 +00:00
Olivia Lee	78d3b9cd0a	perfetto: allow specifying clock domain for cpu timestamps Everything is currently using CLOCK_BOOTTIME, which is perfetto's default, and matches the previous behavior. On some hardware, different clocks may be better synchronized with the gpu clock. Signed-off-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34390>	2025-08-19 09:50:36 +00:00
Sagar Ghuge	49b917baaf	intel/compiler: Fix ray geometry index Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details We have only 24-bit wide geometry index, not the 28-bit wide. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Iván Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36796>	2025-08-19 09:32:55 +00:00
Sagar Ghuge	7ca356d5db	intel/genxml: Drop all unused struct/fields Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Iván Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36796>	2025-08-19 09:32:55 +00:00

... 13 14 15 16 17 ...

15202 commits