fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-01-22 04:10:40 +01:00

Author	SHA1	Message	Date
Juan A. Suarez Romero	62ba0d7bf8	vc4: don't use deprecated NIR_PASS_V macro Check more details at https://gitlab.freedesktop.org/mesa/mesa/-/issues/10409. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35127>	2025-05-30 14:24:24 +02:00
Juan A. Suarez Romero	b5706ef70a	vc4: return progress on custom nir lowering Report if the vc4 specific NIR lowering did any progress. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35127>	2025-05-30 14:24:24 +02:00
David Rosca	a9a54632af	frontends/va: Fix H264 top/bottom is reference flags Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details All pics in the ReferenceFrames array should be references, so there is no need to require the SHORT_TERM_REFERENCE flag to actually treat them as references. This fixes decoding with apps that doesn't set this flag, eg. NoMachine remote desktop viewer (nxplayer). See: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13229 Cc: mesa-stable Reviewed-by: Ruijing Dong <ruijing.dong@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35186>	2025-05-30 08:54:31 +00:00
David Rosca	8f4e251c98	radeonsi/vcn: Support disabling HEVC dependent slice segments With older FW this needs to be always enabled, but it can now be disabled when using the new separate header instructions for dependent_slice_segment_flag and slice_segment_address. Reviewed-by: Ruijing Dong <ruijing.dong@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35072>	2025-05-30 08:29:53 +00:00
David Rosca	09a1429a00	radeonsi/vcn: Remove carrizo workaround Carrizo has UVD so this can never be true. Reviewed-by: Ruijing Dong <ruijing.dong@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35072>	2025-05-30 08:29:53 +00:00
David Rosca	f17ea8e901	radeonsi/vcn: Get rid of not_referenced Reviewed-by: Ruijing Dong <ruijing.dong@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35072>	2025-05-30 08:29:52 +00:00
Jesse Natalie	47f4d3e701	d3d12: Handle a null threaded context Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details When GALLIUM_THREAD=0, the threaded context doesn't get created and the following lines would fault. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35248>	2025-05-30 02:19:26 +00:00
Jesse Natalie	0c0f6c6df1	d3d12: Handle sampler view creation on B8G8R8X8 Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35142>	2025-05-30 01:24:10 +00:00
Jesse Natalie	f5781553f6	winsys/d3d12: Support no-alpha formats through the DXGI swapchain path Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35142>	2025-05-30 01:24:10 +00:00
Eric R. Smith	548f652d10	panfrost, panvk: spread hierarchy mask bits out when max_levels < 8 Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details We create hierarchy masks based on the number of levels available, creating a bitmask with `max_levels` bits set. Originally these bits all came together. Modify this to spread the bits out, which improves performance on chips like the G31 with only 2 levels of hierarchy. Reviewed-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34744>	2025-05-29 21:50:54 +00:00
Eric R. Smith	13b35a3c9c	panfrost, panvk: fix G31 use of SHADER_MODE_EARLY_ZS_ALWAYS PRE_POST_FRAME_SHADER_MODE_EARLY_ZS_ALWAYS was introduced in architecture version 7.2, not 7.0 as we assumed. Using it on G31 (a 7.0 device) caused some CTS failures. Cc: mesa-stable Reviewed-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Lars-Ivar Hesselberg Simonsen <lars-ivar.simonsen@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34744>	2025-05-29 21:50:52 +00:00
Faith Ekstrand	2e85076b1d	nak: Set cache ops on surface load/store ops Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35217>	2025-05-29 21:02:37 +00:00
Faith Ekstrand	ad98b76a14	nak: Set cache ops on global memory access on Kepler For now we leave shared and local alone on SM32 (there are no such controls on SM20). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35217>	2025-05-29 21:02:37 +00:00
Faith Ekstrand	a7760e4bd8	nak: Set MemOrder::Constant for CAN_REORDER image loads on all hardware On Turing and Volta, it will safely degrade suld.weak. On Maxwell and Pascal, it will degrade to suld.cta. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35217>	2025-05-29 21:02:37 +00:00
Faith Ekstrand	d3b9752ee6	nak/sm50: Use MemScope::CTA for constant image loads Using MemScope::System synchronizes with everything, which is exactly what we don't want for constant loads. This is currently a no-op because we aren't using MemScope::Constant pre-Ampere yet. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35217>	2025-05-29 21:02:37 +00:00
Faith Ekstrand	6ea0e91c99	nak: Handle suld.constant on Turing and Volta in legalization Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35217>	2025-05-29 21:02:37 +00:00
Lionel Landwerlin	f0e18c475b	intel: remove GRL/intel-clc Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35227>	2025-05-29 20:17:13 +00:00
Mike Blumenkrantz	44bff7eb05	zink: fix queue transition check in check_for_layout_update() this only applies if the resource has active binds, otherwise it triggers crashes Fixes: `18d206d67c` ("zink: Check queue families when binding image resources") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35234>	2025-05-29 15:25:04 -04:00
Mel Henning	6c68c2c3ba	nak/spill_values: Follow phis from src to dest Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details ssa_state_out has the predecessor's SSAValue, so we need look for it in the phi_src map. Totals: CodeSize: 4545122720 -> 4534830176 (-0.23%); split: -0.23%, +0.00% Number of GPRs: 10963889 -> 10963693 (-0.00%); split: -0.00%, +0.00% SLM Size: 1855380 -> `1649308` (-11.11%); split: -11.11%, +0.01% Static cycle count: 1104322907 -> 1093035821 (-1.02%); split: -1.02%, +0.00% Spills to memory: 480689 -> 139107 (-71.06%) Fills from memory: 480689 -> 139107 (-71.06%) Spills to reg: 458804 -> 242139 (-47.22%); split: -47.23%, +0.01% Fills from reg: 303068 -> 222030 (-26.74%); split: -26.75%, +0.01% Max warps/SM: 7245516 -> 7245580 (+0.00%) Totals from 9899 (5.04% of 196502) affected shaders: CodeSize: 1056727952 -> 1046435408 (-0.97%); split: -0.98%, +0.00% Number of GPRs: 1666652 -> 1666456 (-0.01%); split: -0.01%, +0.00% SLM Size: 1107988 -> 901916 (-18.60%); split: -18.61%, +0.01% Static cycle count: 254942337 -> 243655251 (-4.43%); split: -4.43%, +0.01% Spills to memory: 480689 -> 139107 (-71.06%) Fills from memory: 480689 -> 139107 (-71.06%) Spills to reg: 367784 -> 151119 (-58.91%); split: -58.92%, +0.01% Fills from reg: 222209 -> 141171 (-36.47%); split: -36.49%, +0.02% Max warps/SM: 119188 -> 119252 (+0.05%) Fixes: `bcad2add47` ("nak: Add a spilling pass") Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35143>	2025-05-29 18:30:40 +00:00
Mel Henning	0e5880ebe4	nvk: Call ensure_slm for nvk_cmd_dispatch_shader Internal shaders can also use slm, so we need to allocate it correctly. This fixes dEQP-VK.dgc.ext.compute.misc.max_pc_range_256_full_preprocess_with_execution_set with NAK_DEBUG=spill Fixes: `105bdf2e36` ("nvk: Add a helper for dispatching compute shaders") Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35143>	2025-05-29 18:30:40 +00:00
Mike Blumenkrantz	d8d913c341	zink: also check for host-visible on staging uploads this has strange mechanics on lavapipe Fixes: `e63acdd2b7` ("zink: force cached mem for streaming uploads") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35239>	2025-05-29 13:18:23 -04:00
Faith Ekstrand	cfeda2d8ae	nvk: Disallow GetMemoryFdProperties with OPAQUE_FD_BIT Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35213>	2025-05-29 11:53:27 -04:00
Faith Ekstrand	601cf33c44	nvk: Only allow importing mappable dma-bufs to HOST_VISIBLE types Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35213>	2025-05-29 11:53:27 -04:00
Faith Ekstrand	77eba0980b	nvk: Don't assert memory placement on import Instead, we assert that the non-placement flags match, which is currently CAN_MAP and SHARED. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35213>	2025-05-29 11:53:27 -04:00
Faith Ekstrand	bf82c94751	nvk/nvkmd: Add a force_mem_to_gart() helper Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35213>	2025-05-29 11:53:27 -04:00
Mike Blumenkrantz	05f8b59c90	gallium: delete union pipe_surface_desc Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details this is no longer used Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35168>	2025-05-29 13:07:02 +00:00
Mike Blumenkrantz	ca65f2cf1c	gallium: delete pipe_surface::writable no longer used Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35168>	2025-05-29 13:07:02 +00:00
Samuel Pitoiset	9692ef41a3	aco: implement bitfield_extract for 8-bit/16-bit Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35199>	2025-05-29 12:24:59 +00:00
Daniel Stone	80e19e7b1e	ci: Bump v6.14 kernel for updated Panthor scheduler Apply a newer version of the Panthor scheduling timeout fix from the list. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35230>	2025-05-29 12:05:06 +01:00
Karmjit Mahil	688d8217a5	tu,freedreno: Add pkt_field_{get,set} helper macro Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details It's very common needing to extract or overwrite a certain field in an already packed register value, so add macros to do that instead of manually doing that each time. Signed-off-by: Karmjit Mahil <karmjit.mahil@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35088>	2025-05-29 10:54:28 +01:00
Danylo Piliaiev	398f14ca3d	freedreno: Use fast variants of {BC4/BC5}_SNORM formats Hardware has two types of BC4 and BC5: fast and not. The exact perf difference is not tested, but these distinct formats could be seen in the public docs: Qualcomm Adreno GPU > Spec Sheet -> Texture format https://docs.qualcomm.com/bundle/publicresource/topics/80-78185-2/spec_sheets.html?product=1601111740035277#panel-0-0-1 Found when scanning prop driver's cmdstream seeing unknown format. Passes: dEQP-VK.bc4 dEQP-VK.bc5 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33945>	2025-05-29 09:01:17 +00:00
Eric Engestrom	5a5b00cfca	ci: drop unneeded printing of `pass`/`fail` alongside the exit_code Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35214>	2025-05-29 07:29:25 +00:00
Samuel Pitoiset	fe2c93a788	ac/nir: enable 64-bit lowering for bitfield_extract Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35187>	2025-05-29 08:45:41 +02:00
Samuel Pitoiset	cecf6675be	nir/lower_int64: add bitfield_extract lowering This will be used by RADV for ACO/LLVM. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35187>	2025-05-29 08:45:40 +02:00
Olivia Lee	104ea2e4cf	panfrost: legalize afbc before zs and rt clears Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details In panfrost_clear_depth_stencil and panfrost_clear_render_target, we start the blit context before binding the clear targets. If we don't legalize AFBC beforehand, we get a recursive blit crash. panfrost_clear does not need this because the resource should already be legalized in panfrost_batch_add_surface. Fixes the following piglit tests with pan_force_afbc_packing: - spec@arb_clear_texture@arb_clear_texture-base-formats - spec@arb_clear_texture@arb_clear_texture-simple - spec@arb_clear_texture@arb_clear_texture-sized-formats Fixes: `17a62ff993` ("panfrost: legalize afbc before blitting") Signed-off-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34992>	2025-05-29 01:50:31 +00:00
Olivia Lee	bed54fa402	panfrost: fix assertion failure compiling image conversion shaders In `59a3e12039`, we changed the UBO->push optimization in panfrost to only push UBOs that are available in a CPU buffer. We require first_ubo_is_default_ubo, to ensure that UBO0 will be a user buffer. We weren't setting this flag for the image conversion shaders, so got an assertion failure compiling them. This can be triggered by the panvk_force_afbc_packing driconf option. The conversion shader info UBO isn't exactly a "default" UBO in the sense of being lowered from uniforms, but it is a user buffer, so setting the flag should be fine. Fixes: `59a3e12039` ("panfrost: do not push "true" UBOs") Signed-off-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34992>	2025-05-29 01:50:31 +00:00
Yiwei Zhang	749265da0d	vulkan/wsi: split cmd record for img2buf blit and img2img blit There's no behavior change, but to prepare for the next img2buf blit improvement, except adding asserts to make clear of the existing blit code paths. v2: use switch with unreachable default per @gfxstrand has suggested Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35220>	2025-05-29 01:20:27 +00:00
Yiwei Zhang	2af2314fb2	vulkan/wsi: include missing barrier for transferring to blit dst image Fixes: `2975a7f453` ("vulkan/wsi: Add support for image -> image blits") Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35220>	2025-05-29 01:20:27 +00:00
Pohsiang (John) Hsu	79bc373b1e	mediafoundation: move readme.md to docs folder Reviewed-by: Yubo Xie <yuboxie@microsoft.com> Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35219>	2025-05-29 01:03:34 +00:00
Pohsiang (John) Hsu	0107d94632	mediafoundation: add mechanism to disable async and h.264 unwrapped POC (commented out for now) according to gpu/version Reviewed-by: Yubo Xie <yuboxie@microsoft.com> Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35219>	2025-05-29 01:03:34 +00:00
Pohsiang (John) Hsu	061085708a	mediafoundation: on use LTR, synchronize the active ltr bitmap to the one passed in Reviewed-by: Yubo Xie <yuboxie@microsoft.com> Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35219>	2025-05-29 01:03:34 +00:00
Pohsiang (John) Hsu	22d84522ed	mediafoundation: get device vendor id, device id, and driver version Reviewed-by: Yubo Xie <yuboxie@microsoft.com> Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35219>	2025-05-29 01:03:34 +00:00
Pohsiang (John) Hsu	5ee854c4eb	mediafoundation: add ETW event for perf analysis Add perf ETW events using TraceLogging API, the following are adding: - MFT receives fence (FenceCompletion). - MFT has output MFSample (METransformHaveOutput). - MFT calls to pipe end_frame (PipeEndFrame) -- bracketed. - MFT calls to pipe flush (PipeFlush) -- bracketed. - MFT submits a frame to pipe (PipeSubmitFrame) -- bracketed from begine_frame to encode_bitstream/encode_bitstream_sliced - MFT processinput (ProcessInput) -- bracketed - MFT processoutput (ProcessOutput) -- bracketed The ETW provider(s) are: - H264Enc: 0000e264-0dc9-401d-b9b8-05e4eca4977e - H265Enc: 0000e265-0dc9-401d-b9b8-05e4eca4977e - AV1Enc: 0000eaa1-0dc9-401d-b9b8-05e4eca4977e Note that the provider is mostly the same as the WPPTrace provider for each codec, with the additional 'e' (e.g. 0000e264 vs 00000264) Reviewed-by: Yubo Xie <yuboxie@microsoft.com> Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35219>	2025-05-29 01:03:34 +00:00
Lucas Stach	a8009e7c11	etnaviv: move TS allocation to resource allocation Allocate TS together with the tracked resource, which gets rid of the resource mutation on surface creation and the diversion between the interal and shared TS handling. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34488>	2025-05-29 00:48:07 +00:00
Lucas Stach	83ab7a8d58	etnaviv: add resource render compatible check Untangle the convoluted render compatible check from etna_render_handle_incompatible to make it easier to read and move it into a separate function so it can be reused from other callers. As this is intended to be called also at resource creation time, where we don't know the exact level of the resource that might be rendered to, the stride check for linear resources is made a bit more conservative by checking that the last level (the one with the smallest stride) still meets the render target stride alignment requirement. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34488>	2025-05-29 00:48:07 +00:00
Lucas Stach	4717022cb0	etnaviv: drop ts_offset from etna_surface TS is only allocated for single layer surfaces, so there is no need to cache a ts_offset taking into account the layer offset in etna_surface. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34488>	2025-05-29 00:48:07 +00:00
Lucas Stach	50940ce393	etnaviv: don't pretend to support TS for array or 3D textures etna_screen_resource_alloc_ts is only called for textures that have a single layer and slice, as we don't want to duplicate the driver side TS tracking information per layer or depth slice. Stop pretending to support allocating TS for such resources. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34488>	2025-05-29 00:48:06 +00:00
Nanley Chery	965d3ec7d4	intel/isl: Fix isl_surf_image_has_unique_tiles() Prevent the function from unnecessarily returning false by: * Comparing the image tile range with that of every LOD instead of only LOD0. * Using the correct comparison check for the exclusive tile end ranges. Fixes: `8dad01903a` ("intel: Add and use isl _surf_image_has_unique_tiles()") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35192>	2025-05-29 00:11:45 +00:00
Paulo Zanoni	ecc90e1bb3	intel/isl: don't clamp num_elements to (1 << 27) Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The BSpec page for Structure_RENDER_SURFACE_STATE says: "For typed buffer and structured buffer surfaces, the number of entries in the buffer ranges from 1 to 2^27. For raw buffer surfaces, the number of entries in the buffer is the number of bytes which can range from 1 to 2^30. After subtracting one from the number of entries, software must place the fields of the resulting 27-bit value into the Height, Width, and Depth fields as indicated, right-justified in each field. Unused upper bits must be set to zero." According to the vkd3d-proton developers, this is what is happening with the applications: "There's also the problematic case of games using typed descriptors but passing non-typed buffer descriptors, which is an extremely common app bug that works on all D3D12 drivers that we need to work around by creating typed views." Previously, we had an assert() to check for "num_elements > (1 << 27)", but that assert was preventing us from running games such as Marvel's Spider-Man Remastered and Assassin's Creed: Valhalla in Debug mode. So not only I removed the assert, but I also made the code clamp num_elements to the maximum of (1 << 27) based on my incorrect interpretation of the paragraph quoted above from BSpec. What I did not realize was that num_elements is being used just to calculate Structure_RENDER_SURFACE_STATE Height, Width and Depth, and our register bit fields on SKL and newer are big enough to fit any number of num_elements up to 2^32, not only 2^27. Clamping num_elements results in an incorrect value for S.Depth, which generates visual corruption in some games. On Marvel's Spider-Man Remastered, without this patch the texture of the asphalt in some streets (like the very first one you jump to when the game starts) gets rendered incorrectly. Testcase: vkd3d-proton/d3d12/test_large_texel_buffer_view Link: https://github.com/HansKristian-Work/vkd3d-proton/issues/2071 Link: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12827 Fixes: `f3c7e14f09` ("isl: don't assert(num_elements > (1ull << 27))") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35032>	2025-05-28 23:45:54 +00:00
Olivia Lee	97e54511a5	panvk: advertise VK_EXT_shader_subgroup_vote and VK_EXT_shader_subgroup_ballot These are already supported in the compiler. Signed-off-by: Olivia Lee <olivia.lee@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35218>	2025-05-28 23:20:09 +00:00

1 2 3 4 5 ...

206146 commits