mirror of
https://gitlab.freedesktop.org/mesa/mesa.git
synced 2025-12-20 03:00:11 +01:00
That are from the gitlog, which we don't currently escape. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38454>
6068 lines
284 KiB
ReStructuredText
6068 lines
284 KiB
ReStructuredText
Mesa 25.3.0 Release Notes / 2025-11-14
|
||
======================================
|
||
|
||
Mesa 25.3.0 is a new development release. People who are concerned
|
||
with stability and reliability should stick with a previous release or
|
||
wait for Mesa 25.3.1.
|
||
|
||
Mesa 25.3.0 implements the OpenGL 4.6 API, but the version reported by
|
||
glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /
|
||
glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.
|
||
Some drivers don't support all the features required in OpenGL 4.6. OpenGL
|
||
4.6 is **only** available if requested at context creation.
|
||
Compatibility contexts may report a lower version depending on each driver.
|
||
|
||
Mesa 25.3.0 implements the Vulkan 1.4 API, but the version reported by
|
||
the apiVersion property of the VkPhysicalDeviceProperties struct
|
||
depends on the particular driver being used.
|
||
|
||
SHA checksums
|
||
-------------
|
||
|
||
::
|
||
|
||
SHA256: 0fd54fea7dbbddb154df05ac752b18621f26d97e27863db3be951417c6abe8ae mesa-25.3.0.tar.xz
|
||
SHA512: 46df9e5e27f9a36cf893a68ad4a465fcc6efe1bcb46ad8d4b015699ad1a11e582b8d41f4157326556af603fe454b2ff34ecc17a0c742b5fd9ce5f0097106fec5 mesa-25.3.0.tar.xz
|
||
|
||
|
||
New features
|
||
------------
|
||
|
||
- EGL_EXT_create_context_robustness support on Panfrost V10+
|
||
- GL_ARB_robust_buffer_access_behavior, GL_KHR_robust_buffer_access_behavior and GL_KHR_robustness support on Panfrost
|
||
- VK_EXT_mutable_descriptor_type on panvk/v9+
|
||
- GL_KHR_robustness on v3d
|
||
- VK_ARM_shader_core_builtins on panvk
|
||
- VK_KHR_shader_untyped_pointers on anv
|
||
- cl_ext_immutable_memory_objects
|
||
- VK_KHR_video_encode_intra_refresh on radv
|
||
- VK_KHR_video_encode_quantization_map on radv
|
||
- GL_ATI_meminfo and GL_NVX_gpu_memory_info on r300
|
||
- VK_KHR_shader_untyped_pointers on anv and RADV
|
||
- VK_KHR_maintenance8 on NVK
|
||
- VK_KHR_maintenance9 on NVK
|
||
- cl_khr_semaphore on radeonsi and zink
|
||
- cl_khr_external_semaphore on radeonsi and zink
|
||
- cl_khr_external_semaphore_sync_fd on radeonsi and zink
|
||
- GL_NV_shader_atomic_int64 on radeonsi and Panfrost V9+
|
||
- VK_KHR_maintenance7 on panvk/v10+
|
||
- VK_KHR_maintenance8 on panvk/v10+
|
||
- VK_KHR_maintenance9 on panvk
|
||
- VK_AMD_buffer_marker on NVK
|
||
- VK_EXT_ycbcr_2plane_444_formats on radv
|
||
- Removed VDPAU frontend
|
||
- GL_NV_representative_fragment_test on zink
|
||
- VK_KHR_maintenance9 on HoneyKrisp
|
||
- sparseBinding on panvk/v10+
|
||
- sparseResidencyBuffer on panvk/v10+
|
||
- Vulkan 1.2 on pvr
|
||
- VK_KHR_create_renderpass2 on pvr
|
||
- VK_KHR_dedicated_allocation on pvr
|
||
- VK_KHR_depth_stencil_resolve on pvr
|
||
- VK_KHR_descriptor_update_template on pvr
|
||
- VK_KHR_imageless_framebuffer on pvr
|
||
- VK_KHR_line_rasterization on pvr
|
||
- VK_KHR_maintenance1 on pvr
|
||
- VK_KHR_maintenance2 on pvr
|
||
- VK_KHR_maintenance3 on pvr
|
||
- VK_KHR_multiview on pvr
|
||
- VK_KHR_robustness2 on pvr
|
||
- VK_KHR_separate_depth_stencil_layouts on pvr
|
||
- VK_KHR_shader_draw_parameters on pvr
|
||
- VK_KHR_shader_float_controls on pvr
|
||
- VK_KHR_shader_subgroup_extended_types on pvr
|
||
- VK_KHR_spirv_1_4 on pvr
|
||
- VK_KHR_shader_terminate_invocation on pvr
|
||
- VK_KHR_swapchain_mutable_format on pvr
|
||
- VK_KHR_vertex_attribute_divisor on pvr
|
||
- VK_EXT_border_color_swizzle on pvr
|
||
- VK_EXT_color_write_enable on pvr
|
||
- VK_EXT_custom_border_color on pvr
|
||
- VK_EXT_depth_clamp_zero_one on pvr
|
||
- VK_EXT_depth_clip_enable on pvr
|
||
- VK_EXT_extended_dynamic_state on pvr
|
||
- VK_EXT_extended_dynamic_state2 on pvr
|
||
- VK_EXT_extended_dynamic_state3 on pvr
|
||
- VK_EXT_image_2d_view_of_3d on pvr
|
||
- VK_EXT_line_rasterization on pvr
|
||
- VK_EXT_physical_device_drm on pvr
|
||
- VK_EXT_provoking_vertex on pvr
|
||
- VK_EXT_robustness2 on pvr
|
||
- VK_EXT_queue_family_foreign on pvr
|
||
- VK_EXT_separate_stencil_usage on pvr
|
||
- VK_EXT_shader_demote_to_helper_invocation on pvr
|
||
- VK_EXT_vertex_attribute_divisor on pvr
|
||
- imageCubeArray on pvr
|
||
- independentBlend on pvr
|
||
- sampleRateShading on pvr
|
||
- logicOp on pvr
|
||
- drawIndirectFirstInstance on pvr
|
||
- alphaToOne on pvr
|
||
- samplerAnisotropy on pvr
|
||
- shaderStorageImageExtendedFormats on pvr
|
||
- shaderStorageImageReadWithoutFormat on pvr
|
||
- shaderStorageImageWriteWithoutFormat on pvr
|
||
- shaderClipDistance on pvr
|
||
- shaderCullDistance on pvr
|
||
- VK_EXT_zero_initialize_device_memory on pvr
|
||
- VK_KHR_sampler_mirror_clamp_to_edge on pvr
|
||
- VK_KHR_shader_non_semantic_info on pvr
|
||
- VK_KHR_shader_relaxed_extended_instruction on pvr
|
||
- VK_EXT_shader_replicated_composites on pvr
|
||
- VK_KHR_device_group_creation on pvr
|
||
- VK_KHR_map_memory2 on pvr
|
||
- VK_EXT_map_memory_placed on pvr
|
||
- VK_KHR_device_group on pvr
|
||
- VK_KHR_buffer_device_address on pvr
|
||
- GL_EXT_mesh_shader on zink
|
||
- VK_KHR_wayland_surface on pvr
|
||
- VK_NVX_image_view_handle on NVK
|
||
|
||
|
||
Bug fixes
|
||
---------
|
||
|
||
- amdgpu: ring gfx_0.0.0 timeout, in vr when opening apps
|
||
- zink/radv: new cts fails on rdna3
|
||
- Penumbra: Overture OpenGL game has graphical glitch for ice
|
||
- mesa: regression caused by hash_table sizing
|
||
- RustiCL: fence fd leak on CL-GL interop
|
||
- Uniform variable not updated correctly with shared contexts
|
||
- [radv] Borderlands 4 triggers a consistent GPU page fault on RDNA2
|
||
- radv: RE4 Separate Ways DLC hangs RDNA2 GPU
|
||
- ACO: fix a hazard when the number of attributes loaded/consumed don't match with VS prologs
|
||
- ACO: loading 64-bit attributes can override the fetch index in VS prologs
|
||
- [RADV][bisected][regression] - Doom: The Dark Ages (3017860) - Square flickering artifacts around Hebeth
|
||
- nvk, nak: Broken icons in ENDLESS Legend 2 on a RTX 4080
|
||
- LLVMPipe's \`VkPhysicalDeviceAccelerationStructurePropertiesKHR::maxPrimitiveCount` is lower than Vulkan requires.
|
||
- asahi: DMABuf import of multi-plane YCbCr (NV12 from ISP) not renderer correctly
|
||
- brw: Gfx9 sampler messages violate r127 rule
|
||
- radv: No Man's Sky XESS page fault GPU reset
|
||
- r600/sfn: Assertion \`cir.alu_vec.empty()` failed
|
||
- radv: Hit assert when over maxFragmentDualSrcAttachments but vkCmdSetColorBlendEnableEXT is set to false
|
||
- [ANV][PTL][DG2] Flickering textures in Assassin's Creed Valhalla benchmark
|
||
- ADL, ANV: Wuthering Waves leads to gpu reset on Alder Lake iGPU
|
||
- RADV: ANGLE deqp regression
|
||
- [ANV][EXT_debug_utils] descriptor set object_name leak when not calling vkFreeDescriptorSets
|
||
- nvk: CTS failures in sample_locations_ext.verify_interpolation.samples_1
|
||
- [regression] [bisected] RuneLite GPU Experimental - GPU crash
|
||
- Missing definition of __builtin_ia32_clflush since "util/cache_ops: Add some cache flush helpers"
|
||
- LLVM instruction selection compilation error
|
||
- v3d: green screen when rpivid hevc decoder is used
|
||
- [radv] Stuttering with latest mesa git (21 sept) on radv/6900 XT
|
||
- BFN with UW sources gets munged by lower regioning
|
||
- zink: chromium flickers in youtube when fullscreening videos
|
||
- r600: Attribute stride updates may be skipped
|
||
- [ANV][TGL]: test_buffer_feedback_instructions_sm51 on vkd3d-proton crashes
|
||
- some video file are not shown in mpv when using vaapi hardware decoding on amd apu
|
||
- [ANV][PTL] Indiana Jones and the Great Circle - GPU Hang
|
||
- [ANV] [PTL] Hades 2 game freeze on start of gameplay
|
||
- [anv][ptl] GPU hang in Dying Light dx12
|
||
- radv: Only look at statically used descriptors.
|
||
- RADV: Consider always using the global bo list
|
||
- anv: Age of Wonders 4 corruption on a Arc b580
|
||
- nvk: Incorrect rendering in Baldur's Gate 3 shadows starting with e6dae6ef5fc134f9ed5dd93b1a462084bc3aadfd
|
||
- nvk commets cause problems with kepler
|
||
- anv: Assert in brew when descriptor indexing with modulo
|
||
- tu: VK_EXT_zero_initialize_device_memory
|
||
- ResourceTracker.cpp:40:10: fatal error: perfetto/tracing.h: No such file or directory
|
||
- A bunch of CTS tests are failing on Gfx12.0 trying to use the blitter with TILE_X
|
||
- radv: meta pipeline cache appears to be broken
|
||
- mesa:amd+compiler / aco_tests assembler.mubuf/gfx11 failure with llvm-21.1.2
|
||
- [ANV] Bunch of tests in dEQP-VK.pipeline.*.render_to_image.*3d.*2d_compatible failing on gen9/11
|
||
- elk: segfault in lower_txd_cb
|
||
- bisected: Regression in EXT_shader_framebuffer_fetch_non_coherent test after !37527
|
||
- VK_QUERY_RESULT_WAIT_BIT does not work for VK_QUERY_TYPE_VIDEO_ENCODE_FEEDBACK_KHR
|
||
- a618-traces often times out
|
||
- bisected build failure in clc_helpers.ccp with llvm 22
|
||
- anv: GL mesh tests crash/fail on zink with shader object
|
||
- 25.2.1 fails to build on risc-v with llvm 21
|
||
- RISC-V builds with llvmpipe against LLVM 21 fail due to API changes
|
||
- Confidential issue #14013
|
||
- implicit-function-declaration error when compiling mesa 25.2.0 devel
|
||
- vl_stubs.c:105:1: error: conflicting types for 'vl_mpg12_bs_decode'
|
||
- [ANV][LNL] - FINAL FANTASY XVI (2515020) - Title crashes to Desktop immediately following the splash card.
|
||
- Segfault in init_source at ../src/gallium/auxiliary/vl/vl_idct.c:597 when trying to play DVD on r600
|
||
- nvk: Failure in vkd3d-proton ibfe tests
|
||
- nvk, nak: NAK panic in Call of the Wild: The Angler on RTX 4080
|
||
- Simple External Semaphore test hangs in vk_sync_wait
|
||
- nir_builtin_builder.h:108:43: error: 'M_LOG2E' undeclared
|
||
- regression: windows: msys2 - undeclared M_PI and M_LOG2E probably since !37289 21b8e7604ba51f90682adeff650fc866c71c57f2
|
||
- dEQP-VK.spirv_assembly.instruction.compute.float_controls.fp32.input_args.reflect_denorm_flush_to_zero regression on nvk
|
||
- mesa-25.2.3/src/gallium/drivers/radeonsi/radeon_uvd.c:658: array index used before check ?
|
||
- lp_test_arit.c:200:14: error: static declaration of ‘rsqrtf’ follows non-static declaration
|
||
- build failure with glibc 2.42
|
||
- [bisected] 44aaf884254 regressing FSR vulkan cts tests on PTL
|
||
- [bisected] f416a529 "egl: refine dma buf export to support multi plane" results in piglit crash
|
||
- Crash on game Elite Dangerous at 0% planetary generation, on Tigerlake+ Iris Xe and Arc GPUs.
|
||
- regression;bisected;amd: 0a266f0256025d271945adb3478fc2c1291d4c79 leads pgadmin4-qt to crashes
|
||
- segfault with mesa >= 24.1.0 on nvidia
|
||
- segfault through lavapipe
|
||
- Confidential issue #13807
|
||
- [bisected] 25b97a mesa/st: mark internal texture map calls as UNSYNCHRONIZED breaks r600
|
||
- Gallium: Segfault while trying to compile a shader with differing UBO contents in fragment and vertex stage - With reproduction case
|
||
- aco: generate wrong code when gl_DrawID is used by primitive indices in mesh shader
|
||
- Regression since mesa 25.2.0: applications waiting for dGPU to start
|
||
- ci: libX11 upgrade tracker
|
||
- anv: Regression in dEQP-VK.graphicsfuzz.cov-nested-loops-set-struct-data-verify-in-function
|
||
- brw: regression crash on dEQP-VK.graphicsfuzz.cov-dfdx-dfdy-after-nested-loops
|
||
- a618-traces often times out
|
||
- ci: crosvm dumping log spam from host gl when the job fails
|
||
- panfrost: assertion fail in pan_image_get_wsi_row_pitch
|
||
- virgl: guest memory leak with qemu + virtio-gpu-gl
|
||
- [ANV][LNL] - Horizon Forbidden West™ Complete Edition (2420110) - Orbicular artifacts near heads of machines (wildlife).
|
||
- iris: Assertion failures in piglit tests on all platforms
|
||
- [radv] [Regression) Shadow of the Tomb Raider - flickering/missing textures
|
||
- Minecraft 1.12.2 visual artifacts when running on zink/radv
|
||
- [RADV][VEGA 64][bisected] Cyberpunk 2077 - Massive performance regression due to https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37025/diffs?commit_id=d7f401c2bbadd192dbbcaaeede2805bad71f6193
|
||
- [PTL] hitting assert when starting Xorg
|
||
- GZDoom 4.11/Raze 1.7 exhibit very high memory usage during shader compilation under OpenGL
|
||
- anv: Assertion failure replaying q2rtx fossil
|
||
- [ANV] [REGRESSION] PCSX2/Midnight Club 3 crashes with VK_ERROR_DEVICE_LOST on Mesa 25.0.7
|
||
- Hollow Knight Silksong segfaults with zink on radv
|
||
- zink: crash in KHR-GL46.framebuffer_blit.scissor_blit
|
||
- Request: RADV support for VK_EXT_ycbcr_2plane_444_formats..
|
||
- mesa: state parameters duplicated
|
||
- ARB_vertex_program and ARB_fragment_program are broken
|
||
- d3d10umd fails to create basic pixel shader, outputs "unknown TGSI opcode: RET"
|
||
- turnip: FDM failures with forcebin
|
||
- a7xx_state_location
|
||
- [ANV] Assertion with VVL GPU-AV around robust UBO
|
||
- radv: avoid advertising unsupported global queue priorities for the client
|
||
- crocus: SIGSEGV crash at pbo compressed teximage
|
||
- nvk: test_conditional_rendering fails on vkd3d-proton
|
||
- Segfault in x11_xlib_display_is_thread_safe
|
||
- [ANV][BMG] Witcher 3 ray tracing freeze on a Arc b580
|
||
- anv: optimize utrace overhead from bo memset
|
||
- radv: watching vp9 encoded video with vulkan hwdecode result in artifacts
|
||
- ci: zink-venus-lavapipe errors
|
||
- [ANV][DG2][BMG] 3DMark Solar Bay Misrendering
|
||
- lavapipe defaults to memfd when multiple export types are requested
|
||
- anv: Simple vulkan compute shader causes Intel GPU hang due to excessive loop unrolling
|
||
- INTEL_DEBUG=spill_fs regression
|
||
- NIR validation failed after nir_lower_io in DOOM The Dark Ages
|
||
- radv: NIR validation failed after nir_shrink_vec_array_vars in ../src/amd/vulkan/radv_shader.c:171 (bisected)
|
||
- Wayland EGL missing pbuffer surface support
|
||
- missing sparse synchronization in zink
|
||
- ACO validation failed in DOOM: The Dark Ages - Undef operand to \`p_parallelcopy`
|
||
- ACO validation failure in DOOM: The Dark Ages Shader
|
||
- Dragon Age Veilguard / Ability Wheel Targeting Visual Distortion
|
||
- [AMD] glTexturePageCommitmentEXT triggers an error if level is higher than 0
|
||
- deqp-egl multithread link flakes
|
||
- Ratchet and Clank "[gfxhub] page fault" Mesa25.3
|
||
- [RADV] Support for VK_KHR_video_encode_quantization_map
|
||
- System Crashes when starting VR on rx 9070 (xt)
|
||
- [Build][32bit] Meson does not find libdisplay-info in 32-bit builds
|
||
- freedreno,decode: Lua environment picks up the A6XX register offset instead of A7XX
|
||
- Confidential issue #13351
|
||
- d3d10umd: Build regression on 25.2.0-rc1
|
||
- turnip: LRZ bug with TU_DEBUG=gmem,forcebin
|
||
- nvk/nak regression: memory_model.message_passing fails on KeplerA
|
||
- [ANV] dEQP-VK.api.copy_and_blit.copy_commands2.image_to_buffer_transfer_queue.2d_images.whole_r32g32b32_uint_linear and possibly others asserts on LNL
|
||
- nir: validation failed after nir_remove_dead_variables in 3DMark Solar Bay
|
||
- Build dependency on intel_wa.h missing in Intel vulkan driver
|
||
- Missing polygons/vertices in CS2 on BMG
|
||
- \`vn_ring`: use-after-release crash after \`vn_ring_destroy` on Virtio-Vulkan
|
||
- venus: vkmark --winsys headless segfault (regression)
|
||
- Vulkan headless WSI crashes when initializing swapchain on Asahi Linux running Apple M1 Max
|
||
- lavapipe: Crashes on simple Descriptor Buffer test
|
||
- make zink-radv-navi31-valve a pre-merge job
|
||
- [RADV] Graphical glitches in Ghost of Tsushima on Polaris
|
||
- radv: RT regressions
|
||
- macOS: use of undeclared identifier 'free_zombie_glx_drawable'
|
||
- macOS: Undefined symbols "_pipe_loader_drm_probe_fd", referenced from: _dri_get_drm_device_info in libdri.a[7](dri_util.c.o)
|
||
- Segfault when activating DPMS on i915 hardware
|
||
- RADV caps reported sparse address size at 4 GiB
|
||
- nvk Blackwell support
|
||
- hk: framerate limited/locked to 23 in RDR2 ingame menus (Vulkan)
|
||
- i915: multiple dEQP tests asserts at nir_opt_group_loads.c:75: get_load_resource: Assertion \`!"" "tex instr should have a resource"' failed.
|
||
- tu: VK_KHR_fragment_shading_rate broken when HelperInvocation is used
|
||
- radv: regression: commit a7291074c800 break lighting in Like a Dragon: Infinite Wealth
|
||
- [bisected regression] Latest mesa-git keeps /dev/dri/renderD129 always open with new applications even though they don't use it at all
|
||
- spec\@arb_shader_storage_buffer_object\@max-ssbo-size\@fs stack overflow since cb558b2b88c2
|
||
- anv: enable compression on ASTC LDR emulation surface
|
||
- High GPU usage when using Zink for eglgears_x11 (on X11)
|
||
- Segfault in X11 image acquire code with timeout=0
|
||
- v3dv: regression in vkAllocateMemory importing gbm bo
|
||
- Crash from iris_set_sampler_views in chromium/chrome with accelerated video decoding
|
||
- rusticl: aco: LLVM outperforms ACO in clpeak for \`short` benchmarks on hawaii
|
||
- rusticl: aco: Performance regression in clpeak for char benchmarks on hawaii
|
||
- nir: Deprecate NIR_PASS_V
|
||
- zink on tu assertion failing doing shader-db runs.
|
||
- Race condition with timeline semaphores
|
||
- nir_algebraic silently ignores operand conditions in some cases
|
||
- lavapipe: valgrind triggers errors with CTS unit tests when creating a vulkan device
|
||
- radv: gfx12 RGP captures don't support instruction timings for graphics pipelines
|
||
- xe2: DMA Buffer exported modifier is incorrect
|
||
- cleanup CI kernel patches
|
||
- radv: more glcts fails KHR-GL46.shading_language_420pack.initializer_list_initializer*
|
||
- radv: regression in KHR-GL46.gpu_shader5_gl.float_encoding
|
||
- !36097 breaks Xwayland (& others)
|
||
- GPU process crash via WebGPU shader - heap-buffer-overflow in Mesa build_interference_graph
|
||
- radeonsi: Broken VAAPI video color conversion
|
||
- Gallium HUD broken since !34054
|
||
|
||
|
||
Changes
|
||
-------
|
||
|
||
Aaron Ruby (13):
|
||
|
||
- gfxstream: Rename platform/linux to platform/drm
|
||
- gfxstream: init vk_queues in CreateDevice() based on queueCreateInfo
|
||
- gfxstream: Remove all "Yoda conditions" in gfxstream_vk_device.cpp
|
||
- gfxstream: Downgrade some debug prints to traces
|
||
- gfxstream: Remove duplicate/unnecessary frees in destroyDevice
|
||
- gfxstream: Modify deviceName, driverVersion, driverName, driverInfo ...
|
||
- gfxstream: Use the Mesa common tss_* TLS helper functions
|
||
- gfxstream: Remove on_vkGetDeviceQueue* impls entirely
|
||
- gfxstream: Pre-fetch the VkQueue objects from the host
|
||
- gfxstream: Add init+tracking for the host-equivalent queue_family information
|
||
- vulkan/wsi: No commandPool allocation required for WSI_SWAPCHAIN_NO_BLIT
|
||
- gfxstream: Prune all guest-side KHR entrypoints that are provided with VK_VERSION_1_1
|
||
- gfxstream: address-space graphics requires kParamResourceBlob and kParamHostVisible
|
||
|
||
Agate, Jesse (1):
|
||
|
||
- amd/vpelib: Use Ceil Division Macro
|
||
|
||
Ahmed Hesham (1):
|
||
|
||
- rusticl: Fix negative CTS device tests
|
||
|
||
Aitor Camacho (7):
|
||
|
||
- nir: Set cursor in lower_sampler_lod_bias
|
||
- meson: static link spirv-tools for darwin
|
||
- wsi/metal: Cleanup unused members in wsi_metal_swapchain
|
||
- wsi/metal: Fix wsi_metal_surface_get_formats2
|
||
- wsi/metal: Disable reference counting
|
||
- wsi/metal: Fix size query and present result
|
||
- wsi/metal: Backend addition for drivers built on top of Metal
|
||
|
||
Aksel Hjerpbakk (5):
|
||
|
||
- panvk: avoid cs jump block with no allocator
|
||
- panvk: implement cs_extract64 & cs_extract_tuple
|
||
- panvk: Use a single FBD for IR
|
||
- panvk: pool large TLS allocations
|
||
- panvk: clear big_bos on cmd pool reset with release bit
|
||
|
||
Alejandro Piñeiro (4):
|
||
|
||
- broadcom/compiler: update compact arrays comment
|
||
- docs: GL_ARB_compute_shader is not a ES extension
|
||
- v3d: use directly MESA_TRACE_SCOPE for additional context
|
||
- v3d: expose GL_KHR_shader_subgroup for v71+
|
||
|
||
Aleksi Sapon (11):
|
||
|
||
- meson: add missing x11 dependency on libloader_x11
|
||
- util: SWAP macro implementation for older MSVC versions
|
||
- wsi/metal: current extents might not be known until swapchain is created
|
||
- draw: fix missing line viewport transformation
|
||
- draw: don't set the clipped window coordinate to NaN in debug
|
||
- nir: Fix gnu-empty-initializer warning
|
||
- nir: Fix nir.h MSVC compilation for C++ source files
|
||
- wsi/metal: move VkFormat -> MTLPixelFormat conversion to wsi_common_metal_layer.m
|
||
- wsi/metal: add support for color spaces
|
||
- wsi/metal: fix cleanup on swapchain image creation failure
|
||
- vk: Fix MSVC warning C4189
|
||
|
||
Alessio Belle (4):
|
||
|
||
- pvr: Fix error value returned by pvr_rt_datas_init
|
||
- pvr: Replace check on Mlist size with assert
|
||
- pvr: Pass the PM/FW protect flag to the Mlist allocation
|
||
- pvr: add device info for BXM-4-64 (36.56.104.183)
|
||
|
||
Alexandros Frantzis (1):
|
||
|
||
- egl/wayland: Support pbuffer surfaces
|
||
|
||
Ali, Nawwar (1):
|
||
|
||
- amd/vpelib: add FL capabilitie and lut container size
|
||
|
||
Alyssa Rosenzweig (145):
|
||
|
||
- nir/opt_preamble: add sampler class
|
||
- nir: add bindless_sampler_agx intrinsic
|
||
- hk: dedupe hk_buffer_view_descriptor
|
||
- hk: push descriptor set addresses
|
||
- hk: embed texture desc in set
|
||
- hk: stop pushing image heap
|
||
- hk: stop reserving uniform for image heap
|
||
- hk: drop image heap
|
||
- asahi: drop image heap decode
|
||
- agx: report sampler state count
|
||
- hk: plumb sampler state counts
|
||
- hk,agx: promote bindless samplers
|
||
- hk: optimize desc set addr push
|
||
- hk: only pass sampler heap if needed
|
||
- nir: add nir_mov_scalar helper
|
||
- treewide: use nir_mov_scalar
|
||
- util: crib SWAP macro from freedreno
|
||
- nir: mark exact fmul in ldexp lowering
|
||
- nir: introduce "inexact associative" property
|
||
- nir: restrict associativity to binary operations
|
||
- nir: unmark 24b multiply as associative
|
||
- agx: fix dead phis
|
||
- agx: simplify block image store offset
|
||
- agx: optimize txl LOD
|
||
- agx: optimize imgwblk uniform
|
||
- agx: add immediate load ts/ss encodings
|
||
- agx: use immediate load ts/ss forms
|
||
- hk: use amul instead of imul
|
||
- hk: always lower bindless samplers
|
||
- hk: readvertise required bgra4 format
|
||
- nir: introduce ergonomic tex builder
|
||
- nir/lower_drawpixels: use tex builder
|
||
- nir/lower_bitmap: use more effective NIR
|
||
- vulkan/nir_convert_ycbcr: use more effective nir
|
||
- radv: remove redundant nir->info.internal = true
|
||
- tu: use more effective NIR in meta shaders
|
||
- freedreno: use tex builder
|
||
- asahi: use tex builders
|
||
- dzn: drop redundant internal = true writes
|
||
- nir: add vbo_stride_agx
|
||
- hk: support static vertex input state
|
||
- util: make SWAP safe for MSVC
|
||
- nir: add nir_alu_src_rewrite_scalar helper
|
||
- nir: add ALU reassocation pass
|
||
- agx: make sure denorm flushing really happens
|
||
- agx: run more opt passes
|
||
- agx: reassociate ALU
|
||
- vulkan: fix shader linking with common pipelines
|
||
- glsl,nir: factor out nir_opt_varyings_bulk
|
||
- nir: handle frag_coord_z/w intrinsics
|
||
- nir/opt_vectorize_io: allow i/o semantics w/o component
|
||
- nir/divergence_analysis: handle more AGX
|
||
- agx/nir_lower_gs: handle XFB corner
|
||
- hk: optimize varyings
|
||
- dzn: use common SWAP
|
||
- treewide: use SWAP macro
|
||
- nir/lower_system_values: simplify load_helper_invocation lowering
|
||
- nir: drop load_sample_id_no_per_sample
|
||
- nir: add nir_def_as_* helpers
|
||
- nir: add nir_def_block helper
|
||
- treewide: use nir_def_as_*
|
||
- treewide: simplify nir_def_rewrite_uses_after
|
||
- treewide: use nir_def_block
|
||
- asahi: clang-format
|
||
- clc: force exact! across libclc
|
||
- asahi: drop sink/move in GS code
|
||
- agx: try to rematerialize to improve occupancy
|
||
- asahi: use native colour masking
|
||
- hk: kill psiz writes via topology, not feature
|
||
- hk: only enable image view min LOD for dx12
|
||
- asahi: optimize pass type with depth-only passes
|
||
- asahi,hk: optimize no-op FS
|
||
- asahi: rename compressed 1 to just compressed
|
||
- agx: add foreach_reg_{src,dest}
|
||
- agx: track block divergence
|
||
- agx: fix reg cache printing
|
||
- agx: fix export instructions in the IR
|
||
- agx: fix simd reduce forcing no cache bit
|
||
- agx: fix cache bit packing
|
||
- agx: plumb is_alu query for reg cache opt
|
||
- agx: lower export even later
|
||
- agx: set register cache hints
|
||
- agx: handle 16-bit coordinates
|
||
- asahi: use 16-bit coordinates for bg program
|
||
- libagx: factor out query_report
|
||
- libagx: port reset query helper to libagx
|
||
- hk: use new reset query kernel
|
||
- people: add John Anthony
|
||
- nir: add nir_inline_sysval pass
|
||
- brw: replace lower_fs_msaa with nir_inline_sysval
|
||
- pan/bi: replace specialize_idvs with nir_inline_sysval
|
||
- lvp: replace lower_ray_tracing_stack_base with inline_sysval
|
||
- panfrost: don't use nir_lower_printf_buffer
|
||
- nir,agx: pull lower_printf_buffer into backend
|
||
- nir: gather info in opt_varyings_bulk
|
||
- nir: gather interpolation qualifiers
|
||
- nir/opt_varyings: link interpolation qualifiers
|
||
- asahi: use NIR gathered interpolation
|
||
- asahi: inline UVS indices
|
||
- asahi: enable virtgpu support
|
||
- panvk: rewrite pan_nir_lower_static_noperspective
|
||
- agx: gate scratch opt on internal shaders
|
||
- asahi: clang-format
|
||
- asahi: reduce ppp alignment
|
||
- hk: fix todo
|
||
- hk: clarify command pool types
|
||
- hk: fix pathological RAM use for tess emulation
|
||
- hk: drop unused
|
||
- hk: reduce storage desc
|
||
- nir/lower_subgroups: add lower_fp64 option
|
||
- nir: plumb ballot options
|
||
- glsl: lower fp64 subgroup ops
|
||
- agx: lower fmin/fmax scans
|
||
- asahi: implement KHR_shader_subgroup
|
||
- agx: drop bounds check optimize pass
|
||
- people: update Alyssa's email
|
||
- mailmap: add Alyssa's Intel e-mail address
|
||
- hk: assume largePoints always set
|
||
- asahi: fix drm-shim
|
||
- util: add util_bit_swap macro
|
||
- util: add boolean lookup table helpers
|
||
- util: add unit tests for util/lut.h
|
||
- agx: use util_lut2
|
||
- nir/lower_flatshade: clean up
|
||
- brw: drop unused brw_kernel code
|
||
- brw: drop indirection on compiler options
|
||
- brw: hoist shared options out of the stage loop
|
||
- brw: cleanup int64 option set
|
||
- anv,hasvk: do not use unify_interfaces
|
||
- brw: drop printf info plumbing
|
||
- intel: drop clamp_fragment_color handling
|
||
- intel: drop legacy flatshade handling
|
||
- util/shader_stats: allow "hidden" stats
|
||
- brw,anv: use XML-based stats
|
||
- util: add BITSET_CALLOC helper
|
||
- treewide: use BITSET_CALLOC
|
||
- brw/nir_lower_alpha_to_coverage: eliminate goto
|
||
- brw/nir_lower_fs_barycentrics: avoid nir_def_rewrite_uses_after
|
||
- brw/nir_lower_sample_index_in_coord: use helpers
|
||
- brw/nir_lower_shader_calls: use helpers
|
||
- brw/nir_lower_storage_image: use helper
|
||
- intel/nir_blockify_uniform_loads: use helpers
|
||
- treewide: don't check before free
|
||
- anv: use D3D-compatible texturing for Proton
|
||
- asahi,ail: fix multi-plane imports
|
||
|
||
Alyssa Ross (4):
|
||
|
||
- gfxstream: guest: don't use transitional LFS64 API
|
||
- docs: update GitLab option name
|
||
- meson.build: remove dead code
|
||
- meson.build: set with_clc for asahi tools
|
||
|
||
Anna Maniscalco (4):
|
||
|
||
- tu: Add support for realtime vk priority
|
||
- mailmap: Update my name
|
||
- freedreno/registers: add CP_ALWAYS_ON_CONTEXT
|
||
- freedreno/afuc: Add x1e fw-id
|
||
|
||
Ansari, Muhammad (1):
|
||
|
||
- amd/vpelib: VPE Events
|
||
|
||
Antonio Ospite (32):
|
||
|
||
- ci/android: update comment about ANDROID_CTS_MODULES
|
||
- ci/android: fix exit code from android-cts-runner.sh and android-deqp-runner.sh
|
||
- zink: fix assigning _Bool to _Bool*
|
||
- nir: fix returning _Bool instead of pointer
|
||
- crocus: fix returning _Bool instead of pointer
|
||
- zink: fix returning _Bool instead of pointer
|
||
- anv: fix returning _Bool instead of pointer
|
||
- nak: fix returning _Bool instead of pointer
|
||
- radv: fix returning _Bool instead of pointer
|
||
- dril: fix returning _Bool instead of pointer
|
||
- microsoft/compiler: fix returning _Bool instead of pointer
|
||
- asahi: fix returning _Bool instead of pointer
|
||
- etnaviv: fix returning _Bool instead of pointer
|
||
- lima: fix returning _Bool instead of pointer
|
||
- broadcom/compiler: prevent FALLTHROUGH error with C23
|
||
- glsl: rename state name to avoid conflicts with future changes
|
||
- build: stop calling unreachable() without arguments
|
||
- build: avoid redefining unreachable() which is standard in C23
|
||
- util: avoid calling UNREACHABLE(str) macro without arguments
|
||
- libcl: avoid calling UNREACHABLE(str) macro without arguments
|
||
- nak/nouveau: silence errors about never used methods
|
||
- compiler/rust: fix errors about hiding elided lifetime
|
||
- ci/android: add rust compiler to create-android-cross-file.sh
|
||
- ci/android: add comment about updating tags to create-android-cross-file.sh
|
||
- nvk: silence error when cross-building for Android
|
||
- subprojects: fix ignore exception for files under packagefiles/
|
||
- meson: handle dep_libdrm before the driver specific libdrm modules
|
||
- ci: bump DEBIAN_BUILD_TAG to include all the android/rust changes for nvk and panvk
|
||
- ci/android: enable cross-building nvk and panvk for Android
|
||
- radv: don't include amdgpu.h directly
|
||
- radv: fix building with libdrm as a submodule
|
||
- device-select: fix build errors on some stricter build configurations
|
||
|
||
Arkadiusz Hiler (1):
|
||
|
||
- wsi/display: Avoid connector reprobes in wsi_GetRandROutputDisplayEXT
|
||
|
||
Arseny Kapoulkine (1):
|
||
|
||
- ac/rgp: Warn when RGP capture can't be saved without libelf
|
||
|
||
Asahi Lina (1):
|
||
|
||
- asahi: Ensure shared BOs have a prime_fd
|
||
|
||
Ashish Chauhan (10):
|
||
|
||
- pvr: temporary spm tweaks
|
||
- pvr: Add support for gpu multicore MC1 configurations
|
||
- pvr: Implement WA BRN_72168
|
||
- pvr: Implement WA BRN_72463
|
||
- pvr: Enable PBE_FILTERABLE_F16
|
||
- pvr: Feature support TPU_PARALLEL_INSTANCES
|
||
- pvr: Enable PDS_DDMADT
|
||
- pvr: Enable shaderStorageImageExtendedFormats
|
||
- pvr: Drop broken driver environment variable check for BXS-4-64
|
||
- pvr: Drop '-experimental' suffix from the 'imagination' build option
|
||
|
||
Ashley Smith (4):
|
||
|
||
- mesa: Fix support for GL_EXT_shader_clock
|
||
- panfrost: Enable shader_atomic_int64 for gallium
|
||
- panfrost,mesa: Fix versions for EXT_shader_realtime_clock
|
||
- panfrost,mesa: Fix versions for EXT_shader_clock
|
||
|
||
Assadian, Navid (3):
|
||
|
||
- amd/vpelib: Exit when VPE not support in debug
|
||
- amd/vpelib: Add necessary pointer casting
|
||
- amd/vpelib: Add new colors to visual confirm
|
||
|
||
Autumn Ashton (4):
|
||
|
||
- radv: Implement VK_KHR_video_encode_quantization_map
|
||
- radv: Support VK_IMAGE_TILING_OPTIMAL for quantization maps
|
||
- radv: Allow MUTABLE_FORMAT and EXTENDED_USAGE for qp_map images
|
||
- nvk: Implement VK_NVX_image_view_handle
|
||
|
||
Bas Nieuwenhuizen (2):
|
||
|
||
- device-select: Fix error check.
|
||
- radv: use vk_drm_syncobj_copy_payloads
|
||
|
||
Benjamin Cheng (11):
|
||
|
||
- vulkan/video: Add vk_video_is_profile_supported()
|
||
- radv/video: Fix video profile reporting
|
||
- radv/video: Report extra image usages
|
||
- vulkan/query_pool: Store video encode feedback
|
||
- radv: Output requested encode query results only
|
||
- radv/video: Fill maxCodedExtent caps first
|
||
- radv/video_enc: Cleanup slice count assert
|
||
- radv/video: Override H265 SPS block size parameters
|
||
- radv/video: Override H265 SPS unaligned resolutions
|
||
- vulkan/video: NULL check codec-specific chain
|
||
- radv/video: Fix dummy DPB addresses
|
||
|
||
Benjamin Otte (1):
|
||
|
||
- device_select: Allow shortcut names for device types
|
||
|
||
Bo Hu (2):
|
||
|
||
- gfxstream: update codegen for event save and load
|
||
- gfxstream: [vulkan snapshot]: update code gen for vkUpdateDescriptorSet change
|
||
|
||
Bohan Yu (1):
|
||
|
||
- gallium: Fix LLVMpipe function parameter of Vector type call load mismatch
|
||
|
||
Boris Brezillon (53):
|
||
|
||
- panfrost: Add get_device_reset_status() to the CSF backend
|
||
- panfrost: Add a GPU fault injection mechanism
|
||
- panfrost: Log when an unusable group caused a context re-initialization
|
||
- util/format: Auto-generate the enum pipe_format definition
|
||
- util/format: Use more descriptive names for YUV formats
|
||
- util/format: Add subsampling info to our YUV-as-RGB format names
|
||
- util/format: Auto-generate a bunch of YUV helpers
|
||
- pan/mod: Add a pan_mod_get_handler() implementation when PAN_ARCH is defined
|
||
- pan/mod: Replace ::supports_format() by ::test_props()
|
||
- pan/image: Provide two helpers to check image viability
|
||
- panvk: Use pan_image_test_props() to do our modifier check
|
||
- panfrost: Don't check for MTK_TILED when walking the native modifiers list
|
||
- dri: Don't pretend we can lower NV15/NV20 when we can't
|
||
- panfrost: Use pan_image_test_modifier_with_format() to do our modifier check
|
||
- panvk: Remove leftovers from CPU-side min/max index calculation
|
||
- panvk: Fix disjoint image memory binding
|
||
- panvk: Fix panvk_image_can_use_afbc() for GetPhysicalDeviceImageFormatProperties2()
|
||
- panvk: Pass a correct aspect to panvk_plane_index()
|
||
- panvk/jm: Preload the FB even if we have no draws queued
|
||
- panvk/jm: Automatically open a batch in dispatch_precomp()
|
||
- panvk/jm: Add a JM barrier on clear AFBC jobs
|
||
- panfrost: Fix panfrost_batch_to_fb_info() for stencil-only attachments
|
||
- pan/mod: Allow testing if a modifier is optimal
|
||
- pan/format: Fix the mapping for Z32_FLOAT on v7+
|
||
- panfrost: Explicitly reject AFBC(Z32)
|
||
- pan/afbc: Add missing S8 and Z32 cases to pan_afbc_format()
|
||
- panvk: Hook-up optimal modifier selection
|
||
- util/format: Autogen type conversion helpers
|
||
- pan/afbc: Cache the pan_afbc_mode selection
|
||
- panfrost: Explictly filter out AFBC(SNORM)
|
||
- pan/desc: Upgrade writeback format to RAW32 on v9+ when AFBC(RAW24)
|
||
- pan/afbc: Allow AFBC on UINT/SINT/SNORM types on v9+
|
||
- panvk: Don't allow AFBC if the format format is mutable on v7-
|
||
- panvk: Make panvk_meta.h per-gen
|
||
- panvk: Consolidate image copy format selection
|
||
- panvk: Disallow AFBC(D24S8) if separateDepthStencilLayouts=true
|
||
- panvk: Make AFBC an opt-out
|
||
- util/format: Add a Z24_UNORM_PACKED format
|
||
- pan/lib: Hook-up Z24_UNORM_PACKED support
|
||
- panvk: Initialize panvk_image::plane_count early
|
||
- panvk: Pass an image to panvk_plane_count()
|
||
- panvk: Stop using panvk_image_can_use_afbc() in panvk_image_can_use_mod()
|
||
- panvk: Add planar Z24S8 support
|
||
- drm-uapi: Sync panfrost_drm.h
|
||
- pan/kmod: query and cache available context priorities from KMD
|
||
- panfrost: Support JM context creation and destruction
|
||
- panfrost: Support debugging JM context priorities with env vars
|
||
- panvk: Fix ordering in prepare_draw()
|
||
- panvk: Don't expose low/high priority queues on Bifrost
|
||
- vk/meta: Support DS <-> color copies
|
||
- panvk: Fix panvk_interleaved_copy() formatting
|
||
- panvk: Fix host copies on planar DS resources
|
||
- panvk: Only use Z24_UNORM_PACKED for AFBC images
|
||
|
||
Boyuan Zhang (5):
|
||
|
||
- pipe: add gaps_in_frame for h264
|
||
- frontends/va: get gaps_in_frame for h264 dec
|
||
- radeon/vcn: add gaps_in_frame flag to h264 sps
|
||
- ci/fluster: remove 3 pass cases resulted by gaps_in_frame
|
||
- radeonsi/vcn: adjust subsample size alignment
|
||
|
||
Brais Solla (2):
|
||
|
||
- r300: Added support for GL_ATI_meminfo and GL_NVX_gpu_memory_info
|
||
- r300: move r300_query_memory_info to r300_screen.c
|
||
|
||
Caio Oliveira (93):
|
||
|
||
- brw: Fix cmat conversion between bfloat16 and non-float32
|
||
- brw: Move insert/remove code to the block
|
||
- brw: Add more specific brw_builder helpers
|
||
- brw: Use a more specific builder helper in combine constants
|
||
- brw: Use a builder to track position in lower_simd
|
||
- brw: Make brw_builder() shader constructor use CFG if available
|
||
- intel/decoder/tests: Sort gentest.xml file
|
||
- intel/genxml: Add support for dword/bits in fields to gen_sort_tags.py script
|
||
- intel/genxml: Add support for dword/bits in fields to rest of the code
|
||
- intel/genxml: Convert field format from start/end to dword/bits
|
||
- intel/genxml: Remove support for start/end atttributes
|
||
- spirv: Load block descriptors as soon as we hit them
|
||
- spirv: Implement SPV_KHR_untyped_pointers
|
||
- brw: Use ralloc helpers for string handling in brw_eu_validate
|
||
- brw: Remove extra iteration on instructions from brw_opt_address_reg_load
|
||
- spirv: Update headers and metadata from latest Khronos commit
|
||
- vulkan: Update enum_to_str conversion to handle ARM enum names
|
||
- vulkan: Update headers/xml for 1.4.325
|
||
- anv: Advertise VK_KHR_shader_untyped_pointers
|
||
- brw: Define order for fixes in 3-src operand fix
|
||
- brw: Make sure copied instruction don't copy the list pointers
|
||
- brw: Move resize_sources() earlier when lowering FIND_LIVE_CHANNELS
|
||
- brw: Only access valid sources in lower_btd_logical_send()
|
||
- brw: If the instruction is already a SEND, no need to resize sources
|
||
- brw: Avoid invalid access when compacting out-of-bounds JIP/UIP
|
||
- brw: Add disabled test for MAD constant folding
|
||
- brw: Fix folding case for MAD instruction with all immediates
|
||
- brw: Fix checking sources of wrong instruction in opt_address_reg_load
|
||
- brw: Add brw_shader_params
|
||
- brw: Pass per_primitive_offset in brw_shader_params
|
||
- anv: Allocate prog_data->param array when making internal kernels
|
||
- intel/brw: Remove brw_shader::import_uniforms()
|
||
- intel/brw: Simplify tracking of dispatch_width_limit in brw_compile_fs
|
||
- intel/brw: Simplify variant tracking in brw_compile_fs
|
||
- intel/brw: Take shader in the brw_generator::generate_code() parameters
|
||
- brw: Run validation as soon as we have the CFG around
|
||
- brw: Fix printing of blocks in disassembly when BRW is available
|
||
- util: Avoid invalid access in ralloc_print_info()
|
||
- brw: Add \`FILE \*\` parameter to dump_assembly
|
||
- brw: Add and use more brw_validate.cpp macros
|
||
- brw: Use uint16_t for size_written
|
||
- brw: Centralize brw_inst allocation
|
||
- brw: Allocate brw_inst::src with ralloc
|
||
- brw: Remove builtin sources from brw_inst
|
||
- brw: Bundle the allocation of brw_inst and its sources
|
||
- brw: Let the builder fill the sources of brw_inst
|
||
- brw: Allow emit instruction with only number of sources
|
||
- brw: Pass brw_shader in fold_instruction
|
||
- brw: Add and use brw_transform_inst()
|
||
- brw: Add brw_builder::SEND() helper
|
||
- brw: Add brw_builder::URB_READ and URB_WRITE helpers
|
||
- brw: Remove the extra function call when lowering samplers
|
||
- brw: Add initial support for different instruction kinds
|
||
- brw: Add brw_send_inst
|
||
- brw: Add brw_tex_inst
|
||
- brw: Add brw_mem_inst
|
||
- brw: Add brw_dpas_inst
|
||
- brw: Add brw_load_payload_inst
|
||
- brw: Add brw_urb_inst
|
||
- brw: Add brw_fb_write_inst
|
||
- brw: Add a generic LOGICAL instruction kind
|
||
- brw: Allocate only brw_inst for BASE instructions
|
||
- brw: Repack brw_inst fields
|
||
- brw: Don't use individual rallocs for each instruction
|
||
- brw: Fix encoding of 3-src dst in Xe2+
|
||
- egl: Set atexit() handler during initialization
|
||
- egl: Don't maintain a list of AtExit functions
|
||
- intel/mda: Add code to produce mesa debug archives
|
||
- brw: Use debug archive file with INTEL_DEBUG=mda
|
||
- brw: Include some NIR states in the debug archive
|
||
- brw: Also include the final disassembly in the debug archive
|
||
- anv: Refactor anv_shader_compile result handling
|
||
- anv: Create archive file when using INTEL_DEBUG=mda
|
||
- iris: Create archive file when using INTEL_DEBUG=mda
|
||
- intel/mda: Add tool to inspect mesa debug archives
|
||
- intel/mda: Add search/searchall commands
|
||
- intel/mda: Add -U and -Y diff options
|
||
- intel/mda: Handle non-contiguous object versions in mda.tar files
|
||
- intel/mda: Add pager support
|
||
- intel/mda: Add MDA_OUTPUT_DIR and MDA_PREFIX environment variable support
|
||
- intel/mda: If MDA_PREFIX=timestamp use the actual timestamp as a prefix
|
||
- intel/mda: Allow more toplevel directory names inside mda.tar files
|
||
- intel/mda: Use archive filename as directory name instead of hardcoded "mda/"
|
||
- intel/mda: Add MDA_FILTER to select which archives to generate
|
||
- brw: Identify if/break/endif special case before emission
|
||
- intel/executor: Destroy syncobjs after using them
|
||
- intel/executor: Expose extra command line arguments to script
|
||
- intel/executor: Drop check_ver and check_verx10 functions
|
||
- intel/executor: Expose a devinfo table
|
||
- intel/executor: Add script directory to \`package.path`
|
||
- intel/executor: Add DPAS examples for HF/F, UB/UD and BF/F
|
||
- intel/executor: Add a matrix multiplication example
|
||
- brw: Add variable for opcode in the brw_set_* high-level helpers
|
||
|
||
Calder Young (13):
|
||
|
||
- nir/builder: Add helper for building uvec8 immediates
|
||
- brw,anv: Reduce UBO robustness size alignment to 16 bytes
|
||
- isl: Add support for creating layered surfaces for video encode/decode
|
||
- anv: Add support for creating layered surfaces for video encode/decode
|
||
- anv: Add support for using layered surfaces in H.264 and H.265 video coding
|
||
- anv: Add support for using layered surfaces in AV1 video decoding
|
||
- anv: Add support for using layered surfaces in VP9 video decoding
|
||
- anv: Report disjoint images as unsupported for video usage
|
||
- anv: Update video test expectations for layered_dpb
|
||
- anv: Advertise only OUTPUT_COINCIDE_BIT for AV1 video decoding
|
||
- anv: Add support for AV1 film grain sythesis on Xe2+
|
||
- anv: Fix tiling for AV1 IntraBC surface on Gfx125+
|
||
- isl: Fix noncoherent framebuffer fetch when base_level != 0
|
||
|
||
Caleb Callaway (6):
|
||
|
||
- spirv: Fix RT raygen hit attribute validation error
|
||
- compiler: use PATH_MAX for SPIR-V capture filename
|
||
- compiler: BLAKE3 ID for SPIR-V capture
|
||
- compiler: auto-stage file ext for SPIR-V capture
|
||
- compiler: SPIR-V shader replacement
|
||
- compiler: document SPIR-V capture + replace
|
||
|
||
Caterina Shablia (17):
|
||
|
||
- vulkan/runtime: add vk_image_subresource_slice_count
|
||
- panvk/csf: change get_cs_deps to be add_cs_deps
|
||
- panvk: add a meta command for transitioning image layout
|
||
- panvk: call cmd_transition_image_layout for each image memory barrier
|
||
- panvk: do not zero AFBC when an image is being bound
|
||
- panvk/csf: plop the stage and access masks into panvk_sync_scope
|
||
- panvk: adjust formatting in csf/panvk_queue.h
|
||
- pan/kmod,panvk: use uint64_t and not size_t for device sizes
|
||
- pan/kmod: introduce pan_kmod_vm::pgsize_bitmap
|
||
- panvk: introduce panvk_get_gpu_page_size
|
||
- pan/kmod,panvk: rewrite how alignment for an allocation is chosen
|
||
- panvk: add blackhole bo
|
||
- panvk: add PANVK_DEBUG=force_blackhole
|
||
- panvk: implement sparse resources
|
||
- panvk: add bind queue
|
||
- panvk: report support for sparse{Binding,ResidencyBuffer}
|
||
- docs/features: add sparse{Binding,ResidencyBuffer} on panvk/v10+
|
||
|
||
Chan, Roy (2):
|
||
|
||
- amd/vpelib: fix memory corruption
|
||
- amd/vpelib: check stream_count as well before accessing streams
|
||
|
||
Chang, Tomson (2):
|
||
|
||
- amd/vpelib: Add missing swizzle and dcc info
|
||
- amd/vpelib: Update register header and definitions macros
|
||
|
||
Charles Giessen (1):
|
||
|
||
- docs: Use correct ICD path in install.rst
|
||
|
||
Chia-I Wu (2):
|
||
|
||
- panvk: require gpu_can_query_timestamp for calibrated timestamps
|
||
- panvk: use common calibrated timestamp support
|
||
|
||
Christian Gmeiner (63):
|
||
|
||
- v3dv: Make use of hash table helpers
|
||
- freedreno/rddecompiler: Make use of hash table helpers
|
||
- etnaviv: Update headers from rnndb
|
||
- etnaviv: Handle 64-bit pixel formats in texture sampler TS setup
|
||
- etnaviv: Fix vertex format normalization for signed integer formats
|
||
- etnaviv: Fix negative LOD value encoding in texture descriptors
|
||
- etnaviv: Emulate rasterizer_discard
|
||
- etnaviv: hwdb: Add MSAA_FRAGMENT_OPERATION feature
|
||
- etnaviv: Only emit VIVS_PS_MSAA_CONFIG if GPU support it
|
||
- etnaviv: Update headers from rnndb
|
||
- etnaviv: Emit alpha-to-coverage dither
|
||
- etnaviv: Add support for alpha_to_coverage
|
||
- etnaviv: blt: Add r8_unorm format support
|
||
- etnaviv: blt: Add r8g8_unorm format support
|
||
- etnaviv: blt: Clear only requested color buffers
|
||
- etnaviv: rs: Clear only requested color buffers
|
||
- etnaviv: Optimize sampler view iteration with u_foreach_bit(..)
|
||
- etnaviv: blt: Extend translate_blt_format(..)
|
||
- etnaviv: blt: Add hardware based mipmap generation
|
||
- etnaviv: Enable texture_multisample for deqp testing
|
||
- etnaviv: isa: Add tg4 instruction
|
||
- etnaviv: nir: Add nir_texop_tg4 offset lowering
|
||
- etnaviv: Add support for ARB_texture_gather
|
||
- etnaviv: Do not update derived states during non-draw force flush
|
||
- etnaviv: re-format using clang-format
|
||
- etnaviv: Replace unsupported blit debug message with detailed dump and assertion
|
||
- r300: re-format using clang-format
|
||
- radv: re-format using clang-format
|
||
- nak: Move dataflow to compiler crate
|
||
- etnaviv: hwdb: Add S8 feature
|
||
- etnaviv: Update headers from rnndb
|
||
- etnaviv: rs: Support 8bpp for clears
|
||
- etnaviv: Support PIPE_FORMAT_S8_UINT stencil format
|
||
- imagination: Re-format using clang-format
|
||
- clang-format: Add src/imagination to .clang-format-include
|
||
- nir/opt_algebraic: optimize f2i32(fround_even(x)) to f2i32_rtne(x)
|
||
- etnaviv: blt: Enable scissored clear
|
||
- etnaviv: Update headers from rnndb
|
||
- etnaviv: hwdb: Add HWTFB cap
|
||
- etnaviv: Support hw based rasterizer_discard
|
||
- etnaviv: Pass context to acc sample provider supports(..) function
|
||
- etnaviv: Support PIPE_QUERY_PRIMITIVES_EMITTED
|
||
- etnaviv: Implement stream output target management
|
||
- etnaviv: Implement hardware based streamout support
|
||
- etnaviv: Fix util_blitter_save_so_targets(..) call
|
||
- docs/features: Mark GL_EXT_transform_feedback as done for etnaviv/HWTFB
|
||
- etnaviv: Update headers from rnndb
|
||
- etnaviv: Support ARB_stencil_texturing
|
||
- etnaviv: Expose faked xfb support when DEQP debug flag is enabled
|
||
- pvr, pco: Set has_f2i32_rtne to true
|
||
- etnaviv/ci: Add per-gpu GLES2 extension lists
|
||
- etnaviv: Allow 128-bit formats when DEQP debug flag is enabled
|
||
- etnaviv: Add 128bit emulated formats
|
||
- etnaviv: Add 128 bit format helper
|
||
- etnaviv: Add 128-bit format tilling
|
||
- etnaviv: Support 128 bit formats transfers
|
||
- etnaviv: 128 bit format needs to be CPU tiled
|
||
- etnaviv: Do not use TS for emulated 128 bit formats
|
||
- etnaviv: Implement 128-bit format emulation using dual 64-bit layout
|
||
- etnaviv: blt: Support 128 bit clear operations
|
||
- etnaviv: blt: Support 128 bit blit operations
|
||
- anv: Fix needs_temp_copy() incorrectly matching depth/stencil formats
|
||
- meson: require sysprof-capture-4 >= 4.49.0
|
||
|
||
Christian Meissl (1):
|
||
|
||
- panfrost: take reference from pool used for allocation
|
||
|
||
Christoph Neuhauser (3):
|
||
|
||
- egl: Fix DRI utility function compilation on macOS
|
||
- iris: Increase max_shader_buffer_size to max_buffer_size
|
||
- egl: Fix invalid device UUID returned by EGL_EXT_device_persistent_id
|
||
|
||
Christoph Pillmayer (25):
|
||
|
||
- panvk: hide utrace behind more generic interface
|
||
- panvk: Make panvk_utrace_record_ts wait mask configurable
|
||
- panvk: Make ts in panvk_instr_begin_work synchronous
|
||
- panvk: Make most end work instrumentation synchronous
|
||
- panvk: Support VK_DESCRIPTOR_TYPE_MUTABLE_EXT on v9+
|
||
- panvk: Support DESCRIPTOR_POOL_CREATE_HOST_ONLY_BIT
|
||
- panvk: Advertise VK_EXT_mutable_descriptor_type on v9+
|
||
- vk/sync: Pass dependencyFlags in vk_common_CmdPipelineBarrier
|
||
- panvk: Fix preserved metadata in lower_input_attachment_load
|
||
- panvk/utrace: Alloc utrace copy buf from userspace heap
|
||
- panvk/utrace: Remove dynamic alloc from utrace clone builder
|
||
- panvk/perfetto: Handle re-submittable command buffers
|
||
- panvk/perfetto: Drop zero duration events
|
||
- panvk: Add support for moving constants to the FAU
|
||
- pan/bi: Move some constants into FAU entries
|
||
- pan/va: Pull out constant swizzle handling
|
||
- pan/bi: Prioritize consts moved to the FAU
|
||
- nir/opt_algebraic: Convert a + b + a to b + 2a
|
||
- pan: Add gpu variant to compile inputs
|
||
- panfrost: Wire up gpu_variant to pan_compile_inputs
|
||
- panvk: Wire up gpu_variant to pan_compile_inputs
|
||
- pan/clc: Wire up gpu_variant to pan_compile_inputs
|
||
- pan: Lift pan_get_model into its own lib
|
||
- pan/bi: Normalize with pan_model.rates
|
||
- pan/va: Remove redundant MOVs from va_lower_split_64bit
|
||
|
||
Collabora's Gfx CI Team (11):
|
||
|
||
- Uprev ANGLE to 6a04a50f98cac71b25464d10289ce7a013841caf
|
||
- Uprev Piglit to 0980079dcfb5adbad873d88e00181268f55cb8ef
|
||
- Uprev Piglit to c3a3e29d59e0972650a6d30d20de930c87739c14
|
||
- Uprev ANGLE to 995c4c4d89ed6a5c28b210e9c0f83eb4f8b6e2f5
|
||
- Uprev Piglit to 28d1349844eacda869f0f82f551bcd4ac0c4edfe
|
||
- Uprev ANGLE to 1df3b59f8730b56b4770595d4d69f36d5283333f
|
||
- Uprev Piglit to 517270ccca11a795d2f29bd723c362eb6ef9ce8f
|
||
- Uprev Piglit to a70c33045c59310f972dbbdb33f322eb209971bc
|
||
- Uprev ANGLE to 538129c6b3c17dc864101c7a4af4b74b00706f82
|
||
- Uprev ANGLE to 8ed16003f27125f27cbb87578368e447043420d3
|
||
- Uprev Piglit to 4147e9d7aeb8ba26ffc25a90fc237588bcb3bb11
|
||
|
||
Connor Abbott (62):
|
||
|
||
- tu: Don't keep track of acceleration structure sizes
|
||
- freedreno: Add bin scaling registers
|
||
- freedreno: Document GRAS_SC_BIN_CNTL::FORCE_LRZ_DIS
|
||
- freedreno: Add HW bin scaling feature
|
||
- tu: Add documentation for VK_EXT_fragment_density_map
|
||
- tu: Use GRAS bin offset registers
|
||
- tu: Enable LRZ with FDM
|
||
- ir3: Simplify and rationalize shading rate LUT
|
||
- freedreno: Add common VRS helpers
|
||
- ir3: Use common shading rate lookup table
|
||
- tu, freedreno: Document GRAS shading rate LUT
|
||
- vulkan/queue: Fix VkTimelineSemaphoreSubmitInfo sanitization
|
||
- tu: Refactor BO deletion
|
||
- freedreno/drm: Import new UABI for VM_BIND
|
||
- tu: Align BO size to page size
|
||
- tu: Fix CmdBindTransformFeedbackBuffersEXT size handling
|
||
- tu/drm: Enable VM_BIND
|
||
- tu/knl: Add an API for sparse binding
|
||
- tu/drm: Add support for sparse binding
|
||
- tu/kgsl: Add support for sparse binding
|
||
- tu: Initial support for sparse binding
|
||
- tu: Support sparseResidencyAliased
|
||
- freedreno/ci: Add sparse-related a618 skips
|
||
- freedreno/ci: Skip dEQP-VK.memory.mapping.*.full.variable.*
|
||
- freedreno/ci: Update kernel with VM_BIND fixes
|
||
- freedreno/ci: Update a750 expectations
|
||
- zink: Make sparse always wait on pending gfx commands
|
||
- tu: Don't decrement implicit_sync_bo_count with VM_BIND
|
||
- freedreno/fdl: Expose fdl6_is_r8g8_layout() publicly
|
||
- freedreno/fdl: Refactor and expose bank swizzling logic
|
||
- freedreno/fdl: Handle cpp=32 and cpp=64 when getting macrotile size
|
||
- freedreno/fdl: Handle layout differences for r8g8 images
|
||
- freedreno/fdl: Add sparse layout support
|
||
- tu: Support sparse residency for images
|
||
- ir3: Assemble and disassemble rck modifier
|
||
- ir3: Implement sparse residency check
|
||
- tu: Expose shaderResourceResidency
|
||
- ir3: Assemble and disassemble .clp modifier
|
||
- ir3: Support min_lod tex source
|
||
- tu: Advertise shaderResourceMinLod
|
||
- freedreno/ci: Add a750 sparse skips
|
||
- tu: Lower ViewIndex to 0 when multiview is disabled
|
||
- freedreno: Add blit_wfi_quirk and use in turnip
|
||
- tu/drm: Split out iova allocation and BO allocation
|
||
- tu: Add support for a "lazy" sparse VMA
|
||
- tu: Make tu_image point to tu_device_memory instead of tu_bo
|
||
- tu: Implement transient attachments and lazily allocated memory
|
||
- freedreno: Don't program non-context reg with CRB
|
||
- tu: Fix 3d load and clear when FDM bin offsets are in use
|
||
- tu/fdm: Use better bounds for LRZ overallocation with FDM offset
|
||
- tu: Expose VK_EXT_dynamic_rendering_unused_attachments
|
||
- tu: Reset \*_BIN_FOVEAT when not using FDM
|
||
- freedreno: Don't stomp VSC registers
|
||
- tu: Pass tu_queue to kernel create/destroy functions
|
||
- tu/drm: Emulate combined gfx/sparse queues
|
||
- tu: Support sparse binds on the gfx queue
|
||
- tu: Fix RT count with remapped color attachments
|
||
- tu: Don't patch GMEM for input attachments never in GMEM
|
||
- tu: Fix 3d load path with D24S8 on a7xx
|
||
- tu: Also disable stencil load for attachments not in GMEM
|
||
- tu: Rename tu_render_pass_attachment::clear_views to used_views
|
||
- tu: Fix attachment stores with subpasses with partial views
|
||
|
||
Corentin Noël (8):
|
||
|
||
- virgl: Stop using deprecated util_framebuffer_init
|
||
- ci/piglit: Allow traces content-type to be binary/octet-stream
|
||
- docs/features: Add missing llvmpipe extensions
|
||
- docs/features: Add missing virgl extensions
|
||
- tgsi: Drop TGSI_SEMANTIC_TESS_DEFAULT_OUTER/INNER_LEVEL
|
||
- tgsi: Remove return type from tgsi_instruction_texture
|
||
- android: Only include libdrm_intel for i915 as iris do not depend on it
|
||
- virgl: Skip resource destruction only when there are actually needed references
|
||
|
||
Daivik Bhatia (7):
|
||
|
||
- v3d: remove unused functions from v3d_bufmgr.h
|
||
- v3d: use Texture Data Formats enum in Texture Shader State struct
|
||
- v3d: move format helpers to v3dx_format_table.h
|
||
- v3d: replace raw integers with enum types in helper functions
|
||
- broadcom/common: Optimize CSD super-group packing
|
||
- broadcom/common: Add subgroup support to CSD super-group packing
|
||
- broadcom/compiler: support arithmetic subgroup operations
|
||
|
||
Dallas Strouse (1):
|
||
|
||
- rusticl/device: skip loading devices in cfg(test)
|
||
|
||
Daniel Almeida (2):
|
||
|
||
- nouveau/headers: Import the video class headers from NVIDIA
|
||
- nouveau: Handle video decode in nv_push_print()
|
||
|
||
Daniel Schürmann (74):
|
||
|
||
- util/time: add os_time_nanosleep_until() function
|
||
- vulkan: implement VK_AMD_anti_lag as implicit vulkan layer
|
||
- aco/tests: Fix p_startpgm definitions to registers
|
||
- aco/ra: generalize register affinities
|
||
- aco/ra: collect register affinities for all precolored operands.
|
||
- aco/ra: don't optimize encodings on precolor affinity mismatch
|
||
- aco/ra: propagate precolor affinities through phis
|
||
- aco/ra: propagate precolor affinities through parallelcopies and tied definitions
|
||
- aco/scheduler: improve scheduling heuristic
|
||
- nir/opt_load_store_vectorize: only attempt to vectorize shared2 after exhausting other possibilities
|
||
- nir/opt_load_store_vectorize: don't vectorize large shared2_amd loads
|
||
- radv: only vectorize shared2 instructions during late optimizations
|
||
- aco/isel: allow for large 8-bit vectors in extract_8_16_bit_sgpr_element()
|
||
- ac/nir: use HW-requirements on alignment for vectorizing LDS
|
||
- ac/nir_lower_mem_access_bit_sizes: Split unsupported shared memory instructions
|
||
- aco/isel: rename emit_readfirstlane() -> emit_vector_as_uniform()
|
||
- aco/isel: refactor load_shared() by directly matching NIR intrinsics to ACO opcodes
|
||
- radv: unconditionally call ac_nir_lower_mem_access_bit_sizes()
|
||
- aco/isel: refactor store_shared() by directly matching NIR intrinsics to ACO opcodes
|
||
- aco/scheduler: check dependencies of entire clause upfront
|
||
- aco/scheduler: Stop downwards scheduling after encountering the first clause
|
||
- aco/scheduler: split downwards_move_clause() from downwards_move()
|
||
- aco/scheduler: remove DownwardsCursor::insert_demand_clause
|
||
- aco/scheduler: remove DownwardsCursor::clause_demand
|
||
- aco/scheduler: short-cut downwards_move_clause() when no movement is done
|
||
- aco/scheduler: ignore potential SMEM stalls when forming clauses
|
||
- aco/scheduler: move clauses as batch
|
||
- aco/scheduler: schedule VMEM store clauses during the regular forward pass
|
||
- aco/scheduler: small refactor of schedule_VMEM()
|
||
- aco/ra: don't clear lateKill operands in get_reg_create_vector()
|
||
- aco/ra: add vector_info::index to indicate the Operand's index into the vector
|
||
- aco/ra: don't set precolor affinities for already assigned temporaries
|
||
- aco/ra: consider precolor affinities in get_reg_vector()
|
||
- aco/ra: coalesce vector affinities with tied definitions
|
||
- radv/rt: use ACCESS_CAN_REORDER when loading SBT entries
|
||
- nir/algebraic: add pattern for (a << #b) * #c => a * (#c << #b)
|
||
- nir/load_store_vectorize: also parse offsets through u2u64 if additions don't wrap around
|
||
- nir/load_store_vectorize: hoist base addr instead of subtracting
|
||
- nir/opt_offsets: allow for unsigned wraps when folding load/store_shared2_amd offsets
|
||
- radv: allow for unsigned wraps for shared memory intrinsics in nir_opt_offsets
|
||
- radeonsi: allow for unsigned wraps for shared memory intrinsics in nir_opt_offsets
|
||
- aco/optimizer: remove DS offset optimization
|
||
- aco: remove excess offset handling for load/store_shared
|
||
- amd: don't allow unsigned wraps for shared memory offsets on GFX6
|
||
- nir/opt_offsets: call allow_offset_wrap() for try_fold_shared2()
|
||
- nir/load_store_vectorize: Fix parsing offsets through u2u64
|
||
- radv: delay lowering global access
|
||
- radv: delay lowering int64
|
||
- nir/divergence_analysis: check ACCESS_SMEM_AMD
|
||
- ac/nir_lower_global_access: require no_unsigned wrap when extracting from 32-bit additions
|
||
- ac/nir_lower_global_access: don't assume pack_64_2x32 is the same as u2u64
|
||
- radv: delay nir_opt_shrink_vectors
|
||
- radeonsi: delay nir_lower_global_access
|
||
- radv,radeonsi: call ac_nir_lower_global_access and nir_lower_int64 for gs copy shaders
|
||
- ac/nir: switch load_smem_amd to use load_global
|
||
- nir/divergence: don't assume that load_sample_positions_amd is always uniform
|
||
- radv: use load_global instead of load_global_amd for load_sample_positions_amd
|
||
- amd/lower_mem_access_bit_sizes: lower all SMEM instructions to supported sizes
|
||
- amd/lower_mem_access_bit_sizes: also use SMEM for subdword loads
|
||
- amd/common: merge radv_nir_opt_access_speculate() into ac_nir_flag_smem_for_loads()
|
||
- radv: delay ac_nir_lower_mem_access_bit_sizes
|
||
- ac/nir_flag_smem_for_loads: call divergence analysis internally
|
||
- radv/rt: fix LDS size calculation with LLVM for inlined stages
|
||
- radv: fix max_waves calculation for tesselation
|
||
- radv: use lds_alloc_granularity alignment for stats
|
||
- amd: change ac_shader_config::lds_size to bytes
|
||
- radv: calculate LDS allocation requirements independently from the compiler
|
||
- radeonsi: pass calculated LDS size to ACO
|
||
- amd: add and use utility functions for LDS size encoding
|
||
- amd/common: remove radeon_info::lds_alloc_granularity and radeon_info::lds_encode_granularity
|
||
- aco: remove DeviceInfo::lds_encoding_granule and DeviceInfo::lds_alloc_granule
|
||
- amd: keep ac_shader_config::lds_size unaligned
|
||
- amd: change radeon_info::lds_size_per_workgroup for GFX10+ to 64KB
|
||
- radv/null_device: set more options which affect compilation
|
||
|
||
Daniel Stone (2):
|
||
|
||
- ci/panfrost: Add wider EGL/multithread flakes
|
||
- ci/freedreno: Skip overly-slow trace
|
||
|
||
Danylo Piliaiev (30):
|
||
|
||
- tu: Use safe-const binning VS when safe-const full VS is used
|
||
- util/u_trace: Add scripts for perf analysis based on u_trace results
|
||
- tu: Fix nullptr dereference in cmd_buffer tracepoint
|
||
- util: Add function os_get_option_secure
|
||
- util/disk_cache: Use os independent functions instead of getenv
|
||
- util/disk_cache: Fallback to ftruncate if posix_fallocate not supported
|
||
- util/disk_cache: Allow disk cache on Android if explicitly enabled
|
||
- tu: Fix unaligned image_to_buffer on close to (1 << 14) width
|
||
- tu/a6xx: Fix unaligned buffer_to_image on close to (1 << 14) width
|
||
- ir3: Add EOLM and EOGM a7xx flags to NOP
|
||
- tu: Use approx square tiles when FDM is enabled
|
||
- freedreno/a750: Fix typo in recent magic regs change
|
||
- tu: Fix the lack of IB size sanitization in several cases in tu_cs
|
||
- tu/a7xx: Don't disable LRZ for empty FS when FDM is used
|
||
- tu: Reset rp_trace on tu_reset_cmd_buffer
|
||
- tu: Prevent dangling start_sysmem_clear_all tracepoint
|
||
- egl: Bring back util_cpu_trace_init
|
||
- tu: Reset BIN_FOVEAT regs for tiling with and without HW binning
|
||
- freedreno/decode: Fix preamble decoding
|
||
- tu/a7xx: Update reg stomping info to fix GPU crashes when stomping
|
||
- tu: Destroy all mutexes used for device
|
||
- tu/perfetto: Don't check sync_gpu_ts when emitting renderstage
|
||
- tu/perfetto: Track GPU timestamps per-device
|
||
- tu/perfetto: Make GPU clock sequence-scoped
|
||
- tu/perfetto: Init perfetto datasources once
|
||
- tu/perfetto: Use a separate track for VK_EXT_debug_utils labels
|
||
- tu: Prevent GPU hang with occlusion query + certain depth state
|
||
- tu: Synchronize access to copy_timestamp_cs_pool
|
||
- vulkan: Always fill DS state for EXT_dynamic_rendering_unused_attachments
|
||
- tu: Use cmd->rp_trace u_trace for draw calls
|
||
|
||
Dave Airlie (11):
|
||
|
||
- nak: disable imma 8x8x16 on Blackwell+
|
||
- nvk: add sm120 latencies via csv files.
|
||
- spirv: move cmat store barrier after the store.
|
||
- nouveau: Handle subchannels better in nv_push_print()
|
||
- nir: add coop mat flexible dimensions lowering.
|
||
- radv: add support for coopmat2 flexible dimensions
|
||
- radv: consolidate cooperative matrix array sizes enumeration
|
||
- nir: add nir_intrinsic_cmat_load_shared_nv
|
||
- gallivm: handle u8/u16 const loads properly on big-endian.
|
||
- nir/coopmat: fix non square load/store lowering for flexible dimensions
|
||
- c11/threads: fix build on c23
|
||
|
||
David Rosca (129):
|
||
|
||
- radeonsi/vcn: Correctly handle tile swizzle
|
||
- radv/video: Fix encode when using layered source image
|
||
- ac/surface: Add ac_modifier_supports_video
|
||
- radeonsi/video: Use ac_modifier_supports_video
|
||
- radv/video: Support DRM format modifier tiling
|
||
- radeonsi/uvd: Set H264 gaps_in_frame_num_value_allowed_flag
|
||
- radv/video: Don't allow DRM format modifier tiling on GFX < 9
|
||
- radv/ci: Add dEQP-VK.video.formats.* fails for navi10 and vega10
|
||
- radv/video: Add bit depth and profile check for AV1 encode
|
||
- radv/video: Add bit depth and profile check for VP9 decode
|
||
- radv/video: Set encodeInputPictureGranularity for AV1 encode
|
||
- radv/video: Add radv_video_is_profile_supported
|
||
- radv/video: Rework GetPhysicalDeviceVideoFormatPropertiesKHR
|
||
- radv/video: Remove 10 to 8bit dithering support
|
||
- radv: Reject linear modifier for video decode DPB
|
||
- radv/ci: Update navi10 and vega10 expected failures
|
||
- radv/video: Remove disabled slice header code for field encoding
|
||
- radv/video: Set H264 encode cabac_init_idc and Cb/Cr QP offsets
|
||
- radv/video: Always send the latency command
|
||
- radv/video: Send slice control, spec misc and deblocking params every frame
|
||
- radv/video: Add more encode session params overrides
|
||
- radv/video: Fix encode bitstream buffer offset and alignment
|
||
- radv/video: Fix setting H265 encode cu_qp_delta on VCN2
|
||
- radv/video: Fix session_init and rc_per_pic on VCN2
|
||
- radv/video: Disable rate control modes for H265 encode on VCN1
|
||
- radv/video: Use the new defines for H264 SPS info flags
|
||
- frontends/va: Add H264 encode more_rbsp_data PPS flag
|
||
- radeonsi/vcn: Use more_rbsp_data flag for H264 PPS encode
|
||
- radeonsi: Add missing DEBUG_NAMED_VALUE_END to radeonsi_shader_debug_options
|
||
- radeonsi/vcn: Always enable decode tier2 when supported
|
||
- vulkan/video: Fix h265 level values
|
||
- radeonsi: Move multimedia debug options to its own flags
|
||
- radeonsi: Add debug option to disable tiling for video
|
||
- radeonsi: Add debug options to disable video decode/encode tiers
|
||
- wsi/display: Report supported formats based on plane formats
|
||
- wsi/display: Add RGBA16, RGBA16F and A2RGB10(SRGB) formats
|
||
- radv: Add timeout to video encode query
|
||
- radv/video: Don't init vp9 probs table in message buffer
|
||
- radv/video: Simplify vp9 q params
|
||
- radv/video: Remove unused enum
|
||
- ac/vcn_dec: Add RDECODE_IT_SCALING_TABLE_SIZE
|
||
- radv/video: Use more common defines
|
||
- radv: Fix alignment for linear video decode dst images
|
||
- rusticl/ptr: Fix hidden lifetime warning
|
||
- ac/vcn_dec: Add av1_intrabc_workaround
|
||
- radeonsi/vcn: Enable AV1 decode workaround for gfx1153
|
||
- radv/video: Enable AV1 decode workaround for gfx1153
|
||
- vulkan/video: Add intra refresh support
|
||
- radv/video: Add support for VK_KHR_video_encode_intra_refresh
|
||
- auxiliary/vl: Map X6R10/X6R10X6G10 formats to R16/R16G16
|
||
- radeonsi: Map X6R10/X6R10X6G10 formats to R16/R16G16
|
||
- frontends/va: Cleanup CreateContext
|
||
- frontends/va: Refactor vlVaVidEngineBlit
|
||
- frontends/va: Change vlVaPostProcCompositor to take pipe_vpp_desc arg
|
||
- frontends/va: Remove EFC support
|
||
- frontends/va: Add support for decode/encode processing
|
||
- radeonsi/vcn: Support EFC with encode processing
|
||
- radeonsi/vcn: Support VPE with decode processing
|
||
- radeonsi: Remove now unused si_vid_is_target_buffer_supported
|
||
- pipe: Remove now unused is_video_target_buffer_supported
|
||
- subprojects: Remove libdisplay-info wrap file
|
||
- radeonsi/vcn: Disable H264 encode 8x8 transform when CABAC is disabled
|
||
- radv/video: Disable H264 encode 8x8 transform when CABAC is disabled
|
||
- radeonsi/vcn: Disable H264/5 constrained intra pred with rate control
|
||
- radeonsi/vcn: Fix compatibility with old FW for encode
|
||
- radeonsi/vcn: Fix HEVC encode cu_qp_delta with old FW
|
||
- radeonsi/vcn: Fix HEVC encode transform_skip with old FW
|
||
- ci: Add missing rust subprojects to meson/build.sh
|
||
- radeonsi/vcn: Correctly set chroma location with EFC
|
||
- radv: Use extra context for video encode queue with multiple VCN instances
|
||
- radv/video: Fix VP9 loop filter and segmentation params
|
||
- util/format: Add RGB lowering for single plane YUV formats
|
||
- ac/vcn: Add RADEON_VCN_IB_COMMON_OP_RESOLVEINPUTPARAMLAYOUT
|
||
- radv/video: Set rate control to default on reset
|
||
- radv/video: Support quantization map on VCN5
|
||
- util/format: Add VK_EXT_ycbcr_2plane_444_formats formats
|
||
- vulkan/format: Map VK_EXT_ycbcr_2plane_444_formats to pipe format
|
||
- radv: Enable VK_EXT_ycbcr_2plane_444_formats
|
||
- ci: Stop building VDPAU driver
|
||
- mesa: Remove NV_vdpau_interop
|
||
- Remove VDPAU
|
||
- gallium/vl: Remove now unused filters
|
||
- radeonsi/video: Remove support for interlaced buffers
|
||
- pipe: Remove PIPE_VIDEO_CAP_PREFERS/SUPPORTS_INTERLACED
|
||
- radeonsi/vcn: Fix calculating QP map region dimensions
|
||
- radeonsi/vcn: Get rid of PIPE_ALIGN_IN_BLOCK_SIZE
|
||
- radv/video: Always use OBU_FRAME in AV1 encode
|
||
- radeonsi/uvd: Swap order of comparison to avoid warning
|
||
- r600: Remove mpeg12 shader decoder support
|
||
- r300: Remove mpeg12 shader decoder support
|
||
- nouveau: Remove mpeg12 shader decoder support
|
||
- gallium/vl: Remove mpeg12 shader decoder
|
||
- gallium/vl: Fix building vl_stubs
|
||
- r600: Implement resource_get_param
|
||
- d3d12: Implement resource_get_param
|
||
- frontends/va: Use resource_get_param instead of resource_get_info
|
||
- pipe: Remove resource_get_info
|
||
- radv: Change radv_vcn_write_event to a write memory func
|
||
- radv/video: Check FW version before using WRITE_MEMORY
|
||
- radv/video: Fix waiting on encode feedback query
|
||
- radeonsi/vpe: Fix transfer function mapping to vpelib
|
||
- frontends/va: Fix parsing VP9 frame header
|
||
- frontends/va: Add VP9 use_prev_frame_mvs and segmentation_update_data flags
|
||
- radeonsi/vcn: Use VP9 use_prev_frame_mvs and segmentation_update_data
|
||
- ac/gfx10_format_table: Use new names for 422 subsampled formats
|
||
- gallium/vl: Add new function to get RGB YUV conversion matrix
|
||
- frontends/va: Set color properties when not using explicit color standard
|
||
- frontends/va: Use new RGB YUV conversion matrix
|
||
- gallium/vl: Remove vl_csc_get_matrix
|
||
- frontends/va: Always advertise explicit color standard support
|
||
- radeonsi/vcn: Stop using vpp colors standard
|
||
- radeonsi/vpe: Stop using vpp colors standard
|
||
- frontends/va: Stop using vpp colors standard
|
||
- vl,frontends/va: Implement YUV->YUV matrix coeff conversion
|
||
- vl,frontends/va: Implement gamma and primaries conversion
|
||
- gallium/vl: Remove luma key support
|
||
- gallium/vl: Remove vl_compositor_set_csc_matrix
|
||
- pipe: Remove PIPE_VIDEO_CAP_VPP_SUPPORT_HDR_INPUT/OUTPUT
|
||
- pipe: Remove pipe_video_vpp_color_standard_type
|
||
- radeonsi/vcn: Support BT2020 matrix with EFC
|
||
- ac/surface: Limit video modifiers to 64K_S also for VCN 2.2
|
||
- radv/video: Introduce two levels of write_memory support
|
||
- radv/video: Only use write_memory for encode feedback with full support
|
||
- radeonsi/vcn: Fix AV1 bidir compound encode with order_hint disabled
|
||
- radv/video: Don't require encode FW version >= interface version
|
||
- radv/video: Fix AV1 bidir compound encode with order_hint disabled
|
||
- vulkan/video: Avoid NULL pointers in session parameters
|
||
- radv/video: Correctly handle no feedback query for encode
|
||
- radv/video: Add NULL checks for picture parameters
|
||
|
||
Deborah Brouwer (1):
|
||
|
||
- android: fall back to SwiftShader’s LLVM
|
||
|
||
Derek Foreman (2):
|
||
|
||
- dril: Skip some pipe formats to avoid breaking X
|
||
- zink: Don't use VK_PRESENT_MODE_IMMEDIATE_KHR on wayland
|
||
|
||
Dhruv Mark Collins (1):
|
||
|
||
- tu/util: Allow setting all TU_DEBUG options from envvar and file
|
||
|
||
Dmitry Baryshkov (2):
|
||
|
||
- glx: provide glx.pc
|
||
- ci: drop google-freedreno remnants
|
||
|
||
Dmitry Osipenko (1):
|
||
|
||
- virtio/vdrm: Fix varying offsets of struct vdrm_device members
|
||
|
||
Dylan Baker (31):
|
||
|
||
- meson: set the \`legacy-x11` option as deprecated
|
||
- anv: avoid potential integer overflow in video address calculation
|
||
- intel/brw: Fix implementaiton of \|= operator for enum
|
||
- isl: prevent potential overflow before widen
|
||
- blorp: Fix potential read of uninitaized elk fields in debug paths
|
||
- anv: add assertion that tes and tcs data is non-null
|
||
- anv: remove dead code
|
||
- mailmap: Update for Dylan Baker
|
||
- calendar: Update release dates and change 25.3 to Dylan
|
||
- meson: use the wayland module
|
||
- anv: don't attempt to memcpy if allocation fails
|
||
- iris: Fix potential null deref in debug archiver
|
||
- VERSION: bump for 25.3.0-rc1
|
||
- .pick_status.json: Update to 3b2f7ed918a5ad78c1d3756e9823a1616c1f21d7
|
||
- .pick_status.json: Update to ad421cdf2e68a1ccef80cb810c012c8469579cb6
|
||
- .pick_status.json: Mark c20e2733bf8f9bb595f1bcc68ebb3d0686ef28e4 as denominated
|
||
- .pick_status.json: Update to 28fbc6addbda2ce3e264b41b6ad91a7a0d8eb788
|
||
- .pick_status.json: Update to e38491eb1850ab8b0082716b00f514f75e2a0e1a
|
||
- VERSION: bump for rc2
|
||
- .pick_status.json: Update to fd55e874ed09a04447ebd4dae25c98df2621ef7d
|
||
- .pick_status.json: Update to 45a762727cf8708392b6de38616909543c799923
|
||
- intel/compiler/brw: Add assert that we don't have a negative value
|
||
- .pick_status.json: Update to 32b646c5976f64152a004d4c83962ca14c46154f
|
||
- VERSION: bump for rc3
|
||
- .pick_status.json: Update to 33342848451ca06deb054fad94de3cea3a9efe63
|
||
- .pick_status.json: Update to e44a776f4751d665efc447d8fe8e6c01d25a60c5
|
||
- .pick_status.json: Update to 27d9e4ec2a13a957f416a234a93bf2f0c2c9c56c
|
||
- VERSION: bump for 25.3.0-rc4
|
||
- .pick_status.json: Update to 04a0d512fa68a48bc2a2632a0a4ff2c3ac10c6ca
|
||
- .pick_status.json: Update to 294e72e2b517bc744f909fbce9e154efa698dd10
|
||
- .pick_status.json: Update to 8f13905c5e38ac3921c4804b19fc0f50531b0317
|
||
|
||
Ella Stanforth (22):
|
||
|
||
- util/list: Fix next instruction removal usecase for non safe iterators
|
||
- util/list: Add iterator debug to more routines.
|
||
- util/tests: Add list iterator tests
|
||
- pvr: Use demote
|
||
- nir: assert when we do not have a sample count when not using intrinsic
|
||
- pco: Switch to common alpha_to_coverage intrinsic
|
||
- pco: Switch to common alpha to coverage lowering
|
||
- pco: Cleanup meson.build files
|
||
- pco: Switch back to util/list
|
||
- v3d: rename msaa resolve
|
||
- v3d: Always lower frag color
|
||
- v3d: Fallback to software blend support for formats that do not support blend.
|
||
- v3d/compiler: Add unpacking instructions for normalised 16bit formats.
|
||
- v3d/compiler: Lower load_output after logic operations
|
||
- nir: add v3d specific intrinsic normalised to float conversion
|
||
- v3d/compiler: implement normalised to float conversions
|
||
- v3d/compiler: Implement 16bit normalised render targets.
|
||
- v3d: Add support for 16bit normalised formats
|
||
- v3dv: Take format plane when packing hw clear color
|
||
- v3dv: Add normalisation flags to the format table
|
||
- v3dv: Add support for 16bit normalised formats
|
||
- pvr: implement buffer device address
|
||
|
||
Emma Anholt (49):
|
||
|
||
- wsi/display: Add some comments about what's going on in the code.
|
||
- wsi/display: Add error messages to some shouldn't-be-hit paths.
|
||
- wsi/display: Pull DRM format translation up a level.
|
||
- wsi/display: Do connector setup before swapchain init.
|
||
- ir3: Rename per_samp to sample_shading.
|
||
- tu: Rename per_samp to sample_shading to match ir3.
|
||
- freedreno: Drop min_samples handling code.
|
||
- tu: Implement sampleShadingEnable by flagging uses_sample_shading.
|
||
- nir: Move ST's force-persample-shading NIR pass to shared code.
|
||
- nir/lower_sample_shading: Set the sample qualifier on in vars.
|
||
- zink: Lower sample shading before we add_derefs().
|
||
- ci/radeonsi: Add a flake on mendocino that appeared yesterday.
|
||
- nir,agx: Move AGX's loop (generalized) to shared NIR code.
|
||
- tu: Use nir_opt_reassociate.
|
||
- ci/tu: Generalize the subgroupclustered pre-merge skips.
|
||
- ci/tu: Do more generalization of the tess flakes.
|
||
- i915: Avoid calling drm_intel_get_aperture_sizes().
|
||
- Revert "tu: Use nir_opt_reassociate."
|
||
- vk/runtime: Set GPU_MULTI_WAIT on the drm syncobj type.
|
||
- tu: Use the common syncobj sync type for the layered timelines.
|
||
- tu: Fix the comment about DRM_CAP_SYNCOBJ_TIMELINE support.
|
||
- ci/tu: Generalize the FDM flakes and link an issue.
|
||
- ci/tu: Drop highp.scalar xfail.
|
||
- ci/tu: generalize the multisample_resolve tess/gs flakes.
|
||
- tu: Disable LRZ writes after most stencil-write operations.
|
||
- vulkan/wsi: Add comments about the WSI's syncing, and KHR_display stuff.
|
||
- vulkan/wsi: Add a test for kernel 6.0 sync file import/export ioctls.
|
||
- wsi/drm: Do the dma_buf_semaphore setup at swapchain creation time.
|
||
- wsi/drm: Don't request implicit sync if we're doing implicit sync ourselves.
|
||
- tu: Move the BO implicit sync flag handling to a BO allocation flag.
|
||
- ir3: Don't try to use indirect access in the alias table.
|
||
- util/u_queue: Fix data race on num_threads during finish.
|
||
- ir3: Enable nir_opt_shrink_stores.
|
||
- ir3: Enable nir_opt_shrink_shrink_vec_array_vars.
|
||
- ir3: Use a bitset for the defs-seen table.
|
||
- ir3: Use a linear allocation context for ir3_registers.
|
||
- ir3: Use a linear allocation context for ir3_instructions.
|
||
- d3d10umd: Add missing dependency on u_formats codegen.
|
||
- treewide: Make exported DRM FDs read-write.
|
||
- ir3: Avoid O(n^2) behavior in rpt validation.
|
||
- nir: Add a shader bisect tool.
|
||
- radv: Restore marking WSI image's mem->buffer as uncached.
|
||
- radv: Allocate BOs as implicit sync even if the WSI is doing implicit sync.
|
||
- ir3: Move the big block of C support code out of the parser .y file.
|
||
- ir3/parser: Make sure relative accesses have a size set.
|
||
- ir3: Use bitset range operations.
|
||
- wsi: Fix the flagging of dma_buf_sync_file for the amdgpu workaround.
|
||
- nir/shrink_stores: Don't shrink stores to an invalid num_components.
|
||
- v3dv: Fix assertion failure for not-found primary_fd during enumeration.
|
||
|
||
Eric Engestrom (247):
|
||
|
||
- VERSION: bump to 25.3
|
||
- docs: reset new_features.txt
|
||
- docs/releasing: add missing "track remote staging branch" command in instructions
|
||
- docs: update calendar for 25.2.0-rc1
|
||
- docs: update calendar for 25.1.6
|
||
- docs: add release notes for 25.1.6
|
||
- docs: add sha sum for 25.1.6
|
||
- gfxstream: move variables into the #ifdef that uses them
|
||
- docs/linkcheck: drop cgit exception as nothing links to it anymore
|
||
- docs/linkcheck: ignore sourceforge subdomains as well
|
||
- docs/linkcheck: ignore vulkan.org failures as it also blocks non-browsers
|
||
- freedreno/ci: disable defunct baremetal jobs
|
||
- wsi/display: setup the connector earlier
|
||
- wsi/display: also select a plane when selecting a crtc
|
||
- ci: fix rustfmt job rules
|
||
- radv/ci: lower timeouts for newly added gfx1201 jobs
|
||
- radv/ci: lower timeouts for vkd3d jobs
|
||
- ci: fix rustfmt job rules (one more case)
|
||
- radv/ci: sort navi21 flakes
|
||
- broadcom/ci: sort rpi4 flakes
|
||
- zink+radv/ci: sort cezanne flakes
|
||
- radeonsi/ci: document recent flakes
|
||
- radv/ci: document recent flakes
|
||
- broadcom/ci: document recent flakes
|
||
- zink+radv/ci: document recent flakes
|
||
- lavapipe/ci: document recent flakes
|
||
- docs: update calendar for 25.2.0-rc2
|
||
- ci/lava: fix heredoc-in-yaml syntax
|
||
- wsi/display: pass the image's DRM modifiers to the kernel
|
||
- wsi/display: pass the plane's modifiers to the image
|
||
- docs: update calendar for 25.2.0-rc3
|
||
- docs: update calendar for 25.1.7
|
||
- docs: add release notes for 25.1.7
|
||
- docs: add sha sum for 25.1.7
|
||
- ci-tron: set pipefail to show the correct error message when failing to download the install tarball
|
||
- ci-tron: drop unnecessary \`HWCI_TEST_SCRIPT: deqp-runner.sh` re-defines
|
||
- ci-tron: cleanup redundancy in artifacts exclude variable
|
||
- ci-tron: set SCRIPTS_DIR where its path is defined
|
||
- radv/ci: deduplicate \`DEQP_SUITE: radv-valve` in ci-tron jobs
|
||
- radv/ci: deduplicate GPU_VERSION in ci-tron jobs
|
||
- turnip/ci: drop redundant GPU_VERSION
|
||
- broadcom/ci: drop redundant \`script:` already set by .broadcom-test
|
||
- broadcom/ci: drop redundant HWCI_TEST_SCRIPT already set by .broadcom-test
|
||
- anv/ci: drop already included skip list
|
||
- iris/ci: drop already included skip list
|
||
- nouveau/ci: drop already included \*-skips.tx
|
||
- llvmpipe/ci: set DRIVER_NAME to not have to manually add llvmpipe-skips.txt in asan job variant
|
||
- ci/deqp-runner: fix path to install folder
|
||
- ci/prepare-artifacts: move git version dump out of static file copy block
|
||
- ci/prepare-artifacts: drop redundant copy
|
||
- ci/prepare-artifacts: turn file copies into a loop
|
||
- meson: fix VkLayer_MESA_device_select in the devenv
|
||
- meson: include VkLayer_MESA_screenshot in the devenv
|
||
- meson: include VkLayer_MESA_vram_report_limit in the devenv
|
||
- meson: include VkLayer_MESA_anti_lag in the devenv
|
||
- radv/ci: add missing GPU_VERSION for navi10 in kws farm
|
||
- ci: fix PYTHONPATH variable
|
||
- turnip/ci: document new vkd3d crash
|
||
- ci/vkd3d: fix "unexpected results" check
|
||
- ci: uprev vkd3d to fix some nvk tests
|
||
- ci: cleanup weston invocations
|
||
- llvmpipe/ci: use weston's Xwayland instead of broken Xvfb
|
||
- llvmpipe/ci: document two regressions
|
||
- llvmpipe/ci: document flakes seen during stress-testing
|
||
- ci: dedupe weston setup
|
||
- ci: document image tag to bump for rust build changes
|
||
- docs/llvmpipe: fix links to defunct drdobbs.com website
|
||
- docs/linkcheck: ignore crates.io links as it also blocks non-browsers
|
||
- zink+nvk/ci: fix flakes
|
||
- ci: drop unnecessary rename of \*.log into \*.log.txt
|
||
- freedreno/ci: run a618-gl job on xwayland instead of xorg
|
||
- intel/ci: run iris-{apl,glk,amly}-egl jobs on xwayland instead of xorg
|
||
- ci: drop xorg + weston workaround now that no user is left
|
||
- zink+nvk/ci: sort ad106 fails
|
||
- zink+nvk/ci: give piglit tests a display to use
|
||
- ci-tron: keep \*.qpa in job artifacts
|
||
- ci-tron: move vkcts shader cache out of $CI_PROJECT_DIR
|
||
- ci-tron: move vkd3d shader cache out of $CI_PROJECT_DIR
|
||
- ci: mark igalia farm as offline
|
||
- broadcom/ci: skip two more slow CL tests
|
||
- radv/ci: mark all of dEQP-VK.ray_tracing_pipeline.pipeline_library.configurations.* as flaky
|
||
- radeonsi/ci: document recent flakes
|
||
- radv/ci: document recent flakes
|
||
- broadcom/ci: document recent flakes
|
||
- zink+radv/ci: document recent flakes
|
||
- lavapipe/ci: document recent flakes
|
||
- docs: update calendar for 25.2.0
|
||
- docs: add release notes for 25.2.0
|
||
- docs: add sha sum for 25.2.0
|
||
- docs: add 25.2.x release dates
|
||
- Revert "ci: mark igalia farm as offline"
|
||
- radeonsi/ci: document fixes test
|
||
- r300/ci: document fixes tests and one regression in c64c6a0c...bf8ebb6a
|
||
- turnip/ci: document regression in 0a12ff6f...8fe0a347
|
||
- broadcom/ci: fix another slow & flaky CL test on rpi4
|
||
- radeonsi/ci: document recent flakes
|
||
- radv/ci: document recent flakes
|
||
- zink+radv/ci: document recent flakes
|
||
- llvmpipe/ci: document fixed test
|
||
- llvmpipe/ci: document recent flakes
|
||
- lavapipe/ci: document recent flakes
|
||
- ci: track changes to new src/x11/ folder
|
||
- ci: uprev vkd3d
|
||
- ci/init-stage2: drop no-op "copy python path into python path"
|
||
- ci: move setting python path for structured_logger.py to where it's actually used
|
||
- docs: update calendar for 25.1.8
|
||
- docs: add release notes for 25.1.8
|
||
- docs: add sha sum for 25.1.8
|
||
- freedreno/ci: consistently use x11- prefix for deqp-egl-x11
|
||
- iris/ci: consistently use x11- prefix for deqp-egl-x11
|
||
- llvmpipe/ci: consistently use x11- prefix for deqp-egl-x11
|
||
- softpipe/ci: document fixed tests
|
||
- ci: set DRIVER_NAME in jobs that are implicitly inheriting skip lists
|
||
- ci/deqp-runner: drop implicit skips of \`GALLIUM_DRIVER` or \`VK_DRIVER`
|
||
- ci/deqp-runner: simplify handling the various \*-skips.txt files
|
||
- ci/deqp-runner: add support for all the prefixes for \*-flakes.txt files
|
||
- ci/deqp-runner: remove duplicate values to avoiding read the same file multiple times
|
||
- ci/deqp-runner: add support for all the prefixes for \*-fails.txt files
|
||
- lavapipe/ci: drop asan fails that are already tracked as normal fails
|
||
- softpipe/ci: drop asan fails that are already tracked as normal fails
|
||
- zink+radv/ci: set DRIVER_NAME=zink-radv to allow using common expectation files
|
||
- zink+radv/ci: deduplicate zink-radv-\*-skips.txt lists
|
||
- zink+radv/ci: deduplicate zink-radv-\*-fails.txt files
|
||
- zink+radv/ci: fix typo in skips comment
|
||
- zink+radv/ci: add common fails for the next commits
|
||
- zink+radv/ci: give polaris10 piglit tests a display to use
|
||
- zink+radv/ci: give navi10 piglit tests a display to use
|
||
- zink+radv/ci: give navi31 piglit tests a display to use
|
||
- zink+radv/ci: give vangogh piglit tests a display to use
|
||
- zink+radv/ci: give gfx1201 piglit tests a display to use
|
||
- panfrost/meson: drop invalid C++ arg
|
||
- zink+turnip/ci: document regression in b22806705c...cac3b4f404
|
||
- zink+turnip/ci: document fixed tests
|
||
- r300/ci: document flake
|
||
- etnaviv/ci: document some flakes
|
||
- turnip/ci: document a flake
|
||
- nvk/ci: document some flakes
|
||
- meson: add spirv-tools option to disable the optional dependency
|
||
- docs: stub pipe_format & pipe_video_chroma_format
|
||
- docs: update calendar for 25.2.1
|
||
- docs: add release notes for 25.2.1
|
||
- docs: add sha sum for 25.2.1
|
||
- meson: fixup b_sanitize checks
|
||
- ci-tron: drop meaningless timestamp in initial section message
|
||
- virgl/ci: drop invalid but overridden empty caching proxy
|
||
- vmware/ci: fix caching proxy url
|
||
- ci/piglit: automatically use LAVA proxy
|
||
- ci/piglit: automatically use baremetal proxy
|
||
- broadcom/ci: drop unnecessary variables redefinitions
|
||
- ci-tron: move s3_jwt token file to the project dir
|
||
- ci-tron: avoid uploading downloaded traces
|
||
- piglit/ci: configure ci-tron to download traces and upload renders
|
||
- broadcom/ci: add ci-tron variant of the piglit traces job
|
||
- docs/ci: drop redundant/dead fork rule
|
||
- docs/ci: drop unnecessary comment
|
||
- docs/ci: always build the docs
|
||
- docs: update calendar for 25.1.9
|
||
- docs: add release notes for 25.1.9
|
||
- docs: add sha sum for 25.1.9
|
||
- ci: document what scope the ci_run_n_monitor token needs
|
||
- zink+radv/ci: add traces job on vangogh
|
||
- zink+radv/ci: add traces job on gfx1201
|
||
- broadcom/ci: document recent flakes
|
||
- radeonsi/ci: document recent flakes
|
||
- radv/ci: document recent flakes
|
||
- zink+radv/ci: document recent flakes
|
||
- zink+lavapipe/ci: document recent flakes
|
||
- docs: update calendar for 25.2.2
|
||
- docs: add release notes for 25.2.2
|
||
- docs: add sha sum for 25.2.2
|
||
- bin/ci: let filter_dag() caller define job filter once (instead of 3 times)
|
||
- ci/gitlab_gql: keep track of job tags
|
||
- ci_run_n_monitor: add --job-tags filter
|
||
- radv/ci: deduplicate navi10 GPU_VERSION
|
||
- radv/ci: document whether ci-tron jobs runs on an APU or a dGPU
|
||
- etnaviv/ci: document fixed tests
|
||
- r300/ci: document fixed tests
|
||
- nvk/ci: document fixed tests
|
||
- zink+nvk/ci: document fixed tests
|
||
- zink+turnip/ci: document fixed tests
|
||
- venus/ci: document fixed tests
|
||
- zink+radv/ci: comment out the two checksums
|
||
- ci/update_traces_checksum: fix decoding of log lines
|
||
- ci/update_traces_checksum: fix regex detecting PIGLIT_REPLAY_DEVICE_NAME in job logs
|
||
- intel/perf: fix enum type for eu stall props
|
||
- zink+radv/ci: sort vangogh flakes
|
||
- zink+radv/ci: document recent flakes
|
||
- radv/ci: document recent flakes
|
||
- broadcom/ci: document recent flakes
|
||
- zink+lvp/ci: document recent flakes
|
||
- broadcom/ci: update test expectations
|
||
- etnaviv/ci: update test expectations
|
||
- turnip/ci: update test expectations
|
||
- zink+turnip/ci: update test expectations
|
||
- zink+nvk/ci: update test expectations
|
||
- doc/features.txt: add missing supported anv extensions
|
||
- doc/features.txt: add missing supported tu extensions
|
||
- doc/features.txt: add missing supported lvp extensions
|
||
- doc/features.txt: add missing supported v3dv extensions
|
||
- doc/features.txt: add missing supported nvk extensions
|
||
- docs/release-calendar: add 25.2.x dates, and 25.3 branchpoint and release candidates
|
||
- docs: update calendar for 25.2.3
|
||
- docs: add release notes for 25.2.3
|
||
- docs: add sha sum for 25.2.3
|
||
- doc/features.txt: add missing supported dzn extensions
|
||
- radv: make sure fp16 is enabled consistently on gfx8
|
||
- radv: add comment explaining why fp16 is disabled by default on gfx8
|
||
- meson: require glslang >= 12.2 for bvh preample
|
||
- meson: only require glslang >= 12.2 when anv/radv/turnip are built
|
||
- ci/fedora: manage rust version ourselves
|
||
- ci/alpine: install and manage rust version ourselves
|
||
- ci/rust: install components with the initial install command
|
||
- ci: use MSRV for build-for-tests jobs and recent version in build-only jobs and CI components
|
||
- ci/build-rust: strip rust libs and binaries
|
||
- zink+nvk/ci: fix test expectations
|
||
- zink/ci: drop gbm override now that debian has a usable xorg
|
||
- util/meson: make sure shader_stats.h is generated in time for anything that depends on mesautil
|
||
- egl/meson: generate wayland presentation-time header before it gets included
|
||
- panvk/meson: generate git_sha1.h before compiling panvk_vX_physical_device.c
|
||
- gfxstream/meson: generate git_sha1.h before compiling ResourceTracker.cpp
|
||
- intel/meson: generate spirv_info.h before compiling brw_spirv.c
|
||
- etnaviv/meson: generate enums.h before compiling assembler.c
|
||
- freedreno/meson: generate xml headers before compiling gmemtool
|
||
- i915/meson: generate intel_device_info_gen.h before compiling i915_drm_winsys.c
|
||
- meson: use vcs_tag() instead of custom script
|
||
- llvmpipe/ci: document fixed tests
|
||
- docs: update calendar for 25.2.4
|
||
- docs: add release notes for 25.2.4
|
||
- docs: add sha sum for 25.2.4
|
||
- iris/meson: generate git_sha1.h before compiling iris_program.c
|
||
- docs: finish converting the docs job into a meson build job
|
||
- ci/alpine: install the real \`ninja` package
|
||
- ci: check for missing meson dependencies
|
||
- Revert "meson: use vcs_tag() instead of custom script"
|
||
- ci-tron: bump job template commit to get cached job templates
|
||
- docs: update khronos wiki url
|
||
- nvk/ci: document some flakes
|
||
- nvk/ci: document fixed tests
|
||
- broadcom/ci: document fixed tests
|
||
- docs: update calendar for 25.2.5
|
||
- docs: add release notes for 25.2.5
|
||
- docs: add sha sum for 25.2.5
|
||
- asahi/virtio: fix memleak
|
||
- util/meson: don't build libmesa_util_clflushopt unless needed
|
||
- util/meson: don't build libmesa_util_clflush unless needed
|
||
- ci: track src/c11/ changes
|
||
- ci: track src/android_stub/ changes
|
||
|
||
Eric R. Smith (9):
|
||
|
||
- panvk: use minimum attachment size for frame buffer size
|
||
- panvk: fix a NULL pointer dereference in occlusion queries
|
||
- mesa: fix off by one in MSRTT handling
|
||
- panfrost: add some sanity checks for nr_samples
|
||
- panvk: revised occlusion query pointer fix
|
||
- panfrost: fix typo in register allocation
|
||
- panfrost: fix debug print of spilled registers
|
||
- panfrost: align spills to reduce TLS memory usage
|
||
- glcpp: prevent accidental token pasting
|
||
|
||
Erico Nunes (10):
|
||
|
||
- lima: fix array limit in texture mipmap descriptor
|
||
- lima: ppir: fix check for discard_block in optimization
|
||
- lima: ppir: fix store_output optimization for modifiers
|
||
- ci: lima farm maintenance
|
||
- Revert "ci: lima farm maintenance"
|
||
- kmsro: enable with zink
|
||
- pvr: add VK_EXT_physical_device_drm support
|
||
- v3dv: rename primary_fd to display_fd
|
||
- v3dv: use v3d primary node for VK_EXT_physical_device_drm
|
||
- pvr: enable KHR_wayland_surface
|
||
|
||
Erik Faye-Lund (89):
|
||
|
||
- panfrost: enable robust_buffer_access_behavior
|
||
- docs: document new panfrost extensions
|
||
- docs: add GL_KHR_robustness to panfrost
|
||
- r300/ci: update expected failures
|
||
- mesa/st: do not check single-sampled for max_samples
|
||
- Revert "lima: make fp16 render-targets opt-in with driconf"
|
||
- Revert "upanfrost: make 128-bit opt-in with driconf on v4"
|
||
- panfrost: add new skips
|
||
- panvk/ci: try to remove all previously slow tests
|
||
- pan/ci: remove non-existent flag from PAN_MESA_DEBUG
|
||
- docs/features: add missing panvk extension
|
||
- panvk: fix EXT_texture_compression_astc_support
|
||
- crocus: use os_get_total_physical_memory instead of open-coding
|
||
- iris: use os_get_total_physical_memory instead of open-coding
|
||
- panfrost: use os_get_page_size()
|
||
- winsys/radeon: use os_get_page_size and error-check
|
||
- winsys/radeon: use util_get_cpu_caps()-helper
|
||
- prefer _SC_PAGESIZE over _SC_PAGE_SIZE
|
||
- meson/util: properly detect sysconf
|
||
- nvk: drop some needless definitions and deps
|
||
- docs/features: sort drivers
|
||
- docs/panfrost: update exposed vulkan version
|
||
- pan/util: use nir_component_mask instead of BITFIELD_MASK
|
||
- pan: use translate_s_format for stencil
|
||
- pan/lib: do not duplicate enum mali_pixel_kill
|
||
- panvk: avoid implicit cast-warning on Clang
|
||
- pan/midgard: avoid implicit cast-warning on Clang
|
||
- pan/bi: plug leak
|
||
- pan/bi: bail from optimizing on oom
|
||
- pan/bi: use ralloc
|
||
- pan/midgard: r1w should be set
|
||
- pan/midgard: initialize last_next_tag to TAG_BREAK
|
||
- pan/decode: detect error on fseek
|
||
- pan/clc: handle seek-error
|
||
- pan/bi: use os_read_file-helper
|
||
- pan/midgard: fix check for negative texture offset
|
||
- pan/va: check branch_offset for overflow
|
||
- panvk: properly handle errors from utrace_context_init
|
||
- pan/lib: clamp format size to 4
|
||
- pan/lib: clean up tilebuffer size helpers
|
||
- panvk: enable KHR_maintenance7
|
||
- doc/features: update VK_KHR_maintenance8
|
||
- panvk: enable KHR_maintenance8
|
||
- panvk: respect VK_QUERY_POOL_CREATE_RESET_BIT_KHR-flag
|
||
- panvk: enable KHR_maintenance9
|
||
- panvk: fix up vk1.4 properties
|
||
- panvk: clean up feature-bits
|
||
- panvk: clean up limits and properties
|
||
- panvk: explicitly list unsupported features
|
||
- panvk: expose missed vulkan 1.4 properties
|
||
- zink: update profile schema
|
||
- zink: add missing gpl requirement
|
||
- zink: use polygonModePointSize instead of open-coding
|
||
- aux/pp: fixup sampler-view release
|
||
- pan/lib: set afbc mode based on plane-format, not view
|
||
- panfrost: add per-gpu GLES2 extension lists
|
||
- panvk: do not export needless symbols
|
||
- pvr: use vulkan_icd_link_args
|
||
- pvr: report vulkan 1.4 to the loader
|
||
- pvr: wire up version-overriding
|
||
- pvr: remove unused enum
|
||
- pvr: drop pointless PVR_FROM_HANDLE macro
|
||
- pvr: move event/sampler cast defs to correct header
|
||
- pvr: remove bogus forward-declaration
|
||
- pvr: include pvr_common.h instead of pvr_private.h
|
||
- pvr: use pvr_memlayout instead of uint32_t
|
||
- pvr: remove stale comment about pvr_pds_upload
|
||
- pvr: move pvr_pds_upload to pvr_common.h
|
||
- pvr: break out queue to separate header
|
||
- pvr: break out instance/device to separate header
|
||
- pvr: break out image to separate header
|
||
- pvr: break out buffer to separate header
|
||
- pvr: break out render-pass to separate headers
|
||
- pvr: break out cmd-buffer to separate header
|
||
- pvr: break out queries to separate header
|
||
- pvr: break out pipelines to separate header
|
||
- pvr: break out descriptor sets to separate header
|
||
- pvr: break out wsi to separate header
|
||
- pvr: break out macros to separate header
|
||
- pvr: avoid including pvr_private.h from headers
|
||
- pvr: kill off pvr_private.h
|
||
- pvr: include pvr_csb.h first in implementation
|
||
- pvr: kill rogue_hwdefs.h
|
||
- pvr: split out rogue hw-defs to separate folder
|
||
- v3dv: use ld_args_build_id
|
||
- docs/pvr: update conformance status
|
||
- docs/pvr: update vulkan version
|
||
- aux/pp: release correct sampler-views
|
||
- gallium/aux: unconditionally write buffer
|
||
|
||
Ernst Persson (3):
|
||
|
||
- meson: Raise minimum Python version to 3.9
|
||
- vulkan/util: Use str.removeprefix() from Python 3.9
|
||
- amd/vulkan: Use str.removesuffix() from Python 3.9
|
||
|
||
Fafa Kitten (1):
|
||
|
||
- meson: detect \`memfd_create()` and \`getrandom()` from headers, not system libraries
|
||
|
||
Faith Ekstrand (205):
|
||
|
||
- nak: Wire up the mma predicate on Hopper+
|
||
- nir/instr_set: Rework tex instr hash/compare
|
||
- nil: Add a ViewAccess enum and plumb it through from NVK
|
||
- nil: Use an extent in samples for MSAA storage images
|
||
- nir,nak: Add a nir_texop_sample_pos_nv and plumb it through
|
||
- nak/lower_tex: Don't use remap_sampler_dim() for images
|
||
- nak/lower_tex: Add texture query helpers
|
||
- nak/lower_tex: Handle NULL image queries pre-Volta
|
||
- nvk: Drop the pre-Volta texture query workaround
|
||
- nak: Lower MSAA image load/store/atomic/size
|
||
- nvk: Delete the old MSAA image workarounds and trust NIL and NAK
|
||
- nouveau/headers: Skip duplicate enumerants in rust enums and switches
|
||
- nouveau,nvk: Import the Blackwell and Hopper DMA class headers
|
||
- nvk: Move KHR_timeline_semaphore to the right spot in the list
|
||
- nvk: Bump the conformance version to 1.4.3
|
||
- nvk: Add an nvk_is_conformant() helper
|
||
- vulkan/meta: Supply image view usage in vk_meta_clear_*_image()
|
||
- loader: Ignore NOUVEAU_USE_ZINK on Hopper+
|
||
- vulkan: Rename a bunch of vk_sync_timeline helpers
|
||
- vulkan: Hold a reference to pending vk_sync_timeline_points
|
||
- nak/lower_tex: Re-order arguments to put can_speculate at the end
|
||
- vulkan/wsi/x11: Handle VK_NOT_READY in AcquireNextImage()
|
||
- spirv: Assert !ptr_as_array for blocks and acceleration structures
|
||
- spirv: Drop block_index/offset pointers
|
||
- spirv: Simplify pointer_to/from_ssa a bit
|
||
- spirv: Assert that vtn_pointer_to_deref() doesn't return NULL
|
||
- compiler/rust: Add a CFG::loop_depth() method
|
||
- nak: Take loops into account in static cycle estimates
|
||
- nvk: Blackwell is now Vulkan 1.4 conformant
|
||
- nvk: Handle empty pushes in nvk_queue_push()
|
||
- nouveau/class_parser: Strip unnecessary parens
|
||
- nouveau/headers: Import video encode/decode headers from NVIDIA
|
||
- nouveau/push: Map b0 classes to subchannel 4
|
||
- nouveau/winsys: Allow subchan_dealloc() on zeroed subchans
|
||
- nouveau/winsys: Refactor nouveau_ws_context_create()
|
||
- nvk: Advertise KHR_shader_untyped_pointers
|
||
- vulkan/video: Switch vk_video_session_parameters to create/destroy
|
||
- vulkan: Add handle casts for vk_video_session[_parameters]
|
||
- vulkan: Add common VideoSessionParametersKHR entrypoints
|
||
- anv: Delete anv_video_session_params
|
||
- radv: Delete radv_video_session_params
|
||
- vulkan: Add a vk_video_session_finish() helper
|
||
- nvk: Allow kepler in nvk_is_conformant()
|
||
- anv: Set the Shader capability when compiling the FP64 shader
|
||
- anv/i915: Require HAS_EXEC_ASYNC
|
||
- anv/i915: Require HAS_EXEC_CAPTURE
|
||
- anv/i915: Require HAS_EXEC_TIMELINE_FENCES
|
||
- intel/gem: Add an intel_gem_supports_dma_buf_sync_file() helper
|
||
- anv: Require Linux 6.0 for dma-buf sync file import/export
|
||
- anv/wsi: Stop requesting signal_*_with_memory
|
||
- anv: Dead code anv_bo_sync
|
||
- hasvk: Require HAS_EXEC_ASYNC
|
||
- hasvk: Require HAS_EXEC_CAPTURE
|
||
- hasvk: Require HAS_EXEC_TIMELINE_FENCES
|
||
- hasvk: Require Linux 6.0 for dma-buf sync file import/export
|
||
- hasvk/wsi: Stop requesting signal_*_with_memory
|
||
- hasvk: Dead code anv_bo_sync
|
||
- dozen: Drop dzn_create_sync_for_memory()
|
||
- vulkan/wsi: Drop signal_fence/semaphore_with_memory
|
||
- vulkan/wsi: Stop setting wsi_memory_signal_submit_info
|
||
- vulkan: Drop implicit sync support
|
||
- vulkan/wsi: Style nits
|
||
- vulkan/wsi: Sanitize the result of wsi_drm_check_dma_buf_sync_file_import_export()
|
||
- vulkan/wsi: Only test for dma-buf sync file support once
|
||
- subprojects: Stop calling add_languages() in paste-1-rs/meson.build
|
||
- meson: Add a rust_2024_lint_args helper
|
||
- meson: Disable unsafe_op_in_unsafe_fn in bindgen for now
|
||
- meson: Disable unsafe_attr_outside_unsafe for now
|
||
- nil/copy: Wrap all unsafe code in unsafe blocks
|
||
- nil/copy: Use saturating_sub() instead of doing it manually
|
||
- nil: Fix a couple of clippy lints
|
||
- nak: Use .as_ref().unwrap() instead ofv &*
|
||
- nak/hw_runner: Wrap all unsafe code in unsafe blocks
|
||
- nak: Use +use<> to avoid unnecessary lifetime captures
|
||
- nouveau: Use rust_2024_lint_args
|
||
- nouveau/class_parser: Stop shifting by zero
|
||
- nouveau/class_parser: Add a helper for address expression filtering
|
||
- nouveau/struct_parser: Stop generationg i * 1
|
||
- nouveau/bitview: Drop an unneeded lifetime
|
||
- compiler/rust: Use .as_ref().unwrap() instead of &*
|
||
- compiler/rust: Stop using NonNull in the NIR bindings
|
||
- meson: Add --wrap-unsafe-ops to bindgen
|
||
- compiler/rust: Add Rust 2024 lints
|
||
- compiler/rust/nir: Drop a bunch of explicit lifetimes
|
||
- compiler/rust: Don't use assert_eq!() with booleans
|
||
- compiler/rust: Add a bunch of clippy lints
|
||
- compiler/rust: Stop using try_into() for u8 -> usize
|
||
- compiler/rust/bitset: Don't use a vector for expected sets in tests
|
||
- compiler/rust/cfg: Use slices instead of &Vec
|
||
- vulkan/sync: Return early in vk_sync_timeline_wait() if wait_value == 0
|
||
- vulkan/drm_syncobj: Use SWAP() in vk_drm_syncobj_move()
|
||
- vulkan/sync: Make the can_wait_many() check faster
|
||
- vulkan/sync: Add vk_sync_signal/reset_many()
|
||
- vulkan/drm_syncobj: Implement signal/reset_many
|
||
- vulkan: Add a vk_sync_wait_unwrap() helper
|
||
- vulkan/queue: Move timeline point allocation to vk_queue_submit_final()
|
||
- vulkan: Add a vk_sync_signal_unwrap() helper
|
||
- vulkan: Add a vk_device_copy_semaphore_payloads() helper
|
||
- vulkan/drm_syncobj: Add a vk_drm_syncobj_copy_payloads helper
|
||
- anv,hasvk: Use vk_drm_syncobj_copy_payloads
|
||
- nvk: Use vk_drm_syncobj_copy_payloads
|
||
- panvk: Use vk_drm_syncobj_copy_payloads
|
||
- anv: Stop picking our own blit queue
|
||
- vulkan/wsi: Switch to vkQueueSubmit2()
|
||
- vulkan,anv,hasvk: Drop vk_queue_wait_before_present()
|
||
- vulkan/wsi: Take a vk_queue in wsi_common_queue_present()
|
||
- vulkan/wsi: Make get_blit_queue return a struct vk_queue *
|
||
- vulkan/wsi: Add a QueueSubmit2() wrapper
|
||
- vulkan/wsi: Gather per-swapchain results in an array in queue_present()
|
||
- vulkan/wsi: Handle throttling in a separate loop
|
||
- vulkan/wsi: Consolodate vkQueueSubmit2() calls across swapchains
|
||
- vulkan/wsi: Skip the vkQueueSubmit() entirely if we aren't blitting
|
||
- vulkan/wsi: Always use VK_PIPELINE_STAGE_2_TRANSFER_BIT for semaphore ops
|
||
- nak: Lower away ldcx when NAK_DEBUG=no_ugpr is set
|
||
- nvk/nvkmd: Stop setting WAIT_FOR_SUBMIT for sync
|
||
- nvk/nvkmd: Track all memory objects by default
|
||
- nvk,nvkmd: Move push dumping to NVKMD
|
||
- nouveau/push: Handle more recent versions of 6F
|
||
- nak: Add a nak_qmd_size_B() query
|
||
- nak/hw_runner: Allow for variable sized QMDs
|
||
- nvk: Allow for larger QMDs
|
||
- nak/qmd: QMD versions 4.0 and 5.0 are both 384B
|
||
- nouveau/headers: Add a MAX_BIT for structs
|
||
- nak: Assert that QMDs are big enough
|
||
- nak: NAK_MAX_QMD_SIZE_B should be 384
|
||
- nak: Increase Imma latencies on Blackwell by 4
|
||
- compiler/rust: Fix the DFS loop detection algorithm
|
||
- lavapipe: Always use dma-buf for external memory when we can
|
||
- vulkan/wsi: Move a couple of dma-buf sync checks
|
||
- vulkan/wsi: Don't dma-buf sync import/export on success
|
||
- nir: Add an option to make lower_phis_to_regs_block() less clever
|
||
- nak,nir: Use a simpler version of phis_to_regs_block in lower_cf
|
||
- nil: Delete some useless image alignment code
|
||
- turnip: Use vk_drm_syncobj_copy_payloads
|
||
- nouveau/push: Fix SET_OBJECT handling
|
||
- nvk: Use the image format for depth views
|
||
- vulkan/meta: Always set VK_IMAGE_VIEW_CREATE_DRIVER_INTERNAL_BIT_MESA
|
||
- vulkan: Handle VK_IMAGE_VIEW_CREATE_DRIVER_INTERNAL_BIT_MESA automatically
|
||
- nvk: Use VK_IMAGE_VIEW_CREATE_DRIVER_INTERNAL_BIT_MESA
|
||
- radv: Use VK_IMAGE_VIEW_CREATE_DRIVER_INTERNAL_BIT_MESA
|
||
- v3dv: Use VK_IMAGE_VIEW_CREATE_DRIVER_INTERNAL_BIT_MESA
|
||
- vulkan: Drop the driver_internal from vk_image_view_init/create()
|
||
- nvk: Stop adding Vulkan image usage flags
|
||
- nvk: Use Vulkan formats for SET_ZT_FORMAT instead of NIL
|
||
- mesa: Use mesa_log_if_debug() for no context errors
|
||
- util/log: Add a MESA_LOG_LEVEL environment variable
|
||
- vulkan/wsi/x11: Use mesa_logX() instead of fprintf()
|
||
- vulkan/queue: Move shared binary semaphores to temps
|
||
- spirv: Add support for OpBitcast in OpSpecConstantOp
|
||
- nvk: Actually reserve 1/2 for FALCON
|
||
- compiler/rust: Add a DepthFirstSearch trait
|
||
- compiler/rust/cfg: Use DepthFirstSearch for rev_post_order_sort()
|
||
- compiler/rust/cfg: Use DepthFirstSearch for calc_dominance()
|
||
- compiler/rust/cfg: Use DepthFirstSearch for find_back_edges()
|
||
- compiler/rust/cfg: Use DepthFirstSearch for finding reaches sets
|
||
- compiler/rust: Implement dfs() non-recursively
|
||
- nil: Add a GOB_TYPE_MODIFIER_INFOS table
|
||
- nil: Add GOBType::TegraColor
|
||
- util/cache_ops: Add some cache flush helpers
|
||
- util/cache_ops/x86: Call util_get_cpu_caps() less
|
||
- hasvk: Switch to util/cache_ops.h
|
||
- anv: Switch to util/cache_ops.h
|
||
- intel/sanitize-gpu: Use util_flush_inval_range()
|
||
- crocus: Use util_flush_inval_range()
|
||
- intel: Drop intel_mem.c/h
|
||
- turnip: Use the util cache helpers
|
||
- nouveau/winsys: Add a NOUVEAU_WS_BO_COHERENT flag
|
||
- nvk/nvkmd: Add an NVKMD_MEM_COHERENT flag
|
||
- nvk/nvkmd: Add map sync to/from GPU helpers
|
||
- nvk: Implement Flush/InvalidateMappedMemoryRanges()
|
||
- nvk: Flush pushbufs in EndCommandBuffer()
|
||
- nvk/nvkmd: Invalidate maps before dumping pushbufs
|
||
- nvk: Use a coherent map for the event heap
|
||
- nvk: Flush descriptor tables and heap maps on submit
|
||
- nvk/mem_stream: Flush maps in nvk_mem_stream_flush()
|
||
- nvk: Flush after zeroing memory
|
||
- nvk: Flush the zero page
|
||
- nvk: Flush/invalidate around host image copies
|
||
- nvk: Use _B suffixes in descriptor sets
|
||
- nvk: Use a pool offset instead of an address in nvk_descriptor_set
|
||
- nvk: Add an nvk_descriptor_writer
|
||
- nvk: Route more descriptor types through write_desc()
|
||
- nvk: Flush descriptor set maps
|
||
- nvk: Flush indirect execution set maps
|
||
- nvk/query: Rework offset helpers
|
||
- nvk/query: Pass an IS_TIMESTAMP flag explicitly to the CL kernel
|
||
- nvk/query: Add a vk_query_pool_report_count() helper
|
||
- nvk/query: Add an interleaved query layout
|
||
- nvk/query: Rework query waits
|
||
- nvk/query: Handle non-coherent query pool memory
|
||
- nvk: Expose cached and coherent as separate types on Tegra
|
||
- panvk: Fix integer dot product properties
|
||
- util: Don't advertise cache ops on x86 without SSE2
|
||
- util: Build util/cache_ops_x86.c with -msse2
|
||
- nvk: Include the chipset in the pipeline/binary cache UUID
|
||
- nvk: Disable sampleLocationsSampleCounts for 1x MSAA
|
||
- nvk: Emit inactive vertex attributes
|
||
- nvk: Look at the right pointer in GetDescriptorInfo for SSBOs
|
||
- nvk: Capture/replay buffer addresses for EDB capture/replay
|
||
- panvk/shader: [de]serialize desc_info.max_varying_loads
|
||
- panvk/shader: Use the right copy size for deserializing dynamic UBOs/SSBOs
|
||
- nvk: Don't re-initialize the descriptor writer if the set matches
|
||
- drm-uapi: Import the new NVIDIA modifiers
|
||
- nil: Add support for Blackwell 8 and 16-bit modifiers
|
||
- nir: Add a couple panfrost sysvals to divergence analysis
|
||
|
||
Francisco Jerez (16):
|
||
|
||
- intel/brw/xe3+: Handle SENDG in instruction scheduler.
|
||
- intel/brw: Fix behavior of scheduler around flag register writes.
|
||
- intel/brw/xe3+: Define BRW_SCHEDULE_PRE_LATENCY scheduling mode.
|
||
- util/ra: Allow driver to override class P value.
|
||
- intel/brw/xe3+: Override P value of GRF register classes to increase thread parallelism.
|
||
- intel/brw/xe3+: Model trade-off between parallelism and GRF use in performance analysis.
|
||
- intel/brw/xehp+: Adjust performance model weights of LSC atomic ops.
|
||
- intel/brw/xe3+: Adjust weights of discard control flow for non-EU-fused platforms.
|
||
- intel/brw/xe3+: Tweak render target write timings in performance modeling pass.
|
||
- intel/brw: Allow using performance analysis pass pre-register allocation.
|
||
- intel/brw: Make sure we don't use stale analysis after inst. order restore in brw_allocate_registers().
|
||
- intel/brw/xe3+: Select scheduler heuristic with best trade-off between register pressure and latency.
|
||
- intel/brw: Apply 7e1362e9c070ad037 to pre-xe3 codepath of brw_compile_fs().
|
||
- intel/brw/xe3+: Re-enable static analysis-based SIMD32 FS heuristic for the moment.
|
||
- intel/brw: Fix regression in brw_allocate_registers() compiling large shaders with throughput==0.
|
||
- intel/brw/gfx12.0+: Sync on all pending send messages after halt target.
|
||
|
||
Frank Binns (30):
|
||
|
||
- pvr: correctly return core count for pvrsrvkm
|
||
- pvr: update conformance version
|
||
- pvr: only share scratch buffers when they're the required size
|
||
- pvr: apply PBE stride alignment when setting up image physical extents
|
||
- pvr: implement VK_(EXT|KHR)_vertex_attribute_divisor
|
||
- pvr: advertise VK_EXT_queue_family_foreign
|
||
- pvr: implement VK_EXT_depth_clip_enable
|
||
- pvr: Implement VK_KHR_descriptor_update_template
|
||
- pvr: add support for VK_FORMAT_D32_SFLOAT_S8_UINT
|
||
- pvr: setup tpu_tag_cdm_ctrl when present (pvrsrvkm)
|
||
- pvr: support VK_FORMAT_R8G8_SSCALED for vertex attribs
|
||
- pvr: add some more pixel formats needed by Zink
|
||
- pvr: implement KHR_shader_float_controls
|
||
- pvr: disable gs_rta_support for BXS-4-64 to workaround some conformance failures
|
||
- pvr: enable KHR_create_renderpass2
|
||
- pvr: advertise KHR_shader_subgroup_extended_types
|
||
- pvr: advertise KHR_spirv_1_4
|
||
- pvr: setup Vulkan 1.1 & 1.2 features, properties, version
|
||
- docs: add pvr VK 1.0, extensions and optional features to new_features.txt
|
||
- pvr: advertise VK_EXT_zero_initialize_device_memory
|
||
- docs/features: claim vk 1.2 for pvr
|
||
- pvr: add device info for BXE-4-32 (36.50.54.182)
|
||
- pvr: add device info for GX6250 (4.45.2.58)
|
||
- pvr: add device info for G6110 (5.9.1.46)
|
||
- pvr: add device info for GX6650 (4.46.6.62)
|
||
- pvr: add device info for BXM-4-64 (36.52.104.182)
|
||
- pvr: add device info for BXE-2-32 (36.29.52.182)
|
||
- pvr: add device info for GE8300 (22.102.54.38)
|
||
- pvr: add device info for GE8300 (22.68.54.30)
|
||
- pvr: support VK_KHR_device_group
|
||
|
||
GKraats (1):
|
||
|
||
- crocus: fix SIGSEGV crash at pbo compressed teximage
|
||
|
||
Georg Lehmann (175):
|
||
|
||
- ac/nir/lower_mem_access_bit_sizes: make 8/16bit access 32bit if possible
|
||
- nir/lower_int64: lower 64bit bitfield_select
|
||
- aco/isel: don't create literal operands for SALU bitfield_select
|
||
- aco: supported 64bit or vectorized bitfield_select
|
||
- ac/nir: don't lower 8/16bit bitfield_select
|
||
- nir/opt_generate_bfi: create vector and non 32bit bitfield_select
|
||
- nir/opt_algebraic: create non 32bit bitfield_select
|
||
- radv: vectorize 8/16bit bitfield_select
|
||
- lavapipe: use NIR_PASS(_, ...) instead of NIR_PASS_V
|
||
- gallium/draw: use NIR_PASS(_, ...) instead of NIR_PASS_V
|
||
- gallivm: use NIR_PASS(_, ...) instead of NIR_PASS_V
|
||
- nir/schedule: return progress and fix metadata
|
||
- broadcom/compiler: use NIR_PASS for nir_schedule
|
||
- llvmpipe: use NIR_PASS(_, ...) for nir_lower_fragcolor
|
||
- svga: use NIR_PASS(_, ...) for gl_nir_lower_images
|
||
- nir/opt_remove_phis: skip unreachable phis
|
||
- pvr/rogue: return progress in rogue_nir_pfo
|
||
- pvr/rogue: replace NIR_PASS_V with NIR_PASS(_, ...)
|
||
- lima: rework lima_nir_duplicate_modifiers
|
||
- lima: rework lima_nir_duplicate_intrinsic
|
||
- lima: rework lima_nir_duplicate_load_consts
|
||
- lima: fix metadata in lima_nir_split_loads
|
||
- lima: replace NIR_PASS_V with NIR_PASS(_, ...)
|
||
- aco: optimize get_alu_src with constant source and size > 1
|
||
- nir: remove NIR_PASS_V
|
||
- aco/statistics: add latency to WMMA
|
||
- aco/statistics: update GFX12 WMMA cost
|
||
- aco: insert VALU s_delay_alu for WMMA
|
||
- aco/select_alu: avoid vector get_alu_src for instructions with scalar operands
|
||
- aco/isel: refactor shared vgpr usage
|
||
- aco/gfx10: optimize subgroupRotate(x, 32) and subgroupShuffleXor(x, 32)
|
||
- nir/search: support swizzles on expressions in replacement patterns
|
||
- radv/nir/lower_cmat: load gfx11 8bit ACC using the B layout to get aligned loads
|
||
- nir/opt_algebraic: remove 8bit roundtrip when vectorizing i2i16(unpack_4x8(a).zw)
|
||
- aco/print_asm: use real true16 instr on gfx11+
|
||
- aco/ra: convert bitwise instruction to gfx11+ 16bit on demand
|
||
- nir/opt_algebraic: optimize fsat(fmax(a, b)) where b is not positive
|
||
- nir/opt_algebraic: push fsat into bcsel with constant
|
||
- nir/opt_algebraic: use range analysis to detect no-op fmin/fmax
|
||
- nir/range_analysis: look through f2f
|
||
- nir/range_analysis: look through vec2
|
||
- nir/opt_algebraic: make fmin/fmax(a, #b) 16bit if only used by f2f16
|
||
- nir/opt_algebraic: remove fneg around fmin/fmax
|
||
- nir/opt_algebraic: create 16bit fmin/fmax if only used by pack_half_2x16_rtz_split
|
||
- nir/opt_algebraic: optimize pack_half_rtz of bcsel with constant
|
||
- nir/opt_algebraic: optimize pack_half_rtz of b2f
|
||
- nir/opt_tex_skip_helpers: don't skip helpers for terminate_if source
|
||
- nir/opt_tex_skip_helpers: never require helpers for stores/atomics
|
||
- nir: print skip_helpers for tex instrs
|
||
- nir: rename to nir_opt_load_skip_helpers and add options struct
|
||
- nir: add ACCESS_SKIP_HELPERS
|
||
- nir: add access for scratch loads
|
||
- nir/opt_load_skip_helpers: optionally handle intrinsics
|
||
- aco/insert_exec: remove p_jump_to_epilog from needs exact
|
||
- aco: add a post-RA pass to disable wqm
|
||
- aco/insert_exec: new way to handle instructions that need wqm disabled
|
||
- aco: use new disable_wqm for mubuf/mtbuf
|
||
- aco: use new disable_wqm for flatlike
|
||
- aco: use new disable_wqm for mimg
|
||
- aco/builder: support new disable_wqm
|
||
- aco: use new disable_wqm for exp
|
||
- aco: use new disable_wqm for p_dual_src_export_gfx11
|
||
- aco/insert_exec: remove per instruction wqm/exact exec handling
|
||
- aco: use a smaller wqm section for strict_wqm sampling
|
||
- aco: don't restrict vmem load scheduling by inserting p_end_wqm early
|
||
- aco: disable wqm for tex loads when not needed
|
||
- aco: disable wqm for sampled buffer loads when not needed
|
||
- aco/disable_wqm: optimize local mask creation
|
||
- amd: replace ACCESS_TYPE_SMEM with ACCESS_SMEM_AMD
|
||
- amd: stop using custom gl_access_qualifier for access type
|
||
- amd/ci: update checksums for restricted traces
|
||
- nir/uub: guard against division by 0
|
||
- aco/isel: fix vectorized i2i16 with 8bit vec8 source
|
||
- nir/uub: fix exclusive scans
|
||
- nir/uub: decrease default max subgroup size to 128
|
||
- nir/uub: handle more reduction ops
|
||
- nir/uub: handle bit_count
|
||
- nir/shrink_vec_array_vars: allow nir_var_mem_shared
|
||
- radv: shrink shared arrays
|
||
- nir/shrink_vec_array_vars: use range analysis for non constant indices
|
||
- aco: fix ra validation for flat/global/scratch/ds load sbyte_d16
|
||
- aco/optimizer: don't apply packed clamp to v_fma_mix
|
||
- aco/optimizer: don't create undef copies from p_create_vector
|
||
- nir: constant fold txd with 0 ddx/ddy to txl
|
||
- nir/shrink_vec_array_vars: update constant initializer after shrinking
|
||
- nir/shrink_vec_array_vars: detect zero init shared memory using constant initializer
|
||
- radv/nir/lower_cmat: split up larger nested switches
|
||
- radv: reorder cmat properties according to performance
|
||
- ac/nir: do not assume mesh cull flag is 1bit
|
||
- nir/lower_io: fix boolean output stores
|
||
- nir/peephole_select: allows more lowered io
|
||
- nir/opt_algebraic: optimize some post peephole select patterns
|
||
- radv: set ACCESS_CAN_SPECULATE for smem buffer loads with known good descriptors
|
||
- aco/isel: add init_disable_wqm helper
|
||
- aco: implement skip_helpers for image loads
|
||
- aco: implement skip_helpers for load_ssbo/ubo/constant
|
||
- aco: implement skip_helpers for load_scratch
|
||
- aco: implement skip_helpers for load_global_amd
|
||
- aco: never end wqm early for vmem
|
||
- nir: make inverse_ballot 1bit only
|
||
- nir/builder: add nir_inverse_ballot_imm
|
||
- nir: make ballot_bitfield_extract 1bit only
|
||
- spirv: handle ballot bit_extract separately
|
||
- nir: make ballot find_lsb/msb/bit_count 32bit only
|
||
- spirv: ensure ballot find_lsb/find_msb/bit_count have 32bit result
|
||
- nir/lower_subgroups: don't use get_max_subgroup_size for lowering boolean rotates
|
||
- nir/lower_subgroups: change filter to intrinsic callback
|
||
- nir/lower_subgroups: recursively lower ballot scans
|
||
- mesa: clamp fog scale to -FLT_MAX instead of FLT_MIN
|
||
- intel/ci: update restricted trace checksums
|
||
- radv/nir/lower_cmat: add shuffle_xor_imm helper
|
||
- radv/nir/lower_cmat: clean up gfx12 transpose
|
||
- radv/nir/lower_cmat: clean up GFX11 ACC->B convert
|
||
- nir/lower_subgroup: optimize reduce/scans with unknown subgroup size
|
||
- mesa/st: make double subgroup lowering more precise
|
||
- nir: remove subgroup size related nir_shader_compiler_options members
|
||
- nir/lower_subgroups: remove lower_fp64 option
|
||
- nir: remove unused shader_info param in nir_create_shader
|
||
- nir: define new subgroup size info
|
||
- vulkan: set nir subgroup size shader info
|
||
- mesa,glsl,spirv: set new subgroup size info
|
||
- intel: switch to new subgroup size info
|
||
- radeonsi: switch to new subgroup size info
|
||
- rusticl: switch to new subgroup size info
|
||
- microsoft: switch to new subgroup size info
|
||
- shader_info: remove gl_subgroup_size enum
|
||
- radv: add varying subgroup size to shader stage key
|
||
- ac/llvm: remove unused ballot size
|
||
- radv: remove unused ballot_bit_size from shader info
|
||
- ac/nir: set subgroup size for gs copy shader
|
||
- radv: determine subgroup/wave size early
|
||
- radv: remove uses_rt from radv_shader_info
|
||
- nir: remove has_ddx_intrinsics option
|
||
- aco/isel: fix output args init stack buffer overflow
|
||
- nir/uub: remove vertex input handling
|
||
- nir/uub: use shader_info subgroup size
|
||
- nir/uub: remove max_workgroup_size from config
|
||
- nir: remove unsigned upper bound config
|
||
- radv: allow application required fragment shader subgroup size
|
||
- radv: use rt wave size in fragment shaders with ray queries
|
||
- radv,aco: don't end monolithic ray tracing with unconditional terminate
|
||
- aco: remove existing dealloc_vgprs use
|
||
- aco: dealloc vgprs if there is a pending non scratch store and no pending export
|
||
- aco: don't insert s_sendmsg dealloc_vgprs with little vgprs allocated
|
||
- util: add util_round_down_npot
|
||
- aco: use maximum RT vgpr_limit that doesn't reduce wave count
|
||
- aco/lower_branches: update branch hints after changing jump targets
|
||
- radv: call nir_opt_undef late too
|
||
- nir/opt_undef: prefer 0 over NaN for pack_half_2x16_rtz_split
|
||
- aco/optimizer: fix incorrect operand order assumption for neg(mul) opt
|
||
- aco/insert_waitcnt: don't merge waitcnts for LDS clauses
|
||
- nir: add atomic isub
|
||
- ac/llvm: support nir_atomic_op_isub
|
||
- aco/isel: support nir_op_atomic_isub
|
||
- nir: optimize atomic isub if supported
|
||
- aco: fix global_atomic_swap offset overflow check
|
||
- nir: fix nir_get_io_offset_src for global_atomic_swap_amd
|
||
- aco/gfx10+: only work around split execution of uniform LDS in WGP mode
|
||
- nir/opt_uniform_atomics: optimize xchg with uniform address and data
|
||
- nir/opt_intrinsics: don't pass nir options around
|
||
- nir/opt_intrinsics: optimize atomics to atomic load/store
|
||
- ac/nir: enable nir atomic load/store opts
|
||
- aco/tests: allow even more literals
|
||
- aco/optimizer: add a new dce helper
|
||
- aco/optimizer: add alu_opt_info helpers
|
||
- aco/optimizer: use new helpers to apply literals
|
||
- aco/optimizer: use new helpers to propagate constants/neg/abs
|
||
- aco/optimizer: rework packed fneg opt
|
||
- aco/optimizer: apply sgprs/extract with new helpers
|
||
- aco/optimizer: delete apply_extract
|
||
- aco/optimizer: remove can_apply_extract
|
||
- aco/optimizer: apply f2f16 conversion with the new helpers
|
||
- aco/optimizer: unify constant labels
|
||
- radv: do not report wave32 in gl_SubgroupSize for Doom Dark Ages
|
||
- aco/gfx10_3: work around NSA hazard
|
||
|
||
Gert Wollny (95):
|
||
|
||
- r600/sfn: lower bany/ball \*(n)equal in nir
|
||
- r600/sfn: lower ineg in nir
|
||
- r600/sfn: remove some dead code
|
||
- r600/sfn: remove obsolete index and address register handling
|
||
- r600/sfn: remove code used for vectorized ALU ops
|
||
- r60/sfn: Update .clang-format
|
||
- r600/sfn: Move RA helper class declaration into implementation file
|
||
- r600/sfn: lower b2f64 in nir
|
||
- r600/sfn: Allow f2f64 to use vec2
|
||
- r600/sfn: remove first call to r600_split_64bit_alu_and_phi
|
||
- r600/sfn: lower u2f64 and i2f64 in nir
|
||
- r600/sfn: check number of fsat64 source uses properly
|
||
- r600/sfn: rename free_slots and improve updating it
|
||
- r600/sfn: Simplify test code when scheduling a vec instr into trans
|
||
- r600/sfn: unify and fix naming of group readport reserver
|
||
- r600/sfn: reuse readport for already loaded registers
|
||
- r600/sfn: Fix update readports method
|
||
- r600/sfn: update readports before trying to schedule group instrutions
|
||
- r600: Update GPR count when adding a GDS instruction
|
||
- r600/sfn: allow skipping RA for shader ID ranges
|
||
- r600/sfn: factor out adding an input in GS
|
||
- r600/sfn: Handle indirect access to GS input arrays
|
||
- r00/sfn: Fix copy propagation into buffer load address
|
||
- r600/sfn: resolve constant indices into local arrays better
|
||
- r600/sfn: Lower all GS indirect input loads after lowering IO
|
||
- r600/sfn: cleanup GS shader emission
|
||
- r600/sfn: When splitting an ALU CF update possible start of next CF
|
||
- r600/sfn: Fix AR use tracking off-by-one error
|
||
- r600/sfn: remove extra slot of AR use
|
||
- r600/sfn: remove early emmission of ALU last op
|
||
- r600/sfn: Take allowed dest mask into account in copy-prop
|
||
- r600/sfn: Only map ssa index to register index if pinning is not free
|
||
- r600/sfn: Fix test when allocating registers more freely
|
||
- r600/sfn: Take slot count into account when pinning registers
|
||
- r600/sfn: Fix the mods when splitting ALU op
|
||
- r600/sfn: replace hard-coded multislot dot handling
|
||
- r600/sfn: Handle more ops in desk mask evaluation
|
||
- r600/sfn: op1v_flt64_to_flt32 as multi-slot instruction
|
||
- r600/sfn: give more liberty to the channel selection in simple two-slot ops
|
||
- r600/sfn: Emit thread position as two-slot op
|
||
- r600/sfn: pass group into AluInstr::split instead of creating it
|
||
- R600/sfn: split one-dest multi-slot ops late when scheduling
|
||
- r600/sfn: stop early when looking for ALU vec ready ops
|
||
- r600/sfn: remove some useless boolean parameters
|
||
- r600/sfn: add an unreachable if the creation of a fp64 group fails
|
||
- r600/sfn: rework testing readport config for more than one source
|
||
- r600/sfn: factor out common code for readport validation
|
||
- r600/sfn: preloading sources for fp64 ops with common code path
|
||
- r600/sfn/tests: Update source pinning when loading from string
|
||
- r600/sfn: Pin registers to channel only after scheduling
|
||
- r600/sfn: try all possible configurations when splitting multi-slot instructions
|
||
- r600: remove hack to force a new CF if TEX grad is set
|
||
- r600/sfn: Increase limit for lowering local arrays to scratch
|
||
- r600/sfn: remove superfluous semicolon
|
||
- egl,glx,X11: Handle case when PlatformDisplay is EGL_DEFAULT_DISPLAY
|
||
- r600/sfn: make pin_dest_to_chan a virtual function
|
||
- r600/sfn: Simplify scheduling
|
||
- r600/sfn: preselect fetch by using TC and VC in scheduler
|
||
- r600/sfn: Prepare scheduler to handle WaitAck instructions
|
||
- r600/sfn: Emit and schedule WaitACK as a separate instruction
|
||
- r600/sfn: Add more CF instruction types
|
||
- r600/sfn: Add a CF block start member and handle it in the tests
|
||
- r600/sfn: chain group barrier and predicate instructions
|
||
- r600/sfn: Add method to query whether an ALU group sets the predicate
|
||
- r600/sfn: Add method to emit ALU_PUSH_BEFORE in assembler
|
||
- r600/sfn: Drop test for address register in assembler IF predicate
|
||
- r600/sfn: Add method to query whether ALU block will need ALU_EXTENDED
|
||
- r600/sfn: extract handling of ALU_PUSH_BEFORE in assembler code
|
||
- r600/sfn: make sure that kill and update pred are not in the same group
|
||
- r600/sfn: handle the IF predicate in the scheduler
|
||
- r600/sfn: start scheduling memory writes earlier
|
||
- r600/sfn: Don't fall through if a WaitACK was scheduled
|
||
- r600/sfn: fix op2_pred_sete_64 opcode
|
||
- r600/sfn: Pass chan and dest_clamp to alu op if no dest register is given
|
||
- r600/sfn: Add handling of channels for dest-less ALU ops
|
||
- r600/sfn: don't use dummy regs in alu ops when no dest register is needed
|
||
- r600/sfn: optimize comparison results
|
||
- r600/sfn: emit 64 bit predicates like normal ALU ops
|
||
- r600/sfn: relax restrictions when optimizing predicate evaluation with a register
|
||
- r600/sfh: Handle 64 bit comparisons in predicate optimization
|
||
- r600/sfn: Optimize pred(not X != 0) to pred(X == 0)
|
||
- r600/sfn: Filter lowering of b2f32(comp(x,y)) for 64 bit sources
|
||
- r600/sfn: Propagate pred and exec update flags when splitting ops
|
||
- r600/sfn: Add omod to AluInstr and assembler
|
||
- r600/sfn: Wire up some omod optimizations
|
||
- nir+r600: add option to avoid contracting fabs into ffma
|
||
- r600/sfn: replace hand coded comparison opts with opt_algebraic
|
||
- r600/sfn: clear PIPE_MAP_UNSYNCRONIZED for partial DS texture writes
|
||
- r600: Fix comparison of strides array when emitting vertex buffers
|
||
- r600/sfn: extract function to update group after instr insert
|
||
- r600/sfn: move some common code into try_readport
|
||
- r600/sfn: Track whether a ALU group has a exec flag update
|
||
- r600/sfn: make sure kill and update_exec don't happen in one group
|
||
- r600/sfn: AR loads are not dependend on the future and other code blocks
|
||
- r600/sfn: Don't start a new ALU-CF if LDS pipeline loads are pending
|
||
|
||
Guilherme Gallo (12):
|
||
|
||
- ci/bare-metal: Fix exit code variable
|
||
- ci/panfrost: Disable DUTs under maintenance
|
||
- Revert "ci/panfrost: Disable DUTs under maintenance"
|
||
- ci: Fix for GitLab 18.2.2 upgrade
|
||
- ci: Disable vmware farm
|
||
- ci/radeonsi: Document a new flake
|
||
- ci/baremetal: Use find_s3_project_artifact on baremetal_build.sh
|
||
- ci/android: Use find_s3_project_artifact in build script
|
||
- ci/android: Use curl-with-retry in build scripts
|
||
- ci/baremetal: Use curl-with-retry in build scripts
|
||
- ci/zink: Document bypassed failures
|
||
- ci: Bump image tags to force recreation of s3 artifacts
|
||
|
||
Gurchetan Singh (13):
|
||
|
||
- gfxstream: null-check in vulkan-mapper
|
||
- gfxstream: vulkan-mapper: special case Nvidia
|
||
- gfxstream: correct Android API level check
|
||
- mesa: define peripheral support for src/util/rust
|
||
- util: rust: make stubs simpler
|
||
- gfxstream: ANDROID --> VK_USE_PLATFORM_ANDROID_KHR
|
||
- vulkan: #if DETECT_OS_ANDROID --> #if defined(VK_USE_PLATFORM_ANDROID_KHR)
|
||
- util: rust: fix some warnings
|
||
- mesa3d: util: rust: add proper stubs
|
||
- util: rust: spelling and whitespace fixes
|
||
- gfxstream: determine page size based on guest properties too
|
||
- virtio: virtgpu_kumquat: clippy fixes
|
||
- gfxstream: delete magma-over-gfxstream
|
||
|
||
Hans-Kristian Arntzen (10):
|
||
|
||
- anti-lag: Only consider timestamps from queues which have presented.
|
||
- anti-lag: Submit timestamps early in a frame.
|
||
- ac/nir: Avoid 0/0 when computing texel buffer size on Polaris.
|
||
- nvk: Return 0 for opaque memory capture replay.
|
||
- nvk: Avoid passing garbage data in descriptor buffers for UBOs.
|
||
- anti-lag: Fix stype for submit2 semaphores.
|
||
- anti-lag: Don't force enable every supported feature on device creation.
|
||
- radv/sqtt: Ensure that present fence gets signalled.
|
||
- anti-lag: Do not enable layer by default.
|
||
- radv: Actually fail custom border color sampler creation.
|
||
|
||
Hsieh, Mike (3):
|
||
|
||
- amd/vpelib: add format, colorspace check function
|
||
- amd/vpelib: bug fix: remove unnecessary free
|
||
- amd/vpelib: add max/min input output capability
|
||
|
||
Hyunjun Ko (18):
|
||
|
||
- vulkan/video: fix to write a h264 slice header for CAVLC mode
|
||
- vulkan/video: fix to set ref_pic_list_modification_flag_l1 correctly
|
||
- anv/video: Fix to set high profile to PPS if high profile provided
|
||
- anv/video: implement GetPhysicalDeviceVideoEncodeQualityLevelPropertiesKHR
|
||
- vulkan/video: align with spec correctly for h265 slice header.
|
||
- anv/video: fix to set some attributes for HCP_PIC_STATE.
|
||
- anv/genxml: the type of POC delta changes correctly
|
||
- anv/video: set short term ref list1 even if P frames provided
|
||
- anv/video: don't set the MVDL1Zero for encoding
|
||
- anv/video: create Motion Vector buffers for encoding too
|
||
- anv/video: add VK_VIDEO_ENCODE_H265_CTB_SIZE_32_BIT_KHR for minimum ctb sizes
|
||
- vulkan/video: fix h265 decoding with LT enabled.
|
||
- vulkan/video: fix h265 encoding with LT enabled.
|
||
- vulkan/video: fix misuse of CLAMP in h265 slice parsing.
|
||
- anv/video: fix to set slice block size correctly for h265 decoding.
|
||
- anv/video: Make the query result for video profiles and formats more precisely.
|
||
- anv/video: remove support for VK_IMAGE_TILING_DRM_FORMAT_MODIFIER_EXT
|
||
- anv/ci: added video tests failures on tgl/jsl
|
||
|
||
Iago Toral Quiroga (2):
|
||
|
||
- nir/serialize: make alu src deserialization consistent for unused swizzles
|
||
- panfrost: fix swapped stats for varing and position shaders
|
||
|
||
Ian Romanick (40):
|
||
|
||
- brw/reg_allocate: Don't access out of bounds in non-debug builds
|
||
- brw: Split virtual GRFs again at the end of optimizations
|
||
- nir/print: Don't segfault checking has_debug_info
|
||
- brw: Add and use brw_reg_is_arf to test for a specific ARF
|
||
- brw: Implement Wa_22012725308 for flags via SWSB too
|
||
- brw: Allow additional flags registers on Xe2+
|
||
- brw: Do cmod prop again after brw_lower_subgroup_ops
|
||
- brw: Don't emit redundant flags initialization for subgroup op lowering
|
||
- brw: Strategically place flags initialization to help cmod prop
|
||
- brw: Use nir_opt_sink and more nir_opt_move
|
||
- elk: Use nir_opt_sink and more nir_opt_move
|
||
- iris: Limit max_shader_buffer_size to INT32_MAX
|
||
- brw: Increase the size of some structure fields in combine_constants
|
||
- elk: Increase the size of some structure fields in combine_constants
|
||
- brw/nir: nir_intrinsic_load_reloc_const_intel may not be scalar [v3]
|
||
- elk: Set lower_txd_data to devinfo
|
||
- nir: Add saturating float to integer conversion opcodes
|
||
- brw: Enable saturating float to integer conversion opcodes
|
||
- elk: Enable saturating float to integer conversion opcodes
|
||
- nir/algebraic: Elide range clamping of f2u sources
|
||
- nir/algebraic: Remove useless ftrunc inside f2i/f2u
|
||
- nir/algebraic: Don't introduce undefined behavior in f2u conversion
|
||
- nir/algebraic: Optimize f2u of negative value to zero
|
||
- nir/algebraic: Prefer bfi over bitfield_select for bitfield_insert
|
||
- nir/range_analysis: Handle bfi and bitfield_select in get_alu_uub
|
||
- brw/disasm: Fix BFN disassembly of src1 and src2
|
||
- brw/disasm: Pretty print the BFN equation as an annotation
|
||
- brw: Basic validation for BFN
|
||
- brw: BFN does not support source modifiers
|
||
- brw: Constant propagation and constant combining support for BFN
|
||
- brw/builder: Add BFN
|
||
- brw/cmod: Enable limited cmod propagation for BFN
|
||
- brw: Use BFN to implement nir_opt_bitfield_select
|
||
- nir/algebraic: Optimize bfi with odd-valued mask to bitfield_select
|
||
- brw: elk: Fix name of function in comment
|
||
- brw: Mark src3 of BFN as is_control_source
|
||
- brw: Don't do non-obvious things with BFN parameter ordering
|
||
- brw: Apply Gfx9 vgrf127 workaround in more cases
|
||
- elk: Apply vgrf127 workaround in more cases
|
||
- brw: Correctly generate conditional modifier for BFN
|
||
|
||
Icenowy Zheng (4):
|
||
|
||
- pvr: fix for GCC
|
||
- pvr: implement samplerAnisotropy
|
||
- gallivm: orcjit: put object cache under the protect of lookup_mutex
|
||
- gallivm: orcjit: remember Context in addition to ThreadSafeContext
|
||
|
||
Igor Naigovzin (1):
|
||
|
||
- zink: fix clamping gl_Layer output to 0 when framebuffer is not layered
|
||
|
||
Iliyan Dinev (3):
|
||
|
||
- pvr: fix pvr_CmdResetQueryPool barriers
|
||
- pvr: add support for VK_FORMAT_X8_D24_UNORM_PACK32
|
||
- pvr: re-emit ppp state update when ds depth bits are set
|
||
|
||
Iván Briano (15):
|
||
|
||
- intel: Re-disable ray tracing on 32 bits
|
||
- anv: check for pending_db_mode when dirtying descriptor mode
|
||
- anv: dirty descriptor state on CmdSetDescriptorBufferOffets
|
||
- anv: fix capture/replay of sparse images with descriptor buffer
|
||
- anv, hasvk: allow using a 3D image as a resolve target
|
||
- anv: pass only isl_format to helper functions
|
||
- anv: drop EXT from host_image_copy stuff
|
||
- anv: handle multiple aspects in vkCopyImageToImage
|
||
- anv: drop height_pitch parameter from anv_copy_image_memory
|
||
- anv: intermediate RGB <-> RGBX copy for HIC
|
||
- anv: fix FS output <-> attachment map building
|
||
- anv: use the color_map if present for calculating color_mask
|
||
- anv: handle compiling of mesh shader separately from task shader
|
||
- brw/mesh: drop brw_tue_map::per_task_data_start_dw
|
||
- anv: report maint5::earlyFragment*SampleCounting correctly
|
||
|
||
James Fitzpatrick (2):
|
||
|
||
- pvr: update WClamp value to 1.0e-13f
|
||
- pvr: add support for (EXT|KHR)_line_rasterization
|
||
|
||
Janne Grunau (1):
|
||
|
||
- hk: Report the correct plane count in VkDrmFormatModifierProperties2?EXT
|
||
|
||
Jarred Davies (3):
|
||
|
||
- pvr: Disable PBE resolve on cores without gs_rta_support
|
||
- pvr: Reduce number of stencil dependency barriers needed
|
||
- pvr: Mark barrier load subcmd as not empty
|
||
|
||
Jason Macnak (4):
|
||
|
||
- gfxstream: Add gfxstream TLS connection manager reset
|
||
- gfxstream: add a vkTraceAsyncGOOGLE
|
||
- gfxstream: hide vkTraceAsyncGOOGLE behind new capset flag
|
||
- gfxstream: Address some Werror errors from ag/35389434
|
||
|
||
Jeffrey Zhuang (1):
|
||
|
||
- zink: remove ALWAYS_INLINE from zink_batch_usage_unflushed_wait
|
||
|
||
Jeongik Cha (1):
|
||
|
||
- gfxstream: Generate goldfish dispatch code for AHB extension
|
||
|
||
Jesse Natalie (19):
|
||
|
||
- gallium/aux: nir_lower_pstipple_fs progress and metadata
|
||
- microsoft/compiler: Use NIR_PASS instead of NIR_PASS_V
|
||
- microsoft/clc: Use NIR_PASS instead of NIR_PASS_V
|
||
- dozen: Use NIR_PASS instead of NIR_PASS_V
|
||
- d3d12: Use NIR_PASS instead of NIR_PASS_V
|
||
- winsys/d3d12: Use DComp swapchains to support transparency
|
||
- nir: Add missing #include for c99_alloca.h
|
||
- util: Disable inline asm for arm64 for MSVC
|
||
- d3d12: Stop using util_framebuffer_init
|
||
- d3d12: Support more logic op formats
|
||
- d3d12: Move logicop emulation resource from surface to resource
|
||
- d3d12: Move logicop descriptor initialization to after all blits
|
||
- d3d12: Flush command queue when destroying or resizing
|
||
- wgl: Always revalidate framebuffer when front is requested
|
||
- d3d12: Only use DComp swapchains when alpha is present in the framebuffer
|
||
- wgl: Fix zink depth buffers
|
||
- dlist: Flush the context during EndList if it's part of a share group and uploaded during recording
|
||
- microsoft/compiler: Use lower_mem_access_bit_sizes for scratch/shared
|
||
- microsoft/compiler: Respect write masks when lowering unaligned loads and stores
|
||
|
||
Jianxun Zhang (7):
|
||
|
||
- anv: No compression on host memory allocation (xe2)
|
||
- anv: Fix PAT entry in importing (xe2)
|
||
- iris: Disable compression on sharing without modifier
|
||
- iris: Ensure type of bo's heap is consistent with modifier
|
||
- iris: Assert no disabling aux in first query (xe2)
|
||
- isl: Reuse Xe2 modifers on newer platforms
|
||
- iris: Enable Xe2 modifiers on all newer platforms
|
||
|
||
Job Noorman (75):
|
||
|
||
- ir3/cp: disable cat3 hw bug workaround on a6xx+
|
||
- freedreno: remove ir3_cmdline
|
||
- ir3/legalize: add asserts to prevent OOB array access
|
||
- ir3/postsched/legalize: ignore prefetch sam dummy src
|
||
- ir3: use dummy dst for descriptor prefetches
|
||
- ir3/shared_ra: don't reuse src of different halfness
|
||
- tu: add constlen shader stat
|
||
- ir3/a750: don't allocate const space for primitive_param/map
|
||
- ir3: treat consts_ubo as normal UBO
|
||
- tu: remove consts_ubo upload code
|
||
- freedreno/a7xx: disable consts_ubo upload
|
||
- tu: disable VK_EXT_post_depth_coverage
|
||
- tu: enable fragmentShadingRateWithShaderSampleMask
|
||
- ir3/legalize: prevent infinite loop when inserting (ss)nop
|
||
- ir3/ra: fix file start wraparound
|
||
- ir3: add pointer from ir3_shader_variant to ir3_shader
|
||
- ir3: add shader bisect debug tool
|
||
- v3d/drm-shim: add support for multisync
|
||
- nir/opt_uniform_subgroup: use ballot_bit_count
|
||
- ir3: allow 2 const srcs in scalar cat2
|
||
- ir3: align alias sequences to work around hardware bug
|
||
- ir3: don't add array stores to block keeps
|
||
- ir3: allow shared srcs for ldc
|
||
- ir3: use isam for txf with LOD 0
|
||
- ir3/array_to_ssa: fix updating/removing phis
|
||
- ir3/array_to_ssa: remove trivial all-undef phis
|
||
- ir3: allow shared srcs for ldc.k
|
||
- ir3: use ir3_get_predicate for demote/kill
|
||
- ir3: use shared srcs for demote/kill condition
|
||
- ir3/legalize: don't special-case early-preamble a1 reads
|
||
- ir3: make backend aware of scalar predicates
|
||
- ir3/isa: add encoding for scalar predicates
|
||
- ir3/opt_predicates: move some helpers up
|
||
- ir3: enable scalar predicates
|
||
- tu: pass SSBO/UBO min alignment to SPIR-V frontend
|
||
- nir: add nir_src_is_deref helper
|
||
- nir: add offset_shift intrinsic index
|
||
- nir: add some helpers for dealing with offset_shift
|
||
- nir,ir3: add offset_shift index to SSBO access intrinsics
|
||
- nir/lower_atomics: add support for offset_shift
|
||
- nir/lower_io_to_scalar: add support for offset_shift
|
||
- nir/lower_wrmasks: don't adjust BASE
|
||
- nir/lower_wrmasks: add support for offset_shift
|
||
- nir/opt_shrink_vectors: add support for offset_shift
|
||
- nir/lower_mem_access_bit_sizes: add partial support for offset_shift
|
||
- nir/opt_load_store_vectorize: allow per-instruction offset scaling
|
||
- nir/opt_load_store_vectorize: add support for offset_shift
|
||
- nir/opt_load_store_vectorize: fix wrap check for scaled offsets
|
||
- nir/lower_explicit_io: make offset calculation reusable
|
||
- nir/lower_explicit_io: add helper to build address
|
||
- nir/lower_explicit_io: use nir_io_offset to pass around addresses
|
||
- nir/lower_explicit_io: add alignment parameters to address builder
|
||
- nir/lower_explicit_io: add support for offset_shift
|
||
- ir3: use offset_shift for SSBO intrinsics
|
||
- ir3: don't vectorize nir_op_sdot_4x8_iadd[_sat]
|
||
- ir3: emit descriptor prefetch in block dominated by its sources
|
||
- freedreno/drm-shim: disable VM_BIND
|
||
- ir3: use shared masks for cov when scalar ALU is supported
|
||
- freedreno/computerator: fix cs builder conversion errors
|
||
- nir/opt_offsets: rename max_offset_data to cb_data
|
||
- nir/opt_offsets: add callback to set need_nuw per intrinsic
|
||
- ir3/cf: don't swap signedness of (sat) instructions
|
||
- ir3: use nir_lower_bit_size for 8-bit bit_count
|
||
- bin/rb: update Alyssa's email address in test case
|
||
- ir3/spill: initialize base reg as late as possible
|
||
- ir3/ra: make main shader reg select independent of preamble
|
||
- ir3: don't create merge sets for subreg moves
|
||
- ir3/parser: don't use instr as ralloc context
|
||
- freedreno/computerator: disable disk cache
|
||
- nir: add nir_shr builder
|
||
- nir/lower_alu: use Knuth's Algorithm M for [iu]mul_high
|
||
- nir,ir3: rename umul_low to umul_16x16
|
||
- nir: mark fneg distribution through fadd/ffma as nsz
|
||
- ir3/ra: fix assert during file start reset
|
||
- spirv: don't set in_bounds for structs
|
||
|
||
John Anthony (4):
|
||
|
||
- nir,agx: unvendor core_id_agx
|
||
- nir,spirv: Add support for SPV_ARM_core_builtins
|
||
- pan/va: Add support for SPV_ARM_core_builtins
|
||
- panvk: Enable VK_ARM_shader_core_builtins
|
||
|
||
Jonathan Marek (1):
|
||
|
||
- wsi/display: use atomic mode setting
|
||
|
||
Jordan Justen (6):
|
||
|
||
- intel/dev: Add WCL platform enum
|
||
- intel/dev/mesa_defs.json: Add WCL WA entries
|
||
- intel/dev: Add WCL device info
|
||
- intel/dev: Add WCL PCI IDs
|
||
- intel/dev: Add BMG 0xe209 PCI ID
|
||
- anv: Use image view base-layer in can_fast_clear_color_att()
|
||
|
||
Jose Maria Casanova Crespo (13):
|
||
|
||
- v3dv: Move V3D_TFU_READAHEAD_SIZE to src/broadcom/common
|
||
- v3d: Add V3D_TFU_READAHEAD padding for allocated resources
|
||
- v3dv: limit V3D_TFU_READAHEAD to buffers/images with USAGE_TRANSFER_SRC flag
|
||
- v3d: glMemoryBarriers only flush jobs with tmu_dirty_rcl
|
||
- v3d: Mark DIRTY_ZSA if disable_ez is changed from FS.
|
||
- v3d: Reduce CLE submission of CLIP_WINDOW packets
|
||
- v3d: Add V3D_TFU_READAHEAD padding for renderonly resources
|
||
- vc4/simulator: pass and return sim_file on vc4_simulator init/destroy
|
||
- vc4/simulator: avoid free simulator memory on destroy
|
||
- v3dv: Fix stencil clear values for only stencil clears
|
||
- v3d: Don't enable Early-z with discards when stencil updates are enabled
|
||
- v3d: use helpers util_writes_depth/stencil
|
||
- v3d: mark FRAG_RESULT_COLOR as output_written on SAND blits FS
|
||
|
||
Josh Simmons (2):
|
||
|
||
- util: Fix \`BITSET_EXTRACT` out-of-bounds read
|
||
- radv: Fix crash in sqtt due to uninitalized value
|
||
|
||
Joshua Ashton (5):
|
||
|
||
- wsi/common: Track VkColorSpaceKHR with wsi swapchain
|
||
- wsi/display: Implement VK_EXT_hdr_metadata on KHR_display swapchain
|
||
- wsi/display: Clean up DRM hdr/color state on swapchain destruction
|
||
- build: Add dependency on libdisplay-info
|
||
- wsi/display: Expose HDR10 colorspace based on EDID
|
||
|
||
Joshua Simmons (1):
|
||
|
||
- vtn: Fix OpCopyLogical destination type
|
||
|
||
José Roberto de Souza (23):
|
||
|
||
- intel/brw: Nuke unused brw_message_desc_header_present()
|
||
- intel/brw: Add comment to reg_unit()
|
||
- intel/brw: Remove duplicated implementation of brw_imm_uq/brw_imm_u64()
|
||
- gallium/llvmpipe/test: Rename rsqrtf() to _rsqrtf()
|
||
- intel/decode: Add support to new version of Xe KMD devcoredump with canonical addresses
|
||
- intel/brw: Use ASR over SHR for SHADER_OPCODE_ISUB_SAT
|
||
- intel/brw: Move brw_s0() to brw_reg.h
|
||
- anv/allocator: Move definition of ANV_FREE_LIST_EMPTY to anv_allocator
|
||
- anv/allocator: Drop uncessary function
|
||
- anv/allocator: Change some parameters and variables from 32bit to 64bits
|
||
- anv/allocator: Don't call anv_block_pool_map() with an offset that includes start_offset
|
||
- anv/allocator: Subtract start_offset in chunk_offset
|
||
- anv: Add comment to anv_state->offset
|
||
- anv: Define bt_block only in the block that uses it in anv_cmd_buffer_alloc_binding_table()
|
||
- anv: Replace duplicated code set shader relocs by a function
|
||
- anv: Drop shader relocs from anv_shader_bin_create()
|
||
- anv: Simply anv_shader_set_relocs() parameters
|
||
- anv: Rename anv_shader_bin to anv_shader_internal
|
||
- intel/brw: Share mode code in lower_lsc_varying_pull_constant_logical_send()
|
||
- intel/brw: Add comment to first_non_payload_grf
|
||
- intel/brw: Fix LSC fence scope and flush type
|
||
- intel/brw: Call lower_hdc_memory_fence_and_interlock() with brw_send_inst
|
||
- intel/brw: Store and set sfid in memory fences
|
||
|
||
Juan A. Suarez Romero (20):
|
||
|
||
- broadcom/ci: disable baremetal jobs for ci-tron
|
||
- v3d/ci: unlock rusticl citron jobs
|
||
- broadcom: remove obvious comment
|
||
- drm-uapi: update v3d_drm.h for reset counters
|
||
- broadcom: check for GPU reset counters support
|
||
- broadcom/simulator: add support for GPU reset counters
|
||
- v3d: implement get device reset status
|
||
- v3d: handle QUNIFORM_GET_UBO_SIZE
|
||
- v3d: implement robust buffer access
|
||
- broadcom/ci: disable baremetal rusticl jobs for ci-tron
|
||
- meson: check for no_sanitize function attributes
|
||
- util: add DECLARE_LINEAR_ZALLOC with no sanitize
|
||
- glsl: disable UBSan vptr check for ir_instruction
|
||
- broadcom/ci: comment some of the failures
|
||
- broadcom/ci: unlock CI-Tron jobs for arm32
|
||
- v3d/ci: update expected results
|
||
- ci: uprev VKCTS to 1.4.3.3
|
||
- glsl: use array element type to validate assignment
|
||
- vc4/ci: disable asan job
|
||
- v3d/v3dv/ci: switch to asan rpi5
|
||
|
||
Julia Zhang (2):
|
||
|
||
- virgl: Small fix of converting format
|
||
- pps: init driver in OnSetup
|
||
|
||
Julian Orth (2):
|
||
|
||
- ci: build and install native libwayland
|
||
- kms-swrast: export dmabufs with DRM_RDWR
|
||
|
||
Juston Li (3):
|
||
|
||
- anv/android: refactor anb resolve to fix align assertion
|
||
- anv: fix uninitialized mutex lock in anv_slab_bo_deinit()
|
||
- android/gralloc0: add CROS_GRALLOC_DRM_GET_BUFFER_COLOR_INFO
|
||
|
||
K900 (1):
|
||
|
||
- gfxstream: fix build on 32-bit
|
||
|
||
Karmjit Mahil (10):
|
||
|
||
- freedreno/registers: Fix SP_READ_SEL_LOCATION
|
||
- pvr: fix spm-related renderpass hwr
|
||
- pvr: Remove shareds_dest_offset from load_op
|
||
- pvr: Move renderpass load op setup into a separate function
|
||
- nir: Add more matches for \`fmulz`
|
||
- nir, ir3: Add \`lower_fmulz_with_abs_min` backend option
|
||
- freedreno/registers: Fix typo
|
||
- tu: Add VK_EXT_zero_initialize_device_memory
|
||
- ci,crnm: Fix f-string print error
|
||
- freedreno/decode: Add 2d_to_json lua script
|
||
|
||
Karol Herbst (125):
|
||
|
||
- vtn/opencl: set exact on all ffmas and mads
|
||
- zink: disallow intensity buffer images
|
||
- zink: disable shader images for intensity formats
|
||
- rusticl/mem: set swizzle for intensity images
|
||
- rusticl/mesa: add return status to PipeFence::wait
|
||
- rusticl/queue: offload waiting on fences to another thread
|
||
- rusticl/mem: relax flags validation for clGetSupportedImageFormats
|
||
- rusticl/queue: do not return event status errors on flush/finish
|
||
- rusticl/kernel: fix clippy lint needless-question-mark
|
||
- zink: properly unbind sampler views with imported 2D resource
|
||
- rusticl/mesa: use pipe_sampler_view_reference
|
||
- rusticl/queue: clear shader images when destroying queues
|
||
- rusticl/queue: pass a mut reference to QueueContext around
|
||
- rusticl/queue: commit lifetime crimes
|
||
- rusticl/queue: remove RefCell<QueueKernelState>
|
||
- rusticl/kernel: stop clearing sampler views on kernel launches
|
||
- rusticl/queue: cache samplers
|
||
- rusticl/kernel: unbind trailing shader images
|
||
- nak: fix wrong argument order in calls to build_txq_size
|
||
- nak: optimize load_subgroup_id
|
||
- nv50: fully migrate away from util_framebuffer_init
|
||
- nak: use MemScope::CTA for shared memory scoped SCOPE_WORKGROUP barriers
|
||
- nak: copy late_algebraic iadd3 rules without the constant restriction
|
||
- rusticl: fix impl_trait_overcaptures lint errors
|
||
- rusticl: fix unsafe_attr_outside_unsafe lint errors
|
||
- rusticl: add lints relevant for edition 2024 migration
|
||
- rusticl: use pipe_sampler_view_release
|
||
- rusticl/mesa: wire up fence_server
|
||
- rusticl/gl: store the mesa_glinterop_export_in
|
||
- st/interup: flushing objects is a no-op when no context is bound
|
||
- rusticl/gl: only flush objects on import if we get a valid fd
|
||
- rusticl/gl: flush and wait on gl objects inside clEnqueueAcquireGLObjects
|
||
- vulkan: use p_atomic_read on vk_descriptor_set_layout::ref_cnt
|
||
- zink: fix data race in descriptor_util_pool_key_get
|
||
- rusticl: silence warnings in generated sources
|
||
- rusticl: silence new warnings from rustc versions above our rustc target
|
||
- anv: do not map from_host_ptr bos in image_bind_address
|
||
- zink: set zink_bo is_user_ptr on creation
|
||
- anv/i915: print bo->map when dumping exec buffers bos
|
||
- nak: set max_gpr to multiple of 8s
|
||
- nak: add more helpers for predicates
|
||
- nak: relayout opt_uniform_instrs
|
||
- nak: support bra.u with a upred source on Ampere and newer
|
||
- rusticl/mesa: add ResourceType::Immutable
|
||
- rusticl/kernel: create shader constants as immutable
|
||
- rusticl/mem: split out mem_flags validation for creation operations
|
||
- rusticl/mem: turn bool argument into enum in validate_mem_flags
|
||
- rusticl: implement cl_ext_immutable_memory_objects
|
||
- rusticl: fix a bunch of warnings
|
||
- rusticl/util: add read_and_advance methods for pointers
|
||
- rusticl/util: use read_and_advance in Properties
|
||
- rusticl/util: drop uneccesary Arc in event_list_from_cl
|
||
- rusticl/icd: qualify CLResult inside impl_cl_type_trait_base macro
|
||
- rusticl/icd: sort extension functions by extension name
|
||
- rusticl: handle failures when importing fences
|
||
- rusticl/mesa: port PipeFence to use ThreadSafeCPtr
|
||
- rusticl: specify FD type when importing fences
|
||
- nak: run nir_opt_move nir_move_load_ubo
|
||
- nak: run nir_opt_move nir_move_comparisons
|
||
- rusticl: add SPDX tags
|
||
- aux/trace: move fence_server calls outside the locked area
|
||
- nak: rework scale argument of compute_mat and rename it
|
||
- nak: protect static cycle counting against overflows
|
||
- nak: use logarithmic scaling in estimate_block_weight
|
||
- nak: extract nir_intrinsic_cmat_load lowering into a function
|
||
- nak/hw_runner: support shared memory
|
||
- nak/hw_runner: add ldsm tests
|
||
- nak: use ldsm
|
||
- rusticl/mesa: rename PipeResource to PipeResourceOwned
|
||
- rusticl/mesa: add borrow/to_owned semantics to our pipe_resource wrapper
|
||
- rusticl/kernel: reduce CPU overhead of set_global_binding
|
||
- rusticl/kernel: move add_pointer into KernelExecBuilder
|
||
- rusticl/kernel: move add_global into KernelExecBuilder
|
||
- rusticl/kernel: move add_sysval into KernelExecBuilder
|
||
- rusticl/kernel: add KernelExecBuilder::add_values
|
||
- rusticl/kernel: add KernelExecBuilder::add_zero_padding
|
||
- rusticl/kernel: add KernelExecBuilder::get_resources_and_globals
|
||
- rusticl/kernel: move workgroup id offset handling into KernelExecBuilder
|
||
- rusticl/kernel: add KernelExecBuilder::input
|
||
- rusticl/kernel: allocate the full input buffer at creation time
|
||
- rusticl/kernel: rework KernelExecBuilder::get_resources_and_globals to reduce allocations
|
||
- rusticl/device: add DeviceCaps::has_create_fence_fd and use it
|
||
- docs/gallium: Clarify ordering requiremenets on fence_server_signal and fence_server_sync
|
||
- rusticl/event: fix create_and_queue for deps in error states
|
||
- rusticl/util: add MultiValProperties
|
||
- gallium/noop: add fence_server_signal
|
||
- gallium: add pipe_screen::semaphore_create
|
||
- rusticl/mesa: wire up semaphores
|
||
- zink: factor out fence creation function
|
||
- zink: implement pipe_screen::semaphore_create
|
||
- radeonsi: implement pipe_screen::semaphore_create
|
||
- rusticl: add stubs for semaphores and external_memory
|
||
- rusticl: implement cl_khr_semaphore
|
||
- rusticl: implement cl_khr_external_semaphore
|
||
- util: move typed_memcpy into macros.h
|
||
- nvk: prepare for higher shared memory sizes
|
||
- nouveau/winsys: add shared memory size tables
|
||
- nak/qmd: base shared mem size allocation on hardware limits
|
||
- nvk: use hardware limits for maxComputeSharedMemorySize
|
||
- nak/qmd: properly set target shared mem size
|
||
- rusticl: drop unneeded dependency to generated sources
|
||
- rusticl: drop global allow statements
|
||
- rusticl: specify allowed lints for tests in lib.rs
|
||
- rusticl: add a bunch of trivial tests
|
||
- rusticl/mem: fix Image::read for 1Darray images
|
||
- rusticl/mesa: fix NULL pointer access in set_constant_buffer_stream
|
||
- ac/llvm: fix get_global_address for global atomics
|
||
- rusticl: reference resource in sampler and image view wrappers
|
||
- ci: document what version to specify in RUST_VERSION
|
||
- rusticl/util: make ThreadSafeCPtr Copy, Clone and transparent
|
||
- rusticl/mesa: add PipeScreen::pipe
|
||
- rusticl/mesa: rework Context creation
|
||
- rusticl/mesa: make PipeScreen transparent
|
||
- rusticl/mesa: make PipeScreen refcounted
|
||
- libagx: fix heap argument type in libagx_draw_robust_index
|
||
- clc: Fix createDiagnostics for LLVM-22
|
||
- nak: extract cmat load/store element offset calculation
|
||
- nak: ensure deref has a ptr_stride in cmat load/store lowering
|
||
- nak: fix MMA latencies on Ampere
|
||
- st/interop: fix fence leak
|
||
- rusticl/queue: fix error code for invalid queue properties part 1
|
||
- rusticl/queue: fix error code for invalid queue properties part 2
|
||
- rusticl/queue: fix error code for invalid sampler kernel arg
|
||
- rusticl/kernel: take no kernel_info reference inside the launch closure
|
||
- rusticl/spirv: preserve signed zeroes by default
|
||
|
||
Kenneth Graunke (45):
|
||
|
||
- brw: Refactor copy propagation checks for EOT send restrictions
|
||
- brw: Fix units in copy propagation EOT restriction size calculation
|
||
- brw: Update copy propagation into EOT sends handling for Xe2 units
|
||
- crocus: Drop 16X MSAA code remnants
|
||
- crocus: Fix a comment about supporting 16x MSAA
|
||
- intel: Disable 16x MSAA support on Xe3
|
||
- brw: Use BAD_FILE instead of ARF null for second send payload
|
||
- brw: Assert that EOT is always SHADER_OPCODE_SEND on pre-Xe3
|
||
- brw: Stop checking inst->is_send_from_grf() for g127 register hack
|
||
- brw: Stop using is_send_from_grf() in CSE pass
|
||
- brw: Drop inst->mlen check from is_send()
|
||
- brw: Rename is_send_from_grf to is_send, replace other is_send() helper
|
||
- brw: Properly resolve non-sendable sources in a few logical opcodes
|
||
- brw: Enumerate SHADER_OPCODE_SEND sources and standardize how many
|
||
- brw: Drop INTERPOLATE_AT_* opcodes from is_send()
|
||
- brw: Drop interlock and memory fence logical opcodes from is_send()
|
||
- brw: Drop uniform pull constant load virtual opcode from is_send()
|
||
- brw: Drop INTERPOLATE_AT_* opcodes from is_payload()
|
||
- brw: Drop interlock and memory fence logical opcodes from is_payload()
|
||
- brw: Validate that send payloads can't be imms or have source mods
|
||
- brw: Remove brw_inst::no_dd_check/no_dd_clear
|
||
- nir: Add load_simd_width_intel to divergence analysis
|
||
- intel/nir: Make ffma peephole optimization preserve fp_fast_math flags
|
||
- brw: Move "SSA form" printing to after divergence analysis is run
|
||
- brw: Lower certain subgroup size modes in brw_preprocess_nir
|
||
- brw: Split brw_postprocess_nir() into two pieces
|
||
- brw: Do most of NIR postprocessing before cloning for SIMD variants
|
||
- brw: Add a quick NIR-based register pressure estimate pass
|
||
- brw: Skip compilation of larger SIMDs when pressure is too high
|
||
- iris/ci: Update trace checksums
|
||
- brw: Only skip SIMD widths based on pressure if an smaller one compiled
|
||
- elk: Delete ELK_SHADER_RELOC_DESCRIPTORS_ADDR_HIGH
|
||
- brw: Rename brw_shader_reloc to intel_shader_reloc
|
||
- intel: Move intel_shader_reloc to common code and drop elk_shader_reloc
|
||
- brw: Drop ir_expression_operation_h from build system
|
||
- brw: Rename brw_nir_trig build target to brw_nir_workarounds
|
||
- intel: Make a libintel_compiler_nir internal static library
|
||
- intel: Re-unify brw_prim.h and elk_prim.h
|
||
- brw: Drop compiler/ from brw includes
|
||
- brw: Move into a new src/intel/compiler/brw subdirectory
|
||
- brw: Stop using type_size_dvec4 for fragment shader outputs
|
||
- brw: Replace type_size_xvec4 with glsl_count_attribute_slots
|
||
- brw: Refactor clip/cull distance mask setting into a helper
|
||
- brw: Use BITFIELD_{MASK,RANGE} in clip/cull distance mask handling code
|
||
- brw: Fix mesh shader asserts in clip/cull distance setting
|
||
|
||
Konstantin Seurer (63):
|
||
|
||
- radv: Optimize ray tracing position fetch
|
||
- radv: Disable pointer flags and the GFX12 WA for emulated RT
|
||
- radv: Implement watertightness for emulated RT
|
||
- radv/rt: Optimize emulated ray-triangle tests
|
||
- radv/rt: Use inv_dir for software ray-triangle tests
|
||
- radv/rt: Implement null acceleration structure in shader code
|
||
- radv/rra: Only write used BLAS
|
||
- radv/rra: Increase rra_validation_context::location
|
||
- radv/rra/gfx12: Handle box nodes without children
|
||
- radv/rra/gfx12: Add validation
|
||
- gallivm: Silence a warning
|
||
- gallium/util: Fix an assert in util_resource_copy_region
|
||
- lavapipe: Adjust imageGranularity for block formats
|
||
- lavapipe/ci: Add context to some vkd3d-proton test fails
|
||
- lavapipe: Set image_array for input attachment loads
|
||
- gallivm: Implement txs with divergent explicit lod
|
||
- gallivm: Implement arrayed non-arrayed descriptor compatibility
|
||
- util: Fix sparse tile size when dimensions=1
|
||
- lavapipe/rt: Fix watertightness for real this time
|
||
- lavapipe/rt: Set push_constant_size
|
||
- lavapipe/rt: Do not use vk_acceleration_structure::size
|
||
- radv: Add and use RADV_OFFSET_UNUSED
|
||
- radv: Only write leaf node offsets when required
|
||
- radv/bvh: Fix flush in bit_writer_skip_to
|
||
- radv/bvh: Use a fixed indices midpoint on GFX12
|
||
- radv: Initialize base IDs when doing a BVH update with src!=dst
|
||
- radv/bvh: Update leaf nodes before refitting
|
||
- radv/bvh: Specialize the update shader for geometryCount==1
|
||
- vulkan/cmd_queue: Do not free if driver_free_cb is provided
|
||
- vulkan/cmd_queue: Improve struct free code indentation
|
||
- vulkan/cmd_queue: Recursively free struct members
|
||
- vulkan/cmd_queue: Clean up generating copies
|
||
- vulkan/cmd_queue: Reorder memcpy in get_struct_copy
|
||
- radv: Use vk_acceleration_struct_vtx_format_supported
|
||
- lavapipe: Use vk_acceleration_struct_vtx_format_supported
|
||
- radv/rra/gfx12: Handle compressed primitive nodes
|
||
- radv: Emit compressed primitive nodes on GFX12
|
||
- vulkan: Add MESA_VK_SHADER_STAGE_ALL
|
||
- lavapipe: Mask invalid shader stage flags
|
||
- radv: Rename radv_printf files to radv_debug_nir
|
||
- radv: Add RADV_DEBUG=validatevas for address validation in nir
|
||
- radv: Store parent node IDs inside nodes on GFX12
|
||
- radv/bvh: Copy parent_id during updates on GFX12
|
||
- nir: Use nir_def_as_* in more places
|
||
- nir: Use nir_def_block in more places
|
||
- radv/bvh: Do not write pointer flag related data on GFX103
|
||
- vulkan: Use a struct for debug markers
|
||
- vulkan: Add more detail to encode debug markers
|
||
- radv: Use vk_barrier_compute_w_to_compute_r more
|
||
- radv,vulkan: Avoid a useless barrier in radv_update_bind_pipeline
|
||
- nir/opt_ray_queries: Cleanup and return if functions is not singular
|
||
- vulkan/bvh: Enable glsl extensions in meson
|
||
- vulkan/cmd_queue: Remove unused variable
|
||
- vulkan/cmd_queue: Handle internal structs
|
||
- vulkan/cmd_queue: Handle struct arrays with pNext
|
||
- Revert "lavapipe/ci: Disable stack-use-after-return detection for ASan"
|
||
- vulkan/vk_cmd_queue: Clone VkSampleLocationsInfoEXT extending VkRenderingInfo
|
||
- aco: Fixup out_launch_size_y in the RT prolog for 1D dispatch
|
||
- lavapipe: Bump maxPrimitiveCount
|
||
- lavapipe: Zero image null descriptors
|
||
- lavapipe: Bump MAX_DESCRIPTOR_UNIFORM_BLOCK_SIZE
|
||
- gallivm/nir/soa: Use the sign of src1 for imod
|
||
- llvmpipe: Always recompute 1/w
|
||
|
||
Kovac, Krunoslav (2):
|
||
|
||
- amd/vpelib: Fix Possible dereferencing null
|
||
- amd/vpelib: Minor Refactor
|
||
|
||
Lars-Ivar Hesselberg Simonsen (20):
|
||
|
||
- u_trace: Indirect capture fixes
|
||
- panvk: Fix instrumentation on v12+
|
||
- panvk: Fix IUB decode
|
||
- panvk/utrace: Pass async_op instead of mask
|
||
- panvk/utrace: Make indirect capture wait optional
|
||
- panvk/utrace: Add support for storing registers
|
||
- panvk/utrace: Add sync32/64_wait support
|
||
- panvk/utrace: Add sync32/64_add support
|
||
- panvk/utrace: Add flush_cache support
|
||
- panvk: Add utrace tracepoints in queue_submit
|
||
- vulkan: Stop combining subpass dependencies
|
||
- vulkan: Find first_subpass when creating renderpass
|
||
- vulkan: Add transition_view_mask calculation
|
||
- vulkan: Optimize implicit begin_subpass barrier
|
||
- vulkan: Optimize implicit end_subpass barrier
|
||
- panvk/ci: Add uncovered CTS issue to flakes
|
||
- radv/ci: Add uncovered CTS issue to gfx1201 fails
|
||
- panvk: Fix IUB decode
|
||
- pan/format: Fix mapping for I16F
|
||
- pan/format: Disable PAN_BIND_STORAGE_IMAGE for RGBA4/BGRA4
|
||
|
||
Leder, Brendan Steve (Brendan) (1):
|
||
|
||
- amd/vpelib: General cleanup / optimization tasks
|
||
|
||
Lewis Cooper (2):
|
||
|
||
- pvr: Implement VK_KHR_maintenance3
|
||
- pvr: Implement VK_KHR_dedicated_allocation
|
||
|
||
LingMan (7):
|
||
|
||
- ci/rust: Drop date from Rust release channel selection
|
||
- docs/rusticl: Update documented version requirements for meson and bindgen
|
||
- mesa: Bump required Rust version to 1.82
|
||
- rusticl: Use \`is_aligned` from std
|
||
- rusticl: Drop include paths for \`size_of`, \`size_of_val`, and \`align_of`
|
||
- rusticl: Use std::mem::offset_of!()
|
||
- nak: Drop include paths for \`size_of` and \`size_of_val`
|
||
|
||
Lionel Landwerlin (148):
|
||
|
||
- anv: reuse runtime descriptor set layout base object
|
||
- anv: remove unused helper arguments
|
||
- brw: fix NIR metadata invalidation with closest-hit shaders
|
||
- brw: fixup source depth enabling with coarse pixel shading
|
||
- brw: fixup coarse_z computation
|
||
- brw: consider LOAD_PAYLOAD fully defined
|
||
- brw: always ensure coarse pixel is disabled on Gfx9
|
||
- anv: fix wsi image aliasing
|
||
- compiler: add gl_shader_stage_is_graphics
|
||
- brw: make more passes printable through NIR_DEBUG
|
||
- anv: move over to common descriptor set & pipeline layouts
|
||
- anv: expose helper function outside of anv_pipeline.c
|
||
- anv: rename vertex input emission helper
|
||
- anv: reuse runtime flags field for descriptor set layout
|
||
- anv: make anv_pipeline_sets_layout looks more like vk_pipeline_layout
|
||
- anv: stop using anv_pipeline_sets_layout
|
||
- anv: extract embedded samplers from pipeline_cache
|
||
- anv: break ANV_CMD_DIRTY_PIPELINE into each stage
|
||
- anv: avoid storing L3 config on the pipeline
|
||
- intel: move deref_block_size to intel_urb_config
|
||
- intel: reuse intel_urb_config for mesh
|
||
- anv: store layout_type on the bind_map for convenience
|
||
- anv: move URB programming to dynamic emission path
|
||
- anv: avoid looking at the pipeline to flush push descriptors
|
||
- anv: constify some helpers
|
||
- anv: store gfx/compute bound shaders on command buffer state
|
||
- meson: remove intel-clc options
|
||
- brw: implement ACCESS_COHERENT on Gfx12.5+
|
||
- anv: fix source hash utrace prints
|
||
- anv/brw: store min_sample_shading on wm_prog_data
|
||
- anv/brw: move sample_shading_enable to wm_prog_data
|
||
- anv: move primitive_replication emission to dynamic path
|
||
- anv: move 3DSTATE_SF dynamic emission path
|
||
- anv: simplify SBE emission
|
||
- anv: move SBE emission to dynamic path
|
||
- anv: move 3DSTATE_CLIP emission to dynamic path
|
||
- anv: move 3DSTATE_VFG emission to dynamic path
|
||
- anv: move 3DSTATE_TE::TessellationDistributionMode to dynamic path
|
||
- anv: pass active stages to push descriptor flushing
|
||
- anv: remove pipeline_stage unused field
|
||
- anv: use a local variable for batch
|
||
- anv: actually use the COMPUTE_WALKER_BODY prepacked field
|
||
- anv: rework gfx state emission (again)
|
||
- anv: subclass vk_pipeline
|
||
- brw: compute consistent clip/cull distance masks with VUE
|
||
- anv: Do not consider task as prerasterization
|
||
- anv: fix missing meson dep
|
||
- vulkan/runtime: add a few more shader properties
|
||
- vulkan/runtime: add ray tracing pipeline support
|
||
- brw: reorder reloc enums to leave embedded samplers at the end
|
||
- anv: stop using descriptor layouts for descriptor buffers push sizes
|
||
- brw: move URB channel mask shifting to the lowering pass
|
||
- anv: fix R64* vertex buffer format support
|
||
- vulkan/runtime: use a pipeline flag for unaligned dispatches
|
||
- brw: enable register allocation to deal with multiple EOTs
|
||
- brw: enable opt_register_coalesce to work with multiple EOT blocks
|
||
- brw: workaround broken indirect RT messages on Gfx11
|
||
- brw: fix analysis dirtying with pulled constants
|
||
- brw: make assign_curb_setup visible in optimizer debug
|
||
- anv: fix uninitialized return value
|
||
- brw: remove uniform from opt_offsets
|
||
- brw: use a scalar builder for the load_payload on transpose loads
|
||
- brw: fix INTEL_DEBUG=spill_fs
|
||
- brw: fix broadcast opcode
|
||
- anv: move input coverage mask setup to runtime flush
|
||
- anv: temporary disable KHR_maintenance8
|
||
- Revert "anv: enable non uniform texture offset lowering"
|
||
- Revert "brw: move texture offset packing to NIR"
|
||
- intel: update code owners
|
||
- anv: fix pipeline barriers with pre-rasterization stages
|
||
- anv/utrace: avoid memseting timestamp buffers by using tracepoint flags
|
||
- anv: fix partial queries
|
||
- nir: add a new intrinsic for load dynamic tessellation config
|
||
- brw: add ability to compute VUE map for separate tcs/tes
|
||
- anv/brw/iris: move VS VUE computation to backend
|
||
- brw: add support for separate tessellation shader compilation
|
||
- anv: prep work for separate tessellation shaders
|
||
- compiler: add stage_is_graphics() helper
|
||
- anv: add infrastructure for common vk_pipeline
|
||
- anv: move internal RT shaders around
|
||
- anv: add runtime shader statistic support
|
||
- anv: add shader instruction emission
|
||
- anv: store a few default instructions
|
||
- anv: switch over to runtime pipelines
|
||
- anv: remove unused gfx/compute pipeline code
|
||
- anv: expose VK_EXT_shader_object
|
||
- anv: add an undocumented HW workaround for Gfx12.5
|
||
- anv: fixup robust_ubo_range mask
|
||
- vulkan: remove incorrect assert
|
||
- anv: remove divergence requirement
|
||
- brw: don't use brw_null_reg() for unused SEND sources
|
||
- anv: run nir_opt_acquire_release_barriers
|
||
- brw: remove unused RT write code
|
||
- brw: improve eot_reg computation in register allocate
|
||
- anv: fixup 3DSTATE_COARSE_PIXEL emission
|
||
- anv: avoid unnecessary 3DSTATE_PS_EXTRA emissions
|
||
- brw: lower non coherent FS load_output in NIR
|
||
- brw/blorp: lower MCS fetching in NIR
|
||
- brw: lower shader opcode into tex_instr
|
||
- brw: simplify texture surface/sampler handle sources
|
||
- brw: fix split_sends with txf combining
|
||
- brw: layout patch in VUE in position independent way
|
||
- anv: fix streamout config comparison
|
||
- anv: fix crash in ESO tests
|
||
- brw: fix type conversion in tex operation params
|
||
- nir/lower_tex: add an callback to lower txd ops
|
||
- brw: use the new lower_txd_cb
|
||
- elk: remove txd bindless sampler lowering
|
||
- elk: use the new lower_txd_cb
|
||
- nir/lower_tex: remove unused options
|
||
- brw: fix render target indexing in FS output reads
|
||
- vulkan/render_pass: fixup renderpasses barriers for 2D views of 3D images
|
||
- nir: add pass to propagate image format to intrinsics
|
||
- anv: run image/intrinsic update pass
|
||
- iris: run image/intrinsic update pass
|
||
- brw: avoid looking at variables to get image formats
|
||
- u_trace: use os_get_option instead of getenv
|
||
- intel/ds: lump all the draw under the same toggle
|
||
- intel/ds: disable draw/blorp tracepoints by default on android
|
||
- brw: prevent LOAD_REG modifications on MOV_INDIRECT/BROADCAST
|
||
- anv: fix companion usage for emulated image
|
||
- nir/divergence: add a new mode to cover fused threads on Intel HW
|
||
- nir/lower_io: add get_io_index_src_number support for image intrinsics
|
||
- compiler: add an access flag for intel EU fusion
|
||
- brw: serialize messages on Gfx12.x if required
|
||
- brw: add serialize send stats
|
||
- anv: fix query copy with shaders
|
||
- intel/ci: remove old comments
|
||
- brw: fix invalid sparse bitfield offset computation
|
||
- Revert "wsi: Implements scaling controls for DRI3 presentation."
|
||
- anv: fix image-to-image copies of TileW images
|
||
- brw: constant fold u2u16 conversion on MCS messages
|
||
- brw: only consider cross lane access on non scalar VGRFs
|
||
- brw: fix ballot() type operations in shaders with HALT instructions
|
||
- nir/divergence: fix handling of intel uniform block load
|
||
- anv: rename structure holding 3DSTATE_WM_DEPTH_STENCIL state
|
||
- brw: handle GLSL/GLSL tessellation parameters
|
||
- nir/lower_io: add missing levels intrinsics to get_io_index_src_number
|
||
- anv/brw: fix output tcs vertices
|
||
- anv: destroy sets when destroying pool
|
||
- vulkan/render_pass: Add a missing sType
|
||
- u_trace: reserve chunk space before emitting copies
|
||
- anv: avoid null pointer access in utrace copies on CCS
|
||
- brw: avoid invalid URB messages
|
||
- anv: avoid invalid timestamp generation due to skipped commands
|
||
- vulkan/runtime: simplify robustness state hashing
|
||
- anv/blorp/iris: rework Wa_14025112257
|
||
- anv: disable software detiling on Xe2+ for image atomics 64bits
|
||
|
||
Lorenzo Rossi (3):
|
||
|
||
- nak: Fix pre-volta iadd3 panic during compilation
|
||
- nak/kepler: Refine instruction scheduling
|
||
- nvk: Fix QMD buffer length on upload
|
||
|
||
Luc Ma (1):
|
||
|
||
- dri: use XCB_PRESENT_EVENT_* enum instead of macros for consistency
|
||
|
||
Lucas Fryzek (14):
|
||
|
||
- lp: Don't allocate sampler functions if count is 0
|
||
- anv: Enable compression on astc emulation plane
|
||
- vulkan/util: update pd feature codegen to use platform guards
|
||
- anv: Remove special CROS_GRALLOC path from format logic
|
||
- hasvk: Remove special CROS_GRALLOC path from format logic
|
||
- anv: Update viewport/scissor state when count changes
|
||
- vulkan/runtime: Error if ahb has more than one layer
|
||
- anv: Assert that we only import ahb image with one layer
|
||
- anv: Enable R10X6 & R10X6G10X6 unorm formats
|
||
- anv: Modify anv feature (dis)enable code to match other drivers
|
||
- vulkan/android: Add rp_attachment_has_external_format helper
|
||
- vulkan/runtime: Add logic to set external format resolve mode
|
||
- anv: Add external format resolve operation using blorp
|
||
- anv: Enable VK_ANDROID_external_format_resolve
|
||
|
||
Lucas Stach (6):
|
||
|
||
- etnaviv: Update headers from rnndb
|
||
- etnaviv: stop touching code steering bits while updating uniforms
|
||
- etnaviv: update code steering bit when writing shader instructions
|
||
- etnaviv: don't emit start/end PC states when unified instmem is present
|
||
- etnaviv: use new shader range registers when icache is present
|
||
- etnaviv: fix YUV tiler blits
|
||
|
||
Ludvig Lindau (1):
|
||
|
||
- panfrost: Make instrs_equal check res table/index
|
||
|
||
Luigi Santivetti (22):
|
||
|
||
- pvr: rename pvr tex format description variables for clarity
|
||
- pvr: rename pvr_{create,generate} to appear at the end
|
||
- pvr: split out missing output register write handling into separate function
|
||
- pvr: determine rt layers based on rta support
|
||
- pvr: fix logic for setting vdm instance count present
|
||
- pvr: don't csb emit multi-layer clear attachments without rta support
|
||
- pvr: reset the pds info map entries pointer to avoid double free
|
||
- pvr: align texture stride for spm as the PBE requires
|
||
- pvr: take zonlyrender into account when setting up ZLS control
|
||
- pvr: add support for VK_KHR_maintenance1
|
||
- pvr: add support for VK_KHR_maintenance2
|
||
- pvr: unify the creation of load_op objects and shaders
|
||
- pvr: rename job field holding pds PR background objects
|
||
- pvr: rename {init,setup} command buffer helpers
|
||
- pvr: drop unused argument from pvr_load_op_shader_generate()
|
||
- pvr: add support for U16U16U16 texture state format
|
||
- pvr: restrict signed A2-10 bits per component formats to vertex only
|
||
- Revert "pvr: treat VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT as not supported"
|
||
- pvr: add initial driver support for VK_KHR_multiview
|
||
- pvr: improve unemitted resolve attachments readability
|
||
- pvr: restrict the scope of copy_{buffer,image}_to_{image,buffer}
|
||
- pvr: propagate image samples when doing a blit from DS surface
|
||
|
||
Marek Olšák (168):
|
||
|
||
- gallium: make pipe_screen::finalize_nir return void
|
||
- gallium: replace get_compiler_options with pipe_screen::nir_options
|
||
- st/mesa: don't expect pipe_screen::nir_options to be NULL for supported shaders
|
||
- mesa: use pipe_screen::nir_options instead of NirOptions
|
||
- glsl: use pipe_screen::nir_options instead of NirOptions
|
||
- ac/surface/gfx12: add addr_from_coord for sparse MSAA textures
|
||
- ac/surface/gfx12: select 64K tiling for sparse MSAA textures
|
||
- radeonsi/gfx12: enable sparse textures
|
||
- ac/nir: don't vectorize to 96-bit and 128-bit LDS loads (it's slower)
|
||
- ac/nir: mark all input loads as reorderable and speculatable (for LICM)
|
||
- ac/llvm: rewrite global & shared stores to share code
|
||
- ac/llvm: rewrite global & shared loads to share code
|
||
- ac/llvm: always use opaque pointers
|
||
- ac/llvm: fix readlane with vectors
|
||
- radeonsi: disallow the compute copy for Z/S
|
||
- radeonsi: add a workaround for gfx10.3-11 corruption with R9G9B9E5_FLOAT
|
||
- radeonsi: recompute FS output IO bases to prevent an LLVM crash
|
||
- radeonsi: get si_shader_info::input::usage_mask from NIR
|
||
- radeonsi: flatten struct si_vs_tcs_input_info
|
||
- radv,radeonsi: mark VS input loads and poly stipple load speculatable
|
||
- radv: don't sink VS input loads and move them to the top
|
||
- nir: add nir_instr_can_speculate helper (for LICM)
|
||
- nir: add nir_tex_instr::can_speculate
|
||
- nir: add access to load_smem_amd (for ACCESS_CAN_SPECULATE)
|
||
- nir/divergence_analysis: simplify nir_vertex_divergence_analysis
|
||
- nir/opt_move_to_top: check can_reorder & can_speculate
|
||
- nir: silence a warning in nir_opt_shrink_vectors
|
||
- nir: handle store_buffer_amd in nir_intrinsic_writes_external_memory
|
||
- radeonsi/ci: import piglit & cts build scripts
|
||
- radeonsi/ci: don't build GLES CTS separately
|
||
- radeonsi/ci: update gfx12 and other failures
|
||
- nir/group_loads: handle more loads
|
||
- nir/group_loads: allow moving loads across instructions without defs
|
||
- nir/group_loads: split is_barrier into is_barrier + is_terminate
|
||
- nir/group_loads: group any reorderable intrinsics regardless of barriers
|
||
- nir/group_loads: invert the return value of can_move to reflect its true meaning
|
||
- nir/group_loads: remove mostly duplicated function is_memory_load
|
||
- nir/group_loads: make is_grouped_load use get_load_resource
|
||
- nir/group_loads: use nir_instr_next/prev
|
||
- nir/group_loads: store our custom instr->index in an array
|
||
- nir/group_loads: don't use pass_flags to store the indirection level
|
||
- nir/group_loads: rename to nir_opt_group_loads
|
||
- nir: mark inverse_ballot & is_subgroup_invocation_lt_amd as CAN_REORDER
|
||
- nir: change how can_mov_out_of_loop is set for intrinsics in nir_can_move_instr
|
||
- nir: handle can_reorder robustly in nir_can_move_instr
|
||
- nir: renumber nir_move_options
|
||
- nir: split nir_move_load_frag_coord from nir_move_load_input
|
||
- nir: handle load_input_vertex in nir_can_move_instr
|
||
- nir: add more nir_move_options
|
||
- nir: add nir_move_only_convergent/divergent
|
||
- glsl: fork exec_node/list -> ir_exec_node/list as private GLSL IR utility
|
||
- intel: fork exec_node/list -> brw_exec_node/list as a private Intel utility
|
||
- nir: move list.h outside the glsl directory
|
||
- nir: remove C++ stuff from list.h
|
||
- nir: remove unused stuff from list.h
|
||
- glsl: remove unused stuff from ir_list.h
|
||
- glsl: remove unused symbol_table_entry::get_interface
|
||
- glsl: remove reparent_ir
|
||
- nir/opt_group_loads: support tex instructions without resource srcs for i915
|
||
- glsl/tests: fix memory leaks
|
||
- ralloc/linalloc: allow adding custom code to LINEAR_ALLOC new operator
|
||
- glsl: add support for linear_ctx into ir_instruction
|
||
- glsl: switch ir_instruction to linear_ctx to eliminate malloc overhead
|
||
- glsl: switch ir_variable_refcount to linear_ctx
|
||
- mesa: switch symbol_table to linear_ctx
|
||
- dri: fail creating DRI images that exceed hw limits
|
||
- nir: don't allocate nir_constant::elements if there are none
|
||
- nir: add nir_variable_{set,append,steal}_name{f}() to modify nir_variable names
|
||
- nir: eliminate most ralloc/malloc for nir_variable names
|
||
- nir/clone: don't call ralloc_strdup with a NULL pointer for intrinsic names
|
||
- nir: don't use variables as ralloc parents, use the shader instead
|
||
- nir: add nir_variable_create_zeroed helper
|
||
- nir: use gc_ctx for nir_variable to reduce ralloc/malloc overhead
|
||
- meson: reinstate LLVM requirement for r300 and enforce it for i915 too
|
||
- meson: remove unused -DLLVM_AVAILABLE
|
||
- mesa: move src/mapi to src/mesa/glapi
|
||
- docs,ci: update mapi relocation
|
||
- mesa: remove inc_mapi
|
||
- mesa: stop using inc_mesa in most places that have nothing to do with GL
|
||
- glsl: use pipe caps in opt_shader
|
||
- glsl: replace LowerBuiltinVariablesXfb with pipe caps
|
||
- glsl: replace LowerPrecisionFP16/Int16 with pipe caps
|
||
- glsl: replace LowerPrecisionDerivatives with pipe caps
|
||
- glsl: replace LowerPrecisionFloat16Uniforms with pipe caps
|
||
- glsl: replace LowerPrecision16BitLoadDst with pipe caps
|
||
- glsl: replace LowerPrecisionConstants with pipe caps
|
||
- st/mesa: replace EmitNoIndirect* with pipe caps
|
||
- glsl: move PositionAlwaysInvariant/Precise options to gl_constants
|
||
- glsl: remove gl_shader_compiler_options
|
||
- ac/nir/meta: allow compute blits with R5G6B5 & R5G5B5A1 formats on GFX9+
|
||
- radeonsi/gfx12: print swizzle modes for AMD_TEST=imagecopy
|
||
- ac/nir: clarify the behavior of ac_nir_lower_ngg_options::can_cull
|
||
- ac/llvm: inline ac_array_in_const*_addr_space
|
||
- ac/nir: inline ac_get_ptr_arg
|
||
- ac/nir: remove unused ac_get_ptr_arg & ac_arg_type_to_pointee_type
|
||
- ac: simplify AC_ARG_CONST_*PTR enums
|
||
- ac/llvm: make ac_get_arg non-inline
|
||
- radeonsi: bitcast shader args to float in LLVM IR manually
|
||
- ac/llvm: make AC_ARG_FLOAT equal to AC_ARG_INT
|
||
- ac: merge AC_ARG_INT & AC_ARG_FLOAT into single AC_ARG_VALUE
|
||
- egl,glx: allow OpenGL with old libx11, but disable glthread if it's unsafe
|
||
- util/set: improve support for usage without "set" structure allocation
|
||
- radv,zink,st/mesa: use _mesa_set_fini instead of ralloc_free
|
||
- util/set: start with 16 entries to reduce reallocations when growing the set
|
||
- util/set: don't allocate the smallest table, declare it in the struct
|
||
- util/set: set _mesa_set_init return type to void
|
||
- util/set: add _mesa_set_copy, a cloning helper without allocation
|
||
- util/hash_table: start with 16 entries to reduce reallocations
|
||
- util/hash_table: improve support for usage without "hash_table" allocation
|
||
- util/hash_table: don't allocate the smallest table, declare it in the struct
|
||
- util/hash_table: set _mesa_hash_table_init return type to void
|
||
- util/hash_table: don't allocate hash_table_u64::table, declare it statically
|
||
- util/hash_table: add _mesa_hash_table_copy, a cloning helper without allocation
|
||
- nir/dominance: don't allocate 0-sized dom_children
|
||
- nir/dominance: eliminate ralloc overhead for allocating dom_children
|
||
- nir: make nir_block::predecessors & dom_frontier sets non-malloc'd
|
||
- nir/lower_vars_to_ssa: don't ralloc sets
|
||
- nir/instr_set: don't ralloc the set
|
||
- nir/remove_dead_variables: don't ralloc the set
|
||
- nir/opt_vectorize: don't ralloc the set
|
||
- nir/gather_info: don't ralloc the set
|
||
- nir/search: don't ralloc the hash table
|
||
- nir/opt_copy_prop_vars: don't allocate vars_written::derefs hash table
|
||
- nir/opt_copy_prop_vars: don't allocate vars_written_map hash table
|
||
- nir/opt_copy_prop_vars: don't allocate copies::ht hash table
|
||
- nir/lower_vars_to_ssa: don't ralloc the hash table
|
||
- nir/opt_find_array_copies: don't allocate the hash tables
|
||
- nir/split_vars: don't allocate the hash tables
|
||
- nir/serialize: don't allocate the hash tables
|
||
- nir/opt_load_store_vectorize: don't allocate 0-sized offset_defs
|
||
- nir: convert nir_instr_worklist to init/fini semantics w/out allocation
|
||
- nir/opt_dead_write_vars: don't use ralloc context, share dynarray among blocks
|
||
- nir/gather_info: don't allocate the ralloc context
|
||
- glsl/opt_function_inlining: don't ralloc the hash table
|
||
- glsl/ir_constant_expression: don't ralloc the hash table
|
||
- glsl/ir_variable_refcount: don't ralloc the hash table
|
||
- glsl_to_nir: don't allocate 0-sized num_params & subroutine_types
|
||
- glsl_to_nir: don't allocate 0-sized arrays for Uniform/ShaderStorageBlocks
|
||
- nir/opt_call: handle load_global(_amd) with SPECULATE as rematerializable
|
||
- nir/opt_sink: handle load_global_amd
|
||
- nir/opt_move_to_top: handle load_global_amd with ACCESS_SMEM_AMD
|
||
- aco: check that global addresses are 64bit, apply_nuw_to_ssa to global_amd/smem
|
||
- ac/llvm: fix handling COHERENT and VOLATILE flags for global access
|
||
- ac/llvm: port load_smem_amd behavior to load_global_amd
|
||
- aco,radeonsi: expand 32-bit shader arg pointers to 64 bits for ACO
|
||
- ac/nir: switch nir_load_smem_amd uses to ac_nir_load_smem wrapper
|
||
- radv: fix load_smem alignment
|
||
- radeonsi: always set TC_L2 for CP DMA on GFX12
|
||
- radeonsi: inline si_upload_const_buffer
|
||
- radeonsi: if rebinding the same constbuf, don't update refcount with atomics
|
||
- radeonsi: remove recursion from si_set_constant_buffer
|
||
- radeonsi: don't ref and unref an index buffer uploaded from a user buffer
|
||
- radeonsi: switch VBO descriptor uploads from u_upload_alloc_ref to u_upload_alloc
|
||
- radeonsi/ci: primitive_counter failures are no longer reproducible on gfx12
|
||
- radeonsi: compute blake3 hashes of internal shaders if they are not set
|
||
- gallium/u_threaded: remove refcounting for draw indirect buffers
|
||
- gallium/u_threaded: remove refcounting for dispatch compute indirect buffers
|
||
- gallium/u_threaded: remove refcounting for clear_buffer
|
||
- gallium/u_threaded: remove refcounting for draw mesh indirect buffers
|
||
- gallium/u_threaded: remove refcounting for get_query_result_resource
|
||
- gallium/u_threaded: remove refcounting for buffer_unmap
|
||
- gallium/u_threaded: remove refcounting for buffer_subdata
|
||
- nir: remove load_smem_amd
|
||
- r300: fix DXTC blits
|
||
- winsys/radeon: fix completely broken tessellation for gfx6-7
|
||
- zink: fix mesh and task shader pipeline statistics
|
||
- Revert ABI breakage "amd: Add user queue HQD count to hw_ip info"
|
||
- gallium/noop: don't unref buffers passed to set_vertex_buffers to fix crashes
|
||
|
||
Marek Vasut (4):
|
||
|
||
- etnaviv: hwdb: update gc_feature_database from ST
|
||
- etnaviv: Turn ETNA_CORE\_ into ETNA_FEATURE_CORE\_
|
||
- pvr: fix features pointer on GX6650 (4.46.6.62)
|
||
- pvr: fix device info for GX6250 (4.45.2.58)
|
||
|
||
Mario Kleiner (6):
|
||
|
||
- asahi: Fix lseek failure error handling in agx_bo_import().
|
||
- asahi: Set PIPE_BIND_SCANOUT in agx_resource_from_handle().
|
||
- wsi/display: Accept 0 nits for HDR light level properties for "undefined"
|
||
- wsi/display: Initially set default HDR metadata from EDID for HDR modes
|
||
- wsi/display: Allow atomic modeset for change of Colorspace or HDR poperties
|
||
- wsi/wayland: Zero min_luminance, max_luminance HDR light levels are valid.
|
||
|
||
Mark Collins (1):
|
||
|
||
- freedreno/drm: Only initialize memory data source when Perfetto is active
|
||
|
||
Martin Krastev (1):
|
||
|
||
- Revert "ci: Disable vmware farm"
|
||
|
||
Martin Roukala (né Peres) (24):
|
||
|
||
- radv/ci: add post-merge jobs for gfx1201
|
||
- zink/ci: add post-merge jobs for gfx1201
|
||
- zink/ci: update the nvk expectations
|
||
- nvk/ci: document a new fail and flakes
|
||
- radv/ci: document new flakes
|
||
- freedreno/ci: document new flakes
|
||
- radv/ci: disable hang detection in navi31-vkcts
|
||
- ci: disable the valve-kws farm
|
||
- Revert "ci: disable the valve-kws farm"
|
||
- ci/ci-tron: uprev the job submission template
|
||
- freedreno/ci: uprev the kernel for the a750
|
||
- nvk/ci: document some vk3d fails
|
||
- ci-tron: uprev b2c to v0.9.17
|
||
- radv/ci: switch to default kernel to b2c's default kernel
|
||
- nvk/ci: switch to default kernel to b2c's default kernel
|
||
- zink/ci: raise the job timeout from 5 to 8 minutes
|
||
- turnip/ci: document more flakes
|
||
- zink/ci: document more flakes in the a750 job
|
||
- turnip/ci: switch vkcts testing to the KWS farm
|
||
- ci,crnm: remove unsupported arguments by console.print
|
||
- ci,crnm: remove unused imports
|
||
- turnip/ci: enable a750_vk in marge pipelines
|
||
- turnip/ci: squeeze a750-vk into 4 jobs
|
||
- zink/ci: run the a750 job in pre-merge
|
||
|
||
Mary Guillemard (85):
|
||
|
||
- panvk: Fix nullDescriptor for dynamic descriptors
|
||
- panvk: Wire robustness2 buffer info down to pan/bi
|
||
- panvk: Exposes robustBufferAccess2 on v11+
|
||
- pan/genxml: Add missing parenthesis on pan_cast_and_pack macros
|
||
- pan/genxml: Make resource table optional on RUN_COMPUTE{_INDIRECT}
|
||
- panvk: Add basic infrastructure for shader variants
|
||
- pan/bi: Fuse FCMP/ICMP on Valhall
|
||
- pan/bi: Properly handle SWZ.v4i8 lowering on v11+
|
||
- panvk: Always use varying_count in emit_varying_attrs
|
||
- panvk: track oq write jobs in JM
|
||
- panvk: Directly use index buffer tracked value in JM
|
||
- libcl: Add stdatomic.h
|
||
- panfrost: Allow to pass job dependencies in grid for precomp JM
|
||
- libpan: Add draw indexed and indirect helper for Bifrost
|
||
- panvk: Prepare draw_emit_attrib_buf and draw_emit_attrib for indirect
|
||
- panvk: Move JM draw preparation logic to prepare_draw
|
||
- panvk: Prepare panvk_draw_prepare_varyings for JM indirect
|
||
- panvk: Prepare tiler and vertex dcd for JM indirect
|
||
- panvk: Implement indirect draw for Bifrost on JM
|
||
- panvk: Use indirect path for indexed draw on JM
|
||
- panvk: Make indexed draw use indirect indexed draw
|
||
- panvk: Parallelize min max index search on JM
|
||
- panvk: Call nir_opt_access
|
||
- pan/bi: Switch to nir_lower_alu_width
|
||
- pan/bi: Vectorize UBOs load/store
|
||
- pan/bi: Handle needless conversions in nir_lower_bool_to_bitsize
|
||
- pan/bi: Revamp bi_optimize_nir
|
||
- pan/bi: Move pan_lower_sample_pos to next block
|
||
- pan/bi: Stop exposing bifrost_nir_lower_load_output
|
||
- panvk: Remove unused color_output_var function in fb_preload
|
||
- panvk: Lower sampler and texture index in case of offset
|
||
- panfrost: Split compilers preprocess_nir
|
||
- panfrost: Move nir_lower_io outside of postprocess
|
||
- panfrost: Split texture lowering passes
|
||
- pan/bi: Split bi_optimize_nir and run bi_optimize_loop_nir in preprocess
|
||
- pan/bi: remove dead variables in preprocess
|
||
- pan/bi: Run opt_sink and opt_move in preprocess
|
||
- nouveau/headers: Properly parse DMA classes for Turing and Ampere A
|
||
- nouveau/headers: Mark SET_POINT_SIZE as using float
|
||
- nouveau/headers: Handle Ampere A GPFIFO in dumper
|
||
- nouveau/headers: Add missing M2MF parsing and set it for subchan 2
|
||
- nouveau/headers: Fix nv_push rust push_inline_data implementation
|
||
- nouveau/headers: Add raw INC methods in nv_push rust impl
|
||
- nvk: Force GART for command buffers
|
||
- nvk: Use MEM_LOCAL for nvk_cmd_mem_create
|
||
- nak: add Ldsm
|
||
- hk: Return 0 for opaque memory capture replay
|
||
- pan/bi: Ensure to merge adjacent ifs after bifrost_nir_lower_shader_output
|
||
- pan/bi: Reintroduce bi_fuse_small_int_to_f32 on v11+
|
||
- pan/bi: Make va_optimize_forward run until there is no progress
|
||
- pan/bi: Propagate MKVEC.v2i8 and V2X8_TO_V2X16 for replicate swizzle
|
||
- panvk: Do not clamp blend constants in command buffer
|
||
- panvk: Enable SNORM rendering
|
||
- panvk/ci: Update waivered tests
|
||
- pan/decode: Fix SYNC_SET32 double dots
|
||
- panvk: Fix wrong type for sb_mask in CmdSetEvent2
|
||
- panvk: Take VK_DEPENDENCY_ASYMMETRIC_EVENT_BIT_KHR into account
|
||
- docs/features: Mark VK_KHR_maintenance9 as done for ANV
|
||
- hk: Move query pool creation/destruction
|
||
- hk: Add support for VK_QUERY_POOL_CREATE_RESET_BIT_KHR
|
||
- hk: Rework queue creation logic
|
||
- hk: Advertise VK_KHR_maintenance9
|
||
- nir/print: Fix load_converted_output_pan and load_readonly_output_pan
|
||
- panvk: Follow nir_lower_io for subpass lowering
|
||
- panvk: Properly set shader binary properties
|
||
- nouveau/headers: Autogenerate push method dumpers
|
||
- nouveau/headers: Handle all compute classes in vk_push_print
|
||
- nouveau/headers: Handle all DMA classes in vk_push_print
|
||
- nouveau/headers: Handle all 3D classes in vk_push_print
|
||
- nouveau/headers: Handle more gpfifo classes in vk_push_print
|
||
- nouveau/headers: Include class headers instead of redefining class ids
|
||
- nouveau/headers: Add Blackwell support to nv_push_dump
|
||
- nouveau/headers: Properly set subchannel 3 to 2D engine in vk_push_print
|
||
- nouveau/headers: Import Blackwell host class headers
|
||
- nouveau/headers: Handle unbound sub channels in vk_push_print
|
||
- panvk, vk/meta: Move D/S sanitizing to panvk
|
||
- asahi: Add base expectation on VKCTS main
|
||
- nouveau/headers: Define fake devices in a table for nv_push_dump
|
||
- nouveau/headers: Add missing Kepler, Maxwell and Pascal defs to nv_push_dump
|
||
- nouveau/headers: Properly reformat nv_push_dump
|
||
- hk: Fix maxVariableDescriptorCount with inline uniform block
|
||
- hk: Disable 1x in sampleLocationsSampleCounts
|
||
- hk: Remove unused allocation in queue_submit
|
||
- hk: Make width and height per block in HIC
|
||
- hk: Allocate the temp tile buffer in copy_image_to_image_cpu
|
||
|
||
Matt Coster (6):
|
||
|
||
- pvr: Fill in missing {u,s}norm equivalents for tex formats
|
||
- pvr: Add missing format adjustment for e5b9g9r9
|
||
- pvr: Add macros to iterate all supported tex formats
|
||
- pvr: Cleanup compressed border colour support
|
||
- pvr: Use 2D texstate for buffer views to allow for >8k sizes
|
||
- pvr: Add support for custom border colors
|
||
|
||
Matt Turner (4):
|
||
|
||
- meson: Allow controlling perfetto fallback
|
||
- meson: Allow configuring with Android-internal perfetto
|
||
- brw/algebraic: Protect SHUFFLE from OOB indices
|
||
- elk/algebraic: Protect SHUFFLE from OOB indices
|
||
|
||
Mauro Rossi (4):
|
||
|
||
- intel/mda: Fix gnu-empty-initializer warning
|
||
- amd: require LLVM when amd-use-llvm is enabled
|
||
- android: fix building rules for i915, r300
|
||
- util: Fix gnu-empty-initializer error
|
||
|
||
Max R (2):
|
||
|
||
- d3d10umd: De-bufferize OutputMerger
|
||
- d3d10umd: Flush on present
|
||
|
||
Maíra Canal (3):
|
||
|
||
- vulkan: create a wrapper struct for vk_sync_timeline
|
||
- vulkan: don't destroy vk_sync_timeline if a point is still pending
|
||
- broadcom/ci: remove synchronization-related flakes and skips
|
||
|
||
Mel Henning (68):
|
||
|
||
- nouveau/headers: Update g_nv_name_released.h
|
||
- nak/mark_lcssa_invariants: Invalidate divergence
|
||
- loader: Don't load nouveau GL on nvidia kmd
|
||
- meson,nvk: Require rustc-hash 2.0 or later
|
||
- nvk: Call cmd_buffer_begin_* based on queue flags
|
||
- nvk: Factor out nvk_queue_engines_from_queue_flags
|
||
- nvk: Check subchannels are valid in nv_push
|
||
- nvk: Disable non-graphics timestamp queries
|
||
- zink: Fix a few profile errors
|
||
- zink: Convert profile tabs to spaces
|
||
- zink: Add zink_check_requirements
|
||
- loader: Don't fall back to nouveau GL without zink
|
||
- nvk: Split out NVC0_FIFO_SUBC_FROM_PKHDR helper
|
||
- nvK: Add nvk_cmd_buffer_last_subchannel
|
||
- nvk: Reduce subc switches in cmd_invalidate_deps
|
||
- nvk/copy: Split out nvk_remap_insert_aspect
|
||
- nvk/copy: Split out nvk_remap_extract_aspect
|
||
- nvk/copy: Split out nvk_remap_copy_aspect
|
||
- nvk/copy: Implement CopyImage2 between R and D/S
|
||
- nvk: Expose VK_KHR_maintenance8
|
||
- nvk: Clear cond_render_gart_* in reset_cmd_buffer
|
||
- nak/hw_runner: Make a few more items public
|
||
- nak: Add a test to check how RENDER_ENABLE works
|
||
- nvk/cmd_pool: NVK_DEBUG=trash_mem for alloc_mem
|
||
- nvk: Clear second SET_RENDER_ENABLE operand
|
||
- nvk: Remove gart from the name of cond_render_mem
|
||
- nvk: Move cond rendering memory out of gart
|
||
- nvk: Reuse the same cond render temp in a cmd_buf
|
||
- nvk: Don't re-initialize cond rendering operand B
|
||
- nvk: Only copy 32-bits for cond render operand A
|
||
- nir: Don't require nir_metadata_control_flow
|
||
- nir/phi_builder: Adjust valid_metadata assert
|
||
- util: Add range_minimum_query
|
||
- nir: Add a faster lowest common ancestor algorithm
|
||
- treewide: Spell indices correctly
|
||
- nak: Remove Option<> from SSARef::file() return
|
||
- nak: impl HasRegFile for SSARef and &[SSAValue]
|
||
- nak/assign_regs: Make src_ssa_ref return a slice
|
||
- nak: Make BindlessSSA store [SSAValue; 2]
|
||
- compiler/rust: impl AsSlice for Box
|
||
- nak: Special case Box in derive_from_variants
|
||
- nak: impl SM*Op for Op
|
||
- nak: Place most Op structs in Box<>
|
||
- nak: Don't copy-prop adds that flush to zero
|
||
- nak: Fix divergence test for redux availability
|
||
- util/macros: Add ATTRIBUTE_COLD
|
||
- nouveau/headers: Mark vk_push_print as cold
|
||
- nouveau/headers: Split out "cases" in template
|
||
- nouveau/headers: Deduplicate push dump impls
|
||
- nouveau/headers: Use previous method for default
|
||
- nak: Add OpSgxt
|
||
- nak: Implement bitfield_extract with OpSgxt
|
||
- nvk: Only run one INVALIDATE_SHADER_CACHES
|
||
- nvk: Combine BARRIER_{COMPUTE,RENDER}_WFI
|
||
- nvk: Fix execution deps in pipeline barriers
|
||
- nvk/cmd_buffer: Remove redundant tests for access
|
||
- vulkan: Drop vk_pipeline_stage_flags2_has_*_shader
|
||
- nvk: INVALIDATE_SHADER_CACHES on most recent subc
|
||
- nvk: WFI on the most recent subc
|
||
- nvk/cmd_copy: Use PIPELINED for user transfers
|
||
- nvk/cmd_copy: Pipeline user copy_rect operations
|
||
- nvk: Reduce subc switches with events
|
||
- nvk: Call INVALIDATE_RASTER_CACHE for shading rate
|
||
- nvk: FLUSH_PENDING_WRITES in gr semaphore release
|
||
- nvk: Fix maxVariableDescriptorCount with iub
|
||
- nvk: Really fix maxVariableDescriptorCount w/ iub
|
||
- nvk: VK_DEPENDENCY_ASYMMETRIC_EVENT_BIT_KHR
|
||
- nak/opt_lop: Don't handle modifiers in dedup_srcs
|
||
|
||
Michal Krol (3):
|
||
|
||
- gallium: Do not flush subnormals during tessellation.
|
||
- lavapipe: Bump maxTransformFeedbackBufferDataStride to 2048.
|
||
- llvmpipe: Add support for 8x MSAA.
|
||
|
||
Michel Dänzer (2):
|
||
|
||
- egl/dri: Name struct dri2_egl_buffer
|
||
- egl/gbm: Destroy excess BOs
|
||
|
||
Mike Blumenkrantz (217):
|
||
|
||
- gallium/hud: set the framebuffer texture when drawing
|
||
- ci: bump VVL to 1.4.322ish
|
||
- zink: fix valid contents check for adding new bind
|
||
- lavapipe: call nir_lower_int64
|
||
- lavapipe: maintenance9
|
||
- lavapipe: VK_KHR_unified_image_layouts
|
||
- zink: use maint9 implicit query resets when available
|
||
- zink: flag dmabuf exports on usage set, not synchronization
|
||
- zink: simplify sampler bufferview change for non-db path
|
||
- egl/x11: don't leak device_name when choosing zink
|
||
- zink: account for generated tcs when pruning programs
|
||
- zink: remove extra gfx prog unref during separable replacement
|
||
- anv: fix format compatibility check typo
|
||
- ci: add venus-lavapipe flake
|
||
- ci: disable xwm decorations in weston
|
||
- zink: create a dummy image for shaderdb runs
|
||
- zink: drop primitiveTopologyPatchListRestart from profile
|
||
- zink: just check multiview availability to advertise extensions
|
||
- crocus: silence perf_debug -Waddress warnings
|
||
- iris: silence perf_debug -Waddress warnings
|
||
- vulkan: silence typed_memcpy -Waddress warnings
|
||
- zink: skip all glx piglit tests on anv-adl
|
||
- zink: verify that no generated tcs is ever in zink_context::gfx_stages
|
||
- kopper: fix initial swapinterval setting
|
||
- zink: also add access stage sync when rebinding buffers
|
||
- zink: check for multi-context image/buffer rebinds during dispatch
|
||
- zink: fix tc buffer replacement rebind condition
|
||
- zink: trigger multi-context buffer invalidate on internal buffer invalidate
|
||
- mesa/fbobject: tweak attachment validation
|
||
- crocus: stop using util_framebuffer_init
|
||
- i915: stop using util_framebuffer_init
|
||
- zink: add cezanne skip for a device loss flake
|
||
- mesa: fix and advertise GL_EXT_sRGB
|
||
- zink: zero dynamic rendering resolve views on rp end
|
||
- tc: also inline depth resolves
|
||
- zink: add ZINK_DEBUG=rploads to mimic tiler behavior
|
||
- zink: fix assert for unsynchronized non-GENERAL image barriers
|
||
- tc: don't clobber CSO info when renderpass has ended
|
||
- zink: don't access ctx in submit_queue
|
||
- zink: stop always syncing threaded flushes
|
||
- perfetto: unify init
|
||
- mesa: make _mesa_bufferobj_release_buffer static
|
||
- mesa: add a ctx param to _mesa_bufferobj_release_buffer
|
||
- mesa/st: check for tc on context create
|
||
- util/tc: don't print END_BATCH in debug
|
||
- tc: break out buffer list busy check
|
||
- tc: add a function to check the internal buffer lists
|
||
- freedreno: stop using util_set_vertex_buffers
|
||
- r300: stop using util_set_vertex_buffers
|
||
- r600: stop using util_set_vertex_buffers
|
||
- zink: destroy u_uploaders earlier in context destroy
|
||
- gallium: set prefer_real_buffer_in_constbuf0 for all drivers using tc
|
||
- gallium: always upload cbuf0 when cap is set
|
||
- mesa/st: rework thread scheduler handling + add dispatch tracking
|
||
- tc: remove user cbuf uploads
|
||
- zink: optimize a GENERAL layout case in pre-draw/dispatch barriers
|
||
- zink: fix image sync deferral
|
||
- zink: remove UNSYNCHRONIZED map flag during unmap flush for non-subdata calls
|
||
- zink: improve deferred buffer barrier heuristics
|
||
- glthread: mark internal bufferobjs for the ctx they belong to
|
||
- st/program: stop calling st_finalize_nir() unnecessarily for variants
|
||
- kopper: don't sync glthread from swapbuffers
|
||
- glx/egl/kopper: explicitly pass __DRI2_FLUSH_CONTEXT when appropriate
|
||
- glx/kopper: don't call glFlush from swapbuffers
|
||
- zink: sprinkle in a bunch of MESA_TRACE_FUNC
|
||
- zink: inline zink_resource_access_is_write()
|
||
- zink: ALWAYS_INLINE resource inlines
|
||
- zink: break out unflushed batch waiting into separate function/mechanism
|
||
- zink: pass ctx to sparse bind functions
|
||
- zink: when sparse unbinding, always wait on main timeline semaphore
|
||
- zink: trigger fb unbind barrier on resolve images too
|
||
- zink: fix sizing on resolve resource array
|
||
- zink: update resized swapchain depth buffer layout while blitting
|
||
- zink: unify/fix clear flushing
|
||
- zink: fixes for flushing clears
|
||
- zink: also set msrtss stencil
|
||
- zink: always flush clears when doing single-aspect blit to avoid data loss
|
||
- zink: enable single-aspected blitting of mixed z/s formats
|
||
- zink: fix some weird indentation in update_binds_for_samplerviews()
|
||
- zink: flag resources for layout eval in update_binds_for_samplerviews()
|
||
- zink: unset validate_all_dirty_states
|
||
- zink: set can_bind_const_buffer_as_vertex
|
||
- radv: ALWAYS_INLINE radv_upload_graphics_shader_descriptors and relateds
|
||
- zink: add a util function for appending a batch state
|
||
- zink: split out batch state finding
|
||
- zink: null out zink_batch_state::next when reusing a batch state
|
||
- zink: defer batch state resets more competently
|
||
- zink: check ctx batch states first when finding a usable one
|
||
- zink: stop using atomics to check fence submit/complete
|
||
- zink: stop trying to oom prune batch states
|
||
- zink: rename zink_batch_state::unref_resources -> unref_resource_objs
|
||
- zink: move buffer hashlist clear to normal batch state reset
|
||
- zink: stop deferring resource object unrefs
|
||
- zink: once there are many outstanding submits, check for timeline updates
|
||
- zink: zero db offset on batch reset
|
||
- zink: don't init non-db batch stuff in db mode
|
||
- zink: reset batch descriptor states again before use on recycle
|
||
- zink: don't increase db scale when resizing a db up to the current scale
|
||
- zink: add some cml flakes
|
||
- mesa: tag a couple framebuffer commands for MESA_VERBOSE=api
|
||
- mesa: add MESA_DEBUG=fallback_tex
|
||
- kopper: unwrap screen before checking cpu flag
|
||
- tc: don't unset resolve resource in set_framebuffer_state
|
||
- mesa/varray: inline a bunch of functions
|
||
- zink: reeneable OVR_multiview2
|
||
- mesa: add task/mesh to _mesa_shader_stage_to_subroutine_prefix()
|
||
- aux/trace: dump more mesh draw info
|
||
- zink: remove rebar requirement for descriptor buffer support
|
||
- zink: add another flag to determine whether linked program compile is done
|
||
- zink: toggle ctx->has_swapchain when flushing clears
|
||
- zink: flag pipeline_changed when updating shader modules
|
||
- zink: clamp subgroup op return types to required int/uint types
|
||
- zink: fix edgeflags check on program creation
|
||
- zink: correctly handle batch_id==0 in check_last_finished()
|
||
- zink: only set compute module info on dispatch (after compile fence)
|
||
- zink: set current compute prog after comparing against current compute prog
|
||
- zink: do bindless init when binding a bindless shader, not on create
|
||
- zink: just reference compute progs to batch on delete
|
||
- zink: ensure transient surface is created when doing msaa expand
|
||
- gallium: add pipe_context::resource_release to eliminate buffer refcounting
|
||
- zink: eliminate buffer refcounting to improve performance
|
||
- zink: flag vertex element state for rebind after vstate draws
|
||
- zink: don't init batch descriptors for copy contexts
|
||
- zink: simplify state iterating in find_completed_batch_state()
|
||
- zink: make find_completed_batch_state() only return state for COPY_ONLY ctx
|
||
- zink: update gfx pipeline less frequently
|
||
- zink: use implicit offsets for function temp variables in ntv
|
||
- zink: more vvl exceptions
|
||
- cso: unbind vertex buffers when unbinding context
|
||
- tc: eliminate refcounting for set_shader_buffers
|
||
- ci: bump vvl to another random version
|
||
- zink: store last index buffer
|
||
- zink: always use vkCmdBindVertexBuffers2
|
||
- zink: simplify index type access to normal array
|
||
- zink: move draw state flag resets into their blocks
|
||
- zink: add some pre-checks before calling query update/suspend/resume
|
||
- zink: add another tu flake
|
||
- mesa: support GL_NV_representative_fragment test
|
||
- zink: support NV_representative_fragment_test
|
||
- zink: add a fastpath for nooping vertex and draw buffer barriers
|
||
- zink: ALWAYS_INLINE zink_set_vertex_buffers_internal
|
||
- zink: split update_res_bind_count
|
||
- zink: use velems buffer count in blitter instead of gfx mask
|
||
- zink: move zink_bind_vertex_elements_state() to zink_context.c
|
||
- zink: move vbo unbind to bind_vertex_state
|
||
- zink: rescope some zink_set_vertex_buffers_internal variables
|
||
- zink: use memcpy for vbo bind
|
||
- zink: delete some function decls that no longer exist
|
||
- zink: only remove buffer deferred sync on release
|
||
- zink: eliminate even more calls to sync functions
|
||
- util/vbuf: stop nooping set_vertex_buffers calls
|
||
- Revert "util/vbuf: stop nooping set_vertex_buffers calls"
|
||
- zink: mark dirty_gfx_stages using util function
|
||
- zink: delete weird prog->pipelines sizing
|
||
- zink: make zink_descriptor_util_push_layouts_get() static
|
||
- zink: unify ntv code for storing shared/scratch memory
|
||
- zink: unify ntv code for loading shared/scratch memory
|
||
- zink: add enum zink_pipeline_idx to distinguish between types of pipelines
|
||
- zink: break out setting draw-time dynamic state into separate function
|
||
- zink: some minor tweaks to descriptor template code
|
||
- zink: use a better array loop sizing for gfx descriptor program init
|
||
- zink: stop unsetting zink_gfx_pipeline::modules on shader unbind
|
||
- zink: don't use screen ralloc context for screen::pipeline_libs
|
||
- zink: imagelessFramebuffer is no longer required/used
|
||
- tc: don't sync on internal UNSYNCHRONIZED texture_map calls
|
||
- mesa/st: add a flags param to st_texture_create()
|
||
- mesa/st: mark internal texture map calls as UNSYNCHRONIZED
|
||
- mesa/st: mark internal buffer map call as UNSYNCHRONIZED
|
||
- zink: make zink-anv-adl jobs use descriptor buffer
|
||
- zink: hook up VK_EXT_mesh_shader
|
||
- zink: implement compiler-side handling for mesh shaders
|
||
- zink: split out descriptor invalidation to be more explicit
|
||
- zink: use pipeline_idx for descriptor invalidation
|
||
- zink: implement mesh shaders
|
||
- zink: wait on queues during screen destroy
|
||
- zink: account for kopper dt not having a swapchain when pruning batch usage
|
||
- zink: prune active queries in reset_batch_state_ctx()
|
||
- zink: call post_submit directly from submit_queue
|
||
- zink: check for zink_batch_state::ctx before using during descriptor state reset
|
||
- zink: null out zink_batch_state::ctx when adding to the screen list
|
||
- zink: reset batch states on destroy
|
||
- zink: flag gfx pipeline_changed if switching from a shader object draw
|
||
- zink: flag mesh pipeline_changed if switching from a shader object draw
|
||
- zink: only try update descriptors on draw/dispatch when necessary
|
||
- zink: fix descriptor array indexing for mesh pipeline
|
||
- zink: set OutputPoints for mesh point output
|
||
- zink: various cleanups for mesh+multiview
|
||
- zink: stop creating GPL inputs for mesh
|
||
- zink: disable single-aspected blits for now
|
||
- tu: don't deref end info in tu_CmdEndRendering2EXT
|
||
- zink: add ZINK_DEBUG=nogeneral to disable unified image layouts
|
||
- mesa: don't assert when finding a renderbuffer miplevel fails
|
||
- zink: fix u_blitting when clears are pending
|
||
- hud: delete buffer refcounting
|
||
- zink: convert task_payload offset to array index in prepass
|
||
- vulkan: update spec to 1.4.328
|
||
- lavapipe: move copy_depth_box to lvp_image.c
|
||
- lavapipe: handle aspected depth/stencil memory->image HIC transfers
|
||
- lavapipe: VK_KHR_copy_memory_indirect
|
||
- mesa: delete task and mesh programs on context destroy
|
||
- zink: fix disabling multiview mesh with shader objects
|
||
- zink: various fixes for custom sample locations
|
||
- zink: stop using vk lazy allocations / transient attachments
|
||
- zink: strip dmabuf bind flags when creating transient image
|
||
- zink: always add mutable to transient surface creation when needed
|
||
- zink: only add mutable bind for transient surfaces when necessary
|
||
- zink: disable msrtss handling when blitting
|
||
- glsl: fix gl_ViewID_OVR type to uint
|
||
- mesa: copy NumSamples in reuse_framebuffer_texture_attachment
|
||
- zink: enable GL_EXT_mesh_shader
|
||
- zink: enable srgb-mutable for dmabufs when possible
|
||
- zink: defer swapchain updates for interval changes if acquired image is active
|
||
- zink: consistently set/unset msrtss in begin_rendering
|
||
- zink: disable primitiveFragmentShadingRateMeshShader feature
|
||
- zink: collapse gfx pipeline fetching and binding conditionals
|
||
- zink: collapse mesh pipeline fetching and binding conditionals
|
||
- zink: don't destroy old push layout when enabling fbfetch descriptor
|
||
|
||
Mohamed Ahmed (12):
|
||
|
||
- nvk: Dynamically allocate queues
|
||
- nak: Fix 64-bit bit_count, ufind_msb, ifind_msb, find_lsb
|
||
- nak: Enable lowering for bitfield manipulation at <32bit sizes
|
||
- nvk: Ensure we have nvkmd before shader upload
|
||
- nvk: Ensure we have nvkmd before sampler descriptor upload
|
||
- nvk: Skip creating a nvkmd device if we don't have to
|
||
- nvk: Add support for VK_QUERY_POOL_CREATE_RESET_BIT_KHR
|
||
- nvk: Advertise VK_KHR_maintenance9
|
||
- nil: Add missing compressible PTE kinds
|
||
- nouveau/headers: Add AMPERE_B compute subchannel definition
|
||
- nouveau/mme: Add unit tests for sharing between compute and 3D scratch registers
|
||
- nvk: Use the compute MME for compute dispatch
|
||
|
||
Myrrh Periwinkle (1):
|
||
|
||
- gallium: Properly handle non-contiguous used sampler view indexes
|
||
|
||
Nagulendran, Iswara (3):
|
||
|
||
- amd/vpelib: Fix Issues with Background Color insertions
|
||
- amd/vpelib: Fix cost profiling support
|
||
- amd/vpelib: Handle Destination Rect with zero dimensions
|
||
|
||
Nanley Chery (18):
|
||
|
||
- anv: Disable CCS if image bound to wrong heap on Xe2+
|
||
- anv: Disable fast-clears on linear surfaces
|
||
- iris: Disable fast-clears on linear surfaces
|
||
- iris: Add PIPE_BIND_SCANOUT when exporting textures
|
||
- iris: Fix image reallocation for sharing
|
||
- intel/isl: Only set CMF on renderable views on Xe2+
|
||
- intel: Enable CCS_E on linear surfaces on Xe2+
|
||
- iris: Drop iris_resource_image_is_pat_compressible
|
||
- anv,hasvk: Take trace submission ID out of lock
|
||
- anv: Rework locking for sparse binding with TR-TT
|
||
- intel/isl: Define initial state of non-zeroed CCS on gfx9-11
|
||
- anv: Query ISL for the aux-state of undefined layouts
|
||
- intel: Delete the has_illegal_ccs_values bool
|
||
- intel/isl: Update the initial HiZ state for Xe2+
|
||
- intel/isl: Update the aux-state of zeroed HiZ
|
||
- iris: Don't zero the CCS in an already zeroed BO
|
||
- iris: Initialize HiZ to the CLEAR state on BDW-ICL
|
||
- iris: Drop iris_resource_level_has_hiz()
|
||
|
||
Natalie Vock (18):
|
||
|
||
- radv/winsys: Support vm_always_valid in the NULL winsys
|
||
- radv: Only expose indirect raytracing on gfx7+
|
||
- aco: Add RegisterDemand::operator!=
|
||
- aco: Add function call attributes
|
||
- aco: Add ABI and Pseudo CALL format
|
||
- aco: Add call-related program/block properties
|
||
- aco: Add call info
|
||
- aco/lower_to_hw_instr: Lower calls
|
||
- aco/live_var_analysis: Handle calls
|
||
- aco/sched: Handle calls
|
||
- aco/validate: Validate call instructions
|
||
- aco/vn: Don't combine expressions across calls
|
||
- aco/opt: Work around GCC compiler issue
|
||
- aco/scheduler: Bail early on unreorderable instructions
|
||
- vulkan/bvh: Mark instances with NAN AABBs as inactive
|
||
- radv/bvh: Encode empty AS bounds as NaN
|
||
- nir/lower_shader_calls: Repair SSA after wrap_instrs
|
||
- radv: Fix PSO history with RT pipelines
|
||
|
||
Nataraj Deshpande (1):
|
||
|
||
- anv: add feature flags for linearly tiled ASTC images
|
||
|
||
Okenczyc, Andrzej (1):
|
||
|
||
- amd/vpelib: Move predication size calculation to bufs_req
|
||
|
||
Olivia Lee (16):
|
||
|
||
- panvk: stop CPU mapping all index buffers on JM
|
||
- perfetto: allow specifying clock domain for cpu timestamps
|
||
- panvk/perfetto: improve clock synchronization using CLOCK_MONOTONIC_RAW
|
||
- editorconfig: move OpenCL configuration to root
|
||
- vulkan: move internal vulkan pseudo-extensions to a common file
|
||
- vulkan/util: add vk_topology_to_mesa helper function
|
||
- hk: replace vk_conv_topology with vk_topology_to_mesa from vulkan/util
|
||
- lavapipe: replace vk_conv_topology with vk_topology_to_mesa from vulkan/util
|
||
- v3dv: replace vk_to_mesa_prim with vk_topology_to_mesa from vulkan/util
|
||
- panvk: pass correct variant shader/compile inputs to panvk_lower_nir
|
||
- pan/va: fix bi_is_imm_desc_handle early return
|
||
- panvk: fix FS driver set layout when LD_VAR_BUF is disabled
|
||
- vtn_bindgen2: use anonymous namespace to avoid name collisions
|
||
- util/macros: coerce likely/unlikely to bool even without __builtin_expect
|
||
- panfrost: fix cl_local_size for precompiled shaders
|
||
- hk: fix data race when initializing poly_heap
|
||
|
||
Paolo Bonzini (2):
|
||
|
||
- meson: rename Rust subprojects to NAME-SEMVER-rs
|
||
- docs: document naming convention for Rust subprojects
|
||
|
||
Patrick Lerda (23):
|
||
|
||
- dri: fix image_loader_extensions array
|
||
- dri: complete the support for ARGB4444
|
||
- r600: refactor r600_is_buffer_format_supported() for the next update
|
||
- r600: fix remaining pbo issues
|
||
- r600: fix arb_shader_image_load_store incomplete
|
||
- r600: refactor step 1 - r600_texture cast is replaced by a function
|
||
- r600: refactor step 2 - r600_resource cast is replaced by a function
|
||
- r600: refactor step 3 - split r600_framebuffer
|
||
- r600: refactor step 4 - clean up r600_surface width0 and height0 elements
|
||
- r600: refactor step 5 - evergreen clean up an incompatible mechanism
|
||
- r600: refactor step 6 - pre-evergreen clean up
|
||
- r600: refactor step 7 - split r600_surface
|
||
- r600: refactor step 8 - pre-evergreen operations
|
||
- r600: refactor step 9 - remove util_framebuffer_init
|
||
- r600: refactor step 10 - drop create_surface
|
||
- r600: refactor step 11 - change r600_aligned_buffer_create() return type
|
||
- r600: fix evergreen gds atomic_counter_comp_swap
|
||
- r600: fix r600_resource_copy_region behavior for some formats
|
||
- r600: update multi_draw_indirect_params drm version requirement
|
||
- r600: fix emit_ssbo_atomic_op when ssbo_image_offset is non-zero
|
||
- r600: fix r600_draw_rectangle refcnt imbalance
|
||
- r600: update nplanes support
|
||
- r600: limit pre-evergreen predicate ready size
|
||
|
||
Paul Gofman (1):
|
||
|
||
- driconf: add a workaround for Investigation Stories : gunsound
|
||
|
||
Paulo Zanoni (32):
|
||
|
||
- brw: remove unnecessary <vector> inclusions
|
||
- brw: store 'volatile' GLSL/SPIR-V access in MEMORY_LOGICAL_FLAGS
|
||
- brw: consider 'volatile' memory access when doing CSE
|
||
- brw: mark 'volatile' sends as uncached on LSC messages
|
||
- brw: adjust comment pasted from a commit message
|
||
- brw: remove unnecessary casts to unsigned after calling LSC_CACHE()
|
||
- brw: null-tile sends don't need to skip L3 on Xe2 and newer
|
||
- anv/sparse: don't claim Xe2's non-standard MSAA shapes as unsupported
|
||
- anv/sparse: declare sparse MSAA block shapes as standard before Xe2
|
||
- anv/sparse: allow multiple sample bits in anv_sparse_image_check_support
|
||
- anv/sparse: don't support depth/stencil with sparse
|
||
- anv/sparse: we can support R64 and other atomics emulated formats
|
||
- anv/sparse: call sparse_image_check_support from get_image_format_properties
|
||
- zink: new expected failures for sparse depth buffers
|
||
- intel: rework the way sparse forces CCS/MCS/HIZ to be disabled
|
||
- isl: allow sparse with CCS on Xe2 and newer
|
||
- isl: allow sparse with STC_CCS on DG2
|
||
- iris: fix indentation during command submission
|
||
- iris/xe: move error checking to inside the devinfo->no_hw case
|
||
- iris: devinfo->no_hw is unlikely
|
||
- anv/i915: bring info->no_hw handling to anv_gem_execbuffer()
|
||
- anv/xe: extract xe_exec_ioctl()
|
||
- anv/xe: rework set_lost handling in xe_exec_ioctl()
|
||
- anv/i915: rework set_lost handling in anv_gem_execbuffer()
|
||
- anv/xe: set the queue as lost instead of the device on execbuf failure
|
||
- anv: we never set I915_EXEC_FENCE_OUT
|
||
- intel/i915: add i915_gem_execbuf_ioctl()
|
||
- intel/i915: sleep a little bit between retries of the execbuf ioctl
|
||
- intel/i915: give up the execbuf ioctl after ~16s of ENOMEMs
|
||
- intel/i915: warn the user about repeated execbuf ENOMEM after ~2s
|
||
- intel/xe: unify behavior with i915.ko regarding ENOMEM on DRM_IOCTL_XE_EXEC
|
||
- intel: unify parameters for the exec ioctl retries
|
||
|
||
Pavel Asyutchenko (1):
|
||
|
||
- radv: report full sparse address space size
|
||
|
||
Pavel Ondračka (5):
|
||
|
||
- r300/ci: check gles2 extensions
|
||
- r300/ci: add one recent flake
|
||
- r300/ci: add RS740 piglit and dEQP testing
|
||
- r300/ci: remove emulated swtcl testing
|
||
- i915/ci: update CI expectations
|
||
|
||
Peter Quayle (2):
|
||
|
||
- pvr: various multiview fixes
|
||
- pvr: add view index support for vertex shaders
|
||
|
||
Philipp Zabel (1):
|
||
|
||
- rusticl: Fix hidden lifetime warnings
|
||
|
||
Pierre-Eric Pelloux-Prayer (31):
|
||
|
||
- bufferobj: init the return value for GetParam functions
|
||
- radeonsi/tests: enable vk interop testing
|
||
- radeonsi: fix refcount with memobj
|
||
- radeonsi/gfx12: dont use HTILE for imported textures
|
||
- nir/lower_io: make sure range is not 0
|
||
- mesa/st: always use base_serialized_nir for draw
|
||
- nir/opt_varyings: fix build with PRINT_RELOCATE_SLOT
|
||
- mesa/st: check buf before dereferencing it
|
||
- radeonsi/tests: update rasterpos results
|
||
- radeonsi: sync harder on finish
|
||
- radeonsi/sqtt: retry a frame capture after reiszing the buffer
|
||
- radeonsi/sqtt: update the shader after scratch config
|
||
- mesa: clear TransformFeedback.NumVarying on error
|
||
- mesa: add u_overflow.h
|
||
- util, vulkan: use u_overflow.h
|
||
- nir/opcodes: use u_overflow to fix incorrect checks
|
||
- nir/opcodes: remove invalid comment
|
||
- glthread, tc: Fix buffer release with glthread and tc
|
||
- st: add early to st_prune_releasebufs
|
||
- tc: prevent flush of incomplete batches
|
||
- tc: add debug code for tc_set_vertex_elements_for_call_pending
|
||
- util: mimic KCMP_FILE via epoll when KCMP is missing
|
||
- util: use F_DUPFD_QUERY on Linux
|
||
- radeonsi/tests: use black to fix style issues
|
||
- radeonsi/tests: allow to test radv
|
||
- radeonsi/tests: add gfx11_5 to the list
|
||
- radeonsi/tests: rename --no-xxx arguments
|
||
- radeonsi/tests: rename glcts_path -> vk_gl_cts_path
|
||
- radeonsi/tests: add an argument to specify a folder with the must pass files
|
||
- radeonsi/tests: add a flag to specify a folder with the cts binaries
|
||
- radeonsi: propagate shader updates for merged shaders
|
||
|
||
Pohsiang (John) Hsu (11):
|
||
|
||
- mediafoundation: change frame preanalysis rc from ifdef to runtime control
|
||
- d3d12: Fix mediafoundation build
|
||
- mediafoundation: fix deadlock when user call shutdown and endGetEvent concurrently
|
||
- gallium/pipebuffer: fix multithread issue on pb_slab_manager_create_buffer
|
||
- mediafoundation: periodic clang-format, no code changes
|
||
- mediafoundation: update doc to remove gallium-vdpau from build setup
|
||
- mediafoundation: return adjusted LTR frame (need to remove one for short term)
|
||
- mediafoundation: create sample allocator for SW input sample on demand to save video memory
|
||
- mediafoundation: periodic clang format - no code changes
|
||
- mediafoundation: remove extra ';'
|
||
- mediafoundation: update version to 1.07
|
||
|
||
Qiang Yu (103):
|
||
|
||
- all: rename PIPE_SHADER_VERTEX to MESA_SHADER_VERTEX
|
||
- all: rename PIPE_SHADER_TESS_CTRL to MESA_SHADER_TESS_CTRL
|
||
- all: rename PIPE_SHADER_TESS_EVAL to MESA_SHADER_TESS_EVAL
|
||
- all: rename PIPE_SHADER_GEOMETRY to MESA_SHADER_GEOMETRY
|
||
- all: rename PIPE_SHADER_FRAGMENT to MESA_SHADER_FRAGMENT
|
||
- all: rename PIPE_SHADER_COMPUTE to MESA_SHADER_COMPUTE
|
||
- all: rename PIPE_SHADER_TASK to MESA_SHADER_TASK
|
||
- all: rename PIPE_SHADER_MESH to MESA_SHADER_MESH
|
||
- all: rename PIPE_SHADER_TYPES to MESA_SHADER_STAGES
|
||
- all: rename PIPE_SHADER_MESH_TYPES to MESA_SHADER_MESH_STAGES
|
||
- glsl: remove miss declaration of struct gl_shader_stage
|
||
- all: rename gl_shader_stage to mesa_shader_stage
|
||
- all: rename pipe_shader_type to mesa_shader_stage
|
||
- mesa,gallium: remove pipe_shader_type_from_mesa
|
||
- all: rename gl_shader_stage_is_compute to mesa_shader_stage_is_compute
|
||
- all: rename gl_shader_stage_is_mesh to mesa_shader_stage_is_mesh
|
||
- compiler: remove gl_shader_stage_is_graphics
|
||
- all: rename gl_shader_stage_uses_workgroup to mesa_shader_stage_uses_workgroup
|
||
- compiler: rename gl_shader_stage_is_callable to mesa_shader_stage_is_callable
|
||
- all: rename gl_shader_stage_is_rt to mesa_shader_stage_is_rt
|
||
- all: rename gl_shader_stage_can_set_fragment_shading_rate
|
||
- all: rename gl_shader_stage_name to mesa_shader_stage_name
|
||
- compiler,gallium: remove PIPE_SHADER_* and adjust some macro usage
|
||
- gallium: add mesh shader caps
|
||
- mesa,gallium: remove tgsi_processor_to_shader_stage
|
||
- mesa/st: use shader_caps.max_instructions to check shader present
|
||
- compiler: adjust comments for mesa_shader_stage
|
||
- radeonsi: do not init nir_options for mesh shader
|
||
- gallium/dd: enlarge shader string for mesh shader
|
||
- mesa: enlarge the shader resourse limits for mesh shader
|
||
- mesa: init program constants for mesh shader
|
||
- glsl,gallium,mesa: replace MESA_SHADER_STAGES with MESA_SHADER_MESH_STAGES
|
||
- mesa: set a more accurate value for combined limits
|
||
- mesa: count mesh shader when init limits
|
||
- mesa: add mesh shader extension state
|
||
- nir/opt_varying: remove assert for mesh shader crash
|
||
- nir: lower io support task and mesh shader
|
||
- nir: compute io base for fragment shader inputs which maybe per primitive
|
||
- Update OpenGL headers for GL_EXT_mesh_shader
|
||
- mesa,mapi: add EXT_mesh_shader extension
|
||
- mesa: implement EXT_mesh_shader glGet* values
|
||
- mesa: implement EXT_mesh_shader glGetProgrameiv values
|
||
- mesa: implement EXT_mesh_shader glGetActive* values
|
||
- mesa,glsl: add mesh shader subrotine handling
|
||
- mesa: implement mesh shader queries
|
||
- mesa: support mesh shader when glCreateShader
|
||
- mesa: remove mtype.h include from st_atom.h
|
||
- mesa: fix glTexPageCommitmentARB and glTexturePageCommitmentEXT level check
|
||
- mesa: use bitset for driver states tracker
|
||
- gallium: cso context support mesh shader
|
||
- mesa: add mesh shader states
|
||
- mesa: handle mesh shader in state management
|
||
- mesa: implement mesh shader draw calls
|
||
- mesa,gallium: handle mesh shader create and delete
|
||
- gallium: threaded context support mesh shader
|
||
- gallium/u_blitter: save mesh shader
|
||
- gallium/ddebug: support mesh shader
|
||
- mesa: allow NULL for vertex shader when mesh pipeline
|
||
- gallium/trace: dump mesh shader queries
|
||
- mesa/st: convert mesh shader to gl stages
|
||
- mesa: not fail the assert when detach mesh shader
|
||
- mesa: program pipeline support mesh shader
|
||
- gallium/noop: add mesh shader callbacks
|
||
- panfrost: fix image plane array copy
|
||
- panfrost: fix lowered multi plane resource offset/stride param get
|
||
- ac/surface: refine supported modifier list for multi block size
|
||
- ac/surface: add radeonsi exported modifiers to supported list
|
||
- ac/surface: add ac_compute_surface_modifier
|
||
- gallium: add PIPE_RESOURCE_PARAM_DISJOINT_PLANES
|
||
- egl: refine dma buf export to support multi plane
|
||
- radeonsi: really support eglExportDMABUFImageQueryMESA
|
||
- mesa: fix draw mesh shader indirect buffer size check
|
||
- radeonsi: fix use aco/llvm debug options
|
||
- radeonsi: hide real modifier export behind AMD_DEBUG
|
||
- glsl: prepare parse state for mesh shader
|
||
- glsl: handle taskPayloadSharedEXT variables
|
||
- glsl: handle PerPrimitiveEXT qualifier
|
||
- glsl: allow shared variables in task and mesh shader
|
||
- glsl: handle mesh shader primitive type layout qualifier
|
||
- glsl: handle max_vertices/primitives for mesh shader
|
||
- glsl: handle work group in layout for mesh shader
|
||
- glsl: add input builtin variables for mesh shader
|
||
- glsl: add mesh shader builtin outputs
|
||
- glsl: assign mesh shader output variable array size
|
||
- glsl: handle mesh shader output block
|
||
- glsl: add mesh shader builtin functions
|
||
- glsl: nir_build_program_resource_list support mesh shader
|
||
- glsl: gl_nir_link_glsl handle mesh shader
|
||
- glsl: validate MS/FS interstage in/out block
|
||
- glsl: handle per primitive varying when link
|
||
- glsl: validate MS/FS interstage in/out variable type
|
||
- glsl: disable mesh shader output remove when separate shader
|
||
- glsl: pack vertex pipeline varying linkage into a function
|
||
- glsl: pack varying limit check code into functions
|
||
- glsl: add mesh pipeline varying linkage
|
||
- glsl: handle mesh shader when optimize varying
|
||
- glsl: handle explicit location for mesh shader
|
||
- glsl: lower shared and task playload for mesh shader
|
||
- glsl: no xfb buffer qualifier for mesh shader
|
||
- glsl: flat qualifier is not needed for per primitive IO
|
||
- glsl: translate mesa stage for mesh shader
|
||
- glsl: allow barrier builtin functions for mesh shader
|
||
- gallium: fix eglExportDMABUFImageQueryMESA crash for r600
|
||
|
||
Quentin Schulz (3):
|
||
|
||
- nvk: remove unused relative_dir variable
|
||
- meson: replace global_source_root/global_build_root with project_*
|
||
- meson: fix libcl assert() reproducibility
|
||
|
||
Renato Pereyra (1):
|
||
|
||
- anv: Enable anv_emulate_read_without_format for Android 15+
|
||
|
||
Rhys Perry (107):
|
||
|
||
- aco/lower_phis: add bld_before_logical_end helper
|
||
- nir/divergence: ignore boolean phis for ignore_undef_if_phi_srcs
|
||
- aco: optimize s_and(s_cselect, exec)
|
||
- aco: stop labeling first def of and(uniform_bool/uniform_bitwise, exec)
|
||
- aco: don't both flip s_cselect and label uniform_bool
|
||
- aco/opt: add some comments
|
||
- aco: optimize uniform s_not
|
||
- aco/isel: optimize uniform vote
|
||
- nir/cf: have nir_remove_after_cf_node remove phis at the start too
|
||
- nir/search: check variable requirements even if it's already seen
|
||
- nir/uub: fix 8/16-bit overflow
|
||
- nir/opt_access: support RT/callable shaders
|
||
- nir/load_store_vectorize: check for interfering shared2 before vectorizing
|
||
- nir/load_store_vectorize: set is_store for shared append/consume
|
||
- nir/load_store_vectorize: always set num_components correctly
|
||
- glsl_to_nir,vtn: insert barriers around begin/end invocation interlock
|
||
- ac/nir/lower_ps: remove barrier for end_invocation_interlock
|
||
- aco/gfx12: fix printing of temporal hints
|
||
- aco: align scratch size after isel
|
||
- aco: fix possible scratch offset overflow
|
||
- vtn: fix placement of barriers for MakeAvailable/MakeVisible
|
||
- nir: don't move accesses across make visible/available barriers
|
||
- vtn: remove acquire/release around make visible/available barriers
|
||
- nir/lower_memory_model: remove empty lowered barriers
|
||
- aco/ra: set late-kill for operands of temporary p_create_vector
|
||
- nir: add global_amd to nir_get_io_offset_src/nir_get_io_index_src
|
||
- nir/opt_load_skip_helpers: move divergence check earlier
|
||
- nir/opt_load_skip_helpers: always require helpers for handles
|
||
- nir/search: add nir_search_state
|
||
- nir/search: don't clear empty hash tables
|
||
- nir/search: reorder match_value to check constants first
|
||
- nir: add nir_def_num_lsb_zero
|
||
- nir/algebraic: improve is_unsigned_multiple_of_4 and use it more
|
||
- nir/algebraic: allow non-const for iand(iadd()) -> iadd(iand())
|
||
- nir/load_store_vectorize: use nir_def_num_lsb_zero in check_for_robustness
|
||
- nir/load_store_vectorize: use nir_def_num_lsb_zero in calc_alignment
|
||
- device-select: clang-format
|
||
- device-select: move get_default_device to it's own file
|
||
- device-select: simplify adding/removing instances
|
||
- device-select: do all getenv during instance creation
|
||
- device-select: use debug_get_bool_option for FORCE_DEFAULT_DEVICE
|
||
- device-select: refactor device_select_get_default
|
||
- nir/divergence: make smem load_global_amd uniform
|
||
- drm-shim: use atomics for inited
|
||
- drm-shim: fix with asan
|
||
- aco: fix signed integer overflow
|
||
- radv: fix shift overflow in radv_pipeline_init_dynamic_state
|
||
- vtn: use vtn_has_decoration more
|
||
- nir/load_store_vectorize: refactor offset parsing
|
||
- nir/load_store_vectorize: refactor entry key creation
|
||
- nir/load_store_vectorize: call nir_def_num_lsb_zero less
|
||
- nir/load_store_vectorize: optimize accesses with u2u64(ishl.nuw(iadd))
|
||
- nir/opt_offsets: report progress if NUW is set
|
||
- nir/opt_offsets: fix progress determination with offsets that add to zero
|
||
- nir/opt_offsets: improve shared2 optimization
|
||
- nir/load_store_vectorize: remove offset check in try_vectorize_shared2
|
||
- aco: reduce cost of using values defined in predecessors
|
||
- aco: add is_atomic_or_control_instr helper
|
||
- aco: don't move release barriers after interlock end
|
||
- aco: don't move acquire barriers before interlock begin
|
||
- aco: refactor waitcnt pass to use barrier_info
|
||
- aco: add a separate barrier_info for release/acquire barriers
|
||
- aco: delay barrier waitcnt until they are needed
|
||
- aco: remove waitcnt code for SMEM stores
|
||
- aco: remove waitcnt code for POPS
|
||
- aco: update waitcnt events for exports
|
||
- aco: use a separate event for sendmsg_rtn
|
||
- aco: fix workgroup-scope barrier between vmem and lds
|
||
- aco/gfx10: skip waitcnts or use vm_vsrc(0) for workgroup vmem barriers
|
||
- aco/gfx10: skip waitcnts or use vm_vsrc(0) for workgroup lds barriers
|
||
- aco/tests: add barrier-to-waitcnt tests
|
||
- aco: avoid wraparound for smem global loads with both offsets
|
||
- aco: avoid unaligned offsets when selecting load_global_amd
|
||
- zink/ntv: fix coherent image load/store
|
||
- vtn: skip make-available/visible for shared
|
||
- zink/ntv: use MakePointerAvailable/Visible for shared load/store
|
||
- nir/lower_atomics_to_ssbo: set ACCESS_COHERENT for loads
|
||
- nir/lower_atomics: set ACCESS_COHERENT
|
||
- aco: workaround load tearing for load_shared2_amd
|
||
- aco: fix SGPR 8-bit nir_op_vec with mixed constant and non-constant
|
||
- ac/nir: fix progress reporting in ac_nir_lower_tex
|
||
- nir: fix progress reporting in nir_io_add_const_offset_to_base
|
||
- radv: fix progress reporting in lower_rt_derefs
|
||
- nir/opt_if: fix progress reporting with multiple function impls
|
||
- nir/opt_if: rewrite progress reporting and metadata invalidation
|
||
- nir: fix NIR_DEBUG=extended_validation
|
||
- nir: add NIR_DEBUG=progress_validation
|
||
- rusticl: support NIR_DEBUG=invalidate_metadata/extended_validation
|
||
- rusticl: support NIR_DEBUG=progress_validation
|
||
- aco: remove buffer_load_lds instructions
|
||
- nir: add ACCESS_ATOMIC
|
||
- vtn: set ACCESS_ATOMIC
|
||
- zink/ntv: use ACCESS_ATOMIC
|
||
- nir,vtn: add shader_info::assume_no_data_races
|
||
- nir: assume non-atomic loads don't tear
|
||
- aco: only workaround load tearing for atomic loads
|
||
- aco: set atomic semantic for atomic load/store
|
||
- aco: remove barrier acquire/release workaround
|
||
- aco: use MTBUF for 64-bit atomic load/store
|
||
- radv: move nir_opt_algebraic loop for NGG culling earlier
|
||
- radv: only call radv_should_use_wgp_mode() once
|
||
- radv: use CU mode when LDS is used
|
||
- radv: allow WGP mode with task/mesh
|
||
- amd/lower_mem_access_bit_sizes: don't create subdword UBO loads with LLVM
|
||
- amd/lower_mem_access_bit_sizes: improve subdword/unaligned SMEM lowering
|
||
- amd/lower_mem_access_bit_sizes: be more careful with 8/16-bit scratch load
|
||
- amd/lower_mem_access_bit_sizes: fix shared access when bytes<bit_size/8
|
||
|
||
Ritesh Raj Sarraf (8):
|
||
|
||
- ci: Update to Linux 6.16 for gfx-ci
|
||
- ci: Add test streaming-texture-leak to all-skips
|
||
- panfrost/ci: Document new/known flakes
|
||
- radv/ci: Update flakes uncovered during testing
|
||
- venus/ci: Add test to skips list
|
||
- ci: Uprev kernel to pick up virtio fix and use it for venus
|
||
- ci: Update to latest revision from ci-deb-repo
|
||
- ci/android: Use prebuilt cuttlefish packages from ci-deb-repo
|
||
|
||
Rob Clark (99):
|
||
|
||
- freedreno/decode: Add enum value decoding
|
||
- freedreno/crashdec: Sanitize index-regs section names
|
||
- freedreno/crashdec: Add option to export a snapshot
|
||
- freedreno/registers: Fix DBGC_CFG_DBGBUS_SEL_D definition
|
||
- util: Optimize MESA_TRACE_FUNC()
|
||
- freedreno: Remove obsolete comment
|
||
- freedreno: Add debug option to crash on context fault
|
||
- freedreno/drm: Rename emit_bo
|
||
- freedreno/drm: Split out sp_attach_ring()
|
||
- freedreno/drm: Split emit/attach obj/nonobj
|
||
- freedreno/drm: Expose attach_ring()
|
||
- freedreno/registers: Fix array reg names
|
||
- freedreno/registers: Add some reg64 to pm4
|
||
- freedreno/registers: Rename NCRB
|
||
- freedreno/registers: Fix SP_CS_PVT_MEM_BASE type
|
||
- freedreno/registers: Fix RB_A2D_DEST_FLAG_BUFFER_PITCH
|
||
- freedreno: Drop unused emit_const_asserts() param
|
||
- freedreno/a6xx: Drop orval/shift
|
||
- freedreno/a6xx: Separate out blit setup
|
||
- freedreno/a6xx: Bake iova's into descriptors
|
||
- freedreno/a6xx: Make samp descriptor an array
|
||
- freedreno/a6xx: Make tex/samp descriptors a normal BO
|
||
- freedreno/a6xx: Make __assert_eq() a nop if !NDEBUG
|
||
- freedreno/a6xx: Add new cmdstream builders
|
||
- freedreno/a6xx: Remove compute state user_consts_cmdstream_size
|
||
- freedreno/a6xx: Rename compute state to cp
|
||
- ir3: Don't count images in num_samp
|
||
- freedreno/a6xx: Convert gallium to new cs builders
|
||
- freedreno/computerator: Split out context restore
|
||
- freedreno/computerator: Convert ubo state to BO
|
||
- freedreno/computerator: Convert to new cs builders
|
||
- freedreno/a6xx: Remove old reg builder macros
|
||
- freedreno/drm-shim: Fix nr_rings/priorities
|
||
- drm/drm-shim: Add a702
|
||
- freedreno/layout: Don't align explicit layouts to page size
|
||
- freedreno/decode: Add missing varset check
|
||
- drirc: Work around ANGLE brokeness
|
||
- freedreno/registers: pm4 cleanup
|
||
- freedreno/decode: Fix ufixed/fixed decode of bitset fields
|
||
- freedreno/decode: lua support for <stripe>
|
||
- freedreno/drm-shim: Handle GET/SET_METADATA
|
||
- freedreno/registers: Add a way to disable deprecated warnings
|
||
- freedreno/registers: Generate variant builder always
|
||
- freedreno/a6xx: Convert to variant reg packers
|
||
- freedreno/computerator: Convert to variant reg packers
|
||
- freedreno/registers: Fix variant ranges
|
||
- freedreno/registers: Add implicit reg32 for empty arrays
|
||
- freedreno/registers: De-open-code some offsets
|
||
- freedreno/registers: Cleanup the bin_cntl's
|
||
- freedreno/registers: Move descriptor related enums
|
||
- freedreno/registers: Prep for upcoming things
|
||
- freedreno/registers: Make TPL1_BICUBIC_WEIGHTS_TABLE an array
|
||
- freedreno: Name a few events
|
||
- freedreno/a6xx: Drop VPC table magic
|
||
- freedreno/a6xx: Require write support for images
|
||
- freedreno/a6xx: Disallow impossible image swizzles
|
||
- freedreno/a6xx: Mark tex and samp descriptors for dumping
|
||
- freedreno/a6xx: Format table fixes
|
||
- nir/lower-amul: Fix crash with unused SSBO
|
||
- nir/lower-amul: Comment fix
|
||
- freedreno/registers: Add A7XX_CX_DBGC
|
||
- freedreno/registers: Re-enable validation for gen_header.py
|
||
- freedreno/registers: Remove license/etc from generated headers
|
||
- freedreno/registers: remove python 3.9 dependency for compiling msm
|
||
- freedreno/registers: Generate _HI/LO builders for reg64
|
||
- freedreno/registers: Update GMU register xml
|
||
- freedreno/a6xx: Fallback to original blit in the snorm_copy path
|
||
- freedreno/blitter: Don't ignore blit swizzle
|
||
- freedreno/a6xx: Add missing format
|
||
- freedreno/a6xx: Fix snorm rounding
|
||
- freedreno/devices: Update chicken bits
|
||
- freedreno/decode: Add test to check for conflicting regs
|
||
- freedreno/registers: Remove conflicting RBBM regs
|
||
- freedreno/registers: Fix x_CONTEXT_SWITCH_GFX_PREEMPTION_SAFE_MODE
|
||
- freedreno/decode: checkreg handling for bitsize/stride
|
||
- freedreno/decode/scripts: Add license comments
|
||
- freedreno/fdl: Set pitch for buffers
|
||
- freedreno/a6xx: Drop arbitrary import restrictions
|
||
- freedreno: Handle buffer import
|
||
- freedreno: Always use aux-ctx for export blits
|
||
- freedreno: Allow TC async fences to have an fd
|
||
- freedreno: Disable explicit sync heuristic for Xwayland
|
||
- freedreno/a6xx: Move reg to static-non-context
|
||
- freedreno/decode/crashdec: Limit snapshot BO size
|
||
- freedreno/afuc: Add missing varset check
|
||
- freedreno/registers: More register prep
|
||
- freedreno/registers: Rename some unknowns
|
||
- freedreno/registers: x_ADDR_MODE_CNTL is a6xx and earlier
|
||
- freedreno/registers: Fix a couple reg names
|
||
- freedreno/registers: Extract out bitset for roq_avail
|
||
- freedreno/decode: Add gen8 support
|
||
- freedreno/decode: Move enum lookup out of snapshot
|
||
- freedreno/registers: Common-ize PIPE definitions
|
||
- freedreno/registers: Add gen8 regs
|
||
- freedreno/registers: Add gen8 descriptor layout
|
||
- freedreno/registers: pm4 updates for gen8
|
||
- freedreno/a6xx: Slight re-org of sampler descriptor building
|
||
- freedreno/layout: Convert fd6_view to c++
|
||
- freedreno/layout: gen8 descriptor support
|
||
|
||
Rob Hughes (1):
|
||
|
||
- llvmpipe: Work around WSL 1 missing support for memfd_create()
|
||
|
||
Robert Mader (8):
|
||
|
||
- anv: Enable G8_B8_R8_3PLANE_422 and G8_B8_R8_3PLANE_444 formats
|
||
- gallium: Set and count all extra samplers
|
||
- mesa: Add support for NV61, NV24 and NV42 pixel formats
|
||
- panfrost: Add lowerings for the NV61, NV24 and NV42 pixel formats
|
||
- nir: Fixup 10/12 bit SW decoder YCbCr formats
|
||
- sw_winsys: Add winsys_handle to displaytarget_create_mapped
|
||
- kms-dri-sw: Implement create_mapped()
|
||
- kms-dri-sw: Report linear modifiers in get_handle()
|
||
|
||
Rohan Garg (1):
|
||
|
||
- intel/compiler: use the WA framework when emitting WA 14014595444
|
||
|
||
Rohit Athavale (6):
|
||
|
||
- mediafoundation: Add guids for the newly added Input Delta QP & Absolute QP APIs
|
||
- mediafoundation: Add IsSupported() & GetValue() for CODECAPI_AVEncVideoInputDeltaQPBlockSettings
|
||
- d3d12: Make delta QP min and max to be bit-depth dependent for HEVC
|
||
- pipe: Add pipe_enc_qpmap_input_info to contain GPU & CPU QP Maps
|
||
- d3d12: Update d3d12 back to use pipe_enc_qpmap_input_info
|
||
- mediafoundation: Lock QP Map Buffer when in use, unlock after
|
||
|
||
Roland Scheidegger (13):
|
||
|
||
- llvmpipe: minor cleanup
|
||
- llvmpipe: Fix array mismatch when accessing shader images
|
||
- llvmpipe: Fix attribute interpolation setup when rendering lines with msaa
|
||
- llvmpipe: Fix wrong pixel shader invocation count with discard
|
||
- llvmpipe: Fix wrong GS invocation count when using instanced GS
|
||
- llvmpipe: add bitcasts around fptrunc/fpext operations
|
||
- docs: fix up old comment about fake msaa for llvmpipe
|
||
- lavapipe: don't leak the temporary msaa resource
|
||
- llvmpipe: fix incorrect scissor planes
|
||
- lavapipe: expose support for msaa 8x
|
||
- gallium,mesa/st: reverse logic for y flip for programmable sample locations
|
||
- llvmpipe: implement GL_ARB_sample_locations
|
||
- lavapipe: implement VK_EXT_sample_locations
|
||
|
||
Romaric Jodin (11):
|
||
|
||
- pan/bi: use only 1 MKVEC.v2i8 to generate v4i8 when possible
|
||
- pan/va: improve lowering of SWZ_V4I8
|
||
- pan/bi: add pass to simplify control flow
|
||
- pan/bi: schedule simple iterators to avoid extra move
|
||
- panfrost/perfetto: Use Android-internal perfetto
|
||
- meson: remove '--outdir' argument in script
|
||
- meson: add vk_enum_defines.h to idep_vulkan_util_headers
|
||
- meson: add depend_files for gl_enums.py
|
||
- meson: update xml files list in mesa/glapi
|
||
- meson: sort xml files in mesa/glapi
|
||
- glapi: static_data: do not use __file__ to get gl symbols file
|
||
|
||
Ruijing Dong (2):
|
||
|
||
- radeonsi/vcn: vcn5 av1 decoding context buffer fix
|
||
- radeonsi/vcn: Correct a typo condition for jpeg decoding
|
||
|
||
Ryan Houdek (1):
|
||
|
||
- freedreno/fdl: Fix typo in tiled_to_linear_2cpp
|
||
|
||
Sagar Ghuge (24):
|
||
|
||
- intel/genxml: Update CS_CHICKEN1 register field
|
||
- anv: Use thread group preemption granularity
|
||
- vulkan/radix_sort: Fix subgroup invocation id
|
||
- anv: Use vk_get_bvh_build_pipeline_spv helper
|
||
- vulkan/runtime: Add VK_SHADER_CREATE_UNALIGNED_DISPATCH_BIT_MESA flag
|
||
- anv: Mask off excessive invocations
|
||
- intel/genxml: Drop all unused struct/fields
|
||
- intel/compiler: Fix ray geometry index
|
||
- anv: Add missing ACCELERATION_STRUCTURE_READ in barrier handling
|
||
- anv: Enable CS stall for ACCELERATION_STRUCTURE_COPY stage
|
||
- anv: Add missing L3 flushes
|
||
- anv: Apply pipe flushes for outstanding PC bits
|
||
- anv: Emit state cache invalidation after every compute dispatch
|
||
- blorp: Emit state cache invalidation after every compute dispatch
|
||
- iris: Emit state cache invalidation after every compute dispatch
|
||
- isl: Respect driconf option for EnableSamplerRoutetoLSC
|
||
- Revert "intel: Always set Cube Face Enables for all surfaces."
|
||
- anv: Call brw_nir_lower_rt_intrinsics_pre_trace lowering pass
|
||
- brw/rt: Move nir_build_vec3_mat_mult_col_major helper to header
|
||
- brw/rt: fix ray_object_(direction|origin) for closest-hit shaders
|
||
- vulkan/runtime: Fix typo in stack size calculation
|
||
- anv: Use correct engine class for companion RCS
|
||
- anv: Drop unwanted untyped flush for AS query
|
||
- intel/common: Consider 0 threads while setting TG
|
||
|
||
Samuel Pitoiset (352):
|
||
|
||
- Revert "ci: Disable Valve keywords farm"
|
||
- radv: adjust conservative rasterization configuration on GFX12
|
||
- radv: use vk_optimize_depth_stencil_state() for optimal settings
|
||
- radv: add RADV_DEBUG=novideo to disable all video extensions
|
||
- radv: fix SQTT shaders relocation on GFX12
|
||
- radv: simplify emitting SQTT shaders relocation for GFX6-GFX11.5
|
||
- radv: fix reporting instance/vertex_count for direct draws with RGP on GFX12
|
||
- radv: reject 1D block-compresed formats with mips on GFX6
|
||
- zink/ci: update list of expected failures for NAVI31
|
||
- zink/ci: remove old gfx1200 lists
|
||
- radv/ci: fix list of expected failures for VEGA10/NAVI10
|
||
- radv: fix a memleak with GS copy shader NIR
|
||
- radv: emit PGM_HI_PS in the gfx preamble on GFX12
|
||
- radv: remove dead ES emit code on GFX12
|
||
- radv: invalidate compute/rt descriptors at pipeline bind time
|
||
- radv: stop passing compute shader to radv_dispatch()
|
||
- radv: rework graphics shaders/vbos prefetch sligthly
|
||
- radv: handle compute/rt prefetch like graphics
|
||
- radv: add radv_{before,after}_dispatch() functions
|
||
- radv: replace DGC before/after dispatch helpers with the new ones
|
||
- radv: fix fbfetch output with compresed FMASK on <= GFX9
|
||
- vulkan: fix missing presentId2/presentWait2 enable features
|
||
- docs: add missing VK_KHR_present_id/2 to features.txt
|
||
- ci: uprev VKCTS main to 9dd9a72b28218f1ca12777d9b73c2a85c5c60231
|
||
- ac/gpu_info,radv: use the maximum virtual address from the kernel
|
||
- radv: invalidate compute/rt descriptors at dispatch time
|
||
- zink/ci: skip spec\@arb_fragment_program\@fog-modes on RADV
|
||
- radv/ci: fix GPU hang detection regex with recent kernels
|
||
- zink/ci: reduce timeout of zink-radv-navi31-valve
|
||
- zink/ci: make zink-radv-navi31-valve a pre-merge job
|
||
- radv: precompute the mask for enabled color writes
|
||
- radv: precompute the mask for color write attachments
|
||
- radv: precompute color blend equations
|
||
- radv: track more CB related context registers on < GFX12
|
||
- radv: regroup CB related states emission together
|
||
- radv: tidy up radv_device_init_perf_counters()
|
||
- radv: introduce radv_cmd_stream
|
||
- radv: switch to radv_cmd_stream everywhere
|
||
- radv: move buffered registers for GFX12 to radv_cmd_stream
|
||
- radv: move context_roll_without_scissor_emitted to radv_cmd_stream
|
||
- radv: move tracked registers to radv_cmd_stream
|
||
- radv/ci: uprev kernel to 6.15.9
|
||
- radv: cleanup some redundant cmd_buffer->cs occurrences
|
||
- radv: remove cs parameter for all opt context emit helpers
|
||
- radv: remove cs parameter for gfx12 push SH reg helpers
|
||
- radv: implement RB+ depth-only rendering for better perf
|
||
- radv: fix destroying CS with RADV_PERFTEST=dmashaders
|
||
- ac,radv,radeonsi: fix programming PA_SU_PRIM_FILTER_CNTL on GFX12
|
||
- radv/amdgpu: fix creation with different but unused RADV_PERFTEST flags
|
||
- ac/descriptors: add a function to create a descriptor for HiZ surfaces
|
||
- radv: allocate image metadata to implement a workaround for HiZ on GFX12
|
||
- radv: add a function to create an image view for HiZ surfaces
|
||
- radv/meta: add a pass to clear HiZ surfaces
|
||
- radv: initialize HiZ metadata during image layout transitions
|
||
- radv/meta: update HiZ metadata after depth/stencil image clears
|
||
- radv: validate dynamic states earlier
|
||
- radv: implement an alternative workaround for HiZ on GFX12
|
||
- radv: fix reserving space for emitting push constants with DGC IES
|
||
- radv: remove redundant push constant size alignment for DGC
|
||
- radv: pass the IES struct when computing the DGC sequence size
|
||
- radv: pre-compute more information when updating DGC IES
|
||
- radv: optimize the preprocess buffer size for DGC IES compute
|
||
- radv: use radv_write_sampler_descriptor() for combined image/sampler
|
||
- radv: do not hardcode the combined image/sampler offset in the db path
|
||
- radv: only write 32 bytes for combined image/sampler on GFX11+
|
||
- radv: reduce the combined image/sampler desc size on GFX11+
|
||
- radv: remove useless inline push constant emission with DGC IES
|
||
- radv: stop using the pipeline layout for inlined push constants with DGC
|
||
- radv: split uploading push constants with DGC in two parts
|
||
- radv: stop using the pipeline layout for uploading push constants with DGC
|
||
- radv: tidy up radv_flush_descriptors()
|
||
- radv: slightly optimize indirect descriptor sets upload size
|
||
- radv: invalidating push constants for compute<->rt during dispatches
|
||
- radv: do not emit inlined SGPRs twice for merged shaders
|
||
- radv: use radv_shader_need_indirect_descriptor_sets() more
|
||
- radv: determine if push constants need to be uploaded earlier
|
||
- radv: rework emitting push constants for less CPU overhead
|
||
- radv: add a function that uploads push constants
|
||
- radv: remove unused forwarded declarations of pipeline layout
|
||
- radv: determine the push constant size from the shader itself
|
||
- radv: add a function to get push constant layout info for DGC
|
||
- radv: gather push constant size from shaders for DGC
|
||
- radv: stop using the pipeline layout completely for DGC
|
||
- radv: fix color attachment remapping with fast-GPL/ESO
|
||
- radv: merge two similar loops in lookup_ps_epilog()
|
||
- Revert "radv/ci: disable hang detection in navi31-vkcts"
|
||
- zink/ci: skip one piglit subset that randomly hangs on RADV
|
||
- zink/ci: update list of flakes for NAVI31/VANGOGH/CEZANNE
|
||
- amd/drm-shim: add navi33
|
||
- radv: emit relocation for task shaders at the same place as other stages
|
||
- radv: rework the helper to emit buffered regs on GFX12
|
||
- radv: emit compute pipeline with buffered SH regs on GFX12
|
||
- radv: emit descriptor pointers with buffered SH regs on GFX12
|
||
- radv: emit inlined push constants with buffered SH regs on GFX12
|
||
- radv/ci: update expected list of failures/flakes on GFX1201
|
||
- radv/ci: use 3 parallel jobs for radv-gfx1201-vkcts
|
||
- radv/ci: reduce the timeout for radv-gfx1201-vkcts
|
||
- radv/ci: make radv-gfx1201-vkcts a pre-merge job
|
||
- radv/ci: document a very recent ACO regression on GFX12
|
||
- zink/ci: make zink-radv-gfx1201-valve a pre-merge job
|
||
- zink/ci: update list of flakes for GFX1201
|
||
- radv: get the depth clamp mode earlier when emitting viewports
|
||
- radv: emit depth clamp enable as part of the viewport state
|
||
- radv: add a new dirty bit for the viewport state
|
||
- radv: precompute the depth clamp mode
|
||
- radv: precompute the depth clip enable
|
||
- radv: dirty some states from graphics pipeline earlier
|
||
- radv: do not emit few RADV_CMD_DIRTY_xxx based on dynamic states
|
||
- radv: only re-emit needed states when PS inner coverage changes
|
||
- radv: add a new dirty bit for the binning state
|
||
- radv: optimize re-emitting the occlusion query state on GFX12
|
||
- radv: validate dynamic states for the occlusion query state earlier
|
||
- radv: validate dynamic states for the db shader control state earlier
|
||
- radv: add a new dirty bit for the ngg culling state
|
||
- radv: add a new dirty bit for the FSR state
|
||
- radv: add a new dirty bit for the rast samples state
|
||
- radv: rename RADV_CMD_DIRTY_TESS_STATE to RADV_CMD_DIRTY_TCS_TES_STATE
|
||
- radv: add a new dirty bit for the depth bias state
|
||
- radv: dirty the depth stencil state when rendering begins
|
||
- radv: dirty the cb render state when rendering begins
|
||
- radv: dirty more states when rendering begins
|
||
- radv: add a new dirty bit for the VS prolog state
|
||
- radv: add a new dirty bit for the blend constants state
|
||
- radv: add a new dirty bit for the sample locations state
|
||
- radv: add a new dirty bit for the scissor state
|
||
- radv: make radv_cmd_state::dirty a 64-bit field
|
||
- radv: add missing L2 invalidate cache flush for non-coherent images
|
||
- radv: add a new dirty bit for the tess domain origin state
|
||
- radv: add a new dirty bit for the patch control points state
|
||
- radv: add a new dirty bit for the VGT prim state
|
||
- radv: remove radv_cmd_buffer_flush_dynamic_state()
|
||
- radv: remove dead code when setting dynamic primitive topology
|
||
- radv: dirty the rast sample states for VRS att/OOO rast
|
||
- radv: dirty RADV_CMD_DIRTY_xx states when binding sample shading state
|
||
- radv: dirty the rast samples state when VRS is forced to 1x1
|
||
- radv: rename rast_prim to vgt_outprim_type everywhere
|
||
- radv: stop abusing dirty_dynamic when binding a NULL fragment shader
|
||
- radv: clear RADV_CMD_DIRTY_xxx bits outside of the caller in most cases
|
||
- radv: fix hashing graphics pipeline when no stages are compiled
|
||
- radv: run nir_lower_memcpy after spirv->nir
|
||
- radv: run nir_opt_memcpy before nir_opt_copy_prop_vars
|
||
- radv/nir/lower_cmat: handle untyped pointers for load/store
|
||
- radv: advertise VK_KHR_shader_untyped_pointers
|
||
- radv: clear RADV_CMD_DIRTY_xxx bits outside of the caller in more cases
|
||
- radv: handle fbfetch output after binding graphics shaders
|
||
- radv: clear descriptors state dirty bit outside of the caller
|
||
- radv: add a new state for forced VRS rates
|
||
- radv: check if SQTT is enabled before calling radv_describe_draw()
|
||
- radv: check flush_bits before calling radv_emit_cache_flush() in the draw path
|
||
- radv: add radv_cmd_set_line_width()
|
||
- radv: add radv_cmd_set_tessellation_domain_origin()
|
||
- radv: add radv_cmd_set_patch_control_points()
|
||
- radv: add radv_cmd_set_depth_clamp_range()
|
||
- radv: add radv_cmd_set_depth_clip_negative_one_to_one()
|
||
- radv: add radv_cmd_set_primitive_restart_enable()
|
||
- radv: add radv_cmd_set_depth_bias()
|
||
- radv: add radv_cmd_set_line_stipple()
|
||
- radv: add radv_cmd_set_cull_mode()
|
||
- radv: add radv_cmd_set_front_face()
|
||
- radv: add radv_cmd_set_depth_bias_enable()
|
||
- radv: add radv_cmd_set_rasterizer_discard_enable()
|
||
- radv: add radv_cmd_set_polygon_mode()
|
||
- radv: add radv_cmd_set_line_stipple_enable()
|
||
- radv: add radv_cmd_set_depth_clip_enable()
|
||
- radv: add radv_cmd_set_conservative_rasterization_mode()
|
||
- radv: add radv_cmd_set_provoking_vertex_mode()
|
||
- radv: add radv_cmd_set_depth_clamp_enable()
|
||
- radv: add radv_cmd_set_line_rasterization_mode()
|
||
- radv: add radv_cmd_set_alpha_to_coverage_enable()
|
||
- radv: add radv_cmd_set_alpha_to_one_enable()
|
||
- radv: add radv_cmd_set_sample_mask()
|
||
- radv: add radv_cmd_set_rasterization_samples()
|
||
- radv: add radv_cmd_set_sample_locations_enable()
|
||
- radv: add radv_cmd_set_depth_bounds()
|
||
- radv: add radv_cmd_set_stencil_compare_mask()
|
||
- radv: add radv_cmd_set_stencil_write_mask()
|
||
- radv: add radv_cmd_set_stencil_reference()
|
||
- radv: add radv_cmd_set_logic_op()
|
||
- radv: add radv_cmd_set_color_write_enable()
|
||
- radv: add radv_cmd_set_color_write_mask()
|
||
- radv: add radv_cmd_set_logic_op_enable()
|
||
- radv: add radv_cmd_set_fragment_shading_rate()
|
||
- radv: add radv_cmd_set_attachment_feedback_loop_enable()
|
||
- radv: add radv_cmd_set_primitive_topology()
|
||
- radv: add radv_cmd_set_blend_constants()
|
||
- radv: add radv_cmd_set_discard_rectangle_mode()
|
||
- radv: add radv_cmd_set_discard_rectangle_enable()
|
||
- radv: add radv_cmd_set_depth_test_enable()
|
||
- radv: add radv_cmd_set_depth_write_enable()
|
||
- radv: add radv_cmd_set_depth_compare_op()
|
||
- radv: add radv_cmd_set_depth_bounds_test_enable()
|
||
- radv: add radv_cmd_set_stencil_test_enable()
|
||
- radv: add radv_cmd_set_stencil_op()
|
||
- radv: add radv_cmd_set_discard_rectangle()
|
||
- radv: make use of RADV_DYNAMIC_{VIEWPORT,SCISSOR}_WITH_COUNT
|
||
- radv: add radv_cmd_set_viewport_with_count()
|
||
- radv: add radv_cmd_set_scissor_with_count()
|
||
- radv: add radv_cmd_set_scissor()
|
||
- radv: add radv_cmd_set_viewport()
|
||
- radv: make radv_ps_epilog_state::color_blend_enable a 8-bit field
|
||
- radv: pre-compute color blend enable
|
||
- radv: add radv_cmd_set_color_blend_enable()
|
||
- radv: add radv_cmd_set_rendering_attachment_locations()
|
||
- radv: add radv_cmd_set_rendering_input_attachment_indices()
|
||
- radv: add radv_cmd_set_sample_locations()
|
||
- radv: add radv_cmd_set_color_blend_equation()
|
||
- radv: only update vertex stride if pStrides is non-NULL when binding VBO
|
||
- radv: use the dynamic state to store vertex binding strides
|
||
- radv: bind the vertex binding strides like a normal dynamic state
|
||
- radv: move radv_vertex_input_state to radv_pipeline_graphics.h
|
||
- radv: move VBO misaligned/unaligned info to radv_vertex_input_state
|
||
- radv: remove unused parameter to radv_pipeline_init_dynamic_state()
|
||
- radv: use the dynamic state to store vertex input state
|
||
- radv: replace an assertion with a check when emitting VS prolog
|
||
- radv: bind the vertex input state like a normal dynamic state
|
||
- radv: fix setting VBO misaligned mask in graphics pipelines
|
||
- radv: allow to select a different HiZ workaround on GFX12
|
||
- radv: add RADV_GFX12_HIZ_WA to select the HiZ wa behavior on GFX12
|
||
- radv: rename NGG culling user SGPRs
|
||
- radv: split RADV_CMD_DIRTY_NGGC_STATE in two states
|
||
- radv: clear dynamic states earlier
|
||
- radv: use radv_get_vgt_outprim_type() to disable NGGC for points/lines
|
||
- radv: use radv_get_vgt_outprim_type() for the NGG SGPRs state
|
||
- radv: add an early return to radv_flush_vertex_descriptors()
|
||
- radv: emit BREAK_BATCH when the PS changes also for ESO
|
||
- radv: cleanup configuring AUTO_RESET_CNTL
|
||
- radv: dirty the raster state when setting the primitive topology
|
||
- radv: pre-compute tessellation num patches/lds size earlier
|
||
- radv: do not trigger PATCH_CONTROL_POINTS_STATE on GFX12
|
||
- radv: rename DIRTY_PATCH_CONTROL_POINTS_STATE to DIRTY_LS_HS_CONFIG
|
||
- radv: remove unnecessary ternary expressions in radv_emit_depth_stencil_state()
|
||
- radv: translate stencil op earlier
|
||
- radv: fix compiler warnings when uploading cmdbuf data might fail
|
||
- radv: remove unused radv_pipeline::user_data_0
|
||
- radv: remove set but unused has_nggc in radv_cmd_state
|
||
- radv: remove set but unused radv_graphics_pipeline fields
|
||
- radv: remove unnecessary radv_graphics_pipeline::is_ngg
|
||
- radv: disable VK_EXT_image_compression_control on GFX12
|
||
- radv/rt: only use one user SGPR for the traversal shader addr
|
||
- radv/rt: fix a potential issue with RADV_PERFTEST=dmashaders
|
||
- radv/ci: remove RADV_DEBUG=novideo for radv-gfx1201-vkcts
|
||
- radv: mark RADV_DEBUG=nodynamicbounds as deprecated
|
||
- radv: mark RADV_DEBUG=invariantgeom as deprecated
|
||
- radv: mark RADV_DEBUG=splitfma as deprecated
|
||
- radv: mark RADV_DEBUG=nongg_gs as deprecated
|
||
- radv: move drirc options to a separate struct
|
||
- radv: move features related drirc to radv_drirc::features
|
||
- radv: move performance related drirc to radv_drirc::performance
|
||
- radv: move debug related drirc to radv_drirc::debug
|
||
- radv: move misc related drirc to radv_drirc::misc
|
||
- radv: fix vk_error in radv_update_preambles()
|
||
- radv/amdgpu: add a function to query permitted context priorities
|
||
- radv: only expose permitted global queue priorities
|
||
- radv: rework the optimal packet order for "normal" draws
|
||
- radv: rework the optimal packet order for task/mesh draws
|
||
- radv: rework the optimal packet order for dispatches
|
||
- radv: rename radv_flush_occlusion_query_state()
|
||
- radv: simplify sample shading state tracking
|
||
- radv: determine which shader is the last VGT shader using next stage
|
||
- radv: trigger VS related states in radv_bind_pre_rast_shader()
|
||
- radv/meta: use radv_CmdDispatchBase() directly for ASTC decode
|
||
- radv: add small helper to dispatch RT
|
||
- radv: remove unnecessary NULL check when creating PS epilogs
|
||
- radv: add a function to bind a PS epilog
|
||
- radv: add a new dirty bit for compiling/binding a PS epilog
|
||
- radv: add a new dirty bit for emitting a PS epilog
|
||
- radv: rename RADV_CMD_DIRTY_FS_STATE to RADV_CMD_DIRTY_PS_STATE
|
||
- radv: exclude dynamic vertex input stride for the late scissor workaround
|
||
- radv/amdgpu: return OOM device when BO mapping fails
|
||
- radv/amdgpu: add more helpers for managing virtual BOs
|
||
- radv: add RADV_DEBUG=bo_history
|
||
- Revert "radv: handle fbfetch output after binding graphics shaders"
|
||
- radv: emit more push shader registers on GFX12
|
||
- radv: report an message when RADV_GFX12_HIZ_WA value is invalid
|
||
- radv: replace RADV_GFX12_HIZ_WA by a drirc option
|
||
- radv: switch to the full HiZ workaround by default on GFX12
|
||
- radv: disable radv_disable_hiz_his_gfx12 for Mafia Definition Edition
|
||
- radv: set radv_gfx12_hiz_wa=partial for some games to mitigate performance loss
|
||
- zink/ci: mark one test as crash/flake for turnip a618
|
||
- radv: get NIR options after initializing the physical device cache key
|
||
- radv: fix capture/replay with sampler border color
|
||
- spirv: add missing non-uniform access for SSBO atomics
|
||
- radv/meta: fix saving push constants for depth/stensil resolves on compute
|
||
- radv/meta: rework depth/stencil resolves using compute
|
||
- radv/meta: rework depth/stencil resolves using graphics
|
||
- radv/meta: remove useless VK_ACCESS_2_SHADER_WRITE_BIT for subpass resolves
|
||
- radv/meta: simplify barriers for resolves
|
||
- radv/meta: simplify calling depth/stencil resolve helpers
|
||
- radv/meta: remove useless assertion when choosing resolve method
|
||
- radv: pre-compute the number of rasterization samples
|
||
- radv: pre-compute the line rasterization mode
|
||
- radv: pre-compute vgt_outprim_type
|
||
- radv: remove redundant RADV_DYNAMIC_PRIMITIVE_TOPOLOGY
|
||
- radv: remove redundant RADV_DYNAMIC_LINE_RASTERIZATION_MODE
|
||
- radv: remove redundant RADV_DYNAMIC_POLYGON_MODE
|
||
- radv: remove redundant RADV_DYNAMIC_RASTERIZATION_SAMPLES
|
||
- radv: set DRLR mapping info from inheritance info when present
|
||
- radv: add a helper whether shader fp16 is enabled
|
||
- radv/ci: document recent unexpected failures on TAHITI
|
||
- Revert "radv/ci: document recent unexpected failures on TAHITI"
|
||
- radv: only expose AMD_device_coherent_memory if actually supported
|
||
- radv: reserve more CS space when executing DGC calls
|
||
- radv/ci: update expected list of failures for VEGA10/NAVI10
|
||
- radv: lower ycbcr tex instructions earlier
|
||
- radv: lower embedded/immutable samplers earlier
|
||
- radv: fix expected disk cache size for meta shaders
|
||
- nir: adjust nir_tex_instr_need_sampler() for AMD FMASK instructions
|
||
- radv: remove useless radeon_cmdbuf forwarded declaration
|
||
- ac/sqtt: use void pointers for start/stop CS
|
||
- ac/cmdbuf: introduce ac_cmdbuf
|
||
- radeonsi: replace radeon_cmdbuf_chunk by ac_cmdbuf
|
||
- radv: replace radeon_cmdbuf by ac_cmdbuf completely
|
||
- radv,radeonsi: use new ac_cmdbuf macros
|
||
- radv: do not initialize HiZ on transfer queue on RDNA4
|
||
- radv: use force_indirect_desc_sets when creating RT prologs
|
||
- radv: rename indirect_descriptor_sets to indirect_descriptors
|
||
- radv: rename shader arg descriptor_sets to descriptors
|
||
- radv: make radv_descriptor_get_va() a static function
|
||
- radv: rename radv_mark_descriptor_sets_dirty()
|
||
- ac/surface: fix host image copies with 96-bits formats
|
||
- ac/surface: fix host image copies with stencil-only
|
||
- radv: allow VK_FORMAT_S8_UINT with host image copy
|
||
- vulkan/runtime: fix memleak when creating ETC pipelines
|
||
- radv/rt: fix memory leak in lower_rt_instructions_monolithic()
|
||
- radv: fix shaders memleak when importing pipeline binaries with GPL
|
||
- radv/meta: pass image formats to radv_meta_resolve_{hardware,fragment}_image()
|
||
- radv/meta: re-use radv_meta_resolve_{fragment,hardware}_image() for subpass resolves
|
||
- radv/meta: pass iview formats for subpass resolves
|
||
- radv/meta: remove radv_cmd_buffer_resolve_rendering_{hw,cs,fs}
|
||
- radv: enable the global BO list by default
|
||
- radv: only return identicalMemoryLayout for linear images
|
||
- radv: always return optimalDeviceAccess=TRUE for block-compressed formats
|
||
- radv: declare a new user SGPR for dynamic descriptors
|
||
- radv: upload and emit dynamic descriptors separately from push constants
|
||
- radv: allow to inline all push constants even with dynamic descriptors
|
||
- radv: use COPY_DATA_DST_MEM when writing timestamps
|
||
- amd,radv: add ac_emit_cond_exec()
|
||
- amd,radv: add ac_emit_write_data_imm()
|
||
- amd,radv,radeonsi: add ac_emit_cp_wait_mem()
|
||
- amd,radv,radeonsi: add ac_emit_cp_acquire_mem_pws()
|
||
- amd,radv,radeonsi: add ac_emit_cp_release_mem_pws()
|
||
- radv: use ac_emit_cp_{acquire,release}_mem_pws() when syncing GE rings
|
||
- amd,radv,radeonsi: add ac_emit_cp_copy_data()
|
||
- amd,radv,radeonsi: add ac_emit_cp_pfp_sync_me()
|
||
- ci: uprev VKCTS main to db48c34bebaf3359453e44ab151a2ff9f9c58eb2
|
||
- radv/ci: bump timeout for radv-gfx1201-vkcts to 5 minutes more
|
||
- radv: dirty dynamic descriptors when required
|
||
- radv: ignore dual-source blending when blending isn't enabled for MRT0
|
||
- radv: add a workaround for illegal depth/stencil descriptors with No Man's Sky
|
||
- aco: fix reserving VGPRs for 64-bit attributes in VS prologs
|
||
- radv,aco: wait for all VMEM loads when the prolog loads large 64-bit attributes
|
||
- radv: add vk_wsi_disable_unordered_submits and enable for GTK
|
||
|
||
Serdar Kocdemir (2):
|
||
|
||
- gfxstream: fix warnings about unused parameters
|
||
- gfxstream: Enable VK_MVK_macos_surface for host dispatch
|
||
|
||
Sergi Blanch Torne (19):
|
||
|
||
- ci: fix gc2000 fails duplication
|
||
- ci,crnm: migrate colorama to rich
|
||
- Revert "ci: Temporarily hardcode S3 artifact path"
|
||
- Revert "ci: Fix for GitLab 18.2.2 upgrade"
|
||
- ci: disable Collabora's farm due to maintenance
|
||
- ci: fix requirements file
|
||
- Revert "ci: disable Collabora's farm due to maintenance"
|
||
- ci,marge_queue: encapsulate monitor loop
|
||
- ci,marge_queue: enhance script interruption
|
||
- ci,marge_queue: objects to represent the queue
|
||
- ci,marge_queue: refactor the get queue method
|
||
- ci,marge_queue: protect form transient errors
|
||
- ci,marge_queue: encapsulate GitLab module queries
|
||
- ci,marge_queue: queue element formatting
|
||
- docs,marge_queue: document the tool usage
|
||
- ci,marge_queue: handle GitLab auth exception
|
||
- ci,marge_queue: use rich module
|
||
- ci,marge_queue: introduce testing
|
||
- ci: Add missing aiohttp Python dependecy
|
||
|
||
Sergi Blanch-Torne (3):
|
||
|
||
- ci: disable Collabora's farm due to maintenance
|
||
- Revert "ci: disable Collabora's farm due to maintenance"
|
||
- ci: disable Collabora's farm due to maintenance
|
||
|
||
Sergii Ushakov (1):
|
||
|
||
- android: moving HMI symbol to separate file
|
||
|
||
Sergio Lopez (1):
|
||
|
||
- hk: fix instance reference in vk_free
|
||
|
||
Seán de Búrca (14):
|
||
|
||
- rusticl: move debug logging to the end of the build step
|
||
- rusticl: disentangle \`ProgramBuild` state from kernel compilation
|
||
- rusticl: clarify naming of program-related structs and fields
|
||
- rusticl: release borrow on device build before linking
|
||
- rusticl: consolidate linking code
|
||
- rusticl: add abstraction for \`util_queue`
|
||
- rusticl: introduce intermediate header object
|
||
- rusticl: restructure program build to prepare for parallelization
|
||
- rusticl: execute program builds as jobs on a worker thread
|
||
- rusticl: adjust naming and assert usage for clarity
|
||
- rusticl/kernel: delay calculation of CSO info until kernel creation
|
||
- nak: remove boxing of instructions
|
||
- rusticl/kernel: add Kernel::mut_ref_from_raw()
|
||
- rusticl/kernel: remove mutexes from kernel structure
|
||
|
||
Sid Pranjale (1):
|
||
|
||
- docs: mark VK_KHR_depth_clamp_zero_one as done for NVK
|
||
|
||
Sil Vilerino (16):
|
||
|
||
- mediafoundation: Fix recon pic two pass VPBlit target
|
||
- mediafoundation: Do GPU-GPU encoder sync for two-pass input vpblit
|
||
- d3d12: Fix two pass flag setting and rate control dirty flag check
|
||
- d3d12: Fix double video encode resource barrier for DPB/recon pic resources
|
||
- d3d12: Implement d3d12_context_queue_priority_manager
|
||
- mediafoundation: Implement d3d12_context_queue_priority_manager and related ICodecAPI
|
||
- mediafoundation: Check driver caps for intra-refresh CodecAPI advertisement
|
||
- d3d12: Check slice support for PIPE_VIDEO_CAP_ENC_INTRA_REFRESH support
|
||
- d3d12: Fix leak d3d12_context::priority_manager_lock
|
||
- mediafoundation: Fix leak mft_context_queue_priority_manager::m_lock
|
||
- ci: Bump DirectX-Headers and Agility SDK dependencies to 1.618.1
|
||
- pipe: Add video encode spatial adaptive quantization interface
|
||
- d3d12: Implement video encode spatial adaptive quantization interface
|
||
- d3d12: Remove Agility v717 guards for features now available in v618
|
||
- mediafoundation: Remove Agility v717 guards for features now available in v618
|
||
- mediafoundation: Implement video encode spatial adaptive quantization interface
|
||
|
||
Silvio Vilerino (8):
|
||
|
||
- d3d12: Fix typo in cast when reading pipe_h265_enc_picture_desc::gpu_stats_psnr
|
||
- mediafoundation: Use lower size estimations for compressed output bitstream sizes
|
||
- d3d12: Use lower size estimations for compressed output bitstream sizes
|
||
- d3d12: Allow frontends to set_video_encoder_max_async_queue_depth() to manage encoder memory overhead
|
||
- d3d12: Fix video encoder async depth fence wait off by one bug
|
||
- mediafoundation: Use d3d12 extension set_video_encoder_max_async_queue_depth to save memory in low latency (no async/in flight frames)
|
||
- d3d12: Video encode - Check driver caps to determine which output stats are supported
|
||
- mediafoundation: mftransform async slices parsing, avoid heap allocation inside loop
|
||
|
||
Simon McVittie (2):
|
||
|
||
- vulkan: Consistently form driver library names as prefix + name + suffix
|
||
- vulkan: Compute path to write into JSON manifests once, use it everywhere
|
||
|
||
Simon Perretta (251):
|
||
|
||
- wsi/display: make HDR_OUTPUT_METADATA, Colorspace properties optional
|
||
- nir/nir_lower_calls_to_builtins: trivially handle IA64 mangled functions
|
||
- pvr: start moving over to using the vulkan runtime vertex input state
|
||
- pco: handle replicated components when translating nir alu srcs
|
||
- pvr: default varyings interpolation to smooth when not set
|
||
- pco: amend index register mapping
|
||
- pco: enable all expected types for vertex i/o
|
||
- pvr: amend incorrect format assertions
|
||
- pvr: support getting device info from public name
|
||
- pco: pygen: support passing custom refs to enc_ops
|
||
- pco, pygen: support more comparison ops and types
|
||
- pco: support shift ops
|
||
- pco, pygen: support integer add/mul/mad ops
|
||
- pco, pygen: support gradient/derivative ops
|
||
- pco: commonize and improve iteration helpers
|
||
- pco: support re-indexing loops and ifs
|
||
- pco: amend cf printing indentation
|
||
- pco: pygen: amend op mod print strings
|
||
- pco: fix idx reg print colors and sq brackets
|
||
- pco: control-flow epilogue/interlogue/prologue boilerplate
|
||
- pco: switch to glsl/list, add control flow boilerplate
|
||
- pco: skip over empty blocks when iterating instructions
|
||
- pco, pygen: differentiate between int and float ref mods
|
||
- pco: add virtual register support
|
||
- pco: primitive bool support
|
||
- pco: pygen: propagate selected source for ops with multiple source selections
|
||
- pco: pygen: support applying modifiers to OpRefs
|
||
- pco: pygen: add control-flow and branch ops
|
||
- pvr, pco: initial ssbo and atomics support
|
||
- pco, pygen: support test predicate setting
|
||
- pco: initial control-flow support
|
||
- pco, pygen: expose enhanced logical ops with optional mask
|
||
- pco: add support for various selection, complex, trig ops
|
||
- pco: add support for more bitwise and bitfield ops
|
||
- pvr, pco: add base compute support
|
||
- pco: experimental regalloc changes
|
||
- pvr: pack image/texture array size unconditionally
|
||
- pvr: preliminary support for combined image samplers
|
||
- pco: add uadd64_32 op
|
||
- pco: add basic pass to shrink vecs with unused components
|
||
- pco: initial texture/sampler compiler support
|
||
- pvr: initial texture/sampler driver support
|
||
- pco: add support for using index(ed) registers
|
||
- pco, pvr: push constants support
|
||
- pco: basic arrayed image/sampler descriptor support
|
||
- pvr: storage image descriptor support
|
||
- pco: add boilerplate code for legalizing pseudo-ops
|
||
- pco: add helpers for phase iteration, print more igrp offset info
|
||
- pvr, pco: add support for buffer size intrinsic
|
||
- pco: rework nir processing and passes
|
||
- pvr, pco: usc program (pre-)generation boilerplate
|
||
- pco: add support for loops and ifs using predicated execution
|
||
- pco: update virtual register support for bools and nir reg translation
|
||
- pco: support integer abs/neg
|
||
- pvr: temporarily tweak support required for query programs
|
||
- pco, pygen: add mutex op
|
||
- pco: add intrinsic for loading instance num in slot
|
||
- pvr, pco: improve indexed reg support, add shared memory support
|
||
- pvr, pco: temporarily add supporting code for VK_KHR_zero_initialize_workgroup_memory
|
||
- pco: add initial support for shared atomics
|
||
- pco: experimentally propagate olchk mod for fwd prop opt
|
||
- pco: temporarily prevent shared mem (coeffs) and vregs from being copy proped
|
||
- pco: basic support for undefs
|
||
- pvr, pco: initial support for blend constants
|
||
- pco: suppress uses_sample_shading changes from nir_lower_blend
|
||
- pvr: enable logicOp feature
|
||
- pvr, pco: point sampler support
|
||
- pco: initial image support
|
||
- pvr, pco: per frag/vertex input/output rework
|
||
- pco: skip lowering fs outputs that aren't present
|
||
- pco: add support for sscaled8* formats
|
||
- pvr: add descriptor copy support
|
||
- pco: lower {insert,extract}_[ui]{8,16} to bitfield ops
|
||
- pvr, pco: temporarily add legacy tq shader gen code
|
||
- pco: initial image write support
|
||
- pvr: initial texel buffer support
|
||
- pvr, pco: basic depth feedback/discard/terminate support
|
||
- pvr, pco: add input attachment sampler and initial support
|
||
- pvr: use mrt_resource output size for fs outputs and input attachments
|
||
- pvr: skip setting up unused fragment shader outputs
|
||
- pvr, pco: temporarily add legacy loadop shader gen code
|
||
- pvr: check for unused attachments
|
||
- pco, pvr: account for early frag testing
|
||
- pvr: sampler and sampled image descriptor support
|
||
- pco, pvr: sample mask out support
|
||
- pco: support combined depth/discard isp feedback
|
||
- pvr, pco: initial texture gather support with gather sampler
|
||
- pco: fully switch over to common smp emission code
|
||
- pco: basic image array support
|
||
- pco: branching fence support, simple ditr insertion logic
|
||
- pvr, pco: simple end-of-tile/render nir shader gen
|
||
- pvr, pco: switch to new nop shader
|
||
- pvr: drop legacy rogue compiler
|
||
- pco: support dce for vregs
|
||
- pco: further commonize iteration instruction emission
|
||
- pco: support indirect function temp refs
|
||
- pvr: initial sample rate shading support
|
||
- pco: add pass to split shader in/out struct/array vars across more slots
|
||
- pco: enable shrink vec opt
|
||
- pco: support shader i/o arrays of structs
|
||
- pco: temporarily treat already overridden refs as comps during regalloc
|
||
- pvr: remove vertex position output assertion
|
||
- pco: force image/texture array coordinate f2i32 conversions to be rtne
|
||
- pco: add pass to expand out vecs only used by comps
|
||
- pvr, pco: add support for gl_FrontFacing
|
||
- pvr: dynamically handle shademodel for flat shaded varyings
|
||
- pvr, pco: z-replicate support
|
||
- pvr, pco: image size query support
|
||
- pvr, pco: improved image write (with format) support, handle 111110
|
||
- pco: support render target/layer id intrinsic
|
||
- pco: add render target awareness to input attachments
|
||
- pco: temporarily make vecs interfere with their components during regalloc
|
||
- pco: restrict regalloc debug printing
|
||
- pco: add helpers for finding non-empty blocks, apply
|
||
- pco: skip comp-only opt on collated vecs
|
||
- pvr, pco: clip/cull distance support
|
||
- pco: temporarily prevent vectorization of vertex outputs
|
||
- pvr, pco: add support for robust buffer access
|
||
- pvr: texture swizzle depth/stencil fix
|
||
- pco: experimentally pre-propagate vectors during regalloc
|
||
- pco: remap buffer samplers to be 2d
|
||
- pco: basic image/texture cube support
|
||
- pco: add remaining texture buffer support
|
||
- pvr, pco: dynamic buffer and immutable sampler support
|
||
- pco: handle vector ra via parallel copy
|
||
- pvr: temporarily dword align \*all* descriptors
|
||
- pco: temporarily aggressively prevent isp feedback reordering by opt passes
|
||
- pvr, pco: fragment shader metadata boilerplate code
|
||
- pvr, pco: additional multisample support
|
||
- pvr, pco: tile buffer support
|
||
- pco: experimentally transfer olchk to ops with refs requiring it
|
||
- pvr, pco: add dummy stores for tilebuffer-only loadops
|
||
- pvr: dynamic depth bias support
|
||
- pco: remove modifiers from instructions with variable src/dests
|
||
- pvr, pco: alpha to coverage support
|
||
- pco: full shared atomics support
|
||
- pco: improve image write using pck.prog
|
||
- pvr: fix multi-type varying allocations
|
||
- pco: fix split-type vertex attrib allocations/nir vars
|
||
- pco: lower vertex attrib vars first
|
||
- pco: add lower_io_array_vars_to_elements_no_indirects to preprocessing
|
||
- pco: legalize between movs1/mbyp without emitting additional ops
|
||
- pco: temporarily switch to basic lowering for [iu]mulextended
|
||
- pco: add ops needed to support fquantize2f16
|
||
- pco: support accessing shareds/coeffs >= 256
|
||
- pco: lower nir phi undefs to zero
|
||
- pco: handle offset calculation for empty blocks
|
||
- pco: support break/continue in loop body/outside if/else
|
||
- pvr: handle num workgroups in indirect compute
|
||
- pco: uncoalesce vecs that can't be propagated
|
||
- pvr, pco: handle stencil input attachments
|
||
- pvr, pco: full support for tile buffer eot handling
|
||
- pco: temporarily don't propagate pixout accesses in opt
|
||
- nir, asahi: commonize interleave_agx
|
||
- pco: image atomics support
|
||
- pco: scalarize push constant accesses
|
||
- pco: add write memory check before processing nir
|
||
- pco: add early nir opt pass
|
||
- pvr: select SPM EOT state words from render index
|
||
- pco: rematerialize load consts to reduce register pressure
|
||
- pco: amend early frag test/depthf logic for isp feedback
|
||
- pco: support skipping overlap check emission, enable for eot shader
|
||
- pvr: fix valgrind warnings for 64-bit unaligned access
|
||
- pco: ensure srcs/dests interfere for instructions with repeat > 1
|
||
- pvr: spilling enablement - allow empty uploads
|
||
- pco: spilling enablement - track barrier usage
|
||
- pvr, pco: experimental temp spilling
|
||
- pco: temporary spilling workarounds
|
||
- pvr, pco: temporary initial scratch memory support
|
||
- pvr, pco: implement VK_EXT_image_2d_view_of_3d
|
||
- pvr, pco: add VK_EXT_image_2d_view_of_3d sampled image support
|
||
- pvr: add support for VK_EXT_provoking_vertex
|
||
- pvr, pco: implement VK_EXT_depth_clamp_zero_one
|
||
- pvr, pco: implement alphaToOne feature
|
||
- pvr, pco: implement VK_EXT_color_write_enable
|
||
- pvr, pco: basic write without format support
|
||
- pco: support 1010102 snorm, [us]scaled formats
|
||
- pco: replace {un,}packing alu ops with intrinsics
|
||
- pvr: add a2b10g10r10 formats
|
||
- pvr: enable VK_EXT_extended_dynamic_state
|
||
- pco: handle remaining loadop depth formats
|
||
- pvr: width-based tq depth format selection
|
||
- pco: lower nir_b2b* ops
|
||
- pco: use nir_cf_{extract,reinsert} instead of inlining compute instance check
|
||
- pco: fix missing csbgen dependency
|
||
- pvr: fix missing types in x86 builds
|
||
- pco/opt: disable back-propagation of indexed registers
|
||
- pco/ra: properly handle non-dced instrs with unused defs
|
||
- vulkan: setup max_subgroup_size for drivers without varying/max/min size support
|
||
- nir: print loop unroll info if present
|
||
- pco: store additional metadata for precompiled shaders
|
||
- pvr, pco: enable pre-generated header string functions to work with clc
|
||
- pvr/csbgen: use stdint macro for unsigned 64-bit constants
|
||
- pco/usclib: switch to common defs
|
||
- pco: move uses_usclib flag into shader data
|
||
- pvr, pco: switch to clc state update shader
|
||
- pvr, pco: switch to clc nop shader
|
||
- pco/usclib: add some preprocessor helper macros
|
||
- pvr, pco: switch to clc vertex passthrough shaders
|
||
- pvr, pco: switch to clc query shaders
|
||
- pvr, pco: switch to usc generated clear attachment shaders
|
||
- pvr, pco: switch to usc generated zero-init workgroup memory shaders
|
||
- pvr: switch to usc generated spm load shaders
|
||
- pco/usclib: disable predicate control-flow in generated shaders
|
||
- pvr, pco: switch to clc load/store sr and idfwdf shaders
|
||
- pco: switch to using csbgen and clc helpers for tex/smp state {un,}packing
|
||
- pvr: merge legacy uscgen code into pvr_usc
|
||
- pvr/wsi: don't advertise supports_modifiers
|
||
- docs/pvr: drop GX6250 from the active development hardware list
|
||
- vulkan/runtime: only set shader subgroup info if non-zero
|
||
- pco: add usclib build dependency on generated files
|
||
- mesa/st, nir: commonize unlower_io_to_vars pass
|
||
- pvr, pco: implement prerequisites for sampleRateShading
|
||
- pco: use interpolated input intrinsics for shader io
|
||
- pco: use nir_unlower_io_to_vars
|
||
- pvr, pco: track and implement workaround for brn74056
|
||
- pvr: add debug for missing sysvals
|
||
- pvr: enable sampleRateShading feature
|
||
- pvr, pco: allow fs sample rate to be dynamically set
|
||
- pco: discard invalid instances depending on the sample & valid masks
|
||
- pvr: enable independentBlend feature
|
||
- pvr: enable VK_FORMAT_D32_SFLOAT_S8_UINT
|
||
- pvr, pco: add multiview compiler support, advertise extension
|
||
- pco: treat all load_consts as 32-bit
|
||
- pvr, pco: support imageCubeArray feature
|
||
- pco: fully support Vulkan 1.2 image atomics
|
||
- pvr, pco: add minimal support required for Vulkan 1.2 subgroups
|
||
- pco: set lower_device_index_to_zero
|
||
- pvr: add support for VK_KHR_shader_draw_parameters, drawIndirectFirstInstance
|
||
- pvr, pco: add remaining support for eds2 & 3
|
||
- nir/lower_alpha: extend to support dynamic a2c
|
||
- pvr, pco: add primitive support for VK_KHR_robustness2.nullDescriptor
|
||
- pvr, pco: add primitive support for terminate,demote_to_helper}_invocation
|
||
- nir/unlower_io_to_vars: keep io bases intact when keeping intrinsics
|
||
- pco: apply rounding mode to relevant conversion ops
|
||
- pco: tidy and commonize conversion ops
|
||
- pco: improve early and late algebraic pass ordering
|
||
- pvr: amend tile buffer size calculation for eot
|
||
- pvr: amend num temps calculation when wg_size is not provided
|
||
- pco: ensure a variable exists for the multiview index
|
||
- docs/pvr: update hardware list
|
||
- pvr: advertise VK_KHR_sampler_mirror_clamp_to_edge
|
||
- pvr: advertise VK_KHR_shader_non_semantic_info
|
||
- pvr: advertise VK_KHR_shader_relaxed_extended_instruction
|
||
- pvr: advertise VK_EXT_shader_replicated_composites
|
||
- pvr: advertise VK_KHR_device_group_creation
|
||
- pvr: support VK_KHR_map_memory2
|
||
- pvr: support VK_EXT_map_memory_placed
|
||
- pvr: support VK_EXT_map_memory_placed.memoryUnmapReserve
|
||
- pco: add support for global memory
|
||
- pco/ra: abort if spilling fails
|
||
|
||
SoroushIMG (5):
|
||
|
||
- pvr: fix transfer fast clear color for srgb formats
|
||
- pvr: remove unnecessary asserts
|
||
- pvr: fix color values and crash for soft bg load ops
|
||
- pvr: add more helper format function for tq pbe formats
|
||
- pvr: set nn coords in sampler state for tq shaders when needed
|
||
|
||
Surafel Assefa (1):
|
||
|
||
- wsi: Implements scaling controls for DRI3 presentation.
|
||
|
||
Sushma Venkatesh Reddy (6):
|
||
|
||
- intel/compiler: apply sqrt workaround for Horizon Forbidden West shader
|
||
- intel/compiler: generalize workaround script name for broader applicability
|
||
- intel/compiler: Initial bits for SRND instruction
|
||
- brw: Add assembler support for SRND
|
||
- intel/compiler: Validation for SRND instructions
|
||
- intel/executor: Add examples for srnd
|
||
|
||
Sviatoslav Peleshko (3):
|
||
|
||
- anv: Always disable Color Blending for unused Render Targets
|
||
- mesa,driconf: Add WA to initialize vertex program outputs to vec4(0,0,0,1)
|
||
- driconf: Add vertex_program_default_out option for Penumbra: Overture
|
||
|
||
Tapani Pälli (17):
|
||
|
||
- isl/blorp: handle failing 96bpp linear blit case
|
||
- compiler/types: handle BFLOAT16 when decoding blob
|
||
- iris: remove stage_from_pipe and pipe_from_stage helpers
|
||
- intel/genxml: update CACHE_MODE_0 register for gfx200
|
||
- intel/dev: provide a helper to detect bmg g31 device
|
||
- iris/anv: toggle on CACHE_MODE_0::MsaaFastClearEnabled on BMG G31
|
||
- anv: change some image qualifiers as coherent for Last Of Us
|
||
- egl: allocate device info lazily only when queried
|
||
- anv: remove assert, group can have 0 shaders in it
|
||
- iris: setup bits for ARB_texture_filter_minmax with gfx9+
|
||
- blorp: add missing pipecontrol after 3DSTATE_WM_HZ_OP for Xe2+
|
||
- intel/blorp: add restriction for gfx12
|
||
- iris: add a check if blorp can support blitter copy
|
||
- anv: add cs stall for any pipe control on compute
|
||
- anv/blorp: add missing cs stall on compute pipe control
|
||
- anv: bring back some lost game drirc workarounds for subgroups
|
||
- anv: fix issues found with indirect data stride
|
||
|
||
Taras Pisetskyi (1):
|
||
|
||
- drirc/anv: force_vk_vendor=-1 for Wuthering Waves
|
||
|
||
TellowKrinkle (2):
|
||
|
||
- hk: Enable caching on memory marked with HOST_CACHED_BIT
|
||
- hk: Add non-cached memory type
|
||
|
||
Thibault Payet (1):
|
||
|
||
- venus: Use SYS_thr_self on FreeBSD instead of SYS_gettid
|
||
|
||
Thomas H.P. Andersen (4):
|
||
|
||
- anti-lag: pass a proper dataSize
|
||
- zink: do not overwrite existing error for miptail on uncommit
|
||
- nvk: implement VK_AMD_buffer_marker
|
||
- nvk: allow host image copy on non host visible heaps
|
||
|
||
Tim Van Patten (2):
|
||
|
||
- intel/ds: Skip expensive timestamp query until necessary
|
||
- intel: Convert getenv() to os_get_option()
|
||
|
||
Timothy Arceri (33):
|
||
|
||
- util: add workaround for Interstellar Rift
|
||
- glsl: move mark_array_elements_referenced() with ubo code
|
||
- glsl: add mark_array_elements_referenced() fast path
|
||
- glsl: rename setup_uniform_remap_tables()
|
||
- util: remove recursion from bitset helpers
|
||
- st/glsl: encapsulate more in st_nir_state_variable_create()
|
||
- st/glsl: fix packed uniform handling in st_nir_lower_fog()
|
||
- st/glsl: fix nir_lower_position_invariant()
|
||
- nir: move nir_lower_drawpixels() to the state tracker
|
||
- st/glsl: set driver locations in nir_lower_drawpixels()
|
||
- nir: move nir_lower_alpha_test() to the st
|
||
- st/glsl: set driver location in nir_lower_alpha_test()
|
||
- nir: move nir_lower_point_size_mov() to st
|
||
- st/glsl: set driver location in nir_lower_point_size_mov()
|
||
- st/glsl: set driver loc after lowering clipplane
|
||
- st/glsl_to_nir: dont add duplicate state tokens
|
||
- util: add range remap util
|
||
- glsl: make use of u_range_remap for uniform remapping
|
||
- glsl: remove now unused NumUniformRemapTable
|
||
- nir: fix uniform cloning helper again
|
||
- util: add shortcut for range remap inserts
|
||
- util: rewrite remap util to avoid looping list
|
||
- Revert "ci/freedreno: Skip overly-slow trace"
|
||
- Reapply "ci/freedreno: Skip overly-slow trace"
|
||
- util/range_remap: dont overwrite entry if ptr is NULL
|
||
- glsl/util: update util_range_remap to use range_remap struct
|
||
- util/range_remap: split list node from range entry
|
||
- util/range_remap: use child memory context for list
|
||
- util/range_remap: add util_range_switch_to_sorted_array() helper
|
||
- util/range_remap: switch to using sorted array
|
||
- Revert "Reapply "ci/freedreno: Skip overly-slow trace""
|
||
- mesa: skip redundant uniform update optimisation if unsafe
|
||
- glsl: assign block indices in the order they appear
|
||
|
||
Timur Kristóf (41):
|
||
|
||
- radv/amdgpu: Fix crash with RADV_DEBUG=noibs
|
||
- radv/amdgpu: Use correct NOP packets when unchaining a CS
|
||
- radv/amdgpu: Don't use IB2 on GFX6 (for now)
|
||
- radv: Don't set SWITCH_ON_EOI without tessellation
|
||
- radv: Don't use EVENT_WRITE_EOS on GFX7
|
||
- radv: Clean up use of RELEASE_MEM on GFX7 MEC
|
||
- radv: Don't use V_370_PFP or V_028A90_PS_DONE on compute queues
|
||
- radeonsi: Flush L2 for render condition when CP can't use L2
|
||
- radeonsi: Fix some comments to also include GFX11.5
|
||
- radv: Add comment to document CP DMA prefetch
|
||
- radv: Flush L2 before CP DMA copy/fill when CP DMA doesn't use L2
|
||
- docs: Add more details about the contribution process
|
||
- spirv: Always mark FS layer and viewport index inpus as flat
|
||
- ac/nir/ngg: Remove dead code for 64-bit mesh shader variables
|
||
- ac/nir/ngg: Fix scalarized mesh primitive indices
|
||
- radv/amdgpu: Rename use_ib to chain_ib
|
||
- radv: Rename RADV_DEBUG=noibs to noibchaining
|
||
- radv/amdgpu: Don't assert chaining match when copying secondary IB
|
||
- radv/amdgpu: Add a helper function to emit NOP packets
|
||
- radv/amdgpu: Emit a single 4 dword NOP in chainable CS buffers
|
||
- radv/amdgpu: Small cleanup of counting submitted IBs
|
||
- ac/gpu_info: Add can_chain_ib2 field to ac_gpu_info
|
||
- radv/amdgpu: Support IB2 without chaining, enable on GFX6
|
||
- radv/amdgpu: Allow IB2 when primary CS isn't chained
|
||
- radv: Pass correct queue family to radv_cs_emit_write_event_eop
|
||
- radv: Pass correct queue family in radv_emit_cache_flush
|
||
- radv: Call transfer copy functions from API functions, not helpers
|
||
- radv: Clarify image and image/buffer copy helper functions
|
||
- radv: Add amd_ip_type to radv_cmd_stream
|
||
- radv: Remove qf argument from radv_cs_emit_write_event_eop
|
||
- radv: Remove qf argument from radv_cp_wait_mem
|
||
- radv: Remove qf argument from radv_cs_emit_cache_flush
|
||
- radv: Remove qf argument from radv_cs_write_data (and _head)
|
||
- radv: Remove unneeded forward declaration of qf from dgc header
|
||
- radv: Remove qf from radv_spm/sqtt/perfcounter where applicable
|
||
- radeonsi: Don't use compute queue with regalloc hang bug
|
||
- radv: Disable compute queues when the regalloc bug is present
|
||
- radv: Mitigate GPU hang on Hawaii in Dota 2 and RotTR
|
||
- radv: Document SWITCH_ON_EOP and WD_SWITCH_ON_EOP
|
||
- ac/nir/ngg_mesh: Lower num_subgroups to constant
|
||
- ac/nir/ngg: Fix scratch space for NGG GS streamout
|
||
|
||
Tomeu Vizoso (29):
|
||
|
||
- teflon: Reformat with clang-format
|
||
- pipe-loader: Implement loading of /dev/accel devices
|
||
- teflon/tests: Increase tolerance
|
||
- teflon: Query drivers on what operations they support
|
||
- etnaviv/ml: Implement ml_operation_supported() callback
|
||
- rocket: Initial commit of a driver for Rockchip's NPU
|
||
- pipe-loader: Load the rocket accel driver
|
||
- teflon: Link to the rocket driver
|
||
- teflon: Add support for Reshape operations
|
||
- etnaviv/ml: Add support for no-op Reshape operations
|
||
- teflon: Add support for non-fused Relu operations
|
||
- etnaviv/ml: Add support for non-fused ReLU
|
||
- teflon: Add support for Absolute
|
||
- etnaviv/ml: Add support for Absolute
|
||
- teflon: Add support for Logistic
|
||
- etnaviv/ml: Add support for Logistic
|
||
- teflon: Add support for Subtract
|
||
- etnaviv/ml: Add support for Subtract
|
||
- teflon: Add support for Transpose
|
||
- etnaviv/ml: Support Transpose operation
|
||
- etnaviv/ml: Remove some skips that pass now
|
||
- teflon/tests: Remove dependency on xtensor
|
||
- teflon/tests: Replace YOLOX model with that from TI
|
||
- teflon: Add support for the MaxPool operation
|
||
- teflon: Add support for the StridedSlice operation
|
||
- teflon: Add support for the ResizeNearestNeighbor operation
|
||
- ethos: Initial commit of a driver for the Arm Ethos-U65 NPU.
|
||
- pipe-loader: Load the ethos accel driver
|
||
- teflon: Link to the ethos driver
|
||
|
||
Torge Matthies (2):
|
||
|
||
- wsi/display: Factor drmModeObjectProperties retrieval out of find_properties.
|
||
- wsi/display: Fix vkGetRandROutputDisplayEXT when connector is not leased yet.
|
||
|
||
Trigger Huang (2):
|
||
|
||
- virtio/vdrm: add ENABLE_DRM_AMDGPU for c_args
|
||
- radeonsi: Fix u_log_ctx for aux_context recreation
|
||
|
||
Utku Iseri (1):
|
||
|
||
- panvk: override can_present_on_device
|
||
|
||
Val Packett (1):
|
||
|
||
- radv: detect platform:virtio-mmio devices for virtgpu native context
|
||
|
||
Valentine Burley (101):
|
||
|
||
- ci/lava: Use UART for non-Chromebooks
|
||
- freedreno/ci: Increase concurrency for a618 jobs
|
||
- turnip/ci: Increase coverage of a618-vk, reduce parallelism
|
||
- freedreno/ci: Re-enable a618-gl job
|
||
- zink/ci: Run full zink-tu-a618 job pre-merge
|
||
- freedreno,zink+tu/ci: Document Piglit bug
|
||
- ci: Disable Valve keywords farm
|
||
- ci: Always save the artifacts for performance traces
|
||
- ci/angle: Update gn arg to avoid warning message
|
||
- lavapipe/ci: Add Android Hardware Buffer test set
|
||
- freedreno/ci: Update a6xx kernel to msm-next
|
||
- freedreno/ci: Remove a630 jobs
|
||
- freedreno/ci: Streamline using common a6xx-skips
|
||
- zink/ci: Only enable VVL for deqp on RADV
|
||
- zink/ci: Fix enabling VVL for RADV jobs
|
||
- zink/ci: Enable more VVL on ANV
|
||
- radeonsi/ci: Convert Fluster job to deqp-runner suite
|
||
- radeonsi/ci: Remove Fluster flakes, document failures
|
||
- ci/lava: Only keep structured_logger in lava-trigger container
|
||
- ci/lava: Use init-stage1 from Mesa build instead of inlining it
|
||
- vulkan/wsi/wayland: Enable 4444 formats
|
||
- zink/ci: Add pre-merge EGL coverage on ANV
|
||
- zink/ci: Drop duplicate full ANV deqp-runner suites
|
||
- ci/lava: Add x86_64 ASan job templates
|
||
- ci: Build more drivers in debian-x86_64-asan
|
||
- radv/ci: Use same deqp-runner suite for all RADV jobs
|
||
- radv/ci: Add an ASan RADV job on Cezanne
|
||
- intel/ci: Fix acer-chromebox-cxi4-puff concurrency
|
||
- zink/ci: Add an ASan job on CML
|
||
- radeonsi/ci: Increase Fluster job concurrency
|
||
- ci: Drop obsolete EGL skips
|
||
- zink/ci: Use Weston's Xwayland instead of Xvfb
|
||
- softpipe/ci: Use Weston's Xwayland instead of Xvfb
|
||
- virgl/ci: Use Weston's Xwayland instead of Xvfb
|
||
- ci: Remove xvfb from test-base container
|
||
- freedreno/ci: Move a660-gl-cl job to nightly
|
||
- zink/ci: Skip flaky tests on CML due to HW deficiency
|
||
- zink/ci: Document flakes on ANV
|
||
- zink/ci: Add a prefix for X11 dEQP-EGL on ANV
|
||
- zink/ci: Document more flakes on ANV
|
||
- ci: Separate build and test container tags
|
||
- zink/ci: Run full zink-lavapipe job pre-merge
|
||
- zink/ci: Add EGL coverage on lavapipe
|
||
- zink/ci: Document recent flakes on TGL
|
||
- ci/fluster: Uprev Fluster
|
||
- ci/lava: Make Fluster vectors an optional overlay
|
||
- ci: Temporarily hardcode S3 artifact path
|
||
- anv/ci: Lower concurrency for nightly jobs
|
||
- anv/ci: Update expectations from nightly jobs
|
||
- zink/ci: Switch to quick_gl profile for nightly ANV jobs
|
||
- zink/ci: Update expectations from nightly jobs
|
||
- anv/ci: Run full anv-adl-angle job pre-merge
|
||
- anv/ci: Add a job replaying traces with ANGLE
|
||
- iris/ci: Add a new iris deqp job on Alder Lake
|
||
- zink/ci: Add EGL coverage on Turnip
|
||
- zink/ci: Document recent flakes on a618 with Turnip
|
||
- radeonsi/ci: Fix radeonsi-vangogh-glcts job definition
|
||
- freedreno/ci: Add missing caching proxy for traces
|
||
- tu: Advertise VK_EXT_shader_atomic_float
|
||
- ci/crosvm: Retry all curl errors when downloading kernel
|
||
- zink/ci: Disable zink-anv-cml-asan
|
||
- tu: Enable robustBufferAccessUpdateAfterBind
|
||
- zink/ci: Enable VVL for Turnip on a618
|
||
- zink/ci: Document recent a618 EGL flakes
|
||
- zink/ci: Add a new Minecraft restricted trace
|
||
- ci/crosvm: Add log sections for crosvm
|
||
- zink/ci: Disable ASan leak detection and re-enable zink-anv-cml-asan
|
||
- llvmpipe: Initialize src array in generate_fs_twiddle
|
||
- r300/compiler: Silence array-bounds warning
|
||
- imgui: Mark imgui dependencies as system includes
|
||
- imgui: Silence build warnings for imgui
|
||
- util: Update BLAKE3 from 1.5.1 to 1.8.2
|
||
- util: Disable Werror for BLAKE3
|
||
- meson: Relax -Wmaybe-uninitialized errors
|
||
- lavapipe/ci: Disable stack-use-after-return detection for ASan
|
||
- ci/gfxreconstruct: Bump version for compatibility with Debian 13
|
||
- ci/skqp: Add missing include to fix compilation errors on Debian 13
|
||
- ci/vkd3d: Disable Werror for vkd3d-proton
|
||
- ci/mold: Bump version for compatibility with Debian 13
|
||
- ci/lava: Update \`fire` for compatibility with Debian 13
|
||
- ci/va: Bump va-tools version for compatibility with Debian 13
|
||
- ci: Bump ci-kdl version for compatibility with Debian 13
|
||
- ci: Update to Debian 13 (trixie)
|
||
- ci/android: Use aapt from Debian packages again
|
||
- ci: Uprev ci-templates to pull in new helpers
|
||
- zink/ci: Document flakes on Cezanne
|
||
- zink/ci: Re-enable ASan leak detection and drop VVL filter on CML
|
||
- ci/lava: Use lava-job-submitter from gfx-ci repo
|
||
- ci: Remove lava-job-submitter, LAVA containers, and tests
|
||
- ci/android: Upload arm64 Mesa driver builds
|
||
- ci: Rename ANDROID_GPU_MODE to CUTTLEFISH_GPU_MODE
|
||
- ci/android: Make Vulkan driver replacement conditional
|
||
- ci: Disable broken MR check in sanity job
|
||
- ci/lava: Make fastboot commands customizable
|
||
- freedreno/ci: Update kernel to pull in updated dtb
|
||
- freedreno/ci: Update expectations for a306 and a530
|
||
- freedreno/ci: Move a306 and a530 jobs to LAVA
|
||
- freedreno/ci: Remove baremetal job templates
|
||
- docs: Update LAVA caching setup
|
||
- tu: Fix indexing with variable descriptor count
|
||
- tu: Fix maxVariableDescriptorCount with inline uniform blocks
|
||
|
||
Vasily Khoruzhick (1):
|
||
|
||
- lima: ppir: index SSA nodes the same way as we index registers
|
||
|
||
Vignesh Raman (7):
|
||
|
||
- ci/lava: default CI_JOB_TIMEOUT to 3600 if unset
|
||
- ci/lava: add main() function to fix entry point
|
||
- ci/lava: make rootfs shell prompt configurable
|
||
- ci/lava: Move lava_job_submitter tests to lava folder
|
||
- ci/lava: bump ALPINE_X86_64_LAVA_TRIGGER_TAG
|
||
- ci/init-stage1: avoid duplicate mounts
|
||
- ci/container: add comment to bump image tag
|
||
|
||
Vinson Lee (2):
|
||
|
||
- panfrost: Remove duplicate variable ret
|
||
- gfxstream: Fix build error
|
||
|
||
Vitaliy Triang3l Kuzmin (6):
|
||
|
||
- .gitignore: Add KDevelop \*.kdev4
|
||
- radv,ac: GFX10 depth/stencil HTILE mipmap bug info variable
|
||
- radv,ac: Split has_tc_compat_zrange_bug into Z and ZS, document it
|
||
- radeonsi: Disable TC-compatible HTILE when bug workarounds conflict
|
||
- radeonsi: Use radeon_info bug flags in TILE_STENCIL_DISABLE setup
|
||
- ac: Enable HTILE TC Z clear value bug workaround on GFX1013
|
||
|
||
Vlad Schiller (6):
|
||
|
||
- pvr: Enable VK_FORMAT_FEATURE_2_TRANSFER_SRC_BIT flag
|
||
- pvr: Enable VK_FORMAT_FEATURE_2_TRANSFER_DST_BIT flag
|
||
- pvr: implement dynamically set vertex buffer strides
|
||
- pvr: Enable KHR_swapchain_mutable_format
|
||
- pvr: Implement VK_KHR_imageless_framebuffer
|
||
- pvr: Implement EXT_separate_stencil_usage
|
||
|
||
Wenfeng Gao (2):
|
||
|
||
- mediafoundation: support CODECAPI_AVEncVideoSatdMapBlockSize and MFSampleExtension_VideoEncodeSatdMap for SATD map.
|
||
- mediafoundation: look into using texture pool for metadata retrieval, e.g SATD, Bitsused map, etc.
|
||
|
||
X512 (1):
|
||
|
||
- NVK: report \`VK_KHR_unified_image_layouts` extenstion support
|
||
|
||
Xaver Hugl (2):
|
||
|
||
- vulkan/wsi: require extended target volume support for scRGB
|
||
- vulkan/wsi: remove support for VK_COLOR_SPACE_EXTENDED_SRGB_NONLINEAR_EXT
|
||
|
||
Yinjie Yao (3):
|
||
|
||
- radeonsi/vcn: Enable preencode on VCN5.0
|
||
- ac,radeonsi/vcn: Use correct swizzle_mode for vcn4
|
||
- ac/parse_ib: Update vcn ib parser to include missing commands
|
||
|
||
Yiwei Zhang (152):
|
||
|
||
- doc: fix section and android instruction linking for install page
|
||
- venus/virtgpu: drop mappable if blob size is smaller than requested
|
||
- venus: drop force_unmappable hack
|
||
- venus: refactor ahb import interface to take whole alloc info
|
||
- venus/virtgpu: use size zero to request mapping the entire blob mem
|
||
- venus: requests whole blob mem size for non-dedicated import
|
||
- venus/ci: udpate expectations from venus-lavapipe-full runs
|
||
- vulkan/android: add vk_android_get_ahb_image_properties
|
||
- vulkan/android: add vk_android_get_ahb_buffer_properties
|
||
- venus: adopt vk_android_get_ahb_buffer_properties
|
||
- venus/wsi: move wsi image format info validation to vn_wsi
|
||
- venus: adopt vk_android_get_ahb_image_properties
|
||
- venus: clean up post vk_android_get_ahb_image_properties adoption
|
||
- turnip: adopt vk_android_get_ahb_image_properties
|
||
- turnip: amend AHB buffer support
|
||
- vulkan/android: make vk_ahb_probe_format private to android runtime
|
||
- v3dv: adopt vk_android_get_ahb_image_properties
|
||
- v3dv: amend AHB buffer support
|
||
- lvp: hook up AHB image and buffer properties queries
|
||
- vulkan/android: improve AHB image format check logging
|
||
- lavapipe: allow AHB export allocation
|
||
- lavapipe: implement GetMemoryAndroidHardwareBufferANDROID
|
||
- lavapipe: do not close import fd on error and amend an error code
|
||
- lavapipe: properly handle AHB release
|
||
- lavapipe: populate AHB memory mapping
|
||
- lavapipe: do not short-circuit AHB export alloc (non-import)
|
||
- lavapipe: amend missing object finish on mem alloc failure
|
||
- lavapipe: adopt common vk_device_memory
|
||
- lavapipe: do not early return for mem alloc size being zero
|
||
- lavapipe: use common vk_device_memory::ahardware_buffer
|
||
- lavapipe: drop redundant memory type index tracking
|
||
- lavapipe: use common host ptr info
|
||
- lavapipe: use common export and import info tracked
|
||
- lavapipe: use common tracked size and override if needed
|
||
- u_gralloc/mapper4: properly expose ChromaSiting types based on api level
|
||
- lavapipe: ensure to use zero memoryOffset for wsi image alias binding
|
||
- lavapipe: improve image memory binding
|
||
- lavapipe: fix a leak on a lvp_image_create exit path
|
||
- lavapipe: fix maint4 vkGetDeviceBufferMemoryRequirements
|
||
- lavapipe: fix maint4 vkGetDeviceImageMemoryRequirements
|
||
- venus: add code owners
|
||
- vulkan/android: improve memoryTypeBits reporting in AHB props query
|
||
- venus: adopt vk_common_GetAndroidHardwareBufferPropertiesANDROID
|
||
- venus: rework AHB memory import
|
||
- venus: drop cached ahb buffer memory types
|
||
- venus: drop is_wsi tracking and some asserts
|
||
- venus: set wsi alias binding memoryOffset to zero
|
||
- nvk: clean up existing nvk_android frontend
|
||
- nak: do not hide drm header on Android
|
||
- nvk: clean up direct u_gralloc dep
|
||
- Revert "android: moving HMI symbol to separate file"
|
||
- venus/android: clean up leftovers from common AHB helpers adoption
|
||
- docs/android: add docs for preparing offline compilers
|
||
- docs/android: fix meson setup for Android cross-compilation
|
||
- docs/android: update cross file and add nvk instructions
|
||
- docs/android: drop pkg-config workaround from cross-file
|
||
- util/perf: amend missing atrace_init
|
||
- venus: drop vn_trace_init
|
||
- vulkan/wsi/headless: allow explicit modifiers
|
||
- vulkan/wsi/headless: drop redundant chain struct members
|
||
- venus: fix a race condition in ring shmem reuse
|
||
- vulkan/wsi/headless: acquire the most likely idle image
|
||
- vulkan/wsi/headless: drop the wsi_create_null_image_mem override
|
||
- vulkan/wsi/headless: clean up headless wsi device and headers
|
||
- vulkan/util: add missing vulkan header
|
||
- vulkan/util: no need to hide ANB property itself behind Android
|
||
- vulkan/util: update common properties code gen to use platform guard
|
||
- venus: stop consuming wsi_memory_signal_submit_info
|
||
- venus: layer vkQueueSubmit2 over vkQueueSubmit w/o sync2
|
||
- meson/android: drop redundant libdisplay-info dep
|
||
- venus: use VK_USE_PLATFORM_ANDROID_KHR when applicable
|
||
- venus: hide swapchainMaintenance1 behind wsi guard
|
||
- venus: expose KHR_present_id(2)/wait(2) support
|
||
- hasvk: advertise present_id/wait behind ANV_USE_WSI_PLATFORM
|
||
- anv: advertise present_id/wait behind ANV_USE_WSI_PLATFORM
|
||
- nvk: advertise present_id/wait and the 2 version
|
||
- panvk: no need to set DRI_CONF_VK_KHR_PRESENT_WAIT
|
||
- turnip: advertise present_id/wait behind TU_USE_WSI_PLATFORM
|
||
- radv: advertise present_id/wait behind RADV_USE_WSI_PLATFORM
|
||
- hk: no need to set DRI_CONF_VK_KHR_PRESENT_WAIT
|
||
- vulkan/wsi: drop obsolete wsi_common_vk_instance_supports_present_wait
|
||
- driconf: drop obsolete DRI_CONF_VK_KHR_PRESENT_WAIT
|
||
- venus: misc sync2 emulation fixes
|
||
- panvk: stub out Android ANB and AHB image handling
|
||
- panvk: resolve ANB (pre spec v8)
|
||
- panvk: implement deferred image creation
|
||
- panvk: ensure wsi memory is bound at offset 0
|
||
- panvk: add panvk_android_get_wsi_memory for AHB spec v8+
|
||
- panvk: add shared image support and advertise VK_ANDROID_native_buffer
|
||
- panvk: implement AHB image deferred init and memory alloc
|
||
- panvk: support VK_ANDROID_external_memory_android_hardware_buffer
|
||
- vulkan/android: amend a missing case for IMPLEMENTATION_DEFINED AHB
|
||
- anv: drop obsolete anv_create_ahw_memory
|
||
- anv: avoid setting image format twice for AHB image
|
||
- anv: adopt vk_android_get_ahb_image_properties
|
||
- anv: drop anv_ahb_format_for_vk_format
|
||
- anv: adopt common GetAndroidHardwareBufferPropertiesANDROID
|
||
- vulkan/android: support AHARDWAREBUFFER_FORMAT_YCbCr_P010 format mapping
|
||
- vulkan/android: refactor to retrieve AHB format properties once
|
||
- vulkan/android: support AHB query for VK_ANDROID_external_format_resolve
|
||
- panvk: drop an obsolete assert of explicit mod plane count
|
||
- docs/android: default to use -Dandroid-libbacktrace=disabled
|
||
- meson/android: amend the condition for libbacktrace
|
||
- nvk: refactor nvk_CreateImage error path
|
||
- vulkan/android: add an early return when there's no wait semaphores
|
||
- vulkan/android: switch to vkQueueSubmit2
|
||
- vulkan/runtime: silence a -Wsometimes-uninitialized warning
|
||
- vulkan/android: skip queue submit with copy_sync_payloads
|
||
- vulkan/android: improve stage masks for semaphore ops
|
||
- mailmap: add Yiwei Zhang
|
||
- v3dv: use stack image for v3dv_GetDeviceImageSubresourceLayout
|
||
- vulkan: handle wsi private data properly
|
||
- anv: fix broken utrace
|
||
- radv: bind aliased wsi image at memory offset zero
|
||
- nvk: bind aliased wsi image at memory offset zero
|
||
- tu: drop redundant Android headers
|
||
- tu: simplify AHB image view format resolving for external format
|
||
- vulkan/util: drop unused vk_select_android_external_format
|
||
- tu: bind aliased wsi image at memory offset zero
|
||
- tu: properly implement VkBindMemoryStatus from maint6
|
||
- panvk: fix broken clock sync after using CLOCK_MONOTONIC_RAW
|
||
- intel/ds: VulkanApiEvent doesn't rely on interning data
|
||
- intel/ds: simplify clock sync emit
|
||
- intel/ds: minor code clean up
|
||
- intel/ds: update GPU clock to be sequence-scoped when applicable
|
||
- panvk: fix blackhole bo error path to use MODE_IMMEDIATE for unmap
|
||
- panvk: fix image/buffer destroy to use MODE_IMMEDIATE for unmap
|
||
- vulkan/util: drop workaround for ANB struct
|
||
- panvk: use os_get_option instead of getenv
|
||
- pan/genxml: improve pandecode_dump_file_open logging
|
||
- pan/genxml: fall back to stderr when unable to create CS dump file
|
||
- pan/genxml: use process name to distinguish CS dumps
|
||
- panvk: add PANVK_DEBUG(category) to simplify debug control
|
||
- panvk: adopt PANVK_DEBUG(category)
|
||
- ci/panfrost: udpate panfrost-g610-fails to reflect latest stats
|
||
- panvk: fix to clear FPK with incompatible blend modes
|
||
- calendar: fix 25.3 branch names
|
||
- panvk: use mesa_logi for startup info logs
|
||
- panvk: log device and driver info for startup
|
||
- panvk: allow panvk_pool_alloc_mem to use full slab_size
|
||
- panvk: improve big_bo_pool bo utilization
|
||
- panvk: drop panvk_pool_upload helper
|
||
- panvk: improve error propagation in panvk_pool_upload_aligned
|
||
- panvk: fix to advance vs driver_set properly
|
||
- panvk: fix to advance vs res_table properly
|
||
- panvk: fix sample shading of internal blend shader for MSAA
|
||
- llvmpipe: zero is also a valid fd
|
||
- llvmpipe: fix udmabuf mmap error check
|
||
- llvmpipe: add a missing alloc error handling in fd import
|
||
- llvmpipe: misc fixes for sparse binding
|
||
- glcpp/meson: fix libglcpp generated header dependency
|
||
- panvk: fix mem alloc size for VkBuffer backed by imported blob AHB
|
||
|
||
Yonggang Luo (82):
|
||
|
||
- radv: Move the amdgpu.h defines for Win32 to ac_linux_drm.h
|
||
- addrlib: __debugbreak only present on Windows and from intrin.h
|
||
- util: Refactoring util_dl_get_path_from_proc out of clc/clc_helpers.cpp
|
||
- util: Add namespace over float16_t in half_float.h
|
||
- util: Upgrade xxhash.h to v0.8.3
|
||
- renderdoc: Upgrade to v1.5
|
||
- util: Remove usage of WIN32 macro for DETECT_OS_WINDOWS
|
||
- broadcom: gl_shader_stage_to_broadcom => mesa_shader_stage_to_broadcom
|
||
- gallium: Remove unused TRACE_FLAG_USER_BUFFER
|
||
- gallium/mesa: Change type of tgsi_shader_info::processor st_init_limits::sh to mesa_shader_stage
|
||
- microsoft/clc: {} for struct initialize to avoid warning
|
||
- microsoft/clc: Improve clc_compiler_test.cpp to use defined expect value
|
||
- microsoft/compiler: Fixes dxcapi.h compiling warning with mingw64-clang
|
||
- util: Remove dbghelp.h that already comes with winsdk and mingw for fix warning with mingw
|
||
- virgl: Fixes warning: cast to smaller integer type 'unsigned long' from 'void \*' [-Wvoid-pointer-to-int-cast]
|
||
- virgl: Fixes differs in parameter lists
|
||
- ci/windows: Enable virgl for MSVC
|
||
- aco: Fixes warning note: ambiguity is between a regular call to this operator and a call with the argument order reversed
|
||
- lavapipe: Revise HAVE_LIBDRM to guard on drm only variables
|
||
- util: Update DETECT_ARCH_X86_64 to exclude _M_ARM64EC
|
||
- util: Add DETECT_ARCH_ARM64EC for defined(_M_ARM64EC) equivalent
|
||
- util: Now DETECT_ARCH_X86_64 can be safely used in rounding.h
|
||
- d3d10umd: Fixes building with mingw/gcc and windows sdk/ddk 10.0.26100.0
|
||
- va: Remove unused variable pscreen
|
||
- va: Use { 0 } initialize struct
|
||
- amdcommon: Use { 0 } initialize struct for .c files
|
||
- radv: Fixes warning implicit conversion from enum type
|
||
- radv: Fixes warning C5287: operands are different enum types 'VkShaderStageFlagBits' and '<unnamed-enum-RADV_GRAPHICS_STAGE_BITS>'; use an explicit cast
|
||
- radv: Fixes warning C5287: operands are different enum types 'rgp_sqtt_marker_event_type' and 'rgp_sqtt_marker_general_api_type';
|
||
- mesa: Remove unused assyntax.h and update related files
|
||
- ci: remove non-existent files in ci watch list
|
||
- meson: Remove redundant TODO:
|
||
- util: Add DETECT_ARCH_SPARC64 for sparc
|
||
- mesa: Remove usage of USE_*ASM in mesa/main/debug.c
|
||
- util: Remove usage of USE_**_ASM macros
|
||
- vc4: Remove the usage of USE_ARM_ASM
|
||
- mesa: refactor the glapi/tls includes into a single, reused header
|
||
- mesa: Remove duplicated deceleration of _mesa_glapi_tls_Dispatch _mesa_glapi_tls_Context
|
||
- meson: Remove unused with_asm_arch and USE_*_ASM macros
|
||
- microsoft/clc: Fixes gcc 14 compile warning about sign-compare
|
||
- microsoft/clc: Fixes gcc 14 compile warning about narrowing conversion
|
||
- d3d12: Fixes warning: enumeration value 'PIPE_FORMAT_NONE' not handled in switch
|
||
- d3d12: Fixes warning: comparison of integer expressions of different signedness
|
||
- d3d12: Fixes warnings: format '%x' expects argument of type 'unsigned int', but argument 2 has type 'HRESULT'
|
||
- d3d12: Fixes warning: format '%d' expects argument of type 'int', but argument 3 has type 'LONG'
|
||
- meson: Use build_always_stale instead of build_always
|
||
- util/format: u_format_gen.h are using UTIL_ARCH_LITTLE_ENDIAN, include util/u_endian.h for it
|
||
- util: Always generate u_format_gen.h as docs need it
|
||
- Revert "glsl: Work around MSVC arm64 optimizer bug"
|
||
- Revert "nir: Temporarily disable optimizations for MSVC ARM64"
|
||
- docs: Update requirement for MSVC
|
||
- util: Remove the __declspec(dllexport) on win32 for PUBLIC export macro
|
||
- util: Implement p_atomic_read for C++ properly.
|
||
- d3d10umd: Fixes gcc warning: enumeration value 'D3D11_SB_OPERAND_TYPE_FUNCTION_BODY' not handled in switch [-Wswitch]
|
||
- dzn: -DVK_USE_PLATFORM_WIN32_KHR is already comes from idep_vulkan_wsi_defines that depends by idep_vulkan_wsi
|
||
- tgsi: Fixes ntt_should_vectorize_io parameters
|
||
- tgsi/nir: Handling TGSI_OPCODE_RET in tgsi_to_nir
|
||
- clang-format: Update the .clang-format files to conformance clang-format json-schema
|
||
- clang-format: Move ForEachMacros into src/.clang-format for freedreno
|
||
- meson: mingw do not need _USE_MATH_DEFINES, only MSVC need it
|
||
- meson: Remove unused predefined macros for windows msvc/gcc
|
||
- meson: Remove redundant '/wd4996' option for MSVC
|
||
- meson: For windows, the with_ld_version_script won't take effect
|
||
- aco: Fixes warning: function get_branch_target/to_clrx_device_name defined but not used
|
||
- glsl: Fixes warning: deprecated directive: ‘%pure-parser’, ‘%error-verbose’
|
||
- meson: Remove non-unused inc_d3d9
|
||
- util: Fixes gcc warning: declaration of 'strndup' shadows a built-in function [-Wshadow]
|
||
- meson: Getting symbols-check.py works for mingw
|
||
- etnaviv: The relative path to build dir is not always valid, fix it
|
||
- lavapipe: fixes warning C5286: implicit conversion from enum 'type1' to 'type2'; use an explicit cast to silence this warning
|
||
- ci/window: Fixes LLVM error Lexer.cpp(1578): error C2065: 'C11AllowedIDCharRanges': undeclared identifier
|
||
- ci/windows: Strip misleading release/15.x
|
||
- ci/windows: Building gallium-d3d10umd with MSVC
|
||
- ci/windows: Improve ci scripts
|
||
- ci/windows: Rename to mesa_deps_packages.ps1
|
||
- ci/windows: Now building the deps with MSVC 2019
|
||
- ci/windows: Use winget to install packages and install Microsoft.WindowsWDK.10.0.26100
|
||
- ci/windows: Bump llvm and SPIRV-LLVM-Translator version tag
|
||
- ci/windows: Bump image tag for enable d3d10umd building
|
||
- ci/windows: Update documents to use winget
|
||
- meson: Update comment to be clear
|
||
- meson/util: Define _GNU_SOURCE for mingw
|
||
|
||
Yurii Kolesnykov (2):
|
||
|
||
- Guard double include of libdrm.h by defining LIBDRM_H
|
||
- Guard call to free_zombie_glx_drawable with condition from its definition
|
||
|
||
Zach Battleman (1):
|
||
|
||
- brw: Initial bits of BFN support
|
||
|
||
Zan Dobersek (7):
|
||
|
||
- tu: disable LRZ writes also for alpha-to-coverage, FS sample coverage output
|
||
- tu: prevent tu_bo unmapping during destruction while being dumped
|
||
- tu/drm: avoid has_set_iova-specific util_vma_heap freeing in tu_bo_init
|
||
- tu/drm: msm backend shouldn't use util_vma_heap in the !has_set_iova codepaths
|
||
- tu/drm: msm's has_set_iova codepath should avoid freeing zombified tu_sparse_vma
|
||
- tu: limit query pool types logged into RMV
|
||
- fd: allow limiting RD dumps to specific frames and submits
|
||
|
||
Zhao, Jiali (2):
|
||
|
||
- amd/vpelib: Extend TMZ value to 8 bit
|
||
- amd/vpelib: Create Function to Check for Blending Feature
|
||
|
||
Zhou Qiankang (2):
|
||
|
||
- anv: Use os_get_page_size for mmap offset alignment to work with page size other than 4K
|
||
- meson: use pointer size for 64-bit detection instead of architecture names
|
||
|
||
abdelhadi (2):
|
||
|
||
- aco, radv: remove line duplicate
|
||
- aco: fix debug info offset
|
||
|
||
bbhtt (1):
|
||
|
||
- meson: Clearly print error when distutils or packaging is missing
|
||
|
||
fossdd (1):
|
||
|
||
- bin/symbols-check: add __(de)register_frame_info_bases to platform symbols
|
||
|
||
jglrxavpok (1):
|
||
|
||
- radv: Avoid calls to strlen when parsing umr output to speed up hang progressing
|
||
|
||
leonperianu (2):
|
||
|
||
- pvr: Advertise KHR_separate_depth_stencil_layouts
|
||
- pvr: add support for VK_KHR_depth_stencil_resolve
|
||
|
||
llyyr (2):
|
||
|
||
- radv: don't set HOST_IMAGE_TRANSFER_BIT if host_image_copy not enabled
|
||
- vulkan: Update enum_to_str conversion to handle AMDX enum names
|
||
|
||
nihui (2):
|
||
|
||
- aco: gfx940 has no mad f32 instruction
|
||
- aco: set program->dev.fused_mad_mix=true for GFX940
|
||
|
||
no92 (1):
|
||
|
||
- gallivm: support LLVM 21
|
||
|
||
norablackcat (2):
|
||
|
||
- rusticl: fix unit tests
|
||
- rusticl: add Test targets
|
||
|
||
sarbes (4):
|
||
|
||
- lima: move RSW packing/unpacking to genxml
|
||
- lima: clean up unused PP struct
|
||
- lima: implement logicops
|
||
- lima: wire up anisotropic filtering
|
||
|
||
sergiuferentz (1):
|
||
|
||
- gfxstream: VirtGpuDevice can be null for Goldfish.
|
||
|
||
serguei (1):
|
||
|
||
- Revert "ci: disable Collabora's farm due to maintenance"
|
||
|
||
sjfricke (1):
|
||
|
||
- nir: Fix gnu-empty-initializer warning
|
||
|
||
stefan11111 (1):
|
||
|
||
- glx: Fix segfault when Nvidia PRIME render offload is enabled, but not used
|
||
|
||
swscm, z1 (1):
|
||
|
||
- amd/vpelib: Ensures type-safe comparison for callback assignment
|