fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-20 11:18:11 +02:00

Author	SHA1	Message	Date
Jesse Natalie	973c5bd047	dzn: Don't resolve for RESOLVE_MODE_NONE Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27348>	2024-02-02 01:19:52 +00:00
Jesse Natalie	dd7cfd5255	dzn: Add a debug flag for forcing off native view instancing Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27348>	2024-02-02 01:19:52 +00:00
Jesse Natalie	a85e8058cb	dzn: Support non-static samplers for meta Some hardware that doesn't support true static samplers, emulates it by copying all static samplers into a reserved portion of every descriptor heap. To support Vulkan's required 4000 live sampler limit in bindless mode, D3D is now able to create descriptor heaps which do not have a reserved portion. Any descriptor heaps above the MaxSamplerDescriptorHeapSizeWithStaticSamplers limit will not have that reserved portion and cannot be used with static samplers. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27348>	2024-02-02 01:19:51 +00:00
Jesse Natalie	c286c01136	dzn: Add barrier to copy source for DispatchIndirect copies Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27348>	2024-02-02 01:19:51 +00:00
Jesse Natalie	581a23c0cc	dzn: Add missing handling of VK_PIPELINE_STAGE_2_DRAW_INDIRECT_BIT Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27348>	2024-02-02 01:19:51 +00:00
Jesse Natalie	60aad6ef07	spirv2dxil: Lower the Vulkan memory model and coherent loads/stores Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27348>	2024-02-02 01:19:51 +00:00
Jesse Natalie	003d2da2dc	microsoft/compiler: Add a pass for promoting ACCESS_COHERENT on loads/stores DXIL doesn't have instruction-level coherency. We have 3 options: 1. Promote the instruction to an atomic instruction. We can only do this for 32-bit or 64-bit ops. 2. If using bindless, declare the local resource declaration as globally-coherent. 3. If not using bindless, add globally-coherent to the global resource declaration. This pass does all 3 of these, stopping at the intrinsic level for supported types of atomics, otherwise assigning to the global resource declaration, which will be unused if we're doing bindless, where instead we'll get it from the instruction. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27348>	2024-02-02 01:19:51 +00:00
Jesse Natalie	b74cd405d3	microsoft/compiler: Respect ACCESS_COHERENT in UAV variable data DXIL has a globally-coherent field for UAVs. When emitting UAV metadata based on a resource variable, respect the relevant bit in the var data. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5628 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27348>	2024-02-02 01:19:51 +00:00
Ian Romanick	118e0bdc1f	intel/rt: Don't directly generate umul_32x16 The optimization pass will (eventually) turn the imul into a umul_32x16. In many cases, the multiply will be converted to something else. I also tried cloning a bunch of existing imul algebraic patterns for [iu]mul_32x16. This produced the same result, but it was a lot more churn. All of the shaders affected were ray tracing shaders in Q2RTX. This is the only ray tracing workload in my fossil-db. DG2 Totals: Instrs: 191995626 -> 191995079 (-0.00%); split: -0.00%, +0.00% Cycles: 14003803561 -> 14003798040 (-0.00%); split: -0.00%, +0.00% Spill count: 108320 -> 108288 (-0.03%) Fill count: 200695 -> 200663 (-0.02%) Scratch Memory Size: 8755200 -> 8754176 (-0.01%) Totals from 7 (0.00% of 652118) affected shaders: Instrs: 14998 -> 14451 (-3.65%); split: -3.94%, +0.29% Cycles: 137222 -> 131701 (-4.02%); split: -4.10%, +0.07% Spill count: 32 -> 0 (-inf%) Fill count: 32 -> 0 (-inf%) Scratch Memory Size: 19456 -> 18432 (-5.26%) Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27161>	2024-02-02 00:02:05 +00:00
Timothy Arceri	bc0178af57	glsl: don't tree graft globals As per this optimisations description: "Takes assignments to variables that are dereferenced only once and pastes the RHS expression into where the variables dereferenced." However the optimisation is run at compile time before multiple shaders from the same stage could have been pasted together. So this optimisation can incorrectly assume a global is only referenced once since it cannot see the other pieces of the shader stage until link time. Here we skip the optimisation if the variable is a global. We could change it to only run at link time however this optimisation is only run at link time if we are being forced to use GLSL IR to inline a function that glsl to nir cannot handle and this will also be removed in a future patchset. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10482 Fixes: `d75a36a9ee` ("glsl: remove do_copy_propagation_elements() optimisation pass") Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27351>	2024-02-01 23:15:24 +00:00
Gert Wollny	dd267ab434	zink: move zink_resource_copies_reset out of exportable_lock The function takes care of synchronization by itself, so no need to also protect the call by ctx->batch.state->exportable_lock. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27327>	2024-02-01 21:22:25 +01:00
Gert Wollny	01e64bbf36	zink/sync: remove duplicate assignments in UNSYNCHRONIZED case Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27327>	2024-02-01 21:22:25 +01:00
Gert Wollny	ef548bf040	zink: extract update_unordered_access_and_get_cmdbuf Use template specialization to handle the static control flow based on template parameters during code generation instead of relying on the optimizer to remove the unused code path. v2: - Fix function opening brace location (zmike) - remove accidently added dead code Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27327>	2024-02-01 21:22:25 +01:00
Gert Wollny	ceca832662	zink: extract emit_memory_barrier::for_buffer from zink_resource_buffer_barrier Use template specialization to handle the static control flow based on template parameters during code generation instead of relying on the optimizer to remove the unused code path. v2: - move function opening braces to new line and fix indetion (zmike) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27327>	2024-02-01 21:22:25 +01:00
Gert Wollny	8c1ddcace9	zink: extract emit_memory_barrier from zink_resource_image_barrier Replace the generic true/false by an enum to make the intent clearer. Factor out the emission of the barrier, and use template specialization to pick the type of barrier that is to be emitted, because with template specialization the control flow is avoided altogether, whereas with the static code flow it is up to the optimizer to remove the unused bits - which may not happen in debug builds. v2: Fix function start braces (zmike) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27327>	2024-02-01 21:22:25 +01:00
Gert Wollny	2cac3adf31	zink: remove duplicate check and assignment in zink_resource_image_needs_barrier zink_resource_image_barrier already checks and sets the pipeline and the flags. v2: make zink_resource_image_needs_barrier private (zmike) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27327>	2024-02-01 21:22:25 +01:00
Gert Wollny	de354a48b9	zink: extract check_unordered_exec from zink_get_cmdbuf Avoid some code duplication and interleaving of resource checks v2: Use ALWAYS_INLINE (zmike) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27327>	2024-02-01 21:22:25 +01:00
Yiwei Zhang	53d9debcf4	util: refactor to use DETECT_OS_ANDROID except leaving u_endian.h behind to use __ANDROID__ directly to be consistent with the rest in that file, which deserves a different refactor Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27374>	2024-02-01 19:29:49 +00:00
Yiwei Zhang	569437221d	gallium: refactor to use DETECT_OS_ANDROID Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27374>	2024-02-01 19:29:49 +00:00
Yiwei Zhang	8762b2fca1	egl: refactor to use DETECT_OS_ANDROID instead of ANDROID Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27374>	2024-02-01 19:29:49 +00:00
Yiwei Zhang	5a37340689	turnip: refactor to use DETECT_OS_ANDROID instead of ANDROID Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27374>	2024-02-01 19:29:49 +00:00
Yiwei Zhang	5df083eff7	radv: refactor to use DETECT_OS_ANDROID instead of ANDROID Include a tiny refactor in radv_physical_device_get_supported_extensions to avoid a false formatting issue. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27374>	2024-02-01 19:29:49 +00:00
Yiwei Zhang	4fd4a6109d	anv: refactor to use DETECT_OS_ANDROID instead of ANDROID Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27374>	2024-02-01 19:29:49 +00:00
Yiwei Zhang	a678b7434a	hasvk: refactor to use DETECT_OS_ANDROID instead of ANDROID Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27374>	2024-02-01 19:29:49 +00:00
Yiwei Zhang	f245339120	venus: refactor to use DETECT_OS_ANDROID instead of ANDROID Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27374>	2024-02-01 19:29:49 +00:00
Yiwei Zhang	f06d7f6942	v3dv: refactor to use DETECT_OS_ANDROID instead of ANDROID Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27374>	2024-02-01 19:29:49 +00:00
Yiwei Zhang	2dd95bc4b7	vulkan/runtime: refactor to use DETECT_OS_ANDROID instead of ANDROID Also update vk_android_native_buffer.h to use __ANDROID__ directly. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27374>	2024-02-01 19:29:49 +00:00
Yiwei Zhang	f7d35be362	vulkan/util: drop redundant code gen from vk_extensions_gen.py driver is now always 'vk'. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27374>	2024-02-01 19:29:48 +00:00
Yiwei Zhang	1e80a426c2	anv: extend implicit fencing support for case requiring implicit write This change extends the coverage to ANV being the producer while consumer is hardware encoder backed by iHD. So we'd apply implicit write to bos backing render target images, which is mostly aligned with i915_batch_submit tracking of the bos being writtern to. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27398>	2024-02-01 18:53:28 +00:00
Yiwei Zhang	be3af5acf6	anv: optimize the implicit fencing support of external memory Previously we apply implicit sync to all external memory, which is a bit redundant since we only need it for the dedicated image scenario (media image imported into Vulkan). This change optimizes just like that while also excluding wsi which has its own way of synchronizing with the compositor. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27398>	2024-02-01 18:53:28 +00:00
Yiwei Zhang	55ac9a08b5	anv: refactor wsi_memory_allocate_info handling Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27398>	2024-02-01 18:53:28 +00:00
Roland Scheidegger	e04eed2827	auxiliary/draw: fix streamout overflow calculation If the stride is larger than the component with the largest offset plus the size of that component, it is still considered an overflow if there's not enough space in the buffer to fit the whole stride-sized thing, even when there would be enough space to actually write all components. This is actually much simpler too, since we don't need to verify the individual components at all (stride is guaranteed to be larger or equal to the component with the largest offset plus the size of that component). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27368>	2024-02-01 17:10:40 +00:00
Eric Engestrom	eb4effeead	v3d-rpi4-gl: reduce the parallelism from 10 to 8 We are slightly over-subscribed right now, with 21 merge jobs (10 vk + 10 gl + 1 traces) for 20 RPi4, so let's split the tests between slightly fewer RPi4 and make each split job run for slightly longer, because it also means that all the jobs start immediately, reducing the overall delay in merging any MR that triggers this job. The typical run time for this job was around 8 min with a 10-split; with an 8-split it is now 9 min, which is still within the 10 min target and well below the 15min limit. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27391>	2024-02-01 13:45:59 +00:00
Lionel Landwerlin	03490ec019	vulkan/runtime: rework VK_KHR_dynamic_rendering_local_read state tracking I missed a bunch of things like input tracking. Also take the opportunity to rename things. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `fe19405c46` ("vulkan/runtime: handle new dynamic states for attachment remapping") Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27380>	2024-02-01 13:20:21 +00:00
Lionel Landwerlin	d7f5a815e3	vulkan/multialloc: bump max number to 16 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27380>	2024-02-01 13:20:21 +00:00
Erik Faye-Lund	4de62731f4	mesa/main: add support for EXT_texture_storage It's sometimes really, really useful if GL_BGRA8 can be used as a sized internal format, and the combination of EXT_texture_storage and EXT_texture_format_BGRA8888 allows this (only when using texture-storage, which is good enough in some cases). Until now, we've only implemented ARB_texture_storage, and not the EXT version. So let's implement the EXT version as well, so we get the benefit of the interaction here. This pulls in a lot of other similar interactions as well, which also seems useful. ...because the ARB version is created from the EXT version, let's move the EXT function definitions to the EXT extension. These should probably have been suffixed with ARB in the ARB-version, but things seems to have just ended up kinda confused. Oh well. Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27222>	2024-02-01 12:30:58 +00:00
Alejandro Piñeiro	16f6f50ce4	v3dv: expose VK_EXT_depth_clip_enable We already had the logic implemented, but it was never really tested (there was a comment about that) So the advantage of this is that we now test that code (in fact, there were a small typo on that code). There aren't too much CTS tests for this feature, but we gets tests like this working: dEQP-VK.clipping.clip_volume.depth_clip.* Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10527 Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27386>	2024-02-01 11:33:38 +00:00
Luc Ma	03371887d5	gallium/u_blitter: Fix a few uninitialized fb_state An uninitialized pipe_framebuffer_state maybe causes some issues if someone is about to use its members such as `fb_state.layers`. Signed-off-by: Luc Ma <luc@sietium.com> Reported-by: Mark Zhou <mark@sietium.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27332>	2024-02-01 10:59:53 +00:00
Pierre-Eric Pelloux-Prayer	6f47e87a60	egl/drm: flush before calling get_back_bo Similar to what was done for Wayland in `58f90fd03f`: the glthread unmarhsal thread needs to be idle to avoid concurrent calls to get_back_bo. Also the existing code flushed after setting dri2_surf->back to NULL so a new back buffer was always allocated by the glthread flush: \|---------------> dri2_drm_swap_buffers \| get_back_bo (back=0x55eb93c6c488) > # First get_back_bo call \| get_back_bo (back=0x55eb93c6c488 age: 0)< \| # dri2_surf->back = NULL \|-----> FLUSH \| get_back_bo (back=nil) > # Another get_back_bo call \| get_back_bo (back=0x55eb93c6c4c8 age: 3)< \|-----< FLUSH \|---------------< dri2_drm_swap_buffers Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10437 Cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27143>	2024-02-01 09:26:33 +00:00
Pierre-Eric Pelloux-Prayer	e4f7754977	radeonsi: try to disable dcc if compute_blit is the only option COMPUTE contexts have no blitter so there are no fallback to si_can_use_compute_blit failing. One solution would be to disable DCC globally when a COMPUTE context is created but I'm not 100% sure it's a good idea. Until then this commit can fix a number of cases and will also prevent crashing if si_compute_blit fails. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10296 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27295>	2024-02-01 09:28:31 +01:00
Friedrich Vock	f66055a6a6	radv/rt: Write inactive node data in ALWAYS_ACTIVE workaround Fixes: `a9831caa` ("radv/rt: Add workaround to make leaves always active") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27340>	2024-02-01 05:42:59 +00:00
Faith Ekstrand	60071f94e5	nvk: Use the upload queue for shader uploads Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27205>	2024-02-01 03:51:08 +00:00
Faith Ekstrand	aea4c9a913	nvk: Add an upload queue to nvk_device Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27205>	2024-02-01 03:51:08 +00:00
Faith Ekstrand	2074e28a0d	nvk: Add an upload queue Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27205>	2024-02-01 03:51:08 +00:00
Faith Ekstrand	e6f137e9ed	nvk: Only map heaps that explicitly request maps Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27205>	2024-02-01 03:51:08 +00:00
Faith Ekstrand	e162c2e78e	nvk: Use VM_BIND for contiguous heaps instead of copying This gets rid of our (fairly sketchy) heap resizing via stall-and-copy and replaces it with VM_BIND. We couldn't do this on the old nouveau API but now that we can assume VM_BIND, it makes everything simpler. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27205>	2024-02-01 03:51:08 +00:00
Faith Ekstrand	f0fad6ed17	nvk/queue: Only initialize the necessary engines Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27205>	2024-02-01 03:51:08 +00:00
Faith Ekstrand	ced7c5193e	nvk/queue: Rework context state init The queue now owns the nv_push and just invokes the per-engine functions to fill it with context state init data. This also splits out 3D and compute into separate helpers and pulls M2MF off into its own thing. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27205>	2024-02-01 03:51:08 +00:00
Faith Ekstrand	b02f83e5c6	nvk: Add an array of queue families to nvk_physical_device Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27205>	2024-02-01 03:51:08 +00:00
Faith Ekstrand	86e79cd744	nvk: Move the nouveau_ws_context to nvk_queue Otherwise, different queues aren't actually going to run independently. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27205>	2024-02-01 03:51:08 +00:00

1 2 3 4 5 ...

170458 commits