fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-04-29 06:20:37 +02:00

Author	SHA1	Message	Date
Ian Romanick	c8ba2bc2f0	nir: Pack texture LOD and array index to a single 32-bit value v2: Fix clamped_ai calculation in nir_lower_tex.c. Add nir_tex_src_combined_lod_and_array_index_intel to print_tex_instr. Suggested by Sagar. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27305>	2024-02-02 02:39:10 +00:00
Ian Romanick	78e7f7b377	intel/compiler/xe2: Use new sample_*_mlod messages Note: a future commit will expand the sampler message type to the 6 bits used on Xe2. v2 (Francisco Jerez): Rebase on `07b9bfacc7` ("intel/compiler: Move logical-send lowering to a separate file"). v3: Drop XE2_SAMPLER_MESSAGE_SAMPLE_BIAS_MLOD as it does not actually exist. This resulted in some bigger changes in brw_disasm.c. Noticed by Sagar. v4: Now that XE2_SAMPLER_MESSAGE_SAMPLE_MLODc conflicts with GFX7_SAMPLER_MESSAGE_SAMPLE_GATHER4_PO_C, the determination of min_lod_is_first must include devinfo->ver or previous platforms will break. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27305>	2024-02-02 02:39:09 +00:00
Sagar Ghuge	8690a6b546	intel/compiler/xe2: Handle 6-bit message type for Gfx20+ Message types are expanded to 6-bit encoding now. 5 bits are still the same field from the Sampler Message Descriptor. The most significant bit is now bit 31 of the Sampler Message Descriptor. The messages that have '1 in bit 6 are only to support programmable offsets and those would require message header. If a sampler type shows only 5 bits encoding, it is implied bit 6 equal to 0 and there is no requirement for header. v2 (idr): Trivial formatting changes. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27305>	2024-02-02 02:39:09 +00:00
Ian Romanick	a9ed9cf88b	intel/fs: Move opcode modification before the switch that emits srcs This small refactor simplifies a later commit that will optionally emit some opcodes before the switch (as is already done with the shadow comparitor). v2 (Francisco Jerez): Rebase on `07b9bfacc7` ("intel/compiler: Move logical-send lowering to a separate file"). v3 (Jordan): SHADER_OPCODE_TXL => SHADER_OPCODE_TXL_LZ (was SHADER_OPCODE_TXF_LZ). Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27305>	2024-02-02 02:39:09 +00:00
Ian Romanick	7441af803f	intel/compiler/xe2: Update get_sampler_lowered_simd_width The Bspec also says, "The table below describes the SIMD modes which are supported. SIMD32 and SIMD64 are used for media-type operations only." Perhaps this commit should just add if (devinfo->ver >= 20) return 16; instead. v2: Use reg_unit in get_sampler_lowered_simd_width. Suggested by Sagar. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27305>	2024-02-02 02:39:09 +00:00
Mike Blumenkrantz	24a7f6cd16	zink: add a tu flake Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27422>	2024-02-02 02:23:02 +00:00
Dave Airlie	59fb425e1c	vulkan: update registry/includes to 1.3.277 Acked-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27421>	2024-02-02 01:46:24 +00:00
Jesse Natalie	559f31e202	dzn: Use blits for all non-averaging resolves Trying to do min/max resolves on depth/stencil is failing for me on hardware, just simplify things and always use a manual resolve for modes that aren't average. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27348>	2024-02-02 01:19:52 +00:00
Jesse Natalie	70fa127c97	dzn: Use correct format for depth/stencil resolves Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27348>	2024-02-02 01:19:52 +00:00
Jesse Natalie	973c5bd047	dzn: Don't resolve for RESOLVE_MODE_NONE Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27348>	2024-02-02 01:19:52 +00:00
Jesse Natalie	dd7cfd5255	dzn: Add a debug flag for forcing off native view instancing Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27348>	2024-02-02 01:19:52 +00:00
Jesse Natalie	a85e8058cb	dzn: Support non-static samplers for meta Some hardware that doesn't support true static samplers, emulates it by copying all static samplers into a reserved portion of every descriptor heap. To support Vulkan's required 4000 live sampler limit in bindless mode, D3D is now able to create descriptor heaps which do not have a reserved portion. Any descriptor heaps above the MaxSamplerDescriptorHeapSizeWithStaticSamplers limit will not have that reserved portion and cannot be used with static samplers. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27348>	2024-02-02 01:19:51 +00:00
Jesse Natalie	c286c01136	dzn: Add barrier to copy source for DispatchIndirect copies Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27348>	2024-02-02 01:19:51 +00:00
Jesse Natalie	581a23c0cc	dzn: Add missing handling of VK_PIPELINE_STAGE_2_DRAW_INDIRECT_BIT Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27348>	2024-02-02 01:19:51 +00:00
Jesse Natalie	60aad6ef07	spirv2dxil: Lower the Vulkan memory model and coherent loads/stores Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27348>	2024-02-02 01:19:51 +00:00
Jesse Natalie	003d2da2dc	microsoft/compiler: Add a pass for promoting ACCESS_COHERENT on loads/stores DXIL doesn't have instruction-level coherency. We have 3 options: 1. Promote the instruction to an atomic instruction. We can only do this for 32-bit or 64-bit ops. 2. If using bindless, declare the local resource declaration as globally-coherent. 3. If not using bindless, add globally-coherent to the global resource declaration. This pass does all 3 of these, stopping at the intrinsic level for supported types of atomics, otherwise assigning to the global resource declaration, which will be unused if we're doing bindless, where instead we'll get it from the instruction. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27348>	2024-02-02 01:19:51 +00:00
Jesse Natalie	b74cd405d3	microsoft/compiler: Respect ACCESS_COHERENT in UAV variable data DXIL has a globally-coherent field for UAVs. When emitting UAV metadata based on a resource variable, respect the relevant bit in the var data. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5628 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27348>	2024-02-02 01:19:51 +00:00
Ian Romanick	118e0bdc1f	intel/rt: Don't directly generate umul_32x16 The optimization pass will (eventually) turn the imul into a umul_32x16. In many cases, the multiply will be converted to something else. I also tried cloning a bunch of existing imul algebraic patterns for [iu]mul_32x16. This produced the same result, but it was a lot more churn. All of the shaders affected were ray tracing shaders in Q2RTX. This is the only ray tracing workload in my fossil-db. DG2 Totals: Instrs: 191995626 -> 191995079 (-0.00%); split: -0.00%, +0.00% Cycles: 14003803561 -> 14003798040 (-0.00%); split: -0.00%, +0.00% Spill count: 108320 -> 108288 (-0.03%) Fill count: 200695 -> 200663 (-0.02%) Scratch Memory Size: 8755200 -> 8754176 (-0.01%) Totals from 7 (0.00% of 652118) affected shaders: Instrs: 14998 -> 14451 (-3.65%); split: -3.94%, +0.29% Cycles: 137222 -> 131701 (-4.02%); split: -4.10%, +0.07% Spill count: 32 -> 0 (-inf%) Fill count: 32 -> 0 (-inf%) Scratch Memory Size: 19456 -> 18432 (-5.26%) Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27161>	2024-02-02 00:02:05 +00:00
Timothy Arceri	bc0178af57	glsl: don't tree graft globals As per this optimisations description: "Takes assignments to variables that are dereferenced only once and pastes the RHS expression into where the variables dereferenced." However the optimisation is run at compile time before multiple shaders from the same stage could have been pasted together. So this optimisation can incorrectly assume a global is only referenced once since it cannot see the other pieces of the shader stage until link time. Here we skip the optimisation if the variable is a global. We could change it to only run at link time however this optimisation is only run at link time if we are being forced to use GLSL IR to inline a function that glsl to nir cannot handle and this will also be removed in a future patchset. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10482 Fixes: `d75a36a9ee` ("glsl: remove do_copy_propagation_elements() optimisation pass") Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27351>	2024-02-01 23:15:24 +00:00
Eric Engestrom	98197e15cc	ci: explain purpose of the word after the date in image tags Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27379>	2024-02-01 22:10:09 +00:00
Eric Engestrom	b6d70eb099	ci: reduce maximum image tags length from 30 to 20 To keep a margin in case we need to add something more in the future. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27379>	2024-02-01 22:10:09 +00:00
Eric Engestrom	b6fceeaa9f	ci: enforce maximum image tag length Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27379>	2024-02-01 22:10:09 +00:00
Eric Engestrom	73dcdc50d2	ci: drop dash in image tags dates I put dashes in the dates when I first introduced the image tags; it made sense to improve date readability as we had only a handful of these and they barely combined. Nowadays we combine a lot of these tags to form the docker image tags, and we often run out of space. Let's remove these dashes, making dates slightly harder to read, and instead allow these two extra characters to be used in the unique/descriptive part of the tag. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27379>	2024-02-01 22:10:09 +00:00
Gert Wollny	dd267ab434	zink: move zink_resource_copies_reset out of exportable_lock The function takes care of synchronization by itself, so no need to also protect the call by ctx->batch.state->exportable_lock. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27327>	2024-02-01 21:22:25 +01:00
Gert Wollny	01e64bbf36	zink/sync: remove duplicate assignments in UNSYNCHRONIZED case Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27327>	2024-02-01 21:22:25 +01:00
Gert Wollny	ef548bf040	zink: extract update_unordered_access_and_get_cmdbuf Use template specialization to handle the static control flow based on template parameters during code generation instead of relying on the optimizer to remove the unused code path. v2: - Fix function opening brace location (zmike) - remove accidently added dead code Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27327>	2024-02-01 21:22:25 +01:00
Gert Wollny	ceca832662	zink: extract emit_memory_barrier::for_buffer from zink_resource_buffer_barrier Use template specialization to handle the static control flow based on template parameters during code generation instead of relying on the optimizer to remove the unused code path. v2: - move function opening braces to new line and fix indetion (zmike) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27327>	2024-02-01 21:22:25 +01:00
Gert Wollny	8c1ddcace9	zink: extract emit_memory_barrier from zink_resource_image_barrier Replace the generic true/false by an enum to make the intent clearer. Factor out the emission of the barrier, and use template specialization to pick the type of barrier that is to be emitted, because with template specialization the control flow is avoided altogether, whereas with the static code flow it is up to the optimizer to remove the unused bits - which may not happen in debug builds. v2: Fix function start braces (zmike) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27327>	2024-02-01 21:22:25 +01:00
Gert Wollny	2cac3adf31	zink: remove duplicate check and assignment in zink_resource_image_needs_barrier zink_resource_image_barrier already checks and sets the pipeline and the flags. v2: make zink_resource_image_needs_barrier private (zmike) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27327>	2024-02-01 21:22:25 +01:00
Gert Wollny	de354a48b9	zink: extract check_unordered_exec from zink_get_cmdbuf Avoid some code duplication and interleaving of resource checks v2: Use ALWAYS_INLINE (zmike) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27327>	2024-02-01 21:22:25 +01:00
Yiwei Zhang	558aca10b4	meson: drop -DANDROID Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27374>	2024-02-01 19:29:49 +00:00
Yiwei Zhang	53d9debcf4	util: refactor to use DETECT_OS_ANDROID except leaving u_endian.h behind to use __ANDROID__ directly to be consistent with the rest in that file, which deserves a different refactor Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27374>	2024-02-01 19:29:49 +00:00
Yiwei Zhang	569437221d	gallium: refactor to use DETECT_OS_ANDROID Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27374>	2024-02-01 19:29:49 +00:00
Yiwei Zhang	8762b2fca1	egl: refactor to use DETECT_OS_ANDROID instead of ANDROID Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27374>	2024-02-01 19:29:49 +00:00
Yiwei Zhang	5a37340689	turnip: refactor to use DETECT_OS_ANDROID instead of ANDROID Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27374>	2024-02-01 19:29:49 +00:00
Yiwei Zhang	5df083eff7	radv: refactor to use DETECT_OS_ANDROID instead of ANDROID Include a tiny refactor in radv_physical_device_get_supported_extensions to avoid a false formatting issue. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27374>	2024-02-01 19:29:49 +00:00
Yiwei Zhang	4fd4a6109d	anv: refactor to use DETECT_OS_ANDROID instead of ANDROID Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27374>	2024-02-01 19:29:49 +00:00
Yiwei Zhang	a678b7434a	hasvk: refactor to use DETECT_OS_ANDROID instead of ANDROID Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27374>	2024-02-01 19:29:49 +00:00
Yiwei Zhang	f245339120	venus: refactor to use DETECT_OS_ANDROID instead of ANDROID Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27374>	2024-02-01 19:29:49 +00:00
Yiwei Zhang	f06d7f6942	v3dv: refactor to use DETECT_OS_ANDROID instead of ANDROID Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27374>	2024-02-01 19:29:49 +00:00
Yiwei Zhang	2dd95bc4b7	vulkan/runtime: refactor to use DETECT_OS_ANDROID instead of ANDROID Also update vk_android_native_buffer.h to use __ANDROID__ directly. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27374>	2024-02-01 19:29:49 +00:00
Yiwei Zhang	f7d35be362	vulkan/util: drop redundant code gen from vk_extensions_gen.py driver is now always 'vk'. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27374>	2024-02-01 19:29:48 +00:00
Yiwei Zhang	1e80a426c2	anv: extend implicit fencing support for case requiring implicit write This change extends the coverage to ANV being the producer while consumer is hardware encoder backed by iHD. So we'd apply implicit write to bos backing render target images, which is mostly aligned with i915_batch_submit tracking of the bos being writtern to. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27398>	2024-02-01 18:53:28 +00:00
Yiwei Zhang	be3af5acf6	anv: optimize the implicit fencing support of external memory Previously we apply implicit sync to all external memory, which is a bit redundant since we only need it for the dedicated image scenario (media image imported into Vulkan). This change optimizes just like that while also excluding wsi which has its own way of synchronizing with the compositor. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27398>	2024-02-01 18:53:28 +00:00
Yiwei Zhang	55ac9a08b5	anv: refactor wsi_memory_allocate_info handling Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27398>	2024-02-01 18:53:28 +00:00
Eric Engestrom	f8078e278c	docs/calendar: add 24.1 branchpoint and release schedule Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27403>	2024-02-01 18:02:46 +00:00
Roland Scheidegger	e04eed2827	auxiliary/draw: fix streamout overflow calculation If the stride is larger than the component with the largest offset plus the size of that component, it is still considered an overflow if there's not enough space in the buffer to fit the whole stride-sized thing, even when there would be enough space to actually write all components. This is actually much simpler too, since we don't need to verify the individual components at all (stride is guaranteed to be larger or equal to the component with the largest offset plus the size of that component). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27368>	2024-02-01 17:10:40 +00:00
Eric Engestrom	eb4effeead	v3d-rpi4-gl: reduce the parallelism from 10 to 8 We are slightly over-subscribed right now, with 21 merge jobs (10 vk + 10 gl + 1 traces) for 20 RPi4, so let's split the tests between slightly fewer RPi4 and make each split job run for slightly longer, because it also means that all the jobs start immediately, reducing the overall delay in merging any MR that triggers this job. The typical run time for this job was around 8 min with a 10-split; with an 8-split it is now 9 min, which is still within the 10 min target and well below the 15min limit. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27391>	2024-02-01 13:45:59 +00:00
Lionel Landwerlin	03490ec019	vulkan/runtime: rework VK_KHR_dynamic_rendering_local_read state tracking I missed a bunch of things like input tracking. Also take the opportunity to rename things. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `fe19405c46` ("vulkan/runtime: handle new dynamic states for attachment remapping") Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27380>	2024-02-01 13:20:21 +00:00
Lionel Landwerlin	d7f5a815e3	vulkan/multialloc: bump max number to 16 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27380>	2024-02-01 13:20:21 +00:00

1 2 3 4 5 ...

184110 commits