fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-26 04:10:09 +01:00

Author	SHA1	Message	Date
Anton Bambura	0129b3ff79	panfrost: Enable Mali-T600 It works since !27515 Signed-off-by: Anton Bambura <jenneron@postmarketos.org> Acked-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27519>	2024-03-01 23:26:57 +00:00
Caio Oliveira	337641cfcc	intel/compiler: Fix SIMD lowering when instruction needs a larger SIMD When lower_simd_width() encounters an instruction that needs a larger SIMD, for example SHADER_OPCODE_TXS_LOGICAL in Gfx4 needs at least SIMD16. In this case the builder needs to be at least as large as max_width, otherwise the group() setup will assert. Turns out this did not assert before "by accident", since it was relying on the default fs_visitor builder that had a dispatch width of 64, a bogus placeholder value, expected not to be used. However, when we changed the code to remove that builder (and the bogus value), we created a new builder in the pass shader dispatch_width -- which work fine except in the case where we want to "lower" the SIMD above the shader dispatch width. The fix is to also consider the already calculated max_width when creating the builder. Fixes: `5b8ec015f2` ("intel/compiler: Don't use fs_visitor::bld in remaining places") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10338 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27782>	2024-03-01 22:54:57 +00:00
Kenneth Graunke	ad37622a8f	intel/brw: Delete legacy texture opcodes We first generate the logical opcodes, and these days fully lower to SHADER_OPCODE_SEND. In the past, we lowered to a non-logical variant and handled that in the generator. These days, we were just using the non-logical opcodes as an awkward intermediate opcode change during the lowering...which isn't really necessary at all. This patch eliminates them by using the original logical opcodes. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27908>	2024-03-01 22:19:51 +00:00
Kenneth Graunke	19248f48eb	intel/brw: Allow CSE on TXF_CMS_W_GFX12_LOGICAL This was missed when adding the new XeHP variant of the opcode. Fixes: `261dd6c8` ("intel/compiler: Add new variant for TXF_CMS_W") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27908>	2024-03-01 22:19:51 +00:00
Kenneth Graunke	45a5e4c0c4	intel/brw: Delete SHADER_OPCODE_TXF_UMS Nothing seems to generate this anymore. I guess we always use CMS. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27908>	2024-03-01 22:19:51 +00:00
Kenneth Graunke	601ef12467	intel/brw: Delete SHADER_OPCODE_TXF_CMS[_LOGICAL] We always use the wide variant (_W) on hardware this compiler supports. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27908>	2024-03-01 22:19:50 +00:00
Kenneth Graunke	494eee1337	intel/brw: Change unit tests to use TEX_LOGICAL instead of TEX We're not really doing any fancy texturing here, just emitting a TEX instruction that writes multiple destination registers. I plan to remove the non-logical TEX instruction in the next commit, so we swap these over to use the logical version instead. It should work just as well for the purposes of the test. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27908>	2024-03-01 22:19:50 +00:00
Roland Scheidegger	e03e593143	auxiliary/rtasm: fix unaligned stores Unaliged stores are unspecified behavior according to C rules, hence address sanitizers may complain. Even though this worked fine in practice (it's almost impossible here for the compiler to do something "wrong" even if it assumes the store is aligned, given such stores work just fine on x86), we should follow the rules. The widely accepted solution for this (it may be somewhat surprising you can't actually do unaligned assignments explicitly somehow in C) nowadays is to just use memcpy(). The compiler should figure out (at least with optimizations enabled) it's just a trivial store and optimize it back to a single cpu instruction, while still satisfying asan. (I've verified that even in debug builds the memcpy() is actually optimized away anyway, I suspect there's some compiler flags somewhere forcing this behavior.) Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10208 Reviewed-by: Jose Fonseca <jose.fonseca@broadcom.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27896>	2024-03-01 21:44:52 +00:00
Friedrich Vock	e7d78a7b87	vulkan/runtime: Allow more than 8 DRM devices Some people seem to have systems with more than 8 GPUs installed at once. 256 is the maximum number of devices returned by libdrm, so using this seems like a good choice for now. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27901>	2024-03-01 21:18:44 +00:00
Jesse Natalie	4ccbaa2cd8	microsoft/compiler: Remove deref load/store/atomic ops that statically go out of array bounds Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27919>	2024-03-01 20:49:17 +00:00
Jesse Natalie	941d83ded4	spirv2dxil: Set push constant register space to nonzero Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27919>	2024-03-01 20:49:17 +00:00
Luca Bacci	fe520ecfbf	meson,windows: Use relative paths in Vulkan ICD manifest files See https://github.com/msys2/MINGW-packages/issues/16065 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27468>	2024-03-01 20:09:59 +00:00
Nanley Chery	a1e3c93ff7	isl: Pick a better initial state for zeroed MCS Pick the compressed-no-clear aux state instead of aux-invalid state to reduce ambiguates in iris. Hopefully this will help debug the issues seen around MCS-enabling on ACM. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27881>	2024-03-01 19:40:03 +00:00
Sagar Ghuge	5775bc0c53	anv/xe: Consider pat_index while unbinding the bo Xe KMD also checks if cpu_caching caching set during bo creationg matches with caching of the PAT index set in the VM unbind. This was being unnoticed until now by luck and lack of testing in MTL. So here always setting PAT index for all VM operations that has a bo associated. v2: (Jose) - Move pat_index little bit up - Copy commit message from iris patch Fixes: `19439624d9` ("anv: Use DRM_XE_VM_BIND_OP_UNMAP_ALL to unbind whole bos") Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27893>	2024-03-01 19:08:19 +00:00
José Roberto de Souza	963c08b623	iris/xe: Consider pat_index while unbinding the bo Xe KMD also checks if cpu_caching caching set during bo creationg matches with caching of the PAT index set in the VM unbind. This was being unnoticed until now by luck and lack of testing in MTL. So here always setting PAT index for all VM operations that has a bo associated. Fixes: `eb18a92ef9` ("iris: Fill PAT fields in Xe KMD gem_create and vm_bind uAPIs") Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27893>	2024-03-01 19:08:19 +00:00
Konstantin Seurer	6380118d13	radv/meta: Add shader - device mapping for radv_build_printf Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27817>	2024-03-01 18:46:41 +00:00
Konstantin Seurer	d9a1882daa	radv/rra: Avoid reading past the ray history buffer The loop exit condition did not take the token size into account. Fixes: `767f628` ("radv/rra: Dump basic ray history tokens") Reviewed-by: Friedrich Vock <friedrich.vock@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27815>	2024-03-01 18:20:13 +00:00
Caio Oliveira	082735750b	intel/brw: Simplify usage of reg immediate helpers Use fs_reg and don't take the type as argument. In all uses the type passed is the type of the register. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27904>	2024-03-01 17:52:09 +00:00
Caio Oliveira	fb1d871714	intel/brw: Fold backend_reg into fs_reg Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27904>	2024-03-01 17:52:09 +00:00
Sil Vilerino	67c461dbe0	d3d12: Do not use PIPE_BIND_DISPLAY_TARGET for d3d12_video_buffer Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27918>	2024-03-01 17:29:12 +00:00
Samuel Pitoiset	570ebe1b37	radv: enable radv_zero_vram for RAGE2 Another native Vulkan game which doesn't properly initialize memory. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/10701 Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27845>	2024-03-01 16:49:16 +00:00
Rohan Garg	73d98848fa	intel/compiler: Xe2+ can do URB load/store with a byte offset Thanks to Ken for suggesting this URB refactoring change and pointing out that the LSC can operate on the byte offset granularity. This should fix the geometry shader test cases where we have more than 32 vertices since previously we were failing to write the correct control data bits because of incorrect write mask. Shader-db results for Xe2: total instructions in shared programs: 153475 -> 153437 (-0.02%) instructions in affected programs: 1374 -> 1336 (-2.77%) helped: 11 HURT: 0 helped stats (abs) min: 3 max: 5 x̄: 3.45 x̃: 3 helped stats (rel) min: 1.67% max: 4.92% x̄: 3.23% x̃: 2.70% 95% mean confidence interval for instructions value: -3.92 -2.99 95% mean confidence interval for instructions %-change: -4.10% -2.36% Instructions are helped. total loops in shared programs: 140 -> 140 (0.00%) loops in affected programs: 0 -> 0 helped: 0 HURT: 0 total cycles in shared programs: 16002649 -> 16002329 (<.01%) cycles in affected programs: 9174 -> 8854 (-3.49%) helped: 11 HURT: 0 helped stats (abs) min: 22 max: 38 x̄: 29.09 x̃: 32 helped stats (rel) min: 2.62% max: 5.54% x̄: 3.78% x̃: 3.85% 95% mean confidence interval for cycles value: -33.56 -24.62 95% mean confidence interval for cycles %-change: -4.48% -3.08% Cycles are helped. total spills in shared programs: 52 -> 52 (0.00%) spills in affected programs: 0 -> 0 helped: 0 HURT: 0 total fills in shared programs: 94 -> 94 (0.00%) fills in affected programs: 0 -> 0 helped: 0 HURT: 0 total sends in shared programs: 4240 -> 4240 (0.00%) sends in affected programs: 0 -> 0 helped: 0 HURT: 0 LOST: 0 GAINED: 0 Rework: (Sagar) - Adjust offset/indirect offset calculation. - Add shader-db results - Always calculate dword index - Drop changes for indirect writes Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27602>	2024-03-01 16:11:30 +00:00
Eric Engestrom	2ef7b4dfc1	ci/deqp: control the GLES release independently of GL Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27465>	2024-03-01 15:33:11 +00:00
Eric Engestrom	3c9cba1085	ci/deqp: control the GL release independently of VK Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27465>	2024-03-01 15:33:11 +00:00
Eric Engestrom	c59f0ca97e	ci/deqp: make deql-egl for android less of a special case Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27465>	2024-03-01 15:33:11 +00:00
Eric Engestrom	3fb95a9457	ci/deqp: build deqp-egl using mold as well Just like the build for the rest of the components. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27465>	2024-03-01 15:33:10 +00:00
Eric Engestrom	07b8b410a8	ci/deqp: only apply the android patches to the android build They are unnecessary otherwise, so let's skip them Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27465>	2024-03-01 15:33:10 +00:00
Eric Engestrom	d63e7b4403	ci/image-tags: move KERNEL_ROOTFS_TAG to group the test images together Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27465>	2024-03-01 15:33:10 +00:00
Bas Nieuwenhuizen	3b15a9c52c	radv: Expose VK_EXT_map_memory_placed. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27689>	2024-03-01 15:07:09 +00:00
Bas Nieuwenhuizen	7e029735e6	radv: Implement reserving the VA range on unmap. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27689>	2024-03-01 15:07:09 +00:00
Bas Nieuwenhuizen	a6a31538c5	radv: Support for mapping a buffer at a fixed address. Doing the separate bool to support mapping at 0. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27689>	2024-03-01 15:07:09 +00:00
Bas Nieuwenhuizen	d779cda0ae	radv/amdgpu: Use mmap directly. To give more control. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27689>	2024-03-01 15:07:09 +00:00
Bas Nieuwenhuizen	cccbe1527c	radv/winsys: Use radv_buffer_map wrapper. So we can have 1 function ptr and then have a version with default args. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27689>	2024-03-01 15:07:09 +00:00
Bas Nieuwenhuizen	f7e8fdf35d	radv: Add winsys argument to buffer map/unmap. To use later. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27689>	2024-03-01 15:07:09 +00:00
Samuel Pitoiset	4e95d1cc12	radv: allow RADV_PERFTEST=shader_object on GFX11 It should be working now on RDNA3. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27724>	2024-03-01 14:45:05 +00:00
Samuel Pitoiset	426d8b5216	radv: add support for emitting NGG shaders with ESO Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27724>	2024-03-01 14:45:05 +00:00
Samuel Pitoiset	85d682b318	radv: lower lds_ngg_{gs_out_vertex_base,_scratch_base} with ESO These values will be computed at draw time when both shaders are known in case it's VS+GS or TES+GS as NGG. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27724>	2024-03-01 14:45:05 +00:00
Samuel Pitoiset	2a9b466299	radv: add a new user SGPR for NGG shaders compiled separately with ESO The esgs ring size and the scratch lds base can't be known at compile time when NGG shaders are merged separately. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27724>	2024-03-01 14:45:05 +00:00
Samuel Pitoiset	4d87a05d34	radv: add a helper to emit PKT3_COND_EXEC Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27746>	2024-03-01 14:19:48 +00:00
Samuel Pitoiset	f21cc55dae	radv: add missing conditional rendering for indirect dispatches on GFX6 Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27746>	2024-03-01 14:19:48 +00:00
Samuel Pitoiset	cdf7e35618	radv: fix conditional rendering on compute queue on GFX6 PKT3_SET_PREDICATION is GFX only, even on GFX6. This fixes recent dEQP-VK.conditional_rendering.dispatch.*_compute_queue. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27746>	2024-03-01 14:19:48 +00:00
Job Noorman	0b953bdbb8	freedreno/ci: Update pixmark-piano-v2 hash Minor irrelevant change. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27411>	2024-03-01 13:45:11 +00:00
Job Noorman	9de628b65c	ir3: fold and/or and negations into branches Fold and/or into braa/brao when profitable. Only do this when the and/or is not used for any non-branch instructions as this would increase total instruction count. Add an algebraic nir pass that performs the inverse DeMorgan's laws to try to bring and/or in front of branches. Again, only do this when the original inot in only used for branches. This should always decrease instruction count since the extra inots can be folded into the branch. Fold inot into branches by using the inv1/inv2 cat0 fields. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27411>	2024-03-01 13:45:11 +00:00
Job Noorman	96c2fe3e1a	nir: add search helper is_only_used_by_if Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27411>	2024-03-01 13:45:11 +00:00
Job Noorman	ddd5f6abae	ir3: fold negations into cmps.ne zero It was already implemented for sel so pull that functionality into a helper function which can be used from ir3_get_predicate. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27411>	2024-03-01 13:45:11 +00:00
Job Noorman	c43d0e4e4f	ir3: insert predicate conversions after their source Instead of creating a cmps.s.ne for every use of a predicate, create just one and insert it after the instruction whose def is tested. This reduces the number of compares or, when they are folded into bitwise operations, those operations. It also decreases register pressure on GPRs by increasing pressure on predicate registers. This should be preferred in general since at worst, the predicate register will be spilled to a GPR again. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27411>	2024-03-01 13:45:11 +00:00
Job Noorman	c6a6902e4d	ir3: optimize bitwise ops that can directly write predicates On a6xx+, bitwise operations can directly write to predicate registers. The result will be 1 iff the result of the non-predicate operation would be non-zero. When generating instructions that need a predicate source, ir3 will insert a cmps.s.ne 0 instruction to guarantee a predicate can be produced. This is kept in place by this patch and we add a pass that tries to optimize useless comparisons away. Concretely: - Look through chains of multiple cmps.s.ne instructions and remove all but the first. - If the source of the cmps.s.ne can write directly to predicates, remove the cmps.s.ne. In both cases, no instructions are actually removed but clones are made and we rely on DCE to remove anything that became unused. Note that it's fine to always make a clone since even in the case that the original instruction is also used for non-predicate sources (so it won't be DCE'd), we replaced a cmps.ne.s with another instruction so this pass should never increase instruction count. Note that this pass replaces the double-comparison folding that was performed by ir3_cp before. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27411>	2024-03-01 13:45:11 +00:00
Job Noorman	9905b6f2f4	ir3: integrate predicates into RA validation In principle, validating the result of predicate RA works exactly the same as for normal RA. There is one slight issue: spilling is implemented by cloning the instruction that produced the original def which might cause different defs to legitimately reach the same source. For example: bool b = ...; if (...) { // b gets spilled and reloaded } else { // b is not spilled } use b; The use of b might see different reaching defs. To solve this, RA will store a pointer to the original def in the instruction's data field. Validation then uses the original def. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27411>	2024-03-01 13:45:11 +00:00
Job Noorman	96dc227988	ir3: validate no registers are invalid after RA This would already be caught by another check but would produce a message that was difficult to interpret. Better to check for it explicitly. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27411>	2024-03-01 13:45:11 +00:00
Job Noorman	21cd9b9557	ir3: implement RA for predicate registers Up to now, ir3 only supported one predicate register (p0.x). However, since a6xx, four predicate registers are available. This patch adds a register allocator for predicate registers that allows all of them to be used. The RA also works for older generations with only one register. The use of p0.x was hard-coded in many places in ir3. This has been replaced by a new flag, IR3_REG_PREDICATE, to indicate that an SSA value should be allocated to a predicate register. The RA uses the standard liveness analysis available in ir3. Using this, registers are allocated in a single pass over all blocks. For each block we keep track of currently live defs in the registers. Predicate destinations allocate a new register and sources take the register from their def. The live defs of a block are initialized with the intersection of the live-out defs of their predecessors: if all predecessors have the same live-out def in the same register, it is used as live-in. However, we only do this for defs that are actually live-in according to the liveness analysis. This doesn't work for loops: since predecessors from back edges are processed after their successors, we don't know their live-out state yet. We solve this by ignoring such predecessors while calculating the live-in state. When this predecessor is later processed, we fix-up its live-out state to match what its successor expects by reloading defs if necessary. Spilling is implemented by reloading, or rematerializing, the instruction that produced the def. Whenever we need a new register while none are available, we simply free one. If the freed def is later needed again, we clone the original instruction in front on the new use. We keep track of the original def the reload is cloned from so that subsequent uses can reuse the reload. Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27411>	2024-03-01 13:45:11 +00:00

1 2 3 4 5 ...

185732 commits