fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-23 23:48:18 +02:00

Author	SHA1	Message	Date
Paulo Zanoni	60f75a013e	hasvk: don't leave undefined values in exec->syncobj_values This is the Hasvk version of Anv's: `ad6a036a68` ("anv: don't leave undefined values in exec->syncobj_values") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20800> (cherry picked from commit `80196aaa5b`)	2023-01-26 15:40:35 +00:00
Francisco Jerez	8f0b387d94	intel/fs: Fix src and dst types of LOAD_PAYLOAD ACP entries during copy propagation. The ACP entries created by copy propagation to track the implied copies of LOAD_PAYLOAD instructions don't model the behavior of LOAD_PAYLOAD correctly, since (as of `41868bb682`) header moves are implicitly retyped to UD and the destination of non-header copies implicitly uses the same type as the corresponding source, even though the ACP entries created for such copies could incorrectly represent a type conversion, which can lead to mis-optimization of the program. According to Marcin, this fixes the func.mesh.ext.workgroup_id.task.q0 crucible test. Fixes: `41868bb682` ("i965/fs: Rework the fs_visitor LOAD_PAYLOAD instruction") Reported-by: Marcin Ślusarz <marcin.slusarz@intel.com> Tested-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18980> (cherry picked from commit `7b5e933629`)	2023-01-26 15:40:35 +00:00
Marcin Ślusarz	e79f4e0b12	intel/compiler/mesh: handle const data in task & mesh programs Started showing up when nir_opt_large_constants call was moved in `88756cee8d`. Fixes dEQP-VK.mesh_shader.ext.smoke.monolithic.fullscreen_gradient* Suggested-by: Kenneth Graunke <kenneth@whitecape.org> Fixes: `88756cee8d` ("intel/compiler: Run nir_opt_large_constants before scalarizing consts") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20876> (cherry picked from commit `536a2acfc2`)	2023-01-26 15:40:34 +00:00
Lionel Landwerlin	1c243f4f8b	intel/fs: avoid cmod optimization on instruction with different write_mask I've been running into failures with tests like : dEQP-VK.robustness.robustness2.bind.notemplate.rgba32i.unroll.nonvolatile.uniform_buffer_dynamic.no_fmt_qual.len_4.samples_1.1d.frag With the load_global_const_block_intel NIR intrinsic, you can load a vec8/vec16 with a predicate. The predicate is correctly uniformized to feed into the SEND instruction's flag register. The problem is that a series of optimization first remove the find_live_channel and then changes the broadcast into a simple MOV instruction, on the assumption that the first channel is always active if there is not control flow. This is correct. But after that the cmod optimzation will remove this instruction : mov.nz.f0.0(16) null:D, vgrf16+0.0<0>:D NoMask because it seems to be equivalent to : cmp.g.f0.0(16) vgrf16:D, vgrf12:D, 63d In this case vgrf16 is the predicate to the load block SEND instruction. Since the execution mask is different between both, some of the channels of the SEND instruction end up not being loaded or loaded with the wrong predication and we end up with incorrect UBO data. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20852> (cherry picked from commit `a50d2fdb46`)	2023-01-26 15:40:34 +00:00
Tapani Pälli	74d02d26df	hasvk: add restrictions for 3DSTATE_RASTER::AntiAliasingEnable Field must be disabled if any render targets have integer format. Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20671> (cherry picked from commit `58dd9d5134`)	2023-01-26 15:40:33 +00:00
Tapani Pälli	5d473b4282	anv: add restrictions for 3DSTATE_RASTER::AntiAliasingEnable Field must be disabled if any render targets have integer format, additionally for Gfx12+ field must be disabled when num multisamples > 1 or forced multisample count > 1. Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20671> (cherry picked from commit `9b37ef40f8`)	2023-01-26 15:40:33 +00:00
Kenneth Graunke	89e679803b	intel/compiler: Drop redundant 32-bit expansion for shared float atomics We already expanded data to 32-bit a few lines earlier, so this is just redundantly doing it a second time. Fixes: `43169dbbe5` ("intel/compiler: Support 16 bit float ops") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20604> (cherry picked from commit `f7b29d7924`)	2023-01-26 15:40:32 +00:00
Francisco Jerez	7fe1d202d5	intel/fs/gfx12: Ensure that prior reads have executed before barrier with acquire semantics. This avoids a violation of the Vulkan memory model that was leading to intermittent failures of at least 8k test-cases of the Vulkan CTS (within the group dEQP-VK.memory_model.) on TGL and DG2 platforms. In theory the issue may be reproducible on earlier platforms like IVB and ICL, but the SYNC.ALLWR instruction is not available on those platforms so a different (likely costlier) fix will be needed. The issue occurs within the sequence we emit for a NIR memory barrier with acquire semantics requiring the synchronization of multiple caches, e.g. in pseudocode for a barrier involving the TGM and UGM caches on DG2: x <- load.ugm // Atomic read sequenced-before the barrier y <- fence.ugm z <- fence.tgm wait(y, z) w <- load.tgm // Read sequenced-after the barrier In the example we must provide the guarantee that the memory load for x is completed before the one for w, however this ordering can be reversed with the intervention of a concurrent thread, since the UGM fence will block on the prior UGM load and potentially take a long time, while the TGM fence may complete and invalidate the TGM cache immediately, so a concurrent thread could pollute the TGM cache with stale contents for the w location before* the UGM load has completed, leading to an inversion of the expected memory ordering. v2: Apply the workaround regardless of whether the NIR barrier intrinsic specifies multiple storage classes or a single one, since an acquire barrier is required to order subsequent requests relative to previous atomic requests of unknown storage class not necessarily specified by the memory scope information of the intrinsic. Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20690> (cherry picked from commit `4a2e7306dd`)	2023-01-26 15:40:31 +00:00
Paulo Zanoni	9098d83fb3	anv: check the return value of anv_execbuf_add_bo_bitset() Because anv_execbuf_add_bo_bitset() calls anv_execbuf_add_bo(), which can fail if its memory allocations fail. I have seen dEQP tests exercising memory allocation failures during anv_execbuf_add_bo(), but I don't think the path coming from add_bo_biset() was specifically exercised. Anyway, add the error check just in case. v2: Rebase. Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20703> (cherry picked from commit `3d37950fd9`)	2023-01-26 15:40:31 +00:00
Paulo Zanoni	f2aaa18997	anv: don't leave undefined values in exec->syncobj_values In anv_execbuf_add_syncobj(), we try to not create or use exec->syncobj_values if we don't need to. But when we figure we're going to need it (i.e., when timeline_value is not zero), then we create exec->syncobj_values with vk_zalloc, which means every previous value is set to zero, as it should be. This is all correct. The problem starts when we add a 16th element. In this case we double exec->syncobj_array_length and realloc the buffer by using vk_alloc and copying the old array to the new one. After that, we write the timeline_value to the array only if it's not zero, and that's the problem: since we just used vkalloc and memcpy, we don't have any guarantees that the new array will be zero after the 16th element, and if timeline_value is zero we write nothing to that position. Once we start using exec->syncobj_values we have to commit to using it, so the "if (timeline_value)" check near the end of the function has to be changed to "if (exec->syncobj_values)", so we actually set elements after the 16th to zero when they need to be zero. Another approach to fix this would be to memset the new elements once we double syncobj_array_length. In practice, I couldn't find any application or deqp test that used more than 3 elements in exec->syncobj_array_length, and we need more than 16 elements in order to be able to reproduce the bug, so I'm not aware of any real-world bug that goes away with this patch. This issue was found while reading code. If we craft a little Vulkan program that submits a ton of timeline and binary semaphores on vkQueueSubmit, then waits for them, we get the following error without this patch: MESA: error: ../../src/intel/vulkan/anv_batch_chain.c:1910: execbuf2 failed: Invalid argument (VK_ERROR_DEVICE_LOST) v2: Rebase. Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20703> (cherry picked from commit `ad6a036a68`)	2023-01-26 15:40:31 +00:00
Lionel Landwerlin	d8cf98bcb8	anv: use the null surface with unused push descriptor binding table entries Some binding table entries have been identify as unused in the shaders by the push constant analysis pass. We can just put the null entry in there. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `b49b18f0b7` ("anv: reduce BT emissions & surface state writes with push descriptors") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20555> (cherry picked from commit `2d627f28c8`)	2023-01-11 17:44:22 +00:00
Lionel Landwerlin	7d2915088d	anv: return properly typed value for no ubo promoted Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `ff91c5ca42` ("anv: add analysis for push descriptor uses and store it in shader cache") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20555> (cherry picked from commit `bbfca4eb92`)	2023-01-11 17:44:22 +00:00
Lionel Landwerlin	b95e5a8cc2	anv: check that push range actually match binding considered We can't just check the load_ubo range is contained in the push entry, we also need to check that the push entry set/binding matches the load_ubo set/binding. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `ff91c5ca42` ("anv: add analysis for push descriptor uses and store it in shader cache") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20555> (cherry picked from commit `e2b0086b78`)	2023-01-11 17:44:22 +00:00
Lionel Landwerlin	605feb0281	anv: don't nullify entries We'll use those to fill the push constant addresses, so we can't have them turned to null. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `ff91c5ca42` ("anv: add analysis for push descriptor uses and store it in shader cache") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20555> (cherry picked from commit `48bb3df951`)	2023-01-11 17:44:22 +00:00
Felix DeGrood	22f8331320	hasvk: Emit CS stall on INTEL_MEASURE timestamp For INTEL_MEASURE, ensure all prior instructions completed before timestamp taken. Continue to support no CS flush case for Perfetto. CS stall was dropped from pipecontrol when adding u_trace support. Fixes: `cc5843a573` ("anv: implement u_trace support") Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20502> (cherry picked from commit `c1c81137d9`)	2023-01-11 17:44:20 +00:00
Felix DeGrood	6893f2b5b4	anv: Emit CS stall on INTEL_MEASURE timestamp For INTEL_MEASURE, ensure all prior instructions completed before timestamp taken. Continue to support no CS flush case for Perfetto. CS stall was dropped from pipecontrol when adding u_trace support. Fixes: `cc5843a573` ("anv: implement u_trace support") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20502> (cherry picked from commit `7f6beb8537`)	2023-01-11 17:44:20 +00:00
Väinö Mäkelä	44bb614c37	intel: Fix a hang caused by invalid dispatch enables on gfx6/7 Because commit `b9403b1c47` moved dispatch enable handling away from the compiler, brw_fs_get_dispatch_enables must be used to ensure valid dispatch enable values. v2: Fix gfx6 build and use brw_fs_get_dispatch_enables for gfx6 in crocus Fixes: `b9403b1c47` ("intel: factor out dispatch PS enabling logic") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20267> (cherry picked from commit `4c986c58b3`)	2023-01-01 17:07:04 +00:00
Lionel Landwerlin	1010e2ca89	anv: handle null push descriptors in deferred optimization Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `b49b18f0` ("anv: reduce BT emissions & surface state writes with push descriptors") Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20410> (cherry picked from commit `739a08ad23`)	2022-12-29 19:25:30 +00:00
Rohan Garg	fa4a0b7f63	anv: Ensure we clear ANV_PIPE_PSS_STALL_SYNC_BIT on flush Add the PSS stall bit to ANV_PIPE_STALL_BITS so that it get's cleared on flush. Fixes: `f3c62973` ("anv,iris: PSS Stall Sync around color fast clears") Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20317> (cherry picked from commit `ad9c0e8cd9`)	2022-12-29 19:25:30 +00:00
Lionel Landwerlin	9c1899e93f	anv: fixup another dirty issue with gpu_memcpy Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20335> (cherry picked from commit `b21cd1ee1b`)	2022-12-29 19:25:29 +00:00
Lionel Landwerlin	a60641d132	anv: disable Wa_1806565034 when robustImageAccess is enabled Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5711 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7859 Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20280> (cherry picked from commit `a921486e2a`)	2022-12-14 20:56:54 +00:00
Lionel Landwerlin	fcb34f031c	intel/fs: make Wa_1806565034 conditional to non robust access Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20280> (cherry picked from commit `94bb4a13fa`)	2022-12-14 20:56:54 +00:00
Lionel Landwerlin	b5820a84a2	isl: make Wa_1806565034 conditional to non robust access Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20280> (cherry picked from commit `89a550a37b`)	2022-12-14 20:56:54 +00:00
Peng Huang	3e3def9620	intel: Fix crashes for importing drm buffer image_aspect_to_binding() converts aspect to index by subrracting VK_IMAGE_ASPECT_MEMORY_PLANE_0_BIT_EXT, however these enum values are bitfields, not consecutive numbers, so comparing and subtracting them won't work. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20269> (cherry picked from commit `7642f3b99c`)	2022-12-14 20:47:02 +00:00
Lionel Landwerlin	0c7a3133ac	anv: fixup descriptor copies I did not properly understood that we cannot access the views written to the descriptor sets because they might have been destroyed after the write operation and the copy operation is allowed to copy what is invalid data. The shader just can't access it. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `03e1e19246` ("anv: Refactor descriptor copy") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20222> (cherry picked from commit `a0991c7c79`)	2022-12-14 20:47:02 +00:00
Iván Briano	dc889d95bc	hasvk: pipelineStageCreationFeedbackCount is allowed to be 0 Fixes: `6601e5d6fc` ("anv: implement VK_EXT_pipeline_creation_feedback") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20216> (cherry picked from commit `68b546ec3d`)	2022-12-14 20:47:02 +00:00
Lionel Landwerlin	bf1d05b8e4	intel/nir/rt: fixup primitive id There is a delta index value in the hit structure, we forgot to add it to the base value. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `0465714790` ("intel/nir/rt: add more helpers for ray queries") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7565 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19346> (cherry picked from commit `6106396825`)	2022-12-14 20:47:02 +00:00
Lionel Landwerlin	68fece9af5	Revert "anv: compile anv_acceleration_structure.c" This reverts commit `74d0be27ae`. Also remove anv_acceleration_structure.c, it was meant to be removed earlier. There was probably a rebase issue somewhere. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20248> (cherry picked from commit `d608706875`)	2022-12-14 20:47:01 +00:00
Tapani Pälli	e17740493f	anv: emit sample mask state independent of fragment stage Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7861 Fixes: `9f6af43743` ("anv: dynamic multisample sample mask") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20221> (cherry picked from commit `68ef0d8448`)	2022-12-14 20:47:01 +00:00
Tapani Pälli	5b6718728b	intel/fs: implement Wa_14017989577 The first instruction of any kernel should have non-zero emask. This restriction needs to be obeyed to avoid GPU hangs. Patch adds a function to insert dummy mov as first instruction to make sure this requirement is fulfilled. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20194> (cherry picked from commit `bc4b7de0d0`)	2022-12-14 20:47:01 +00:00
Kenneth Graunke	d936394cf4	intel/compiler: Set NoMask on cr0 access for float controls mode This is trying to clear a bit in the control register. However, it's executing with whatever channel mask happens to be active. Typically this is the one at the start of the program, so at least some channels will be active. Typically the first channel will be active due to packed dispatch, but that's not always guaranteed. Without NoMask, the float controls writes may randomly not happen. Recent GPUs also seem to have a hang issue when the first instruction in the shader doesn't have any active channels. Having an instruction with NoMask at the start of the program works around the issue. See HSD bug 14017989577. In our case, the float controls preamble was breaking that restriction every time, causing us to run into this problem frequently. Thanks to Tapani Pälli for finding this hang issue, and Francisco Jerez and Lionel Landwerlin for helping pinpoint this issue during review of a workaround patch in !20194. Fixes GPU hangs in Elder Scrolls Online, Witcher 3, and likely more. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7639 Fixes: `9da56ffc52` ("i965/fs: add emit_shader_float_controls_execution_mode() and aux functions") Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20214> (cherry picked from commit `bafbe7c23a`)	2022-12-14 20:47:01 +00:00
Otavio Pontes	d4d1f52284	anv/hasvk: Clamping Scissor Rect values in a valid range On cmd_buffer_emit_scissor(), if VkViewport height or width are set to a value lower than 1.0, y_max or x_max can be attributed negative values, causing an overflow. That leads to ScissorRectangleYMax or ScissorRectangleXMax to be set to values on an unsupported range. Clamping x_max and y_max in the valid range solves the problem. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7471 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20200> (cherry picked from commit `2e775b8bdb`)	2022-12-14 20:47:01 +00:00
Lionel Landwerlin	7b5ba2d363	intel: add missing restriction on fragment simd dispatch Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7755 Reviewed-by: Ivan Briano <ivan.briano@intel.com> Tested-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20169> (cherry picked from commit `d4cd33630a`)	2022-12-14 20:47:01 +00:00
Lionel Landwerlin	e2fc0b33cd	intel: factor out dispatch PS enabling logic Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Tested-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20169> (cherry picked from commit `b9403b1c47`)	2022-12-14 20:47:01 +00:00
Sviatoslav Peleshko	d43425f7e0	anv: Defer flushing PIPE_CONTROL bits forbidden in CCS while in GPGPU mode Fixes: `313aeee8` ("anv: Use pending pipe control mechanism in flush_pipeline_select() ") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7816 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20124> (cherry picked from commit `77ecf9149c`)	2022-12-14 20:47:00 +00:00
Lionel Landwerlin	a17409115a	anv: correctly predicate ray tracing Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `7479fe6ae0` ("anv: Implement vkCmdTraceRays and vkCmdTraceRaysIndirect") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20011> (cherry picked from commit `af3f7948d1`)	2022-12-14 20:47:00 +00:00
Lionel Landwerlin	b81a29146b	isl: don't report I915_FORMAT_MOD_Y_TILED_CCS on Gfx8 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19852> (cherry picked from commit `0626b68c88`)	2022-12-14 20:47:00 +00:00
Marcin Ślusarz	2615b5a354	intel/compiler: user payload starts after TUE header & its padding All data written by the user are offset by TUE header size. Without this patch we copy the correct amount of user data, but both "from" and "to" offsets are wrong. Fixes: `37e78803d7` ("intel/compiler: use nir_lower_task_shader pass") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19409> (cherry picked from commit `db0e6f9a07`)	2022-12-14 20:47:00 +00:00
Marcin Ślusarz	20ba98ab2f	intel/compiler: adjust [store\|load]_task_payload.base too Base also needs to be converted from bytes to words. Fixes: `c36ae42e4c` ("intel/compiler: Use nir_var_mem_task_payload") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19409> (cherry picked from commit `7aaafaa8ae`)	2022-12-14 20:47:00 +00:00
Martin Roukala (né Peres)	ac18e931fa	Revert "glx: Fix drawable refcounting for naked Windows" This reverts commit `768238fdc0` which is not only leading to memory leaks, but also reportedly breaks KDE pretty badly. Fixes: #7674, #7435 Acked-by: Michel Dänzer <mdaenzer@redhat.com> Acked-by: Tapani Pälli <tapani.palli@intel.com> Acked-by: Adam Jackson <ajax@redhat.com> Signed-off-by: Martin Roukala (né Peres) <martin.roukala@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19972> (cherry picked from commit `0cee008fee`)	2022-11-30 21:12:43 +00:00
Lionel Landwerlin	a4eeeb8f78	anv: generate correct addresses for state pool offsets Fixes a number of CTS patterns on DG2 : - dEQP-VK.dynamic_rendering.primary_cmd_buff.random* - dEQP-VK.draw.secondary_cmd - dEQP-VK.dynamic_rendering.secondary_cmd - dEQP-VK.geometry.secondary_cmd_buffer - dEQP-VK.multiview.secondary_cmd* Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `9c1c1888d9` ("intel/fs: put scratch surface in the surface state heap") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19946> (cherry picked from commit `9bb055ff5d`)	2022-11-23 19:12:00 +00:00
Lionel Landwerlin	532521adbc	blorp: support negative offsets in addresses Similar to anv_address Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `9c1c1888d9` ("intel/fs: put scratch surface in the surface state heap") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19946> (cherry picked from commit `20e8e1eb06`)	2022-11-23 19:12:00 +00:00
Lionel Landwerlin	ac303c5d5b	intel/fs: improve Wa_22013689345 workaround The initial implementation is a pretty big hammer. Implement the HW recommendation to minimize cases in which we need a fence. This improves by 10FPS on some of the Sascha Willems RT demos. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `6031ad4bf6` ("intel/fs: Add Wa_22013689345") Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19322> (cherry picked from commit `945637514e`)	2022-11-23 19:12:00 +00:00
Lionel Landwerlin	b92f135377	anv: fixup context initialization on DG2 Fixing a typo :( Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `507a86e131` ("anv: ensure CPS is initialized when KHR_fragment_shading_rate is disabled") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19922> (cherry picked from commit `f7d6c6e1ed`)	2022-11-23 19:12:00 +00:00
Lionel Landwerlin	77a9b631db	anv: ensure CPS is initialized when KHR_fragment_shading_rate is disabled We need to set CPS_MODE_NONE when no per coarse pixel dispatch. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `231651fd89` ("anv: implement VK_KHR_fragment_shading_rate") Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19867> (cherry picked from commit `507a86e131`)	2022-11-23 19:11:59 +00:00
Lionel Landwerlin	46517e0b65	anv: fix 3d state initialization We missed a couple of restriction leading to inconsistent 3d pipeline state. It is mostly noticeable when doing a multiple sample dispatch as the verify first 3d operation. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7531 Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19867> (cherry picked from commit `62f12c2dad`)	2022-11-23 19:11:59 +00:00
Lionel Landwerlin	d567ac1dc8	intel/fs: put scratch surface in the surface state heap In `4ceaed7839` we made scratch surface state allocations part of the internal heap (mapped to STATE_BASE_ADDRESS::SurfaceStateBaseAddress) so that it doesn't uses slots in the application's expected 1M descriptors (especially with vkd3d-proton). But all our compiler code relies on BSS (STATE_BASE_ADDRESS::BindlessSurfaceStateBaseAddress). The additional issue is that there is only 26bits of surface offset available in CS instruction (CFE_STATE, 3DSTATE_VS, etc...) for scratch surfaces. So we need the drivers to put the scratch surfaces in the first chunk of STATE_BASE_ADDRESS::SurfaceStateBaseAddress (hence all the driver changes). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `4ceaed7839` ("anv: split internal surface states from descriptors") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7687 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19727> (cherry picked from commit `9c1c1888d9`)	2022-11-23 19:11:59 +00:00
Lionel Landwerlin	d6b2c77fac	intel/perf: fix B/C counters accumulation in non query mode When we're not using queries, all the counters from the MI_REPORT_PERF_COUNT are available. This is the case when using perfetto with the global pps datasource that capture global counter values. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `8750f43a90` ("intel/perf: add performance query layout using MI_SRM") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18893> (cherry picked from commit `61fef1ed72`)	2022-11-23 19:11:58 +00:00
Lionel Landwerlin	84ada12002	intel/perf: allocate cleared counter infos This array of structure needs to be initialized to 0 as it contains a bitset we don't explicitly clear. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `3144bc1d33` ("intel/perf: move query_mask and location out of gen_perf_query_counter") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18893> (cherry picked from commit `e754bf6be4`)	2022-11-23 19:11:58 +00:00
Lionel Landwerlin	9476566032	anv: get rid of ilog2_round_up __builtin_clz(value - 1) is undefined for with value=1 (because __builtin_clz(0) is undefined). Because we set rt_pipeline->stack_size = 1 when a ray tracing pipeline doesn't need any stack allocation to differentiate from a dynamic size (rt_pipeline->stack_size = 0) we can run into this undefinied behavior issue. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `f68d64dac0` ("anv: Add support for vkCmdSetRayTracingPipelineStackSizeKHR") Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19781> (cherry picked from commit `440da44a84`)	2022-11-23 19:11:58 +00:00

1 2 3 4 5 ...

8649 commits