fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-23 10:48:08 +02:00

Author	SHA1	Message	Date
Lionel Landwerlin	9c1899e93f	anv: fixup another dirty issue with gpu_memcpy Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20335> (cherry picked from commit `b21cd1ee1b`)	2022-12-29 19:25:29 +00:00
Lionel Landwerlin	a60641d132	anv: disable Wa_1806565034 when robustImageAccess is enabled Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5711 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7859 Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20280> (cherry picked from commit `a921486e2a`)	2022-12-14 20:56:54 +00:00
Lionel Landwerlin	fcb34f031c	intel/fs: make Wa_1806565034 conditional to non robust access Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20280> (cherry picked from commit `94bb4a13fa`)	2022-12-14 20:56:54 +00:00
Lionel Landwerlin	b5820a84a2	isl: make Wa_1806565034 conditional to non robust access Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20280> (cherry picked from commit `89a550a37b`)	2022-12-14 20:56:54 +00:00
Peng Huang	3e3def9620	intel: Fix crashes for importing drm buffer image_aspect_to_binding() converts aspect to index by subrracting VK_IMAGE_ASPECT_MEMORY_PLANE_0_BIT_EXT, however these enum values are bitfields, not consecutive numbers, so comparing and subtracting them won't work. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20269> (cherry picked from commit `7642f3b99c`)	2022-12-14 20:47:02 +00:00
Lionel Landwerlin	0c7a3133ac	anv: fixup descriptor copies I did not properly understood that we cannot access the views written to the descriptor sets because they might have been destroyed after the write operation and the copy operation is allowed to copy what is invalid data. The shader just can't access it. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `03e1e19246` ("anv: Refactor descriptor copy") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20222> (cherry picked from commit `a0991c7c79`)	2022-12-14 20:47:02 +00:00
Iván Briano	dc889d95bc	hasvk: pipelineStageCreationFeedbackCount is allowed to be 0 Fixes: `6601e5d6fc` ("anv: implement VK_EXT_pipeline_creation_feedback") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20216> (cherry picked from commit `68b546ec3d`)	2022-12-14 20:47:02 +00:00
Lionel Landwerlin	bf1d05b8e4	intel/nir/rt: fixup primitive id There is a delta index value in the hit structure, we forgot to add it to the base value. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `0465714790` ("intel/nir/rt: add more helpers for ray queries") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7565 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19346> (cherry picked from commit `6106396825`)	2022-12-14 20:47:02 +00:00
Lionel Landwerlin	68fece9af5	Revert "anv: compile anv_acceleration_structure.c" This reverts commit `74d0be27ae`. Also remove anv_acceleration_structure.c, it was meant to be removed earlier. There was probably a rebase issue somewhere. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20248> (cherry picked from commit `d608706875`)	2022-12-14 20:47:01 +00:00
Tapani Pälli	e17740493f	anv: emit sample mask state independent of fragment stage Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7861 Fixes: `9f6af43743` ("anv: dynamic multisample sample mask") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20221> (cherry picked from commit `68ef0d8448`)	2022-12-14 20:47:01 +00:00
Tapani Pälli	5b6718728b	intel/fs: implement Wa_14017989577 The first instruction of any kernel should have non-zero emask. This restriction needs to be obeyed to avoid GPU hangs. Patch adds a function to insert dummy mov as first instruction to make sure this requirement is fulfilled. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20194> (cherry picked from commit `bc4b7de0d0`)	2022-12-14 20:47:01 +00:00
Kenneth Graunke	d936394cf4	intel/compiler: Set NoMask on cr0 access for float controls mode This is trying to clear a bit in the control register. However, it's executing with whatever channel mask happens to be active. Typically this is the one at the start of the program, so at least some channels will be active. Typically the first channel will be active due to packed dispatch, but that's not always guaranteed. Without NoMask, the float controls writes may randomly not happen. Recent GPUs also seem to have a hang issue when the first instruction in the shader doesn't have any active channels. Having an instruction with NoMask at the start of the program works around the issue. See HSD bug 14017989577. In our case, the float controls preamble was breaking that restriction every time, causing us to run into this problem frequently. Thanks to Tapani Pälli for finding this hang issue, and Francisco Jerez and Lionel Landwerlin for helping pinpoint this issue during review of a workaround patch in !20194. Fixes GPU hangs in Elder Scrolls Online, Witcher 3, and likely more. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7639 Fixes: `9da56ffc52` ("i965/fs: add emit_shader_float_controls_execution_mode() and aux functions") Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20214> (cherry picked from commit `bafbe7c23a`)	2022-12-14 20:47:01 +00:00
Otavio Pontes	d4d1f52284	anv/hasvk: Clamping Scissor Rect values in a valid range On cmd_buffer_emit_scissor(), if VkViewport height or width are set to a value lower than 1.0, y_max or x_max can be attributed negative values, causing an overflow. That leads to ScissorRectangleYMax or ScissorRectangleXMax to be set to values on an unsupported range. Clamping x_max and y_max in the valid range solves the problem. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7471 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20200> (cherry picked from commit `2e775b8bdb`)	2022-12-14 20:47:01 +00:00
Lionel Landwerlin	7b5ba2d363	intel: add missing restriction on fragment simd dispatch Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7755 Reviewed-by: Ivan Briano <ivan.briano@intel.com> Tested-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20169> (cherry picked from commit `d4cd33630a`)	2022-12-14 20:47:01 +00:00
Lionel Landwerlin	e2fc0b33cd	intel: factor out dispatch PS enabling logic Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Tested-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20169> (cherry picked from commit `b9403b1c47`)	2022-12-14 20:47:01 +00:00
Sviatoslav Peleshko	d43425f7e0	anv: Defer flushing PIPE_CONTROL bits forbidden in CCS while in GPGPU mode Fixes: `313aeee8` ("anv: Use pending pipe control mechanism in flush_pipeline_select() ") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7816 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20124> (cherry picked from commit `77ecf9149c`)	2022-12-14 20:47:00 +00:00
Lionel Landwerlin	a17409115a	anv: correctly predicate ray tracing Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `7479fe6ae0` ("anv: Implement vkCmdTraceRays and vkCmdTraceRaysIndirect") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20011> (cherry picked from commit `af3f7948d1`)	2022-12-14 20:47:00 +00:00
Lionel Landwerlin	b81a29146b	isl: don't report I915_FORMAT_MOD_Y_TILED_CCS on Gfx8 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19852> (cherry picked from commit `0626b68c88`)	2022-12-14 20:47:00 +00:00
Marcin Ślusarz	2615b5a354	intel/compiler: user payload starts after TUE header & its padding All data written by the user are offset by TUE header size. Without this patch we copy the correct amount of user data, but both "from" and "to" offsets are wrong. Fixes: `37e78803d7` ("intel/compiler: use nir_lower_task_shader pass") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19409> (cherry picked from commit `db0e6f9a07`)	2022-12-14 20:47:00 +00:00
Marcin Ślusarz	20ba98ab2f	intel/compiler: adjust [store\|load]_task_payload.base too Base also needs to be converted from bytes to words. Fixes: `c36ae42e4c` ("intel/compiler: Use nir_var_mem_task_payload") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19409> (cherry picked from commit `7aaafaa8ae`)	2022-12-14 20:47:00 +00:00
Martin Roukala (né Peres)	ac18e931fa	Revert "glx: Fix drawable refcounting for naked Windows" This reverts commit `768238fdc0` which is not only leading to memory leaks, but also reportedly breaks KDE pretty badly. Fixes: #7674, #7435 Acked-by: Michel Dänzer <mdaenzer@redhat.com> Acked-by: Tapani Pälli <tapani.palli@intel.com> Acked-by: Adam Jackson <ajax@redhat.com> Signed-off-by: Martin Roukala (né Peres) <martin.roukala@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19972> (cherry picked from commit `0cee008fee`)	2022-11-30 21:12:43 +00:00
Lionel Landwerlin	a4eeeb8f78	anv: generate correct addresses for state pool offsets Fixes a number of CTS patterns on DG2 : - dEQP-VK.dynamic_rendering.primary_cmd_buff.random* - dEQP-VK.draw.secondary_cmd - dEQP-VK.dynamic_rendering.secondary_cmd - dEQP-VK.geometry.secondary_cmd_buffer - dEQP-VK.multiview.secondary_cmd* Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `9c1c1888d9` ("intel/fs: put scratch surface in the surface state heap") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19946> (cherry picked from commit `9bb055ff5d`)	2022-11-23 19:12:00 +00:00
Lionel Landwerlin	532521adbc	blorp: support negative offsets in addresses Similar to anv_address Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `9c1c1888d9` ("intel/fs: put scratch surface in the surface state heap") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19946> (cherry picked from commit `20e8e1eb06`)	2022-11-23 19:12:00 +00:00
Lionel Landwerlin	ac303c5d5b	intel/fs: improve Wa_22013689345 workaround The initial implementation is a pretty big hammer. Implement the HW recommendation to minimize cases in which we need a fence. This improves by 10FPS on some of the Sascha Willems RT demos. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `6031ad4bf6` ("intel/fs: Add Wa_22013689345") Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19322> (cherry picked from commit `945637514e`)	2022-11-23 19:12:00 +00:00
Lionel Landwerlin	b92f135377	anv: fixup context initialization on DG2 Fixing a typo :( Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `507a86e131` ("anv: ensure CPS is initialized when KHR_fragment_shading_rate is disabled") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19922> (cherry picked from commit `f7d6c6e1ed`)	2022-11-23 19:12:00 +00:00
Lionel Landwerlin	77a9b631db	anv: ensure CPS is initialized when KHR_fragment_shading_rate is disabled We need to set CPS_MODE_NONE when no per coarse pixel dispatch. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `231651fd89` ("anv: implement VK_KHR_fragment_shading_rate") Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19867> (cherry picked from commit `507a86e131`)	2022-11-23 19:11:59 +00:00
Lionel Landwerlin	46517e0b65	anv: fix 3d state initialization We missed a couple of restriction leading to inconsistent 3d pipeline state. It is mostly noticeable when doing a multiple sample dispatch as the verify first 3d operation. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7531 Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19867> (cherry picked from commit `62f12c2dad`)	2022-11-23 19:11:59 +00:00
Lionel Landwerlin	d567ac1dc8	intel/fs: put scratch surface in the surface state heap In `4ceaed7839` we made scratch surface state allocations part of the internal heap (mapped to STATE_BASE_ADDRESS::SurfaceStateBaseAddress) so that it doesn't uses slots in the application's expected 1M descriptors (especially with vkd3d-proton). But all our compiler code relies on BSS (STATE_BASE_ADDRESS::BindlessSurfaceStateBaseAddress). The additional issue is that there is only 26bits of surface offset available in CS instruction (CFE_STATE, 3DSTATE_VS, etc...) for scratch surfaces. So we need the drivers to put the scratch surfaces in the first chunk of STATE_BASE_ADDRESS::SurfaceStateBaseAddress (hence all the driver changes). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `4ceaed7839` ("anv: split internal surface states from descriptors") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7687 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19727> (cherry picked from commit `9c1c1888d9`)	2022-11-23 19:11:59 +00:00
Lionel Landwerlin	d6b2c77fac	intel/perf: fix B/C counters accumulation in non query mode When we're not using queries, all the counters from the MI_REPORT_PERF_COUNT are available. This is the case when using perfetto with the global pps datasource that capture global counter values. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `8750f43a90` ("intel/perf: add performance query layout using MI_SRM") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18893> (cherry picked from commit `61fef1ed72`)	2022-11-23 19:11:58 +00:00
Lionel Landwerlin	84ada12002	intel/perf: allocate cleared counter infos This array of structure needs to be initialized to 0 as it contains a bitset we don't explicitly clear. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `3144bc1d33` ("intel/perf: move query_mask and location out of gen_perf_query_counter") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18893> (cherry picked from commit `e754bf6be4`)	2022-11-23 19:11:58 +00:00
Lionel Landwerlin	9476566032	anv: get rid of ilog2_round_up __builtin_clz(value - 1) is undefined for with value=1 (because __builtin_clz(0) is undefined). Because we set rt_pipeline->stack_size = 1 when a ray tracing pipeline doesn't need any stack allocation to differentiate from a dynamic size (rt_pipeline->stack_size = 0) we can run into this undefinied behavior issue. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `f68d64dac0` ("anv: Add support for vkCmdSetRayTracingPipelineStackSizeKHR") Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19781> (cherry picked from commit `440da44a84`)	2022-11-23 19:11:58 +00:00
Benjamin Tissoires	76dc28e3ff	CI: convert to use the new S3 server instead of the legacy minio We don't need to login anymore, but we can't use plain minio commands now. `ci-fairy` got a helper as `s3cp` to keep an almost identical API. Solved Conflicts: .gitlab-ci/common/init-stage2.sh .gitlab-ci/container/lava_build.sh .gitlab-ci/prepare-artifacts.sh src/amd/ci/traces-amd.yml src/freedreno/ci/traces-freedreno.yml src/gallium/frontends/lavapipe/ci/traces-lavapipe.yml Signed-off-by: Benjamin Tissoires <benjamin.tissoires@gmail.com> (cherry picked from commit `67cee534a8`) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19734>	2022-11-17 14:05:04 +00:00
Caio Oliveira	be102fede4	intel/compiler: Fix missing tie-breaker in brw_nir_analyze_ubo_ranges() ordering code Per Ken suggestion, use ascending order for the start offset. Fixes: `6d28c6e52c` ("i965: Select ranges of UBO data to be uploaded as push constants.") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19731> (cherry picked from commit `494e2edb90`)	2022-11-17 14:05:03 +00:00
Caio Oliveira	6e4a46e2a8	intel/compiler: Fix dynarray usage in intel_clc The code builds up the dynamic array of objects (spirv_objs) and collect pointers to each of them into another dynamic array (spirv_ptr_objs). If the growth of the first array cause a reallocation, it is possible that the previous pointers end up invalid. Fixes: `77e929a527` ("intel/clc: allow multiple CL files to be compiled together") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19730> (cherry picked from commit `9fd1d47aa0`)	2022-11-17 14:05:03 +00:00
Lionel Landwerlin	97a017ed25	anv: bump pool bucket max allocation size Age of Empire IV generates a shader of ~2.3Mb on DG2 which is above the limit we currently have. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19669> (cherry picked from commit `ae76bba34a`)	2022-11-17 14:05:03 +00:00
Tapani Pälli	fc57b9ac44	anv: setup stage bitmask for Wa_22011440098 Fixes: `40b66a4499` ("anv, iris: Add Wa_22011440098 for DG2") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19636> (cherry picked from commit `ecd4517560`)	2022-11-17 14:05:03 +00:00
Lionel Landwerlin	d7ca6ccee2	anv: split internal surface states from descriptors On Intel HW we use the same mechanism for internal operations surfaces as well as application surfaces (VkDescriptor). This change splits the surface pool in 2, one part dedicated to internal allocations, the other to application VkDescriptors. To do so, the STATE_BASE_ADDRESS::SurfaceStateBaseAddress points to a 4Gb area, with the following layout : - 1Gb of binding table pool - 2Gb of internal surface states - 1Gb of bindless surface states That way any entry from the binding table can refer to both internal & bindless surface states but none of the driver allocations interfere with the allocation of the application. Based off a change from Sviatoslav Peleshko. v2: Allocate image view null surface state from bindless heap (Sviatoslav) Removed debug stuff (Sviatoslav) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7110 Cc: mesa-stable Tested-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19275> (cherry picked from commit `4ceaed7839`)	2022-11-17 14:05:03 +00:00
Lionel Landwerlin	91dfc02570	anv: fixup invalid enum for nir environment Also switching away from PIPE_ Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `8c4c4c3ee1` ("anv: Add softtp64 workaround") Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19638> (cherry picked from commit `68fd9d2829`)	2022-11-17 14:05:02 +00:00
Jason Ekstrand	7701bf1228	intel: Don't cross DWORD boundaries with byte scratch load/store The back-end swizzles dwords so that our indirect scratch messages match the memory layout of spill/fill messages for better cache coherency. The swizzle happens at a DWORD granularity. If a read or write crosses a DWORD boundary, the first bit will get correctly swizzled but whatever piece lands in the next dword will not because the scatter instructions assume sequential addresses for all bytes. For DWORD writes, this is handled naturally as part of scalarizing. For smaller writes, we need to be sure that a single write never escapes a dword. Fixes: `fd04f858b0` ("intel/nir: Don't try to emit vector load_scratch instructions") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7364 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19580> (cherry picked from commit `25c180b509`)	2022-11-09 21:22:06 +00:00
Jason Ekstrand	d580ab8898	intel/lower_mem_access_bit_sizes: Compute alignments automatically Because dup_mem_intrinsic() retains the SSA offset from the original intrinsic and only modifies it by adding a constant, we can compute the alignment based on the original alignment and the constant offset. This is both easier and more accurate. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19580> (cherry picked from commit `85685cf932`)	2022-11-09 21:22:06 +00:00
Lionel Landwerlin	e54150d6e1	anv: fix missing VkPhysicalDeviceExtendedDynamicState3PropertiesEXT handling Fixes: `13c422e1b2` ("anv: toggle on EXT_extended_dynamic_state3") Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19573> (cherry picked from commit `97b3dd34c1`)	2022-11-09 21:22:06 +00:00
Ian Romanick	87e7794d7b	intel/fs: Fix constant propagation into 32x16 integer multiplication Don't copy propagate the constant in situations like mov(8) g8<1>D 0x7fffffffD mul(8) g16<1>D g8<8,8,1>D g15<16,8,2>W On platforms that only have a 32x16 multiplier, this will result in lowering the multiply to mul(8) g15<1>D g14<8,8,1>D 0xffffUW mul(8) g16<1>D g14<8,8,1>D 0x7fffUW add(8) g15.1<2>UW g15.1<16,8,2>UW g16<16,8,2>UW On Gfx8 and Gfx9, which have the full 32x32 multiplier, it results in mul(8) g16<1>D g15<16,8,2>W 0x7fffffffD Volume 2a of the Skylake PRM says: When multiplying a DW and any lower precision integer, the DW operand must on src0. See also https://gitlab.freedesktop.org/mesa/crucible/-/merge_requests/104. Previous to INTEL_shader_integer_functions2 (in Vulkan or OpenGL), I don't think it would be possible to create a situation where this could occur. I discovered this via some optimizations that can determine that the non-constant source must be able to fit in 16-bits. The case listed above came from piglit's "ext_transform_feedback-order arrays points" with those optimizations in place. No shader-db or fossil-db changes on any Intel platform. Fixes: `de6c0f8487` ("intel/fs: Implement support for NIR opcodes for INTEL_shader_integer_functions2") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17718> (cherry picked from commit `db20412168`)	2022-11-09 21:22:06 +00:00
Mauro Rossi	15aae04df5	hasvk: fix android build and reported API version anv_device.c for vulkan.intel_hasvk requires changes to be compiled and behave correctly for android target Fixes the following building error: FAILED: src/intel/vulkan_hasvk/libanv_hasvk_common.a.p/anv_device.c.o ... ../src/intel/vulkan_hasvk/anv_device.c:143:19: error: use of undeclared identifier 'ANV_API_VERSION_1_3' *pApiVersion = ANV_API_VERSION_1_3; ^ ../src/intel/vulkan_hasvk/anv_device.c:1822:44: error: use of undeclared identifier 'ANV_API_VERSION_1_3' .apiVersion = pdevice->use_softpin ? ANV_API_VERSION_1_3 : ANV_API_VERSION_1_2, ^ ../src/intel/vulkan_hasvk/anv_device.c:1822:66: error: use of undeclared identifier 'ANV_API_VERSION_1_2' .apiVersion = pdevice->use_softpin ? ANV_API_VERSION_1_3 : ANV_API_VERSION_1_2, ^ 3 errors generated. Cc: "22.3" mesa-stable Fixes: `00eefdc` ("hasvk: stop advertising Vk 1.3 on non-softpin") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19452> (cherry picked from commit `814b822fe0`)	2022-11-09 21:22:05 +00:00
Lionel Landwerlin	32a7d9b892	anv: Reduce RHWO optimization (Wa_1508744258) Implement Wa_1508744258: Disable RHWO by setting 0x7010[14] by default except during resolve pass. Disable the RCC RHWO optimization at all times except when resolving single sampled color surfaces. v2: Move stalling to genX(cmd_buffer_apply_pipe_flushes) for clarity (Mark) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Mark Janes <markjanes@swizzler.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19450> (cherry picked from commit `ba0336ab3f`)	2022-11-09 21:22:05 +00:00
Marcin Ślusarz	dcaaeb56ef	anv: program 3DSTATE_MESH_DISTRIB with the recommended values It improves performance of vk_meshlet_cadscene on A770. Fixes: `f083df8710` ("anv: update task/mesh distribution with the recommended values") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19412>	2022-11-02 08:56:53 +00:00
Marcin Ślusarz	d1d2dee970	anv: set 3DSTATE_[MESH\|TASK]_CONTROL.MaximumNumberofThreadGroups Documentation is worded in a confusing way, which may be understood that we don't have to set this field to get good results. MESH part of this commit improves performance of vk_meshlet_cadscene by a factor of 2 on A380. Fixes: `ef04caea9b` ("anv: Implement Mesh Shading pipeline") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19412>	2022-11-02 08:56:53 +00:00
Marcin Ślusarz	11612d81b7	intel/genxml: fix width of 3DSTATE_TASK_CONTROL.MaximumNumberofThreadGroups Fixes: `3567d47f3e` ("intel/genxml: Inline the BODY structs into the instructions") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19412>	2022-11-02 08:56:53 +00:00
Illia Abernikhin	aa4ac5ff8b	utils: Merge util/debug.* into util/u_debug.* and remove util/debug.* Rename env_var_as_unsigned() -> debug_get_num_option(), because duplicate Rename env_var_as_bool() -> debug_get_bool_option(), because duplicate Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7177 Signed-off-by: Illia Abernikhin <illia.abernikhin@globallogic.com> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19336>	2022-11-02 07:25:39 +00:00
Kenneth Graunke	fde99747e9	nir: Drop infer_non_readable option for nir_opt_access() Everybody sets it to true now, and the only reason for the option to exist was to work around a bug that's now been fixed. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19162>	2022-11-02 03:42:04 +00:00
Kenneth Graunke	88756cee8d	intel/compiler: Run nir_opt_large_constants before scalarizing consts nir_opt_large_constants balks at seeing a store_deref of a variable where the source is a vecN operation of multiple load_consts, and thinks that isn't a constant, so it should not bother promoting it. Unfortunately, we were running nir_lower_load_const_to_scalar before nir_opt_large_constants, so this prevented a ton of constant promotion. This commit /used to help/ some shaders in shader-db. Presumably since !16770 landed, those shaders were already helped. Currently ther are no shader-db changes on any Intel platform. Fossil-db results: All Intel platforms had similar results. (Ice Lake shown) Instructions in all programs: 141998227 -> 141421756 (-0.4%) Instructions helped: 12515 Instructions hurt: 237 SENDs in all programs: 7437925 -> 7468033 (+0.4%) SENDs hurt: 12806 Cycles in all programs: 9161655753 -> 9132869800 (-0.3%) Cycles helped: 10163 Cycles hurt: 2637 Spills in all programs: 19977 -> 18678 (-6.5%) Spills helped: 384 Spills hurt: 40 Fills in all programs: 32863 -> 31396 (-4.5%) Fills helped: 385 Fills hurt: 42 Lost: 1 Lots of Shadow of the Tomb Raider fragment shaders and Batman Arkham Origins vertex shaders were hurt for SENDs in this commit. A couple Aztec Ruins compute shaders and Spaceship shaders (multiple stages) were also hurt. All of the shaders hurt for spills or fills were Spaceship compute shaders. Nearly all of the shaders helped were Shadow of the Tomb Raider fragmenet shaders. One Spaceship shader was reall, REALLY helped: Spills helped fossils/fossil-db/Spaceship.run.9f90a2a226fcc57f.1.foz/0b507d3abe2e3c28/compute: 321 -> 13 (-96.0%) Fills helped fossils/fossil-db/Spaceship.run.9f90a2a226fcc57f.1.foz/0b507d3abe2e3c28/compute: 279 -> 21 (-92.5%) Overall this seems like an improvement, but we may want to actually run these few benchmarks before landing. Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16539>	2022-11-01 14:55:21 -07:00

1 2 3 4 5 ...

8630 commits