fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-20 09:08:07 +02:00

Author	SHA1	Message	Date
Lionel Landwerlin	25cb4805f5	intel/devinfo: initialize pci_device_id with from_pci_id() Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Suggested-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21949>	2023-03-23 08:08:49 +00:00
Lionel Landwerlin	19c9391a2c	intel/devinfo: dedicated entries for XeHP Also fixing the max URB entries for VS stage. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reported-by: Chuansheng Liu <chuansheng.liu@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21949>	2023-03-23 08:08:49 +00:00
Lionel Landwerlin	de5ee891f0	intel/dev: use generated WA helpers for Wa_22012575642 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Mark Janes <markjanes@swizzler.org> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21949>	2023-03-23 08:08:49 +00:00
Lionel Landwerlin	9b1660c727	intel/devinfo: printout URB entries Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21949>	2023-03-23 08:08:49 +00:00
Lionel Landwerlin	a42a5bf87e	intel/devinfo: add an option to pick platform to print Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21949>	2023-03-23 08:08:49 +00:00
Tapani Pälli	6538c5bcd4	intel/fs: restore message layout changes for cube array This reverts commit `bc04e2daca` that handled the change as a WA while this is about a new feature, change done in message layout. Patch also changes the original comment to not refer to Wa but bspec page. Fixes: `bc04e2daca` ("intel/fs: use generated helpers for Wa_1209978020 / Wa_18012201914") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22068>	2023-03-22 20:18:11 +00:00
Oleksii Bozhenko	3d2d4728aa	Move combining clip and cull optimization before linking As far gl_nir_link_glsl fills xfb data we should do it after lowering clip and cull in order to get correct locations. Fixes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7152 Signed-off-by: Oleksii Bozhenko <oleksii.bozhenko@globallogic.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21625>	2023-03-22 19:01:40 +00:00
Rohan Garg	5e8866a35a	anv,hasvk: cleanup unused enum Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22070>	2023-03-22 15:00:16 +00:00
Jason Ekstrand	87efb9c3b3	intel/isl: Support Yf/Ys/Tile-64 in isl_surf_get_image_offset_sa All that's really needed here is to handle the array offsetting by using an Z or array offset instead of the Y offset. This patch originally changed get_image_offset_sa_gfx9_1d(), but since we only use linear with the 1d case, it was dropped. Rework: * Jordan: Include ISL_TILING_64 as well * Jordan: Drop change to get_image_offset_sa_gfx9_1d as recommended by Nanley Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21113>	2023-03-22 08:32:52 +00:00
Alyssa Rosenzweig	e80f209df9	blorp,anv,hasvk: Use umod_imm Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22010>	2023-03-22 06:18:18 +00:00
Emma Anholt	dfec80aed1	ci/hasvk: Update some xfails from the 8-sample fast clear disable. Fixes: `e509afacf3` ("hasvk: Disable non-zero fast clears for 8xMSAA images") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22039>	2023-03-21 23:46:13 +00:00
Emma Anholt	f2c356a095	ci/iris: Update more manual job xfails from the Wayland build change. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22039>	2023-03-21 23:46:13 +00:00
Tapani Pälli	c34916f841	anv: implement occlusion query related Wa_14017076903 Fixes artifacts on some games that relied on occlusion query results when no PS or depth buffers are bound. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21721>	2023-03-21 12:56:51 +00:00
Lionel Landwerlin	957186102f	anv: report shader max dispatch width in pipeline props Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22014>	2023-03-21 11:53:04 +00:00
Lionel Landwerlin	2acc2f18ea	intel/compiler: report max dispatch width statistic Most tools looking at shader stats assume that there is only a single resulting binary shader out of a single input. On Intel HW this is not always the case. So having a statistic on each variant that reports the maximum dispatch width helps showing improvement on a single shader in terms of how large we manage to compile it. For shaders that can be compiled in multiple SIMD width (like fragment shaders), this will report the maximum dispatch width in the statistics of each variants. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22014>	2023-03-21 11:53:04 +00:00
Faith Ekstrand	92ea49edcb	anv: Implement VK_KHR_map_memory2 Reviewed-by: Iván Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22031>	2023-03-20 23:11:09 +00:00
Faith Ekstrand	f4a5b2d59e	anv: Limit memory maps to the client-allocated size No need to expose extra padding or CCS data to the client map. Now that we have the data, we can also make the BindBufferMemory asserts a bit more accurate. Reviewed-by: Iván Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22031>	2023-03-20 23:11:09 +00:00
José Roberto de Souza	491887c9f2	intel: Add TODO about removal of 2Mb alignment in i915 Xe kmd don't suffer this yet because it still lacks MTL support. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21972>	2023-03-20 17:18:04 +00:00
José Roberto de Souza	96302900aa	anv: Apply memory alignment requirements in Xe kmd Without alignment vm bind will fail and during gem buffer creation size also need to be aligned otherwise the range in vm bind can be bigger than allocated size for smem. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21972>	2023-03-20 17:18:04 +00:00
José Roberto de Souza	7dc8474c3b	intel: Set mem_alignment in Xe kmd Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21972>	2023-03-20 17:18:04 +00:00
José Roberto de Souza	bfc1782ad6	anv: Use intel_device_info memory alignment It was also necessary to initialize mem_alignment in the tests otherwise vma allocation would fail with stubs. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21972>	2023-03-20 17:18:04 +00:00
José Roberto de Souza	2ab3d5f436	intel: Move memory aligment information to intel_device_info This same information is also used in ANV, so intel_device_info is a better place to have it. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21972>	2023-03-20 17:18:03 +00:00
David Heidelberg	3823d4696a	ci/intel: add dEQP-EGL.functional.wide_color.window_fp16_default_colorspace flake Occasionally flake since Wayland got enabled. Acked-by: Guilherme Gallo <guilherme.gallo@collabora.com> Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21990>	2023-03-18 14:20:44 +01:00
Iván Briano	4dd81b4e2f	intel/fs: handle interpolation modes for at_sample and at_offset too Fixes dEQP-VK.draw..linear_interpolation. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19647>	2023-03-18 10:18:15 +00:00
Kenneth Graunke	b6878d456f	st/mesa, iris: Add optional CPU-based ASTC void extent denorm flushing Intel Gen9 GPUs have hardware ASTC support, but have a bug where they don't handle denormalized values in void extent blocks correctly. This isn't that hard to work around - on upload, we can detect such blocks, and flush any denorms to zero. Because we're altering the data behind the application's back, and applications can theoretically ask to download the original unaltered image data, we unfortunately need to maintain shadow copies of the data. To make sure that we don't accidentally skip the void-extent flushing via any fast-upload paths, and support download correctly, we plug this into the st/mesa compressed texture format fallback paths, which store a CPU copy of the original image data, and upload altered data. This is unfortunately common code for what's likely to be a single driver's issue (on a single generation), but it beats replicating an entire framework we already have inside the driver. Fixes dEQP-GLES3.functional.texture.compressed.astc.void_extent_ldr.* using iris on Intel Gen9 GPUs. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4167 Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Acked-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21943>	2023-03-17 21:30:48 +00:00
Michel Dänzer	86c6634897	intel/vk/grl: Do not use no_override_init_args for C++ It's only valid for C code. Avoids cc1plus: error: command-line option '-Wno-override-init' is valid for C/ObjC but not for C++ [-Werror] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21781>	2023-03-17 16:08:33 +00:00
Francisco Jerez	76b4255cd8	intel/fs: Fix register coalesce in presence of force_writemask_all copy source writes. This fixes the behavior of register coalesce in cases where the source of a copy is written elsewhere in the program by a force_writemask_all instruction, which could cause the overwrite to be executed for an inactive channel under non-uniform control flow, causing can_coalesce_vars() to give incorrect results. This has been reported in cases like: > while (true) { > x = imageSize(img); > if (non_uniform_condition()) { > y = x; > break; > } > } > use(y); Currently the register coalesce pass would coalesce x and y in the example above, which is invalid since in the example above imageSize() is implemented as a force_writemask_all SEND message, whose result is broadcast to all channels, so when a given channel executes 'y = x' and breaks out of the loop, another divergent channel can execute a subsequent iteration of the loop overwriting 'x' with a different value, hence coalescing y and x into the same register changes the behavior of the program. Note that this is a regression introduced by commit `a4b36cd3dd`. In order to avoid the problem without reverting that patch, we prevent register coalesce if there is an overwrite of the source with force_writemask_all behavior inconsistent with the copy and this occurs anywhere in the intersection of the live ranges of source and destination, even if it occurs lexically before the copy, since it might be physically executed after the copy under divergent loop control flow. Fixes: `a4b36cd3dd` ("intel/fs: Coalesce when the src live range is contained in the dst") Reported-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21351>	2023-03-17 03:05:24 -07:00
Francisco Jerez	d4015bcb38	intel/fs: Fix copy propagation dataflow analysis in presence of force_writemask_all ACP overwrites. This fixes the behavior of copy propagation in cases where either the source or destination of an ACP is overwritten elsewhere in the program by a force_writemask_all instruction, which could cause the overwrite to be executed for an inactive channel under non-uniform control flow, causing the current per-channel dataflow propagation to give incorrect results. This has been reported in cases like: > while (true) { > x = imageSize(img); > if (non_uniform_condition()) { > y = x; > break; > } > } > use(y); Currently the copy propagation pass would propagate copy 'y = x' into 'use(y)', which is invalid since in the example above imageSize() is implemented as a force_writemask_all SEND message, whose result is broadcast to all channels, so when a given channel executes 'y = x' and breaks out of the loop, another divergent channel can execute a subsequent iteration of the loop overwriting 'x' with a different value, hence replacing 'y' with 'x' at 'use(y)' changes the behavior of the program. This patch extends the global dataflow analysis algorithm to determine whether there is any control flow path from a given copy to an overwrite of its source or destination which has force_writemask_all behavior inconsistent with the copy, and in such case prevents copy propagation for that ACP entry at any point of the program which can be reached from the overwrite, even if the copy is statically re-executed along all such control flow paths (as in the example above), since the execution of the overwrite for a given channel i may corrupt other channels j!=i inactive for the subsequently re-executed copy. Note that a simpler solution has been attempted which fully shuts down copy propagation if such a force_writemask_all ACP overwrite is present /anywhere/ in the program regardless of its location in the control flow graph, however that led to large shader-db regressions in some programs from shader-db (like a CS from Car Chase which would emit 53% more instructions). With this solution the only handful of shaders that suffer instruction count regressions seem to be getting misoptimized right now (e.g. some compute shaders from Deus Ex Mankind). This solution doesn't seem to affect the run-time of shader-db significantly, it's less than 1% higher with the fix applied. Reported-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21351>	2023-03-17 03:05:20 -07:00
Francisco Jerez	1c1be23497	intel/fs: Track force_writemask_all behavior of copy propagation ACP entries. force_writemask_all determines whether all channels of the copy are actually valid, and may be required to be set for it to be propagated safely in cases where the destination of the copy is used by another force_writemask_all instruction, or when the copy occurs in a divergent control flow block different from its use. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21351>	2023-03-17 03:05:18 -07:00
Kenneth Graunke	14f9f98dcb	i965/vec4: Implement uclz in the vec4 backend Commit `28311f9d02` moved ufind_msb lowering to NIR and started emitting uclz. Unfortunately, the vec4 backend never actually implemented uclz. It's trivial to do. Now it does. Fixes: `28311f9d02` ("nir: intel/compiler: Move ufind_msb lowering to NIR") Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21974>	2023-03-17 09:01:18 +00:00
Kenneth Graunke	e7ea2aa46c	intel/fs: Make bld.F16TO32 actually emit F16TO32 not F32TO16 Ahem, "add builder helpers that work on Gfx7"...now might actually work. Too much copy and paste... Fixes: `966995d911` ("intel/fs: Add builder helpers for F32TO16/F16TO32 that work on Gfx7.x") Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21974>	2023-03-17 09:01:18 +00:00
Kenneth Graunke	84197bc0a4	intel/vec4: Retype texture/sampler indexes to UD generate_tex() asserts that sampler_index.type == UD, but commit `83fd7a5ed1` removed the uint temporary, which caused us to see D at some points. Really, either should be fine, but let's just put the UD retype back. This fixes a ton of things in crocus. Fixes: `83fd7a5ed1` ("intel: Use nir_lower_tex_options::lower_index_to_offset") Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21974>	2023-03-17 09:01:18 +00:00
Anuj Phogat	b4b43aa912	anv: implement TES distribution mode WA 22012785325 Set TEDMODE_RR_STRICT when TEEnable is set. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21899>	2023-03-16 14:42:53 +00:00
Constantine Shablya	d53aba56db	anv: use vk_get_physical_device_features Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21754>	2023-03-16 08:23:29 +00:00
Mark Janes	a2e5e7daa0	intel: use generated helpers for Wa_1409433168/Wa_16011107343 HSD 1306463417 is a hardware defect. The originating software workaround for the issue is Wa_1409433168. Convert all references to the software workaround number, and use generated helpers instead of GFX comparisons. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21914>	2023-03-15 23:31:08 +00:00
José Roberto de Souza	eec5ddd0ed	anv: Handle external objects allocation in Xe External(imported or exported) objects needs to have vm_id set to 0. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21885>	2023-03-15 18:17:11 +00:00
José Roberto de Souza	b2d82c25fb	anv: Properly alloc buffers that will be promoted to framebuffer in Xe KMD Xe KMD does a special caching handling for buffers that will be scanout to display, so that is why it needs a flag set during allocation. Checking if VK_STRUCTURE_TYPE_WSI_MEMORY_ALLOCATE_INFO_MESA is available in AllocateMemory() and marking the buffer as scanout. All WSI code paths but one sets VK_STRUCTURE_TYPE_WSI_MEMORY_ALLOCATE_INFO_MESA. The only one that doesn't requires that WSI is initialize with wsi_device_options.sw_device = true to be executed, what is not the case for ANV. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21885>	2023-03-15 18:17:11 +00:00
José Roberto de Souza	a311c031f6	anv: Implement Xe version of anv_physical_device_get_parameters() Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21885>	2023-03-15 18:17:11 +00:00
Rohan Garg	becc1c5615	anv: break out of the loop when the first color attachment is found Fixes: `2bd304bc` ("anv: Skip the RT flush when doing depth-only rendering") Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21903>	2023-03-15 10:52:50 +00:00
Emma Anholt	a74d2ef17d	ci/iris: Add skips for slow tests on APL. These get reported as flakes for timing out before passing when the shader cache is hot. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21879>	2023-03-15 08:15:37 +00:00
Mohamed Ahmed	5ada09412f	anv: remove GetBufferMemoryRequirements2() Signed-off-by: Mohamed Ahmed <mohamedahmedegypt2001@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21898>	2023-03-15 00:30:35 +00:00
Dave Airlie	4e0d4aab48	anv: fix image height for field pictures. Fixes: `98c58a16ef` ("anv: add initial video decode support for h264.) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21807>	2023-03-14 13:34:53 +00:00
Lionel Landwerlin	56474fae93	intel/fs: fix subgroup invocation read bounds checking nir->info.subgroup_size can be set to an enum : SUBGROUP_SIZE_VARYING = 0 SUBGROUP_SIZE_UNIFORM = 1 SUBGROUP_SIZE_API_CONSTANT = 2 SUBGROUP_SIZE_FULL_SUBGROUPS = 3 So compute the API subgroup size value and compare it to the dispatch size to determine whether we need some bound checking. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `9ac192d79d` ("intel/fs: bound subgroup invocation read to dispatch size") Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21856>	2023-03-14 12:15:48 +00:00
Lionel Landwerlin	bf59cfcee1	intel/fs: prevent large vector ops generated by peephole_ffma Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21782>	2023-03-14 10:38:50 +00:00
Lionel Landwerlin	bc08f43991	intel/fs: add MOV source count validation Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21782>	2023-03-14 10:38:50 +00:00
Lionel Landwerlin	ed3c2f73db	intel/fs: fixup sources number from opt_algebraic Fixes issues with register_coalesce : fossilize-replay: brw_fs_register_coalesce.cpp:297: bool fs_visitor::register_coalesce(): Assertion `mov[i]->sources == 1' failed. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21782>	2023-03-14 10:38:50 +00:00
Lionel Landwerlin	18bdc71459	intel/fs: fix nir_opt_peephole_ffma max vec assumption There can be larger vec than vec4. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21782>	2023-03-14 10:38:50 +00:00
Lionel Landwerlin	efde1917c9	intel/fs: don't SEND messages as partial writes For instance, to load uniform data with the LSC we usually rely on tranpose messages which have to execute in SIMD1. Those end up being considered as partial writes so within loops their life span spread to the whole loop, increasing register pressure. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21867>	2023-03-14 10:10:32 +00:00
Lionel Landwerlin	adcdc38f3b	anv: more formats for acceleration structure vertices Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21821>	2023-03-14 09:34:27 +00:00
Dave Airlie	cb24faf1a6	anv/video: disable picture id reampping. This isn't needed at the hw level with vulkan Fixes: `98c58a16ef` ("anv: add initial video decode support for h264.") Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21433>	2023-03-14 07:32:00 +00:00

1 2 3 4 5 ...

9273 commits