fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-23 02:30:12 +01:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	9da8dc47f9	agx/dce: Use the helper Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24635>	2023-08-11 20:31:27 +00:00
Alyssa Rosenzweig	0d7b8bfce5	agx: Don't lower load_local_invocation_index We have an SR for it, which can save a bit of math. This came up while working on the spiller. total instructions in shared programs: 1778396 -> 1778376 (<.01%) instructions in affected programs: 3036 -> 3016 (-0.66%) helped: 10 HURT: 3 Instructions are helped. total bytes in shared programs: 12185182 -> 12185018 (<.01%) bytes in affected programs: 38640 -> 38476 (-0.42%) helped: 18 HURT: 2 Bytes are helped. total halfregs in shared programs: 531218 -> 531174 (<.01%) halfregs in affected programs: 471 -> 427 (-9.34%) helped: 6 HURT: 0 Halfregs are helped. total threads in shared programs: 18909056 -> 18909184 (<.01%) threads in affected programs: 1280 -> 1408 (10.00%) helped: 2 HURT: 0 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24635>	2023-08-11 20:31:27 +00:00
Janne Grunau	3f8894b0f7	asahi,agx: Fix stack buffer overflow in agx_link_varyings_vs_fs Discovered while running dEQP-EGL under address sanitizer. Fixes: `f3877f56ba` ("asahi,agx: Rewrite varying linking") Signed-off-by: Janne Grunau <j@jannau.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24635>	2023-08-11 20:31:27 +00:00
Asahi Lina	eafd35e458	asahi: Force linear for SHARED buffers with no/implicit modifier Consumers might not pass through the modifier information in this case. Fixes XWayland/mutter using dma-buf v4 feedback (though the fact they try to use implicit modifiers is likely a bug on their end, and will decrease performance). Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24635>	2023-08-11 20:31:27 +00:00
Alyssa Rosenzweig	3e5d2f0c1b	asahi,agx: Respect no16 even for I/O Don't call lower_mediump_io for no16. This is helpful for debugging and soon driconf-shaming apps with broken precision qualifiers. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24635>	2023-08-11 20:31:27 +00:00
Asahi Lina	721aa39ad9	asahi: Impose limits on resource shadowing Apps can have pathological use cases where huge resources are shadowed repeatedly. An app that alternately writes to a resource and then uses it to draw can create an unbounded amount of shadow BOs. To fix this, introduce both a maximum resource size for shadowing, and a maximum cumulative size that resource may be shadowed before we start flushing readers. The flush path then clears the counter, as does the happy path where there are no readers left after flushing writers. Fixes massive memory bloating in Firefox and probably others. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24635>	2023-08-11 20:31:27 +00:00
Asahi Lina	9d668f87d3	asahi: Print info about shadowed resources If resource and perf debugging are both enabled, this prints resource info for shadowed resources. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24635>	2023-08-11 20:31:27 +00:00
Asahi Lina	ccbd125468	asahi: Always use resource size, not BO size BOs can be oversized, as they can come from the BO cache. Make sure to always use the resource layout size, not the BO size, when we need this for some reason. This fixes BO shadowing creating overlarge BOs, and also the attachment size for submissions (probably doesn't matter, but it's more correct now). Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24635>	2023-08-11 20:31:27 +00:00
Asahi Lina	f8f4f466f7	asahi: Fix race in BO stats accounting These counters are accessed without locking, so they need to be atomic. Should be cosmetic only. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24635>	2023-08-11 20:31:27 +00:00
Asahi Lina	9762c55589	asahi: Do not overallocate BOs by more than 2x This is not likely to be useful, and might take over a correctly-sized BO that is going to be reused later. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24635>	2023-08-11 20:31:27 +00:00
Asahi Lina	175e02baed	asahi: Add a noshadow debug flag This lets us trivially test whether resource shadowing helps or hurts any given workload. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24635>	2023-08-11 20:31:27 +00:00
Alyssa Rosenzweig	5f3d784c6c	agx: Handle 8-bit vecs These should "just" work, promoting the 8-bit channels to 16-bit registers internally, allowing us to use our 8-bit stores with 8-bit data vectors packed in 16-bit registers. All other non-conversion ALU gets lowered by the previous patch, this is just needed for simple things like nir_op_vec of lowered math passed to a vectorized store. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24635>	2023-08-11 20:31:27 +00:00
Alyssa Rosenzweig	c3b86bcbbc	agx: Lower 8-bit ALU No hardware support for it. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24635>	2023-08-11 20:31:27 +00:00
Alyssa Rosenzweig	aeac45c188	asahi: Move a bunch of helpers to common These have no real Vulkan or Gallium dependence and are (as such) useful for both VK and GL without any real change in level of abstraction. Do the code motion. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24635>	2023-08-11 20:31:27 +00:00
Alyssa Rosenzweig	e5f76821f1	asahi: Stub num_dies We'll use it in the upstreamable driver portion soon. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24635>	2023-08-11 20:31:27 +00:00
George Ouzounoudis	41d094c2cc	nvk: Support dynamic state for enabling sample locations When switching dynamically we should also push the corresponding sample locations, the default when disabled or the custom ones when enabled. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24592>	2023-08-11 19:27:24 +00:00
George Ouzounoudis	2de545c68f	nvk: Fix support for VK_EXT_sample_locations Fixes some crashes on sample locations pipeline tests. The implementation was already there but the device properties were missing. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24592>	2023-08-11 19:27:24 +00:00
Italo Nicola	2dc883eb37	gallium/st: lower NV21 to R8_B8G8 instead of G8_B8R8 When NV21 lowering with hardware sampling and shader CSC was added, the incorrect PIPE_FORMAT_G8_B8R8_UNORM was used. That format is supposed to represent vulkan NV12 instead. This commit introduces PIPE_FORMAT_R8_B8G8_UNORM, which correctly describes the gallium mapping for YUV CSC, with R as Y, instead of G as Y. Fixes: `26e3be513d` ("gallium/st: add support for PIPE_FORMAT_NV21 and PIPE_FORMAT_G8_B8R8_420") Signed-off-by: Italo Nicola <italonicola@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24266>	2023-08-11 18:43:38 +00:00
Italo Nicola	4eb0a98e5a	pan/bi: add support for I420 and YV12 sampling These formats can be directly sampled, and they have a lower stride alignment requirement. Signed-off-by: Italo Nicola <italonicola@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24266>	2023-08-11 18:43:38 +00:00
Italo Nicola	b890a5ff61	gallium/st: add non-CSC lowering of YV12 as PIPE_FORMAT_R8_B8_G8_420 YV12 is the same as DRM_FORMAT_YVU420. We lower it to PIPE_FORMAT_R8_B8_G8_420, which is equivalent to PIPE_FORMAT_R8_G8_B8_420 with U/V planes swapped. This is used for hardware that can sample from YUV but need CSC in shader. Signed-off-by: Italo Nicola <italonicola@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24266>	2023-08-11 18:43:38 +00:00
Italo Nicola	60ebef430a	gallium/st: add non-CSC lowering of I420 as PIPE_FORMAT_R8_G8_B8_420 This new format is similar to PIPE_FORMAT_G8_B8_R8_420, but with R as Y, G as U and B as V. The need for two diferent formats here is because gallium maps the YUV channels differently from vulkan. Some hardware, e.g. Mali GPUs, can sample from I420 but need CSC in shader, this patch implements that. Signed-off-by: Italo Nicola <italonicola@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24266>	2023-08-11 18:43:38 +00:00
David Rosca	06495f11da	radeonsi/vcn: Update rate control when framerate changes with HEVC Similar to H264/AV1, check for framerate changes and update rate control also with HEVC. Reviewed-by: Boyuan Zhang <boyuan.zhang@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24475>	2023-08-11 17:47:43 +00:00
Georg Lehmann	c4f356faf4	aco: always use rtne for fquantize2f16 The SPIR-V spec says: If Value is positive with a magnitude too large to represent as a 16-bit floating-point value, the result is positive infinity. If Value is negative with a magnitude too large to represent as a 16-bit floating-point value, the result is negative infinity. This is only the case for rtne v_cvt_f16_f32 Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24617>	2023-08-11 12:37:23 +00:00
Alyssa Rosenzweig	144546f434	agx: Lower flat shading in NIR We get this as part of the lowering we added for interpolateAtOffset. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24498>	2023-08-11 09:50:12 +00:00
Alyssa Rosenzweig	ff0e25d293	agx: Add interpolateAtOffset lowering pass Add a lowering pass that lowers interpolation to math on the coefficient registers. This handles interpolateAtOffset, as well as flat shading as an easy special case. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24498>	2023-08-11 09:50:11 +00:00
Alyssa Rosenzweig	48029548f3	agx: Forcibly vectorize pointcoord coeffs This avoids regressions from scalarizing pointcoord loads. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24498>	2023-08-11 09:50:11 +00:00
Alyssa Rosenzweig	52b8d31548	agx: Set lower_fisnormal We're going to generate this in our interpolation lower. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24498>	2023-08-11 09:50:11 +00:00
Alyssa Rosenzweig	5577aebfb2	agx: Allow more varying slots Don't overflow. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24498>	2023-08-11 09:50:11 +00:00
Alyssa Rosenzweig	22f694c008	agx: Implement nir_intrinsic_load_coefficients_agx Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24498>	2023-08-11 09:50:11 +00:00
Alyssa Rosenzweig	10cdc0ad9f	nir: Add load_coefficients_agx intrinsic For lowering interpolation. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24498>	2023-08-11 09:50:11 +00:00
Mike Blumenkrantz	e9a5da2f4b	nir: add a filter cb to lower_io_to_scalar this is useful for drivers that want to do selective scalarization of io Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24565>	2023-08-11 09:02:53 +00:00
Mike Blumenkrantz	550f3dc437	nir/lower_io: add a new doubles-only 64bit lowering option this allows lowering only 64bit float operations for drivers that support 64bit integers Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24565>	2023-08-11 09:02:53 +00:00
Vitaliy Triang3l Kuzmin	933e6e4751	r600/asm: Make sure MOVA and SET_CF_IDX are in the same clause Acked-by: Gert Wollny <gert.wollny@collabora.com> Signed-off-by: Vitaliy Triang3l Kuzmin <triang3l@yandex.ru> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24545>	2023-08-11 08:04:05 +00:00
Vitaliy Triang3l Kuzmin	99c8d15c67	r600/asm: Fix AR force_add_cf setting if a clause is not open Acked-by: Gert Wollny <gert.wollny@collabora.com> Signed-off-by: Vitaliy Triang3l Kuzmin <triang3l@yandex.ru> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24545>	2023-08-11 08:04:05 +00:00
Samuel Pitoiset	b34c027cb0	radv: use the number of VS outputs for computing the tessellation info When TCS isn't linked with VS, the vertex stride should be computed from vertex outputs. This is only for shader object and shouldn't change anything right now. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24540>	2023-08-11 07:43:58 +00:00
Samuel Pitoiset	8a97302f57	radv: add support for loading the LSHS vertex stride from a SGPR With shader object, if VS and TCS aren't linked together, the LSHS vertex stride should be computed from the vertex outputs. Otherwise, if an output is unused, the stride is wrong in TCS. This is currently for GFX8 only because for merged shaders this won't be needed but shader object on GFX9+ isn't yet a thing. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24540>	2023-08-11 07:43:58 +00:00
Tapani Pälli	0cb88ddca2	iris: implement required PSS sync for Wa_18019816803 According to WA description, we need to track DS write state and emit a PSS_STALL_SYNC whenever that state changes. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18411>	2023-08-11 07:15:49 +00:00
Tapani Pälli	92941ee84b	anv: implement required PSS sync for Wa_18019816803 According to WA description, we need to track DS write state and emit a PSS_STALL_SYNC whenever that state changes. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18411>	2023-08-11 07:15:48 +00:00
Tapani Pälli	419531c5d9	intel/blorp: add a new flag to communicate PSS sync need This is required for Wa_18019816803 when blorp emit DS state. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18411>	2023-08-11 07:15:48 +00:00
Yogesh Mohan Marimuthu	973e6f3be0	gallium: remove start_slot parameter from pipe_context::set_vertex_buffers This patch removes start_slot from set_vertex_buffers() as suggested in https://gitlab.freedesktop.org/mesa/mesa/-/issues/8142 compilation testing: all gallium drivers, nine frontend compilation has been tested. d3d10umd compilation has not been tested driver, frontend testing: only llvmpipe and radeonsi driver was tested running game only the nine frontend changes are complex. All other changes are easy. nine front end was using start slot and also using multi context. nine frontend code changes: In update_vertex_elements() and update_vertex_buffers(), the vertex buffers or streams are ordered removing the holes. In update_vertex_elements() the vertex_buffer_index is updated for pipe driver to match the ordered list. v2: remove start_slot usage code from Marek (Marek Olšák) v3: nine stream number holes mask code from Axel (Axel Davy) Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> (except nine, which is Ab) Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22436>	2023-08-11 06:37:22 +00:00
Dave Airlie	e0da62c0e9	nvk: NOUVEAU_WS_BO_LOCAL is a trap. This flag isn't a flag, don't be & at it. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24625>	2023-08-11 06:20:01 +00:00
Faith Ekstrand	9f767db126	nv50/ir: Rework conversions for texture array indices Currently, negative array texture indices get saturated to 0 which, while technically in-bounds, isn't what we want for Vulkan with image robustness or robustness2. Vulkan requires that a negative index on a texelFetch() count as out-of-bounds but a negative index on any other texture operation gets clamped to 0. (See the spec section entitled "(u,v,w,a) to (i,j,k,l,n) Transformation And Array Layer Selection"). Instead of using CVT for TXF, we now take U32 MAX with 0xffff. Because it's unsigned, this ensures that negative array indices clamp to 0xffff and will be considered out-of-bounds by the hardware (there are a maximum of 2048 array indices in an image descriptor). For everything other than TXF, we keep using an F32->U16 conversion but add a saturate. This ensures that negative array indices clamp to 0 as per the Vulkan spec. Very large indices will clamp to 0xffff which the hardware will clamp to the maximum array index. This fixes 324 tests in the dEQP-VK.robustness.* group, all those for 1D and 2D array textures Acked-by: M Henning <drawoc@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24593>	2023-08-11 06:02:23 +00:00
Mike Blumenkrantz	585f0e8b48	nir: minor fixes for io_to_scalar Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24613>	2023-08-11 05:14:00 +00:00
Mike Blumenkrantz	0a12cedec9	zink: add a special separate shader i/o mode for legacy variables ARB shaders have no rules restricting i/o interfaces since it's assumed that they'll match by name. given that mesa marks these all as separate shaders, a separate path is needed to ensure these variables correctly match up their i/o even when it's mismatched cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24608>	2023-08-11 04:44:46 +00:00
Mike Blumenkrantz	b24911e5db	zink: pre-convert mode in fixup_io_locations no functional changes Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24608>	2023-08-11 04:44:46 +00:00
Faith Ekstrand	52c57667ed	nvk: Use common physical device properties Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24575>	2023-08-11 02:53:47 +00:00
Konstantin Seurer	c06f70ca18	radv: Use common physical device properties Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24575>	2023-08-11 02:53:47 +00:00
Konstantin Seurer	eaee792ea5	vulkan: Add a generated vk_properties struct Generates a physical device properties table to avoid dealing with pNext chains in the driver. Based on vk_features. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24575>	2023-08-11 02:53:47 +00:00
Eric Engestrom	0ab0e5d803	ci/a530: document piglit flake https://gitlab.freedesktop.org/mesa/mesa/-/jobs/47086976 Signed-off-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24622>	2023-08-11 01:19:27 +00:00
Derek Foreman	5ba5bcf2b6	vulkan/wsi: Allow binding presentation_timing when software rendering The presentation timing extension is used for doing WaitForPresent properly, but we accidentally bind it after an early return intended to stop us from binding dmabuf when software rendering. Remove the early return. cc: mesa-stable Signed-off-by: Derek Foreman <derek.foreman@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24588>	2023-08-11 00:35:37 +00:00

... 17 18 19 20 21 ...

177262 commits