fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-06 15:58:05 +02:00

Author	SHA1	Message	Date
Connor Abbott	fec372dfa5	tu: Implement FDM viewport patching We scale the actual rendering by patching the viewport state. This is helped by a HW bit to make the viewport index equal to the view index, so that we can have a different scaling per-view. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20304>	2023-05-08 19:59:26 +00:00
Connor Abbott	17c732f531	ir3: Record whether a shader writes gl_ViewportIndex This will be needed by turnip. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20304>	2023-05-08 19:59:26 +00:00
Connor Abbott	05f96dd00f	tu: Add core FDM patchpoint infrastructure FDM is implemented pretty much entirely inside the driver, by patching various structures for each bin. This adds the core infrastructure to sample the density map, compute the scaled bin sizes we will use, create patchpoints, and apply them at the start of each bin before executing the IB2. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20304>	2023-05-08 19:59:26 +00:00
Connor Abbott	ebb8e104a5	tu/cs: Add support for CS patching In order to patch the command stream on the gpu, we need two features: 1. The ability to use a read-write BO instead of a read-only one, when patching might be performed. 2. The ability to get the iova of the current position after reserving some number of dwords, even with externally-allocated command streams. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20304>	2023-05-08 19:59:26 +00:00
Connor Abbott	2aa3dc3bd0	tu: Implement sampling the fragment density map Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20304>	2023-05-08 19:59:26 +00:00
Connor Abbott	64daede1c3	tu: Parse fragment density map attachment info Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20304>	2023-05-08 19:59:26 +00:00
Connor Abbott	ab75e0a126	freedreno/a6xx: Document per-view viewport in GRAS_SU_CNTL Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20304>	2023-05-08 19:59:25 +00:00
Connor Abbott	768dcc7a27	tu: Make dynamic viewport and scissor count more accurate Because we delay emitting them until we know the pipeline, we can track the actual count instead of taking the max. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20304>	2023-05-08 19:59:25 +00:00
Connor Abbott	0f33d0392a	tu: Merge RB_DEPTH_CNTL and RB_STENCIL_CONTROL drawstates We're again running out of draw states, and this matches what gallium does. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20304>	2023-05-08 19:59:25 +00:00
Connor Abbott	7673fcf206	tu: Precompute maximum views across all subpasses We'll need this to know how many viewports to create. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20304>	2023-05-08 19:59:25 +00:00
Connor Abbott	2668ba0ecd	tu: Use dirty bit for scissor state This will make patching it on-demand easier. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20304>	2023-05-08 19:59:25 +00:00
Connor Abbott	f3ffd963f5	tu: Add 3D GMEM load path This is similar to old gens which couldn't support loading from GMEM automatically. It will be needed for loads with a fragment density map, because we need to scale the image when loading to GMEM. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20304>	2023-05-08 19:59:25 +00:00
Connor Abbott	a294a6cfe6	freedreno/fdl: Expose view offset Will be used by CPU sampling. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20304>	2023-05-08 19:59:25 +00:00
Connor Abbott	31a9ac7f4e	freedreno/fdl: Don't pre-shift image view pitch We'll need the unshifted pitch for doing CPU reads. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20304>	2023-05-08 19:59:25 +00:00
Connor Abbott	012e8f5c61	tu: Don't pre-shift depth and stencil pitch Different uses in various registers and the texture descriptor have different shifts, and we already had a few ugly workarounds to handle this. Remove the foot-gun by specifying it in bytes and letting users handle the shift themselves using the correct macro. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20304>	2023-05-08 19:59:25 +00:00
Connor Abbott	f6902bf425	tu: Don't override depth for GMEM Otherwise accesses to non-0 views of input attachments may be considered out-of-bounds and return 0. This should've been removed when enabling multiview for GMEM, not sure how it was missed. Fixes: `def56b531c` ("tu: Support GMEM with layered rendering and multiview") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20304>	2023-05-08 19:59:25 +00:00
M Henning	cabbbbf0af	nouveau/nir: Set isSigned on all atomic_imax/imin Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22889>	2023-05-08 18:57:14 +00:00
Mike Blumenkrantz	00627b4f8d	aux/draw: add guardband clipping for lines to comply with ES2+ line clipping rules, guardband clipping should be used so that the rasterizer will clip lines without using clip planes fixes (llvmpipe): dEQP-GLES.functional.clipping.line.wide_line_clip_viewport_center dEQP-GLES.functional.clipping.line.wide_line_clip_viewport_corner Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17284>	2023-05-08 16:55:50 +00:00
Erik Faye-Lund	5fa9436617	aux/draw: check for lines when setting clipping-mode Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17284>	2023-05-08 16:55:50 +00:00
Mike Blumenkrantz	43802ea3b5	aux/draw: guard_band_points_xy -> guard_band_points_lines_xy just a rename, no functional changes Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17284>	2023-05-08 16:55:50 +00:00
Mike Blumenkrantz	ea98df2a65	gallium: pipe_rasterizer_state::point_tri_clip -> point_line_tri_clip this is just a rename, no functional changes Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17284>	2023-05-08 16:55:49 +00:00
Yiwei Zhang	04b3369921	ci: uprev virglrenderer to drop venus release patches Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22896>	2023-05-08 14:58:51 +00:00
Vitaliy Triang3l Kuzmin	4ed2616ac3	radv: Fix vk_instance_init vk_error instance use-after-free Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: Vitaliy Triang3l Kuzmin <triang3l@yandex.ru> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22887>	2023-05-08 14:09:49 +00:00
Vitaliy Triang3l Kuzmin	bb91bc9fd2	lavapipe: Fix vk_instance_init vk_error instance use-after-free Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Signed-off-by: Vitaliy Triang3l Kuzmin <triang3l@yandex.ru> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22887>	2023-05-08 14:09:49 +00:00
Gert Wollny	dbc4c088fc	r600/sfn: Fix iterator use Reported by Coverity 1529462 Fixes: `e57643cf54` r600/sfn: Add handling for R600 indirect access alias Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22903>	2023-05-08 13:45:40 +00:00
Marek Olšák	d90fc82569	radeonsi: do AMD_DEBUG=nodisplaydcc differently to also remove modifiers Only modifiers with DCC retiling are removed for now. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22771>	2023-05-08 13:08:01 +00:00
Marek Olšák	8c8b5a8fbd	radeon: add radeon_info parameter into radeon_winsys::surface_init to allow radeonsi to change radeon_info. The next commit will rely on it. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22771>	2023-05-08 13:08:01 +00:00
Marek Olšák	ae6b928495	ac/gpu_info: disable display DCC on Raphael and Mendocino to improve power usage Below is the summary from the power validation.. "it looks like the only workload where I see savings from DCC is PLT and it is only about 65mW which is just run to run variation. For Idle I am seeing ~280mW increase in power, ~200mW increase for power_VideoCall, and ~80mW increase for VP" Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22771>	2023-05-08 13:08:00 +00:00
Marek Olšák	e4c8ac5aae	ac/surface: don't expose modifiers with DCC retiling if radeon_info forbids it Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22771>	2023-05-08 13:08:00 +00:00
Samuel Pitoiset	ce64300676	radv: remove ac_surf_info from radv_image Introduce a helper to convert vk_image info to ac_surf_info instead. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22816>	2023-05-08 09:17:12 +00:00
Samuel Pitoiset	9e846ab1dc	radv: use vk_image::extent instead of radv_image::info::{width,height,depth} Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22816>	2023-05-08 09:17:12 +00:00
Samuel Pitoiset	cb721d5de5	radv: use vk_image::samples instead of radv_image::info::samples Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22816>	2023-05-08 09:17:12 +00:00
Samuel Pitoiset	d37b020428	radv: use vk_image::samples instead of radv_image::info::storage_samples Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22816>	2023-05-08 09:17:12 +00:00
Samuel Pitoiset	8e62bb0dfe	radv: use vk_image::array_layers instead of radv_image::info::array_size Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22816>	2023-05-08 09:17:12 +00:00
Samuel Pitoiset	b7b9657a70	radv: use vk_image::mip_levels instead of radv_image::info::levels Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22816>	2023-05-08 09:17:12 +00:00
Samuel Pitoiset	87d31cadad	radv: disable RB+ blend optimizations on GFX11 when a2c is enabled Closes: #8222 Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21313>	2023-05-08 07:22:21 +00:00
Christopher Snowhill	a6d4139e59	Corrects log print to produce hexadecimal base output Matching the original %016lx, and the "0x" prefix which is still in the format string. Fixes: `53b77a8102` ("anv: remove 48bit address space checks") Signed-off-by: Christopher Snowhill <kode54@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22882>	2023-05-07 21:33:18 +00:00
Lionel Landwerlin	fb13360546	intel/fs: reduce register usage for relocated constants Commit `bb8e31b7ed` ("anv: avoid hardcoding instruction VA constant in shaders") had a slight negative impact on shaders (Red Dead Redemption 2 in particular). Dropping a few shaders from SIMD32 to SIMD16. With this change, it brings back all the dropped SIMD32 shaders. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22872>	2023-05-07 19:38:04 +00:00
Asahi Lina	1aaf4bf40a	asahi: Fix batch writer_syncobj cleanup When an ACTIVE batch takes over the active writer role from a SUBMITTED batch, the written BO has the syncobj from the latter even though the writer is the former. This is correct and an intended state, but it means that then we can't gate the syncobj cleanup in agx_batch_cleanup on being the active writer, since the SUBMITTED batch won't be. Fixes: asahi/mesa#18 Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22891>	2023-05-07 09:10:37 -04:00
Asahi Lina	3f55eff0e5	asahi: Assert that freed BOs have no pending writers This is just a sanity check, I haven't actually hit this case but if we ever do something is very broken (e.g. BO refcounting bug). Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22891>	2023-05-07 09:10:37 -04:00
Alyssa Rosenzweig	d7d098679b	asahi: Fix depth load/store flags If depth_writemask is set, we need to write depth regardless of whether we run the depth test, to write out the fixed-function fragment depth. This will matter when we start honouring these flags. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22891>	2023-05-07 09:10:37 -04:00
Asahi Lina	d49e8f4d76	asahi: Clear batch->resolve on agx_batch_init This has been broken forever, but it was only noticed with the ZS load/store optimizations in the subsequent commits. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22891>	2023-05-07 09:10:37 -04:00
Alyssa Rosenzweig	d72e1418ce	asahi: Implement transform feedback This code was originally based on the Panfrost implementation, but has been improved in a number of ways. 1. Transform feedback programs are dispatched generically with Gallium calls, rather than emitting something hardware-specific. This is cleaner and portable to future GPUs. 2. Transform feedback with indexed draws is now fixed, by lowering to an index buffer pull. 3. Transform feedback with buffer overflows is now fixed, by correctly bounds checking in transform feedback programs. 4. Transform feedback with strips/fans/loops are fixed, by correctly tessellating to the underlying primitives as required by OpenGL. 5. Transform feedback with QUADS is fixed, by tessellating to triangles as required by OpenGL. That said, the code is still not in its final form. 1. It still does not support indirect draws. This will require a substantial overhaul to do tracking on the GPU instead of the CPU. Currently we force unroll indirect draws (slow but kosher in GL, treif in Vulkan). This isn't hard to solve but I'm not going to duplicate the code until the algorithms are otherwise complete because it's a lot easier to hack on the CPU versions than the GPU versions. 2. It still does not support primitive restart. This has especially nasty interactions with transform feedback. Again we force unroll to non-primitive restart forms, again slow but kosher in GL but treif in Vulkan. This is a lot harder to deal with. I sketched out something really nasty in my notebook (hinging on efficient GPU prefix sums) but I'm not in a hurry to type this out. 3. There will be interactions with geometry and tessellation shaders and I don't think I can get the core code here future-proofed without actually bringing up the new shader stages. As such, this is a hard fork of the panfrost code for now, I'm not trying to share the code (although it would clear out almost all of panfrost's transform feedback related piglit failures). Passes dEQP-GLES3.functional.transform_feedback.* and most of the relevant piglits. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22891>	2023-05-07 09:10:37 -04:00
Alyssa Rosenzweig	25646c7772	asahi: Bump MAX_PUSH_RANGES to the worst-case This shortcuts all headaches about how big this should be. It does increase memory usage a bit if there are lots of shader variants compiled, but this should be tolerable, and can be optimized later if so required. Thanks to the previous commit, the disk cache size should be unaffected. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22891>	2023-05-07 09:10:37 -04:00
Alyssa Rosenzweig	c2f366ce64	asahi: Shrink disk cache size of push ranges Only store the push ranges we actually need, not all of them. This should save some disk space, while insulating us to MAX_PUSH_RANGES changes. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22891>	2023-05-07 09:10:37 -04:00
Alyssa Rosenzweig	e79e743674	agx: Lower I/O to scalar later This lets us preserve vectorized stores for transform feedback shaders. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22891>	2023-05-07 09:10:36 -04:00
Alyssa Rosenzweig	a561a6c468	agx: Validate that collect sources are the same size RA asserts this, but by then if you've messed it up, the failure is inscrutable. Let's check it in the validator for more pleasant debugging. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22891>	2023-05-07 09:10:36 -04:00
Alyssa Rosenzweig	9337f6a865	agx: Rework z/s emit We were being sloppy with the sizes before. It mostly worked out, but there were some corner cases where we would end up with mixed sized collects and that won't end well for us. Let's rework the logic to make all the sizes explicit in NIR -- 32-bit for depth and 16-bit stencil -- and then do the needed promotions to make it happen in the AGX IR side. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22891>	2023-05-07 09:10:36 -04:00
Alyssa Rosenzweig	f4f9269b66	agx: Ensure load_frag_coord has the right sizes In case .x isn't read, it'll be null which has the wrong size and will fail the validation added later in this series. We fix this by padding with sized undefs (something that exists of defined size but undefined value) rather than nothingness (of undefined size). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22891>	2023-05-07 09:10:36 -04:00
Alyssa Rosenzweig	7f71e1bc2d	agx/lower_address: Match multiplies, not only shifts Sometimes a shader might index with a non-power-of-two stride. For example, if it's indexing into an array of structures where the structure size is not a power of two, we'll get a multiply with a constant as opposed to a shift. We want to handle these cases, too. To do so, we generalize our pattern matching to look for any kind of multiply (with our new helper), rather than hardcoding logic for ishl. This eliminates right-shifts in a pile of compute shaders, which makes me happy from a "I read lots of shader assembly when debugging" perspective. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22891>	2023-05-07 09:10:36 -04:00

1 2 3 4 5 ...

170836 commits