fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-05 11:48:06 +02:00

Author	SHA1	Message	Date
Asahi Lina	56d5db247a	asahi: decode: Refactor to always copy GPU mem to local buffers We want to plug this library into the hypervisor, but there we don't have all GPU memory already mapped in our address space. Refactor the GPU mem read function to always allocate local buffers and copy in the data there. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:28 +00:00
Asahi Lina	2c2858c2af	asahi: wrap: Handle freeing shmems Needed for some Metal demos that end up creating multiple queues. This is still definitely broken/not fully correct, but it at least gets things working for those. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:28 +00:00
Asahi Lina	0dc819f284	asahi: Add extra CDM header block for G14X Looks like we finally found our first properly divergent codepath. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:28 +00:00
Asahi Lina	69e91527d3	asahi: decode: Add a params argument to pass through Sooner or later we were going to need divergent codepaths in decode, and it looks like now is the time. Add a `params` typedef and pass it through all the decoder callbacks. This is an alias for drm_asahi_params_global, but use a typedef so we can change that later without changing dozens of instances. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:28 +00:00
Alyssa Rosenzweig	de1174791d	agx: Fix bogus assert Dolphin uses all the uniforms. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:28 +00:00
Alyssa Rosenzweig	80e103d718	agx: Reduce un/packs with mem access lowering Often not needed and makes the NIR harder to read. shader-db is noise. total instructions in shared programs: 1752712 -> 1752688 (<.01%) instructions in affected programs: 8338 -> 8314 (-0.29%) helped: 21 HURT: 8 Inconclusive result (%-change mean confidence interval includes 0). total bytes in shared programs: 11943572 -> 11943434 (<.01%) bytes in affected programs: 56716 -> 56578 (-0.24%) helped: 21 HURT: 8 Inconclusive result (%-change mean confidence interval includes 0). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:28 +00:00
Alyssa Rosenzweig	afa38c7d4f	agx: Vectorize 16-bit parallel copies If we have two 16-bit copies to/from adjacent 16-bit registers, we can instead use a single 32-bit copy from the 32-bit register pair. Since 32-bit integer arithmetic is (almost) as efficient as 16-bit on AGX, this (almost) doubles performance of affected parallel copies. total instructions in shared programs: 1788606 -> 1788301 (-0.02%) instructions in affected programs: 17057 -> 16752 (-1.79%) helped: 150 HURT: 0 Instructions are helped. total bytes in shared programs: 12196492 -> 12194662 (-0.02%) bytes in affected programs: 122894 -> 121064 (-1.49%) helped: 150 HURT: 0 Bytes are helped. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:28 +00:00
Alyssa Rosenzweig	42a4c09b72	agx: Try to allocate phi sources with loop phis total instructions in shared programs: 1788666 -> 1788606 (<.01%) instructions in affected programs: 7953 -> 7893 (-0.75%) helped: 29 HURT: 0 Instructions are helped. total bytes in shared programs: 12196852 -> 12196492 (<.01%) bytes in affected programs: 53908 -> 53548 (-0.67%) helped: 29 HURT: 0 Bytes are helped. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:28 +00:00
Alyssa Rosenzweig	d0caa08c26	agx: Try to allocate phi sources with phis Not meaningfully using more registers since this is just about how we assign registers after fixing the maximum # of registers used (note that thread count is unaffected). total instructions in shared programs: 1790901 -> 1788666 (-0.12%) instructions in affected programs: 230680 -> 228445 (-0.97%) helped: 681 HURT: 2 Instructions are helped. total bytes in shared programs: 12210266 -> 12196852 (-0.11%) bytes in affected programs: 1634100 -> 1620686 (-0.82%) helped: 682 HURT: 2 Bytes are helped. total halfregs in shared programs: 532130 -> 532218 (0.02%) halfregs in affected programs: 848 -> 936 (10.38%) helped: 3 HURT: 13 Halfregs are HURT. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:28 +00:00
Alyssa Rosenzweig	73da872a66	agx: Try to allocate phis compatibly with sources All shaders affected for thread count are in pubg... by chance the allocation before used fewer registers than the calculated register demand (I guess because we're conservative with our vector handling) and so got lucky and got higher thread count. That shader is also helped massively for instructions. The halfreg change doesn't matter -- we're not actually increasing register demand, we're just being more choosy about our registers. total instructions in shared programs: 1799738 -> 1790901 (-0.49%) instructions in affected programs: 306081 -> 297244 (-2.89%) helped: 889 HURT: 14 Instructions are helped. total bytes in shared programs: 12263290 -> 12210266 (-0.43%) bytes in affected programs: 2150966 -> 2097942 (-2.47%) helped: 889 HURT: 14 Bytes are helped. total halfregs in shared programs: 531981 -> 532130 (0.03%) halfregs in affected programs: 1925 -> 2074 (7.74%) helped: 0 HURT: 26 Halfregs are HURT. total threads in shared programs: 18885184 -> 18884224 (<.01%) threads in affected programs: 13440 -> 12480 (-7.14%) helped: 0 HURT: 15 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:28 +00:00
Alyssa Rosenzweig	6cc8d7b52a	agx: Add try_coalesce_with helper Common logic the next few patches will use to try to assign something to the same register as something else. "If it's already been assigned a register and that register is free now, use it, otherwise bail." Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:28 +00:00
Alyssa Rosenzweig	42fbbd2a73	asahi: Forbid 2D Linear with images There's no known use case, so forbidding this reduces the combinatorics required in the texture atomic lowering. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:28 +00:00
Alyssa Rosenzweig	4e53da7265	asahi: Don't restrict sampler views We now emulate an infinitely large binding table with bindless, so the sky is the limit for this CAP. Note we still have the limit for samplers, so this probably doesn't do anything for OpenGL. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:28 +00:00
Alyssa Rosenzweig	6788194c39	asahi: Make clear the non-sRGBness of EOT images For sRGB render targets, we encode sRGB when writing pixels into the tilebuffer (in the fragment shader), not when writing out the image. When we actually write out the tilebuffer to the image, we don't use the PBE's sRGB conversion, we just bind it as a UNORM 8 image and blit the pre-transformed pixels. We're about to add real sRGB support for the PBE, so make this linearization explicit. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:28 +00:00
Alyssa Rosenzweig	8db9eeaeec	asahi: Upload image descriptors Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:28 +00:00
Alyssa Rosenzweig	689d47fe7c	asahi: Upload at most the max texture state registers The rest are bindless now. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:27 +00:00
Alyssa Rosenzweig	274d0d1c82	asahi: Add texture/image indexing lowering pass Both textures and images share a unified indexing scheme in AGX. When binding tables are used, they can be mapped to texture state registers. Otherwise, there is bindless access available. It would be nice to map OpenGL's binding table based textures and images to AGX texture state registers 1:1. The problem is that OpenGL allows more combined textures and images than we necessarily have texture state registers. So, we use as many texture state registers as we can, and then we fallback on an internal bindless scheme mapping an extended binding table. Add and use a lowering pass to map all of the API-level texture/image indices to either texture state registers or bindless handles as required. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:27 +00:00
Alyssa Rosenzweig	1ad4a35a6c	asahi: Add agx_batch_track_image helper Adapted from Panfrost. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:27 +00:00
Alyssa Rosenzweig	868d85bd83	asahi: Reallocate to set the writeable image flag ...If needed, for array images. But avoid doing so for non-array images. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:27 +00:00
Alyssa Rosenzweig	92cd946028	asahi: Mark writeable images as such ail needs this information to select the appropriate layout. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:27 +00:00
Alyssa Rosenzweig	16f081bf2a	ail: Page-align layers for writable images This appears to be necessary for PBE writes. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:27 +00:00
Alyssa Rosenzweig	f716da596b	asahi,agx: Set coherency bit for clustered targets We need to set a particular bit on atomics for them to be coherent across clusters. Fixes atomics on G13X. Setting this bit on the single-cluster G13G, on the other hand, wedges the GPU. So best be careful ;-) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:27 +00:00
Janne Grunau	f66fc18886	asahi: toggle more barrier bits after transform feedback Fixes KHR-GLES31.core.draw_indirect.advanced-twoPass-transformFeedback-arrays and KHR-GLES31.core.draw_indirect.advanced-twoPass-transformFeedback-elements on M1 Ultra (G13D). Let's assume that same bits are required on M1 Pro and Max. Signed-off-by: Janne Grunau <j@jannau.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:27 +00:00
Alyssa Rosenzweig	58d43ca03c	asahi: Identify background/EOT counts Similar to the counts for VDM/PDM/CDM. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:27 +00:00
Alyssa Rosenzweig	198c51d664	asahi: Serialize NIR in memory Deserializing isn't expected to be much more expensive than cloning, and the serialized NIR is significantly smaller. So store the serialized instead of the deserialized, and deserialize on the fly. This reduces a lot of noise in valgrind due to random crap alloc'd against the NIR shader by lowering passes that now get properly freed. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:27 +00:00
Alyssa Rosenzweig	8df0a86cc0	asahi: Extract shader_initialize helper To fill out an agx_uncompiled_shader struct, since the logic was duplicated between graphics and compute. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:27 +00:00
Asahi Lina	0e08923a7b	asahi: Add nomsaa debug flag This forces off MSAA, which together with smalltile mode helps test more combinations. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:27 +00:00
Asahi Lina	e9b2f02c2f	asahi: Add smalltile debug option This lets us force small tiles when they otherwise would not be necessary, which is useful for decoupling tile size and the logic that depends on it from things like MSAA and MRT which can trigger small tiles. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:27 +00:00
Asahi Lina	35715db30d	asahi: Add synctvb debug flag This requests synchronous TVB growth (instead of split renders). Mostly for testing at this point. Only works with newer kernels and the kernel will complain on dmesg for now. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:27 +00:00
Alyssa Rosenzweig	d33375bb05	asahi: Refactor PBE upload routine In general, PBE descriptors map pipe_image_views for the hardware. That we use a writeable shader image internally for render targets is an implementation-detail of the end-of-tile program. So, refactor the PBE upload routine to take a pipe_image_view (not a pipe_surface), and translate the pipe_surface into an internal pipe_image_view for end-of-tile programs. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:27 +00:00
Alyssa Rosenzweig	85c829d64f	asahi: Remove unused #define Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:27 +00:00
Alyssa Rosenzweig	f10d51541d	asahi: Use nir_builder_at more Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:27 +00:00
Alyssa Rosenzweig	c20c9f06d3	asahi: Augment fake drm_asahi_params_global Stub out a bit more UAPI so we can build with the additions in this patch series. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24258>	2023-07-20 15:33:27 +00:00
Sergi Blanch Torne	f7d0586524	Integrate ci-kdl in the building process and launch process. Modify the build process for the images to include the build to have ci-kdl available in the Mesa jobs. Modify also the init-stage2 to launch in the background the process that will collect data and store a json file with the relative changes on the recorded data. Signed-off-by: Sergi Blanch Torne <sergi.blanch.torne@collabora.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24177>	2023-07-20 12:04:41 +00:00
Sergi Blanch Torne	8a1c95caab	Introduce ci-kdl builder and launcher. A tool to collect relative changes in some registers of sysfs can be used in the Mesa jobs to record information while the tests are being executed. Signed-off-by: Sergi Blanch Torne <sergi.blanch.torne@collabora.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24177>	2023-07-20 12:04:41 +00:00
Vignesh Raman	95c9d3db32	ci: add Vignesh Raman into restricted traces access list Signed-off-by: Vignesh Raman <vignesh.raman@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24247>	2023-07-20 11:08:10 +00:00
Eric Engestrom	85a8f03211	ci: delete install.tar after extracting it to avoid re-uploading it Leaving it means it gets re-uploaded when sync'ing the artifacts back from the DUT to GitLab. Signed-off-by: Eric Engestrom <eric@igalia.com> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24196>	2023-07-20 10:34:03 +00:00
Pavel Ondračka	c9a0e91d4c	r300: fix cycles calculation There might be more texture semaphores per begin tex block, just do the cycles calculation on the first one. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24250>	2023-07-20 10:19:24 +00:00
Lionel Landwerlin	2007d67054	ci/a530: switch a few tests to flakes to unblock CI Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24248>	2023-07-20 09:46:21 +00:00
Felix DeGrood	d04be9770b	intel/compiler: use shader source hash in shader dump code Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23942>	2023-07-20 09:08:08 +00:00
Felix DeGrood	6ac8a9a030	intel: use shader source hash in INTEL_MEASURE Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23942>	2023-07-20 09:08:08 +00:00
Felix DeGrood	49182271e3	mesa: propagate shader source sha1 from gl_shader to nir_shader Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23942>	2023-07-20 09:08:08 +00:00
Felix DeGrood	96f344e5a6	iris: save shader source sha1 in ish Save lowest dword of shader source sha1 in pipeline object for use later as hash for uniquely identifying shader in debug outputs. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23942>	2023-07-20 09:08:08 +00:00
Felix DeGrood	124973c635	anv: Add Source hash field to VkPipelineExecutableStatisticKHR Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23942>	2023-07-20 09:08:08 +00:00
Felix DeGrood	b145d05381	anv: save a shader source uint32_t hash in gfx/compute pipelines Save lowest dword of shader source sha1 in pipeline object for use later as hash for uniquely identifying shader in debug outputs. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23942>	2023-07-20 09:08:08 +00:00
Lionel Landwerlin	3384f029be	intel/compiler: rework input parameters Use a struct for various common parameters rather than per stage structure or arguments to stage specific entrypoints. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23942>	2023-07-20 09:08:08 +00:00
Konstantin Seurer	df3f2c89f5	radv/meta_buffer: Rename size_minus16 to max_offset It's just better. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24213>	2023-07-20 07:43:16 +00:00
Konstantin Seurer	c49bd75fa7	radv/meta_buffer: Stop setting RADV_META_SAVE_DESCRIPTORS Everything is done via push constants. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24213>	2023-07-20 07:43:16 +00:00
Konstantin Seurer	839d6f9fa2	radv: Stop using the misleading round_up_u* functions The functions had the same behavior as DIV_ROUND_UP but their names do not mention a division. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24210>	2023-07-20 06:51:30 +00:00
Pavel Ondračka	34a12a2727	r300: cycles estimate for shader-db To account for: - macro MAD in vs - NOPs needed before presubtract - texture scheduling and a proper texture semaphore usage The docs don't mention any other references to extra cycles, so otherwise we assume 1 instruction = 1 cycle. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7573 Reviewed-by: Filip Gawin <filip.gawin@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24152>	2023-07-20 06:37:10 +00:00

1 2 3 4 5 ...

174500 commits