fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-08 04:48:08 +02:00

Author	SHA1	Message	Date
James Park	7758664788	radv: Only close local_fd when valid Necessary when drm_device is bypassed. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7119>	2020-10-13 22:56:31 +00:00
James Park	4ca6faa933	util: Hide timespec_passed on Windows Windows doesn't have clockid_t. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7119>	2020-10-13 22:56:31 +00:00
James Park	1026e2ac0f	radv: Increased const usage Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7119>	2020-10-13 22:56:31 +00:00
James Park	1b551857f9	amd/addrlib: Fix warning list for msvc Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7119>	2020-10-13 22:56:31 +00:00
Jason Ekstrand	5abac85177	intel/fs: Rework scratch handling on Gen9+ The current scratch mechanism uses an MRF hack where we reserve a few GRF registers to treat like the MRF and we collect the data into that MRF region before doing a scratch write. We also use that region for the header for scratch reads. This commit changes things and gets rid of the MRF hack. Instead, we reserve a single register (which RA is free to pick) for the scratch header and uses split sends for scratch writes to avoid having to do the copy. This should provide RA with more freedom in the presence of spilling as well as avoid some unnecessary data moves. In future, the new GEN9_SCRATCH_HEADER opcode gives us a place where we can do our own per-thread scratch base address calculations rather than depending on the scratch base address that gets pushed into g0. Having an opcode for this lets us do it once at the top of the shader rather than repeating it at every read/write. One other noticeable difference is the use of SHADER_OPCODE_SEND. We can get away with this thanks to the fact that we're now using a set to track which instructions are generated by spills and don't rely on the opcodes to find spill/fill instructions. This allows us to avoid adding more virtual opcodes and let the normal code paths handle things like scoreboard dependencies between header setup and the SEND. It also means that post-RA scheduling may be able to space out the header setup MOV and the SEND for better latency hiding. Shader-db results on Skylake: total spills in shared programs: 12137 -> 10604 (-12.63%) spills in affected programs: 6685 -> 5152 (-22.93%) helped: 274 HURT: 2 total fills in shared programs: 13065 -> 11515 (-11.86%) fills in affected programs: 9007 -> 7457 (-17.21%) helped: 275 HURT: 1 Shader-db results on Ice Lake: total spills in shared programs: 12482 -> 10953 (-12.25%) spills in affected programs: 6586 -> 5057 (-23.22%) helped: 275 HURT: 0 total fills in shared programs: 12819 -> 11234 (-12.36%) fills in affected programs: 7867 -> 6282 (-20.15%) helped: 274 HURT: 0 Shader-db results on Tigerlake: total spills in shared programs: 11689 -> 10233 (-12.46%) spills in affected programs: 4740 -> 3284 (-30.72%) helped: 259 HURT: 0 total fills in shared programs: 10840 -> 9443 (-12.89%) fills in affected programs: 6244 -> 4847 (-22.37%) helped: 259 HURT: 0 Fossil-db results on Ice Lake: Spills in all programs: 245249 -> 201633 (-17.8%) Fills in all programs: 366066 -> 314368 (-14.1%) More practically, this seems to give about a 0.5-1% perf boost in Witcher 3 (DXVK) and Shadow of the Tomb Raider (Vulkan native). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7084>	2020-10-13 21:59:27 +00:00
Jason Ekstrand	e557af9781	intel/fs/ra: Use a set to track added spill/fill instructions Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7084>	2020-10-13 21:59:27 +00:00
Jason Ekstrand	f650c4c0c6	intel/fs/ra: Sanity-check our IP counts Starting with `e99081e76d`, we don't re-construct liveness information every time we spill a register. Instead, we're very careful to track which instructions are spill instructions and not contribute those to the IP count so that we can continue to use the old liveness information even though instructions have been added. This commit adds an assert that sanity-checks that we count the same number of instructions as our liveness information is based on. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7084>	2020-10-13 21:59:27 +00:00
Jason Ekstrand	d80d0a6ced	intel/fs/ra: Store the last non-spill VGRF node Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7084>	2020-10-13 21:59:27 +00:00
Jason Ekstrand	2af6528c33	intel/fs/ra: Refactor handling of Gen7 scratch reads The attempt at de-duplication with the gen7_read Boolean wasn't actually saving us anything. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7084>	2020-10-13 21:59:27 +00:00
Jason Ekstrand	74a1843ca0	intel/fs/ra: Increment spill_offset as part of the emit_spill loop This makes it consistent with our handling of src.offset and with our handling of spill_offset in emit_unspill. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7084>	2020-10-13 21:59:27 +00:00
Jason Ekstrand	06ebf23283	intel/fs: Add a SCRATCH_HEADER opcode This opcode is responsible for setting up the buffer base address and per-thread scratch space fields of a scratch message header. For the most part, it's a copy of g0 but some messages need us to zero out g0.2 and the bottom bits of g0.5. This may actually fix a bug when nir_load/store_scratch is used. The docs say that the DWORD scattered messages respect the per-thread scratch size specified in gN.3[3:0] in the message header but we've been leaving it zero. This may mean that we've been ignoring any scratch reads/writes from a load/store_scratch intrinsic above the 1KB mark. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7084>	2020-10-13 21:59:27 +00:00
Jason Ekstrand	24b64c8408	intel/fs: Copy the PTSS from g0 for scratch reads/writes In theory, this fixes a bug where we were dropping the PTSS bound on the floor. The hardware docs claim that the A32 DWORD and BYTE scattered read/write messages do a PTSS bounds check. However, in practice, it seems that the hardware ignores the bounds check so this doesn't actually matter. I verified this with the following couple of piglit tests: https://gitlab.freedesktop.org/mesa/piglit/-/merge_requests/399 In practice, this prevents the next commit from making a subtle behavioral change. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7084>	2020-10-13 21:59:27 +00:00
Jason Ekstrand	f103012fad	intel/batch_decoder: Don't clame vec4 vs/gs/tcs shaders on Gen11+ Because we hard-coded the default to vec4, any platform where it doesn't have a "Dispatch Mode" field gets vec4 by default. This includes Gen11+ where vec4 is no longer a thing. Change the default so it works on newer hardware. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7084>	2020-10-13 21:59:27 +00:00
Alejandro Piñeiro	e1efc534e6	v3dv/device: Support loader interface version 3. Port of `1e41d7f7b0`: "anv: Support loader interface version 3 (patch v2)" Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	866f9493f4	v3dv: fix buffer copies to compressed images on the blit path Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	82b9789f17	v3dv: drop a couple of obsolete comments We only expose a coherent memory heap, so invalidation and flushing are always no-ops for us. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	ec78ee0342	v3dv: limit blit framebuffer dimensions to max coordinates Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	43063d8de8	v3dv: generate proper UUIDs for device and driver Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	c7814bcde3	v3dv: fix blit path for copies from 3D compressed images The aliasing we were using was not always correct. Particularly, for 3D images, the simulator would complain about image strides not being large enough in some cases. This patch fixes this by aliasing both src and dst images and carefully choosing the alias dimensions taking into account the format chosen for the copy and the ratio of block sizes between both images. Playing a bit with the image dimensions used by the relevant CTS tests we confirmed this works well for all tile layouts (lineartile, ublinear1/2 and UIF). This fixes all CTS tests involving 3D image copies from compressed formats without needing to force UIF layout for all compressed images (which would actually not work for all image sizes either). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	24e46f70b9	v3dv: fixes for barriers in secondary command buffers This patch addresses various issues, mostly from secondary command buffers that recorded pipeline barriers that are not consumed in the secondary itself, so they need to be applied to jobs that come right after the execution of the secondary in a primary command buffer. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	376b009104	v3dv: implement workaround for GFXH-1918 Loading depth with odd width/height might cause incorrect loading of the early-Z buffer. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	1e81bb05ae	v3dv: implement workaround for GFXH-1461 If a subpass clears one aspect of Depth/Stencil but loads the other the clear might get lost. Fix this by emitting the clear as a draw call instead of relying on the TLB clear. Fixes: dEQP-VK.renderpass.suballocation.attachment.3.307 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	6fb45a04ab	v3dv: flag tmu_dirty_rcl in primaries when linking secondaries that have it set Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	30534fcb29	v3dv: only advertise one memory type Our current implemenation is always coherent. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	49549f43c7	v3dv: always program a reasonable internal depth type for copies/clears This doesn't seem to fix anything, but it is the right thing to do. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Alejandro Piñeiro	1ed8252514	v3dv/pipeline_cache: extend pipeline cache envvar So far V3DV_ENABLE_DEFAULT_PIPELINE_CACHE allowed to configure pipeline cache to avoid any caching using a pipeline cache. With this change we can be more detailed. Then envvar is not anymore a boolean. Allowed values: * "off": no pipeline cache at all. PipelineCache objects behaves as no-op objects. * "no-default-cache": user PipelineCache caches nir/variants, but we don't provide a default cache in case the user doesn't provide a PipelineCache object, neither for internal pipelines. * "full" (default): we provide a default PipelineCache, used when the user doesn't provide one when creating a Pipeline, and for internal Pipelines. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Alejandro Piñeiro	93e3b001c6	v3dv/pipeline_cache: set a max size for the pipeline cache We don't want to let the default pipeline cache grow without limit. We choose a maximum number of entries that should work for all real world applications. CTS will exceed that limit, but that is okay, as it will prevent us from running out of memory. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	7eb8eb10f6	v3d/compiler: allow to batch spills Some shaders that need to spill hundreds of registers can take very long times to compile as each allocation attempt spills a single register and restarts the allocation process. We can significantly cut down these times if we allow the compiler to spill in batches, which should be possible if we are spilling uniforms, which is in fact the kind of spills that we do first because they have lower cost than TMU spills. Doing this could cause us to slightly over spill in some cases (depending on the chosen batch size) leading to slightly worse performance, so we only enable this behavior after we have started to spill over a certain threshold, at which point we assume that performance won't be good and we want to favor compilation speed instead. v2: - Keep it simple and just try to spill a fixed amount of registers in a batch instead of trying to compute this dynamically based on accumulated spills and current register pressure. (Eric). v3: - Check if the node is valid before doing anything with it. - Drop the environment variable to select batch size and just fix it to 20. With this we can take this CTS test from 35 minutes down to about 3 minutes: dEQP-VK.ssbo.layout.random.all_shared_buffer.5 Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	f7af9eb211	v3dv: free noop job if needed when finishing the queue Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	cec3d5e766	v3dv: clean-up after obtaining an XCB connection Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	deb0dce1ee	v3dv: don't leak dumb BO handles allocated for swapchain images Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Alejandro Piñeiro	e7ea918025	v3dv/meta_copy: fix TFU blitting when using 3D images We had some code on blit_tfu to hande 3D images but it was wrong. For example, it executed a copy on the 3D image no matter the depth component copy needed. This was not detected until vk-gl-cts 1.2.4 introduced more 1D and 3D blitting tests. Also add checks for rely on blit_shader if needed like when mirroring on the depth component. Fixes the following tests: dEQP-VK.api.copy_and_blit.core.blit_image.simple_tests.mirror_z_3d.nearest dEQP-VK.api.copy_and_blit.core.blit_image.simple_tests.whole_3d.nearest dEQP-VK.api.copy_and_blit.dedicated_allocation.blit_image.simple_tests.mirror_z_3d.nearest dEQP-VK.api.copy_and_blit.dedicated_allocation.blit_image.simple_tests.whole_3d.nearest Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	db737a082a	v3dv: honor VkPipelineDepthStencilStateCreateInfo::depthWriteEnable Fixes: dEQP-VK.renderpass.suballocation.subpass_dependencies.separate_channels.d24_unorm_s8_uint Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	966b14adf7	v3dv: fix sampling from stencil aspect of a combined depth/stencil image When sampling the stencil aspect we want to reinterpret the D24S8 format as RGBA8 and read stencil values from the R component. Fixes: dEQP-VK.renderpass.suballocation.formats.d24_unorm_s8_uint.input.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Alejandro Piñeiro	abca6c97a0	v3dv/formats: properly return unsupported for 1D compressed textures Gets tests like the following one properly skipped: dEQP-VK.api.copy_and_blit.core.image_to_image.all_formats.color.1d.etc2_r8g8b8a8_unorm_block.etc2_r8g8b8a8_unorm_block.optimal_general Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	0df0449b65	v3dv: signal semaphore/fence if needed after acquiring a swapchain image Fixes: dEQP-VK.wsi..swapchain.acquire.too_many dEQP-VK.wsi..swapchain.acquire.too_many_timeout Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	e14cdd6eba	v3dv: do not expose VK_IMAGE_USAGE_SAMPLED_BIT for swapchains The display pipeline on the Rpi4 requires that images are linear and the 3D pipeline cannot sample from linear images. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	e6fb805f9a	v3dv: fix size computed by vkGetImageSubresourceLayout for 3D images Fixes: dEQP-VK.image.subresource_layout.3d.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	83336adaca	v3dv: fix offset computed by vkGetImageSubresourceLayout for array images Fixes: dEQP-VK.image.subresource_layout.2d_array.* Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	4cc7d6a85f	v3dv: expose DRM modifiers based on supported features So far we have only been exposing linear for WSI formats and UIF on everythig else, but we should instead expose linear or UIF based on whether the underlying format supports any features for the given layout. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	fdcc36aad4	v3dv: handle VK_STRUCTURE_TYPE_PHYSICAL_DEVICE_IMAGE_DRM_FORMAT_MODIFIER_INFO When negotiating DRM modifiers, applications may use this to validate the features that are supported with a particular modifier. The WSI code in Mesa relies on this to validate its modifiers. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Alejandro Piñeiro	287af8045f	v3dv/meta_copy: handle mirroring z component bliting 3D images By basing the tex_coord on the max layer, instead of min (similarly to what we do for mirroring x/y) Avoid all crashes, and get to Pass most of the following tests: dEQP-VK.api.copy_and_blit.core.blit_image.simple_tests.mirror_z_3d.* The only one failing is this one: dEQP-VK.api.copy_and_blit.core.blit_image.simple_tests.mirror_z_3d.nearest but looks that the core cause would be different, as there are other 3d nearests tests failing. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	9518b677f1	v3dv: fix color clear pipeline destruction for 32-bit architectures Command buffer object destruction callbacks take 64-bit object handles, but we defined the color clear pipeline callback to take a 32-bit argument. Should fix recent crash regressions with some CTS tests on Rpi4. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	4acf5985a4	v3dv: hook up robust buffer access Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	23c727dd67	v3d/compiler: add a lowering pass for robust buffer access Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	4401dde0e9	broadcom/compiler: rename QUNIFORM_GET_BUFFER_SIZE to QUNIFORM_GET_SSBO_SIZE Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	6f55abcb8b	v3dv: handle QUNIFORM_GET_UBO_SIZE Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	d93d903a37	v3d/compiler: implement nir_intrinsic_get_ubo_size Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	f4c754bcd1	nir: add a nir_get_ubo_size intrinsic This is the same as nir_get_buffer_size but geared towards UBOs instead of SSBOs. The new intrinsic is useful in Vulkan backends that need to add bound checks on buffer accesses to honor the robust buffer access feature. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	4823313587	v3dV: move meta init/finish to meta implementation files Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00

... 2 3 4 5 6 ...

130088 commits