fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-28 01:18:15 +02:00

Author	SHA1	Message	Date
Marek Olšák	bfc37e7c63	amd: unify and tune the attribute ring size for gfx11 Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21041>	2023-02-03 00:18:02 +00:00
Marek Olšák	e25f08baf2	radeonsi: never set INTERPOLATE_COMP_Z based on PAL Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21041>	2023-02-03 00:18:02 +00:00
Marek Olšák	d087b3ec3c	radeonsi: determine alpha_to_coverage robustly in si_update_framebuffer_blend_rasterizer Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21041>	2023-02-03 00:18:01 +00:00
Marek Olšák	f2923168ba	radeonsi: merge si_ps_key_update_framebuffer_blend & .._update_blend_rasterizer Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21041>	2023-02-03 00:18:01 +00:00
Marek Olšák	a29218b557	radeonsi/gfx11: always set MSAA_NUM_SAMPLES=0 for DCC_DECOMPRESS hw requirement Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21041>	2023-02-03 00:18:01 +00:00
Marek Olšák	8532cb8e7e	radeonsi: deduplicate VS/TES/GS update code Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21041>	2023-02-03 00:18:01 +00:00
Marek Olšák	711c4bddb2	radeonsi/gfx11: use new packet EVENT_WRITE_ZPASS Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21041>	2023-02-03 00:18:01 +00:00
Marek Olšák	4664b22f65	radeonsi/gfx11: move the PIXEL_PIPE_STAT_CONTROL event into the GFX preambles Both the normal and shadowing preamable should do this. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21041>	2023-02-03 00:18:01 +00:00
Marek Olšák	383269238d	radeonsi/gfx11: fix blend->cb_target_mask dependency for shader keys Shader keys only use cb_target_enabled_4bit. This may cause shaders to be updated less often, but otherwise no change in behavior. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>š Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21041>	2023-02-03 00:18:01 +00:00
Marek Olšák	d5ff270e0b	radeonsi/gfx11: adjust ACCUM_* fields for tessellation based on PAL Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21041>	2023-02-03 00:18:01 +00:00
Marek Olšák	0b4b309fc6	radeonsi/gfx11: add a comment why we use PRIM_GRP_SIZE <= 252 Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21041>	2023-02-03 00:18:01 +00:00
Marek Olšák	d21850f753	radeonsi/gfx11: remove the INST_PREF_SIZE workaround The hw does the right thing automatically. (i.e. enables or disables the feature) Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21041>	2023-02-03 00:18:01 +00:00
Marek Olšák	f6c30af00c	radeonsi: implement RB+ depth-only rendering for better perf The explanation is in the last change of this commit. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21041>	2023-02-03 00:18:01 +00:00
Marek Olšák	2fc03e479b	amd: improve RB+ blending precision Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21041>	2023-02-03 00:18:01 +00:00
Marek Olšák	b6f6465264	amd: update SX_BLEND_OPT_EPSILON.MRT0_EPSILON enum definitions Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21041>	2023-02-03 00:18:01 +00:00
Marek Olšák	84d59cdb59	amd: split GFX1103 into GFX1103_R1 and GFX1103_R2 Fixes: `caa09f66ae` - amd: add chip identification for gfx1100-1103 Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21041>	2023-02-03 00:18:01 +00:00
Marek Olšák	66d11391f7	radeonsi/gfx11: unset SAMPLE_MASK_TRACKER_WATERMARK to fix hangs Same as PAL. Fixes: `529eb739fc` - radeonsi/gfx11: add CB deltas Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21041>	2023-02-03 00:18:01 +00:00
Marek Olšák	8556b3db71	radeonsi: fix RB+ blending with sRGB formats The epsilon for 8bpc is for the linear colorspace. There is no epsilon for sRGB. Fixes: `17021efc74` - radeonsi: adjust RB+ blend optimization settings Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21041>	2023-02-03 00:18:01 +00:00
Marek Olšák	dacb111607	radeonsi/ci: add gfx1100 results There are also a lot of flakes. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21041>	2023-02-03 00:18:01 +00:00
Marek Olšák	6445d2eca9	radeonsi/ci: update gfx10.3 results Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21041>	2023-02-03 00:18:01 +00:00
Lucas Stach	175732bb51	etnaviv: fix double scanout import of multiplanar resources etna_resource_from_handle() is called for each plane of a multiplanar resource, so there is no point in looping over all planes to do the renderonly scanout import. In fact that will cause us to lose track of the scanout imports from later planes when the earlier planes are redoing the import, overwriting the pointer to the allocated renderonly_scanout struct. Drop the loop and just do the import for the current plane. Fixes: `826f95778a` ("etnaviv: always try to create KMS side handles for imported resources") Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20993>	2023-02-02 19:08:29 +00:00
Emma Anholt	8839baee57	ci: Drop the itoral-gl-terrain demo from traces. There's an app bug in the CSM rendering that causes undefined results. Fixes: #8212 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21055>	2023-02-02 18:42:45 +00:00
Mike Blumenkrantz	5a40190f04	Revert "zink: fix zink_mem_type_idx_from_bits()" This reverts commit f7796997964bb462bcbfa6b9faca5dcf04b64e1b. I was doing too much F2F and not enough thinking with this one Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21076>	2023-02-02 16:27:38 +00:00
Rose Hudson	0d4e375a58	asahi: wire up shader disk cache support Note: I (Alyssa) have squashed in some minor changes squashed in pre merge. The rest is Rose's work :-) Closes: #8091 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20835>	2023-02-02 16:12:33 +00:00
Mike Blumenkrantz	68e914a4ca	zink: rework descriptor buffer templating to use offsets compute programs can be reused across contexts, which means storing any pointers directly like this is going to lead to desync and crash instead, make this like regular descriptor templates and calculate the offset from the current context to ensure that everything works as it should fixes #8201 Fixes: `7ab5c5d36d` ("zink: use EXT_descriptor_buffer with ZINK_DESCRIPTORS=db") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21020>	2023-02-02 12:59:15 +00:00
Asahi Lina	ed6edc07e4	asahi: Split off macOS support into its own file All the ifdef __APPLE__ is getting really silly. Let's split off the macOS UAPI abstraction into its own file, so we can have parallel implementations. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21058>	2023-02-02 11:45:52 +00:00
Alyssa Rosenzweig	ea285aea8d	asahi: Use non-UAPI specific BO create flags So we're not tied to the macOS or Linux UAPIs and are not translating awkwardly from one to the other when creating BOs. They're not quite equivalent -- macOS doesn't include writeback information in this flag field, and Linux doesn't have a executable flag. (Maybe we should add one, though? Then we can enforce W^X.) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21058>	2023-02-02 11:45:52 +00:00
Martin Roukala (né Peres)	9e2365708b	zink/ci: allow running manual jobs again on RADV Fixes: `f6c06ef2f6` ("ci: Add manual rules variations to disable.") Reviewed-by: Emma Anholt <emma@anholt.net> Signed-off-by: Martin Roukala (né Peres) <martin.roukala@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21036>	2023-02-02 12:18:33 +02:00
Mike Blumenkrantz	d23b3a1394	zink: fix zink_mem_type_idx_from_bits() at some point this used to work, but it no longer does what it's supposed to do, which is return a memtype from a heap+flags Fixes: `d702a503ad` ("zink: support multiple heaps per memory type") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21025>	2023-02-02 05:04:17 +00:00
Mike Blumenkrantz	ff5a761232	zink: only set VkPipelineColorBlendStateCreateInfo::attachmentCount without full ds3 this should be ignored by drivers/layers, but it isn't, and the crashing is immense Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21025>	2023-02-02 05:04:17 +00:00
Mike Blumenkrantz	fd0562693d	lavapipe: try harder to reuse pipeline layouts during merge the original code was quite conservative and always created a new layout, but many times this is unnecessary, and the original layout can just be refcounted since it doesn't need to be merged Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21051>	2023-02-02 04:49:42 +00:00
Mike Blumenkrantz	a1a859328b	lavapipe: delete lvp_pipeline::mem_ctx this is no longer used Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21051>	2023-02-02 04:49:42 +00:00
Mike Blumenkrantz	59af3b4ad4	lavapipe: delete unused pipelines immediately deferring these can cause memory ballooning and oom Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21051>	2023-02-02 04:49:42 +00:00
Mike Blumenkrantz	408606af02	lavapipe: create gfx gallium csos at pipeline bind this should minimize pipeline creation time and make fast-linking "fast" Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21051>	2023-02-02 04:49:42 +00:00
Mike Blumenkrantz	6f0303ba76	lavapipe: break out (and slightly refactor) gallium shader cso creation there's also now a(n unused) flag to indicate that the csos have been created Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21051>	2023-02-02 04:49:42 +00:00
Mike Blumenkrantz	4031098b85	lavapipe: refcount nir shaders instead of cloning this is just about ownership, not modification, so refcounting saves time Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21051>	2023-02-02 04:49:42 +00:00
Mike Blumenkrantz	3770eaab73	lavapipe: add refcounting for shader nir Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21051>	2023-02-02 04:49:42 +00:00
Mike Blumenkrantz	453f49ce6d	lavapipe: move noop fs creation to device this avoids creating a separate noop fs for every pipeline Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21051>	2023-02-02 04:49:42 +00:00
Chia-I Wu	dc7f6c5324	freedreno: support UBWC scanout On sway+xwayland, both explicit and implicit modifiers are advertised. While dri3proto says nothing about it, zwp_linux_dmabuf_v1 says A compositor that sends valid modifiers and DRM_FORMAT_MOD_INVALID for a given format supports both explicit modifiers and implicit modifiers. "glmark2 -b build:model=bunny --fullscreen" goes from 468 to 598fps on a618 @ 2160x1440. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20892>	2023-02-02 04:33:25 +00:00
Chia-I Wu	1cf28bd049	freedreno: add has_implicit_modifier helper Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20892>	2023-02-02 04:33:25 +00:00
Kenneth Graunke	a0e7e7ff41	iris: Perform load_constant address math in 32-bit rather than 64-bit We lower NIR's load_constant to load_global_constant, which uses A64 bindless messages. As such, we do the following math to produce the address for each load: base_lo@32 <- BRW_SHADER_RELOC_CONST_DATA_ADDR_LOW base_hi@32 <- BRW_SHADER_RELOC_CONST_DATA_ADDR_HIGH base@64 <- pack_64_2x32_split(base_lo, base_hi) addr@64 <- iadd(base@64, u2u64(offset@32)) On platforms that emulate 64-bit math, we have to emit additional code for the 64-bit iadd to handle the possibility of a carry happening and affecting the top bits. However, NIR constant data is always uploaded adjacent to the shader assembly, in the same buffer. These buffers are required to live in a 4GB region of memory starting at Instruction State Base Address. We always place the base address at a 4GB address. So the constant data always lives in a buffer entirely contained within a 4GB region, which means any offsets from the start of the buffer cannot possibly affect the high bits. So instead, we can simply do a 32-bit addition between the low bits of the base and the offset, then pack that with the unchanged high bits. On iris, IRIS_MEMZONE_SHADER is at [0, 4GB) so the high bits are always zero. We don't even need to patch that portion of the address and can simply use u2u64 to promote the 32-bit add result to a 64-bit value where the top bits are 0. shader-db on Icelake indicates that this: - Helps instructions: -1.13% in 135 affected programs - Helps spills/fills: -4.08% / -4.18% in 4 affected programs - Gains us 1 SIMD16 compute shader instead of SIMD8 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20999>	2023-02-02 02:45:04 +00:00
Sil Vilerino	37652da616	d3d12: Honor suggested driver profile/level for H264/HEVC encode Fixes some H264 <-> HEVC transcode cases where the wrong level/profile was assigned to the output bitstream Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21043>	2023-02-01 19:17:21 +00:00
José Roberto de Souza	8092bc2158	intel/ds: Fix crash when allocating more intel_ds_queues than u_vector was initialized u_vector_add() don't keep the returned pointers valid. After the initial size allocated in u_vector_init() is reached it will allocate a bigger buffer and copy data from older buffer to the new one and free the old buffer, making all the previous pointers returned by u_vector_add() invalid and crashing the application when trying to access it. This is reproduced when running dEQP-VK.synchronization.signal_order.timeline_semaphore.* in DG2 SKUs that has 4 CCS engines, INTEL_COMPUTE_CLASS=1 is set and of course perfetto build is enabled. To fix this issue here I'm moving the storage/allocation of struct intel_ds_queue to struct anv_queue/iris_batch and using struct list_head to maintain a chain of intel_ds_queue of the intel_ds_device. This allows us to append or remove queues dynamically in future if necessary. Fixes: `e760c5b37b` ("anv: add perfetto source") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20977>	2023-02-01 18:31:29 +00:00
Rob Clark	e29001d0e7	freedreno/a6xx: Remove excess CS flushing Also requires fixing where we emit barriers, and flushing pending barriers at the end of the batch. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20975>	2023-02-01 17:28:41 +00:00
Rob Clark	9b22bdc956	freedreno/a6xx: Also FLUSH_CACHE on image barrier For the same reason we need to on an UPDATE_BUFFER barrier. Fixes KHR-GLES31.core.compute_shader.pipeline-post-fs once the hard-coded cache-flush is removed from launch_grid path. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20975>	2023-02-01 17:28:41 +00:00
Rob Clark	23e65c6084	freedreno/a6xx: Make shader state independent of grid info Eventually we want to move this into a state group, so we can pre-bake the cmdstream and re-emit it via CP_SET_DRAW_STATE when it is dirty. But in order to do that it needs to not depend on grid info. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20975>	2023-02-01 17:28:41 +00:00
Rob Clark	1faf7133d4	freedreno: Don't open-code setting dirty CS state There is actually no issue with setting FD_DIRTY_PROG, since all state is marked dirty when we switch from compute to 3d. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20975>	2023-02-01 17:28:41 +00:00
Rob Clark	5a37cd8569	freedreno/a6xx: Don't double-write SP_CS_OBJ_START Also SP_CS_INSTRLEN. This is already done in fd6_emit_shader(). Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20975>	2023-02-01 17:28:41 +00:00
Rob Clark	a063caa46a	freedreno: Skip flush_resource with explicit sync Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20975>	2023-02-01 17:28:41 +00:00
Rob Clark	2503e22717	freedreno: nondraw-batch Allow multiple compute grids to be combined into a single non-draw batch. This will allow us to optimize state emit and remove excess flushing between compute jobs. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20975>	2023-02-01 17:28:41 +00:00

1 2 3 4 5 ...

58341 commits