fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-03-06 03:30:30 +01:00

Author	SHA1	Message	Date
Samuel Pitoiset	176befe439	radv/meta: fix potential memleak when creating DCC retile pipelines If the driver needs to create two different DCC retile pipelines which is based on the image swizzle, it will just overwrite the existing layouts. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30233>	2024-07-17 21:55:33 +00:00
Samuel Pitoiset	8e53114de3	radv/meta: fix potential race condition when creating DCC retile pipelines Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30233>	2024-07-17 21:55:33 +00:00
Samuel Pitoiset	0906b64724	radv/meta: rework creating copy expand pipelines Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30233>	2024-07-17 21:55:33 +00:00
Samuel Pitoiset	67d720d115	radv/meta: create fmask copy layouts regardless on-demand Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30233>	2024-07-17 21:55:33 +00:00
Samuel Pitoiset	ee1bc2e821	radv/meta: rework creating FMASK expand pipelines Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30233>	2024-07-17 21:55:33 +00:00
Samuel Pitoiset	f1ec223cd2	radv/meta: create fmask expand layouts regardless on-demand To be consistent with other meta operations. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30233>	2024-07-17 21:55:33 +00:00
Samuel Pitoiset	4bef832b10	radv/meta: rework creating blit pipelines Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30233>	2024-07-17 21:55:33 +00:00
Samuel Pitoiset	ad82a338b3	radv/meta: cleanup meta_emit_blit() Passing the image and the image view is useless. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30233>	2024-07-17 21:55:33 +00:00
Samuel Pitoiset	89dc316720	radv/meta: move the locking around creating blit pipelines Only the on-demand path needs to be locked. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30233>	2024-07-17 21:55:33 +00:00
Samuel Pitoiset	e8683cae01	radv/meta: cleanup radv_device_init_meta_blit_{color,depth,stencil]() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30233>	2024-07-17 21:55:32 +00:00
Samuel Pitoiset	20be729636	radv/meta: create the fill/copy pipelines on-demand Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30233>	2024-07-17 21:55:32 +00:00
Samuel Pitoiset	c57987afc7	radv/meta: separate creating the fill/copy pipelines Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30233>	2024-07-17 21:55:32 +00:00
Samuel Pitoiset	5f2cbc3ab9	radv/meta: cleanup creating the compute depth decompress pipeline Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30233>	2024-07-17 21:55:32 +00:00
Samuel Pitoiset	b3d9afe44d	radv/meta: create the compute depth decompress pipeline on-demand It was always compiled when the device is created. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30233>	2024-07-17 21:55:32 +00:00
Samuel Pitoiset	f3e7c7e19f	radv/meta: rework creating the gfx depth decompress pipeline Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30233>	2024-07-17 21:55:32 +00:00
Samuel Pitoiset	b54dc6a29a	radv/meta: remove unused parameter to radv_get_depth_pipeline() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30233>	2024-07-17 21:55:32 +00:00
Samuel Pitoiset	38567300d4	radv/meta: move locking around the gfx depth decompress pipeline Locking is only needed when the pipeline is created on-demand, so move that to the caller. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30233>	2024-07-17 21:55:32 +00:00
Samuel Pitoiset	7c94ed0394	radv/meta: avoid potential NULL deref with the gfx depth decompress pipeline If the pipeline failed to be created. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30233>	2024-07-17 21:55:32 +00:00
Samuel Pitoiset	9c1fa23adf	radv/meta: remove the depth resummarize operation This has never been used and if we need at some point, we can just re-introduce it. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30233>	2024-07-17 21:55:32 +00:00
Samuel Pitoiset	76bf65b613	radv/meta: rework creating the VRS copy HTILE pipeline Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30233>	2024-07-17 21:55:32 +00:00
Samuel Pitoiset	f5c743e9e9	radv/meta: fix potential race condition when creating the copy VRS pipeline This could lead to a race condtion if two command buffers are recorded at the same time because it's accessing the device meta state. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30233>	2024-07-17 21:55:32 +00:00
Mike Blumenkrantz	f24742e8dc	winsys/radeon: revert recent changes This reverts commit `f673e2bf68`. This reverts commit `216ff9591b`. This reverts commit `ec2451fcb3`. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30228>	2024-07-17 18:48:11 +00:00
Vinson Lee	cc9503206e	panvk: Fix assert Fix defect reported by Coverity Scan. Assign instead of compare (PW.ASSIGN_WHERE_COMPARE_MEANT) assign_where_compare_meant: use of "=" where "==" may have been intended Fixes: `0e74b6eda9` ("panvk: Add support for layered rendering") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30181>	2024-07-17 18:27:50 +00:00
José Roberto de Souza	0500e35165	intel/dev: Drop writeback_incoherent from Xe2 Xe2 platforms are only supported by Xe KMD that do not support CPU WB + 0 way coherent. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29950>	2024-07-17 17:41:32 +00:00
José Roberto de Souza	6d77dfa75d	intel/dev: Use GPU WB PAT for Xe2 writecombining So for this entry we want the CPU mapping to be WC but GPU caches can be WB. This way GPU don't need to snoop to CPU caches and at the end of workloads L3 cache is flushed, so CPU access is coherent after get the signal that workload was finished. With this the transient(XD) L3 flushes will only affect displayable buffers. Ref: Bspec 71582 (r59285) Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29950>	2024-07-17 17:41:32 +00:00
José Roberto de Souza	48da8eab55	intel/dev: Add comment documenting the PAT entries Like said in the past patch, coherency is not needed and there was a miss understating about caching used by CPU and GPU. With this new comment it much better explained. Ref: Bspec 45101 (r51017) Ref: Bspec 71582 (r59285) Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29950>	2024-07-17 17:41:32 +00:00
José Roberto de Souza	7295e09b53	intel/dev: Drop coherency from intel_device_info_pat_entry It is not used in run-time so we can drop from the struct. It might have value as PAT entries documentation but that will be done in the next patch. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29950>	2024-07-17 17:41:32 +00:00
José Roberto de Souza	fa1129540a	intel/dev: Add documentation about intel_device_info_pat_entry::mmap My initial understating was that L3_CACHE_POLICY would be the CPU caching mode but that has nothing to do with CPU caching, it is the GPU caching mode. Due this miss understating we were using a not optimal PAT index that will be fixed in the next patches, so to avoid such issues in future adding comments to intel_device_info_pat_entry struct. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29950>	2024-07-17 17:41:32 +00:00
José Roberto de Souza	4173e0f910	intel/dev: Drop DG1 PAT entries It inherents that table from TGL. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29950>	2024-07-17 17:41:32 +00:00
José Roberto de Souza	178950bf9b	anv: Fix return of PAT index for compressed bos for discrete GPUs Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29950>	2024-07-17 17:41:32 +00:00
Samuel Pitoiset	619bcd3b5c	radv: allow to capture with RGP on GFX11_5 It works fine. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30225>	2024-07-17 16:25:19 +00:00
Samuel Pitoiset	e2882ea3e2	ac/rgp: assume GFX11_5 use the same SQTT/RGP versions as GFX11 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30225>	2024-07-17 16:25:19 +00:00
Samuel Pitoiset	664a31bcd1	radv: disable SPM trace on GFX11_5 SPM needs performance counters and they aren't exposed yet. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30225>	2024-07-17 16:25:19 +00:00
Samuel Pitoiset	057c4e3786	radv: expose BufferFloat32AtomicMinMax on GFX11_5 This is supported like GFX11. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30225>	2024-07-17 16:25:19 +00:00
Samuel Pitoiset	d2ae0c9ef8	radv: fix programming DB_RENDER_CONTROL for NULL depth/stencil on GFX11_5 It should be programmed like GFX11. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30225>	2024-07-17 16:25:19 +00:00
Samuel Pitoiset	1c5779250b	radv: do not expose ImageFloat32AtomicMinMax on GFX11_5 These opcodes aren't supported on GFX11-11.5. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30225>	2024-07-17 16:25:19 +00:00
Georg Lehmann	eff0998064	radv: use radv_nir_opt_tid_function to create inverse_ballot Foz-DB Navi21: Totals from 542 (0.68% of 79395) affected shaders: Instrs: 617316 -> 616259 (-0.17%); split: -0.19%, +0.02% CodeSize: 3347852 -> 3320040 (-0.83%); split: -0.85%, +0.02% VGPRs: 21864 -> 21824 (-0.18%); split: -0.29%, +0.11% SpillSGPRs: 207 -> 199 (-3.86%) Latency: 4900847 -> 4895665 (-0.11%); split: -0.11%, +0.01% InvThroughput: 860278 -> 857272 (-0.35%); split: -0.35%, +0.00% SClause: 21251 -> 21169 (-0.39%); split: -0.40%, +0.01% Copies: 57759 -> 58881 (+1.94%); split: -0.06%, +2.00% Branches: 20854 -> 20365 (-2.34%); split: -2.36%, +0.01% PreSGPRs: 20785 -> 20774 (-0.05%) PreVGPRs: 17309 -> 17212 (-0.56%) VALU: 379885 -> 378180 (-0.45%); split: -0.45%, +0.00% SALU: 87522 -> 88664 (+1.30%); split: -0.02%, +1.32% Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24650>	2024-07-17 15:04:38 +00:00
Georg Lehmann	39de178656	radv: use radv_nir_opt_tid_function for shuffles The main motivation were open coded clustered inclusive scans and clustered broadcasts in the gdeflate decompression shader used by DirectStorage. Foz-DB Navi21 (only the_last_of_us_part1 is affected): Totals from 8 (0.01% of 79395) affected shaders: Instrs: 6230 -> 5438 (-12.71%) CodeSize: 33376 -> 29148 (-12.67%) Latency: 77017 -> 72917 (-5.32%) InvThroughput: 10190 -> 9280 (-8.93%) Copies: 566 -> 569 (+0.53%) PreSGPRs: 528 -> 524 (-0.76%) PreVGPRs: 232 -> 230 (-0.86%) VALU: 2889 -> 2616 (-9.45%) SALU: 1748 -> 1491 (-14.70%); split: -14.82%, +0.11% Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24650>	2024-07-17 15:04:38 +00:00
Georg Lehmann	ca88783318	radv/nir: add a pass to optimize shuffle/booleans dependent only on tid/consts This pass uses constant folding to determine which invocation is read by shuffle for each invocation. Then, it detects patterns in the result and uses more a specialized intrinsic if possible. For booleans it creates inverse_ballot. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24650>	2024-07-17 15:04:38 +00:00
Georg Lehmann	2d3f536174	aco,nir: add dpp16_shift_amd intrinsic Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24650>	2024-07-17 15:04:38 +00:00
Faith Ekstrand	1f430b1111	nak/nir: Make interpolate_at_sample more efficient Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30218>	2024-07-17 13:38:24 +00:00
Faith Ekstrand	24d5acf052	nak/nir: Use prmt for barycentric offset lowering Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30218>	2024-07-17 13:38:24 +00:00
Faith Ekstrand	fffbd3ff2b	nak/nir: Use prmt in texture lowering Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30218>	2024-07-17 13:38:24 +00:00
Faith Ekstrand	bbccbd8d50	nir,nak: Add a nir_op_prmt_nv We have this in hardware since forever and it's really useful. May as well add it to NIR so we can use it in various lowerings. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30218>	2024-07-17 13:38:24 +00:00
Faith Ekstrand	3619ec9630	nak: Don't print the destination of OpIpa twice While we're here, also implement Display for InterpFreq and InterpLoc and simplify printing a bit. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30218>	2024-07-17 13:38:24 +00:00
Faith Ekstrand	ef88597ebb	nak/copy_prop: Ignore the top 16 bits of OpPrmt::sel Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30218>	2024-07-17 13:38:24 +00:00
Faith Ekstrand	f949c00170	nak/copy_prop: Propagate OpSel with a selector of SrcRef::Zero Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30218>	2024-07-17 13:38:24 +00:00
Faith Ekstrand	cc33cafcac	nak/nir: Use an indirect load for sample locations A single ldc is probably more efficient than a 64-bit load and the pile of math we were generating before. The only reason for the old method was that it let us avoid indirect cbuf loads because we didn't support them for a while. Now that we can support all cbuf loads, we can just do an indirect 1B load and call it good. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30218>	2024-07-17 13:38:24 +00:00
Mike Blumenkrantz	f673e2bf68	winsys/radeon: switch to rendernode when card node doesn't work initializing the winsys from a /dev/dri/cardX node (as discovered by gbm) doesn't work, as the kernel abi expects a render node thus, the winsys needs to open the card's rendernode and use that everywhere except when importing buffers, where it has to explicitly export from the card node and import to the rendernode Acked-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30224>	2024-07-17 13:20:48 +00:00
Mike Blumenkrantz	216ff9591b	winsys/radeon: wrap fd access with util function no functional changes Acked-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30224>	2024-07-17 13:20:48 +00:00

1 2 3 4 5 ...

177969 commits