fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-19 20:08:06 +02:00

Author	SHA1	Message	Date
Kenneth Graunke	20222cd956	anv: Use the new nir_opt_acquire_release_barriers pass Improves performance of Phasmophobia with the "Eye Adaptation" video setting enabled on Arc B570 by about 9.5%. fossil-db results on Battlemage: Totals: Instrs: 148797922 -> 148797865 (-0.00%) Send messages: 7066341 -> 7066317 (-0.00%) Cycle count: 21459978352 -> 21459975048 (-0.00%) Totals from 8 (0.00% of 574410) affected shaders: Instrs: 4633 -> 4576 (-1.23%) Send messages: 479 -> 455 (-5.01%) Cycle count: 611886 -> 608582 (-0.54%) Observed to cut 15% of sends in a Phasmophobia shader, 8.3% in a Far Cry New Dawn shader, 7% in a Borderlands 3 DX11 shader, and 3.4-3.7% of sends in a few Witcher 3 and Dark Souls 3 shaders. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33504>	2025-05-16 00:29:13 +00:00
José Roberto de Souza	cb6f96a1e8	anv: Remove a '#if GFX_VER >= 30' block inside of a else of '#if GFX_VERx10 >= 125' Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Removing deadcode. Reviewed-by: Lucas Fryzek <lfryzek@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34988>	2025-05-15 15:25:12 +00:00
José Roberto de Souza	37b42ef648	anv: Drop '#if GFX_VERx10 >= 125' inside of '#if GFX_VERx10 >= 125' This is just redundant. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34988>	2025-05-15 15:25:12 +00:00
José Roberto de Souza	3cd972a2d3	anv: Enable preemption due 3DPRIMITIVE in GFX 12 The issues preventing it to be enabled were fixed so now we can enable it but we need also to enable workaround 16013994831 back again. Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34988>	2025-05-15 15:25:12 +00:00
José Roberto de Souza	2432d6677e	anv: Implement missing part of Wa_1604061319 Description of this workaround are not clear but looking at Iris implementation we need to emit all 3DSTATE_PUSH_CONSTANT_ALLOC_XS if any 3DSTATE_PUSH_CONSTANT_ALLOC_XS is emitted. Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34988>	2025-05-15 15:25:12 +00:00
Hyunjun Ko	7ddf51dc99	anv: Fix to set CDEF filter flag correctly. Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This fixes to play av1_intel_broken2.ivf. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34866>	2025-05-15 01:02:05 +00:00
Hyunjun Ko	2e256a3cee	anv: Allocate MV buffers enough for AV1 decoding. As other video memories for AV1 are already allocated for the maximum sizes, now it does the same for MV buffers too. This fixes a bunch of artifacts of AV1 playing. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34866>	2025-05-15 01:02:05 +00:00
Hyunjun Ko	f4d480f808	anv: Always allocate cdf tables when independent profiles provided Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34866>	2025-05-15 01:02:05 +00:00
Nanley Chery	4502254cd2	anv: Drop the slow clear heuristic Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This no longer provides a performance improvement. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33776>	2025-05-13 15:13:05 +00:00
Nanley Chery	67d60f4325	intel/blorp: Simplify get_fast_clear_rect() for gfx12.5 Refactor the scale factors to highlight the 16-tile width requirement on Tile4. The fast-clear simulator code associated with HSD 1407682962 also contains a 16-tile requirement for Tile4 surfaces (for the pitch). Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33776>	2025-05-13 15:13:05 +00:00
Nanley Chery	312952048b	intel/blorp: Redescribe gfx12.5 surfaces for CCS fast clears According to HSD 1407682962 and the associated simulator code, fast-clear performance can be affected by: image alignment, tiling, dimensionality, and row pitch. Redescribe surfaces in order avoid fast-clearing at a slower rate. Also, benchmarking the main patch in the performance CI (hw=A750) shows that some traces are helped significantly: * TotalWarWarhammer3 +5.58% (n=2) * Factorio +3.75% (n=1) * TerminatorResistance +3.3% (n=2) * Borderlands3 +3.23% (n=2) We could additionally increase the alignment requirements of surfaces in order to deterministically increase fast-clear performance. That's left out of this patch in order to avoid any functional pitfalls that can arise with increased memory consumption. As a result, performance will continue to be affected by how ISL/drivers/apps configure main surface memory alignments (directly or indirectly). Thanks to Lionel Landwerlin for pointing me to the relevant simulator code. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11168 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11418 Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33776>	2025-05-13 15:13:05 +00:00
Nanley Chery	169e22f962	intel/blorp: Drop clear color assignment prior to Xe2 This hasn't been used since the responsibility of clear color updates moved to the drivers. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33776>	2025-05-13 15:13:05 +00:00
Nanley Chery	e353244553	intel/blorp: Disable repclear for gfx12 fast-clear Docs indicate that this shouldn't be used. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33776>	2025-05-13 15:13:05 +00:00
Nanley Chery	8dad01903a	intel: Add and use isl_surf_image_has_unique_tiles() Returns whether or not a subresource range maps to a tile-aligned memory range which doesn't overlap other subresources. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33776>	2025-05-13 15:13:04 +00:00
Nanley Chery	fcdae4d4c0	intel: Add and use isl_surf_from_mem() Unify code which creates surfaces from buffers. The behavior is slightly changed to use array layers to enable arrayed buffer clears (as needed). Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33776>	2025-05-13 15:13:04 +00:00
Mauro Rossi	04a643d877	intel/compiler: use ffsll instead of ffsl in brw_vue_map.c `18bbcf9a` triggered the following building error in Android, simple fix is to use ffsll() as it was done before `18bbcf9a` to process uint64_t generics argument. Fixes the following building error: FAILED: src/intel/compiler/libintel_compiler.a.p/brw_vue_map.c.o ... ../src/intel/compiler/brw_vue_map.c:120:37: error: implicit declaration of function 'ffsl' is invalid in C99 [-Werror,-Wimplicit-function-declaratio n] const int first_generic_output = ffsl(generics) - 1; ^ ../src/intel/compiler/brw_vue_map.c:120:37: note: did you mean 'ffs'? /home/utente/r-x86_kernel/bionic/libc/include/strings.h:72:5: note: 'ffs' declared here int ffs(int __i) __INTRODUCED_IN_X86(18); ^ 1 error generated. Fixes: `18bbcf9a` ("intel: introduce new VUE layout for separate compiled shader with mesh") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34915>	2025-05-11 00:50:21 +02:00
Ian Romanick	338273dedd	brw/reg_allocate: Optimize spill offset calculation using integer MAD Gfx12.5 and later allow the use of two 16-bit immediate values in integer MAD. Gfx11 and Gfx12 allow a single immediate for integer MAD, but that is not helpful where. v2: brw_reg_alloc::build_lane_offsets is only used on Gfx12.5+, so the check around using integer MAD is unnecessary. No shader-db or fossil-db changes on any pre-Gfx12.5 platforms. shader-db: Lunar Lake, Meteor Lake, and DG2 had similar results. (Lunar Lake shown) total instructions in shared programs: 17119962 -> 17118441 (<.01%) instructions in affected programs: 65398 -> 63877 (-2.33%) helped: 32 / HURT: 0 total cycles in shared programs: 895433316 -> 895425578 (<.01%) cycles in affected programs: 13437376 -> 13429638 (-0.06%) helped: 30 / HURT: 2 fossil-db: Lunar Lake, Meteor Lake, and DG2 had similar results. (Lunar Lake shown) Totals: Instrs: 210052706 -> 209550074 (-0.24%) Cycle count: 31486266412 -> 31436238696 (-0.16%); split: -0.16%, +0.00% Totals from 7081 (1.00% of 707082) affected shaders: Instrs: 16864614 -> 16361982 (-2.98%) Cycle count: 6323185782 -> 6273158066 (-0.79%); split: -0.79%, +0.00% Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34886>	2025-05-09 21:31:09 +00:00
Ian Romanick	3db8dbfdc3	brw/reg_allocate: Optimize spill offset calculation using more SIMD8 Re-associate the calculation. The current calcuation is ((lane + zero_or_8) << 2) + offset The first addition is SIMD8, and the shift and second addition are SIMD16. By switching to ((lane << 2) + offset) + zero_or_32 All operations are SIMD8. The SHL operates directly on the UW 0x76543210UV value, and that eliminates the MOV to expand the UW to UD. v2: Switch to alternate method. Update for SIMD32 on Xe2. No shader-db or fossil-db changes on any pre-Gfx12.5 platforms. shader-db: Lunar Lake, Meteor Lake, and DG2 had similar results. (Lunar Lake shown) total instructions in shared programs: 17121519 -> 17119962 (<.01%) instructions in affected programs: 73208 -> 71651 (-2.13%) helped: 36 HURT: 0 helped stats (abs) min: 1 max: 129 x̄: 43.25 x̃: 56 helped stats (rel) min: 0.05% max: 4.92% x̄: 2.50% x̃: 2.79% 95% mean confidence interval for instructions value: -56.02 -30.48 95% mean confidence interval for instructions %-change: -3.24% -1.75% Instructions are helped. total cycles in shared programs: 895450146 -> 895433316 (<.01%) cycles in affected programs: 13709400 -> 13692570 (-0.12%) helped: 31 HURT: 2 helped stats (abs) min: 26 max: 1654 x̄: 543.10 x̃: 672 helped stats (rel) min: <.01% max: 3.43% x̄: 0.43% x̃: 0.51% HURT stats (abs) min: 2 max: 4 x̄: 3.00 x̃: 3 HURT stats (rel) min: <.01% max: <.01% x̄: <.01% x̃: <.01% 95% mean confidence interval for cycles value: -652.42 -367.58 95% mean confidence interval for cycles %-change: -0.61% -0.19% Cycles are helped. fossil-db: Lunar Lake, Meteor Lake, and DG2 had similar results. (Lunar Lake shown) Totals: Instrs: 210566294 -> 210052706 (-0.24%) Cycle count: 31582309052 -> 31486266412 (-0.30%); split: -0.30%, +0.00% Totals from 7091 (1.00% of 707082) affected shaders: Instrs: 17408115 -> 16894527 (-2.95%) Cycle count: 6443785290 -> 6347742650 (-1.49%); split: -1.49%, +0.00% Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34886>	2025-05-09 21:31:09 +00:00
Iván Briano	99405647a4	anv: vkCmdTraceRays* are not covered by conditional rendering Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The spec says: Certain rendering commands can be executed conditionally based on a value in buffer memory. These rendering commands are limited to drawing commands, dispatching commands, and clearing attachments with vkCmdClearAttachments within a conditional rendering block which is defined by commands vkCmdBeginConditionalRenderingEXT and vkCmdEndConditionalRenderingEXT. Other rendering commands remain unaffected by conditional rendering. It would seem that vkCmdTraceRays* are not covered by that. Fixes new tests dEQP-VK.conditional_rendering.conditional_ignore.trace_rays* Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34864>	2025-05-08 21:08:06 +00:00
Lionel Landwerlin	5c7c1eceb5	anv/brw: handle pipeline libraries with mesh I always thought there was a massive issue with pipeline libraries & mesh shaders. Indeed recent CTS tests have exposed a number of issues. Some values delivered to the fragment shader are coming from different places depending on whether the preceding shader is Mesh or not. For example PrimitiveID is delivered in the per-primitive block in Mesh pipelines whereas for other pipelines it's coming as a VUE slot (which is per-vertex). Those are 2 different locations in the payload. We have to find a layout for fragment shaders that is compatible with everything. Leaving gaps here and there in the thread payload. Fixes the following test pattern : dEQP-VK.mesh_shader.ext.smoke.fast_lib.shared_* Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34109>	2025-05-08 06:48:35 +00:00
Lionel Landwerlin	18bbcf9a63	intel: introduce new VUE layout for separate compiled shader with mesh Mesh shaders have per vertex block in URB pretty much identical to the VUE format. Let's just reuse that concept to do all of our layout in the payload attribute registers. This will ensure that we have consistent VUE layout between Mesh & non-Mesh pipelines. We need a new way of laying out the VUE though as we have to accomodate a HW constraint of maximum (per-primitive + per-vertex) of 32 varying. This means we cannot have 2 locations in the payload for things like PrimitiveID which can come from either the per-primitive or the per-vertex block. The new layout places the PrimitiveID at the end of the per-vertex attributes and shrinks the delivery dynamically if the mesh stage is active. The shader is compiled with a MOV_INDIRECT to read the PrimitiveID from the right location in the attributes. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34109>	2025-05-08 06:48:35 +00:00
Lionel Landwerlin	2d396f6085	intel: prepare VUE layout for more than 2 layouts Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34109>	2025-05-08 06:48:35 +00:00
Lionel Landwerlin	95efdca00b	brw: add documentation pointers to FS attribute layout Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34109>	2025-05-08 06:48:35 +00:00
Lionel Landwerlin	9d342081e7	brw/nir: add intrinsics to read attribute payload register indirectly Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34109>	2025-05-08 06:48:35 +00:00
Lionel Landwerlin	ef17fbf8e5	anv/brw: use separate_shader to deduced MUE compaction Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34109>	2025-05-08 06:48:35 +00:00
Lionel Landwerlin	6230f3029f	brw: fix brw_nir_move_interpolation_to_top In a case like this : block_0: %5 = ... %6 = ... block_1: %7 = load_interpolated_input %5, %6 The current logic would move load_interpolated_input to block_0 before %5 but not move %5 & %6 which are sources of that instruction. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34109>	2025-05-08 06:48:34 +00:00
Lionel Landwerlin	5ff1b31c3f	brw: document some brw_wm_prog_data fields Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34109>	2025-05-08 06:48:34 +00:00
Lionel Landwerlin	2f654ddd03	brw: use VARYING_BIT_* macros more Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34109>	2025-05-08 06:48:34 +00:00
Lionel Landwerlin	75b2d000fc	anv: tidy up (CLIP\|SBE)_MESH emission Moving it to is related functions. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34109>	2025-05-08 06:48:34 +00:00
Lionel Landwerlin	62d2e323ba	anv/brw: shrink FS varying payload We're currently allocating payload spots for 3 fields already delivered somewhere else in the payload. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34109>	2025-05-08 06:48:34 +00:00
Lionel Landwerlin	c467444670	brw/nir: use a new intrinsic for fs_msaa_flag Avoid NIR code doing offset computations. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34109>	2025-05-08 06:48:34 +00:00
Lionel Landwerlin	dd1ef73aae	brw: use newer NIR constructs nir_shader_intrinsics_pass() & NIR_PASS() Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34109>	2025-05-08 06:48:34 +00:00
Lionel Landwerlin	b64f237dc4	brw: move helper to brw_nir.c Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34109>	2025-05-08 06:48:34 +00:00
Lionel Landwerlin	cbbe7ff66e	brw: add new helper to print out FS URB setup Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34109>	2025-05-08 06:48:34 +00:00
Lionel Landwerlin	b8a80c88cb	brw: improve VUE printout Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34109>	2025-05-08 06:48:34 +00:00
Lionel Landwerlin	4f10a1f618	anv: switch to brw helpers to figure out if a fragment is dynamic Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34109>	2025-05-08 06:48:34 +00:00
Lionel Landwerlin	cb461fa287	anv: switch to use the tcs_prog_data for dynamic input vertices Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34109>	2025-05-08 06:48:34 +00:00
Lionel Landwerlin	7f500cc6e4	brw: store input_vertices on tcs_prog_data Will allow the driver to know if the vertices count is dynamic. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34109>	2025-05-08 06:48:34 +00:00
Lionel Landwerlin	a9ee498347	brw: add helpers to check if a fragment shader execution is dynamic Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34109>	2025-05-08 06:48:34 +00:00
Lionel Landwerlin	4717382f84	anv: lower input vertices for TCS unconditionally Take the opportunity to reuse the backend pass. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34109>	2025-05-08 06:48:34 +00:00
Lionel Landwerlin	119ef792c5	anv: remove tbimr workaround check Already handled by having a special range created Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34109>	2025-05-08 06:48:34 +00:00
Emma Anholt	7c130e5dcf	intel/ds: Fix formatting of stage index. draws had been bumped to stage #10, so they ended up appearing nested between frame (#1) and cmdbuf (#2), instead of nested under them. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22350>	2025-05-08 01:21:25 +00:00
Emma Anholt	4cc66123ec	anv/ds: Forward VkDebugUtilsObjectNameInfoEXT to perfetto. This gets us names on zink/wsi command buffers in perfetto, but may also be useful some day for getting app names onto framebuffers and non-dynamic renderpasses. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22350>	2025-05-08 01:21:25 +00:00
Emma Anholt	55d788f434	anv/ds: Associate the VkCommandBuffer some anv-only renderstage events. This means the perfetto UI will have a non-zero/NULL handle/name in the UI on these renderstages. Unfortunately, intel/ds is outside of vulkan so unless we pull in anv headers, we can't just pass in the anv_cmd_buffer. This also means it would be much more painful to pass the cmd buffer to the rest of the events, so they'll still have unset command buffers. Still, being able to see the name of the command buffer in at least one of the events should be useful once that's glued together. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22350>	2025-05-08 01:21:25 +00:00
Emma Anholt	546a100f26	intel/ds: Move "have we already sent initial state?" into the helper. I'm going to have to send initial state from another function too, shortly. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22350>	2025-05-08 01:21:25 +00:00
Emma Anholt	dd81420ef1	perfetto: Create a common MesaRenderpassIncrementalState. ... and explain what its role is. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22350>	2025-05-08 01:21:25 +00:00
Sagar Ghuge	bb61a78911	anv: Fix untyped data port cache pipe control dump output Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Fixes: `845ab3d627` Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34855>	2025-05-07 19:29:50 +00:00
Lionel Landwerlin	c434050a00	brw: add pre ray trace intrinsic moves Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Some intrinsics are implemented by reading memory location that could be rewritten by a further tracing calls. So we need to move those reads prior to tracing operations in the shaders. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8979 Tested-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34214>	2025-05-06 13:34:53 +00:00
Lionel Landwerlin	37608c075f	anv: promote VK_EXT_robustness2 to VK_KHR_robustness2 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34821>	2025-05-06 13:16:13 +00:00
Hyunjun Ko	86d21fd2cf	anv: Set tc/beta offset according to the flag from PPS. Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Consider the flag from PPS when setting tc/beta offset. This fixes some artifacts when decoding a hevc video, hevc_scaling_list4.mkv from Lynne. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34782>	2025-05-06 04:24:22 +00:00

1 2 3 4 5 ...

14021 commits