fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-19 02:48:07 +02:00

Author	SHA1	Message	Date
José Roberto de Souza	81a5512565	intel/blorp: Remove duplicated calls in blorp_exec_compute() Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details We can have only one of those calls before the 'if GFX_VERx10 >= 125' block. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39362>	2026-01-19 15:09:29 +00:00
Paulo Zanoni	b52b1a08bf	intel/blorp: add blorp_shaders.cl This gives us the infrastructure that allows us to slowly migrate pieces of blorp shaders from NIR to OpenCL, which, IMHO, are much easier to read. We can't fully migrate everything due to all the conditional building we do with these shaders, but I'm sure we'll find opportunities to replace some NIR with OpenCL eventually. The conversion of blorp_check_in_bounds() serves as the first example. I also plan to have the shaders from the new indirect copy extension be OpenCL shaders (mixed with some NIR as well), so having this patch merged now will reduce the diff for the extension later. Thanks to Alyssa Rosenzweig for her help here. v2: - Use SPDX (Alyssa). - Use nir_trim_vector() (Alyssa). - Adjust CL variable declaration (Alyssa). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39046>	2026-01-15 04:34:55 +00:00
Paulo Zanoni	f047f0b1be	intel/blorp: unionize blorp_params->wm_inputs We have two distinct code paths sharing blorp_params->wm_inputs for different purposes: the code from blorp_blit.c and the code from blorp_clear.c. While blorp_blit.c uses most of the parameters (all except clear_color), blorp_clear.c only uses clear_color and bounds_rect. Split the parameters in two structs: one for blits and the other for clears. This not only helps save some space in the shader inputs, but it also organizes things so it's more clear which parameters are used by what. In addition, my plan is to later add struct blorp_wm_inputs_indirect, which won't share anything that the others use, and would otherwise grow the struct even more. This change would reduce the size of struct blorp_wm_inputs from 96 to 80, but we have to add padding due to the assertion that compares it to cs_prog_data->push.cross_thread.size. Still good, though. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39046>	2026-01-15 04:34:55 +00:00
Paulo Zanoni	a8dd4382bf	intel/blorp: generate the fast_clear_surf shaders later Because blorp_params_get_clear_kernel() calls blorp_params_get_clear_kernel_cs(), which reads params->num_samples, which we have not properly set yet at this point. I am also planning to have the functions that create the shader to rely on params.op, which we have not set yet either. I found this by inspection (when writing another patch), I'm not sure if this fixes something relevant, but it may be relevant to ver >= 30 multi-sampled cases. Fixes: `de0c547448` ("blorp: Handle 2D MSAA array image copies on compute shader") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39046>	2026-01-15 04:34:55 +00:00
Paulo Zanoni	e360afdb8a	intel/blorp: blorp_blit_vars_init() doesn't need 'key' Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39046>	2026-01-15 04:34:55 +00:00
Paulo Zanoni	39a78f764a	blorp: reorganize struct blorp_params When I first looked at this struct, my tiny little brain felt overwhelmed. - Add some white spaces in order to group the parameters into "logical" groups so it's easier to reason about everything. - Change the parameter order just a little bit - without breaking the logical groups - so the struct size decreases by 1.7% to 1864 bytes. - Add a comment explaining what the void * pointers point to. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39046>	2026-01-15 04:34:55 +00:00
Paulo Zanoni	c98f5e9994	blorp: replace magic '2' with BLORP_NUM_BT_ENTRIES If we ever add more entries, things won't explode. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39046>	2026-01-15 04:34:55 +00:00
Paulo Zanoni	814cfa909d	blorp: fix argument indentation I'm sorry, but I have OCD and the rest of the file is nicely aligned. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39046>	2026-01-15 04:34:55 +00:00
Lionel Landwerlin	faa857a061	intel: rework push constant handling Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details nr_params & params array are gone. brw_ubo_range is not stored on the prog_data structure anymore (Anv already stored a copy of that with its own additional information) The backend now only deals with load_push_data_intel. load_uniform & load_push_constant have to be lowered by the driver. Pre Gfx12.5 platforms have to provide a subgroup_id_param to specify where the subgroup_id value is located in the push constants. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38975>	2026-01-09 14:19:52 +00:00
Lionel Landwerlin	f4a0e05970	anv/brw/iris: get rid of param array on prog_data Drivers can do all the lowering to push constants to find the only value useful in that array (subgroup_id). Then drivers call into brw_cs_fill_push_const_info() to get the cross/per thread constant layout computed in the prog_data. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38975>	2026-01-09 14:19:51 +00:00
Sagar Ghuge	de0c547448	blorp: Handle 2D MSAA array image copies on compute shader We are passing number of layers as inline parameter register, so figure out z_pos and write to 2D MSAA array images in compute shader. We already get component X, Y and sample index, all we needed was the number of layers. Ken: - Use load/store var instead of derefs Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33905>	2025-12-17 05:34:02 +00:00
Sagar Ghuge	080d28a03e	blorp: Set persample_msaa_dispatch for render shader Only 3D shader gets dispatched per sample not the compute shader. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33905>	2025-12-17 05:34:02 +00:00
Yonggang Luo	ecb0ccf603	treewide: Replace calling to function ALIGN with align This is done by grep ALIGN( to align( docs,*.xml,blake3 is excluded Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38365>	2025-11-12 21:58:40 +00:00
Lionel Landwerlin	c478b6355a	anv/blorp/iris: rework Wa_14025112257 Drivers already have to track this workaround, so remove the logic from Blorp and let the driver manage this. Also in Anv don't accumulate this workaround, emit it directly in place right after COMPUTE_WALKER. Accumulating can be problematic when you want to dispatch concurrent compute shaders that do not need any cache flush interaction (typical example with the internal simple_shader framework). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `3e0ad0176b` ("anv: Emit state cache invalidation after every compute dispatch") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38306>	2025-11-10 08:57:06 +00:00
Alyssa Rosenzweig	5f53e6edc0	intel: use util_is_aligned more Coccinelle + filtering hunks manually. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Mary Guillemard <mary.guillemard@collabora.com> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38169>	2025-10-31 15:03:58 +00:00
Kenneth Graunke	73cbb35442	brw: Move into a new src/intel/compiler/brw subdirectory Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This keeps the directory structure a bit more organized: - brw specific code - elk specific code - common NIR passes that could be used in both places It also means that you can now 'git grep' in the brw directory without finding a bunch of elk code, or having to "grep thing b*". Reviewed-by: Dylan Baker <dylan.c.baker@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37755>	2025-10-09 07:01:47 +00:00
Tapani Pälli	cb822a323f	anv/blorp: add missing cs stall on compute pipe control Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37715>	2025-10-08 04:49:27 +00:00
Tapani Pälli	e2697d717f	intel/blorp: add restriction for gfx12 Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37731>	2025-10-08 04:26:46 +00:00
Tapani Pälli	c8f47d7681	blorp: add missing pipecontrol after 3DSTATE_WM_HZ_OP for Xe2+ Backport-to: 25.2 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37547>	2025-09-26 10:07:18 +00:00
Lionel Landwerlin	bddfbe7fb1	brw/blorp: lower MCS fetching in NIR One advantage here of moving a bunch of stuff to NIR is that we can now have consistent payload types straight from the NIR conversion to BRW. This massively simplifies the BRW lowering code and avoids type errors that are quite common to make in the backend. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37527>	2025-09-23 15:37:40 +00:00
Alyssa Rosenzweig	0d7083d5bc	brw: drop indirection on compiler options I see no point, we allocate for every shader stage anyway. This is a bit simpler. I'm not a fan of the brw_compiler singleton at all but torching that is not on today's agenda. Flattening it a little bit very much is. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37447>	2025-09-18 14:14:08 +00:00
Georg Lehmann	79d02047b8	intel: switch to new subgroup size info Reviewed-by: Iván Briano <ivan.briano@intel.com> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37258>	2025-09-12 21:05:17 +00:00
Dylan Baker	ecfce9f9ad	blorp: Fix potential read of uninitaized elk fields in debug paths The intel_vue_map is only partially initialized before being used. All used fields are initialized, but in debug paths the unitialzed fields will also be read. To fix this initialize the struct to 0. In the brw path this struct is part of the prog_data, and is rzalloc'd. CID: 1665308 Reviewed-by: Iván Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37261>	2025-09-10 17:51:34 +00:00
Sagar Ghuge	ebbc358db5	blorp: Emit state cache invalidation after every compute dispatch Implement HSD 16028171704/14025112257: LSC state cache livelock:- Once state cache entries are full, subsequent walker dispatches with two threads per thread group maybe gets stuck infinitely because of state cache live lock. One thread continuously stuck in loop doing UGM fence + evict and UGM read is waiting on UGM read to have certain value. while other thread supposed to update the value that first thread is waiting for. But since entries are full in state cache, there is second thread never make progress. Closes: #12352 Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37128>	2025-09-04 00:14:48 +00:00
Tapani Pälli	42088cd602	isl/blorp: handle failing 96bpp linear blit case Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Fix the aux usage assert in blorp for 96bpp linear blit and provide CMF values for RGB formats supported by isl_format_rgb_to_rgba. CC: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/13670 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36709>	2025-08-13 16:09:12 +00:00
Qiang Yu	196569b1a4	all: rename gl_shader_stage to mesa_shader_stage It's not only for GL, change to a generic name. Use command: find . -type f -not -path '/.git/' -exec sed -i 's/\bgl_shader_stage\b/mesa_shader_stage/g' {} + Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36569>	2025-08-06 10:28:40 +08:00
Lionel Landwerlin	be16985c82	intel: move deref_block_size to intel_urb_config Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36512>	2025-08-01 11:35:05 +00:00
Antonio Ospite	ddf2aa3a4d	build: avoid redefining unreachable() which is standard in C23 In the C23 standard unreachable() is now a predefined function-like macro in <stddef.h> See https://android.googlesource.com/platform/bionic/+/HEAD/docs/c23.md#is-now-a-predefined-function_like-macro-in And this causes build errors when building for C23: ----------------------------------------------------------------------- In file included from ../src/util/log.h:30, from ../src/util/log.c:30: ../src/util/macros.h:123:9: warning: "unreachable" redefined 123 \| #define unreachable(str) \ \| ^~~~~~~~~~~ In file included from ../src/util/macros.h:31: /usr/lib/gcc/x86_64-linux-gnu/14/include/stddef.h:456:9: note: this is the location of the previous definition 456 \| #define unreachable() (__builtin_unreachable ()) \| ^~~~~~~~~~~ ----------------------------------------------------------------------- So don't redefine it with the same name, but use the name UNREACHABLE() to also signify it's a macro. Using a different name also makes sense because the behavior of the macro was extending the one of __builtin_unreachable() anyway, and it also had a different signature, accepting one argument, compared to the standard unreachable() with no arguments. This change improves the chances of building mesa with the C23 standard, which for instance is the default in recent AOSP versions. All the instances of the macro, including the definition, were updated with the following command line: git grep -l '[^_]unreachable(' -- "src/**" \| sort \| uniq \| \ while read file; \ do \ sed -e 's/$[^_]$unreachable(/\1UNREACHABLE(/g' -i "$file"; \ done && \ sed -e 's/#undef unreachable/#undef UNREACHABLE/g' -i src/intel/isl/isl_aux_info.c Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36437>	2025-07-31 17:49:42 +00:00
Nanley Chery	4de638ae1e	intel: Enable CCS_E on linear surfaces on Xe2+ Allow CCS for non-display linear surfaces in isl_surf_supports_ccs(). We're going to rely more on the helper to determine CCS-enabling for Xe2 on iris. Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32120>	2025-07-21 18:36:31 +00:00
jhananit	debd903a00	intel: Update all NIR_PASS_V to NIR_PASS Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35889>	2025-07-14 19:25:52 +00:00
Sagar Ghuge	5f1f67358c	blorp: Set TG size based on number of threads Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35904>	2025-07-10 22:08:36 +00:00
José Roberto de Souza	aea519cbc2	intel/blorp: Program DispatchWalkOrder and ThreadGroupBatchSize with optimized values for regular computer walkers It was only added to indirect compute walkers while HSD don't say anything about this optimization be specific to indirect compute walkers. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36058>	2025-07-10 20:54:30 +00:00
José Roberto de Souza	b37747ce68	blorp: Emit STATE_COMPUTE_MODE before COMPUTE_WALKER Cc: stable Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35563>	2025-06-23 18:57:25 +00:00
Nanley Chery	42ef23ecd1	intel/blorp: Don't redescribe some Tile64 clears We don't support redescribing Tile64 and 3D due to interleaved depth planes. Fixes: `312952048b` ("intel/blorp: Redescribe gfx12.5 surfaces for CCS fast clears") Tested-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35619>	2025-06-20 13:39:20 +00:00
Nanley Chery	69d91ae975	intel/blorp: Use get_copy_format_for_bpb more for gfx12.5 Use get_copy_format_for_bpb() instead of get_ccs_compatible_uint_format() when performing blorp_copy(). This matches the code path taken on gfx20 and increases the testing of cases which would impact gfx12.0 in isl_get_sampler_clear_field_offset(). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35329>	2025-06-09 17:40:20 +00:00
Nanley Chery	27a5d84632	intel/isl: Fix isl_get_sampler_clear_field_offset() Through testing, I've found that the sampler will fetch the clear color pixel from the converted clear color field in more cases. So, stop reporting the raw dword offset for them: * On gfx12.5, for 32-bpc color images. * On gfx11-12.0, for 64-bpp color images. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35329>	2025-06-09 17:40:20 +00:00
Rohan Garg	db8b07f88d	anv: use the float qualifier to denote the right value Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34824>	2025-06-05 20:26:54 +02:00
Dylan Baker	2a3cf70db8	blorp: cast uint32_t -> int64_t to avoid potential overflow In practice, I don't think it's actually going to overflow, but it could in theory, which coverity is pointing out. CID: 1647010 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35114>	2025-05-22 21:14:26 +00:00
Nanley Chery	67d60f4325	intel/blorp: Simplify get_fast_clear_rect() for gfx12.5 Refactor the scale factors to highlight the 16-tile width requirement on Tile4. The fast-clear simulator code associated with HSD 1407682962 also contains a 16-tile requirement for Tile4 surfaces (for the pitch). Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33776>	2025-05-13 15:13:05 +00:00
Nanley Chery	312952048b	intel/blorp: Redescribe gfx12.5 surfaces for CCS fast clears According to HSD 1407682962 and the associated simulator code, fast-clear performance can be affected by: image alignment, tiling, dimensionality, and row pitch. Redescribe surfaces in order avoid fast-clearing at a slower rate. Also, benchmarking the main patch in the performance CI (hw=A750) shows that some traces are helped significantly: * TotalWarWarhammer3 +5.58% (n=2) * Factorio +3.75% (n=1) * TerminatorResistance +3.3% (n=2) * Borderlands3 +3.23% (n=2) We could additionally increase the alignment requirements of surfaces in order to deterministically increase fast-clear performance. That's left out of this patch in order to avoid any functional pitfalls that can arise with increased memory consumption. As a result, performance will continue to be affected by how ISL/drivers/apps configure main surface memory alignments (directly or indirectly). Thanks to Lionel Landwerlin for pointing me to the relevant simulator code. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11168 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11418 Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33776>	2025-05-13 15:13:05 +00:00
Nanley Chery	169e22f962	intel/blorp: Drop clear color assignment prior to Xe2 This hasn't been used since the responsibility of clear color updates moved to the drivers. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33776>	2025-05-13 15:13:05 +00:00
Nanley Chery	e353244553	intel/blorp: Disable repclear for gfx12 fast-clear Docs indicate that this shouldn't be used. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33776>	2025-05-13 15:13:05 +00:00
Nanley Chery	fcdae4d4c0	intel: Add and use isl_surf_from_mem() Unify code which creates surfaces from buffers. The behavior is slightly changed to use array layers to enable arrayed buffer clears (as needed). Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33776>	2025-05-13 15:13:04 +00:00
Lionel Landwerlin	2d396f6085	intel: prepare VUE layout for more than 2 layouts Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34109>	2025-05-08 06:48:35 +00:00
José Roberto de Souza	fcb6dfb29c	intel: Fix the MOCS values in XY_BLOCK_COPY_BLT for Xe2+ One more instruction were the MOCS value was splited into two registes. Cc: mesa-stable Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34592>	2025-04-22 20:42:25 +00:00
José Roberto de Souza	161c412a82	intel: Fix the MOCS values in XY_FAST_COLOR_BLT for Xe2+ Xe2 changed the MOCS field in few instructions, those now have a field for the MOCS index and other the encryption enable bit but ISL returns the combination of both aka MEMORY_OBJECT_CONTROL_STATE. To minimize changes I have added 2 macros to extract the values from the value returned by isl. From all the instructions changed Mesa only make use of two, so the other instruction will be handled in the next patch. Cc: mesa-stable Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34592>	2025-04-22 20:42:25 +00:00
José Roberto de Souza	a96e280dfe	intel: Program XY_FAST_COLOR_BLT::Destination Mocs for gfx12 Copy engine is not used in gfx12 platforms on ANV but that is possible in Iris. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34560>	2025-04-17 18:11:44 +00:00
Rohan Garg	ceba312ebd	anv,blorp,isl: handle compressed CPS surfaces through the depth stencil hw Compressed CPS surfaces operations such as copies and clears need to be handled through the depth stencil hw to ensure that the aux data is handled correctly. Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20741>	2025-03-28 04:38:09 +00:00
Lionel Landwerlin	e18431273a	blorp: relax depth/stencil<->color copy restriction Currently blorp assumes that copies of depth/stencil is restricted to/from depth/stencil formats. We want to allow color<->depth copies. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Acked-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31983>	2025-03-25 08:01:15 +00:00
Lionel Landwerlin	fe2f173413	blorp: assert that shaders don't spill Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Nanley Chery <nanley.g.chery@intel.com> Acked-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31983>	2025-03-25 08:01:14 +00:00

1 2 3 4 5 ...

665 commits