fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-19 09:18:10 +02:00

Author	SHA1	Message	Date
Marek Olšák	f583f6e717	nir: use nir_build_frag_coord everywhere nir_build_frag_coord generates the correct sysval loads based on NIR options. nir_load_frag_coord shouldn't be used directly because drivers don't have to support it. v2: RADV can't use it because nir->options isn't set, so use load_pixel_coord. Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41227>	2026-05-03 13:03:01 +00:00
Calder Young	64b5823d33	blorp: Work around sampler overfetch for buffer copies First, the surface dimensions are used to determine the range of valid pages that the data in the buffer overlaps, then rows are removed from the surface until it does not overfetch into any neighboring pages. If any rows were removed, an extra BTI is set up with a texel buffer that views the contents of all the rows that were removed, and the shader is compiled with a branch to sample the last rows through the texel buffer instead of the main surface. Using the texel buffer allows it to access the last rows without dealing with overfetch or weird alignment hacks, and restricting texel buffer usage to just the part of the surface that can't be accessed safely ensures that we don't significantly impact performance for any buffer to image copy that is unlucky enough to be close to a page boundry. Co-authored-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40149>	2026-05-01 19:51:41 +00:00
Calder Young	3cd9b14c80	isl: Optimize the sampler cache to overlap as few 64B cachelines as possible Since we now have a ISL_SURF_USAGE_NO_OVERFETCH_PADDING_BIT flag to turn extra padding calculations on and off, we can align the row pitch of linear surfaces that are accessed through the sampler to minimize the number of L3 cachelines that each sampler cacheline overlaps for added efficiency. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40149>	2026-05-01 19:51:41 +00:00
Calder Young	8d13628f7f	isl: Add additional alignment/padding requirements to prevent overfetch Bspec 58779 describes various cases where additional padding is required on the bottom and right sides of a sampling engine surface to avoid page faults. Since we don't want to mess up the other drivers that also use ISL, there's now a requires_padding boolean in isl_dev that can be used to enable/disable the extra padding calculations per device and driver. The extra padding can also be disabled per-surface by adding the usage flag ISL_SURF_USAGE_NO_OVERFETCH_PADDING_BIT, like when a specific row pitch is needed. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40149>	2026-05-01 19:51:41 +00:00
Paulo Zanoni	e65b5fc066	intel/blorp: remove always-true #if This check for ">= 125" is already inside a check for ">= 125". Also, let's take this opportunity to comment the #else and #endif of the relevant check to make the code easier to follow. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40937>	2026-04-14 18:26:09 +00:00
Nanley Chery	b50bb53630	intel/blorp: Fix width scaling for YCBCR copies Fixes: `eb8883f3ef` ("intel/blorp: Redescribe surfaces for copies") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/15267 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40930>	2026-04-13 20:03:41 +00:00
Marek Olšák	102d41799b	Rename more sha and sha1 names to blake3 Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40383>	2026-03-23 07:03:28 +00:00
Marek Olšák	53c64973e8	Inline _mesa_sha1_compute/format, remove the other unused ones _mesa_sha1_format has a few remaining uses, so it's moved to build_id.c, which is its last user. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40383>	2026-03-23 07:03:27 +00:00
Marek Olšák	0da88d237a	Inline SHA1_DIGEST_STRING_LENGTH Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40383>	2026-03-23 07:03:27 +00:00
Marek Olšák	110632f702	Inline SHA1_DIGEST_LENGTH Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40383>	2026-03-23 07:03:27 +00:00
Michael Cheng	2eebe7b884	intel/blorp: use dedicated clear ops in clear paths Select dedicated blorp ops for clear requests instead of reusing generic depth/color labels. Signed-off-by: Michael Cheng <michael.cheng@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40414>	2026-03-17 21:10:40 +00:00
Michael Cheng	061ed05c7a	intel/blorp: Remove unused blorp_gfx8_hiz_clear_attachments Signed-off-by: Michael Cheng <michael.cheng@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40414>	2026-03-17 21:10:40 +00:00
Michael Cheng	b901ff322a	intel/blorp: add explicit clear op enums for stencil and linear paths Add dedicated BLORP op enums so clear paths can be represented precisely. This is enum-only groundwork; behavior and trace output are wired in follow-up commits. Signed-off-by: Michael Cheng <michael.cheng@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40414>	2026-03-17 21:10:40 +00:00
Nanley Chery	eb8883f3ef	intel/blorp: Redescribe surfaces for copies Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details When copying data between two surfaces, independently increase the size of each surface's format (bits-per-pixel) as alignment constraints allow. Adjust the other surface parameters and blorp_copy() parameters accordingly. This fixes copies between the 16bpp YCRCB formats and 32bpp formats: dEQP-VK.ycbcr.single_plane_copy.linear.linear.r8g8b8a8_to_g8b8g8r8_422 This new test failure was reported by Iván Briano. More generally, this increases the efficiency of our copies. As shown in the configuration pages of the PRMs, our sampler is able to fetch texels at a fixed rate of texels / clock regardless of the texel size (presumably our rendering hardware has similar behavior). By using the largest texel size possible, we can transfer more bits / clock. Improves the performance of a number of traces in the performance CI for BMG: * TotalWarWarhammer3 +2.24% * Payday3 +1.87% * BaldursGate3 +1.34% * Control +1.25% * TotalWarPharaoh +1.22% Four additional traces are helped between +0.44% and +0.96%. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39974>	2026-03-11 00:36:19 +00:00
Nanley Chery	73796c7245	intel/blorp: Add blorp_surf_convert_to_single_level_tile() Convert a Tile64/Yf/Ys surface to a single level or a single miptail. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39974>	2026-03-11 00:36:19 +00:00
Nanley Chery	9351dbfb25	intel/blorp: Use stencil hardware less for CPB copies Don't use it without ISL_AUX_USAGE_STC_CCS. With a future patch, this will allow blorp_copy() calls to increase the size of the surface format for CPB. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39974>	2026-03-11 00:36:19 +00:00
Nanley Chery	20bf27f2a8	intel/blorp: Make blorp_copy() format queries aux-dependent blorp_copy() will soon start changing the format in a way which drivers cannot rely on to do things like manage the texture cache (see iris). Narrow down the scope of blorp_copy_get_formats() and blorp_copy_get_color_format() such that the returned value can only be trusted if compression would be enabled on each image. By doing this (and adapting iris to reflect this), we'll get the required flushes on the platforms which need WaSamplerCacheFlushBetweenRedescribedSurfaceReads: * On the platforms which need the workaround for all formats, blorp_copy() will stick with the queried format on compressed surfaces. * On the platforms which need the workaround when switching from ASTC and non-ASTC formats, blorp_copy() may actually change the queried format on compressed surfaces. This is not a problem, because surfaces which may be read with ASTC formats are not compressible. Prevents gfx9 from failing tests under: * KHR-GL46.copy_image.functional_src_target_texture_2d_array_src_format_r3_g3_b2* * KHR-GL46.copy_image.functional_src_target_texture_2d_array_src_format_rgb5* * KHR-GL46.copy_image.functional_src_target_texture_2d_array_src_format_rgba2* * KHR-GL46.copy_image.functional_src_target_texture_2d_array_src_format_rgba4* Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39974>	2026-03-11 00:36:18 +00:00
Nanley Chery	d993e0dc47	intel/blorp: Add blorp_surf::has_replicated_pixel This allows blorp_copy() to widen a surface format width in some cases. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39974>	2026-03-11 00:36:17 +00:00
Nanley Chery	a77f79f21e	intel/blorp: Lower bit-casting code in blorp_copy() We're going to add code between calling blorp_surf_convert_to_uncompressed() and bit-casting determination. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39974>	2026-03-11 00:36:17 +00:00
Nanley Chery	e0859f5ca1	intel/isl: Use a fixed alignment for single slices We're going to start changing the surface format during blorp_copy(). Changing the surface format could lead to incorrect image alignment parameters, so return a fixed halign and valign for images with a single subresource. That's all that will be needed for the upcoming blorp_copy() changes. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39974>	2026-03-11 00:36:17 +00:00
Nanley Chery	27d515772e	intel/isl: Replace mc_format with aux_format We're going to be changing the surface format of images but need to maintain a consistent render compression format to properly encode/decode. Generalize and use the field that was previously specific to ISL_AUX_USAGE_MC. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39974>	2026-03-11 00:36:15 +00:00
Sagar Ghuge	9a37209fb4	intel/blorp: drop unused BLORP_BATCH_COMPUTE_ENGINE flag Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39317>	2026-03-06 20:42:05 +00:00
Lionel Landwerlin	38cc622d8b	blorp: switch to new load_indirect_address_intel intrinsic Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40174>	2026-03-06 06:34:43 +00:00
Lionel Landwerlin	5283cbe07c	blorp: add mda support Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40174>	2026-03-06 06:34:42 +00:00
Lionel Landwerlin	7f19814414	brw/nir: handle inline_data_intel more like push_data_intel It's pretty much the same mechanism, except it's a different register location. With this change we gain indirect loading support. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39405>	2026-02-25 10:44:09 +00:00
José Roberto de Souza	91c5744e25	intel/brw: Use computed push constants size in brw_assign_urb_setup() It was already computed in brw_shader::assign_curb_setup() so we can use it in brw_assign_urb_setup(). There was a mismatch between assign_curb_setup() and brw_assign_urb_setup() when push_sizes were not multiple of REG_SIZE, the first one was aligning every push_sizes before sum it, while brw_assign_urb_setup() was only aligning the sum of all push_size. By luck the only places that did not had a push_size aligned to REG_SIZE only had one push_size, so this was not an issue. So here also fixing this mismatch and adding an assert to caught any future mismatch. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39817>	2026-02-19 16:53:03 +00:00
Kenneth Graunke	c5859b2d40	intel: Rename wm_prog_key to fs_prog_key Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This is the shader key for the fragment shader. Nobody even knows what the windowizer/masker unit is or does anymore. Even on Gen4-6, "fs" is still clearer. This makes the codebase easier to read. This is only about 15 years overdue. Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39748>	2026-02-06 20:52:01 -08:00
Kenneth Graunke	56e638be81	intel: Rename wm_prog_data to fs_prog_data This is the program data for the fragment shader. Nobody even knows what the windowizer/masker unit is or does anymore. Even on Gen4-6, "fs" is still clearer. This makes the codebase easier to read. This is only about 15 years overdue. Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39748>	2026-02-06 20:51:59 -08:00
Kenneth Graunke	beb4b78fe7	intel: Rename intel_msaa_flags to intel_fs_config This started out as dynamic configuration for MSAA related state, but has since expanded to cover many dynamic fragment shader options. We rename it to intel_fs_config, similar to intel_tess_config, to better indicate its purpose. Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39748>	2026-02-06 20:51:43 -08:00
Nanley Chery	efb5ab1e4b	intel/blorp: Fix the redescribed fast-clear qpitch Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Assign a new QPitch when fast-clearing the unaligned top rows on a redescribed surface. Fixes the following piglit test on gfx12.5: $ test_folder=generated_tests/spec/EXT_shader_framebuffer_fetch/execution/gles3/ $ ./bin/shader_runner_gles3 $test_folder/single-slice-2darray.shader_test -auto -fbo Reported-by: Kenneth Graunke <kenneth@whitecape.org> Fixes: `3e331e4fe9` ("intel/blorp: Optimize non-zero-layer fast-clears") Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39722>	2026-02-06 19:09:12 +00:00
Nanley Chery	23a3c8c972	intel: Disable CCS_E support for YCRCB on gfx12 The table in Bspec 47715 lists these formats as "Not Supported" in the "Lossless Compression Support" column. Reviewed-by: Iván Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39628>	2026-02-04 02:23:48 +00:00
Nanley Chery	4512d81559	intel/blorp: Bump pitch when clearing unaligned bottom rows Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This might be faster if the layer starts at a 64KB offset. No performance benefits found in the performance CI. Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37660>	2026-01-27 18:46:55 +00:00
Nanley Chery	3e331e4fe9	intel/blorp: Optimize non-zero-layer fast-clears Allow surface redescription when fast-clearing a layer > 0. This affects at least five traces in the performance CI, but the CI doesn't report any performance benefit from this. We already had code to handle unaligned rows at the bottom of an image. Now that this handles the misalignment at the top of the image range, we gain some symmetry. Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37660>	2026-01-27 18:46:55 +00:00
Nanley Chery	ba63883692	intel/blorp: Avoid unused surface redescription calc Suggested-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37660>	2026-01-27 18:46:54 +00:00
Tapani Pälli	bb84773c81	blorp: fix asserts hit with msaa blorp blits on xe3 Tested on PTL, fixes various copy_and_blit tests that utilize compute after `ab9d3528dc` that exposed this to them. Fixes: `ab9d3528dc` ("anv: fix queue check in anv_blorp_execute_on_companion on xe3") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39548>	2026-01-27 15:28:55 +00:00
Francisco Jerez	cc66f5ff1d	intel/blorp: Add support for partial resolves of HiZ-CCS surfaces. v2: Define additional enum BLORP_OP_HIZ_PARTIAL_RESOLVE to track partial resolves (Nanley). v3: Add comment regarding fall back to full resolve on Gfx12.0 (Nanley). Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31139>	2026-01-27 08:52:17 +00:00
Nanley Chery	f208ac9f4b	intel: Enable CCS support for Yf and Ys Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Enable CCS with Ys on all systems, and with Yf on gfx9-11. Unfortunately, Yf + CCS isn't supported on gfx12. Tests fail and systems hang in the CI with this enabled. The simulator also complains about this combination on tests such as: dEQP-VK.api.image_clearing.core.clear_color_attachment.multiple_layers.r4g4b4a4_unorm_pack16 dEQP-VK.api.image_clearing.core.clear_color_attachment.single_layer.r4g4b4a4_unorm_pack16_200x180_sample_count_2 The simulator doesn't complain about this combination on depth/stencil surfaces, but actual hardware still has issues with this. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11057 Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38063>	2026-01-26 21:09:05 +00:00
Nanley Chery	6fc0e5c0aa	blorp: Fix Tile64 clear redescription assertion Prevent assert failures in a future commit where Tile64 will be selected more often. Fixes: `42ef23ecd1` ("intel/blorp: Don't redescribe some Tile64 clears") Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38063>	2026-01-26 21:09:03 +00:00
José Roberto de Souza	81a5512565	intel/blorp: Remove duplicated calls in blorp_exec_compute() Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details We can have only one of those calls before the 'if GFX_VERx10 >= 125' block. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39362>	2026-01-19 15:09:29 +00:00
Paulo Zanoni	b52b1a08bf	intel/blorp: add blorp_shaders.cl This gives us the infrastructure that allows us to slowly migrate pieces of blorp shaders from NIR to OpenCL, which, IMHO, are much easier to read. We can't fully migrate everything due to all the conditional building we do with these shaders, but I'm sure we'll find opportunities to replace some NIR with OpenCL eventually. The conversion of blorp_check_in_bounds() serves as the first example. I also plan to have the shaders from the new indirect copy extension be OpenCL shaders (mixed with some NIR as well), so having this patch merged now will reduce the diff for the extension later. Thanks to Alyssa Rosenzweig for her help here. v2: - Use SPDX (Alyssa). - Use nir_trim_vector() (Alyssa). - Adjust CL variable declaration (Alyssa). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39046>	2026-01-15 04:34:55 +00:00
Paulo Zanoni	f047f0b1be	intel/blorp: unionize blorp_params->wm_inputs We have two distinct code paths sharing blorp_params->wm_inputs for different purposes: the code from blorp_blit.c and the code from blorp_clear.c. While blorp_blit.c uses most of the parameters (all except clear_color), blorp_clear.c only uses clear_color and bounds_rect. Split the parameters in two structs: one for blits and the other for clears. This not only helps save some space in the shader inputs, but it also organizes things so it's more clear which parameters are used by what. In addition, my plan is to later add struct blorp_wm_inputs_indirect, which won't share anything that the others use, and would otherwise grow the struct even more. This change would reduce the size of struct blorp_wm_inputs from 96 to 80, but we have to add padding due to the assertion that compares it to cs_prog_data->push.cross_thread.size. Still good, though. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39046>	2026-01-15 04:34:55 +00:00
Paulo Zanoni	a8dd4382bf	intel/blorp: generate the fast_clear_surf shaders later Because blorp_params_get_clear_kernel() calls blorp_params_get_clear_kernel_cs(), which reads params->num_samples, which we have not properly set yet at this point. I am also planning to have the functions that create the shader to rely on params.op, which we have not set yet either. I found this by inspection (when writing another patch), I'm not sure if this fixes something relevant, but it may be relevant to ver >= 30 multi-sampled cases. Fixes: `de0c547448` ("blorp: Handle 2D MSAA array image copies on compute shader") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39046>	2026-01-15 04:34:55 +00:00
Paulo Zanoni	e360afdb8a	intel/blorp: blorp_blit_vars_init() doesn't need 'key' Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39046>	2026-01-15 04:34:55 +00:00
Paulo Zanoni	39a78f764a	blorp: reorganize struct blorp_params When I first looked at this struct, my tiny little brain felt overwhelmed. - Add some white spaces in order to group the parameters into "logical" groups so it's easier to reason about everything. - Change the parameter order just a little bit - without breaking the logical groups - so the struct size decreases by 1.7% to 1864 bytes. - Add a comment explaining what the void * pointers point to. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39046>	2026-01-15 04:34:55 +00:00
Paulo Zanoni	c98f5e9994	blorp: replace magic '2' with BLORP_NUM_BT_ENTRIES If we ever add more entries, things won't explode. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39046>	2026-01-15 04:34:55 +00:00
Paulo Zanoni	814cfa909d	blorp: fix argument indentation I'm sorry, but I have OCD and the rest of the file is nicely aligned. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39046>	2026-01-15 04:34:55 +00:00
Lionel Landwerlin	faa857a061	intel: rework push constant handling Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details nr_params & params array are gone. brw_ubo_range is not stored on the prog_data structure anymore (Anv already stored a copy of that with its own additional information) The backend now only deals with load_push_data_intel. load_uniform & load_push_constant have to be lowered by the driver. Pre Gfx12.5 platforms have to provide a subgroup_id_param to specify where the subgroup_id value is located in the push constants. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38975>	2026-01-09 14:19:52 +00:00
Lionel Landwerlin	f4a0e05970	anv/brw/iris: get rid of param array on prog_data Drivers can do all the lowering to push constants to find the only value useful in that array (subgroup_id). Then drivers call into brw_cs_fill_push_const_info() to get the cross/per thread constant layout computed in the prog_data. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38975>	2026-01-09 14:19:51 +00:00
Sagar Ghuge	de0c547448	blorp: Handle 2D MSAA array image copies on compute shader We are passing number of layers as inline parameter register, so figure out z_pos and write to 2D MSAA array images in compute shader. We already get component X, Y and sample index, all we needed was the number of layers. Ken: - Use load/store var instead of derefs Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33905>	2025-12-17 05:34:02 +00:00
Sagar Ghuge	080d28a03e	blorp: Set persample_msaa_dispatch for render shader Only 3D shader gets dispatched per sample not the compute shader. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/33905>	2025-12-17 05:34:02 +00:00

1 2 3 4 5 ...

703 commits