fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-18 15:58:06 +02:00

Author	SHA1	Message	Date
Calder Young	895ff7fe92	Revert "anv,brw: Allow multiple ray queries without spilling to a shadow stack" Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This optimization doesn't work when the ray query index isn't uniform across the subgroup, which is something the spec allows. While there are some smart ways to fix this and still avoid unnecessary spilling, its not worth investing the time until we find a realtime raytracing workload that actually needs to use multiple live ray queries for something. Fixes: `1f1de7eb` ("anv,brw: Allow multiple ray queries without spilling to a shadow stack") Acked-by: Sagar Ghuge <sagar.ghuge@intel.com> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39445>	2026-01-23 21:33:55 +00:00
Tapani Pälli	f66ff97d58	drirc/anv: implement steps to disable RHWO for Wa_14024015672 Disable RHWO by default for singlesample draws and for MSAA draws if a drirc key is set (avoid perf hit if not needed). Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39404>	2026-01-23 11:10:07 +00:00
Tapani Pälli	055a89cffb	intel/genxml: bring some missing fields to gen125.xml Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39404>	2026-01-23 11:10:07 +00:00
Tapani Pälli	840e6e855b	anv: add handling for Wa_14026600921 This is the Xe3 version of the earlier workaround. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39404>	2026-01-23 11:10:07 +00:00
Tapani Pälli	c75309c8f1	intel/dev: update mesa_defs.json from workaround database Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39404>	2026-01-23 11:10:07 +00:00
Sagar Ghuge	6aa3b70382	anv: Mark RootNodeOffset at 256B always This commit change the BVH layout a little so that we can load the BVH offset as constant rather than reading from memory. We have to force the instance leaves pointer at the end which gets used in copy.comp shader. Totals: Instrs: 54798 -> 54728 (-0.13%) Send messages: 3854 -> 3847 (-0.18%) Cycle count: 1915106 -> 1913954 (-0.06%); split: -0.07%, +0.01% Non SSA regs after NIR: 18594 -> 18575 (-0.10%) Totals from 7 (7.37% of 95) affected shaders: Instrs: 5532 -> 5462 (-1.27%) Send messages: 367 -> 360 (-1.91%) Cycle count: 132592 -> 131440 (-0.87%); split: -1.01%, +0.14% Non SSA regs after NIR: 1989 -> 1970 (-0.96%) PERCENTAGE DELTAS Shaders Instrs Send messages Cycle count Non SSA regs after NIR q2rtx-rt-pipeline 95 -0.13% -0.18% -0.06% -0.10% -------------------------------------------------------------------------------------- All affected 7 -1.27% -1.91% -0.87% -0.96% -------------------------------------------------------------------------------------- Total 95 -0.13% -0.18% -0.06% -0.10% Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39106>	2026-01-22 23:20:04 +00:00
Michel Dänzer	a74ffd6900	Pass the destination buffer size minus one to strncpy Copying the last byte was pointless, since the next line overwrites it, and resulted in a compiler warning: ../src/intel/common/intel_measure.c: In function 'intel_measure_init': ../src/intel/common/intel_measure.c:68:7: warning: 'strncpy' specified bound 1024 equals destination size [-Wstringop-truncation] 68 \| strncpy(env_copy, env, 1024); \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~ This allows dropping -Wno-error=stringop-truncation from the debian-x86_64-asan & debian-{arm64,x86_64}-ubsan CI jobs. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39429>	2026-01-22 15:44:09 +01:00
Caio Oliveira	c8375c0f71	brw/scoreboard: Support local implicit out-of-order dependencies In software scoreboard (Gfx12+) use information from previous instructions to trim out-of-order dependencies. For example, in send g1, g2 ($1) mov g3, g1 ($1.dst) // Depends on g1 (destination of $1) mov g4, g2 ($1.src) // Depends on g2 (source of $1) mov g5, g1 ($1.dst) // Depends on g1 (destination of $1) only the first `mov` needs to be annotated, because the execution will stall until that dependency is fulfilled, which in this case means the `send` is done and `g1` was already written. Note that while `$x.dst` implies `$x.src`, the reverse is not true, so if the first `mov` did not exist, both second and third `mov` in the example would have to keep their annotations. This patch add resolution of implicit out-of-order dependencies that are visible inside a block. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3526>	2026-01-21 22:29:28 +00:00
Caio Oliveira	ba317e14a0	brw: Provide ~ and &= operators for tgl_sbid_mode Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3526>	2026-01-21 22:29:28 +00:00
Caio Oliveira	2ebacbc78d	brw/scoreboard: Add tests showing implicit unordered dependencies in SWSB Mark tests as disabled for now. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3526>	2026-01-21 22:29:28 +00:00
Caio Oliveira	423916152e	brw/scoreboard: Use std::vector when applicable There's agreement now these are helpful and widely supported. We can always fallback to a custom vector class later if necessary. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3526>	2026-01-21 22:29:27 +00:00
Lionel Landwerlin	79aff6e274	brw: use fp64 to compute coarse_z For some reason we cannot get the precision needed from the HW at fp32. LNL internal fossildb changes : Totals from 7226 (0.76% of 947978) affected shaders: Instrs: 5512598 -> 5586086 (+1.33%); split: -0.00%, +1.33% Cycle count: 153836056 -> 155079472 (+0.81%); split: -0.77%, +1.58% Spill count: 2025 -> 2021 (-0.20%); split: -0.35%, +0.15% Fill count: 3139 -> 3112 (-0.86%); split: -1.12%, +0.25% Max live registers: 1034601 -> 1034632 (+0.00%); split: -0.00%, +0.00% Max dispatch width: 207296 -> 207264 (-0.02%); split: +0.02%, -0.03% Non SSA regs after NIR: 1147942 -> 1109326 (-3.36%) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/12726 Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38996>	2026-01-21 16:00:52 +00:00
Lionel Landwerlin	a19e949824	brw: move coarse_z computation to NIR So that we can print it easily with debug printfs Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38996>	2026-01-21 16:00:52 +00:00
Lionel Landwerlin	89a53f048a	brw: make coarse pixel bit available to NIR lowering Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38996>	2026-01-21 16:00:51 +00:00
Lionel Landwerlin	e3fd1b0ac0	brw: populate wm_prog_data earlier So that we can put the coarse_pixel_dispatch value available to NIR lowering. LNL internal fossildb changes: Totals from 40 (0.01% of 490838) affected shaders: Instrs: 33321 -> 33311 (-0.03%); split: -0.04%, +0.01% Cycle count: 780136 -> 779936 (-0.03%); split: -0.03%, +0.00% Max live registers: 5292 -> 5298 (+0.11%) Non SSA regs after NIR: 26638 -> 26464 (-0.65%) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38996>	2026-01-21 16:00:51 +00:00
Lionel Landwerlin	6a7ff83874	brw: set nir_shader_compiler_options::has_pixel_coord Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38996>	2026-01-21 16:00:50 +00:00
Arzaq Naufail Khan	dc702671d9	anv: eliminate dead code Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39400>	2026-01-21 01:21:55 +00:00
Sagar Ghuge	8e85607130	anv/rt: Drop atomic operations on opacity flags Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Each node has their own opacity bits, so we don't need to track these opacity flags at header level. This commit also fixes the instance flag. Instance flag is 8bit wide, but we were always using 4 lower bits. Cc: mesa-stable Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39053>	2026-01-20 22:20:28 +00:00
Sagar Ghuge	61691034ac	anv/rt: Don't always set disableOpacityCull bit Setting this bit always might hurt performance. It might forces traversal to treat all leafs always valid. Cc: mesa-stable Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39053>	2026-01-20 22:20:28 +00:00
Lionel Landwerlin	3d2a696763	brw: treat inline parameters like UNIFORM Makes a bunch of copy propagation and other passes work much better. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39382>	2026-01-20 21:25:53 +00:00
Lionel Landwerlin	1d1866a84b	brw: apply same workaround to spawn than trace opcode Working around BRW's limitations Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39382>	2026-01-20 21:25:52 +00:00
Lionel Landwerlin	0e9453291c	brw: improve push constant loading using base offsets Xe2+ only Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39382>	2026-01-20 21:25:52 +00:00
Lionel Landwerlin	c1ef494b08	brw: add missing base offset decoding Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39382>	2026-01-20 21:25:52 +00:00
Lionel Landwerlin	a7d7492f10	anv: enable debug printfs on internal shaders Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39399>	2026-01-20 12:19:41 +00:00
Lionel Landwerlin	61b35c9d2b	anv: remove all kinds of useless info for internal shaders Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39399>	2026-01-20 12:19:41 +00:00
José Roberto de Souza	48b43157f8	intel/perf: Add Gfx 12.5 mdap_metrics struct and set it Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Gfx 12.5 struct has only one major difference with gfx9, that is OaCntr lenght, while on gfx 9 it is 36 uint64_t long on gfx 12.5 it is 38 uint64_t long. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Tested-by: Lukasz Stalmirski <lukasz.stalmirski@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32842>	2026-01-19 19:24:16 +00:00
José Roberto de Souza	a097a3d214	intel/perf: Change mdapi switch cases from ver to verx We are missing handling for gfx12.5 so to add it we will need a switch case over verx. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Tested-by: Lukasz Stalmirski <lukasz.stalmirski@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32842>	2026-01-19 19:24:16 +00:00
José Roberto de Souza	2d75b3b873	intel/perf: Extend Xe2 mdap_metrics to Xe3 Looking at the reference code, there is no new struct for Xe3 so it should use the same struct as Xe2. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Tested-by: Lukasz Stalmirski <lukasz.stalmirski@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32842>	2026-01-19 19:24:15 +00:00
José Roberto de Souza	8e318e3246	intel/perf: Add Xe2 mdap_metrics struct and set it Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Tested-by: Lukasz Stalmirski <lukasz.stalmirski@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32842>	2026-01-19 19:24:15 +00:00
José Roberto de Souza	0675a0da55	intel/perf: Nuke intel_perf_load_configuration() and related code With no more users of intel_perf_load_configuration() it can be removed with other i915 functions around it. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Tested-by: Lukasz Stalmirski <lukasz.stalmirski@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32842>	2026-01-19 19:24:15 +00:00
José Roberto de Souza	132bcbee74	anv/hasvk: Add intel_perf_get_configuration_id() and replace intel_perf_load_configuration() usage We have no usage of the information returned by intel_perf_load_configuration(). It is only used to add a copy of the configuration so we have the metric id but we could instead get the metric id from sysfs, that is added by mdapi. Xe KMD don't have a uAPI to query the metrics configuration, so using sysfs also fixes the integration of mdapi with Xe KMD. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Tested-by: Lukasz Stalmirski <lukasz.stalmirski@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32842>	2026-01-19 19:24:15 +00:00
José Roberto de Souza	5b39137ba0	anv/hasvk: Nuke register_config from anv_performance_configuration_intel There is no usage for register_config outside of anv_AcquirePerformanceConfigurationINTEL(), so we don't need to store it. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Tested-by: Lukasz Stalmirski <lukasz.stalmirski@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32842>	2026-01-19 19:24:15 +00:00
Georg Lehmann	0165175d4a	ci: update trace checksums Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39180>	2026-01-19 16:11:29 +00:00
Georg Lehmann	050507ab81	brw: make sure nir_opt_algebraic_late was called after late brw_nir_optimize Not only is it questionable for code quality to not call nir_opt_algebraic_late after nir_opt_algebraic, it also breaks correctness for late lowerings. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39180>	2026-01-19 16:11:28 +00:00
José Roberto de Souza	81a5512565	intel/blorp: Remove duplicated calls in blorp_exec_compute() Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details We can have only one of those calls before the 'if GFX_VERx10 >= 125' block. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39362>	2026-01-19 15:09:29 +00:00
Tapani Pälli	ab9d3528dc	anv: fix queue check in anv_blorp_execute_on_companion on xe3 Fixes: dEQP-VK.api.copy_and_blit.dedicated_allocation.resolve_image.whole_copy_before_resolving_transfer.2_bit Otherwise we attempt to use blorp and hit various asserts later in: - blorp_copy_supports_blitter - blorp_xy_block_copy_blt Fixes: `61287b00f3` ("anv: Stop using RCS companion for MSAA copy/clear on Xe3+") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39346>	2026-01-18 17:19:05 +00:00
Piotr Masłowski	4ef73da70e	hasvk: promote VK_EXT_robustness2 to VK_KHR_robustness2 Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39340>	2026-01-16 22:39:10 +00:00
Alyssa Rosenzweig	a11aa3fc4e	brw: combine peephole select calls Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39361>	2026-01-16 21:24:15 +00:00
Tapani Pälli	fcbe987e10	anv: fix setting emitted_flush_bits Fixes a crash with: dEQP-VK.api.external.semaphore.opaque_fd.signal_export_import_wait_temporary when driver calls genX(CmdSetEvent2) -> emit_apply_pipe_flushes with having NULL in emitted_flush_bits. Fixes: `8834ef8bcd` ("anv: use flushing PIPE_CONTROL for event signaling") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39343>	2026-01-16 13:19:06 +00:00
Georg Lehmann	b9908bb165	hasvk: create a new intrinsic for push constant to uniform load lowering Just setting the intrinsic is bad practice and breaks when constant indices no longer match. Fixes: `a6330ed4d0` ("nir: add ACCESS to load_uniforms") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14639 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39331>	2026-01-16 13:02:15 +00:00
Calder Young	d69daf28d0	anv,brw: Add helper to get stack ids per dss for ray queries Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38778>	2026-01-16 09:21:50 +00:00
Calder Young	1f1de7ebd6	anv,brw: Allow multiple ray queries without spilling to a shadow stack Allows a shader to have multiple ray queries without spilling them to a shadow stack. Instead, the driver provides the shader with an array of multiple RTDispatchGlobals structs to give each query its own dedicated stack. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38778>	2026-01-16 09:21:50 +00:00
Caio Oliveira	b542ac4ca0	brw: Fix and properly use increment_a64_address() Since the move to MEMORY__LOGICAL the result value was being ignored, so change to use that. Since the conversion to use new registers, some issues were introduced: - Even with `has_64bit_int` ADD with 64-bit immediate value is not supported; - `dst_high` was not being filled if there was no overflow; - Only `dst_low` returned. Found when writing some new code involving large block loads. Fixes: `b79e85a93f` ("brw: always use new registers for load address increments") Fixes: `b55f77161d` ("intel/brw: Switch to emitting MEMORY__LOGICAL opcodes") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39282>	2026-01-15 19:47:23 +00:00
Dylan Baker	1055004693	anv: initialize anv_address to ensure that the protection field is set It is unconditionally used, but is uninitialized. CID: 1675079 Fixes: `b1e74a1bb1` ("anv: shrink image opaque data") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39245>	2026-01-15 10:19:12 +00:00
Dylan Baker	bc1ccebb0e	anv: Use { 0 } to initialize struct The previous approach does ensure that all entries are zero'd, but that may not be clear to the reader (i.e., me). Using `{ 0 }` is clearer. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39245>	2026-01-15 10:19:12 +00:00
Paulo Zanoni	b52b1a08bf	intel/blorp: add blorp_shaders.cl This gives us the infrastructure that allows us to slowly migrate pieces of blorp shaders from NIR to OpenCL, which, IMHO, are much easier to read. We can't fully migrate everything due to all the conditional building we do with these shaders, but I'm sure we'll find opportunities to replace some NIR with OpenCL eventually. The conversion of blorp_check_in_bounds() serves as the first example. I also plan to have the shaders from the new indirect copy extension be OpenCL shaders (mixed with some NIR as well), so having this patch merged now will reduce the diff for the extension later. Thanks to Alyssa Rosenzweig for her help here. v2: - Use SPDX (Alyssa). - Use nir_trim_vector() (Alyssa). - Adjust CL variable declaration (Alyssa). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39046>	2026-01-15 04:34:55 +00:00
Paulo Zanoni	f047f0b1be	intel/blorp: unionize blorp_params->wm_inputs We have two distinct code paths sharing blorp_params->wm_inputs for different purposes: the code from blorp_blit.c and the code from blorp_clear.c. While blorp_blit.c uses most of the parameters (all except clear_color), blorp_clear.c only uses clear_color and bounds_rect. Split the parameters in two structs: one for blits and the other for clears. This not only helps save some space in the shader inputs, but it also organizes things so it's more clear which parameters are used by what. In addition, my plan is to later add struct blorp_wm_inputs_indirect, which won't share anything that the others use, and would otherwise grow the struct even more. This change would reduce the size of struct blorp_wm_inputs from 96 to 80, but we have to add padding due to the assertion that compares it to cs_prog_data->push.cross_thread.size. Still good, though. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39046>	2026-01-15 04:34:55 +00:00
Paulo Zanoni	a8dd4382bf	intel/blorp: generate the fast_clear_surf shaders later Because blorp_params_get_clear_kernel() calls blorp_params_get_clear_kernel_cs(), which reads params->num_samples, which we have not properly set yet at this point. I am also planning to have the functions that create the shader to rely on params.op, which we have not set yet either. I found this by inspection (when writing another patch), I'm not sure if this fixes something relevant, but it may be relevant to ver >= 30 multi-sampled cases. Fixes: `de0c547448` ("blorp: Handle 2D MSAA array image copies on compute shader") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39046>	2026-01-15 04:34:55 +00:00
Paulo Zanoni	e360afdb8a	intel/blorp: blorp_blit_vars_init() doesn't need 'key' Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39046>	2026-01-15 04:34:55 +00:00
Paulo Zanoni	39a78f764a	blorp: reorganize struct blorp_params When I first looked at this struct, my tiny little brain felt overwhelmed. - Add some white spaces in order to group the parameters into "logical" groups so it's easier to reason about everything. - Change the parameter order just a little bit - without breaking the logical groups - so the struct size decreases by 1.7% to 1864 bytes. - Add a comment explaining what the void * pointers point to. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39046>	2026-01-15 04:34:55 +00:00

1 2 3 4 5 ...

15285 commits