fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-20 15:38:19 +02:00

Author	SHA1	Message	Date
José Roberto de Souza	48b43157f8	intel/perf: Add Gfx 12.5 mdap_metrics struct and set it Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Gfx 12.5 struct has only one major difference with gfx9, that is OaCntr lenght, while on gfx 9 it is 36 uint64_t long on gfx 12.5 it is 38 uint64_t long. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Tested-by: Lukasz Stalmirski <lukasz.stalmirski@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32842>	2026-01-19 19:24:16 +00:00
José Roberto de Souza	a097a3d214	intel/perf: Change mdapi switch cases from ver to verx We are missing handling for gfx12.5 so to add it we will need a switch case over verx. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Tested-by: Lukasz Stalmirski <lukasz.stalmirski@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32842>	2026-01-19 19:24:16 +00:00
José Roberto de Souza	2d75b3b873	intel/perf: Extend Xe2 mdap_metrics to Xe3 Looking at the reference code, there is no new struct for Xe3 so it should use the same struct as Xe2. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Tested-by: Lukasz Stalmirski <lukasz.stalmirski@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32842>	2026-01-19 19:24:15 +00:00
José Roberto de Souza	8e318e3246	intel/perf: Add Xe2 mdap_metrics struct and set it Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Tested-by: Lukasz Stalmirski <lukasz.stalmirski@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32842>	2026-01-19 19:24:15 +00:00
José Roberto de Souza	0675a0da55	intel/perf: Nuke intel_perf_load_configuration() and related code With no more users of intel_perf_load_configuration() it can be removed with other i915 functions around it. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Tested-by: Lukasz Stalmirski <lukasz.stalmirski@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32842>	2026-01-19 19:24:15 +00:00
José Roberto de Souza	132bcbee74	anv/hasvk: Add intel_perf_get_configuration_id() and replace intel_perf_load_configuration() usage We have no usage of the information returned by intel_perf_load_configuration(). It is only used to add a copy of the configuration so we have the metric id but we could instead get the metric id from sysfs, that is added by mdapi. Xe KMD don't have a uAPI to query the metrics configuration, so using sysfs also fixes the integration of mdapi with Xe KMD. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Tested-by: Lukasz Stalmirski <lukasz.stalmirski@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32842>	2026-01-19 19:24:15 +00:00
José Roberto de Souza	5b39137ba0	anv/hasvk: Nuke register_config from anv_performance_configuration_intel There is no usage for register_config outside of anv_AcquirePerformanceConfigurationINTEL(), so we don't need to store it. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Tested-by: Lukasz Stalmirski <lukasz.stalmirski@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32842>	2026-01-19 19:24:15 +00:00
Georg Lehmann	0165175d4a	ci: update trace checksums Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39180>	2026-01-19 16:11:29 +00:00
Georg Lehmann	050507ab81	brw: make sure nir_opt_algebraic_late was called after late brw_nir_optimize Not only is it questionable for code quality to not call nir_opt_algebraic_late after nir_opt_algebraic, it also breaks correctness for late lowerings. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39180>	2026-01-19 16:11:28 +00:00
José Roberto de Souza	81a5512565	intel/blorp: Remove duplicated calls in blorp_exec_compute() Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details We can have only one of those calls before the 'if GFX_VERx10 >= 125' block. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39362>	2026-01-19 15:09:29 +00:00
Tapani Pälli	ab9d3528dc	anv: fix queue check in anv_blorp_execute_on_companion on xe3 Fixes: dEQP-VK.api.copy_and_blit.dedicated_allocation.resolve_image.whole_copy_before_resolving_transfer.2_bit Otherwise we attempt to use blorp and hit various asserts later in: - blorp_copy_supports_blitter - blorp_xy_block_copy_blt Fixes: `61287b00f3` ("anv: Stop using RCS companion for MSAA copy/clear on Xe3+") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39346>	2026-01-18 17:19:05 +00:00
Piotr Masłowski	4ef73da70e	hasvk: promote VK_EXT_robustness2 to VK_KHR_robustness2 Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39340>	2026-01-16 22:39:10 +00:00
Alyssa Rosenzweig	a11aa3fc4e	brw: combine peephole select calls Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39361>	2026-01-16 21:24:15 +00:00
Tapani Pälli	fcbe987e10	anv: fix setting emitted_flush_bits Fixes a crash with: dEQP-VK.api.external.semaphore.opaque_fd.signal_export_import_wait_temporary when driver calls genX(CmdSetEvent2) -> emit_apply_pipe_flushes with having NULL in emitted_flush_bits. Fixes: `8834ef8bcd` ("anv: use flushing PIPE_CONTROL for event signaling") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39343>	2026-01-16 13:19:06 +00:00
Georg Lehmann	b9908bb165	hasvk: create a new intrinsic for push constant to uniform load lowering Just setting the intrinsic is bad practice and breaks when constant indices no longer match. Fixes: `a6330ed4d0` ("nir: add ACCESS to load_uniforms") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14639 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39331>	2026-01-16 13:02:15 +00:00
Calder Young	d69daf28d0	anv,brw: Add helper to get stack ids per dss for ray queries Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38778>	2026-01-16 09:21:50 +00:00
Calder Young	1f1de7ebd6	anv,brw: Allow multiple ray queries without spilling to a shadow stack Allows a shader to have multiple ray queries without spilling them to a shadow stack. Instead, the driver provides the shader with an array of multiple RTDispatchGlobals structs to give each query its own dedicated stack. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38778>	2026-01-16 09:21:50 +00:00
Caio Oliveira	b542ac4ca0	brw: Fix and properly use increment_a64_address() Since the move to MEMORY__LOGICAL the result value was being ignored, so change to use that. Since the conversion to use new registers, some issues were introduced: - Even with `has_64bit_int` ADD with 64-bit immediate value is not supported; - `dst_high` was not being filled if there was no overflow; - Only `dst_low` returned. Found when writing some new code involving large block loads. Fixes: `b79e85a93f` ("brw: always use new registers for load address increments") Fixes: `b55f77161d` ("intel/brw: Switch to emitting MEMORY__LOGICAL opcodes") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39282>	2026-01-15 19:47:23 +00:00
Dylan Baker	1055004693	anv: initialize anv_address to ensure that the protection field is set It is unconditionally used, but is uninitialized. CID: 1675079 Fixes: `b1e74a1bb1` ("anv: shrink image opaque data") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39245>	2026-01-15 10:19:12 +00:00
Dylan Baker	bc1ccebb0e	anv: Use { 0 } to initialize struct The previous approach does ensure that all entries are zero'd, but that may not be clear to the reader (i.e., me). Using `{ 0 }` is clearer. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39245>	2026-01-15 10:19:12 +00:00
Paulo Zanoni	b52b1a08bf	intel/blorp: add blorp_shaders.cl This gives us the infrastructure that allows us to slowly migrate pieces of blorp shaders from NIR to OpenCL, which, IMHO, are much easier to read. We can't fully migrate everything due to all the conditional building we do with these shaders, but I'm sure we'll find opportunities to replace some NIR with OpenCL eventually. The conversion of blorp_check_in_bounds() serves as the first example. I also plan to have the shaders from the new indirect copy extension be OpenCL shaders (mixed with some NIR as well), so having this patch merged now will reduce the diff for the extension later. Thanks to Alyssa Rosenzweig for her help here. v2: - Use SPDX (Alyssa). - Use nir_trim_vector() (Alyssa). - Adjust CL variable declaration (Alyssa). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39046>	2026-01-15 04:34:55 +00:00
Paulo Zanoni	f047f0b1be	intel/blorp: unionize blorp_params->wm_inputs We have two distinct code paths sharing blorp_params->wm_inputs for different purposes: the code from blorp_blit.c and the code from blorp_clear.c. While blorp_blit.c uses most of the parameters (all except clear_color), blorp_clear.c only uses clear_color and bounds_rect. Split the parameters in two structs: one for blits and the other for clears. This not only helps save some space in the shader inputs, but it also organizes things so it's more clear which parameters are used by what. In addition, my plan is to later add struct blorp_wm_inputs_indirect, which won't share anything that the others use, and would otherwise grow the struct even more. This change would reduce the size of struct blorp_wm_inputs from 96 to 80, but we have to add padding due to the assertion that compares it to cs_prog_data->push.cross_thread.size. Still good, though. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39046>	2026-01-15 04:34:55 +00:00
Paulo Zanoni	a8dd4382bf	intel/blorp: generate the fast_clear_surf shaders later Because blorp_params_get_clear_kernel() calls blorp_params_get_clear_kernel_cs(), which reads params->num_samples, which we have not properly set yet at this point. I am also planning to have the functions that create the shader to rely on params.op, which we have not set yet either. I found this by inspection (when writing another patch), I'm not sure if this fixes something relevant, but it may be relevant to ver >= 30 multi-sampled cases. Fixes: `de0c547448` ("blorp: Handle 2D MSAA array image copies on compute shader") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39046>	2026-01-15 04:34:55 +00:00
Paulo Zanoni	e360afdb8a	intel/blorp: blorp_blit_vars_init() doesn't need 'key' Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39046>	2026-01-15 04:34:55 +00:00
Paulo Zanoni	39a78f764a	blorp: reorganize struct blorp_params When I first looked at this struct, my tiny little brain felt overwhelmed. - Add some white spaces in order to group the parameters into "logical" groups so it's easier to reason about everything. - Change the parameter order just a little bit - without breaking the logical groups - so the struct size decreases by 1.7% to 1864 bytes. - Add a comment explaining what the void * pointers point to. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39046>	2026-01-15 04:34:55 +00:00
Paulo Zanoni	c98f5e9994	blorp: replace magic '2' with BLORP_NUM_BT_ENTRIES If we ever add more entries, things won't explode. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39046>	2026-01-15 04:34:55 +00:00
Paulo Zanoni	814cfa909d	blorp: fix argument indentation I'm sorry, but I have OCD and the rest of the file is nicely aligned. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39046>	2026-01-15 04:34:55 +00:00
Paulo Zanoni	548d172094	hasvk: restore anv_is_aligned() On Broadwell, using the debug mode, you can't create even a single VkImage: createimage: ../../src/util/u_math.h:829: util_is_aligned: Assertion `(a != 0) && ((a & (a - 1)) == 0)' failed. Thread 1 "createimage" received signal SIGABRT, Aborted. Download failed: Invalid argument. Continuing without source file ./nptl/./nptl/pthread_kill.c. __pthread_kill_implementation (threadid=<optimized out>, signo=signo@entry=6, no_tid=no_tid@entry=0) at ./nptl/pthread_kill.c:44 warning: 44 ./nptl/pthread_kill.c: No such file or directory (gdb) bt #0 __pthread_kill_implementation (threadid=<optimized out>, signo=signo@entry=6, no_tid=no_tid@entry=0) at ./nptl/pthread_kill.c:44 #1 0x00007ffff789573f in __pthread_kill_internal (threadid=<optimized out>, signo=6) at ./nptl/pthread_kill.c:89 #2 0x00007ffff7840462 in __GI_raise (sig=sig@entry=6) at ../sysdeps/posix/raise.c:26 #3 0x00007ffff78284ac in __GI_abort () at ./stdlib/abort.c:77 #4 0x00007ffff7828420 in __assert_fail_base (fmt=<optimized out>, assertion=<optimized out>, file=<optimized out>, line=829, function=<optimized out>) at ./assert/assert.c:118 #5 0x00007ffff5a5fb0c in util_is_aligned (n=0, a=0) at ../../src/util/u_math.h:829 #6 0x00007ffff5a6060d in memory_range_end (memory_range=...) at ../../src/intel/vulkan_hasvk/anv_image.c:51 #7 0x00007ffff5a61c52 in check_memory_range_s (p=0x7fffffffd800) at ../../src/intel/vulkan_hasvk/anv_image.c:779 #8 0x00007ffff5a61ef3 in check_memory_bindings (device=0x555555654d50, image=0x55555566e050) at ../../src/intel/vulkan_hasvk/anv_image.c:830 #9 0x00007ffff5a62ea3 in anv_image_init (device=0x555555654d50, image=0x55555566e050, create_info=0x7fffffffd9d0) at ../../src/intel/vulkan_hasvk/anv_image.c:1263 #10 0x00007ffff5a63147 in anv_image_init_from_create_info (device=0x555555654d50, image=0x55555566e050, pCreateInfo=0x7fffffffda80) at ../../src/intel/vulkan_hasvk/anv_image.c:1333 #11 0x00007ffff5a63211 in anv_CreateImage (_device=0x555555654d50, pCreateInfo=0x7fffffffda80, pAllocator=0x0, pImage=0x7fffffffdd20) at ../../src/intel/vulkan_hasvk/anv_image.c:1356 #12 0x00007ffff44ff376 in vvl::dispatch::Device::CreateImage (this=0x55555562c480, device=0x555555654d50, pCreateInfo=0x7fffffffdcb8, pAllocator=0x0, pImage=0x7fffffffdd20) at ./layers/vulkan/generated/dispatch_object.cpp:1160 #13 0x00007ffff43e8214 in vulkan_layer_chassis::CreateImage (device=0x555555654d50, pCreateInfo=0x7fffffffdcb8, pAllocator=0x0, pImage=0x7fffffffdd20) at ./layers/vulkan/generated/chassis.cpp:2181 #14 0x0000555555560af4 in vks::Image::init (this=0x7fffffffdcb0) at /home/przanoni/git/random-stuff/vk/vks/libvulkanscript.hpp:1298 #15 0x000055555556557d in main () at createimage.cpp:36 Since we haven't noticed this issue as quickly as I imagined we would, let's opt for what's mostly a revert of the behavior change in the original commit. Fixes: `7be63ef956` ("intel: do not NIH util_is_aligned") Acked-by: Caio Oliveira <caio.oliveira@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39045>	2026-01-15 04:20:29 +00:00
Lionel Landwerlin	fd744b0c8a	brw: switch buffer/image size intrinsics lowering to NIR Fossil-db DG2: Totals from 127 (0.01% of 1799288) affected shaders: Instrs: 60593 -> 60508 (-0.14%); split: -0.15%, +0.01% Cycle count: 7099635 -> 7116148 (+0.23%); split: -0.12%, +0.35% Spill count: 468 -> 466 (-0.43%) Fill count: 224 -> 222 (-0.89%) Max live registers: 6418 -> 6424 (+0.09%); split: -0.06%, +0.16% Non SSA regs after NIR: 11228 -> 11220 (-0.07%); split: -0.20%, +0.12% Fossil-db LNL: Totals from 135 (0.01% of 1573226) affected shaders: Instrs: 55173 -> 55143 (-0.05%); split: -0.07%, +0.01% Cycle count: 9178338 -> 9157052 (-0.23%); split: -0.32%, +0.09% Spill count: 454 -> 452 (-0.44%) Fill count: 181 -> 179 (-1.10%) Max live registers: 12915 -> 12919 (+0.03%); split: -0.06%, +0.09% Non SSA regs after NIR: 10860 -> 10852 (-0.07%); split: -0.20%, +0.13% shader-db LNL: total instructions in shared programs: 16911578 -> 16911566 (<.01%) instructions in affected programs: 1602 -> 1590 (-0.75%) helped: 7 HURT: 0 helped stats (abs) min: 1.0 max: 2.0 x̄: 1.71 x̃: 2 helped stats (rel) min: 0.48% max: 1.10% x̄: 0.75% x̃: 0.74% 95% mean confidence interval for instructions value: -2.17 -1.26 95% mean confidence interval for instructions %-change: -0.96% -0.55% Instructions are helped. total loops in shared programs: 5168 -> 5168 (0.00%) loops in affected programs: 0 -> 0 helped: 0 HURT: 0 total cycles in shared programs: 848964184 -> 848955094 (<.01%) cycles in affected programs: 1528020 -> 1518930 (-0.59%) helped: 9 HURT: 6 helped stats (abs) min: 2.0 max: 8484.0 x̄: 1212.89 x̃: 20 helped stats (rel) min: 0.02% max: 3.23% x̄: 0.57% x̃: 0.11% HURT stats (abs) min: 2.0 max: 1608.0 x̄: 304.33 x̃: 15 HURT stats (rel) min: <.01% max: 0.59% x̄: 0.19% x̃: 0.07% 95% mean confidence interval for cycles value: -1875.18 663.18 95% mean confidence interval for cycles %-change: -0.75% 0.23% Inconclusive result (value mean confidence interval includes 0). total spills in shared programs: 3345 -> 3345 (0.00%) spills in affected programs: 0 -> 0 helped: 0 HURT: 0 total fills in shared programs: 1777 -> 1777 (0.00%) fills in affected programs: 0 -> 0 helped: 0 HURT: 0 total sends in shared programs: 869299 -> 869299 (0.00%) sends in affected programs: 0 -> 0 helped: 0 HURT: 0 LOST: 0 GAINED: 0 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39258>	2026-01-14 10:37:32 +00:00
Alyssa Rosenzweig	c339b55f92	brw/nir_lower_fs_load_output: unify texture builders Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39271>	2026-01-14 08:18:15 +00:00
Lionel Landwerlin	0a3f3fd193	brw: drop unused color_outputs_valid key Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39264>	2026-01-12 20:21:48 +00:00
Lionel Landwerlin	c3bd1a1688	brw: handle layer_id only through system value Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39259>	2026-01-12 19:53:36 +00:00
Lionel Landwerlin	081c5bc6a5	brw: fix derivatives on non 32bit floats Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/14600 Meh'd-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39226>	2026-01-12 15:18:46 +00:00
Lionel Landwerlin	a97b01801a	brw: enable SIMD32 compute shaders with ray queries Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11020 Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36181>	2026-01-12 12:19:21 +00:00
Lionel Landwerlin	527ae448e5	brw/nir/rt: ensure we can load 2 RT_DISPATCH_GLOBALS Each group of 16 lanes inside a SIMD32 shader will load different globals. In SIMD8/16 shaders, the divergence analysis will turn this load into nir_load_global_constant_uniform_block_intel. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36181>	2026-01-12 12:19:21 +00:00
Lionel Landwerlin	b996b03f21	brw: enable topology opcodes in SIMD32 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36181>	2026-01-12 12:19:21 +00:00
Lionel Landwerlin	286073f6eb	brw: handle lowering of a couple of opcodes Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36181>	2026-01-12 12:19:21 +00:00
Lionel Landwerlin	2fa09500a2	brw: enable ray query spilling in SIMD32 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36181>	2026-01-12 12:19:21 +00:00
Lionel Landwerlin	6d19b898e7	anv/brw: prep work for SIMD32 ray queries Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36181>	2026-01-12 12:19:21 +00:00
Alyssa Rosenzweig	43efc1cc7e	brw: use nir_is_shared_access Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39219>	2026-01-09 20:51:12 +00:00
Caio Oliveira	d160b7726a	brw/scoreboard: Disable nomask workaround for Xe2+ Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The issue was caused by fused EU feature that is not used in Xe2+ anymore. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36659>	2026-01-09 17:25:00 +00:00
Caio Oliveira	47a6ef3fef	brw/scoreboard: Use a predicate helper for the nomask workaround If it wasn't for the workaround, it wouldn't be necessary to track the whether instructions are exec_all or not. The workaround affects results when mixing a dep and inst with different exec_all. Add the predicate so that, when the workaround is disabled, none of the effects of having different exec_all will kick in, all them will be considered `exec_all = true`. This patch don't change any behavior, just adds the predicate. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36659>	2026-01-09 17:25:00 +00:00
Lionel Landwerlin	faa857a061	intel: rework push constant handling Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details nr_params & params array are gone. brw_ubo_range is not stored on the prog_data structure anymore (Anv already stored a copy of that with its own additional information) The backend now only deals with load_push_data_intel. load_uniform & load_push_constant have to be lowered by the driver. Pre Gfx12.5 platforms have to provide a subgroup_id_param to specify where the subgroup_id value is located in the push constants. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38975>	2026-01-09 14:19:52 +00:00
Lionel Landwerlin	60e359412d	iris: manage TBIMR null push constant wa in driver Anv already manages this itself. This allows removing the logic from the compiler. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38975>	2026-01-09 14:19:52 +00:00
Lionel Landwerlin	f4a0e05970	anv/brw/iris: get rid of param array on prog_data Drivers can do all the lowering to push constants to find the only value useful in that array (subgroup_id). Then drivers call into brw_cs_fill_push_const_info() to get the cross/per thread constant layout computed in the prog_data. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38975>	2026-01-09 14:19:51 +00:00
Lionel Landwerlin	05c3d427ba	anv: ensure internal compute kernels are run at SIMD16 Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38975>	2026-01-09 14:19:50 +00:00
Lionel Landwerlin	3dc7d71909	anv: stop going through push ranges on the first empty slot The way we build our ranges, the first empty one is the end of the ranges. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38975>	2026-01-09 14:19:50 +00:00
Lionel Landwerlin	ec456e99f2	brw: add a pass to lower ubo to push constant data Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38975>	2026-01-09 14:19:49 +00:00
Lionel Landwerlin	2c7254c131	brw: invert condition to reduce code nesting Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38975>	2026-01-09 14:19:48 +00:00
Caio Oliveira	dcefa0e6b3	brw: Rework UIP and JIP setting code Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The current code walks the instructions, and when needed, it will scan to find the next "end of scope" and sometimes the next "end of block". It also has a separate patching logic for HALTs. The new code collects the necessary scope information up front, then walks the instruction backwards, making avoiding the need to scan for the end of scope. It will also walk only the relevant instructions that were previously collected. It also replaces the previous HALT-specific patching logic. With this new change, many cases that were jumping to intermediate HALTs, will now jump straight to the end of scope (or the "end of the program" section). E.g. in ``` if ... (...) HALT ... (...) HALT endif ``` both HALTs now will jump to the end of the scope, instead of the first HALT jumping into the second one. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38914>	2026-01-08 22:01:45 +00:00

1 2 3 4 5 ...

15260 commits