fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-22 13:08:09 +02:00

Author	SHA1	Message	Date
Georg Lehmann	79d02047b8	intel: switch to new subgroup size info Reviewed-by: Iván Briano <ivan.briano@intel.com> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37258>	2025-09-12 21:05:17 +00:00
Georg Lehmann	95c2a65662	nir: remove unused shader_info param in nir_create_shader Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37258>	2025-09-12 21:05:17 +00:00
Lionel Landwerlin	a91e0e0d61	brw: add support for separate tessellation shader compilation Tessellation factors have to be written dynamically (based on the next shader primitive topology) and the builtins read using a dynamic offset (based on the preceeding shader's VUE). Anv is updated to use this new infrastructure for dynamic patch_control_points. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34872>	2025-09-05 07:46:17 +00:00
Ian Romanick	1ce90ad5e1	elk: Use nir_opt_sink and more nir_opt_move Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details I spent a bunch of time playing around with the various enable bits, and this was the best I could come up with. Enabling any of nir_move_comparisons or nir_move_load_ubo in nir_opt_sink helped instructions quite a bit, but it also caused a large pile of added spills and fills. shader-db: Broadwell total instructions in shared programs: 18428980 -> 18427957 (<.01%) instructions in affected programs: 425245 -> 424222 (-0.24%) helped: 1522 / HURT: 405 total cycles in shared programs: 954756705 -> 953755695 (-0.10%) cycles in affected programs: 623470486 -> 622469476 (-0.16%) helped: 17989 / HURT: 21175 total spills in shared programs: 8349 -> 8356 (0.08%) spills in affected programs: 285 -> 292 (2.46%) helped: 7 / HURT: 13 total fills in shared programs: 10426 -> 10192 (-2.24%) fills in affected programs: 675 -> 441 (-34.67%) helped: 25 / HURT: 1 LOST: 346 GAINED: 554 Haswell total instructions in shared programs: 16809730 -> 16801634 (-0.05%) instructions in affected programs: 772251 -> 764155 (-1.05%) helped: 3055 / HURT: 840 total cycles in shared programs: 945179935 -> 944315696 (-0.09%) cycles in affected programs: 549177588 -> 548313349 (-0.16%) helped: 34143 / HURT: 23605 total spills in shared programs: 7699 -> 7666 (-0.43%) spills in affected programs: 353 -> 320 (-9.35%) helped: 10 / HURT: 16 total fills in shared programs: 8184 -> 7671 (-6.27%) fills in affected programs: 1006 -> 493 (-50.99%) helped: 30 / HURT: 2 total sends in shared programs: 1016676 -> 1016682 (<.01%) sends in affected programs: 49 -> 55 (12.24%) helped: 0 / HURT: 6 LOST: 415 GAINED: 441 Ivy Bridge total instructions in shared programs: 15764955 -> 15757178 (-0.05%) instructions in affected programs: 707453 -> 699676 (-1.10%) helped: 2893 / HURT: 547 total cycles in shared programs: 430017934 -> 429720104 (-0.07%) cycles in affected programs: 251816726 -> 251518896 (-0.12%) helped: 33110 / HURT: 22056 total spills in shared programs: 1537 -> 1525 (-0.78%) spills in affected programs: 18 -> 6 (-66.67%) helped: 6 / HURT: 0 total fills in shared programs: 926 -> 905 (-2.27%) fills in affected programs: 24 -> 3 (-87.50%) helped: 6 / HURT: 0 total sends in shared programs: 816646 -> 816652 (<.01%) sends in affected programs: 49 -> 55 (12.24%) helped: 0 / HURT: 6 LOST: 332 GAINED: 417 Sandy Bridge total instructions in shared programs: 14055229 -> 14045281 (-0.07%) instructions in affected programs: 1436142 -> 1426194 (-0.69%) helped: 5858 / HURT: 757 total cycles in shared programs: 772123170 -> 813543451 (5.36%) cycles in affected programs: 521342483 -> 562762764 (7.94%) helped: 27928 / HURT: 35923 total spills in shared programs: 1742 -> 1741 (-0.06%) spills in affected programs: 66 -> 65 (-1.52%) helped: 1 / HURT: 0 total fills in shared programs: 970 -> 967 (-0.31%) fills in affected programs: 93 -> 90 (-3.23%) helped: 1 / HURT: 0 total sends in shared programs: 1239222 -> 1238992 (-0.02%) sends in affected programs: 6137 -> 5907 (-3.75%) helped: 342 / HURT: 112 LOST: 244 GAINED: 434 Iron Lake and GM45 had similar results. (Iron Lake shown) total instructions in shared programs: 8366385 -> 8363954 (-0.03%) instructions in affected programs: 162761 -> 160330 (-1.49%) helped: 600 / HURT: 195 total cycles in shared programs: 248992618 -> 252119334 (1.26%) cycles in affected programs: 50774708 -> 53901424 (6.16%) helped: 3435 / HURT: 5131 total sends in shared programs: 623693 -> 623681 (<.01%) sends in affected programs: 351 -> 339 (-3.42%) helped: 12 / HURT: 0 LOST: 0 GAINED: 6 Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25463>	2025-09-04 15:01:18 -07:00
Konstantin Seurer	9df7b48d2f	nir: Use nir_def_as_* in more places Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36746>	2025-08-24 14:03:09 +00:00
Matt Turner	6fd4dc353c	elk/algebraic: Protect SHUFFLE from OOB indices Akin to `b67230de63` ("intel/fs: Protect opt_algebraic from OOB BROADCAST indices"), we need to protect SHUFFLE as well. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36779>	2025-08-19 09:15:19 +00:00
Qiang Yu	4847e0b380	all: rename gl_shader_stage_uses_workgroup to mesa_shader_stage_uses_workgroup Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36569>	2025-08-06 10:28:41 +08:00
Qiang Yu	7a91473192	all: rename gl_shader_stage_is_compute to mesa_shader_stage_is_compute Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36569>	2025-08-06 10:28:41 +08:00
Qiang Yu	196569b1a4	all: rename gl_shader_stage to mesa_shader_stage It's not only for GL, change to a generic name. Use command: find . -type f -not -path '/.git/' -exec sed -i 's/\bgl_shader_stage\b/mesa_shader_stage/g' {} + Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36569>	2025-08-06 10:28:40 +08:00
Qiang Yu	07a3a54d37	all: rename PIPE_SHADER_TYPES to MESA_SHADER_STAGES Use command: find . -type f -not -path '/.git/' -exec sed -i 's/\bPIPE_SHADER_TYPES\b/MESA_SHADER_STAGES/g' {} + Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36569>	2025-08-06 10:28:39 +08:00
Marek Olšák	ae5b168051	ralloc/linalloc: allow adding custom code to LINEAR_ALLOC new operator for GLSL IR Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36539>	2025-08-04 02:07:00 +00:00
Alyssa Rosenzweig	82ae8b1d33	treewide: simplify nir_def_rewrite_uses_after Most of the time with nir_def_rewrite_uses_after, you want to rewrite after the replacement. Make that the default thing to be more ergonomic and to drop parent_instr uses. We leave nir_def_rewrite_uses_after_instr defined if you really want the old signature with an arbitrary after point. Via Coccinelle patch: @@ expression a, b; @@ -nir_def_rewrite_uses_after(a, b, b->parent_instr) +nir_def_rewrite_uses_after_def(a, b) Followed by a bunch of sed. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Marek Olšák <maraeo@gmail.com> Acked-by: Karol Herbst <kherbst@redhat.com> Acked-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36489>	2025-08-01 15:34:24 +00:00
Alyssa Rosenzweig	cc6e3b84cb	treewide: use nir_def_as_* Via Coccinelle patch: @@ expression definition; @@ -nir_instr_as_alu(definition->parent_instr) +nir_def_as_alu(definition) @@ expression definition; @@ -nir_instr_as_intrinsic(definition->parent_instr) +nir_def_as_intrinsic(definition) @@ expression definition; @@ -nir_instr_as_phi(definition->parent_instr) +nir_def_as_phi(definition) @@ expression definition; @@ -nir_instr_as_load_const(definition->parent_instr) +nir_def_as_load_const(definition) @@ expression definition; @@ -nir_instr_as_deref(definition->parent_instr) +nir_def_as_deref(definition) @@ expression definition; @@ -nir_instr_as_tex(definition->parent_instr) +nir_def_as_tex(definition) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Marek Olšák <maraeo@gmail.com> Acked-by: Karol Herbst <kherbst@redhat.com> Acked-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36489>	2025-08-01 15:34:24 +00:00
Marek Olšák	db26597f8d	intel: fork exec_node/list -> brw_exec_node/list as a private Intel utility NIR is going to use exec_node/list without the C++ code, and may switch to a different linked list implementation in the future. GLSL is going to use ir_exec_node/list, which we want to keep private for GLSL, so that we can change it easily. Thus, it's better to fork the C++ version of list.h for Intel. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36425>	2025-07-31 20:23:02 +00:00
Antonio Ospite	ddf2aa3a4d	build: avoid redefining unreachable() which is standard in C23 In the C23 standard unreachable() is now a predefined function-like macro in <stddef.h> See https://android.googlesource.com/platform/bionic/+/HEAD/docs/c23.md#is-now-a-predefined-function_like-macro-in And this causes build errors when building for C23: ----------------------------------------------------------------------- In file included from ../src/util/log.h:30, from ../src/util/log.c:30: ../src/util/macros.h:123:9: warning: "unreachable" redefined 123 \| #define unreachable(str) \ \| ^~~~~~~~~~~ In file included from ../src/util/macros.h:31: /usr/lib/gcc/x86_64-linux-gnu/14/include/stddef.h:456:9: note: this is the location of the previous definition 456 \| #define unreachable() (__builtin_unreachable ()) \| ^~~~~~~~~~~ ----------------------------------------------------------------------- So don't redefine it with the same name, but use the name UNREACHABLE() to also signify it's a macro. Using a different name also makes sense because the behavior of the macro was extending the one of __builtin_unreachable() anyway, and it also had a different signature, accepting one argument, compared to the standard unreachable() with no arguments. This change improves the chances of building mesa with the C23 standard, which for instance is the default in recent AOSP versions. All the instances of the macro, including the definition, were updated with the following command line: git grep -l '[^_]unreachable(' -- "src/**" \| sort \| uniq \| \ while read file; \ do \ sed -e 's/$[^_]$unreachable(/\1UNREACHABLE(/g' -i "$file"; \ done && \ sed -e 's/#undef unreachable/#undef UNREACHABLE/g' -i src/intel/isl/isl_aux_info.c Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36437>	2025-07-31 17:49:42 +00:00
Alyssa Rosenzweig	ecfca8ec6f	util: crib SWAP macro from freedreno we have a bunch of copies across the tree, unify them. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36257>	2025-07-21 11:42:18 +00:00
jhananit	debd903a00	intel: Update all NIR_PASS_V to NIR_PASS Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35889>	2025-07-14 19:25:52 +00:00
Daniel Schürmann	2c51a8870d	nir: add nir_vectorize_cb callback parameter to nir_lower_phis_to_scalar() Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Similar to nir_lower_alu_width(), the callback can return the desired number of components for a phi, or 0 for no lowering. The previous behavior of nir_lower_phis_to_scalar() with lower_all=true can be elicited via nir_lower_all_phis_to_scalar() while the previous behavior with lower_all=false now corresponds to nir_lower_phis_to_scalar() with NULL callback. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35783>	2025-07-08 15:33:59 +00:00
Marek Olšák	8def3f865d	agx,freedreno,intel,lima,panfrost,svga,virgl,zink: fix supports_indirect_inputs The GLSL compiler always lowers inputs to temps for VS and GS, so exclude them from driver support because the GLSL compiler will no longer do that unconditionally. Thus, indirect VS and GS inputs are completely untested and broken in a lot of drivers. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35945>	2025-07-08 06:11:42 +00:00
Alyssa Rosenzweig	d31cb824df	treewide: use VARYING_BIT_* Some checks failed macOS-CI / macOS-CI (dri) (push) Has been cancelled Details macOS-CI / macOS-CI (xlib) (push) Has been cancelled Details Via Coccinelle patch generated by the following Python: varys = [ "POS", "COL0", "COL1", "FOGC", "TEX0", "TEX1", "TEX2", "TEX3", "TEX4", "TEX5", "TEX6", "TEX7", "PSIZ", "BFC0", "BFC1", "EDGE", "CLIP_VERTEX", "CLIP_DIST0", "CLIP_DIST1", "CULL_DIST0", "CULL_DIST1", "PRIMITIVE_ID", "PRIMITIVE_COUNT", "LAYER", "VIEWPORT", "FACE", "PRIMITIVE_SHADING_RATE", "PNTC", "TESS_LEVEL_OUTER", "TESS_LEVEL_INNER", "PRIMITIVE_INDICES", "BOUNDING_BOX0", "BOUNDING_BOX1", "VIEWPORT_MASK", "CULL_PRIMITIVE" ] t = """ @@ @@ -(1 << VARYING_SLOT_${V}) +VARYING_BIT_${V} @@ @@ -BITFIELD_BIT(VARYING_SLOT_${V}) +VARYING_BIT_${V} @@ @@ -(1ull << VARYING_SLOT_${V}) +VARYING_BIT_${V} @@ @@ -BITFIELD64_BIT(VARYING_SLOT_${V}) +VARYING_BIT_${V} """ for v in varys: from mako.template import Template print(Template(t).render(V = v)) Closes: #13453 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Marek Olšák <maraeo@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> [panfrost, common] Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> [broadcom] Reviewed-by: Corentin Noël <corentin.noel@collabora.com> [virgl] Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> [zink] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35917>	2025-07-04 19:01:04 +00:00
Marek Olšák	1754507d49	nir: rename nir_lower_io_to_temporaries -> nir_lower_io_vars_to_temporaries Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:54 +00:00
Marek Olšák	1e03827c77	nir: rename nir_lower_io_arrays_to_elements -> nir_lower_io_array_vars_to_elements same for *_no_indirects Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:54 +00:00
Marek Olšák	12df9b3def	nir: rename nir_vectorize_tess_levels -> nir_lower_tess_level_array_vars_to_vec Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:50 +00:00
Marek Olšák	2aa94caf82	nir: rename nir_lower_io_to_vector -> nir_opt_vectorize_io_vars Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:50 +00:00
Marek Olšák	439d805291	nir: rename nir_lower_io_to_scalar_early -> nir_lower_io_vars_to_scalar Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:49 +00:00
Eric Engestrom	99e8d804bf	intel/compiler tests: fix variable type for getopt_long() return value Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details `getopt_long()` returns an `int`, not a `char`; putting the value in a `char` before comparing it to `-1` was making the comparison always fail, resulting in the invalid codepath taken that then fails with: option `-' is invalid: ignored cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34756>	2025-06-23 08:26:29 +00:00
Eric Engestrom	f545f9eed4	intel/compiler tests: fix "is there something after the options" check cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34756>	2025-06-23 08:26:29 +00:00
Eric Engestrom	729922cdae	intel/compiler tests: fix path-to-string conversion cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34756>	2025-06-23 08:26:29 +00:00
Eric Engestrom	de6ab1beda	intel/compiler tests: rewrite subprocess handling in run-test.py `subprocess.Popen()` returns immediately, and the subprocess might not have finished by the time `stdout` is read on the next line, spuriously failing the tests. `subprocess.check_output()` makes sure the output is available before returning, solving this issue; it additionally raises an error if the subprocess failed, giving a better error than a failed diff later in the script. cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34756>	2025-06-23 08:26:29 +00:00
Georg Lehmann	9da23499ff	compiler: add float8 glsl types e4m3fn: 8bit floating point format with 4bit exponent, 3bit mantissa and no infinities (finite only) e5m2: 8bit floating point format with 5bit exponent, 2bit mantissa and with infinities. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35434>	2025-06-23 07:59:24 +00:00
Emma Anholt	cd981e27f7	intel/elk: Move wpos_w setup right into nir_intrinsic_load_frag_w. Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Given that the intrinsic will be CSEed at the NIR level, we don't need to preemptively set it up at the top of the shader. No change in HSW shader-db. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25190>	2025-06-18 23:11:43 +00:00
Emma Anholt	269fbcb144	intel/elk: Use pixel_z for gl_FragCoord.z on pre-gen6. Unless I've seriously missed something, we have the Z in the payload (which we can always request if we need access to it and it's not already passed to us due other WM IZ settings). total instructions in shared programs: 4408303 -> 4408186 (<.01%) instructions in affected programs: 1164 -> 1047 (-10.05%) total cycles in shared programs: 142485036 -> 142484566 (<.01%) cycles in affected programs: 26820 -> 26350 (-1.75%) Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25190>	2025-06-18 23:11:43 +00:00
Emma Anholt	dc55b47a58	intel/elk: Move pre-gen6 smooth interpolation 1/w multiply to NIR. NIR catches that if you're just doing something like adding two smooth inputs, we can do the multiply once on the result instead of on each input. BRW shader-db results: total instructions in shared programs: 4409146 -> 4408303 (-0.02%) instructions in affected programs: 800761 -> 799918 (-0.11%) total cycles in shared programs: 143203198 -> 142485036 (-0.50%) cycles in affected programs: 79081682 -> 78363520 (-0.91%) total sends in shared programs: 363044 -> 363042 (<.01%) sends in affected programs: 33 -> 31 (-6.06%) Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25190>	2025-06-18 23:11:42 +00:00
Emma Anholt	fb9b2261a1	intel/elk: Move pre-gen6 gl_FragCoord.w -> interpolation lowering to NIR. BRW shader-db: total instructions in shared programs: 4409143 -> 4409146 (<.01%) instructions in affected programs: 330 -> 333 (0.91%) Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25190>	2025-06-18 23:11:41 +00:00
Emma Anholt	17ab39fbf8	intel/elk: Fix some tabs in gen4 URB setup. This formatted terribly in my editor, just use spaces. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25190>	2025-06-18 23:11:40 +00:00
Emma Anholt	9d7a016ed1	intel/elk: Retire the global float pixel_x/y values. Nothing used them any more. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25190>	2025-06-18 23:11:40 +00:00
Emma Anholt	e1bf014b6e	intel/elk: Reduce this->pixel_x/y usage in gfx4 interp setup. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25190>	2025-06-18 23:11:40 +00:00
Emma Anholt	241bc5da70	intel/elk: Use the pixel_coord UW x/y values for noncoherent FB reads. No need to force generating the float cast just to turn it back to an int. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25190>	2025-06-18 23:11:39 +00:00
Emma Anholt	1134cdc198	intel/elk: Lower load_frag_coord to load_{pixel_coord,frag_coord_z/w} in NIR. This moves some conversions to NIR that may get eliminated, and also distinguishes gl_FragCoord.z/w loads at the shader info level so we don't need to flag uses_src_depth/uses_src_w when only gl_FragCoord.xy get used (as is typical). This reduces thread payload setup on many shaders. Also, interestingly, blorp shaders stop reserving space for z/w despite not putting them in the payload (since PS_EXTRA isn't filled out for z/w). HSW shader-db is noise: total instructions in shared programs: 9942649 -> 9942997 (<.01%) instructions in affected programs: 143167 -> 143515 (0.24%) total cycles in shared programs: 314768862 -> 314299112 (-0.15%) cycles in affected programs: 62951452 -> 62481702 (-0.75%) LOST: 44 GAINED: 26 Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25190>	2025-06-18 23:11:39 +00:00
Emma Anholt	5222c35924	intel/elk: Save the UW pixel x/y as a temp on gfx6+. This will be used for representing gl_FragCoord in NIR and reducing payload registers pushed. HSW results: total instructions in shared programs: 9940636 -> 9948574 (0.08%) instructions in affected programs: 852560 -> 860498 (0.93%) total cycles in shared programs: 314804525 -> 314900080 (0.03%) cycles in affected programs: 39786599 -> 39882154 (0.24%) LOST: 5 GAINED: 11 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25190>	2025-06-18 23:11:38 +00:00
Emma Anholt	af74abd68c	intel/fs: Don't bother checking if load_frag_coord uses interpolation. This was leftover dead code from `4bb6e6817e` ("intel: Use a system value for gl_FragCoord") -- the sysval doesn't do any interpolation and doesn't have sources that could use a barycentric. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25190>	2025-06-18 23:11:37 +00:00
Emma Anholt	0bf114736a	intel: Use the common NIR lowering for fquantize2f16. This generates one extra instruction to set the rounding mode to RTE due to f2f16_rtne in the lowering. This changes the result for fquantize2f16(65505.0) from 65536 to 65504, which fixes SPIR-V conformance for this value: If Value is positive with a magnitude too large to represent as a 16-bit floating-point value, the result is positive infinity. If Value is negative with a magnitude too large to represent as a 16-bit floating-point value, the result is negative infinity. SPIR-V doesn't specify whether this overflow check is before or after rounding, but IEEE specifies rounding first, which is what produces our 65504. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25552>	2025-06-18 22:45:08 +00:00
Dave Airlie	870b8717b2	Revert "hasvk/elk: stop turning load_push_constants into load_uniform" Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details This reverts commit `b036d2ded2`. This seems to break gtk4 and other stuff. Cc: mesa-stable (taking ack from Lionel saying we should revert) Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35407>	2025-06-09 09:20:19 +10:00
Matt Turner	37016468a5	intel/compiler: Align human-readable send message info This fprintf() was added in commit `cce3bea2a7` ("i965/disasm: Align send instruction meta-information with dst.")) to align the human-readable send message info (e.g. "render MsgDesc: RT write ...") with the destination register on the previous line. Two months later we disabled printing the instruction offset in commit `662f1ccc24` ("i965: Disable hex offset printing in disassembly."), thereby unaligning the human-readable send message info for the next 11 years. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35077>	2025-05-28 21:54:40 +00:00
Christian Gmeiner	41f2da1a6e	treewide: Do not use NIR_PASS_V for nir_divergence_analysis(..) Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35131>	2025-05-23 21:19:25 +00:00
Lionel Landwerlin	b036d2ded2	hasvk/elk: stop turning load_push_constants into load_uniform Those intrinsics have different semantics in particular with regards to divergence. Turning one into the other without invalidating the divergence information breaks NIR validation. But also the conversion means we get artificially less convergent values in the shaders. So just handle load_push_constants in the backend and stop changing things in Hasvk. Fixes a bunch of tests in dEQP-VK.descriptor_indexing.* dEQP-VK.pipeline..push_constant.graphics_pipeline.dynamic_index_ Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34546>	2025-05-22 07:49:20 +00:00
Lionel Landwerlin	2d396f6085	intel: prepare VUE layout for more than 2 layouts Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34109>	2025-05-08 06:48:35 +00:00
Lionel Landwerlin	4717382f84	anv: lower input vertices for TCS unconditionally Take the opportunity to reuse the backend pass. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34109>	2025-05-08 06:48:34 +00:00
Rohan Garg	9e5d7eb88d	compiler/types: add a bfloat16 type Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34105>	2025-04-29 16:29:36 +00:00
Lionel Landwerlin	1f6cca0800	intel: fixup a few debugging option checks Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `ad328bc58d` ("intel: Switch uint64_t intel_debug to a bitset") Reviewed-by: Michael Cheng <michael.cheng@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34667>	2025-04-23 18:47:42 +00:00

1 2 3 4

197 commits