fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-18 05:08:06 +02:00

Author	SHA1	Message	Date
Iago Toral Quiroga	42bd467906	broadcom/compiler: inform NIR scheduler about 0 cost ALU instructions Some ALU instructions will likely end up being copy propagated in the backend, which means they would not have any cost. This helps the scheduler make better decisions for the new open-coded patterns produced in NIR for extracts (i.e. unpack_2x16) with MR#39511. With this (together with previous patches) we manage to produce similar shader-db results as with the unpack_2x16 NIR extract opcodes that MR#39511 will drop. Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39687>	2026-02-05 11:29:42 +00:00
Iago Toral Quiroga	f93e8e76e9	broadcom/compiler: optimize alu(shr(x, 16).l) to alu(x.h) We need this to produce optimal code in the backend for sequences like this: 32 %10 = ushr %5.x, %9 (0x10) 16 %14 = u2u16 %10 32 %17 = f2f32 %14 With such code, our copy propagation pass will drop the u216 and with this patch we will be able to drop the ushr too. This pattern can show up for VK_KHR_16bit_storage when we successfully vectorize 16-bit loads into 32-bit loads, but will become a lot more common after MR#39511 lands, since that would also affect things like 16-bit TMU loads, which are more common. Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39687>	2026-02-05 11:29:42 +00:00
Iago Toral Quiroga	4753a296f9	broadcom/compiler: don't always clear undefined bits from sub-32 integers We only really use sub-32bit integers in conversions, so we can skip clearing the MSB bits when we produce them by converting from larger types (leaving these bits undefined) and only clear them when we convert from them to larger types, since we don't have native opcodes to do these conversions that would only access relevant bits, at least on Pi4. Also, document the cases where we could do better for Pi5. Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39687>	2026-02-05 11:29:42 +00:00
Iago Toral Quiroga	c589268b5c	broadcom/compiler: drop unnecessary MOV Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39687>	2026-02-05 11:29:41 +00:00
Faith Ekstrand	68d22b5a2a	nir/lower_blend: Move the format to nir_lower_blend_rt Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39367>	2026-01-19 21:33:14 +00:00
Juan A. Suarez Romero	13211eb2fc	broadcom/compiler: use skip_helpers with textures, UBOs and SSBOs Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Set the per-pixel mask based on the value of skip_helpers. This slightly increase the performance on several traces. fps_avg helped: gl_gfxbench_trex.trace: 22.30 -> 22.79 (2.20%) total fps_avg in all runs: 55.18 -> 55.71 (0.97%) total fps_avg in affected (through threshold) runs: 22.30 -> 22.79 (2.20%) helped: 1 HURT: 0 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38759>	2026-01-08 12:59:44 +00:00
Juan A. Suarez Romero	1e3da5c985	broadcom/compiler: enable skip_helpers It will be used with image loads to enable or disable helper invocations. This fixes a Vulkan CTS test that perform an imageLoad() inside a fwidth() operation. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38759>	2026-01-08 12:59:44 +00:00
Emma Anholt	059d301c79	nir: Drop the mode argument of nir_lower_vars_to_scratch(). It only makes sense for function temps, and that's the only way it's been used. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37245>	2025-12-17 19:50:28 +00:00
Jose Maria Casanova Crespo	40339ada9c	broadcom: Drop use of nir_lower_wrmasks v3d_nir_lower_load_store_bitsize that uses nir_lower_mem_access_bit_sizes already ensures that any writemask on store has consecutive bits set. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38921>	2025-12-15 11:34:27 +00:00
Ian Romanick	956a09b990	broadcom/compiler: only lower flrp once This is only compile tested. I have not collected any shader-db or fossil-db data. v2: Drop the calls to nir_opt_constant_folding. The builder in nir_lower_flrp will already take care of this. v3: NIR_PASS_V is gone. Noticed by Marge. Acked-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12526>	2025-12-02 21:28:05 +00:00
Marek Olšák	9a56672f56	nir: add shader_info::disable_input/output_offset_src_constant_folding and set it where needed to prevent nir_opt_constant_folding from breaking those drivers. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38277>	2025-11-29 00:16:38 +00:00
Iago Toral Quiroga	a643681dd5	broadcom/compiler: use nir_opt_uub Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Shows performance improvement on aztec/aztec_high fps_avg helped: gl_aztec.trace: 6.37 -> 6.45 (1.26%) fps_avg helped: gl_aztec_high.trace: 4.29 -> 4.33 (0.93%) And a significant instruction count reduction in the affected shaders. But some shaders show a huge reduction. gles_aztec/274.shader_test MESA_SHADER_COMPUTE: 1375 -> 1196 (-13.02%) gles_aztec_high/499.shader_test MESA_SHADER_COMPUTE: 1375 -> 1196 (-13.02%) master-of-orion/1253.shader_test MESA_SHADER_FRAGMENT: 305 -> 262 (-14.10%) blender/7.shader_test MESA_SHADER_FRAGMENT: 12389 -> 10455 (-15.61%) master-of-orion/1256.shader_test MESA_SHADER_VERTEX: 170 -> 131 (-22.94%) total instructions in shared programs: 14679696 -> 14675496 (-0.03%) instructions in affected programs: 196683 -> 192483 (-2.14%) helped: 430 HURT: 8 Instructions are helped. total uniforms in shared programs: 6775582 -> 6775495 (<.01%) uniforms in affected programs: 21155 -> 21068 (-0.41%) helped: 48 HURT: 2 Uniforms are helped. total max-temps in shared programs: 2709673 -> 2709710 (<.01%) max-temps in affected programs: 403 -> 440 (9.18%) helped: 2 HURT: 16 Max-temps are HURT. Signed-off-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38642>	2025-11-26 13:32:39 +00:00
Jose Maria Casanova Crespo	4234e7eed0	broadcom/compiler: enable umul24 and imul24 ALU opcodes For umul24 we expose the operation as UMUL24_RTOP0 so we can identify the difference between umul24 as part of a sequence generated from an imul as "multop+umul24" and a simple umul24 where rtop will always be 0. For umul24_rtop0 instructions we relax the scheduling restrictions, so they don't need to be serialized like the multop+umul24 ops. But we maintain the read dependency with the last_rtop. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38642>	2025-11-26 13:32:39 +00:00
Alyssa Rosenzweig	2c2dd835af	nir/lower_wrmasks: drop callback All drivers use the same callback and it is unlikely that new drivers will use this pass since it has better replacements today (lower_mem_bit_sizes for memory, and it never worked for I/O). This should discourage as much. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Marek Olšák <maraeo@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38533>	2025-11-26 03:20:39 +00:00
Dave Airlie	26eaba935d	nir: add a cmat call instruction type. This adds a new instruction type to handle cooperative matrix calls. This clones the call instr, drops callee, and adds a single metadata slot and a call operation (dummy only for now). (Not NACKed by Alyssa) Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38389>	2025-11-17 23:33:58 +00:00
Marek Olšák	e372365cf4	nir: rename nir_copy_prop -> nir_opt_copy_prop Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38411>	2025-11-15 02:16:38 +00:00
Konstantin Seurer	de32f9275f	treewide: add & use parent instr helpers We add a bunch of new helpers to avoid the need to touch >parent_instr, including the full set of: * nir_def_is_* * nir_def_as__or_null nir_def_as_* [assumes the right instr type] * nir_src_is_* * nir_src_as_* * nir_scalar_is_* * nir_scalar_as_* Plus nir_def_instr() where there's no more suitable helper. Also an existing helper is renamed to unify all the names, while we're churning the tree: * nir_src_as_alu_instr -> nir_src_as_alu ..and then we port the tree to use the helpers as much as possible, using nir_def_instr() where that does not work. Acked-by: Marek Olšák <maraeo@gmail.com> --- To eliminate nir_def::parent_instr we need to churn the tree anyway, so I'm taking this opportunity to clean up a lot of NIR patterns. Co-authored-by: Konstantin Seurer <konstantin.seurer@gmail.com> Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38313>	2025-11-12 21:22:13 +00:00
Faith Ekstrand	6ee4ea5ea3	nir: Add a type parameter to nir_lower_point_size() On Mali, we need not only clamp but also convert to float16 on Valhall+. We could have a separate pass for this but it fits in nicely with the rest of nir_lower_point_size() so we might as well put it there. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38379>	2025-11-12 01:34:36 +00:00
Konstantin Seurer	b962063d72	nir: Remove nir_parallel_copy_instr Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36483>	2025-11-04 18:51:51 +00:00
Marek Olšák	2f6b4803ab	nir/validate: expand IO intrinsic validation with nir_io_semantics Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details There are many workarounds. v2: add more validation Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> (v1) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38113>	2025-11-02 02:21:46 +00:00
Daniel Schürmann	10be538851	tree-wide: don't call nir_opt_constant_folding after nir_lower_flrp Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37195>	2025-10-30 19:28:07 +00:00
Daivik Bhatia	cdef2c0b61	broadcom/common: Add subgroup support to CSD super-group packing Certain subgroup operations don’t impose constraints on CSD supergroup packing. Mark these as supported and account for them in v3d_csd_choose_workgroups_per_supergroup() so packing remains unchanged when they are present. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37836>	2025-10-13 08:25:24 +02:00
Alejandro Piñeiro	cea6d7ada5	v3d: expose GL_KHR_shader_subgroup for v71+ All the compiler support was implemented as part of the v3dv implementation (see commit `31e8740808` and MR#27211). We are using the same size/supported_stages and mostly the same supported features, so probably at some point it would be good to have a common place for that info. Zink reuses their definitions, but as far as I see it does that because the PIPE and equivalent VK definitions has the same values, that seems somewhat fragile. We don't support all features, and in order to support arithmetic we need to enable a lowering. Using CTS, right now we are passing 1023 tests out of 6053 (the rest are skipped). Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37621>	2025-10-08 10:48:41 +00:00
Ella Stanforth	aaa858f958	v3d/compiler: Implement 16bit normalised render targets. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35820>	2025-09-30 12:48:42 +00:00
Ella Stanforth	c9e9d72cce	v3d/compiler: implement normalised to float conversions Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35820>	2025-09-30 12:48:42 +00:00
Ella Stanforth	9263e1838b	v3d/compiler: Lower load_output after logic operations Fixes: `42154029fc` ("v3d/compiler: Implement software blend lowering") Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35820>	2025-09-30 12:48:42 +00:00
Ella Stanforth	0a640f42c5	v3d/compiler: Add unpacking instructions for normalised 16bit formats. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35820>	2025-09-30 12:48:41 +00:00
Ella Stanforth	ee48e81b26	v3d: Always lower frag color Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35820>	2025-09-30 12:48:39 +00:00
Simon Perretta	2a7ebf2ae0	nir/lower_alpha: extend to support dynamic a2c Signed-off-by: Simon Perretta <simon.perretta@imgtec.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37512>	2025-09-30 12:15:53 +00:00
Qiang Yu	c135ed1eb9	all: rename gl_shader_stage_name to mesa_shader_stage_name Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36569>	2025-08-06 10:28:41 +08:00
Qiang Yu	196569b1a4	all: rename gl_shader_stage to mesa_shader_stage It's not only for GL, change to a generic name. Use command: find . -type f -not -path '/.git/' -exec sed -i 's/\bgl_shader_stage\b/mesa_shader_stage/g' {} + Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36569>	2025-08-06 10:28:40 +08:00
Alyssa Rosenzweig	82ae8b1d33	treewide: simplify nir_def_rewrite_uses_after Most of the time with nir_def_rewrite_uses_after, you want to rewrite after the replacement. Make that the default thing to be more ergonomic and to drop parent_instr uses. We leave nir_def_rewrite_uses_after_instr defined if you really want the old signature with an arbitrary after point. Via Coccinelle patch: @@ expression a, b; @@ -nir_def_rewrite_uses_after(a, b, b->parent_instr) +nir_def_rewrite_uses_after_def(a, b) Followed by a bunch of sed. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Marek Olšák <maraeo@gmail.com> Acked-by: Karol Herbst <kherbst@redhat.com> Acked-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36489>	2025-08-01 15:34:24 +00:00
Alyssa Rosenzweig	cc6e3b84cb	treewide: use nir_def_as_* Via Coccinelle patch: @@ expression definition; @@ -nir_instr_as_alu(definition->parent_instr) +nir_def_as_alu(definition) @@ expression definition; @@ -nir_instr_as_intrinsic(definition->parent_instr) +nir_def_as_intrinsic(definition) @@ expression definition; @@ -nir_instr_as_phi(definition->parent_instr) +nir_def_as_phi(definition) @@ expression definition; @@ -nir_instr_as_load_const(definition->parent_instr) +nir_def_as_load_const(definition) @@ expression definition; @@ -nir_instr_as_deref(definition->parent_instr) +nir_def_as_deref(definition) @@ expression definition; @@ -nir_instr_as_tex(definition->parent_instr) +nir_def_as_tex(definition) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Marek Olšák <maraeo@gmail.com> Acked-by: Karol Herbst <kherbst@redhat.com> Acked-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36489>	2025-08-01 15:34:24 +00:00
Antonio Ospite	ddf2aa3a4d	build: avoid redefining unreachable() which is standard in C23 In the C23 standard unreachable() is now a predefined function-like macro in <stddef.h> See https://android.googlesource.com/platform/bionic/+/HEAD/docs/c23.md#is-now-a-predefined-function_like-macro-in And this causes build errors when building for C23: ----------------------------------------------------------------------- In file included from ../src/util/log.h:30, from ../src/util/log.c:30: ../src/util/macros.h:123:9: warning: "unreachable" redefined 123 \| #define unreachable(str) \ \| ^~~~~~~~~~~ In file included from ../src/util/macros.h:31: /usr/lib/gcc/x86_64-linux-gnu/14/include/stddef.h:456:9: note: this is the location of the previous definition 456 \| #define unreachable() (__builtin_unreachable ()) \| ^~~~~~~~~~~ ----------------------------------------------------------------------- So don't redefine it with the same name, but use the name UNREACHABLE() to also signify it's a macro. Using a different name also makes sense because the behavior of the macro was extending the one of __builtin_unreachable() anyway, and it also had a different signature, accepting one argument, compared to the standard unreachable() with no arguments. This change improves the chances of building mesa with the C23 standard, which for instance is the default in recent AOSP versions. All the instances of the macro, including the definition, were updated with the following command line: git grep -l '[^_]unreachable(' -- "src/**" \| sort \| uniq \| \ while read file; \ do \ sed -e 's/$[^_]$unreachable(/\1UNREACHABLE(/g' -i "$file"; \ done && \ sed -e 's/#undef unreachable/#undef UNREACHABLE/g' -i src/intel/isl/isl_aux_info.c Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36437>	2025-07-31 17:49:42 +00:00
Alejandro Piñeiro	fa8731b859	broadcom/compiler: update compact arrays comment PIPE_CAP_NIR_COMPACT_ARRAYS is gone since commit `2e5d49b3dd` v3d properly uses the compact_arrays option from nir_shader_compiler_options since commit `d694c1b094` Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36446>	2025-07-30 22:45:33 +00:00
Antonio Ospite	1b55314615	broadcom/compiler: prevent FALLTHROUGH error with C23 When building for the C23 standard, the compiler gives the following error: ----------------------------------------------------------------------- ../src/broadcom/compiler/vir_register_allocate.c ../src/broadcom/compiler/vir_register_allocate.c: In function ‘update_graph_and_reg_classes_for_inst’: ../src/broadcom/compiler/vir_register_allocate.c:1225:44: error: expected statement before ‘;’ token 1225 \| FALLTHROUGH; \| ^ ../src/broadcom/compiler/vir_register_allocate.c:1225:44: warning: ‘fallthrough’ attribute ignored [-Wattributes] ../src/broadcom/compiler/vir_register_allocate.c:1225:44: error: suggest braces around empty body in an ‘else’ statement [-Werror=empty-body] ../src/broadcom/compiler/vir_register_allocate.c:1222:28: warning: this statement may fall through [-Wimplicit-fallthrough=] 1222 \| if (c->devinfo->ver >= 71) \| ^ ../src/broadcom/compiler/vir_register_allocate.c:1226:17: note: here 1226 \| case 1: \| ^~~~ ----------------------------------------------------------------------- Fix that by doing what the compiler suggests, i.e. by using braces around empty body in the ‘else’ statement. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36323>	2025-07-29 14:07:07 +00:00
Georg Lehmann	3bc691f116	broadcom/compiler: use NIR_PASS for nir_schedule This should work now that the pass returns progress and invalidates metadata. Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36291>	2025-07-23 06:47:58 +00:00
Daniel Schürmann	2c51a8870d	nir: add nir_vectorize_cb callback parameter to nir_lower_phis_to_scalar() Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Similar to nir_lower_alu_width(), the callback can return the desired number of components for a phi, or 0 for no lowering. The previous behavior of nir_lower_phis_to_scalar() with lower_all=true can be elicited via nir_lower_all_phis_to_scalar() while the previous behavior with lower_all=false now corresponds to nir_lower_phis_to_scalar() with NULL callback. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Mel Henning <mhenning@darkrefraction.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35783>	2025-07-08 15:33:59 +00:00
Marek Olšák	89285e25b6	nir: remove nir_shader_compiler_options::lower_all_io_to_temps All drivers should report support_indirect_* correctly, so this is redundant. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35945>	2025-07-08 06:11:43 +00:00
Alyssa Rosenzweig	d31cb824df	treewide: use VARYING_BIT_* Some checks failed macOS-CI / macOS-CI (dri) (push) Has been cancelled Details macOS-CI / macOS-CI (xlib) (push) Has been cancelled Details Via Coccinelle patch generated by the following Python: varys = [ "POS", "COL0", "COL1", "FOGC", "TEX0", "TEX1", "TEX2", "TEX3", "TEX4", "TEX5", "TEX6", "TEX7", "PSIZ", "BFC0", "BFC1", "EDGE", "CLIP_VERTEX", "CLIP_DIST0", "CLIP_DIST1", "CULL_DIST0", "CULL_DIST1", "PRIMITIVE_ID", "PRIMITIVE_COUNT", "LAYER", "VIEWPORT", "FACE", "PRIMITIVE_SHADING_RATE", "PNTC", "TESS_LEVEL_OUTER", "TESS_LEVEL_INNER", "PRIMITIVE_INDICES", "BOUNDING_BOX0", "BOUNDING_BOX1", "VIEWPORT_MASK", "CULL_PRIMITIVE" ] t = """ @@ @@ -(1 << VARYING_SLOT_${V}) +VARYING_BIT_${V} @@ @@ -BITFIELD_BIT(VARYING_SLOT_${V}) +VARYING_BIT_${V} @@ @@ -(1ull << VARYING_SLOT_${V}) +VARYING_BIT_${V} @@ @@ -BITFIELD64_BIT(VARYING_SLOT_${V}) +VARYING_BIT_${V} """ for v in varys: from mako.template import Template print(Template(t).render(V = v)) Closes: #13453 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Marek Olšák <maraeo@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> [panfrost, common] Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> [broadcom] Reviewed-by: Corentin Noël <corentin.noel@collabora.com> [virgl] Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> [zink] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35917>	2025-07-04 19:01:04 +00:00
Alejandro Piñeiro	0f830c5572	v3d/compiler: properly handle the RA debug option RA is intended to dump the vir when register allocation fails, so it should be checked when we set c->compilation_result to FAILED_REGISTER_ALLOCATION. But it seems that this option was forgotten when on some of the refactorings around compilation_result, as was let on an old condition that reported register allocation failures. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35823>	2025-06-30 09:36:04 +00:00
Marek Olšák	439d805291	nir: rename nir_lower_io_to_scalar_early -> nir_lower_io_vars_to_scalar Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35760>	2025-06-26 18:20:49 +00:00
Emma Anholt	eae86f455c	v3d: Stop advertising support for HW clip planes. Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details The GL frontend is perfectly good at lowering it like we do. Cuts out a bunch of duplicate code. We still have ucp_enables for the FS due to lowering of CLIPDIST to discards in the FS. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8953>	2025-06-19 21:44:55 +00:00
Iago Toral Quiroga	c059c721fb	broadcom/compiler: handle moving last ubo load in the block correctly Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Before we move a UBO load to a previous location in the block we take a reference to the instruction after it so we can continue the loop from there, however, if the load we just moved was already the last instruction in the block we just want to break the loop right there. Fixes crashes with shaders from http://flightradar24.com Fixes: `8998666de7` ("broadcom/compiler: sort constant UBO loads by index and offset") Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35333>	2025-06-04 11:50:30 +00:00
Juan A. Suarez Romero	5505bb6c6d	v3d/compiler: don't use deprecated NIR_PASS_V macro We still keep it for the case of nir_scheduling, as this pass requires to be adapted to return the progress as well as update the metadata. Check more details at https://gitlab.freedesktop.org/mesa/mesa/-/issues/10409. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35127>	2025-05-30 14:24:24 +02:00
Christian Gmeiner	41f2da1a6e	treewide: Do not use NIR_PASS_V for nir_divergence_analysis(..) Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35131>	2025-05-23 21:19:25 +00:00
Ella Stanforth	be3ce07f58	v3d/compiler: Fix ub when using memcmp for texture comparisons. Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details We need to zero out all memory in the struct otherwise memcmp ends up comparing padding bytes. Cc: mesa-stable Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34945>	2025-05-16 16:05:21 +00:00
Ella Stanforth	b3cc871b7c	v3d/compiler: remove requirement for format information for fbfetch Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34742>	2025-05-08 06:25:22 +00:00
Ella Stanforth	9a71e6dcc2	v3d/compiler: use mask for 16bit and 32bit return values There are only ever two possibilities here so lets use a mask. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34742>	2025-05-08 06:25:22 +00:00
Ella Stanforth	bb07364c54	v3d/compiler: remove num_samplers_used from shader key This is only ever used by assertions. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/34742>	2025-05-08 06:25:22 +00:00

1 2 3 4 5 ...

988 commits