fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-17 05:18:12 +02:00

Author	SHA1	Message	Date
Jose Maria Casanova Crespo	8f06961bf5	broadcom/compiler: Eliminate redundant setnnmode instructions This new VIR optimization pass tracks the current NN signedness mode per block and removes duplicate setnnmode instructions. When consecutive dot products use the same signedness mode, the backend emits one setnnmode per dot product. This pass removes the redundant ones, keeping only the first. Assisted-by: Claude Opus 4.6 Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41255>	2026-04-29 13:21:08 +00:00
Jose Maria Casanova Crespo	24ecc9cbcc	broadcom/compiler: Add v8dot and setnnmode scheduler dependencies. As nnmode register is read by v8dot instruction we need to add dependencies between setnnmode instructions and v8dot via the nnmode register, so they are scheduled correcty using last_nn_mode virtual register.. Add a last_nn_mode virtual register to the scheduler state and create: - Write dependencies for all SETNNMODE variants - Read dependencies for V8DOT. This follows the same pattern as the existing MULTOP/UMUL24 rtop tracking. Assisted-by: Claude Opus 4.6 Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41255>	2026-04-29 13:21:08 +00:00
Jose Maria Casanova Crespo	33a700be91	broadcom/compiler: hardware-accelerated 4x8-bit dot products on V3D 7.1+ VIR instructions and nir_to_vir implementation of 4x8-bit dot products using native HW accelerated ALU instructions. setnnmode instructions are marked as having side effects. Assisted-by: Claude Opus 4.6 Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41255>	2026-04-29 13:21:08 +00:00
Jose Maria Casanova Crespo	31c8e14df3	broadcom/compiler: MULTOP in branch delay slots doesn't generate RTOP hazard On unconditional branches qpu_set_branch_targets() can fill the delay slots with a copy of the first instructions of the successor block. As the qpu validator is sequential it would detect an incorrect hazard when the MULTOP was copied but the UMUL24 wasn't. This was identified in debug build when running gfxbench5.aztec_ruins_vk. Assisted-by: Claude Opus 4.6 Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40923>	2026-04-14 16:34:54 +00:00
Jose Maria Casanova Crespo	dd6e7c8ef0	broadcom/compiler: really enable branch in delay slots validation The validation of branch instructions happening in branch and thrsw delay slots has been dead code since it was introduced as the check was after: if (inst->type != V3D_QPU_INSTR_TYPE_ALU) return; Now last_branch_ip is updated and checks in_branch_delay_slots() are active. Fixes in_branch_delay_slots, as for branch there are always 3 delay slots. As scheduler enforces this restrictions shader-db does not show any regression. Assisted-by: Claude Opus 4.6 Fixes: `90269ba353` ("broadcom/vc5: Use THRSW to enable multi-threaded shaders.") Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40923>	2026-04-14 16:34:54 +00:00
Juan A. Suarez Romero	d4646cd444	broadcom: use Mesa logging functions Replace printf and nir_print_shaders by proper mesa_logX and nir_log_shaderX functions, that provides better features (like logging to a file, setting the logging verbosity, etc) and works better with Android. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40434>	2026-04-06 07:40:55 +00:00
Juan A. Suarez Romero	1e82e72039	broadcom/compiler: make some dump functions return strings instead of printf This will give better flexibility on how and where the dumps will be done. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40434>	2026-04-06 07:40:55 +00:00
Connor Abbott	22a061fb91	nir: Use better calculation for alpha-to-coverage mask The old calculation depended on the sample count, and gave subpar results for 8x MSAA with standard sample locations. The new calculation is based on the Intel pass, with some changing of the constants so that the sample count is always proportional to alpha for 2xMSAA and 4xMSAA and the addition of rotating the sample mask based on the pixel. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39335>	2026-03-20 18:09:48 +00:00
Faith Ekstrand	f2f792996d	Revert "nir: Add a type parameter to nir_lower_point_size()" This reverts commit `6ee4ea5ea3`. Reviewed-by: Lorenzo Rossi <lorenzo.rossi@collabora.com> Acked-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38681>	2026-03-12 22:59:13 +00:00
Daivik Bhatia	66c5c8fe19	broadcom/compiler: lower txf LOD for robustImageAccess2 on V3D 4.2 On V3D 4.2, txf instructions with an out of bounds LOD do not return robust values (zero) as required by robustImageAccess2. This commit introduces a NIR lowering pass that explicitly checks if the LOD is within bounds. If the LOD is out of bounds, the texture coordinate is replaced with an out of bounds value to force the hardware to return the robust value. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39430>	2026-03-12 19:14:24 +00:00
Daivik Bhatia	bd3e836046	v3dv: Implement robust_image_access_2 flag This flag is used to implement robustImageAccess2. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39430>	2026-03-12 19:14:24 +00:00
Juan A. Suarez Romero	675e5527ba	v3d: add support for GL_ARB_sample_shading Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Most of the work was already done for the Vulkan driver. The main difference to handle is that OpenGL request to ignore sample mask when the framebuffer is non-multisampled, while Vulkan applies it always. This also fixes KHR-GL31.frag_coord_conventions.multisample. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/40059>	2026-02-25 10:03:39 +00:00
Daivik Bhatia	026fa1799b	broadcom/compiler: Update comment clarifying OpTerminate implementation Explain why the driver uses demote instead of an immediate jump to the end of the shader for OpTerminate, noting that the jump approach showed no performance gains. Reference: !38381 Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39703>	2026-02-10 06:20:25 +00:00
Maíra Canal	ba102224ab	broadcom/compiler: Don't lower to LCSSA before calling nir_divergence_analysis() Since commit `87cb42f9` ("treewide: don't lower to LCSSA before calling nir_divergence_analysis()"), NIR can calculate divergence without converting to LCSSA beforehand. Therefore, remove LCSSA lowering from Broadcom's compiler. Signed-off-by: Maíra Canal <mcanal@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39765>	2026-02-09 13:49:02 +00:00
Georg Lehmann	f414132399	broadcom/compiler: remove unpack_half support Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39511>	2026-02-06 06:12:36 +00:00
Georg Lehmann	d50f5387b4	broadcom/compiler: use f2f32 when lowering image load Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39511>	2026-02-06 06:12:36 +00:00
Iago Toral Quiroga	42bd467906	broadcom/compiler: inform NIR scheduler about 0 cost ALU instructions Some ALU instructions will likely end up being copy propagated in the backend, which means they would not have any cost. This helps the scheduler make better decisions for the new open-coded patterns produced in NIR for extracts (i.e. unpack_2x16) with MR#39511. With this (together with previous patches) we manage to produce similar shader-db results as with the unpack_2x16 NIR extract opcodes that MR#39511 will drop. Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39687>	2026-02-05 11:29:42 +00:00
Iago Toral Quiroga	f93e8e76e9	broadcom/compiler: optimize alu(shr(x, 16).l) to alu(x.h) We need this to produce optimal code in the backend for sequences like this: 32 %10 = ushr %5.x, %9 (0x10) 16 %14 = u2u16 %10 32 %17 = f2f32 %14 With such code, our copy propagation pass will drop the u216 and with this patch we will be able to drop the ushr too. This pattern can show up for VK_KHR_16bit_storage when we successfully vectorize 16-bit loads into 32-bit loads, but will become a lot more common after MR#39511 lands, since that would also affect things like 16-bit TMU loads, which are more common. Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39687>	2026-02-05 11:29:42 +00:00
Iago Toral Quiroga	4753a296f9	broadcom/compiler: don't always clear undefined bits from sub-32 integers We only really use sub-32bit integers in conversions, so we can skip clearing the MSB bits when we produce them by converting from larger types (leaving these bits undefined) and only clear them when we convert from them to larger types, since we don't have native opcodes to do these conversions that would only access relevant bits, at least on Pi4. Also, document the cases where we could do better for Pi5. Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39687>	2026-02-05 11:29:42 +00:00
Iago Toral Quiroga	c589268b5c	broadcom/compiler: drop unnecessary MOV Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39687>	2026-02-05 11:29:41 +00:00
Faith Ekstrand	68d22b5a2a	nir/lower_blend: Move the format to nir_lower_blend_rt Reviewed-by: Christoph Pillmayer <christoph.pillmayer@arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/39367>	2026-01-19 21:33:14 +00:00
Juan A. Suarez Romero	13211eb2fc	broadcom/compiler: use skip_helpers with textures, UBOs and SSBOs Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Set the per-pixel mask based on the value of skip_helpers. This slightly increase the performance on several traces. fps_avg helped: gl_gfxbench_trex.trace: 22.30 -> 22.79 (2.20%) total fps_avg in all runs: 55.18 -> 55.71 (0.97%) total fps_avg in affected (through threshold) runs: 22.30 -> 22.79 (2.20%) helped: 1 HURT: 0 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38759>	2026-01-08 12:59:44 +00:00
Juan A. Suarez Romero	1e3da5c985	broadcom/compiler: enable skip_helpers It will be used with image loads to enable or disable helper invocations. This fixes a Vulkan CTS test that perform an imageLoad() inside a fwidth() operation. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38759>	2026-01-08 12:59:44 +00:00
Emma Anholt	059d301c79	nir: Drop the mode argument of nir_lower_vars_to_scratch(). It only makes sense for function temps, and that's the only way it's been used. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37245>	2025-12-17 19:50:28 +00:00
Jose Maria Casanova Crespo	40339ada9c	broadcom: Drop use of nir_lower_wrmasks v3d_nir_lower_load_store_bitsize that uses nir_lower_mem_access_bit_sizes already ensures that any writemask on store has consecutive bits set. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38921>	2025-12-15 11:34:27 +00:00
Ian Romanick	956a09b990	broadcom/compiler: only lower flrp once This is only compile tested. I have not collected any shader-db or fossil-db data. v2: Drop the calls to nir_opt_constant_folding. The builder in nir_lower_flrp will already take care of this. v3: NIR_PASS_V is gone. Noticed by Marge. Acked-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12526>	2025-12-02 21:28:05 +00:00
Marek Olšák	9a56672f56	nir: add shader_info::disable_input/output_offset_src_constant_folding and set it where needed to prevent nir_opt_constant_folding from breaking those drivers. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38277>	2025-11-29 00:16:38 +00:00
Iago Toral Quiroga	a643681dd5	broadcom/compiler: use nir_opt_uub Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Shows performance improvement on aztec/aztec_high fps_avg helped: gl_aztec.trace: 6.37 -> 6.45 (1.26%) fps_avg helped: gl_aztec_high.trace: 4.29 -> 4.33 (0.93%) And a significant instruction count reduction in the affected shaders. But some shaders show a huge reduction. gles_aztec/274.shader_test MESA_SHADER_COMPUTE: 1375 -> 1196 (-13.02%) gles_aztec_high/499.shader_test MESA_SHADER_COMPUTE: 1375 -> 1196 (-13.02%) master-of-orion/1253.shader_test MESA_SHADER_FRAGMENT: 305 -> 262 (-14.10%) blender/7.shader_test MESA_SHADER_FRAGMENT: 12389 -> 10455 (-15.61%) master-of-orion/1256.shader_test MESA_SHADER_VERTEX: 170 -> 131 (-22.94%) total instructions in shared programs: 14679696 -> 14675496 (-0.03%) instructions in affected programs: 196683 -> 192483 (-2.14%) helped: 430 HURT: 8 Instructions are helped. total uniforms in shared programs: 6775582 -> 6775495 (<.01%) uniforms in affected programs: 21155 -> 21068 (-0.41%) helped: 48 HURT: 2 Uniforms are helped. total max-temps in shared programs: 2709673 -> 2709710 (<.01%) max-temps in affected programs: 403 -> 440 (9.18%) helped: 2 HURT: 16 Max-temps are HURT. Signed-off-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38642>	2025-11-26 13:32:39 +00:00
Jose Maria Casanova Crespo	4234e7eed0	broadcom/compiler: enable umul24 and imul24 ALU opcodes For umul24 we expose the operation as UMUL24_RTOP0 so we can identify the difference between umul24 as part of a sequence generated from an imul as "multop+umul24" and a simple umul24 where rtop will always be 0. For umul24_rtop0 instructions we relax the scheduling restrictions, so they don't need to be serialized like the multop+umul24 ops. But we maintain the read dependency with the last_rtop. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38642>	2025-11-26 13:32:39 +00:00
Alyssa Rosenzweig	2c2dd835af	nir/lower_wrmasks: drop callback All drivers use the same callback and it is unlikely that new drivers will use this pass since it has better replacements today (lower_mem_bit_sizes for memory, and it never worked for I/O). This should discourage as much. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Marek Olšák <maraeo@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38533>	2025-11-26 03:20:39 +00:00
Dave Airlie	26eaba935d	nir: add a cmat call instruction type. This adds a new instruction type to handle cooperative matrix calls. This clones the call instr, drops callee, and adds a single metadata slot and a call operation (dummy only for now). (Not NACKed by Alyssa) Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38389>	2025-11-17 23:33:58 +00:00
Marek Olšák	e372365cf4	nir: rename nir_copy_prop -> nir_opt_copy_prop Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38411>	2025-11-15 02:16:38 +00:00
Konstantin Seurer	de32f9275f	treewide: add & use parent instr helpers We add a bunch of new helpers to avoid the need to touch >parent_instr, including the full set of: * nir_def_is_* * nir_def_as__or_null nir_def_as_* [assumes the right instr type] * nir_src_is_* * nir_src_as_* * nir_scalar_is_* * nir_scalar_as_* Plus nir_def_instr() where there's no more suitable helper. Also an existing helper is renamed to unify all the names, while we're churning the tree: * nir_src_as_alu_instr -> nir_src_as_alu ..and then we port the tree to use the helpers as much as possible, using nir_def_instr() where that does not work. Acked-by: Marek Olšák <maraeo@gmail.com> --- To eliminate nir_def::parent_instr we need to churn the tree anyway, so I'm taking this opportunity to clean up a lot of NIR patterns. Co-authored-by: Konstantin Seurer <konstantin.seurer@gmail.com> Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38313>	2025-11-12 21:22:13 +00:00
Faith Ekstrand	6ee4ea5ea3	nir: Add a type parameter to nir_lower_point_size() On Mali, we need not only clamp but also convert to float16 on Valhall+. We could have a separate pass for this but it fits in nicely with the rest of nir_lower_point_size() so we might as well put it there. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38379>	2025-11-12 01:34:36 +00:00
Konstantin Seurer	b962063d72	nir: Remove nir_parallel_copy_instr Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36483>	2025-11-04 18:51:51 +00:00
Marek Olšák	2f6b4803ab	nir/validate: expand IO intrinsic validation with nir_io_semantics Some checks are pending macOS-CI / macOS-CI (dri) (push) Waiting to run Details macOS-CI / macOS-CI (xlib) (push) Waiting to run Details There are many workarounds. v2: add more validation Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com> (v1) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/38113>	2025-11-02 02:21:46 +00:00
Daniel Schürmann	10be538851	tree-wide: don't call nir_opt_constant_folding after nir_lower_flrp Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37195>	2025-10-30 19:28:07 +00:00
Daivik Bhatia	cdef2c0b61	broadcom/common: Add subgroup support to CSD super-group packing Certain subgroup operations don’t impose constraints on CSD supergroup packing. Mark these as supported and account for them in v3d_csd_choose_workgroups_per_supergroup() so packing remains unchanged when they are present. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37836>	2025-10-13 08:25:24 +02:00
Alejandro Piñeiro	cea6d7ada5	v3d: expose GL_KHR_shader_subgroup for v71+ All the compiler support was implemented as part of the v3dv implementation (see commit `31e8740808` and MR#27211). We are using the same size/supported_stages and mostly the same supported features, so probably at some point it would be good to have a common place for that info. Zink reuses their definitions, but as far as I see it does that because the PIPE and equivalent VK definitions has the same values, that seems somewhat fragile. We don't support all features, and in order to support arithmetic we need to enable a lowering. Using CTS, right now we are passing 1023 tests out of 6053 (the rest are skipped). Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37621>	2025-10-08 10:48:41 +00:00
Ella Stanforth	aaa858f958	v3d/compiler: Implement 16bit normalised render targets. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35820>	2025-09-30 12:48:42 +00:00
Ella Stanforth	c9e9d72cce	v3d/compiler: implement normalised to float conversions Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35820>	2025-09-30 12:48:42 +00:00
Ella Stanforth	9263e1838b	v3d/compiler: Lower load_output after logic operations Fixes: `42154029fc` ("v3d/compiler: Implement software blend lowering") Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35820>	2025-09-30 12:48:42 +00:00
Ella Stanforth	0a640f42c5	v3d/compiler: Add unpacking instructions for normalised 16bit formats. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35820>	2025-09-30 12:48:41 +00:00
Ella Stanforth	ee48e81b26	v3d: Always lower frag color Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/35820>	2025-09-30 12:48:39 +00:00
Simon Perretta	2a7ebf2ae0	nir/lower_alpha: extend to support dynamic a2c Signed-off-by: Simon Perretta <simon.perretta@imgtec.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/37512>	2025-09-30 12:15:53 +00:00
Qiang Yu	c135ed1eb9	all: rename gl_shader_stage_name to mesa_shader_stage_name Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36569>	2025-08-06 10:28:41 +08:00
Qiang Yu	196569b1a4	all: rename gl_shader_stage to mesa_shader_stage It's not only for GL, change to a generic name. Use command: find . -type f -not -path '/.git/' -exec sed -i 's/\bgl_shader_stage\b/mesa_shader_stage/g' {} + Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36569>	2025-08-06 10:28:40 +08:00
Alyssa Rosenzweig	82ae8b1d33	treewide: simplify nir_def_rewrite_uses_after Most of the time with nir_def_rewrite_uses_after, you want to rewrite after the replacement. Make that the default thing to be more ergonomic and to drop parent_instr uses. We leave nir_def_rewrite_uses_after_instr defined if you really want the old signature with an arbitrary after point. Via Coccinelle patch: @@ expression a, b; @@ -nir_def_rewrite_uses_after(a, b, b->parent_instr) +nir_def_rewrite_uses_after_def(a, b) Followed by a bunch of sed. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Marek Olšák <maraeo@gmail.com> Acked-by: Karol Herbst <kherbst@redhat.com> Acked-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36489>	2025-08-01 15:34:24 +00:00
Alyssa Rosenzweig	cc6e3b84cb	treewide: use nir_def_as_* Via Coccinelle patch: @@ expression definition; @@ -nir_instr_as_alu(definition->parent_instr) +nir_def_as_alu(definition) @@ expression definition; @@ -nir_instr_as_intrinsic(definition->parent_instr) +nir_def_as_intrinsic(definition) @@ expression definition; @@ -nir_instr_as_phi(definition->parent_instr) +nir_def_as_phi(definition) @@ expression definition; @@ -nir_instr_as_load_const(definition->parent_instr) +nir_def_as_load_const(definition) @@ expression definition; @@ -nir_instr_as_deref(definition->parent_instr) +nir_def_as_deref(definition) @@ expression definition; @@ -nir_instr_as_tex(definition->parent_instr) +nir_def_as_tex(definition) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Marek Olšák <maraeo@gmail.com> Acked-by: Karol Herbst <kherbst@redhat.com> Acked-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36489>	2025-08-01 15:34:24 +00:00
Antonio Ospite	ddf2aa3a4d	build: avoid redefining unreachable() which is standard in C23 In the C23 standard unreachable() is now a predefined function-like macro in <stddef.h> See https://android.googlesource.com/platform/bionic/+/HEAD/docs/c23.md#is-now-a-predefined-function_like-macro-in And this causes build errors when building for C23: ----------------------------------------------------------------------- In file included from ../src/util/log.h:30, from ../src/util/log.c:30: ../src/util/macros.h:123:9: warning: "unreachable" redefined 123 \| #define unreachable(str) \ \| ^~~~~~~~~~~ In file included from ../src/util/macros.h:31: /usr/lib/gcc/x86_64-linux-gnu/14/include/stddef.h:456:9: note: this is the location of the previous definition 456 \| #define unreachable() (__builtin_unreachable ()) \| ^~~~~~~~~~~ ----------------------------------------------------------------------- So don't redefine it with the same name, but use the name UNREACHABLE() to also signify it's a macro. Using a different name also makes sense because the behavior of the macro was extending the one of __builtin_unreachable() anyway, and it also had a different signature, accepting one argument, compared to the standard unreachable() with no arguments. This change improves the chances of building mesa with the C23 standard, which for instance is the default in recent AOSP versions. All the instances of the macro, including the definition, were updated with the following command line: git grep -l '[^_]unreachable(' -- "src/**" \| sort \| uniq \| \ while read file; \ do \ sed -e 's/$[^_]$unreachable(/\1UNREACHABLE(/g' -i "$file"; \ done && \ sed -e 's/#undef unreachable/#undef UNREACHABLE/g' -i src/intel/isl/isl_aux_info.c Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/36437>	2025-07-31 17:49:42 +00:00

1 2 3 4 5 ...

1004 commits