fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-20 22:30:12 +01:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	0aaf174e31	nir/lower_system_values: add ID to 32-bit lowering OpenCL has 64-bit global IDs, but for driver-internal OpenCL we only need 32-bit. Might as well lower in nir_lower_system_values instead of bringing up a whole new pass just for this. Will be used for asahi precomp Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32210>	2024-11-21 21:50:30 +00:00
Alyssa Rosenzweig	39afffe956	nir: split off some definitions for OpenCL we want some enum values on device for NIR->CL bindings. specifically, src_type/dest_type indices. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32208>	2024-11-20 16:53:51 +00:00
Alyssa Rosenzweig	3da8444be5	nir: add names to function parameters SPIR-V has this information. We should try to preserve it. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32208>	2024-11-20 16:53:51 +00:00
Marek Olšák	25d4943481	nir: make use_interpolated_input_intrinsics a nir_lower_io parameter This will need to be set to true when the GLSL linker lowers IO, which can later be unlowered by st/mesa, and then drivers can lower it again without load_interpolated_input. Therefore, it can't be a global immutable option. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32229>	2024-11-20 02:45:37 +00:00
Marek Olšák	4da5b11ca9	nir: add nir_io_separate_clip_cull_distance_arrays to replace PIPE_CAP to make the flag available in NIR passes Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32173>	2024-11-19 23:48:38 +00:00
Marek Olšák	23eb4f3454	nir: rename nir_io_glsl_opt_varyings to nir_io_dont_optimize and deprecate it The meaning is negated. This NIR option is deprecated and shouldn't be used. It means any IO optimizations can be disabled and it's a currently a workaround for zink, which is the only driver that asks for it by default. The original option is replaced by an environment variable for the GLSL linker. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32173>	2024-11-19 23:48:38 +00:00
Marek Olšák	dacae272bf	nir: add nir_io_semantics::fb_fetch_output_coherent Lowering IO should preserve this. Freedreno needs it. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32173>	2024-11-19 23:48:38 +00:00
Caterina Shablia	a5bcf566a9	nir: lower INSTANCE_{ID,INDEX} to an offset load_instance_{index,id} respectively If the hardware does not support INSTANCE_INDEX natively, it will be lowered to load_instance_id + base_instance. Otherwise, INSTANCE_ID will be lowered to load_instance_index - base_instance. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32158>	2024-11-19 09:18:47 +00:00
Marek Olšák	f9b03cf405	nir/opt_varyings: add nir_io_compaction_rotates_color_channels This was enabled by default in nir_opt_varyings, but vc4 can't handle when shader outputs write Y but not X. Add an option for it and enable it only for the driver that benefits from it. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32174>	2024-11-18 13:39:08 +00:00
Marek Olšák	8518e1cfd7	nir/opt_varyings: add nir_io_always_interpolate_convergent_fs_inputs for Asahi Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32174>	2024-11-18 13:39:08 +00:00
Rhys Perry	45c1280d2c	nir_lower_mem_access_bit_sizes: pass access to callback Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31904>	2024-11-13 12:59:26 +00:00
Rhys Perry	61752152f7	nir_lower_mem_access_bit_sizes: add nir_mem_access_shift_method Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31904>	2024-11-13 12:59:26 +00:00
Rhys Perry	80b76ba692	nir: add more intrinsics to nir_intrinsic_can_reorder Including nir_intrinsic_load_global. fossil-db (navi21): Totals from 2725 (3.43% of 79395) affected shaders: MaxWaves: 71972 -> 71964 (-0.01%); split: +0.01%, -0.02% Instrs: 2831052 -> 2819902 (-0.39%); split: -0.45%, +0.06% CodeSize: 15047548 -> 14973072 (-0.49%); split: -0.57%, +0.08% VGPRs: 108864 -> 108856 (-0.01%); split: -0.02%, +0.01% SpillSGPRs: 906 -> 926 (+2.21%) SpillVGPRs: 196 -> 1092 (+457.14%) Scratch: 729088 -> 741376 (+1.69%) Latency: 16621317 -> 16586551 (-0.21%); split: -0.34%, +0.13% InvThroughput: 4169987 -> 4164876 (-0.12%); split: -0.23%, +0.11% VClause: 63247 -> 63471 (+0.35%); split: -0.21%, +0.56% SClause: 56978 -> 55276 (-2.99%); split: -3.50%, +0.51% Copies: 252545 -> 252495 (-0.02%); split: -0.98%, +0.96% Branches: 91378 -> 91388 (+0.01%); split: -0.03%, +0.04% PreSGPRs: 112753 -> 126850 (+12.50%); split: -0.48%, +12.98% PreVGPRs: 90617 -> 90708 (+0.10%) VALU: 1709034 -> 1709368 (+0.02%); split: -0.01%, +0.03% SALU: 463554 -> 462253 (-0.28%); split: -0.57%, +0.29% VMEM: 115952 -> 116272 (+0.28%); split: -0.21%, +0.49% SMEM: 129097 -> 120538 (-6.63%); split: -6.64%, +0.01% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31904>	2024-11-13 12:59:26 +00:00
Georg Lehmann	34f41abe24	nir: add nir_def_all_uses_ignore_sign_bit Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31844>	2024-11-12 18:03:57 +00:00
Konstantin Seurer	f2c204daf0	nir: Add a first_line parameter to gather_debug_info Useful when the file contains multiple shaders. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29298>	2024-11-11 08:39:14 +00:00
Konstantin	4d09cd7fa5	nir/lower_non_uniform_access: Group accesses using the same resource Avoids emitting the waterfall loop for every access if they use the same resource: waterfall_loop { access } waterfall_loop { access } -> waterfall_loop { access access } Totals from 276 (0.33% of 84770) affected shaders: MaxWaves: 3360 -> 3356 (-0.12%) Instrs: 3759927 -> 3730650 (-0.78%) CodeSize: 21125784 -> 20899580 (-1.07%) VGPRs: 23096 -> 23104 (+0.03%) Latency: 35593716 -> 35315455 (-0.78%); split: -0.78%, +0.00% InvThroughput: 7353071 -> 7297309 (-0.76%); split: -0.76%, +0.00% VClause: 120983 -> 118579 (-1.99%) SClause: 113073 -> 110671 (-2.12%) Copies: 358272 -> 348686 (-2.68%) Branches: 166706 -> 159500 (-4.32%) PreSGPRs: 18598 -> 18596 (-0.01%) PreVGPRs: 21417 -> 21424 (+0.03%); split: -0.01%, +0.04% VALU: 2354862 -> 2350053 (-0.20%) SALU: 582291 -> 567638 (-2.52%) SMEM: 139875 -> 137473 (-1.72%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30509>	2024-11-11 07:53:13 +00:00
Alyssa Rosenzweig	23afe968ad	nir: add late_lower_int64 option Some drivers generally need int64 lowered, but prefer to do this lowering themselves late, to have a chance to optimize targeted int64 patterns before lowering the rest. This isn't currently possible since nir_lower_int64 takes no options except what's const* in the shader, and frontends call nir_lower_int64 before passing the shader off to the driver. Add an option to defer int64 lowering. This is a bit ugly but the alternative is replumbing nir_lower_int64's option handling cross-tree and no-thank-you-not-right-now. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31964>	2024-11-08 21:15:42 -04:00
Alyssa Rosenzweig	eaf75169ee	nir: add amul flag Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31964>	2024-11-08 21:15:42 -04:00
Marek Olšák	9d043e138d	nir: add nir_clear_divergence_info, use it in nir_opt_varyings nir_opt_varyings computes vertex divergence, which isn't exactly expected by any other passes. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31968>	2024-11-05 14:13:40 +00:00
Marek Olšák	2ca56376a4	nir: rename nir_io_glsl_lower_derefs -> nir_io_has_io_intrinsics Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31968>	2024-11-05 14:13:40 +00:00
Marek Olšák	adc40aee25	glsl: lower IO in the linker if enabled, don't lower it later This removes the useless codepath that kept IO derefs until st_finalize_nir. It was used before nir_opt_varyings existed. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31968>	2024-11-05 14:13:40 +00:00
Georg Lehmann	bedd6310dc	nir: add nir_opt_frag_coord_to_pixel_coord Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31864>	2024-11-04 12:34:31 +00:00
Alyssa Rosenzweig	b8624d5c6b	nir: correct comment Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: M Henning <drawoc@darkrefraction.com> Reviewed-by: Reviewed-by: Christian Gmeiner <cgmeiner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31892>	2024-10-30 12:59:11 +00:00
Georg Lehmann	695d2414cd	nir,radv: optimize shared atomic offsets Foz-DB Navi21: Totals from 87 (0.11% of 79395) affected shaders: Instrs: 140877 -> 140873 (-0.00%) CodeSize: 747760 -> 747164 (-0.08%); split: -0.09%, +0.01% Latency: 4528171 -> 4528162 (-0.00%) InvThroughput: 826358 -> 826349 (-0.00%) Copies: 10888 -> 10884 (-0.04%) VALU: 84634 -> 84630 (-0.00%) Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31080>	2024-10-29 09:31:08 +00:00
Daniel Schürmann	1a55d6c23b	nir/divergence: Introduce and set nir_def::loop_invariant Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30787>	2024-10-24 10:06:17 +00:00
Daniel Schürmann	c8348139fd	nir: change signature of nir_src_is_divergent() Now, it takes nir_src * instead of nir_src. Also move the implementation to nir_divergence_analysis.c. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30787>	2024-10-24 10:06:17 +00:00
Daniel Schürmann	421b42637d	nir: remove nir_update_instr_divergence() This function has obscure limitations. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30787>	2024-10-24 10:06:17 +00:00
Daniel Schürmann	c25c63ebc0	nir/divergence: separately indicate whether loops have divergent continues or breaks bool nir_loop_is_divergent(nir_loop *) replaces the previous loop->divergent indicator. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30787>	2024-10-24 10:06:17 +00:00
Marek Olšák	0226922384	nir: add nir_gather_tcs_info, new gathering/analysis pass This does shader analysis that is more niche than regular shader info. It's planned to be used by nir_restructure_tcs_flow as discussed here: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11910 It's also useful for driver-specific passes. The code for gathering "all_invocations_define_tess_levels" is copied from radeonsi. The rest is new. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31447>	2024-10-23 03:17:16 +00:00
Amber	a3afe22dc9	nir: add pass to lower atomic arithmetic to a loop with cmpxchg. Signed-off-by: Amber Harmonia <amber@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27776>	2024-10-21 21:47:44 +00:00
Mary Guillemard	84d57e1fb1	nir: Move atomic_op_to_alu to common code Signed-off-by: Mary Guillemard <mary.guillemard@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27776>	2024-10-21 21:47:44 +00:00
Georg Lehmann	dbf63a0788	nir: remove nir_op_is_derivative Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31014>	2024-10-17 09:50:19 +00:00
Connor Abbott	65c0846537	nir/lower_input_attachments: Handle unscaled input attachments with no index With VK_KHR_dynamic_rendering_local_read we can have input attachments with no index, which normally correspond to depth/stencil attachments, and we have to handle this here when determining whether we need to emit an unscaled fragcoord for FDM. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31261>	2024-10-17 00:30:44 +00:00
Connor Abbott	4bd506a7f3	spirv: Make the default input attachment index ~0 This will let us know when an input attachment doesn't have an InputAttachmentIndex, which used to be illegal but is now allowed and meaningful with VK_KHR_dynamic_rendering_local_read. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31261>	2024-10-17 00:30:44 +00:00
Marek Olšák	02923e237d	nir: add hole_size parameter into the vectorize callback It will be used to allow merging loads with a hole between them. Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29398>	2024-10-15 05:50:24 +00:00
Marek Olšák	8ce43b7765	nir/opt_load_store_vectorize: add entry::num_components We will represent vec6..vec7, vec9..vec15 loads with 8 and 16 components respectively, so we need to track how many components we really use. This is a prerequisite for optimal merging up to vec16. Example: Step 1: vec4 + vec3 ==> vec7as8 (last component unused) Step 2: vec1 + vec7as8 ==> vec8 (last unused component dropped) Without using the number of components read, the same example would end up doing: Step 1: vec4 + vec3 ==> vec8 Step 2: vec1 + vec8 ==> vec9 (fail) Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29398>	2024-10-15 05:50:24 +00:00
Alyssa Rosenzweig	e9303c0952	nir: extract round component helper another nir pass will use this. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/29398>	2024-10-15 05:50:24 +00:00
Rhys Perry	67ad7359ff	nir/divergence_analysis: disable phi undef optimization by default If the backend does not implement this too, or some other future transform modifiess the phi so that this isn't the case (replace the phi with a bcsel or replace undef with zero), then it will not actually be uniform. This keeps it enabled to some degree for RADV/ACO. fossil-db (navi31): Totals from 76 (0.10% of 79395) affected shaders: Instrs: 195008 -> 195282 (+0.14%) CodeSize: 1012592 -> 1015884 (+0.33%) Latency: 3892826 -> 3898843 (+0.15%); split: -0.00%, +0.15% InvThroughput: 460681 -> 460964 (+0.06%) Copies: 13508 -> 13516 (+0.06%) Branches: 5244 -> 5412 (+3.20%) PreVGPRs: 5092 -> 5096 (+0.08%) VALU: 116177 -> 116197 (+0.02%) SALU: 23449 -> 23785 (+1.43%) fossil-db (navi21): Totals from 76 (0.10% of 79395) affected shaders: Instrs: 164471 -> 164981 (+0.31%) CodeSize: 883988 -> 888420 (+0.50%) Latency: 4074287 -> 4082043 (+0.19%) InvThroughput: 783783 -> 784276 (+0.06%); split: -0.00%, +0.06% Branches: 5262 -> 5430 (+3.19%) PreVGPRs: 5100 -> 5104 (+0.08%) VALU: 116375 -> 116381 (+0.01%) SALU: 23589 -> 23925 (+1.42%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30211>	2024-10-10 14:59:26 +00:00
Faith Ekstrand	62a4fe861a	nir: Add an option to lower quad vote Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31470>	2024-10-02 21:10:32 +00:00
Georg Lehmann	bb7e8d51b6	nir: delete nir_opt_reuse_constants Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31031>	2024-09-27 05:19:16 +00:00
Georg Lehmann	a9f8089240	nir: replace nir_opt_remove_phis_block with a single source version This is what callers actually want, and it simplifies nir_opt_remove_phis because we can assume dominance meta data is valid. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31031>	2024-09-27 05:19:16 +00:00
Timothy Arceri	60937b5286	nir: add implicit_conversion_prohibited field to nir_parameter Will be used in link time validation in following patches. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31137>	2024-09-25 09:39:44 +00:00
Timothy Arceri	5645495156	nir: store variable mode in nir_parameter This will be used by the nir glsl linker in following patches. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31137>	2024-09-25 09:39:44 +00:00
Timothy Arceri	6ff3e87e5f	nir: add function in/outs to variable modes Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31137>	2024-09-25 09:39:44 +00:00
Timothy Arceri	1cb115abd2	nir: add nir_function_impl_clone_remap_globals() This will be use by the glsl nir linker when we are combining different shaders from the same shader stage that might have multiple declarations of global variables across the different shaders. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31137>	2024-09-25 09:39:43 +00:00
Timothy Arceri	7a1061e0dd	nir: add max_ifc_array_access field to vars This will be used in following patches by the nir based glsl linker code. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31137>	2024-09-25 09:39:43 +00:00
Timothy Arceri	7c5b21c032	glsl: add support for converting global instructions to NIR NIR doesn't really support global instructions such as global val initilisation. So here we add functionality to glsl_to_nir() to put these instructions into a temporary function that will be later inlined into main. We give the function a name starting with gl_mesa_tmp_ as functions starting with gl_ are reserved and will not have any clashes with user functions, we finish the name with the blake3 of the shader source to avoid conflicts with multiple shaders attached to a single stage. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31137>	2024-09-25 09:39:43 +00:00
Boris Brezillon	eeb3512498	nir/lower_ssbo: Extend the load_ssbo_address intrinsic to pass an offset On Mali(Valhall), the bounds checking can be done when in hardware, but for this to work properly, we need to pass the offset to the nir_load_ssbo_address() intrinsic. Add an offset source to the intrinsic, and adjust the lowering pass to conditionally lower the offset addition. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31164>	2024-09-18 13:45:57 +00:00
Boris Brezillon	adadb097a3	nir/lower_ssbo: Add an option to conditionally lower loads On Mali(Valhall), we have a way to load SSBO data without going through an SSBO index -> global address translation, so let's provide a way to tell nir_lower_ssbo() when it shouldn't lower loads. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Eric R. Smith <eric.smith@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31164>	2024-09-18 13:45:57 +00:00
Ian Romanick	6a09d33549	nir: Add a pass to generate BFI instructions from logical operations Inspired by a commit message in !30934, I set about optimizing the code generated for nir_copysign. It would be possible to just implement an opt_algebraic pattern for the specific values used by nir_copysign, but this casts a slightly larger net. As noted in a comment in the code, there may be variations of the pattern that this pass misses. The opt_algebraic pattern would miss them too. v2: Use nir_def_replace. Suggested by Alyssa. Allow more "root" instruction types. Suggested by Georg. v3: Treat extract_u16(x, 0) as (x & 0x0000ffff), and treat extract_u8(x, 0) as (x & 0x000000ff). v4: Use nir_scalar. Suggested by Georg. Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31006>	2024-09-13 00:21:00 +00:00

1 2 3 4 5 ...

1464 commits