fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-20 00:28:08 +02:00

Author	SHA1	Message	Date
Rhys Perry	37d77a12e9	nir/opt_move_discards_to_top: add more intrinsics to add_src_to_worklist fossil-db (navi21): Totals from 115 (0.14% of 79395) affected shaders: MaxWaves: 2882 -> 2886 (+0.14%); split: +0.62%, -0.49% Instrs: 71640 -> 71686 (+0.06%); split: -0.21%, +0.28% CodeSize: 395820 -> 395084 (-0.19%); split: -0.39%, +0.20% VGPRs: 5224 -> 5256 (+0.61%); split: -0.61%, +1.23% Latency: 1114025 -> 1145891 (+2.86%); split: -0.12%, +2.98% InvThroughput: 239149 -> 239028 (-0.05%); split: -0.07%, +0.02% VClause: 1289 -> 1291 (+0.16%); split: -0.62%, +0.78% SClause: 2267 -> 2203 (-2.82%); split: -5.38%, +2.56% Copies: 4359 -> 4372 (+0.30%); split: -2.18%, +2.48% Branches: 1215 -> 1225 (+0.82%) PreSGPRs: 4225 -> 4265 (+0.95%); split: -1.35%, +2.30% PreVGPRs: 4166 -> 4189 (+0.55%); split: -0.96%, +1.51% VALU: 53590 -> 53614 (+0.04%); split: -0.10%, +0.14% SALU: 6527 -> 6539 (+0.18%); split: -0.84%, +1.03% SMEM: 4120 -> 4117 (-0.07%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32145>	2024-11-21 14:50:45 +00:00
Rhys Perry	08e355a287	nir/opt_move_discards_to_top: use nir_intrinsic_can_reorder fossil-db (navi21): Totals from 2306 (2.90% of 79395) affected shaders: MaxWaves: 65920 -> 65952 (+0.05%); split: +0.22%, -0.17% Instrs: 1056765 -> 1058517 (+0.17%); split: -0.09%, +0.26% CodeSize: 5802396 -> 5808076 (+0.10%); split: -0.13%, +0.23% VGPRs: 79976 -> 79248 (-0.91%); split: -1.46%, +0.55% Latency: 17215154 -> 17527774 (+1.82%); split: -0.11%, +1.92% InvThroughput: 4911203 -> 4918838 (+0.16%); split: -0.06%, +0.22% VClause: 16214 -> 16268 (+0.33%); split: -0.44%, +0.78% SClause: 33208 -> 34167 (+2.89%); split: -1.02%, +3.91% Copies: 58352 -> 58343 (-0.02%); split: -1.20%, +1.18% Branches: 21857 -> 21863 (+0.03%); split: -0.02%, +0.05% PreSGPRs: 73666 -> 74298 (+0.86%); split: -0.82%, +1.67% PreVGPRs: 55234 -> 55720 (+0.88%); split: -0.41%, +1.29% VALU: 756386 -> 756329 (-0.01%); split: -0.06%, +0.05% SALU: 123838 -> 124320 (+0.39%); split: -0.35%, +0.74% VMEM: 25002 -> 25009 (+0.03%) SMEM: 60765 -> 60580 (-0.30%); split: -0.41%, +0.11% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32145>	2024-11-21 14:50:45 +00:00
Rhys Perry	fff3eb7848	nir/opt_move_discards_to_top: update variable name Discard doesn't exist anymore. There is only terminate. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32145>	2024-11-21 14:50:45 +00:00
Rhys Perry	eea5be2e28	nir/opt_move_discards_to_top: remove recursion This kind of recursion is unreliable with large shaders or small stack limits. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32145>	2024-11-21 14:50:45 +00:00
Rhys Perry	4c6fdb113f	nir: fix return value of nir_instr_move for some cases This fixes a potential issue where nir_opt_move_discards_to_top would always return progress. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Fixes: `f97fb1fa55` ("nir: Add a nir_instr_move helper") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32145>	2024-11-21 14:50:44 +00:00
Rhys Perry	8bbc8284d9	nir/opt_move_discards_to_top: use nir_tex_instr_has_implicit_derivative Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Fixes: `48158636bf` ("nir: add is_gather_implicit_lod") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32145>	2024-11-21 14:50:44 +00:00
Georg Lehmann	ec487d01e2	nir/opt_undef: handle unpack/pack like mov/vec Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32249>	2024-11-21 14:09:52 +00:00
Georg Lehmann	af974b5fe9	nir/opt_undef: keep undefs used by partial undef vectors Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32249>	2024-11-21 14:09:52 +00:00
Georg Lehmann	a9d3caf3bf	nir/opt_undef: use some nir helpers Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32249>	2024-11-21 14:09:52 +00:00
Georg Lehmann	6630c6d912	nir/opt_undef: replace undef in a separate pass Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32249>	2024-11-21 14:09:52 +00:00
Georg Lehmann	0776b56ad6	nir: cse terminate/demote Foz-DB Navi21: Totals from 32 (0.04% of 79206) affected shaders: MaxWaves: 984 -> 976 (-0.81%) Instrs: 7719 -> 7496 (-2.89%) CodeSize: 43220 -> 42264 (-2.21%) VGPRs: 856 -> 872 (+1.87%) Latency: 62689 -> 62453 (-0.38%); split: -0.72%, +0.34% InvThroughput: 8988 -> 8968 (-0.22%); split: -0.23%, +0.01% VClause: 248 -> 249 (+0.40%) SClause: 296 -> 293 (-1.01%) Copies: 580 -> 534 (-7.93%); split: -9.31%, +1.38% Branches: 181 -> 139 (-23.20%) PreSGPRs: 841 -> 834 (-0.83%) SALU: 1091 -> 933 (-14.48%) Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32235>	2024-11-20 23:54:04 +00:00
Georg Lehmann	a67ca0eb59	nir/instr_set: support instrs with no def Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32235>	2024-11-20 23:54:04 +00:00
Georg Lehmann	7097b705b5	nir/instr_set: replace nir_instr_get_def_def with nir_instr_def Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32235>	2024-11-20 23:54:04 +00:00
Georg Lehmann	4299809321	nir: return def for debug info in nir_instr_def Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32235>	2024-11-20 23:54:04 +00:00
Alyssa Rosenzweig	39afffe956	nir: split off some definitions for OpenCL we want some enum values on device for NIR->CL bindings. specifically, src_type/dest_type indices. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32208>	2024-11-20 16:53:51 +00:00
Alyssa Rosenzweig	d248618d81	nir/print: print parameter names in calls if we have them. example: call libagx_geometry_input_address %10, p %3, vtx %9, location %0 (0x0) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32208>	2024-11-20 16:53:51 +00:00
Alyssa Rosenzweig	6b35d7eb13	nir/print: annotate entrypoints we can have multiple in a collection of OpenCL kernels. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32208>	2024-11-20 16:53:51 +00:00
Alyssa Rosenzweig	eebfbf5ecd	nir/print: print function signature parameter dimensions and names if we have them. example: decl_function libagx_geometry_input_address (64 return, 64 p, 32 vtx, 32 location) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32208>	2024-11-20 16:53:51 +00:00
Alyssa Rosenzweig	3da8444be5	nir: add names to function parameters SPIR-V has this information. We should try to preserve it. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32208>	2024-11-20 16:53:51 +00:00
Alyssa Rosenzweig	61862b209e	nir/opt_algebraic: optimize convert_uint_sat(ulong) I wrote this in my query copy shader, it didn't get the codegen I expected, so I investigated. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32208>	2024-11-20 16:53:50 +00:00
Alyssa Rosenzweig	07ba9335ae	nir/conversion_builder: avoid redundant uint->uint clamp algebraic will clean up but there's no reason to generate it. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32208>	2024-11-20 16:53:50 +00:00
Alyssa Rosenzweig	76927a3b43	nir/lower_convert_alu_types: use intrinsics_pass Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32208>	2024-11-20 16:53:50 +00:00
Marek Olšák	25d4943481	nir: make use_interpolated_input_intrinsics a nir_lower_io parameter This will need to be set to true when the GLSL linker lowers IO, which can later be unlowered by st/mesa, and then drivers can lower it again without load_interpolated_input. Therefore, it can't be a global immutable option. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32229>	2024-11-20 02:45:37 +00:00
Marek Olšák	3affe3cb17	vc4/lower_blend: don't read non-existent channels nir_lower_texcoord_replace_late had swapped parameters in nir_undef. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32173>	2024-11-19 23:48:38 +00:00
Marek Olšák	4da5b11ca9	nir: add nir_io_separate_clip_cull_distance_arrays to replace PIPE_CAP to make the flag available in NIR passes Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32173>	2024-11-19 23:48:38 +00:00
Marek Olšák	23eb4f3454	nir: rename nir_io_glsl_opt_varyings to nir_io_dont_optimize and deprecate it The meaning is negated. This NIR option is deprecated and shouldn't be used. It means any IO optimizations can be disabled and it's a currently a workaround for zink, which is the only driver that asks for it by default. The original option is replaced by an environment variable for the GLSL linker. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32173>	2024-11-19 23:48:38 +00:00
Marek Olšák	dacae272bf	nir: add nir_io_semantics::fb_fetch_output_coherent Lowering IO should preserve this. Freedreno needs it. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32173>	2024-11-19 23:48:38 +00:00
Marek Olšák	5d5a7bd221	nir/lower_two_sided_color: fix for lowered IO 1-bit input loads are illegal in NIR. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32173>	2024-11-19 23:48:38 +00:00
Marek Olšák	65d32b96cf	nir/lower_fragcoord_wtrans: handle trimmed fragcoord loads Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32173>	2024-11-19 23:48:38 +00:00
Marek Olšák	dcca0e590c	nir/lower_clip: rewrite find_output to handle vec2/3 and make it readable Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32173>	2024-11-19 23:48:38 +00:00
Marek Olšák	45b20c8249	nir/lower_clip: fixes for lowered IO without compact arrays Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32173>	2024-11-19 23:48:38 +00:00
Marek Olšák	878d23e171	nir/lower_pntc_ytransform: handle lowered IO Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32173>	2024-11-19 23:48:38 +00:00
Marek Olšák	18f3c92b87	nir/print: print fb_fetch_output for variables Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32173>	2024-11-19 23:48:38 +00:00
Rhys Perry	65a54b4ec4	nir/lcssa: fix premature exit of loop after rematerializing derefs If we have NIR such as: 32x4 %48 = @load_vulkan_descriptor (%47) (desc_type=SSBO) 32x4 %76 = deref_cast (tint_symbol_11 )%48 (ssbo tint_symbol_11) (ptr_stride=0, align_mul=4, align_offset=0) 32x4 %77 = deref_struct &%76->tint_symbol_10 (ssbo int) // &((tint_symbol_11 )%48)->tint_symbol_10 A single nir_rematerialize_deref_in_use_blocks() will rematerialize the deref_struct and then it's deref_cast. However, nir_foreach_instr_reverse_safe is not safe if the next iteration's instruction is removed. This can result in the instruction loop exiting and the load_vulkan_descriptor never having an LCSSA phi. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Fixes: `439e8c42cc` ("nir/lcssa: Fix rematerializing derefs") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11770 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32225>	2024-11-19 18:59:05 +00:00
Rhys Perry	327e5465fc	nir/algebraic: check bit sizes in lowered unpack(pack()) optimization Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Fixes: `894f7f4387` ("nir_opt_algebraic: Add a couple optimizations for lowered unpack(pack())") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32157>	2024-11-19 18:17:18 +00:00
Rhys Perry	ecd6ae12fb	nir/algebraic: fix iabs(ishr(iabs(a), b)) optimization iabs(a) is not positive if "a" is the minimum signed value, so this is incorrect in that case for some values of "b". Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Fixes: `2b76de9b5d` ("nir/algebraic: Add a couple optimizations for iabs and ishr") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32157>	2024-11-19 18:17:17 +00:00
Matt Turner	ba5c65f10b	nir: Get correct number of components The code wants the number of components used by the variable in the current attribute slot, not the total number of components. For e.g. a 4x3 matrix, glsl_get_components() returns 12, leading to the following error reported by AddressSanitizer: ``` Test case 'dEQP-VK.tessellation.shader_input_output.cross_invocation_per_patch_mat4x3'.. ../src/compiler/nir/nir_lower_io_to_vector.c:265:16: runtime error: index 4 out of bounds for type 'nir_variable *[4]' ``` Fixes: `5ef2b8f1f2` ("nir: Add a pass for lowering IO back to vector when possible") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32193>	2024-11-19 16:35:17 +00:00
Caterina Shablia	a5bcf566a9	nir: lower INSTANCE_{ID,INDEX} to an offset load_instance_{index,id} respectively If the hardware does not support INSTANCE_INDEX natively, it will be lowered to load_instance_id + base_instance. Otherwise, INSTANCE_ID will be lowered to load_instance_index - base_instance. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32158>	2024-11-19 09:18:47 +00:00
Caterina Shablia	b9be1f1f20	nir: introduce instance_index system value The semantics of this newly introduced system value match Vulkan's InstanceIndex exactly, and are equivalent to instance_id + base_instance. Some hardware, such as Mali Valhall or later, only provides instance id offset by base_instance. Introducing a new system value to represent this, rather than handling the mismatch when lowering to BIR lets us use NIR to eliminate redundant arithmetic that would follow from mismatched semantics, e.g. instance_id could be lowered to instance_index - base_instance, so expressions such as instance_id + base_instance would be optimized to a simple instance_index. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32158>	2024-11-19 09:18:47 +00:00
Dave Airlie	6714689613	nir/functions: force inlining for barriers. A recent algebraic opt made a function that used to inline with llvmpipe CL not inline anymore. However that function has a barrier in it. Handling barriers from inside a callstack is hard for llvmpipe coroutines, so just force functions with barriers to be inlined. Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32204>	2024-11-19 12:26:28 +10:00
Karol Herbst	fa379a9495	nir/lower_cl_images: lower scalar image_loads to vec4 This will be required for supporting depth images as the rest of mesa assumes those to always return vec4. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30831>	2024-11-18 17:57:28 +00:00
Marek Olšák	899bee4af8	nir/opt_varyings: don't count the cost of the same instruction multiple times Use pass_flags to indicate whether the instruction has already been added to the total cost of the expression. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32174>	2024-11-18 13:39:08 +00:00
Marek Olšák	405e9d9b74	nir/opt_varyings: implement compaction without flexible interpolation We have to honor drivers when they say that different interpolation qualifiers can't be mixed in the same vec4, indicated by nir_io_has_flexible_input_interpolation_except_flat not being set. This is a prerequisite for enabling nir_opt_varyings for all drivers. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32174>	2024-11-18 13:39:08 +00:00
Marek Olšák	a7c671efc6	nir/opt_varyings: fix packing color varyings BITSET_TEST_RANGE_INSIDE_WORD uses first_bit .. last_bit, same as BITSET_RANGE, not first_bit .. size like BITFIELD_RANGE. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32174>	2024-11-18 13:39:08 +00:00
Marek Olšák	f9b03cf405	nir/opt_varyings: add nir_io_compaction_rotates_color_channels This was enabled by default in nir_opt_varyings, but vc4 can't handle when shader outputs write Y but not X. Add an option for it and enable it only for the driver that benefits from it. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32174>	2024-11-18 13:39:08 +00:00
Marek Olšák	8518e1cfd7	nir/opt_varyings: add nir_io_always_interpolate_convergent_fs_inputs for Asahi Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32174>	2024-11-18 13:39:08 +00:00
Kenneth Graunke	95bc42af74	nir: Use load_global_constant for reorderable nir_var_mem_global access The main difference between load_global and load_global_constant is that the latter can be reordered arbitrarily. If the access being lowered is already tagged as being reorderable, then we can preserve that by using the load_global_constant intrinsics instead of load_global. This gives us more flexibility. On Intel, this lets us use the load_global_constant_uniform_block_intel intrinsic for doing convergent block loads in more cases. This nets us significant reductions in spill/fills: Borderlands 3 on Lunarlake sees spills/fills reduced by 53%. Alchemist sees a 13% reduction. Improves performance of Borderlands 3 DX12 on Intel Battlemage by around 44%. Improves Hogwarts Legacy by around 14%. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31995>	2024-11-18 12:55:47 +00:00
Danylo Piliaiev	b501cbf153	nir/nir_opt_offsets: Do not fold load/store with const offset > max When (off_const > max) there is a wrap around uint when calling try_extract_const_addition. Exit early since folding doesn't make sense in this case. Cc: mesa-stable Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32118>	2024-11-14 10:22:39 +00:00
Rhys Perry	d3ae1842a2	aco,ac/nir: flag loads to use smem in NIR This pass will be re-used later. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31904>	2024-11-13 12:59:26 +00:00
Rhys Perry	7fe4f4c14c	nir_lower_mem_access_bit_sizes: support load_constant Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31904>	2024-11-13 12:59:26 +00:00

1 2 3 4 5 ...

5751 commits