fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-02-26 07:00:31 +01:00

Author	SHA1	Message	Date
Qiang Yu	9a6416b374	nir,ac/llvm,radv: add stream id index to nir_load_ring_gsvs_amd For used by legacy GS to store output to different ring according to stream id. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20158>	2022-12-13 11:43:45 +08:00
Qiang Yu	796a150196	nir: add nir_load_ring_gs2vs_offset_amd Used by legacy GS output lowering. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20158>	2022-12-13 11:42:33 +08:00
Qiang Yu	fd240f759f	nir,radv,radeonsi: add nir_atomic_add_gs_invocation_count_amd For shader query emulation. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20156>	2022-12-13 01:26:42 +00:00
Timothy Arceri	9e9b8dc7f8	glsl: fix function inlining for images Here we skip replacing parameters with their actual values for images as glsl_to_nir() expects them to be copied to temps first. Tree grafting has a similiar rule to avoid this happening also. Fixes: `8d10a6835f` ("glsl: dont create temps for builtin function inputs") Tested-by: Martin Roukala <martin.roukala@mupuf.org> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20274>	2022-12-12 21:28:44 +00:00
Konstantin Seurer	7a994d92ff	spirv: Add a debug option to force non uniform texture sampling Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20243>	2022-12-12 18:18:32 +00:00
Friedrich Vock	e20564cfdb	nir/lower_shader_calls: Remove phis after dead control flow This potentially gets rid of some more phis without sources. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19960>	2022-12-11 22:13:32 +00:00
Friedrich Vock	a54c2c8289	nir: Do not consider phis with incompatible dests equal CSE tries to collapse equal instructions, and collapsing two phis with incompatible dests is illegal. Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Fixes: `6bdce55c` ("nir: Add a basic CSE pass") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19960>	2022-12-11 22:13:32 +00:00
Rhys Perry	907fbf22dd	nir/gather_info: use nir_ssa_scalar_resolved This lets us skip copies. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19597>	2022-12-09 20:56:52 +00:00
Rhys Perry	085828ea4d	vtn: add mesh output and task_payload to vtn_mode_is_cross_invocation This fixes a potential race condition, and removes output loads (which should not exist in the EXT_mesh_shader). Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7391 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19597>	2022-12-09 20:56:52 +00:00
Rhys Perry	e1f5100311	nir: add task_payload and shader_out to nir_var_vec_indexable_modes Since these can be cross-invocation, we need this to write individual components without race conditions or loads. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7391 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19597>	2022-12-09 20:56:52 +00:00
Georg Lehmann	4dff3ff005	nir/opt_algebraic: Optimize open coded bfm. Foz-DB Navi21: Totals from 1553 (1.15% of 134913) affected shaders: SpillVGPRs: 2246 -> 2223 (-1.02%); split: -1.42%, +0.40% CodeSize: 10409156 -> 10410720 (+0.02%); split: -0.03%, +0.04% Instrs: 1899725 -> 1898773 (-0.05%); split: -0.07%, +0.02% Latency: 71225814 -> 71118314 (-0.15%); split: -0.21%, +0.06% InvThroughput: 13384926 -> 13330369 (-0.41%); split: -0.47%, +0.06% VClause: 38309 -> 38284 (-0.07%); split: -0.17%, +0.11% SClause: 70743 -> 70706 (-0.05%) Copies: 167296 -> 167230 (-0.04%); split: -0.28%, +0.24% Branches: 42446 -> 42444 (-0.00%); split: -0.01%, +0.00% PreVGPRs: 95191 -> 95188 (-0.00%) Some minor instructions count regressions in parallel-rdp because v_bfm_b32 can't use SDWA, but overall an improvement. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18887>	2022-12-09 14:59:16 +00:00
Konstantin Seurer	36125598c8	nir: Add intrinsics for hit attribute io Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19866>	2022-12-09 07:07:10 +00:00
Konstantin Seurer	5bfc4c293f	nir/split_vars: Handle ray hit attributes Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19866>	2022-12-09 07:07:10 +00:00
Timothy Arceri	8d10a6835f	glsl: dont create temps for builtin function inputs It's not valid to be copying input variables to temps when inlining atomic memory, interpolateAt functions, etc. We got away with this previously because tree grafting would clean up the mess but we shouldn't depend on an optimisation to clean up invalid IR. Also I hope to remove tree grafting in a follow up merge request. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19890>	2022-12-08 05:22:27 +00:00
Timothy Arceri	7b9ec592aa	glsl: use ir_rvalue_visitor for function inlining This allows us to drop some duplicate code that is already in the ir_rvalue_visitor. It also allows us to better replace rvalues and handle swizzle in the following patch without having to add even more duplicate code. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19890>	2022-12-08 05:22:27 +00:00
Mihai Preda	613e9b8e7a	nir: fix digit order in print_bitset() Also fix the leading curly for the new function definitions. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19570>	2022-12-07 12:59:33 +00:00
Mihai Preda	0320dbaff5	nir: print shader_info bools with the value Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19570>	2022-12-07 12:59:33 +00:00
Mihai Preda	da2d36a9d5	nir: print shader_info inputs/outputs as bit ranges e.g. inputs_read: 15-17 outputs_written: 0,32 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19570>	2022-12-07 12:59:33 +00:00
Mihai Preda	e9f3f80b1d	nir: print_shader_info(): brief output Make the shader_info printing less verbose by skipping the fields that are likely not used (being zero). Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19570>	2022-12-07 12:59:33 +00:00
Mihai Preda	814ba7d13d	nir: print_shader_info: print stage-specific shader info Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19570>	2022-12-07 12:59:33 +00:00
Mihai Preda	37b7233c15	nir: print_shader_info() print bitsets Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19570>	2022-12-07 12:59:33 +00:00
Mihai Preda	4ed85c16f9	nir: print more in print_shader_info() Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19570>	2022-12-07 12:59:33 +00:00
Mihai Preda	185e65f0f5	nir: extract print_shader_info() from nir_print_shader_annotated() This is a refactoring, it is not supposed to change the printed output. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19570>	2022-12-07 12:59:33 +00:00
Jason Ekstrand	9d43aebcad	nir: Use nir_component_mask_t for nir_alu_dst::write_mask Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20193>	2022-12-06 18:37:19 -06:00
Konstantin Seurer	91ed8fb13a	nir: Add missing includes Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14831>	2022-12-06 20:17:58 +00:00
Marcin Ślusarz	ffefa386fd	nir/lower_task_shader: fix task payload corruption when shared memory workaround is enabled We were not taking into account that when all invocations within workgroup are active, we'll copy more data than needed, corrupting task payload of other workgroups. Fixes: `8aff8d3dd4` ("nir: Add common task shader lowering to make the backend's job easier.") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20080>	2022-12-06 16:31:11 +00:00
Rhys Perry	9b7217d12e	nir/range_analysis: unsigned upper bound analysis for b2i fossil-db (navi21): Totals from 93 (0.07% of 135636) affected shaders: Instrs: 133949 -> 133899 (-0.04%); split: -0.05%, +0.01% CodeSize: 708124 -> 707528 (-0.08%); split: -0.09%, +0.01% Latency: 2451564 -> 2450158 (-0.06%); split: -0.06%, +0.00% InvThroughput: 398282 -> 397345 (-0.24%) SClause: 4441 -> 4437 (-0.09%); split: -0.18%, +0.09% Copies: 7578 -> 7546 (-0.42%); split: -0.55%, +0.13% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20117>	2022-12-06 15:23:38 +00:00
Chia-I Wu	7244d88516	nir: fix nir_link_varying_precision link_varyings ignores precisions and can assign the same location to variables with different precisions. nir_link_varying_precision should check location_frac as well. Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20113>	2022-12-06 02:00:36 +00:00
Jason Ekstrand	e6de164e03	nir: Use nir_const_value_for_int in nir_lower_subgroups Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7670 Fixes: `e4e79de2a4` ("nir/subgroups: Support > 1 ballot components") Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19689>	2022-12-02 23:12:30 +00:00
Jesse Natalie	d4c70e483d	compiler: Handle nested arrays correctly for computing CL size/alignment Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20111>	2022-12-02 09:52:44 -08:00
Karol Herbst	e22491c832	clc: fetch clang resource dir at runtime Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19617>	2022-12-02 15:38:44 +00:00
Karol Herbst	cd2609b12c	clc: generate sources only with with_microsoft_clc Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19617>	2022-12-02 15:38:44 +00:00
Danylo Piliaiev	5d025f4003	nir/nir_opt_offsets: Prevent offsets going above max In try_fold_load_store when trying to extract const addition from non-const offset source, we should take into account that there is already a constant base offset, which should count towards the limit. The issue was found in "Monster Hunter: World" running on Turnip. Fixes: `cac6f633b2` ("nir/opt_offsets: Use nir_ssa_scalar to chase offset additions.") Well, the issue was present before this commit but it made a lot of changes in surrounding code. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20099>	2022-12-02 15:04:52 +00:00
Qiang Yu	bb837bf6ef	nir,ac/llvm: add nir_buffer_atomic_add_amd Used by radeonsi for lower nir_atomic_add_gen/xfb_prim_count_amd. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18010>	2022-12-02 07:34:31 +00:00
Qiang Yu	8030fbcf16	nir,ac/llvm: add nir_load_smem_buffer_amd Used by radeonsi to load const buffer. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18010>	2022-12-02 07:34:31 +00:00
Qiang Yu	73ea7d651a	ac/llvm: nir_load_smem_amd support 32bit base address For radeonsi which use 32bit address in ac_build_load_to_sgpr(). Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18010>	2022-12-02 07:34:31 +00:00
Alyssa Rosenzweig	0af08acca5	nir: Add intrinsics for lowering UBOs/VBOs on AGX We'll use formatted loads and some system values to lower UBOs and VBOs to global memory in NIR, using the AGX-specific format support and addressing arithmetic to optimize the emitted code. Add the intrinsics and teach nir_opt_preamble how to move them so we don't regress UBO pushing. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19996>	2022-12-02 06:25:20 +00:00
Eric Engestrom	8140eca23b	meson: replace deprecated meson.get_cross_property(...) with meson.get_external_property(...) According to the deprecation note: > It's a pure subset of meson.get_external_property, and works strangely > in host == build configurations, since it would be more accurately > described as get_host_property. Signed-off-by: Eric Engestrom <eric@igalia.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19904>	2022-12-01 22:09:55 +00:00
Marcin Ślusarz	f6adfd6278	nir/lower_task_shader: allow offsetting of the start of payload We need this, because on Intel task payload starts with private header, followed by user-accessible data. Fixes: `37e78803d7` ("intel/compiler: use nir_lower_task_shader pass") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19409>	2022-12-01 11:19:47 +00:00
Jason Ekstrand	4fb33124c3	nir/divergence: Handle base_workgroup_id and workgrpu_id_zero_base Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20068>	2022-12-01 04:56:48 +00:00
Jason Ekstrand	0531630658	nir/builder: Also short-circuit for auto-generated nir_t2t<N>() This makes nir_i2i32(b, x) behave exactly like nir_i2iN(b, x, 32) etc. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7787 Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20067>	2022-12-01 01:10:12 +00:00
Jason Ekstrand	e67e2293fa	nir/builder: Rework the boolean conversion helpers Move them up to where the other conversion helpers. For nir_b2<T>(), suffix them with N like all the others and make them use nir_type_convert() as well. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20067>	2022-12-01 01:10:12 +00:00
Jason Ekstrand	d9a24632d3	nir/builder: Drop nir_i2i and nir_u2u in favor of nir_x2xN Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20067>	2022-12-01 01:10:12 +00:00
Jason Ekstrand	ccf19e0956	nir/builder: Move conversions higher in nir_builder.h Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20067>	2022-12-01 01:10:12 +00:00
Jason Ekstrand	9a225415e3	nir/builder: Short-circuit in nir_type_convert if no conversion happens If both types are the same or both are integer types with the same bit size, no actual conversion happens and nir_type_conversion_op() will return nir_op_mov. In this case, there's no point in emitting the move and we can just return src instead. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20067>	2022-12-01 01:10:12 +00:00
Jason Ekstrand	c5fbcab803	nir/builder: Fix indentation of nir_type_convert Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20067>	2022-12-01 01:10:12 +00:00
Jason Ekstrand	8a406fe055	nir: Fix builder usage in lower_mediump_vars() In our handling of load_deref, we were calling builder helpers to create conversions and then adjusting the destination bit size of the load. We should adjust the bit size first because the builder sometimes looks at the bit sizes of SSA values passed in as arguments. Even though it's not strictly necessary, adjust the store_deref case as well to make it fully symmetric with the load_deref case. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20067>	2022-12-01 01:10:12 +00:00
Erik Faye-Lund	d0342e28b3	nir: Add helper to create passthrough GS shader Based on nir_create_passthrough_tcs and d3d12_make_passthrough_gs, this creates a passthrough geometry shader that can be used by drivers that needs to emulate some graphics features in the geometry shader. Reviewed-by: Rob Clark <robclark@freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19987>	2022-11-30 08:08:25 +00:00
Lionel Landwerlin	9d0560fe87	nir/lower_shader_calls: enable vectorizer We cannot fully use the vectorizer outside of this pass because once stack load/store operations have been lower to global load/store, the robustness rule applies to those as they would to application load/store. But this is all internal and we know it doesn't require out of bound checking. So doing the vectorizing here is the best solution. We just have to teach the vectorizer about our intrinsics. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20058>	2022-11-30 07:23:30 +00:00
Lionel Landwerlin	9c76cda7f0	nir/lower_shader_calls: add a pass to split load/store into scalars We'll run this pass prior to opt_load_store_vectorize to maximize the effect of the optimization. At the moment opt_load_store_vectorize is unable to pack this : store vec3 store vec3 store vec2 into this : store vec4 store vec3 If your backend can only do vec4 stores max. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20058>	2022-11-30 07:23:30 +00:00

1 2 3 4 5 ...

7492 commits