NetworkManager

mirror of https://gitlab.freedesktop.org/NetworkManager/NetworkManager.git synced 2026-05-07 06:08:02 +02:00

Author	SHA1	Message	Date
Thomas Haller	09d5c4e22e	platform: fix handling the onlink route attribute for routes without gateway For IPv6, kernel doesn't care. If the gateway is ::, you may or may not set the onlink attribute. But for IPv4 routes, that gets rejected: # ip route add 1.2.3.4/32 dev v onlink Error: Invalid flags for nexthop - PERVASIVE and ONLINK can not be set. Silently suppress setting the flag in that case and ignore the user request. After all, the effect is probably the same (that is, the route is onlink anyway). (cherry picked from commit `8b14849877`)	2023-02-07 14:26:44 +01:00
Thomas Haller	ae906e42da	platform: detect EINVAL as failure to set the MTU Some drivers will reject an invalid MTU size with EINVAL. Quote from [1]: While investigating, I did notice that do_change_link in nm-linux-platform.c really ought to count -EINVAL as an MTU out-of-range error and not just -ERANGE. Even if the hardware supports a large MTU, if the transmit FIFO is set too small, stmmac_change_mtu [2] will return -EINVAL. For example, on my device, the maxmtu is 9000 but in practice I can't set an MTU larger than 4096 unless I first run ethtool --set-channels eno1 tx 3. [1] https://gitlab.freedesktop.org/NetworkManager/NetworkManager/-/issues/1198#note_1738311 [2] https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/tree/drivers/net/ethernet/stmicro/stmmac/stmmac_main.c?h=v6.1#n5577 (cherry picked from commit `621b41ebfa`)	2023-02-01 10:50:11 +01:00
Thomas Haller	5579fca916	platform: allow setting multi_idx instance for NMPlatform The major point of NMDedupMultiIndex is that it can de-duplicate the objects. It thus makes sense the everybody is using the same instance. Make the multi-idx instance of NMPlatform configurable. This is not used outside of unit tests, because the daemon currently always creates one platform instance and everybody then re-uses the instance of the platform. While this is (currently) only used by tests, and that the performance optimization of de-duplicating is irrelevant for tests, this is still useful. The test can then check whether two separate NMPlatform objects shared the same instance and whether it was de-duplicated.	2023-01-19 08:56:21 +01:00
Thomas Haller	2c22c96235	platform: add NMP_OBJECT_TYPE_NAME() macro	2023-01-19 08:56:21 +01:00
Thomas Haller	7752b2e059	platform: abort handling routes in _rtnl_handle_msg() when resync is required There really is nothing left to do. Skip the rest and do a resync.	2023-01-19 08:56:21 +01:00
Thomas Haller	6fc0dc3fcb	platform: resync route cache upon NLM_F_REPLACE flag There really is no way around this. As we don't cache all the routes (e.g. ignored based on rtm_protocol or rtm_type), we cannot know which route was replaced, when we get a NLM_F_REPLACE message. We need to request a new dump in that case, which can be expensive, if there are a lot of routes or if replace happens frequently. The only possible solutions would be: 1) NetworkManager caches all routes, but it also needs to make sure to get everything right. In particular, to understand every relevant route attribute (including those added in the future, which is impossible). 2) kernel provides a reasonable API (rhbz#1337855, rhbz#1337860) that allows to sufficiently understand what is going on based on the netlink notifications.	2023-01-19 08:56:21 +01:00
Thomas Haller	4ec2123aa2	platform: parse routes of any type to handle replace When you issue ip route replace broadcast 1.2.3.4/32 dev eth0 then this route may well replace a (unicast) route that we have in the cache. Previously, we would right away ignore such messages in _new_from_nl_route(), which means we miss the fact that a route gets replaced. Instead, we need to parse the message at least so far, that we can detect and handle the replace.	2023-01-19 08:56:21 +01:00
Thomas Haller	854f2cc1fc	platform: better handle `ip route replace` for ignored routes We don't cache certain routes, for example based on the protocol. This is a performance optimization to ignore routes that we usually don't care about. Still, if the user does `ip route replace` with such a route, then we need to pass it to nmp_cache_update_netlink_route(), so that we can properly remove the replaced route. Knowing which route was replaces might be impossible, as our cache does not contain all routes. Likely all that nmp_cache_update_netlink_route() can to is to set "resync_required" for NLM_F_REPLACE. But for that it should see the object first. This also means, if we ever write a BPF filter to filter out messages that contain NLM_F_REPLACE, because that would lead to cache inconsistencies.	2023-01-19 08:56:21 +01:00
Thomas Haller	c64053e6e6	platform: minor cleanup in nmp_cache_update_netlink_route() It reads nicer. It will also work better with the change that follows.	2023-01-19 08:56:21 +01:00
Thomas Haller	a3cea7f6fb	platform: fix nmp_lookup_init_route_by_weak_id() to honor the route-table The route table is part of the weak-id. You can see that with: ip route replace unicast 1.2.3.4/32 dev eth0 table 57 ip route replace unicast 1.2.3.4/32 dev eth0 table 58 afterwards, `ip route show table all` will list both routes. The replace operation is only per-table. Note that NMP_CACHE_ID_TYPE_ROUTES_BY_WEAK_ID already got this right. Fixes: `10ac675299` ('platform: add support for routing tables to platform cache')	2023-01-19 08:56:21 +01:00
Thomas Haller	0d458dbf07	platform: avoid printing raw pointer values in log	2023-01-19 08:56:21 +01:00
Lubomir Rintel	38d3834e2c	merge: branch 'lr/nl-retry' https://gitlab.freedesktop.org/NetworkManager/NetworkManager/-/merge_requests/1501	2023-01-17 19:25:51 +01:00
Thomas Haller	3cd02b6ed6	libnm,platform: fix range for "weight" property of next hops for routes In kernel, the valid range for the weight is 1-256 (on netlink this is expressed as u8 in rtnh_hops, ranging 0-255). We need an additional value, to represent - unset weight, for non-ECMP routes in kernel. - in libnm API, to express routes that should not be merged as ECMP routes (the default). Extend the type in NMPlatformIP4Route.weight to u16, and fix the code for the special handling of the numeric range. Also the libnm API needs to change. Modify the type of the attribute on D-Bus from "b" to "u", to use a 32 bit integer. We use 32 bit, because we already have common code to handle 32 bit unsigned integers, despite only requiring 257 values. It seems better to stick to a few data types (u32) instead of introducing more, only because the range is limited. Co-Authored-By: Fernando Fernandez Mancera <ffmancera@riseup.net> Fixes: `1bbdecf5e1` ('platform: manage ECMP routes')	2023-01-17 14:05:13 +01:00
Lubomir Rintel	b8738002ed	platform: retry link change on RESULT_FAILED_RESYNC Sometimes the buffer space of the netlink socket runs out and we lose the response to our link change: <info> [1670321010.2952] platform-linux: netlink[rtnl]: read: too many netlink events. Need to resynchronize platform cache <warn> [1670321010.3467] platform-linux: do-change-link[2]: failure changing link: internal failure 3 With 3 above being WAIT_FOR_NL_RESPONSE_RESULT_FAILED_RESYNC. Let's try harder. https://bugzilla.redhat.com/show_bug.cgi?id=2154350	2023-01-16 12:52:40 +01:00
Lubomir Rintel	1e6fd1288d	platform: log something nice about RESULT_FAILED_RESYNC This is not nice: <warn> [1670321010.3467] platform-linux: do-change-link[2]: failure changing link: internal failure 3 Let's explain what "internal failure 3" is.	2023-01-16 08:30:35 +01:00
Lubomir Rintel	ad659de3ba	platform: remove log_result from do_change_link() It conveys no useful information beyond what wait_for_nl_response_to_string() returns.	2023-01-16 08:30:35 +01:00
Lubomir Rintel	3f6d040274	platform: don't negate lefthand argument in set comparison This 1.) was ugly, 2.) makes it cumbersome to check for both positive and negative elements in one go.	2023-01-16 08:30:35 +01:00
Beniamino Galvani	2883203df4	platform: fix NULL pointer dereference src/libnm-platform/nmp-object.c:930: var_deref_op: Dereferencing null pointer "klass->cmd_plobj_to_string_id". Fixes: `8feeb199ad` ('platform: drop redundant hook implementations from NMPObject classes')	2022-12-22 11:34:09 +01:00
Beniamino Galvani	115102efe9	platform: fix build failures due to missing VTI definitions Older kernel headers don't ship definitions for IFLA_VTI_*, redefine them. Fixes: `1cf8df2f35` ('platform: support VTI tunnels') Fixes: `b669a3ae46` ('platform: support VTI6 tunnels')	2022-12-22 09:57:28 +01:00
Beniamino Galvani	b669a3ae46	platform: support VTI6 tunnels	2022-12-21 14:04:44 +01:00
Beniamino Galvani	1cf8df2f35	platform: support VTI tunnels	2022-12-21 14:04:43 +01:00
Thomas Haller	2191e739ae	platform: fix "-Wcast-align" warning for NMPlatformQdisc cast	2022-12-16 10:55:04 +01:00
Thomas Haller	0b1177cb18	all: use _NM_G_TYPE_CHECK_INSTANCE_CAST() for internal uses G_TYPE_CHECK_INSTANCE_CAST() can trigger a "-Wcast-align": src/core/devices/nm-device-macvlan.c: In function 'parent_changed_notify': /usr/include/glib-2.0/gobject/gtype.h:2421:42: error: cast increases required alignment of target type [-Werror=cast-align] 2421 \| # define _G_TYPE_CIC(ip, gt, ct) ((ct*) ip) \| ^ /usr/include/glib-2.0/gobject/gtype.h:501:66: note: in expansion of macro '_G_TYPE_CIC' 501 \| #define G_TYPE_CHECK_INSTANCE_CAST(instance, g_type, c_type) (_G_TYPE_CIC ((instance), (g_type), c_type)) \| ^~~~~~~~~~~ src/core/devices/nm-device-macvlan.h:13:6: note: in expansion of macro 'G_TYPE_CHECK_INSTANCE_CAST' 13 \| (G_TYPE_CHECK_INSTANCE_CAST((obj), NM_TYPE_DEVICE_MACVLAN, NMDeviceMacvlan)) \| ^~~~~~~~~~~~~~~~~~~~~~~~~~ Avoid that by using _NM_G_TYPE_CHECK_INSTANCE_CAST(). This can only be done for our internal usages. The public headers of libnm are not changed.	2022-12-16 10:55:03 +01:00
Beniamino Galvani	cf11884a85	macsec: fix tracking of parent ifindex For MACsec interfaces, kernel announces the parent ifindex in the generic IFLA_LINK netlink attribute, which we save in NMPlatformLink.parent. There is no need to have a dedicate member in NMPlatformLnkMacsec. The dedicate member was never set and during a restart of NetworkManager the parent of the MACsec device could be unset leading to a failed assertion: act_stage2_config: assertion 'parent' failed Fixes: `85103656e9` ('platform: add support for macsec links') https://bugzilla.redhat.com/show_bug.cgi?id=2122564 https://gitlab.freedesktop.org/NetworkManager/NetworkManager/-/merge_requests/1481	2022-12-15 16:30:29 +01:00
Beniamino Galvani	bd24e0b274	platform: support VLAN protocol Add support for the "protocol" attribute of VLAN links.	2022-12-14 11:33:03 +01:00
Thomas Haller	052ed480a6	platform: fix "-Wcast-align" warning on i686 in nmp_object_ref() With gcc-12.2.1-4.fc37 on i686 we get: ./src/libnm-platform/nmp-object.h: In function 'nmp_object_ref': ./src/libnm-platform/nmp-object.h:626:12: error: cast increases required alignment of target type [-Werror=cast-align] 626 \| return (const NMPObject ) nm_dedup_multi_obj_ref((const NMDedupMultiObj ) obj); \| ^ cc1: all warnings being treated as errors Work around that be increasing the alignment of NMDedupMultiObj. It has no downsides, because we usually put a NMDedupMultiObj in heap allocated memory, which is already suitably aligned. Or we put it on the stack, where wasting a few bytes for the alignment doesn't matter. We basically never embed NMDedupMultiObj in an array where the increase of alignment would waste additional space.	2022-12-14 09:46:33 +01:00
Thomas Haller	36f8de25c4	all: fix various "-Wcast-align=strict" warnings The warning "-Wcast-align=strict" seems useful and will be enabled next. Fix places that currently cause the warning by using the new macro NM_CAST_ALIGN(). This macro also nm_assert()s that the alignment is correct.	2022-12-09 09:15:56 +01:00
Thomas Haller	6996fa64b6	platform: ensure all NMPlatform* structs have same alignment We put all these structs inside the tagged union NMPObject. Also, in a sense NMPlatformObject is the base "type" of all these structs, meaning, it should be able to up and downcast. Ensure the alignment matches. This helps to avoid "-Wcast-align" warnings when trying to cast a (NMPlatformObject) to another (NMPlatformXXX ) type. Something we commonly do.	2022-12-09 09:15:54 +01:00
Thomas Haller	4ae5f7f76b	platform: move "struct _NMPlatformObject" to "nmp-plobj.h" All our platform structs should move there. For now, just move struct _NMPlatformObject because it will be needed there.	2022-12-09 09:15:54 +01:00
Wen Liang	e8618f03d7	support loopback interface Support managing the loopback interface through NM as the users want to set the proper mtu for loopback interface when forwarding the packets. Additionally, the IP addresses, DNS, route and routing rules are also allowed to configure for the loopback connection profiles. https://bugzilla.redhat.com/show_bug.cgi?id=2060905	2022-11-23 20:51:22 +01:00
Thomas Haller	2afadee27f	platform: workaround build error in nm_platform_ip4_route_hash_update() with old clang clang-3.4.2-9.el7 does not like nesting NM_MAX() macro inside nm_hash_update_vals() macro. Workaround by using MAX() instead. NM_MAX() uses an expression statement and NM_UNIQ() to evaluate the arguments only once. We don't need that here and glib's MAX() suffices. CC src/libnm-platform/src_libnm_platform_libnm_platform_la-nm-platform.lo ../src/libnm-platform/nm-platform.c:8247:53: error: in-class initializer for static data member is not a constant expression (guint8) NM_MAX(obj->weight, 1u)); ^ ../src/libnm-std-aux/nm-std-aux.h:399:40: note: expanded from macro 'NM_MAX' #define NM_MAX(a, b) __NM_MAX(NM_UNIQ, a, NM_UNIQ, b) ^ ../src/libnm-std-aux/nm-std-aux.h:402:39: note: expanded from macro '__NM_MAX' typeof(a) NM_UNIQ_T(A, aq) = (a); \ ^ ../src/libnm-glib-aux/nm-hash-utils.h:124:36: note: expanded from macro 'nm_hash_update_vals' NM_HASH_COMBINE_VALS(_val, __VA_ARGS__); \ ^ Fixes: `8cc41d41fe` ('platform: add NM_PLATFORM_IP_ROUTE_CMP_TYPE_ECMP_ID for comparing ECMP base route')	2022-11-23 16:28:34 +01:00
Thomas Haller	3fb8c0f614	clang-format: reformat code with clang-format 15.0.4-1.fc37 This is the version shipped in Fedora 37. As Fedora 37 is now out, the core developers switch to it. Our gitlab-ci will also use that as base image for the check-{patch.tree} tests and to generate the pages. There is a need that everybody agrees on which clang-format version to use, and that version should be the one of the currently used Fedora release. Also update the used Fedora image in "contrib/scripts/nm-code-format-container.sh" script. The gitlab-ci still needs update in the following commit. The change in isolation will break the "check-tree" test.	2022-11-23 09:17:21 +01:00
Thomas Haller	48d7d1d78e	platform: drop inline cmp() wrappers around "full" versions We sometimes have functions foo() and foo_full(), in which case foo() has fewer arguments and just calls foo_full(). The "full" function here is the more powerful one, and foo() is implemented in terms of the former. nm_platform_ip4_route_cmp_full() and m_platform_ip4_route_cmp() inverted that pattern. The "_full" there stands for the full comparison, to not allowing to select the comparison type. That inconsistency is ugly. Also, these wrappers were used at only few places. Let's drop them. While at it, also drop nm_platform_qdisc_cmp() and rename nm_platform_qdisc_cmp_full(). Here cmp()/cmp_full() followed the common pattern foo()/foo_full(), but it's still hardly used and unnecessary.	2022-11-21 17:56:48 +01:00
Thomas Haller	8cc41d41fe	platform: add NM_PLATFORM_IP_ROUTE_CMP_TYPE_ECMP_ID for comparing ECMP base route	2022-11-21 17:46:34 +01:00
Thomas Haller	9270bf611f	platform: add nm_platform_ip4_route_hash() helper	2022-11-21 11:19:39 +01:00
Fernando Fernandez Mancera	151b2bed36	platform: pass extra_hops to ip_route_add function When adding a new route we need to consider it contains extra nexthops i.e it is a ECMP route. As we cannot modify the NMPObject once created, we need to pass the extra nexthops as an argument. We cannot use the original NMPObject because normalization is happening during when adding the route.	2022-11-21 11:19:19 +01:00
Fernando Fernandez Mancera	1bbdecf5e1	platform: manage ECMP routes When reading from netlink an ECMP IPv4 route, we need to parse the multiple nexthops. In order to do that, we are introducing NMPlatformIP4RtNextHop struct. The first nexthop information will be kept at the original NMPlatformIP4Route and the new property n_nexthops will indicate how many nexthops we need to consider.	2022-11-21 11:18:03 +01:00
Thomas Haller	57b23c12cc	platform: only initialize actual data for stackinit NMPObject The NMPObject is a tagged union. There is no need to initialize anything after the size of the actually used union field. Change this, so maybe we get a valgrind warning about uninitialized memory if we wrongly try to access it. On the other hand, the object really is supposed to be a full NMPObject. Previously, we would get a valgrind warning, if we tried to pass fewer data there. It really doesn't matter much, but all other functions don't assume that there is any important data after the size indicated by the class.	2022-11-08 12:57:24 +01:00
Thomas Haller	dd2c5044f6	platform: add internal helper function to get full NMPObject size	2022-11-08 12:54:44 +01:00
Thomas Haller	c9123c2ece	platform: extend cmd_obj_{hash_update,cmp}() hooks to check for identity We will extend IPv4 routes with the list of next hops. This field will be heap allocated and be part of the NMPObjectIP4Route object, while also being part of the identity. To support the ID operator that checks fields of the NMPObject, add a "for_id" argument to the hash/cmp hooks. Also, a function that sets cmd_obj_{hash_update,cmp}() MUST not set cmd_plobj_id_{hashupdate,cmp}(), as it would have overlapping functionality. Therefore, the objects that define cmd_obj_{hash_update,cmp}() need to fully implement the ID comparison.	2022-11-08 12:54:44 +01:00
Thomas Haller	ff63b2eb6e	platform: unify full/id hash/cmp implementations for NMPObject	2022-11-08 12:54:44 +01:00
Thomas Haller	5da0d18fbe	platform/tests: add unit test checking consistency of NMPClass	2022-11-08 12:54:35 +01:00
Thomas Haller	8feeb199ad	platform: drop redundant hook implementations from NMPObject classes A NMPClass that has data outside the plobj part, needs to implement the cmd_obj_() hooks, instead of cmd_plobj_(). For those objects, reasoning only about the plobj part is not sufficient. Implementing both hooks is also unnecessary and confusing. Ensure that if we have cmd_obj_() hooks set, that the corresponding cmd_plobj_() hooks are unset.	2022-11-08 12:53:46 +01:00
Thomas Haller	ee34eeafb9	platform: fix nmp_object_copy(id_only) for object that don't implement cmd_plobj_id_copy() The if-else-if was wrong. It meant that if an object did not implement cmd_plobj_id_copy(), nothign was copied (for id-only). I think this code path was not actually hit, because we never clone an object only by ID. Fixes: `c91a4617a1` ('nmp-object: allow missing implementations for certain virtual functions')	2022-11-08 12:53:41 +01:00
Beniamino Galvani	9feffe7ad4	platform: detect dadfailed IPv6 addresses during pruning If an address is removed during pruning and it had the TENTATIVE flag before, the most likely cause of the removal is that it failed DAD. It could also be that the user removed it at the same time we needed to resync the platform cache, but that seems more unlikely.	2022-10-26 08:54:29 +02:00
Beniamino Galvani	3f84ee27a0	platform: add mechanism to report removed IPv6 addresses that failed DAD	2022-10-26 08:54:29 +02:00
Thomas Haller	ad7d5887cd	all: cleanup close() handling and clarify nm_close()/nm_close_with_error() Cleanup the handling of close(). First of all, closing an invalid (non-negative) file descriptor (EBADF) is always a serious bug. We want to catch that. Hence, we should use nm_close() (or nm_close_with_error()) which asserts against such bugs. Don't ever use close() directly, to get that additional assertion. Also, our nm_close() handles EINTR internally and correctly. Recent POSIX defines that on EINTR the close should be retried. On Linux, that is never correct. After close() returns, the file descriptor is always closed (or invalid). nm_close() gets this right, and pretends that EINTR is a success (without retrying). The majority of our file descriptors are sockets, etc. That means, often an error from close isn't something that we want to handle. Adjust nm_close() to return no error and preserve the caller's errno. That is the appropriate reaction to error (ignoring it) in most of our cases. And error from close may mean that there was an IO error (except EINTR and EBADF). In a few cases, we may want to handle that. For those cases we have nm_close_with_error(). TL;DR: use almost always nm_close(). Unless you want to handle the error code, then use nm_close_with_error(). Never use close() directly. There is much reading on the internet about handling errors of close and in particular EINTR. See the following links: https://lwn.net/Articles/576478/ https://askcodes.net/coding/what-to-do-if-a-posix-close-call-fails- https://www.austingroupbugs.net/view.php?id=529 https://sourceware.org/bugzilla/show_bug.cgi?id=14627 https://news.ycombinator.com/item?id=3363819 https://peps.python.org/pep-0475/	2022-10-25 13:12:48 +02:00
Beniamino Galvani	f7ac887502	platform: set custom netlink buffer size when adding SR-IOV VFs When there are many VFs the default buffer size of 1 memory page is not enough. Each VF can take up to ~120 bytes and so when the page size is 4KiB at most ~34 VFs can be added. Specify the buffer size when allocating the message.	2022-10-17 10:30:44 +02:00
Beniamino Galvani	a4767ad771	platform: add length argument to _nl_msg_new_link_full() Add a new argument to specify the netlink buffer length.	2022-10-17 10:30:44 +02:00
Beniamino Galvani	f12d96f0fa	platform: change nlmsg_alloc*() functions Add a len argument to nlmsg_alloc() and nlmsg_alloc_simple(). After that, nlmsg_alloc_size() can be dropped. Also, rename nlmsg_alloc_simple() to nlmsg_alloc_new().	2022-10-17 10:30:44 +02:00

1 2 3 4 5 ...

438 commits