j3d1/tinc - Neuland Labor e.V.: Git

j3d1/tinc

Author	SHA1	Message	Date
Florian Klink	91355b9ac5	(read\|append)_config_file: log open errors as LOG_DEBUG In a "decentrally managed vpn" it is very likely that host config files for some reachable nodes do not exist. Currently, tinc fills the logs with "Cannot open config file" messages. This commit changes the log level to LOG_DEBUG so syslog doesn't get filled by default.	2015-07-02 21:22:47 +02:00
Etienne Dechamps	ebffa40aa7	Protect against callbacks removing items from the io tree. The definition of the splay_each() macro is somewhat complicated for syntactic reasons. Here's what it does in a more readable way: for (splay_node_t* node = tree->head; node;) { type* item = node->data; splay_node_t* next = node->next; // RUN USER BLOCK with (item) node = next; } list_each() works in the same way. Since node->next is saved before the user block runs, this construct supports removing the current item from within the user block. However, what it does not support is removing other items from within the user block, especially the next item. Indeed, that will invalide the next pointer in the above loop and therefore result in an invalid pointer dereference. Unfortunately, there is at least one code path where that unsupported operation happens. It is located in ack_h(), where the authentication protocol code detects a double connection (i.e. being connected to another node twice). Running in the context of a socket read event, this code will happily terminate the other metaconnection, resulting in its socket being removed from the io tree. If, by misfortune, this other metaconnection happened to have the next socket FD number (which is quite possible due to FD reuse - albeit unlikely), and was part of the io tree (which is quite likely because if that connection is stuck, it will most likely have pending writes) then this will result in the next pending io item being destroyed. Invalid pointer dereference ensues. I did a quick audit of other uses of splay_each() and list_each() and I believe this is the only scenario in which this "next pointer invalidation" problem can occur in practice. While this bug has been there since at least `6bc5d626a8` (November 2012), if not sooner, it happens quite rarely due to the very specific set of conditions required to trigger it. Nevertheless, it does manage to crash my central production nodes every other week or so.	2015-06-20 14:09:00 +01:00
Dato Simó	7f020cf456	Fix typo in tinc.texi.	2015-06-16 20:53:16 -03:00
Guus Sliepen	45a46f068c	Fix crash is sptps_logger(). Unfortunately, sptps_logger() cannot know if s->handle is pointing to a connection_t or a node_t. But it needs to print name and hostname in both cases. So make sure both types have name and hostname fields at the start with the same offset.	2015-06-10 23:42:17 +02:00
Guus Sliepen	bfe231b977	Fix alignment of output of sptps_speed.	2015-06-07 23:20:14 +02:00
Guus Sliepen	a797b4a192	Fix receiving SPTPS data in sptps_speed and sptps_test. The sptps_receive_data() was changed in commit `d237efd` to only process one SPTPS record from a stream input. So now we have to put a loop around it to ensure we process everything.	2015-06-07 23:17:54 +02:00
Guus Sliepen	d8d1ab4ee1	Fix warnings about missing return value checks. In some harmless places, checks for the return value of ECDSA and RSA key generation and verification was omitted. Add them to keep the compiler happy and to warn end users in case something is wrong.	2015-06-07 22:50:05 +02:00
Guus Sliepen	ab0576a203	Fix autoconf check for function attributes. GCC warns when a function attribute has no effect. The autoconf check turns warnings about attributes into errors, therefore thinking that they did not work. The reason was that the test function returned void, which is not suitable for checking both __malloc__ and __warn_unused_result__.	2015-06-07 22:25:22 +02:00
Guus Sliepen	84ecc972e5	Fix missing return value caused by the previous commit.	2015-05-31 23:51:39 +02:00
Etienne Dechamps	eca357ed91	Don't try to relay packets to unreachable nodes. It is not unusual for tinc to receive SPTPS packets to be relayed to nodes that just became unreachable, due to state propagation delays in the metagraph. Unfortunately, the current code doesn't handle that situation correctly, and still tries to relay the packet to the unreachable node. This typically ends up segfaulting. This commit fixes the issue by checking for reachability before relaying the packet.	2015-05-31 20:19:48 +01:00
Etienne Dechamps	9e3adef5cb	Fix invalid pointer use in get_my_hostname(). clang-3.7 warnings surfaced an actual bug: invitation.c:185:5: error: address of array 'filename' will always evaluate to 'true' [-Werror,-Wpointer-bool-conversion] if(filename) { ~~ ^~~~~~~~ The regression was introduced in `3ccdf50beb`.	2015-05-24 09:49:16 +01:00
Etienne Dechamps	7fcfbe2bd2	Fix wrong format string type in send_sptps_tcppacket(). This issue was found through a clang-3.7 warning: protocol_misc.c:167:46: error: format specifies type 'short' but the argument has type 'int' [-Werror,-Wformat] if(!send_request(c, "%d %hd", SPTPS_PACKET, len)) ~~~ ^~~ %d	2015-05-24 09:45:09 +01:00
Etienne Dechamps	3e61c7233b	Don't set up an ongoing connection to myself. It is entirely possible that the configuration file could contain a ConnectTo statement refering to its own name; that's a reasonable scenario when one deploys semi-automatically generated tinc.conf files. Amusingly, tinc does not like that at all, and actually sets up an outgoing_t structure to myself (which obviously makes no sense). This is mostly benign, though it does result in non-sensical "Already connected to myself" messages every retry interval. However, that also makes things blow up in close_network_connections(), because there we delete the entire outgoing list and then the myself node, which still has a reference to the freshly deleted outgoing structure. Boom.	2015-05-23 17:33:32 +01:00
Etienne Dechamps	8587e8c0d9	Fix crashes when trying unreachable nodes. timeout_handler() calls try_tx(c->node) when c->edge exists. Unfortunately, the existence of c->edge is not enough to conclude that the node is reachable. In fact, during connection establishment, there is a short period of time where we create an edge for the node at the other end of the metaconnection, but we don't have one from the other side yet. Unfortunately, if timeout_handler() runs during that short time window, it will call try_tx() on an unreachable node, which makes things explode because that function is not prepared to handle that case. A typical symptom of this race condition is a hard SEGFAULT while trying to send packets using metaconnections that don't exist, due to n->nexthop containing garbage. This patch fixes the issue by making try_tx() check for reachability, and then making all code paths use try_tx() instead of the more specialized methods so that they go through the check. This regression was introduced in `eb7a0db18e`.	2015-05-23 10:24:00 +01:00
Guus Sliepen	537a936671	Update copyright notices.	2015-05-21 11:09:01 +02:00
Guus Sliepen	0a786ffbb9	Set the CLOEXEC flag on the umbilical socket.	2015-05-21 11:06:38 +02:00
Guus Sliepen	87e0952773	Use socketpair() instead of pipe() for the umbilical. This prepares for a possible conversion of the umbilical socket to a control socket.	2015-05-20 21:28:54 +02:00
Guus Sliepen	19e0d449eb	Don't write log messages to the umbilical pipe if we don't detach. If we run in the foreground and are started by the CLI, this would otherwise cause the first few log messages to appear twice.	2015-05-20 21:25:06 +02:00
Guus Sliepen	11868b890d	Ensure "tinc start" knows if the daemon really started succesfully. We do this by creating an umbilical between the CLI and the daemon. The daemon pipes log messages to the CLI until it starts the main loop. The daemon then cuts the umbilical. The CLI copies all the received log messages to stderr, and the last byte indicates whether the daemon started succesfully or not, so the CLI can exit with a useful exit code.	2015-05-20 16:59:43 +02:00
Guus Sliepen	7f96ef081d	Fix check for LOCALSTATEDIR accessibility for the CLI. The CLI does not need write access to the directory where the PID file is stored, it just needs to be able to read the PID file.	2015-05-20 11:11:12 +02:00
Guus Sliepen	3ccdf50beb	Allocate temporary filenames on the stack. This gets rid of xasprintf() in a number of places, and removes the need to free() the temporary strings. A few potential memory leaks have been fixed.	2015-05-20 00:58:00 +02:00
Guus Sliepen	58e8f598f3	Allow dumping a list of outstanding invitations. This dumps the name of the invitation file, as well as the name of the node that is being invited. This can make it easier to find the invitation file belonging to a given node.	2015-05-20 00:12:01 +02:00
Guus Sliepen	7c8f54cdb2	Add "list" as an alias for "dump" in the CLI.	2015-05-20 00:02:53 +02:00
Guus Sliepen	69ba5f621e	Quit with an error message if ioctl(TUNSETIFF) fails. It is possible that opening /dev/net/tun works but that interface creation itself fails, for example if a non-root user tries to create a new interface, or if the desired interface is already opened by another process. In this case, the ioctl() fails, but we actually silently ignored this condition.	2015-05-19 22:26:32 +02:00
Guus Sliepen	60fbdb3f2c	If LOCALSTATEDIR is inaccessible, store the pid and socket files in the configuration directory. The compile time local state directory is usually /var or /usr/local/var. If this is not accessible for some reason, for example because someone ./configured tinc without --localstatedir and /usr/local/var does not exist, or if tinc is started by a non-root user, then tinc will fall back to the directory where tinc.conf is stored. A warning is logged when this happens.	2015-05-19 22:17:18 +02:00
Guus Sliepen	dece2db78e	Don't log seqno failures in sptps_verify_datagram(). This function is not used for normal traffic, only when a packet from an unknown source is received and we need to check against candidates. No failures should be logger in this case; if the packet is really not valid this will be logged by handle_incoming_vpn_data().	2015-05-19 21:32:30 +02:00
Guus Sliepen	a752211801	Add source of SPTPS errors to log messages.	2015-05-19 21:23:35 +02:00
Guus Sliepen	d89f37eb17	Add newline at end of precomp_data.h and sc.h.	2015-05-19 14:25:20 +02:00
Guus Sliepen	d8a3a182de	Fix src/Makefile.am for *BSD. Apparently the BSDs don't like $(srcdir) but want to see ${srcdir} in their rules.	2015-05-19 14:09:53 +02:00
Guus Sliepen	96a323e16a	Remove info-in-builddir option from AM_INIT_AUTOMAKE(). This option is not supported by older, but still widely used versions of automake. The drawback is that when doing multiple VPATH builds in a row, the info manual may mention incorrect paths, but it doesn't affect the executables at all.	2015-05-19 13:31:26 +02:00
Sven-Haegar Koch	51b5aab9b0	Fix check for public key in invite-join.test. Small fix to test/invite-join.test, comparing no-longer-existing ECDSAPublicKey does not make sense.	2015-05-19 13:30:42 +02:00
Etienne Dechamps	a196e9b0fd	Fix direct UDP communciation with pre-relaying 1.1 nodes. try_tx_sptps() gives up on UDP communication if the recipient doesn't support relaying. This is too restrictive - we only need the other node to support relaying if we actually want to relay through them. If the packet is sent directly, it's fine to send it to an old pre-node-IDs tinc-1.1 node.	2015-05-18 21:08:43 +01:00
Etienne Dechamps	fef29d0193	Don't parse node IDs if the sending node doesn't support them. Currently, tinc tries to parse node IDs for all SPTPS packets, including ones sent from older, pre-node-IDs tinc-1.1 nodes, and therefore doesn't recognize packets from these nodes. This commit fixes that. It also makes code slightly clearer by reducing the amount of fiddling around packet offset/length.	2015-05-18 20:56:16 +01:00
Etienne Dechamps	643149b449	Fix SPTPS condition in try_harder(). A condition in try_harder() is always evaluating to false when talking to a SPTPS node because n->status.validkey_in is always false in that case. Fix the condition so that the SPTPS status is correctly checked. This prevented recent tinc-1.1 nodes from talking to older, pre-node-ID tinc-1.1 nodes. The regression was introduced in `6056f1c13b`.	2015-05-18 20:38:01 +01:00
Etienne Dechamps	01d2519862	Don't pollute the system header directory namespace. Since commit `13f9bc1ff1`, tinc passes the -I. option to the preprocessor so that version_git.h can be found during out-of-tree ("VPATH") builds. The problem is, this option also affects the directory search for files included from system headers. For example, on MinGW, unistd.h contains the following line: #include <process.h> Which, due to -I. putting the tinc directory at the head of the search order, results in tinc's process.h being included instead of the file from MinGW. Hilarity ensues. This commit fixes the issue by using -iquote, which doesn't affect system headers.	2015-05-17 22:40:48 +01:00
Etienne Dechamps	c1154bf696	Make sure the MIN() macro is defined. On MinGW this is not automatically the case, thereby breaking the build.	2015-05-17 22:21:11 +01:00
Guus Sliepen	5c32bd1578	Merge remote-tracking branches 'dechamps/sptpsrestart' and 'dechamps/keychanged' into 1.1	2015-05-17 21:07:45 +02:00
Etienne Dechamps	2cb216d83d	Don't send KEY_CHANGED messages if we don't support the legacy protocol. KEY_CHANGED messages are only useful to invalidate keys for non-SPTPS nodes; SPTPS nodes use a different internal mechanism (forced KEX) for that purpose. Therefore, if we know we can't talk to legacy nodes, there's no point in sending them these messages.	2015-05-17 19:27:20 +01:00
Etienne Dechamps	1a7a9078c0	Proactively restart the SPTPS tunnel if we get receive errors. There are a number of ways a SPTPS tunnel can get into a corrupt state. For example, during key regeneration, the KEX and SIG messages from other nodes might arrive out of order, which confuses the hell out of the SPTPS code. Another possible scenario is not noticing another node crashed and restarted because there was no point in time where the node was seen completely disconnected from all nodes; this could result in using the wrong (old) key. There are probably other scenarios which have not even been considered yet. Distributed systems are hard. When SPTPS got confused by a packet, it used to crash the entire process; fortunately that was fixed by commit `2e7f68ad2b`. However, the error handling (or lack thereof) leaves a lot to be desired. Currently, when SPTPS encounters an error when receiving a packet, it just shrugs it off and continues as if nothing happened. The problem is, sometimes getting receive errors mean the tunnel is completely stuck and will not recover on its own. In that case, the node will become unreachable - possibly indefinitely. The goal of this commit is to improve SPTPS error handling by taking proactive action when an incoming packet triggers a failure, which is often an indicator that the tunnel is stuck in some way. When that happens, we simply restart SPTPS entirely, which should make the tunnel recover quickly. To prevent "storms" where two buggy nodes flood each other with invalid packets and therefore spend all their time negotiating new tunnels, we limit the frequency at which tunnel restarts happen to ten seconds. It is likely this commit will solve the "Invalid KEX record length during key regeneration" issue that has been seen in the wild. It is difficult to be sure though because we do not have a full understanding of all the possible conditions that can trigger this problem.	2015-05-17 19:21:50 +01:00
Etienne Dechamps	aa52300b2b	Trivial: make sptps_receive_data_datagram() a little more readable. The new code updates variables as stuff is being consumed, so that the reader doesn't have to do that in his head.	2015-05-17 17:52:15 +01:00
Guus Sliepen	30e839b0a1	Don't send local_address in ADD_EDGE messages if it's AF_UNSPEC.	2015-05-17 18:44:09 +02:00
Sven-Haegar Koch	23fda4db6d	Let sockaddr2hostname() handle AF_UNSPEC addresses.	2015-05-17 18:43:34 +02:00
Etienne Dechamps	1e89a63f16	Prevent SPTPS key regeneration packets from entering an UDP relay path. Commit `10c1f60c64` introduced a mechanism by which a packet received by REQ_KEY could continue its journey over UDP. This was based on the assumption that REQ_KEY messages would never be used for handshake packets (which should never be sent over UDP, because SPTPS currently doesn't handle lost handshake packets very well). Unfortunately, there is one case where handshake packets are sent using REQ_KEY: when regenerating the SPTPS key for a pre-established channel. With the current code, such packets risk getting relayed over UDP. When processing a REQ_KEY message, it is impossible for the receiving end to distinguish between a data SPTPS packet and a handshake packet, because this information is stored in the type field which is encrypted with the end-to-end key. This commit fixes the issue by making tinc use ANS_KEY for all SPTPS handshake messages. This works because ANS_KEY messages are never forwarded using the SPTPS relay mechanisms, therefore they are guaranteed to stick to TCP.	2015-05-17 17:09:56 +01:00
Guus Sliepen	eecfeadeb4	Let sockaddr2str() handle AF_UNSPEC addresses.	2015-05-16 02:01:54 +02:00
Guus Sliepen	613c121cdc	Try all addresses for the hostname in an invitation URL.	2015-05-15 23:35:46 +02:00
Guus Sliepen	54a8bd78e3	Be more liberal accepting ADD_EDGE messages with conflicting local address information. If the ADD_EDGE is for one of the edges we own, and if it is not the same as we actually have, send a correcting ADD_EDGE back. Otherwise, if the ADD_EDGE contains new information, update our idea of the local address for that edge. If the ADD_EDGE does not contain local address information, then we never make a correction nor log a warning.	2015-05-15 23:08:53 +02:00
Guus Sliepen	8028e01100	Use AF_UNSPEC instead of AF_UNKNOWN for unspecified local address in add_edge_h(). AF_UNKNOWN is reserved for valid addresses that the local node cannot parse, but remote nodes possibly can.	2015-05-15 23:01:06 +02:00
Guus Sliepen	fd1cff6df2	Fix receiving UDP packets from tinc 1.0.x nodes. In try_mac(), the wrong offsets were used into the packet buffer, causing the digest verification to always fail.	2015-05-15 00:21:48 +02:00
Guus Sliepen	44e9f1e1d8	Fix invitations. These were broken due to a change in behaviour of sptps_receive_data() introduced in commit `d237efd325`.	2015-05-13 14:28:28 +02:00
Etienne Dechamps	7e6b2dd1ea	Introduce raw TCP SPTPS packet transport. Currently, SPTPS packets are transported over TCP metaconnections using extended REQ_KEY requests, in order for the packets to pass through tinc-1.0 nodes unaltered. Unfortunately, this method presents two significant downsides: - An already encrypted SPTPS packet is decrypted and then encrypted again every time it passes through a node, since it is transported over the SPTPS channels of the metaconnections. This double-encryption is unnecessary and wastes CPU cycles. - More importantly, the only way to transport binary data over standard metaconnection messages such as REQ_KEY is to encode it in base64, which has a 33% encoding overhead. This wastes 25% of the network bandwidth. This commit introduces a new protocol message, SPTPS_PACKET, which can be used to transport SPTPS packets over a TCP metaconnection in an efficient way. The new message is appropriately protected through a minor protocol version increment, and extended REQ_KEY messages are still used with nodes that do not support the new message, as well as for the intial handshake packets, for which efficiency is not a concern. The way SPTPS_PACKET works is very similar to how the traditional PACKET message works: after the SPTPS_PACKET message, the raw binary packet is sent directly over the metaconnection. There is one important difference, however: in the case of SPTPS_PACKET, the packet is sent directly over the TCP stream completely bypassing the SPTPS channel of the metaconnection itself for maximum efficiency. This is secure because the SPTPS packet that is being sent is already encrypted with an end-to-end key.	2015-05-10 21:08:57 +01:00

1 2 3 4 5 ...

2577 commits