Make FindNode according to spec #205

kdeme · 2020-04-02T08:12:01Z

The current FindNode call will respond with ENRs closest to distance. According to spec it should only be ENRs with that distance. This is because we reuse the discovery v4 code where it should indeed respond with all closest nodes (<= k)

Next to that, there are some weird findings in vacp2p/research#19 . It should be verified if this is due to bugs in the findNode implementation.

The text was updated successfully, but these errors were encountered:

kdeme · 2020-04-10T16:00:32Z

After further testing the FindNode call I noticed that indeed sometimes nodes would be returned that were not the closest. A whole bucket could be missing even from the returned values.
Turns out the idAtDistance calculation was wrong. This is fixed here: 99c68d4 / #219

Now, why do we need this idAtDistance? This too comes from the fact that we use the Kademlia implementation of our discovery v4 code.

The Kademlia paper describes an implementation that starts off from one k-bucket, and keep splitting the bucket as more nodes are discovered and added. The bucket splits only on the part of the binary tree that our own id belongs too (same prefix). Resulting eventually in k-buckets per log base 2 distance. Well, not really, as nodes with ids in the lower distance ranges will never be found. And because of this an optimisation is done where buckets will also split even if the nodes own id does not have the same prefix (to avoid highly unbalanced branches).

Now some implementations take a more simplified approach. They simply directly create buckets for each possible logarithmic distance (e.g. here 1->256). Some implementations also just don't create buckets with logarithmic distance < x, as the lower the distances goes, the less chance there is that you will find nodes for it.

Now, discoveryv4 its FindNode call would request k closest nodes. As does original Kademlia.
This effectively puts the work at the node that gets the request. This node will have to check its buckets and gather the closest. Some implementations go over all the nodes in all the buckets for this (e.g. go-ethereum discv4). However, in our bucket splitting approach, this search is improved.

Now, in Discoveryv5 the approach was taken to pass the logarithmic distance instead of the NodeId as parameter for the FindNode call. And to only return nodes that match that logarithmic distance.

This was done to not put the trust at the node for selecting the closest nodes. Possibly a difference in implementation, but probably more important a security issue. See also: https://github.com/ethereum/devp2p/blob/master/discv5/discv5-rationale.md#115-guard-against-kademlia-implementation-flaws

The result is, that in an implementation which just stores buckets per logarithmic distance, it simply has to return the bucket. In our split-bucket implementation, this can't be done and we are still doing the closest search, which is also why we need the idAtDistance proc. Next we will need to filter out the nodes with invalid distances, to be compliant to the specification (This still needs to be added still). Perhaps some things can get optimised there, but, it sounds better to just switch away from the split-bucket approach. As I think that the main benefit it has is improved lookups (due to no unbalanced branches), and I believe this will be negated by limiting the returned nodes to the ones of the requested logarithmic distance for the FindNode call.

This FindNode change will also have an effect on the efficiency of the network. Work will be moved from the receiver of FindNodes to the requester. But this also means more network traffic, as less nodes will potentially be passed around, and thus more requests will be needed for a lookup (adding bandwidth and latency). This might be a concern for mobile devices.

arnetheduck · 2020-04-10T21:15:04Z

this fine description belongs in the source code ;)

kdeme · 2020-04-21T19:36:13Z

According to spec now after #223 + tests added.

kdeme added the discoveryv5 Discovery v5 related issues label Apr 2, 2020

kdeme mentioned this issue Apr 2, 2020

Discv5 Improvements status-im/nimbus-eth2#848

Closed

9 tasks

kdeme mentioned this issue Apr 10, 2020

discv5 vacp2p/research#19

Merged

kdeme closed this as completed Apr 21, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make FindNode according to spec #205

Make FindNode according to spec #205

kdeme commented Apr 2, 2020 •

edited

Loading

kdeme commented Apr 10, 2020 •

edited

Loading

arnetheduck commented Apr 10, 2020

kdeme commented Apr 21, 2020

Make FindNode according to spec #205

Make FindNode according to spec #205

Comments

kdeme commented Apr 2, 2020 • edited Loading

kdeme commented Apr 10, 2020 • edited Loading

arnetheduck commented Apr 10, 2020

kdeme commented Apr 21, 2020

kdeme commented Apr 2, 2020 •

edited

Loading

kdeme commented Apr 10, 2020 •

edited

Loading