engine: tag peers based on usefulness #191

Stebalien · 2019-09-07T02:09:18Z

This patch tracks two usefulness metrics: short-term usefulness and long-term usefulness. Short-term usefulness is sampled frequently and highly weights new observations. Long-term usefulness is sampled less frequently and highly weights on long-term trends.

In practice, we do this by keeping two EWMAs. If we see an interaction within the sampling period, we record the score, otherwise, we record a 0. The short-term one has a high alpha and is sampled every shortTerm period. The long-term one has a low alpha and is sampled every longTermRatio*shortTerm period.

To calculate the final score, we sum the short-term and long-term scores then adjust it ±25% based on our debt ratio. Peers that have historically been more useful to us than we are to them get the highest score.

The number of hard-coded constants makes me really uncomfortable. I'm wondering if we should:

Instead of using a long-term and a short-term metric, use a vector. There has to be a paper on this.
Somehow try to learn these values at runtime.

What I'd like to do is try this on the gateways and cluster and see what happens. We'll probably need to somehow feed connection manager information into grafana to get a good picture of whether or not this is working.

This patch tracks two usefulness metrics: short-term usefulness and long-term usefulness. Short-term usefulness is sampled frequently and highly weights new observations. Long-term usefulness is sampled less frequently and highly weights on long-term trends. In practice, we do this by keeping two EWMAs. If we see an interaction within the sampling period, we record the score, otherwise, we record a 0. The short-term one has a high alpha and is sampled every shortTerm period. The long-term one has a low alpha and is sampled every longTermRatio*shortTerm period. To calculate the final score, we sum the short-term and long-term scores then adjust it ±25% based on our debt ratio. Peers that have historically been more useful to us than we are to them get the highest score.

decision/engine.go

decision/ledger.go

decision/engine.go

lanzafame · 2019-09-10T00:53:40Z

decision/ewma.go

+package decision
+
+func ewma(old, new, alpha float64) float64 {
+	return new*alpha + (1-alpha)*old


@Stebalien I know I am late to this PR but if you are willing to shoulder the import, I highly recommend using gonum/floats for the multiplications as it is significantly faster than stdlib.

I'd rather keep this case simple as I don't think this is going to be a bottleneck.

lanzafame · 2019-09-10T00:56:18Z

What I'd like to do is try this on the gateways and cluster and see what happens. We'll probably need to somehow feed connection manager information into grafana to get a good picture of whether or not this is working.

@Stebalien I didn't see anywhere that exposed the values as metrics for them to be able to be collected and exposed in grafana?

Stebalien · 2019-09-10T17:16:37Z

I didn't see anywhere that exposed the values as metrics for them to be able to be collected and exposed in grafana?

The connection manager exposes them but we'd need to find some way to pipe them out to grafana. Once we do that, I'd like to pipe dht routing tables and bitswap ledgers out as well so we can compare values and see how accurate our measurements are.

engine: tag peers based on usefulness This commit was moved from ipfs/go-bitswap@fef4be2

Stebalien added 3 commits September 6, 2019 19:01

engine(test): make the test peer tagger more reliable

cdc87be

engine(test): test peer usefulness tagging

1f09ef5

Stebalien mentioned this pull request Sep 7, 2019

Expose connection manager metrics over Prometheus ipfs/kubo#6634

Open

Stebalien commented Sep 7, 2019

View reviewed changes

decision/engine.go Show resolved Hide resolved

decision/engine.go Show resolved Hide resolved

Stebalien requested a review from dirkmc September 7, 2019 02:32

dirkmc reviewed Sep 9, 2019

View reviewed changes

decision/ledger.go Show resolved Hide resolved

decision/engine.go Show resolved Hide resolved

decision/engine.go Show resolved Hide resolved

engine(doc): comment on why we have the score adjustment

fcb13fc

dirkmc approved these changes Sep 9, 2019

View reviewed changes

Stebalien merged commit fef4be2 into master Sep 9, 2019

Stebalien deleted the feat/tag-ewma branch September 9, 2019 15:18

lanzafame reviewed Sep 10, 2019

View reviewed changes

This was referenced Jan 17, 2020

Release v0.4.23 ipfs/kubo#6836

Closed

Release v0.4.23 ipfs/kubo#6837

Closed

Jorropo pushed a commit to Jorropo/go-libipfs that referenced this pull request Jan 26, 2023

Merge pull request ipfs/go-bitswap#191 from ipfs/feat/tag-ewma

3aea503

engine: tag peers based on usefulness This commit was moved from ipfs/go-bitswap@fef4be2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

engine: tag peers based on usefulness #191

engine: tag peers based on usefulness #191

Stebalien commented Sep 7, 2019 •

edited

Loading

lanzafame Sep 10, 2019

Stebalien Sep 10, 2019

lanzafame commented Sep 10, 2019

Stebalien commented Sep 10, 2019

engine: tag peers based on usefulness #191

engine: tag peers based on usefulness #191

Conversation

Stebalien commented Sep 7, 2019 • edited Loading

lanzafame Sep 10, 2019

Choose a reason for hiding this comment

Stebalien Sep 10, 2019

Choose a reason for hiding this comment

lanzafame commented Sep 10, 2019

Stebalien commented Sep 10, 2019

Stebalien commented Sep 7, 2019 •

edited

Loading