[SUBGRAPH] Validate Balance-centric Subgraph #728

vmichalik · 2022-04-06T10:33:56Z

Describe the issue:
Try building a balance-centric subgraph, per the below design.

The intended benefits are as follows:

Web3 calls are not required for querying balance
Notifications and automation backend (platform) can be more minimal
Data dashboards for solvency health are simpler to produce
Improve data querying for human readability
Improve data querying for data visualization

Timebox:
This spike should be capped at 3 days and reviewed at the end of the spike.

Additional context:

A primary concern for this subgraph is that it may create DDOS attack vectors.

hellwolf · 2022-04-06T12:15:39Z

Not sure I like it being called "Solvency-centric", will find better naming of this during the investigation.

kasparkallas · 2022-04-13T10:15:34Z

Updated to a more granular diagram:

Very notable changes:

IndexUnitsUpdatedEvent corrected to IndexUpdatedEvent
Added missing SubscriptionDistributionClaimedEvent & SubscriptionApprovedEvent

kasparkallas · 2022-04-13T12:19:53Z

Addition to the latest diagram:

Added missing AgreementLiquidatedByEvent & AgreementLiquidatedV2Event

msoni89 · 2022-04-14T08:05:17Z

msoni89 · 2022-04-21T17:34:34Z

Failed to distribute reason
**ERRO Subgraph failed with non-deterministic error: failed to process trigger**: block #11032 (0x7803…db85), transaction ec1963108703eb7038d81ece686410008e3d0ee146c59776b8d0b2560e0851c0: **Gas limit exceeded. Used: 10000025994697** wasm backtrace: 0: 0x4ac5 - <unknown>!src/mappingHelpers/getOrInitAccountTokenBalance 1: 0x6c89 - **<unknown>!src/mappings/idav1/createIndexSubscribedEntity**

Total record: 10000
Run time: 4.5 hours, Injection: 3 hours, and indexing completed after 1.5 hours of all transactions.
Transactions: Created Index, updateSubscriptionUnits (Batch of 10), approveSubscription (in loop) and distribute.

kasparkallas · 2022-04-21T18:09:26Z

The Graph Node has an internal concept of gas to measure the resources used up by the event handler (mapper). This "gas" is in no way related to the blockchain gas. In the load test, the event handler is running into one of those limitations. Here is a file specifying some of those limits: https://github.com/graphprotocol/graph-node/blob/master/graph/src/runtime/gas/costs.rs For example, the absolute theoretical max would be to save 250k entities in one event handler.

When looking through the code, I don't think there's any way to override these limits, so self-hosting wouldn't help. It's obviously reasonable for Graph Node put a limit somewhere, and from the load testing we can conclude that up to like 5k subscriptions we're good...

The question now is whether we go forwards knowing that there clearly is this attack vector. I don't think it matters too much whether the limit is at 2k or 20k or 200k for practical uses. We could still limit the number of subscriber in the protocol to enable this awesome off-chain capability without the attack vector.

I also wonder how does the current solvency system handles a situation with a 100k subscriber indexes doing updates. I haven't read the code but I'm gonna take a guess that it's going to do 100k calls to get the realtime balance (not for every event though)? There's probably going to exist some DDOS-ing attack vectors here as well...

msoni89 · 2022-04-21T18:34:57Z

I think there is a way to set it but the question is do we really need to support that many subscribers or not?

msoni89 · 2022-04-21T21:21:22Z

5000 subcribers

hellwolf · 2022-04-22T08:53:42Z

Also note that you can issue 100 distributions to the same index batched in the same block, and that would only require, say, at most 1000 subscribers to trigger the issue.

hellwolf · 2022-04-22T08:58:07Z

This is not a good design by abusing current subgraph architecture. We should not bake in undocumented limitations in the system.

Note on solvency model:

Solvency model is aware of this, but since all subscribers get more liquidity as opposed to less, a "lazy evaluation" of checking their balances is an optimization can be done, and is done in our solvency implementations. So there is no scalability issue in the same way in solvency.

hellwolf · 2022-04-22T08:58:35Z

This spike can be considered done, and we will take this findings to the subgraph team.

kasparkallas · 2022-04-22T09:08:24Z

Also note that you can issue 100 distributions to the same index batched in the same block, and that would only require, say, at most 1000 subscribers to trigger the issue.

The limitation might be per event, not per block. I'll ask in Discord.

kasparkallas · 2022-04-22T09:40:40Z

Note on solvency model:

Solvency model is aware of this, but since all subscribers get more liquidity as opposed to less, a "lazy evaluation" of checking their balances is an optimization can be done, and is done in our solvency implementations. So there is no scalability issue in the same way in solvency.

Yep, I get it, probably not an issue here. The solvency sentinel has no rush to account for events that could not affect the balance negatively.

Although the sentinel is probably gonna do a RPC call for all the subscribers at some point to account for the new liquidation date?

We should not bake in undocumented limitations in the system.

Limitations can be documented and governed.

This is not a good design by abusing current subgraph architecture.

The way Subgraph does mapping of events is the most common and straight-forward way for any event-sourced system. Going linearly through the events one by one and projecting at the same time. Its also normal for an event to create many entities -- Subgraph has it documented that theoretically 250k is fine -- but obviously as long as Superfluid is boundless then this can be broken and the mapper would stop. There always has to be a solution / fallback for this scenario anyways when going forwards.

The solution?

Because putting a cap in the protocol seems to be a definite no go, we can put a subscriber cap in the Subgraph (we're already counting the subscribers). Once an index hits i.e. 2k subscribers, we stop updating the balances of the subscribers for distributions and stop creating the IndexSubscriptionDistributions. The visualizations and activity history will not be perfect for accounts subscribed to these huge indexes... but who cares (i.e. it's a non-issue)... when it works 99%+ of time and nothing critical depends on it.

The biggest negative effect of this (or loss of a positive effect) is that when calculating balances, we always have to do a RPC call. If Subgraph had a "perfect history" of balances then we theoretically would not have to rely on RPC calls anymore. RPC calls are what slow down indexing the most.

This would still enable us:

Subgraph-based solvency sentinel
Visualizations in products
Up to date balances in the Subgraph for Ricochet users
A more natural data model:
- Would enable querying all account balance affecting activities in one go and have them ordered

kasparkallas · 2022-04-25T10:43:39Z

Also note that you can issue 100 distributions to the same index batched in the same block, and that would only require, say, at most 1000 subscribers to trigger the issue.

The limitation might be per event, not per block. I'll ask in Discord.

Got a confirmation that it's per event: graphprotocol/graph-node#2414 (comment)

vmichalik added Project: SUBGRAPH Superfluid protocol subgraph Type: Spike Task that is to be timeboxed labels Apr 6, 2022

vmichalik assigned msoni89 Apr 6, 2022

vmichalik changed the title ~~[SUBGRAPH] Validate Solvency-centric Subgraph~~ [SUBGRAPH] Validate Balance-centric Subgraph Apr 11, 2022

hellwolf added this to the [email protected] milestone May 9, 2022

hellwolf closed this as completed May 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SUBGRAPH] Validate Balance-centric Subgraph #728

[SUBGRAPH] Validate Balance-centric Subgraph #728

vmichalik commented Apr 6, 2022 •

edited

Loading

hellwolf commented Apr 6, 2022

kasparkallas commented Apr 13, 2022 •

edited

Loading

kasparkallas commented Apr 13, 2022

msoni89 commented Apr 14, 2022

msoni89 commented Apr 21, 2022

kasparkallas commented Apr 21, 2022 •

edited

Loading

msoni89 commented Apr 21, 2022

msoni89 commented Apr 21, 2022

hellwolf commented Apr 22, 2022 •

edited

Loading

hellwolf commented Apr 22, 2022

hellwolf commented Apr 22, 2022

kasparkallas commented Apr 22, 2022

kasparkallas commented Apr 22, 2022 •

edited

Loading

kasparkallas commented Apr 25, 2022

[SUBGRAPH] Validate Balance-centric Subgraph #728

[SUBGRAPH] Validate Balance-centric Subgraph #728

Comments

vmichalik commented Apr 6, 2022 • edited Loading

hellwolf commented Apr 6, 2022

kasparkallas commented Apr 13, 2022 • edited Loading

kasparkallas commented Apr 13, 2022

msoni89 commented Apr 14, 2022

msoni89 commented Apr 21, 2022

kasparkallas commented Apr 21, 2022 • edited Loading

msoni89 commented Apr 21, 2022

msoni89 commented Apr 21, 2022

hellwolf commented Apr 22, 2022 • edited Loading

hellwolf commented Apr 22, 2022

hellwolf commented Apr 22, 2022

kasparkallas commented Apr 22, 2022

kasparkallas commented Apr 22, 2022 • edited Loading

The solution?

kasparkallas commented Apr 25, 2022

vmichalik commented Apr 6, 2022 •

edited

Loading

kasparkallas commented Apr 13, 2022 •

edited

Loading

kasparkallas commented Apr 21, 2022 •

edited

Loading

hellwolf commented Apr 22, 2022 •

edited

Loading

kasparkallas commented Apr 22, 2022 •

edited

Loading