Added balances for Yield Yak vaults #5737

yy-analytics · 2024-04-04T09:38:06Z

Thank you for contributing to Spellbook!

Thank you for taking the time to submit code in Spellbook. A few things to consider:

If you are a first-time contributor, please sign the CLA by copy & pasting exactly what the bot mentions in PR comment
Refer to docs section below to answer questions
Dune team will review submitted PRs as soon as possible

Best practices

To speed up your development process in PRs, keep these tips in mind:

Each commit to your feature branch will rerun CI tests (see example)
- This includes all modified models on your branch
- This includes all history of the data
Two tips for faster development iteration:
- Ensure dbt is installed locally (refer to main readme) and run dbt compile
  - This will output raw SQL in target/ directory to copy/paste and run on Dune directly for initial query testing
- Hardcode a WHERE filter for only ~7 days of history on large source tables, i.e. ethereum.transactions
  - This will speed up the CI tests and output results quicker -- whether that's an error or fully successful run
  - Once comfortable with small timeframe, remove filter and let full history run

Incremental model setup

Make sure your unique key columns are exactly the same in the model config block, schema yml file, and seed match columns (where applicable)
There cannot be nulls in the unique key columns
- Be sure to double check key columns are correct or COALESCE() as needed on key column(s), otherwise the tests may fail on duplicates

🪄 Use the built CI tables for testing 🪄

Once CI completes, you can query the CI tables and errors in dune when it finishes running.

For example:
- In the run initial models and test initial models, there will be a schema that looks like this: test_schema.git_dunesql_4da8bae_sudoswap_v2_base_pool_creations
- This can be temporarily queried in Dune for ~24 hours

Leverage these tables to perform QA testing on Dune query editor -- or even full test dashboards!

Spellbook contribution docs

The docs directory has been implemented to answer as many questions as possible. Please take the time to reference each .md file within this directory to understand how to efficiently contribute & why the repo is designed as it is 🪄

Example questions to be answered:

What does each property in the model config block mean?
What is the CI test attached to PRs and how can I best utilize it?
What are the Spellbook best practices?

Please navigate through the docs directory to find as much info as you can.

Note: happy to take PRs to improve the docs, let us know 🤝

dune-eng · 2024-04-04T09:38:36Z

Workflow run id 8552515453 approved.

dune-eng · 2024-04-04T09:38:38Z

Workflow run id 8552515769 approved.

dune-eng · 2024-04-04T09:39:04Z

Workflow run id 8552521437 approved.

dune-eng · 2024-04-04T09:39:05Z

Workflow run id 8552521711 approved.

yy-analytics · 2024-04-04T09:44:02Z

I'm expecting some issues with this PR but would like to discuss the approach. I opened a thread in Discord here but haven't heard anything back yet. I've not used the FROM {{ source(schema, name) }} style when querying certain tables because the list of tables to UNION is generated dynamically by using run_query combined with the yield_yak_strategies(blockchain) macro, and I can't use macros in my sources.yml file so don't have a way to dynamically update the sources if using FROM {{ source(schema, name) }} type statements. I've done it this way to avoid having to hardcode strategy contract names (new ones can sometimes be added fairly often and I'm just trying to avoid having to open a PR every time a new strategy is added).

jeff-dude · 2024-04-05T20:21:33Z

I'm expecting some issues with this PR but would like to discuss the approach. I opened a thread in Discord here but haven't heard anything back yet. I've not used the FROM {{ source(schema, name) }} style when querying certain tables because the list of tables to UNION is generated dynamically by using run_query combined with the yield_yak_strategies(blockchain) macro, and I can't use macros in my sources.yml file so don't have a way to dynamically update the sources if using FROM {{ source(schema, name) }} type statements. I've done it this way to avoid having to hardcode strategy contract names (new ones can sometimes be added fairly often and I'm just trying to avoid having to open a PR every time a new strategy is added).

i believe i responded yesterday. unfortunately, i don't think your approach here will work with run_query. does it make sense to build a separate spell solely that outputs a list of tables you need, then call that spell in the main spell to obtain the list? then dbt will handle the dependencies for you?

we don't allow direct access to db, so run_query will break some things.

yy-analytics · 2024-04-07T09:05:45Z

I'm expecting some issues with this PR but would like to discuss the approach. I opened a thread in Discord here but haven't heard anything back yet. I've not used the FROM {{ source(schema, name) }} style when querying certain tables because the list of tables to UNION is generated dynamically by using run_query combined with the yield_yak_strategies(blockchain) macro, and I can't use macros in my sources.yml file so don't have a way to dynamically update the sources if using FROM {{ source(schema, name) }} type statements. I've done it this way to avoid having to hardcode strategy contract names (new ones can sometimes be added fairly often and I'm just trying to avoid having to open a PR every time a new strategy is added).

i believe i responded yesterday. unfortunately, i don't think your approach here will work with run_query. does it make sense to build a separate spell solely that outputs a list of tables you need, then call that spell in the main spell to obtain the list? then dbt will handle the dependencies for you?

we don't allow direct access to db, so run_query will break some things.

Thanks @jeff-dude. I could be wrong on this, but the issue I see with what you're describing is that I need the result of the query as a jinja variable (specifically, a list) that I can then loop over to create the "main" query. If I were to have a separate spell and select from that using ref then isn't that just the same as having it in a CTE? The main approach I'm going for in this is in the yield_yak_balances.sql macro file

yy-analytics · 2024-04-08T12:50:57Z

I'm expecting some issues with this PR but would like to discuss the approach. I opened a thread in Discord here but haven't heard anything back yet. I've not used the FROM {{ source(schema, name) }} style when querying certain tables because the list of tables to UNION is generated dynamically by using run_query combined with the yield_yak_strategies(blockchain) macro, and I can't use macros in my sources.yml file so don't have a way to dynamically update the sources if using FROM {{ source(schema, name) }} type statements. I've done it this way to avoid having to hardcode strategy contract names (new ones can sometimes be added fairly often and I'm just trying to avoid having to open a PR every time a new strategy is added).

i believe i responded yesterday. unfortunately, i don't think your approach here will work with run_query. does it make sense to build a separate spell solely that outputs a list of tables you need, then call that spell in the main spell to obtain the list? then dbt will handle the dependencies for you?
we don't allow direct access to db, so run_query will break some things.

Thanks @jeff-dude. I could be wrong on this, but the issue I see with what you're describing is that I need the result of the query as a jinja variable (specifically, a list) that I can then loop over to create the "main" query. If I were to have a separate spell and select from that using ref then isn't that just the same as having it in a CTE? The main approach I'm going for in this is in the yield_yak_balances.sql macro file

@jeff-dude , do you know if the "no direct access to db" would also mean that the dbt_utils.get_column_values macro also cannot be run? From the source code it looks like it relies on a statement block (same as run_query) so I'm guessing not, but thought I might just confirm (?) as I can make a separate spell as you said if dbt_utils.get_column_values is working and can then point to that spell in the arguments for this macro. If not possible then I think the only solution is going to be hard-coding the list of strategies and submitting a new PR each time that list changes.

… main

dune-eng · 2024-04-11T15:58:11Z

Workflow run id 8649689954 approved.

dune-eng · 2024-04-11T15:58:12Z

Workflow run id 8649690036 approved.

dune-eng · 2024-04-11T15:58:51Z

Workflow run id 8649698543 approved.

dune-eng · 2024-04-11T15:58:52Z

Workflow run id 8649698631 approved.

… main

dune-eng · 2024-04-11T16:15:15Z

Workflow run id 8649927884 approved.

dune-eng · 2024-04-11T16:15:18Z

Workflow run id 8649928099 approved.

dune-eng · 2024-04-11T16:42:52Z

Workflow run id 8650272962 approved.

dune-eng · 2024-04-11T16:42:53Z

Workflow run id 8650272744 approved.

yy-analytics · 2024-04-11T17:40:00Z

Given the "QUERY HAS TOO MANY STAGES" error that's coming now, I'm going to try to break up some of the balances model/macro first into separate Deposits, Withdraws and Reinvests models/macros which can then hopefully be combined to make the balances model/macro without this error.

…h combine to give balances, and included APY calculation for reinvests model

dune-eng · 2024-04-15T14:39:50Z

Workflow run id 8691125503 approved.

dune-eng · 2024-04-15T14:39:52Z

Workflow run id 8691125823 approved.

… main

dune-eng · 2024-04-22T11:25:50Z

Workflow run id 8783217293 approved.

dune-eng · 2024-04-22T11:25:51Z

Workflow run id 8783217639 approved.

dune-eng · 2024-04-22T12:42:18Z

Workflow run id 8784212730 approved.

dune-eng · 2024-04-22T12:42:19Z

Workflow run id 8784212812 approved.

jeff-dude

fantastic code here, thank you for applying all the best practices (and even teaching me some more)!

left a few thoughts below prior to finalizing and merging

models/yield_yak/arbitrum/yield_yak_arbitrum_schema.yml

jeff-dude · 2024-04-23T20:27:27Z

macros/models/_project/yield_yak/yield_yak_deposits_withdraws.sql

+{{ source(blockchain, 'transactions') }} t
+    ON t.hash = c.tx_hash


even if we can't apply an incremental filter, we should be able to enhance the join.

can you add block_number to the join conditions?

transactions is also partitioned on block_date -- we could likely add condition for block_date = cast(date_trunc('day', block_time) as date) (assuming block_date doesn't exist in the decoded tables. if it does, then simply join on block_date)

macros/models/_project/yield_yak/yield_yak_reinvests.sql

jeff-dude · 2024-04-23T20:30:40Z

macros/models/_project/yield_yak/yield_yak_deposits_withdraws.sql

+{{ source(blockchain, 'transactions') }} t
+    ON t.hash = c.tx_hash


also, can it be an inner join?
that's more efficient on the dunesql engine.

if inner join isn't possible, it's usually best to first read from the larger of the two tables, then left/right join as needed to smaller tables

jeff-dude · 2024-04-23T20:36:18Z

also, there is some complex incremental logic here. are you aware of the output tables built via CI pipelines attached, so you can query them on dune and test the data and ensure all looks good prior to merge?

…s merge columns/unique keys

…n for deposits/withdraws/reinvests

yy-analytics · 2024-04-24T06:51:26Z

also, there is some complex incremental logic here. are you aware of the output tables built via CI pipelines attached, so you can query them on dune and test the data and ensure all looks good prior to merge?

@jeff-dude , no I'm not sure on this part. I know about the tables generated by "dbt compile" and have been testing those to make sure the output looks correct, but as far as I could tell, that's for the "initial model". Where do I find the attached CI pipelines/tables to check the incremental logic is working correctly?

… main

dune-eng · 2024-04-24T06:55:01Z

Workflow run id 8812336139 approved.

dune-eng · 2024-04-24T06:55:02Z

Workflow run id 8812336260 approved.

jeff-dude · 2024-04-24T14:11:51Z

@jeff-dude , no I'm not sure on this part. I know about the tables generated by "dbt compile" and have been testing those to make sure the output looks correct, but as far as I could tell, that's for the "initial model". Where do I find the attached CI pipelines/tables to check the incremental logic is working correctly?

yep, dbt compile is useful for testing query in general for full history or initial runs. however, when you introduce complex incremental logic in spellbook, sometimes that can bring bugs into subsequent runs. in order to test the data quality post-incremental run, we open up the CI test tables (from attached gh actions on PRs) to be queried on dune for ~24 hours before being deleted. please note that it's once a day at set time, not 24 hours after build, so a rerun of gh action may be needed if bad timing happens and tables are dropped prior to finishing your testing.

check out these docs, hopefully they help (if not, let me know, we can modify those docs as needed):
https://github.com/duneanalytics/spellbook/blob/main/docs/ci_test/ci_test_overview.md

jeff-dude · 2024-04-24T14:13:32Z

i'll let this sit for a little bit, if you wanted to query those CI output tables for data quality. let me know when ready to merge 🤝

yy-analytics · 2024-04-24T16:51:16Z

@jeff-dude , no I'm not sure on this part. I know about the tables generated by "dbt compile" and have been testing those to make sure the output looks correct, but as far as I could tell, that's for the "initial model". Where do I find the attached CI pipelines/tables to check the incremental logic is working correctly?

yep, dbt compile is useful for testing query in general for full history or initial runs. however, when you introduce complex incremental logic in spellbook, sometimes that can bring bugs into subsequent runs. in order to test the data quality post-incremental run, we open up the CI test tables (from attached gh actions on PRs) to be queried on dune for ~24 hours before being deleted. please note that it's once a day at set time, not 24 hours after build, so a rerun of gh action may be needed if bad timing happens and tables are dropped prior to finishing your testing.

check out these docs, hopefully they help (if not, let me know, we can modify those docs as needed): https://github.com/duneanalytics/spellbook/blob/main/docs/ci_test/ci_test_overview.md

I think the 24 hr window passed. To trigger the actions again I was just going to "Update branch" but it's been showing "Checking for ability to merge automatically..." for a while now (see below). Is there any other way to rerun the gh action?

jeff-dude · 2024-04-24T16:56:17Z

I think the 24 hr window passed. To trigger the actions again I was just going to "Update branch" but it's been showing "Checking for ability to merge automatically..." for a while now (see below). Is there any other way to rerun the gh action?

i was able to merge in main. i think there is a universal gh issue right now, so it's been slow today. other ways to do it:

on your local setup, merge main from spellbook into your main branch, push changes via git locally (assuming there are new changes on main)
if no new changes, you can also go into the action itself and manually rerun

final approach: push a dummy change to your branch, so a commit is made. each commit will run them again.

yy-analytics · 2024-04-24T17:30:38Z

i was able to merge in main. i think there is a universal gh issue right now, so it's been slow today. other ways to do it:

on your local setup, merge main from spellbook into your main branch, push changes via git locally (assuming there are new changes on main)

if no new changes, you can also go into the action itself and manually rerun

final approach: push a dummy change to your branch, so a commit is made. each commit will run them again.

Thanks for the tips! I've been able to review the CI tables and I'm happy with how it all looks so am happy for you to merge when ready 🤝

yy-analytics and others added 2 commits April 4, 2024 15:03

Added balances for Yield Yak vaults

7d401f8

Merge branch 'main' into main

a7da57f

jeff-dude added the WIP work in progress label Apr 5, 2024

yy-analytics added 2 commits April 11, 2024 21:26

Changed approach to hardcoding strategies for yield_yak

b67b1ea

Merge branch 'main' of https://github.com/yy-analytics/spellbook into…

4c57012

… main

Merge branch 'main' into main

35e5456

yy-analytics added 2 commits April 11, 2024 21:44

Correction on column names for incremental models

4c74fa7

Merge branch 'main' of https://github.com/yy-analytics/spellbook into…

322363b

… main

Small fix on incremental predicates for balances models

74c5ad7

Separated deposits, withdraw and reinvests into their own models whic…

31f17f6

…h combine to give balances, and included APY calculation for reinvests model

Correction to column name used in balances macro

6c7970c

yy-analytics added 2 commits April 22, 2024 16:55

Updated post_hooks to new format

219d39c

Merge branch 'main' of https://github.com/yy-analytics/spellbook into…

f29b774

… main

Merge branch 'main' into main

dcdb37a

yy-analytics requested a review from Hosuke April 22, 2024 12:57

jeff-dude self-assigned this Apr 22, 2024

jeff-dude added in review Assignee is currently reviewing the PR and removed ready-for-review this PR development is complete, please review labels Apr 23, 2024

jeff-dude requested changes Apr 23, 2024

View reviewed changes

yy-analytics added 2 commits April 24, 2024 11:24

Updated unique combo of columns in schemas to match incremental model…

b6bdff9

…s merge columns/unique keys

Improved joins to transactions tables and added block_date as a colum…

b30dd99

…n for deposits/withdraws/reinvests

Merge branch 'main' of https://github.com/yy-analytics/spellbook into…

ab1485b

… main

jeff-dude approved these changes Apr 24, 2024

View reviewed changes

jeff-dude added ready-for-merging and removed in review Assignee is currently reviewing the PR labels Apr 24, 2024

Merge branch 'main' into main

5203634

jeff-dude merged commit 236d382 into duneanalytics:main Apr 25, 2024
3 checks passed

github-actions bot locked and limited conversation to collaborators Apr 25, 2024

		{{ source(blockchain, 'transactions') }} t
		ON t.hash = c.tx_hash

Added balances for Yield Yak vaults #5737

Added balances for Yield Yak vaults #5737

Conversation

yy-analytics commented Apr 4, 2024

Thank you for contributing to Spellbook!

Best practices

To speed up your development process in PRs, keep these tips in mind:

Incremental model setup

🪄 Use the built CI tables for testing 🪄

Spellbook contribution docs

dune-eng commented Apr 4, 2024

dune-eng commented Apr 4, 2024

dune-eng commented Apr 4, 2024

dune-eng commented Apr 4, 2024

yy-analytics commented Apr 4, 2024

jeff-dude commented Apr 5, 2024

yy-analytics commented Apr 7, 2024

yy-analytics commented Apr 8, 2024

dune-eng commented Apr 11, 2024

dune-eng commented Apr 11, 2024

dune-eng commented Apr 11, 2024

dune-eng commented Apr 11, 2024

dune-eng commented Apr 11, 2024

dune-eng commented Apr 11, 2024

dune-eng commented Apr 11, 2024

dune-eng commented Apr 11, 2024

yy-analytics commented Apr 11, 2024

dune-eng commented Apr 15, 2024

dune-eng commented Apr 15, 2024

dune-eng commented Apr 22, 2024

dune-eng commented Apr 22, 2024

dune-eng commented Apr 22, 2024

dune-eng commented Apr 22, 2024

jeff-dude left a comment

Choose a reason for hiding this comment

jeff-dude Apr 23, 2024

Choose a reason for hiding this comment

jeff-dude Apr 23, 2024

Choose a reason for hiding this comment

jeff-dude commented Apr 23, 2024

yy-analytics commented Apr 24, 2024

dune-eng commented Apr 24, 2024

dune-eng commented Apr 24, 2024

jeff-dude commented Apr 24, 2024

jeff-dude commented Apr 24, 2024

yy-analytics commented Apr 24, 2024

jeff-dude commented Apr 24, 2024

yy-analytics commented Apr 24, 2024