feat:profiling:state summary and visualization #11012

ZenGround0 · 2023-06-28T15:31:12Z

Related Issues

Proposed Changes

One lotus-shed command that takes a non-running repo and generates a summary of data usage by protocol area
A python script for visualizing the data

Additional Info

Testplan

I've done many manual runs on calibration and mainnet repos and both lotus-shed and python script generate plausible output. Ive cross checked output data and it appears legit matching lotus chain store size + providing an explanation for pre nv19 snapshot bloat. I think this is good enough for running a monitoring service. If we find problems we can fix them as they show up.

Speed

While reviewing please keep an eye out for optimization opportunities in the shed command, today it takes about 5 hours to run so a 20% improvement is an hour saved.

Halp

@jennijuju where should documentation for this go? I have lotus-shed command help but probably we want something a little more thorough. Is https://github.com/filecoin-project/lotus/discussions/categories/tutorials enough?

Extension ideas

I am hoping to tweak the python script to take in visualization chart type information so we can visualize same data in a few different ways.

State summary shed command is readily extensible by using a path identifier for protocol areas. We could extend this to breakdown message usage by message type for example.

We could use similar visualization on gas traces. In particular cron gas traces would be useful to keep track of in the same way.

Checklist

Before you mark the PR ready for review, please make sure that:

Commits have a clear commit message. (they will after squash)
PR title is in the form of of <PR type>: <area>: <change being made>
- example: fix: mempool: Introduce a cache for valid signatures
- PR type: fix, feat, build, chore, ci, docs, perf, refactor, revert, style, test
- area, e.g. api, chain, state, market, mempool, multisig, networking, paych, proving, sealing, wallet, deps
New features have usage guidelines and / or documentation updates in
- Lotus Documentation
- Discussion Tutorials
Tests exist for new functionality or change in behavior
CI is green

cmd/lotus-shed/state-stats.go

magik6k · 2023-06-29T10:08:23Z

cmd/lotus-shed/state-stats.go

+	closer func()
+}
+
+func loadChainStore(ctx context.Context, repoPath string) (*StoreHandle, error) {


Probably could have a mode where this loads a snapshot to a memory-map blockstore, would need a decent amount of RAM, but should be much faster

snissn · 2023-06-30T18:21:50Z

Having this in two branches is causing an issue on the deployment infra. I think this code is good to merge, and all of the comments above are good suggestions that we should open a second issue for.

ZenGround0 · 2023-07-03T15:34:38Z

I think this code is good to merge

yup, I just need a green check mark, can you approve @snissn ?

snissn

looks good to me, has been working and generated expected output for us!

arajasek

Sorry, I never submitted this (partial) review!

arajasek · 2023-07-05T15:42:12Z

cmd/lotus-shed/state-stats.go

+type StoreHandle struct {
+	bs     blockstore.Blockstore
+	cs     *store.ChainStore
+	sm     *stmgr.StateManager


Can we drop the StateManager here? Conceptually, there isn't any state management you're doing, just a lot of reading of the StateTree. I suspect you'll be able to easily replace calls to sm.StateTree or sm.LoadActor with equivalent operations on the StateTree itslef.

Motivation here is that it tends to make things a little more future-proof, and drops a lot of unrelated logic that the StateManager has (migrations, drand, network versions, etc.)

arajasek · 2023-07-05T15:53:33Z

cmd/lotus-shed/state-stats.go

+		},
+		&cli.BoolFlag{
+			Name:  "pretty",
+			Usage: "print formated output instead of ldjson",


Suggested change

Usage: "print formated output instead of ldjson",

Usage: "print formatted output instead of ldjson",

ZenGround0 added 8 commits June 25, 2023 15:55

WIP

d599833

Output is buggy but halfway there

36a88f4

Churn json output is working

8f3123d

Touch up pathing

4aa977f

Tweak path stuff

77ea7ef

Refactor for cleanup + measure top level HAMT churn

0cfdc9b

Cleanup

3897bf1

More cleanup

016661b

ZenGround0 requested a review from a team as a code owner June 28, 2023 15:31

ZenGround0 added 3 commits June 28, 2023 09:52

Lint fixes

3cacbdf

Remove debug pprof serving

d2b2fba

Lint

ab72f2e

magik6k reviewed Jun 29, 2023

View reviewed changes

Cleanup plotting script

3f0ddcc

snissn approved these changes Jul 6, 2023

View reviewed changes

ZenGround0 merged commit 1358d70 into master Jul 6, 2023

ZenGround0 deleted the feat/stat-snapshot branch July 6, 2023 20:28

arajasek reviewed Jul 10, 2023

View reviewed changes

ZenGround0 mentioned this pull request Jul 31, 2023

Filecoin state profiling #10884

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat:profiling:state summary and visualization #11012

feat:profiling:state summary and visualization #11012

ZenGround0 commented Jun 28, 2023 •

edited

Loading

magik6k Jun 29, 2023

snissn commented Jun 30, 2023

ZenGround0 commented Jul 3, 2023

snissn left a comment

arajasek left a comment

arajasek Jul 5, 2023

arajasek Jul 5, 2023

arajasek Jul 5, 2023

	Usage: "print formated output instead of ldjson",
	Usage: "print formatted output instead of ldjson",

feat:profiling:state summary and visualization #11012

feat:profiling:state summary and visualization #11012

Conversation

ZenGround0 commented Jun 28, 2023 • edited Loading

Related Issues

Proposed Changes

Additional Info

Testplan

Speed

Halp

Extension ideas

Checklist

magik6k Jun 29, 2023

Choose a reason for hiding this comment

snissn commented Jun 30, 2023

ZenGround0 commented Jul 3, 2023

snissn left a comment

Choose a reason for hiding this comment

arajasek left a comment

Choose a reason for hiding this comment

arajasek Jul 5, 2023

Choose a reason for hiding this comment

arajasek Jul 5, 2023

Choose a reason for hiding this comment

arajasek Jul 5, 2023

Choose a reason for hiding this comment

ZenGround0 commented Jun 28, 2023 •

edited

Loading