feat: run multidim-interop.yml on self-hosted runners #154

galargh · 2023-03-17T07:32:16Z

This PR revives the proposal from #92 (c53be4c).

Please note that the first build on self-hosted runners had to populate the cache https://github.com/libp2p/test-plans/actions/runs/4447196185/attempts/1. This suggests that some part of the machine config influences the cache keys. It might need some tweaks to make it interoperable with hosted GitHub Actions runners; or it might be a good idea to have another job which populates the cache for hosted GitHub Actions runners.

The subsequent runs were using cache as expected:

I did not try to increase worker count nor modify docker run parameters to accommodate the more powerful environment.

This is the self-hosted runner definition: https://github.com/pl-strflt/tf-aws-gh-runner/blob/main/runners.tf#L28
This is the packer config the AMI was built from: https://github.com/pl-strflt/tf-aws-gh-runner/blob/main/images/ubuntu-jammy/kubo.pkrvars.hcl

We can modify the runners as you wish of course.

.github/actions/run-interop-ping-test/action.yml

galargh · 2023-03-17T13:41:30Z

.github/actions/run-interop-ping-test/action.yml

+      env:
+        AWS_BUCKET: ${{ inputs.s3-cache-bucket }}
+        AWS_REGION: ${{ inputs.aws-region }}
+        AWS_ACCESS_KEY_ID: ${{ inputs.s3-access-key-id }}
+        AWS_SECRET_ACCESS_KEY: ${{ inputs.s3-secret-access-key }}


This way, we modify the env only for this step.

galargh · 2023-03-17T13:42:08Z

.github/actions/run-interop-ping-test/action.yml

+      with:
+        config: ${{ steps.buildkit.outputs.config }}


On standard, hosted GitHub Actions runners this will be an empty string. Which is the default.

thomaseizinger · 2023-03-17T14:46:27Z

Half an hour is still pretty slow. This job usually finishes in 15min on PRs for us? Is that because we filter?

Will this mean we will also be able to have a cache for the PR docker builds that use RUN --type=cache (forgot the exact syntax).

MarcoPolo · 2023-03-17T15:17:20Z

m5.large is 2 cpus and 8GB of ram. Isn't that roughly the same as the hosted runners? https://docs.github.com/en/actions/using-github-hosted-runners/about-github-hosted-runners#supported-runners-and-hardware-resources

galargh · 2023-03-20T08:45:29Z

I moved you to c5.4xlarge, but, ultimately, you can pick any instance type that suits your workload characteristic the best - https://instances.vantage.sh/.

I requested the job that runs on self-hosted (the one with cache) to use 16 workers here - 255238e

I also had to request the build part to remain sequential - 0626f93

The job - https://github.com/libp2p/test-plans/actions/runs/4465794780/jobs/7843205397?pr=154 - took 20 minutes to finish. It spent ~7 minutes building stuff and then ~13 minutes running the tests.

thomaseizinger · 2023-03-20T17:59:43Z

The job - libp2p/test-plans/actions/runs/4465794780/jobs/7843205397?pr=154 - took 20 minutes to finish. It spent ~7 minutes building stuff and then ~13 minutes running the tests.

That is an amazing improvement! With the filtering that we do in the pull-request workflows, that is likely to be even quicker!

thomaseizinger · 2023-03-24T14:11:46Z

Is anything blocking this from merging? I'd be great to improve the runtime of this job.

MarcoPolo · 2023-04-11T19:12:15Z

defer until after ipfs thing

thomaseizinger

Thanks for the work here! I'd suggest we split this into two PRs so we can land parts of it quicker and discuss if we can perhaps avoid introducing the "input".

.github/actions/run-interop-ping-test/action.yml

MarcoPolo · 2023-05-22T21:16:16Z

rebased, fyi

MarcoPolo · 2023-05-22T22:03:19Z

26 minutes on this branch versus 54 minutes on master. @galargh What's the cost to running this? It's nice it's twice as fast, but it's also currently free.

MarcoPolo

One open question around cost, but changes look good. Please request another review when the question has been answered.

MarcoPolo · 2023-06-13T20:50:01Z

@galargh Friendly ping on my open question :)

galargh · 2023-06-23T08:05:59Z

I totally missed the question about cost! Let me do a quick calculation for you.

The monthly cost of moving on with this should be around $38/month (see the AWS resource calculation - https://calculator.aws/#/estimate?id=9a5a201262a70d9f491015a5aa81a4aeaf1ae966). I based this on:

the current configuration of 4xlarge runner
66 libp2p multidimensional interop test workflow runs in the past 30 days
36 minutes run-multidim-interop job took to complete on this PR
~3GB of new data in the S3 bucket in the past 30 days

thomaseizinger · 2023-06-27T11:59:04Z

With the recently added implementations, the interop test are now by far our longest CI job at > 20 minutes: https://github.com/libp2p/rust-libp2p/actions/runs/5387398505/jobs/9778664726

What is the process around deciding whether the spend is worth it?

.github/workflows/multidim-interop.yml

Co-authored-by: Piotr Galar <[email protected]>

MarcoPolo · 2023-07-07T18:26:11Z

Will merge after CI passes. Thank you so much @galargh! ❤️

This reverts commit 7e868ea.

galargh force-pushed the feat/self-hosted-runners branch 3 times, most recently from a2cd76d to 0360dee Compare March 17, 2023 11:51

galargh commented Mar 17, 2023

View reviewed changes

.github/actions/run-interop-ping-test/action.yml Show resolved Hide resolved

galargh commented Mar 17, 2023

View reviewed changes

galargh requested a review from MarcoPolo March 17, 2023 13:42

galargh marked this pull request as ready for review March 17, 2023 13:42

thomaseizinger approved these changes Mar 20, 2023

View reviewed changes

thomaseizinger linked an issue Mar 29, 2023 that may be closed by this pull request

ci: better caching for interop-tests docker build libp2p/rust-libp2p#3481

Closed

galargh mentioned this pull request May 8, 2023

ci: run interop tests on bigger hardware libp2p/rust-libp2p#3861

Merged

4 tasks

thomaseizinger reviewed May 8, 2023

View reviewed changes

.github/actions/run-interop-ping-test/action.yml Show resolved Hide resolved

.github/actions/run-interop-ping-test/action.yml Show resolved Hide resolved

galargh and others added 7 commits May 22, 2023 14:07

feat: run multidim-interop.yml on self-hosted runners

2b45189

feat: make worker count configurable

0b1b648

feat: enable docker.io proxy on self-hosted runners

6ac2e6d

chore: provide s3 creds through env

e362d41

Try 4 workers

a4aaa50

ci: run-multidim-interop on 16 workers

797e97f

chore: change self hosted labels

bc94d5f

MarcoPolo force-pushed the feat/self-hosted-runners branch from f4b07dd to bc94d5f Compare May 22, 2023 21:16

MarcoPolo reviewed May 22, 2023

View reviewed changes

Merge branch 'master' into feat/self-hosted-runners

4a473e4

galargh requested a review from MarcoPolo June 23, 2023 08:06

galargh commented Jul 6, 2023

View reviewed changes

.github/workflows/multidim-interop.yml Show resolved Hide resolved

Update .github/workflows/multidim-interop.yml

7e868ea

Co-authored-by: Piotr Galar <[email protected]>

MarcoPolo approved these changes Jul 7, 2023

View reviewed changes

Use self-hosted runners for now

8659beb

This reverts commit 7e868ea.

MarcoPolo merged commit 35ab35b into master Jul 7, 2023

MarcoPolo deleted the feat/self-hosted-runners branch July 7, 2023 22:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: run multidim-interop.yml on self-hosted runners #154

feat: run multidim-interop.yml on self-hosted runners #154

galargh commented Mar 17, 2023 •

edited

Loading

galargh Mar 17, 2023

galargh Mar 17, 2023

thomaseizinger commented Mar 17, 2023

MarcoPolo commented Mar 17, 2023

galargh commented Mar 20, 2023

thomaseizinger commented Mar 20, 2023

thomaseizinger commented Mar 24, 2023

MarcoPolo commented Apr 11, 2023

thomaseizinger left a comment

MarcoPolo commented May 22, 2023

MarcoPolo commented May 22, 2023

MarcoPolo left a comment

MarcoPolo commented Jun 13, 2023

galargh commented Jun 23, 2023

thomaseizinger commented Jun 27, 2023

MarcoPolo commented Jul 7, 2023

feat: run multidim-interop.yml on self-hosted runners #154

feat: run multidim-interop.yml on self-hosted runners #154

Conversation

galargh commented Mar 17, 2023 • edited Loading

galargh Mar 17, 2023

Choose a reason for hiding this comment

galargh Mar 17, 2023

Choose a reason for hiding this comment

thomaseizinger commented Mar 17, 2023

MarcoPolo commented Mar 17, 2023

galargh commented Mar 20, 2023

thomaseizinger commented Mar 20, 2023

thomaseizinger commented Mar 24, 2023

MarcoPolo commented Apr 11, 2023

thomaseizinger left a comment

Choose a reason for hiding this comment

MarcoPolo commented May 22, 2023

MarcoPolo commented May 22, 2023

MarcoPolo left a comment

Choose a reason for hiding this comment

MarcoPolo commented Jun 13, 2023

galargh commented Jun 23, 2023

thomaseizinger commented Jun 27, 2023

MarcoPolo commented Jul 7, 2023

galargh commented Mar 17, 2023 •

edited

Loading