Cloud computing guide #21

wlandau · 2020-08-20T14:18:22Z

Prework

I understand and agree to this repository's code of conduct.
I understand and agree to this repository's contributing guidelines.
New features take time and effort to create, and they take even more effort to maintain. So if the purpose of the feature is to resolve a struggle you are encountering personally, please consider first posting a "trouble" or "other" issue so we can discuss your use case and search for existing solutions first.

Proposal

There is a community misconception that targets (and drake) do not have HPC capabilities beyond parallel computing over the cores of a single local machine. On the contrary, both tools support distributed computing on clusters (guides here and here) and the workers do not necessarily need access to the file system of the master process. (In fact, I designed targets with an efficient dynamic branching model to go beyond the inherent limitations of map-reduce-like scheduling algorithms and conserve computing resources.) However, I do realize that data scientists from smaller institutions do not always have access to clusters, and an increasing number of folks use AWS. AWS ParallelCluster could be a way to deploy pipelines to the cloud without any need to modify targets itself. If it works, we should probably write a tutorial either in the existing HPC chapter or a chapter of its own.

ropensci/tarchetypes#8 could be an alternative way to deploy to AWS. The advantage of ropensci/tarchetypes#8 is that we should also get the data versioning capabilities of Metaflow for free, and Metaflow may take care of a lot of the AWS setup. However, each new tar_metaflow() will require its own local R worker in order to avoid blocking the master process, which is not ideal.

The text was updated successfully, but these errors were encountered:

wlandau · 2020-08-31T15:33:11Z

The biggest potential difference I see to this (relative to Metaflow's approach to AWS) is that the targets data store will probably live locally. But it's not so bad because drake users want this behavior anyway so they can explore data interactively (example: ropensci/drake#1295). And as @noamross pointed out, aws.s3::s3sync() can upload the data store to an S3 bucket. _targets/ is super light relative to .drake/, so this shouldn't be too painful for most projects.

wlandau · 2020-09-04T03:54:22Z

If cloudyr packages still work, tar_make_future() can probably already talk to multiple AWS instances: https://gist.github.com/DavisVaughan/5aac4a2757c0947a499d25d28a8ca89b. But the data will still live locally.

wlandau · 2020-09-04T04:02:58Z

@MilesMcBain, your team uses AWS, right? What's your preferred way to interact with it?

wlandau · 2020-09-06T22:19:03Z

I read up more on AWS ParallelCluster, AWS Batch, and Metaflow's HPC, and targets' capabilities are not ready for a cloud computing guide yet. But I think development on top of paws can get us much of the way there.

MilesMcBain · 2020-09-06T22:59:21Z

Hey @wlandau, so far we have preferred to call the AWS CLI directly. This is mainly due to a combination of very simple workflows, and uncertainty about the stability of the AWS-R ecosystem.

wlandau · 2020-09-08T02:27:27Z

That's helpful, I know cloudyr has had a rough time. What do you think about paws? I'm hoping it can help with ropensci/targets#152 either directly or through futureverse/future#415 or mschubert/clustermq#102 (comment).

wlandau · 2020-09-28T03:18:36Z

Reopening. I plan to write about ropensci/targets#176 at least.

I will keep my eye on R + cloud packages. Looks like aws.s3 has been updated in May, which is a good sign. And paws is under constant development but isn't quite there for the new S3 feature set in targets.

wlandau-lilly · 2020-09-29T01:13:09Z

Just wrote about S3 integration in the new cloud chapter. Will reopen after ropensci/targets#152.

wlandau self-assigned this Aug 20, 2020

wlandau mentioned this issue Aug 20, 2020

A target archetype that runs on AWS through Metaflow ropensci/tarchetypes#8

Closed

3 tasks

wlandau closed this as completed Sep 6, 2020

wlandau reopened this Sep 28, 2020

wlandau-lilly closed this as completed in 5f1f2ac Sep 29, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cloud computing guide #21

Cloud computing guide #21

wlandau commented Aug 20, 2020 •

edited

Loading

wlandau commented Aug 31, 2020

wlandau commented Sep 4, 2020 •

edited

Loading

wlandau commented Sep 4, 2020

wlandau commented Sep 6, 2020

MilesMcBain commented Sep 6, 2020

wlandau commented Sep 8, 2020 •

edited

Loading

wlandau commented Sep 28, 2020

wlandau-lilly commented Sep 29, 2020

Cloud computing guide #21

Cloud computing guide #21

Comments

wlandau commented Aug 20, 2020 • edited Loading

Prework

Proposal

wlandau commented Aug 31, 2020

wlandau commented Sep 4, 2020 • edited Loading

wlandau commented Sep 4, 2020

wlandau commented Sep 6, 2020

MilesMcBain commented Sep 6, 2020

wlandau commented Sep 8, 2020 • edited Loading

wlandau commented Sep 28, 2020

wlandau-lilly commented Sep 29, 2020

wlandau commented Aug 20, 2020 •

edited

Loading

wlandau commented Sep 4, 2020 •

edited

Loading

wlandau commented Sep 8, 2020 •

edited

Loading