Skip to content
This repository has been archived by the owner on Nov 21, 2023. It is now read-only.

Runtime for Pipelines #11

Open
rabernat opened this issue Oct 12, 2020 · 1 comment
Open

Runtime for Pipelines #11

rabernat opened this issue Oct 12, 2020 · 1 comment

Comments

@rabernat
Copy link
Collaborator

rabernat commented Oct 12, 2020

Right now, the pipelines are very coupled to their "runtime," by which I mean environment context that varies between location.

Example: https://github.com/pangeo-forge/terraclimate-feedstock/blob/master/recipe/pipeline.py

We want to refactor this so that the runtime is specified separately from the pipeline.

Things attached to the runtime:

  • Prefect stuff:
    • execution environment (e.g. DaskKubernetesEnvironment, recipe/job.yaml, recipe/worker_pod.yaml)
    • flow storage (e.g. Docker)
    • flow registration (everything in the __main__)
  • Data storage stuff:
    • cache location
  • Credentials
    • docker
    • cloud storage
    • prefect token

What am I missing?

@rabernat rabernat changed the title Runtime for Piplines Runtime for Pipelines Oct 12, 2020
@rabernat
Copy link
Collaborator Author

Make we want to make the runtime a cli, like conda smithy.

pangeo smithy run pipeline.py

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant