Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

S-D26 Machine Learning Support (continuous) Y4M1-12 #332

Open
12 tasks
KZzizzle opened this issue Oct 5, 2020 · 0 comments
Open
12 tasks

S-D26 Machine Learning Support (continuous) Y4M1-12 #332

KZzizzle opened this issue Oct 5, 2020 · 0 comments
Labels
PO issue Created by Product owners

Comments

@KZzizzle
Copy link
Contributor

KZzizzle commented Oct 5, 2020

Dockerized JupyterLabservice with integrated machine learning packages

please, synchronize closely with alessandro and taylor

Items that are already part of ongoing work:

  • Resource allocation / scheduler.
  • Access to filesystem mount.
  • API for running (AI) jobs in the cloud.
  • Execution of AI python scripts as "computational services" on data-lake.

Items that are ML-specific and/or new:

  • Re-attaching to running services / ability to keep dynamic service alive
  • Tensorflow/pytorch-specific solver with GPU capabilities.
  • Deployment of tensorflow/pytorch-enabled jupyterlab services.
  • Find out whether we need to add a monitor entrypoint to API for interfacing with Tensorboard.
  • Creation of a specialized ML computational runner that takes a trained network and data as inputs and gives predictions as output. May or may not require GPU acceleration.
  • Creation of example ML jupyter notebooks with skeleton scripts and explanations.
  • Support for connections to external databases. May require credentials.
  • Capability to publish shareable/reusable models.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
PO issue Created by Product owners
Projects
None yet
Development

No branches or pull requests

1 participant