Add docs for KIND deployment #266

ckadner · 2021-11-28T23:07:46Z

Add first draft document for deploying MLX on KIND (Kubernetes in Docker).

https://github.com/ckadner/mlx/blob/add_kind_deployment_doc/docs/install-mlx-on-kind.md

Deploy MLX on KIND

Kubernetes in Docker (KIND) provides an easy way to deploy MLX locally including
Kubeflow Pipelines which makes it possible to run generated sample pipelines for
any of the registered MLX assets.

Installation

Docker

Homebrew (on macOS)

Kustomize

kubectl

KIND

Install Required CLIs (macOS)

After installing Docker and Homebrew (linked above) you can install the kind,
kustomize, and kubectl CLIs with brew install. For Windows and Linux follow
the respective home pages for installation instructions.
brew install kind
kind --version

brew install kubectl
kubectl version --client

brew install kustomize
kustomize version
Note: We successfully tested this KIND deployment with the latest version of kustomize v4.4.0.
However, there have been issues in the past with versions later then v3.2.0. To be on the safe side
you could download the kustomize v3.2.0 binary as described
here

Docker Resources

Increase the default resources for Docker:

CPUs: 4 Cores

Memory: 8 GB RAM

Disk: 32+ GB

Note: We found that on older laptops, like a 2016 15 in MacBook Pro (2.7 GHz i7, 16 GB) the MLX
deployment on KIND may require to give all available resources to the Docker daemon in order to be
able to deploy the manifests and run basic pipelines. Even then, trying to run notebooks or deploying
a model, will cause the laptop to get very slow with fans running full throttle. It may even cause
other application to crash.

Create KIND Cluster
kind create cluster --name mlx
kubectl cluster-info --context kind-mlx
kubectl get pods --all-namespaces
Deploy MLX (Single-User)
git clone https://github.com/IBM/manifests -b v1.4.0-mlx
cd manifests

# run the below command two times if the CRDs take too long to provision.
while ! kustomize build mlx-single-kind | \
  kubectl apply -f -; do echo "Retrying to apply resources"; sleep 10; done

# check pod status
kubectl get pods --all-namespaces

# make the MLX UI available to your local browser on http://localhost:3000/
kubectl port-forward -n istio-system svc/istio-ingressgateway 3000:80
Now paste the URL http://localhost:3000/ into your browser and proceed to
import the MLX catalog.

Delete the mlx cluster when it is no longer needed:
kind delete cluster --name mlx
Install Kubeflow Pipelines (for reference, optional)
kind create cluster --name kfp
kubectl cluster-info --context kind-kfp

# env/platform-agnostic-pns hasn't been publically released, so you will install it from master
export PIPELINE_VERSION=1.7.1
kubectl apply -k "github.com/kubeflow/pipelines/manifests/kustomize/cluster-scoped-resources?ref=$PIPELINE_VERSION"
kubectl wait --for condition=established --timeout=60s crd/applications.app.k8s.io
kubectl apply -k "github.com/kubeflow/pipelines/manifests/kustomize/env/platform-agnostic-pns?ref=$PIPELINE_VERSION"

kubectl get pods --all-namespaces

# make the Kubeflow Pipelines UI available on http://localhost:8080/#/pipelines
kubectl port-forward -n kubeflow svc/ml-pipeline-ui 8080:80

kind delete cluster --name kfp

@Tomcli @yhwang

Resolves machine-learning-exchange#73 Signed-off-by: Christian Kadner <[email protected]>

mlx-bot · 2021-11-28T23:07:51Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: ckadner

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [ckadner]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

Tomcli · 2021-11-29T22:25:17Z

I have the 2016 15 inch macbook pro. Here are my spec
2.7 GHz Quad-Core Intel Core i7
16 GB 2133 MHz LPDDR3

By giving all my resources to the Docker daemon, I'm able to deploy the manifests and run basic pipelines. However, if I try to run notebooks or after the model is deployed, my laptop will get very slow and loud to a point that start to crash my apps.

ckadner · 2021-11-29T22:58:58Z

Thanks @Tomcli @yhwang -- I updated the doc with your comments

Signed-off-by: Christian Kadner <[email protected]>

Tomcli · 2021-11-29T23:47:35Z

docs/install-mlx-on-kind.md

+brew install kubectl
+kubectl version --client
+
+brew install kustomize


We need kustomize 3.2 for this manifest. Can you point to the kustomize instruction like the one on ibm kubeflow?
https://www.kubeflow.org/docs/distributions/ibm/deploy/deployment-process/#kubeflow-installation

I added this para:

Note: We successfully tested this KIND deployment with the latest version of kustomize v4.4.0.
However, there have been issues in the past with versions later then v3.2.0. To be on the safe side
you could download the kustomize v3.2.0 binary as described
here

Signed-off-by: Christian Kadner <[email protected]>

ckadner · 2021-11-30T20:45:35Z

docs/install-mlx-on-kind.md

@@ -67,7 +67,7 @@ git clone https://github.com/IBM/manifests -b v1.4.0-mlx
 cd manifests

 # run the below command two times if the CRDs take too long to provision.
-while ! kustomize build mlx-single | \
+while ! kustomize build mlx-single-kind | \


Thanks @yhwang and @Tomcli for the manifest updates

ckadner · 2021-12-02T17:56:09Z

@yhwang @Tomcli -- I think the install-mlx-on-kind.md is ready to be merged, unless you have further change requests.

We can improve it in follow up PRs, like coming up with a better alternative to the port-forwarding:

Instead of:

kubectl port-forward -n istio-system svc/istio-ingressgateway 3000:80

Use a kind cluster configuration like this

apiVersion: kind.x-k8s.io/v1alpha4
kind: Cluster
nodes:
- role: control-plane
  extraPortMappings:
  - containerPort: 3000
    hostPort: 80
- role: worker

ckadner · 2021-12-02T20:01:06Z

@phu since you got the most powerful laptop in the team now, would you mind giving this KIND deployment a spin and let us know your feedback? Deployment takes about 20 minutes on older machines. I am curious what deployment times you'll get :-) Thanks!

Tomcli · 2021-12-02T20:33:18Z

/lgtm

* Add link to KIND deployment doc * Update the script that checks for broken links * Fix broken links in usage steps and API README * Update the resource requirements for KIND * Add a deployment wait-loop for KIND * Update the import assets documentation Related machine-learning-exchange#266 Signed-off-by: Christian Kadner <[email protected]>

* Add link to KIND deployment doc * Update the script that checks for broken links * Fix broken links in usage steps and API README * Update the resource requirements for KIND * Add a deployment wait-loop for KIND * Update the import assets documentation Related #266 Signed-off-by: Christian Kadner <[email protected]>

Add docs for KIND deployment

c27edc3

Resolves machine-learning-exchange#73 Signed-off-by: Christian Kadner <[email protected]>

ckadner requested review from yhwang and Tomcli November 28, 2021 23:07

ckadner self-assigned this Nov 28, 2021

mlx-bot added the do-not-merge/work-in-progress label Nov 28, 2021

mlx-bot requested a review from animeshsingh November 28, 2021 23:07

mlx-bot added the approved label Nov 28, 2021

ckadner linked an issue Nov 28, 2021 that may be closed by this pull request

Create KIND deployment option for MLX including KFP #73

Closed

Update after review comments

25f4de2

Signed-off-by: Christian Kadner <[email protected]>

ckadner force-pushed the add_kind_deployment_doc branch from 053ca5b to 25f4de2 Compare November 29, 2021 23:01

Tomcli reviewed Nov 29, 2021

View reviewed changes

ckadner added 2 commits November 29, 2021 17:11

Update kustomize install instructions

38d4c14

Signed-off-by: Christian Kadner <[email protected]>

Use mlx-single-kind deployment

5abfaaa

Signed-off-by: Christian Kadner <[email protected]>

ckadner commented Nov 30, 2021

View reviewed changes

ckadner changed the title ~~[WIP] Add docs for KIND deployment~~ Add docs for KIND deployment Dec 2, 2021

mlx-bot removed the do-not-merge/work-in-progress label Dec 2, 2021

ckadner requested a review from Tomcli December 2, 2021 19:52

mlx-bot assigned Tomcli Dec 2, 2021

mlx-bot added the lgtm label Dec 2, 2021

mlx-bot merged commit 680e34b into machine-learning-exchange:main Dec 2, 2021

ckadner mentioned this pull request Dec 5, 2021

Link KIND Deployment Option on Main README #283

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add docs for KIND deployment #266

Add docs for KIND deployment #266

ckadner commented Nov 28, 2021 •

edited

Loading

mlx-bot commented Nov 28, 2021

Tomcli commented Nov 29, 2021

ckadner commented Nov 29, 2021

Tomcli Nov 29, 2021

ckadner Nov 30, 2021

ckadner Nov 30, 2021

ckadner commented Dec 2, 2021

ckadner commented Dec 2, 2021

Tomcli commented Dec 2, 2021

Add docs for KIND deployment #266

Add docs for KIND deployment #266

Conversation

ckadner commented Nov 28, 2021 • edited Loading

Deploy MLX on KIND

Installation

Install Required CLIs (macOS)

Docker Resources

Create KIND Cluster

Deploy MLX (Single-User)

Install Kubeflow Pipelines (for reference, optional)

mlx-bot commented Nov 28, 2021

Tomcli commented Nov 29, 2021

ckadner commented Nov 29, 2021

Tomcli Nov 29, 2021

Choose a reason for hiding this comment

ckadner Nov 30, 2021

Choose a reason for hiding this comment

ckadner Nov 30, 2021

Choose a reason for hiding this comment

ckadner commented Dec 2, 2021

ckadner commented Dec 2, 2021

Tomcli commented Dec 2, 2021

ckadner commented Nov 28, 2021 •

edited

Loading