Kfctl go E2E test should deploy kubeflow #2705

jlewi · 2019-03-15T01:27:14Z

E2E test for kfctl should deploy kubeflow with basic auth

The E2E test attempts to call delete but delete isn't working.
kfctl delete fails because there is no kubeconfig file created.
The delete step is marked as an expected failure.

Improve logging in kfctl with go binary

Log the name and status of GCP DM operations
Print out DM creation errors so users see things like quota issues.

Fix #2706 - The coordinator commands should only invoke the commands apply/g
enerate/delete for the resources specified.
* There was a bug in the switch statements and we were always calling
generate/apply/delete for platform & k8s and not respecting the parameter

Attempt to fix #2706

We want to eliminate the need to talk to a K8s server when calling ks init
Hardcode the server address to 127.0.0.1
Hardcode K8s version to 1.11.7
The communication with the K8s server was coming because we were trying
to talk to the K8s master to get the K8s version but we don't need to
do that if we specify it.
- Add filename log hook to kfctl so that we can emit the filename and line
  number where errors occur.
- On init we Don't need to call GetConfig which tries to read the kubeconfig file on generate.
Add retries to generate and apply because running under argo kfctl seems
to randomly exit.

Related to #2610 E2E test for kfctl go binary

This change is

kkasravi · 2019-03-15T11:33:34Z

bootstrap/pkg/client/ksonnet/ksonnet.go

@@ -487,13 +487,15 @@ func (ksApp *ksApp) Init(resources kftypes.ResourceEnum, options map[string]inte
 func (ksApp *ksApp) initKs(config *rest.Config) error {
 	newRoot := path.Join(ksApp.KsApp.Spec.AppDir, ksApp.KsName)
 	ksApp.KsEnvName = kstypes.KsEnvName
-	k8sSpec := kftypes.GetServerVersion(kftypes.GetClientset(config))
+	// We hard code the K8s spec because we won't have a cluster to talk to when calling init.


If we add serverVersion to the ClientSpec then it could be set by the platform and the package managers would use this value rather than attempting to contacting the server by using credentials.

Similarly if the platform sets client credentials in the ClientSpec then the package managers would use this rather than $HOME/.kube/config's context.

This would align with how the platform updates other values in ClientSpec like components, componentParameters and the package managers use this updated ClientSpec.

I had discussed with @gabrielwen at the summit about this approach for fetching credentials.
I have the PR for just using the ClientSpec about ready. (#2691)

Only ksonnet needs to know the K8s version and that's so it can generate an appropriate k8s.libsonnet file from the corresponding swagger spec for the K8s API.

I think we can just hard code it.

Only ksonnet needs to know the K8s API version; kustomize doesn't

ksonnet will likely be removed before the K8s API version is an issue

I suspect ksonnet might even generate a new lk8s lib file when a new environment is added (which we do later).

Similarly if the platform sets client credentials in the ClientSpec then the package managers would use this rather than $HOME/.kube/config's context.

Why would client credentials need to be set in ClientSpec? How is that different then using a .kubeconfig file?

Different platforms will set different credentials. If the package managers use the credentials set by the platform then we avoid package managers using .kube/config when they should be using the platform credentials.

jlewi · 2019-03-15T14:46:37Z

When running kfctl under Argo as part of the test; I'm seeing random failures where it just appears to exit
on generate or apply. It doesn't seem like an error occurred or it was preempted.

If we look at the logs last line just indicates it was waiting for the deployment.

2019-03-15T14:32:04.473205048Z  main            util.py                     71 INFO     time="2019-03-15T14:31:40Z" level=warning msg="Deployment operation name: operation-1552660191586-58422df2d2a4e-6bf2031f-027c70b5 status: RUNNING" filename="gcp/gcp.go:331"

jlewi-kfctl-test-2705-0315-072101-3799861166.log.txt

jlewi-kfctl-test-2705-0315-072101-3799861166.event.log.txt

jlewi · 2019-03-15T17:53:40Z

In the most recent run kfctl delete all failed because there is no kubeconfig file.

time="2019-03-15T17:48:33Z" level=fatal msg="could not open /mnt/test-data-volume/kubeflow-presubmit-kfctl-go-2705-05adf28-6330-dfdf/kfctl_test/.kube/kube
config Error stat /mnt/test-data-volume/kubeflow-presubmit-kfctl-go-2705-05adf28-6330-dfdf/kfctl_test/.kube/kubeconfig: no such file or directory" filenam
e="apps/group.go:180"

* The E2E test attempts to call delete but delete isn't working. * kfctl delete fails because there is no kubeconfig file created. * The delete step is marked as an expected failure. Improve logging in kfctl with go binary * Log the name and status of GCP DM operations * Print out DM creation errors so users see things like quota issues. Fix kubeflow#2706 - The coordinator commands should only invoke the commands apply/generate/delete for the resources specified. * There was a bug in the switch statements and we were always calling generate/apply/delete for platform & k8s and not respecting the parameter Attempt to fix kubeflow#2706 * We want to eliminate the need to talk to a K8s server when calling ks init * Hardcode the server address to 127.0.0.1 * Hardcode K8s version to 1.11.7 * The communication with the K8s server was coming because we were trying to talk to the K8s master to get the K8s version but we don't need to do that if we specify it. * Add filename log hook to kfctl so that we can emit the filename and line number where errors occur. * On init we Don't need to call GetConfig which tries to read the kubeconfig file on generate. Add retries to generate and apply because running under argo kfctl seems to randomly exit.

jlewi · 2019-03-15T23:23:35Z

/assign @gabrielwen @kkasravi

kkasravi · 2019-03-16T02:04:47Z

/lgtm

kkasravi · 2019-03-16T02:04:53Z

/approve

jlewi · 2019-03-16T21:10:37Z

/approve

k8s-ci-robot · 2019-03-16T21:10:40Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: jlewi, kkasravi

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [jlewi]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

) * The E2E test attempts to call delete but delete isn't working. * kfctl delete fails because there is no kubeconfig file created. * The delete step is marked as an expected failure. Improve logging in kfctl with go binary * Log the name and status of GCP DM operations * Print out DM creation errors so users see things like quota issues. Fix kubeflow#2706 - The coordinator commands should only invoke the commands apply/generate/delete for the resources specified. * There was a bug in the switch statements and we were always calling generate/apply/delete for platform & k8s and not respecting the parameter Attempt to fix kubeflow#2706 * We want to eliminate the need to talk to a K8s server when calling ks init * Hardcode the server address to 127.0.0.1 * Hardcode K8s version to 1.11.7 * The communication with the K8s server was coming because we were trying to talk to the K8s master to get the K8s version but we don't need to do that if we specify it. * Add filename log hook to kfctl so that we can emit the filename and line number where errors occur. * On init we Don't need to call GetConfig which tries to read the kubeconfig file on generate. Add retries to generate and apply because running under argo kfctl seems to randomly exit.

k8s-ci-robot added the do-not-merge/work-in-progress label Mar 15, 2019

k8s-ci-robot requested review from abhi-g and ellistarn March 15, 2019 01:27

googlebot added the cla: yes label Mar 15, 2019

k8s-ci-robot added the size/XL label Mar 15, 2019

jlewi mentioned this pull request Mar 15, 2019

Scaffolding for E2E test for the new kfctl go binary. #2672

Merged

jlewi force-pushed the kfctl_go_test_app branch from 6c522fc to a2cf357 Compare March 15, 2019 04:23

jlewi mentioned this pull request Mar 15, 2019

kfctl tries to call ks init even on ks generate platform #2706

Closed

kkasravi suggested changes Mar 15, 2019

View reviewed changes

jlewi force-pushed the kfctl_go_test_app branch from 56cfd76 to 1a7d164 Compare March 15, 2019 12:42

k8s-ci-robot added size/L and removed size/XL labels Mar 15, 2019

jlewi mentioned this pull request Mar 15, 2019

kfctl (golang) Remove hard dependency on GOOGLE_APPLICATION_CREDENTIALS #2535

Closed

jlewi force-pushed the kfctl_go_test_app branch from 0a6b554 to ed8b484 Compare March 15, 2019 22:16

jlewi changed the title ~~[WIP] Kfctl go E2E test should deploy kubeflow~~ Kfctl go E2E test should deploy kubeflow Mar 15, 2019

k8s-ci-robot removed the do-not-merge/work-in-progress label Mar 15, 2019

k8s-ci-robot assigned gabrielwen and kkasravi Mar 15, 2019

jlewi mentioned this pull request Mar 15, 2019

[kfctl] Need E2E test for go binary #2610

Closed

k8s-ci-robot added the lgtm label Mar 16, 2019

k8s-ci-robot added the approved label Mar 16, 2019

k8s-ci-robot merged commit b0c20bb into kubeflow:master Mar 16, 2019

This was referenced Mar 16, 2019

fixes Kfctl clientspec #2691 #2714

Merged

kfctl apply fails trying to deploy to 127.0.0.1 #2722

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kfctl go E2E test should deploy kubeflow #2705

Kfctl go E2E test should deploy kubeflow #2705

jlewi commented Mar 15, 2019 •

edited

Loading

kkasravi Mar 15, 2019

jlewi Mar 15, 2019

kkasravi Mar 15, 2019

jlewi commented Mar 15, 2019

jlewi commented Mar 15, 2019

jlewi commented Mar 15, 2019

kkasravi commented Mar 16, 2019

kkasravi commented Mar 16, 2019

jlewi commented Mar 16, 2019

k8s-ci-robot commented Mar 16, 2019

Kfctl go E2E test should deploy kubeflow #2705

Kfctl go E2E test should deploy kubeflow #2705

Conversation

jlewi commented Mar 15, 2019 • edited Loading

kkasravi Mar 15, 2019

Choose a reason for hiding this comment

jlewi Mar 15, 2019

Choose a reason for hiding this comment

kkasravi Mar 15, 2019

Choose a reason for hiding this comment

jlewi commented Mar 15, 2019

jlewi commented Mar 15, 2019

jlewi commented Mar 15, 2019

kkasravi commented Mar 16, 2019

kkasravi commented Mar 16, 2019

jlewi commented Mar 16, 2019

k8s-ci-robot commented Mar 16, 2019

jlewi commented Mar 15, 2019 •

edited

Loading