-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use KFP lite deployment for presubmit tests #1808
Conversation
…e run from a different project
lightweight deployment.
WIP, I want to see test results on kubeflow-pipeline-e2e-test here first. |
Verified tests can run successfully in my own gcp project. Going to debug permissions issues with prow run. |
Briefly introducing new workflow of presubmit tests to give a better picture of what additional permissions are needed.
Step 4 needs to know default service account which I got by
Alternatives:
Thoughts? |
@Bobgy The workflow looks good. For step 4, you might run into the issue that tests fail to download token because there are limit of <10 token to be downloaded. |
|
||
# when we reuse a cluster when debugging, clean up its kfp installation first | ||
# this does nothing with a new cluster | ||
kubectl delete namespace ${NAMESPACE} --wait || echo "No need to delete ${NAMESPACE} namespace. It doesn't exist." |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can we create a new namespace with random value suffix, and delete the namespace at end of test.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not sure which intention you are having.
If you are thinking about running multiple tests in the same cluster, yes we can do this, but I worry conflicting CRD versions might be a problem.
If you are commenting for readability/structure, I didn't delete this namespace at end of test because there are two scenarios for the script:
- when it runs in prow, it creates a new cluster each time, and the cluster will be cleaned up when the test finishes
- when it runs in a developer's project, it can be configured to keep using the same cluster. Leaving namespace uncleaned enable the developer to debug what failed when running the test.
So I think this setup suits both scenarios.
Good point, I was also thinking about this. I worried there might be security issues, but this is a test project with no real data at all. I think there won't be risks storing the key, right? |
…nto e2e_on_kfp_light
@IronPan I figured out the permission issue. Some of my commands were added earlier than |
@IronPan for service account key, I stored it in gcs because gms seemed too complex to setup for me. For test cluster service account, I think gcs is good enough (I changed the bucket to be only readable from owners + editors). |
ecf2512
to
f10d247
Compare
@IronPan tests are passing and ready to review now. |
/test kubeflow-pipeline-sample-test |
/test kubeflow-pipeline-sample-test |
/lgtm |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: IronPan The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
Great work! Thank you @Bobgy |
commit 2592307 Author: Christian Clauss <[email protected]> Date: Wed Aug 21 04:15:19 2019 +0200 Undefined name 'e' in openvino (kubeflow#1876) Discovered in kubeflow#1721 ``` ./contrib/components/openvino/ovms-deployer/containers/evaluate.py:62:16: F821 undefined name 'e' except e: ^ ./contrib/components/openvino/ovms-deployer/containers/evaluate.py:63:50: F821 undefined name 'e' print("Can not read the image file", e) ^ ``` Your review please @Ark-kun commit 7b442f4 Author: Kirin Patel <[email protected]> Date: Tue Aug 20 18:27:19 2019 -0700 Created extensible code editor based on react-ace (kubeflow#1855) * Created extensible code editor based on react-ace * Installed dependencies * Updated unit tests for Editor.tsx to test placeholder and value in simplified manner * Updated Editor unit tests to use snapshot testing where applicable commit d11fae7 Author: Yuan (Bob) Gong <[email protected]> Date: Wed Aug 21 08:25:20 2019 +0800 Use KFP lite deployment for presubmit tests (kubeflow#1808) * Refactor presubmit-tests-with-pipeline-deployment.sh so that it can be run from a different project * Simplify getting service account from cluster. * Migrate presubmit-tests-with-pipeline-deployment.sh to use kfp lightweight deployment. * Add option to cache built images to make debugging faster. * Fix cluster set up * Copy image builder image instead of granting permission * Add missed yes command * fix stuff * Let other usages of image-builder image become configurable * let test workflow use image builder image * Fix permission issue * Hide irrelevant error logs * Use shared service account key instead * Move test manifest to test folder * Move build-images.sh to a different script file * Update README.md * add cluster info dump * Use the same cluster resources as kubeflow deployment * Remove cluster info dump * Add timing to test log * cleaned up code * fix tests * address cr comments * Address cr comments * Enable image caching to improve retest speed commit 0864faf Author: IronPan <[email protected]> Date: Tue Aug 20 14:09:19 2019 -0700 Use single part as default (kubeflow#1893) The data stored in artifact storage are usually small. Using multi-part is not strictly a requirement. Change the default to true to better support more platform out of box. commit 6284dc1 Author: Christian Clauss <[email protected]> Date: Tue Aug 20 22:01:18 2019 +0200 IBM Watson samples: from six.moves import xrange (kubeflow#1877) Discovered in kubeflow#1721 __xrange()__ was removed in Python 3 in favor of an improved version of __range()__. This PR ensures equivalent functionality in both Python 2 and Python 3. ``` ./samples/contrib/ibm-samples/watson/source/model-source-code/tf-model/input_data.py:100:40: F821 undefined name 'xrange' fake_image = [1.0 for _ in xrange(784)] ^ ./samples/contrib/ibm-samples/watson/source/model-source-code/tf-model/input_data.py:102:41: F821 undefined name 'xrange' return [fake_image for _ in xrange(batch_size)], [ ^ ./samples/contrib/ibm-samples/watson/source/model-source-code/tf-model/input_data.py:103:37: F821 undefined name 'xrange' fake_label for _ in xrange(batch_size)] ^ ``` @gaoning777 @Ark-kun Your reviews please. commit 79c7bda Author: Ning <[email protected]> Date: Tue Aug 20 09:24:56 2019 -0700 fix unit tests and address some comments (kubeflow#1892) commit 6a7b28f Author: Aakash Bajaj <[email protected]> Date: Tue Aug 20 14:07:33 2019 +0530 gcp cred bug fix for multiple credentials in single pipeline (kubeflow#1384) * gcp cred bug fix for multiple credentials in single pipeline * squash to remove conflict commit 60fd70c Author: Andy Wei <[email protected]> Date: Tue Aug 20 15:33:33 2019 +0800 Let backend apiserver mysql dbname configurable (kubeflow#1714) * let mysql dbname configurable * solve conflict * move declaration out to fix scope commit 101a346 Author: olegchorny <[email protected]> Date: Tue Aug 20 08:33:47 2019 +0300 'core' folder included to parameters related On-Premise cluster (kubeflow#1751) * 'core' folder included to parameters related On-Premise cluster Update is required because this sample was migrated to the samples/core folder * Update README.md commit 4c5d34f Author: dushyanthsc <[email protected]> Date: Mon Aug 19 18:57:33 2019 -0700 test/project-cleanup: Support to cleanup gke-clusters in test project (kubeflow#1857) Change to add base framework for cleaning up resources in a GCP project. The resource specification is specified declaratively using a YAML file. As per current requirements this change only adds cleaning up of GKE clusters.
commit 41d39c1 Author: Kirin Patel <[email protected]> Date: Wed Aug 21 19:38:31 2019 -0700 Add run with json data as input within fixed-data.ts for UI testing and development (kubeflow#1895) * Added run with json data as input * Changed run and uid to not be duplicates of hello-world-runtime commit 851e7c8 Author: Kirin Patel <[email protected]> Date: Wed Aug 21 19:04:31 2019 -0700 Replace codemirror editor react component with react-ace editor component (kubeflow#1890) * Replaced CodeMirror with Editor in PipelineDetails.tsx * Replaced codemirror in DetailsTable with react-ace * Removed codemirror * Updated unit tests for Editor.tsx to test placeholder and value in simplified manner * Updated DetailsTable.test.tsx.snap to reflect changes made in DetailsTable.tsx * Updated PipelineDetails test snapshot * Changed width of Editor in DetailsTable to be 100% instead of 300px * Revert "Updated unit tests for Editor.tsx to test placeholder and value in simplified manner" This reverts commit 40103f2. commit 8c3d6fe Author: Kirin Patel <[email protected]> Date: Wed Aug 21 18:30:33 2019 -0700 Add visualization-server service to lightweight deployment (kubeflow#1844) * Add visualization-server service to lightweight deployment * Addressed PR suggestions * Added field to determine if visualization service is active and fixed unit tests for visualization_server.go * Additional small fixes * port change from 88888 -> 8888 * version change from 0.1.15 -> 0.1.26 * removed visualization-server from base/kustomization.yaml * Fixed visualization_server_test.go to reflect new changes * Changed implementation to be fail fast * Changed host name to be constant provided by environment * Added retry and extracted isVisualizationServiceAlive logic to function * Fixed deployment.yaml file * Fixed serviceURL configuration issuse serviceURL is now properly obtained from the environment, the service ip address and port are used rather than service name and namespace * Added log message to indicate when visualization service is unreachable * Addressed PR comments * Removed _HTTP commit ad307db Author: Eterna2 <[email protected]> Date: Thu Aug 22 08:52:32 2019 +0800 [Bug Fix] Delete ResourceOp should not have output parameters (kubeflow#1822) * Fix bug where delete resource op should not have success_condition, failure_condition, and output parameters * remove unnecessary whitespace * compiler test for delete resource ops should retrieve templates from spec instead of root commit 593f25a Author: Alexey Volkov <[email protected]> Date: Wed Aug 21 17:16:33 2019 -0700 Collecting coverage when running python tests (kubeflow#898) * Collecting coiverage when running python tests * Added coveralls to python unit tests * Try removing the PATH modification * Specifying coverage run --source * Using the installed package * Try getting the correct coverage paths commit 553885f Author: Alexey Volkov <[email protected]> Date: Wed Aug 21 16:38:12 2019 -0700 SDK - Components - Fixed ModelBase comparison bug (kubeflow#1874) commit 2622c67 Author: IronPan <[email protected]> Date: Wed Aug 21 16:37:53 2019 -0700 cleanup test dir (kubeflow#1914) commit 203307d Author: Alexey Volkov <[email protected]> Date: Wed Aug 21 16:37:21 2019 -0700 SDK - Lightweight - Fixed custom types in multi-output case (kubeflow#1875) The type was mistakenly serialized as `_ForwardRef('CustomType')`. The input parameter types and single-output types were not affected. commit 2e7f2d4 Author: IronPan <[email protected]> Date: Wed Aug 21 16:36:35 2019 -0700 Add cloud sql and gcs connection for pipeline-lite deployment (kubeflow#1910) * restructure * working example * working example * move mysql * moving minio and mysql out * add gcp * add files * fix test commit 9adf163 Author: Alexey Volkov <[email protected]> Date: Wed Aug 21 16:29:54 2019 -0700 SDK - Airflow - Fixed bug in airflow op creation (kubeflow#1911) This PR fixes a bug in AirFlow op creation. The `_run_airflow_op` helper function was not captured along with the `_run_airflow_op_closure` function, because they belong to different modules (`_run_airflow_op_closure` was module-less). This was not discovered during the notebook testing of the code since in that environment the `_run_airflow_op` was also module-less as it was defined in a notebook (not in .py file). commit 7ec5697 Author: hongye-sun <[email protected]> Date: Wed Aug 21 16:06:31 2019 -0700 Release 151c534 (kubeflow#1916) * Updated component images to version 151c534 * Updated components to version a97f1d0 * Update setup.py * Update setup.py commit 7e062ce Author: IronPan <[email protected]> Date: Wed Aug 21 15:14:31 2019 -0700 Update README.md commit 8e1e823 Author: Christian Clauss <[email protected]> Date: Thu Aug 22 00:04:31 2019 +0200 Lint Python code for undefined names (kubeflow#1721) * Lint Python code for undefined names * Lint Python code for undefined names * Exclude tfdv.py to workaround an overzealous pytest * Fixup for tfdv.py * Fixup for tfdv.py * Fixup for tfdv.py commit 2399348 Author: dushyanthsc <[email protected]> Date: Wed Aug 21 13:44:31 2019 -0700 apiserver: Remove TFX output artifact recording to metadatastore (kubeflow#1904) commit 151c534 Author: Alexey Volkov <[email protected]> Date: Wed Aug 21 12:56:33 2019 -0700 Simplified the build_docker_image function (kubeflow#1887) * Simplified the build_docker_image function * Extracted 'Dockerfile' into a variable Addressed PR feedback * Fixed conflict with updated master. * Addressed the feedback commit 3b7340f Author: IronPan <[email protected]> Date: Wed Aug 21 12:28:31 2019 -0700 Change the type of resource reference payload column (kubeflow#1905) Gorm doesn't automatically change the type of a column. This changes introduced a column type change which might not be effective for an existing cluster doing upgrade. kubeflow@4e43750#diff-c4afa92d7e54eecff0a482cf57490aa8R40 /assign @hongye-sun commit a4991fd Author: Kirin Patel <[email protected]> Date: Wed Aug 21 11:56:31 2019 -0700 Enable error propagation from nbconvert to frontend (kubeflow#1909) commit 94aa078 Author: Ning <[email protected]> Date: Wed Aug 21 10:22:31 2019 -0700 remove tfx notebook sample in favor of the TFX OSS sample (kubeflow#1908) commit 93e3121 Author: Ning <[email protected]> Date: Wed Aug 21 09:20:39 2019 -0700 remove kubeflow training (kubeflow#1902) commit 2592307 Author: Christian Clauss <[email protected]> Date: Wed Aug 21 04:15:19 2019 +0200 Undefined name 'e' in openvino (kubeflow#1876) Discovered in kubeflow#1721 ``` ./contrib/components/openvino/ovms-deployer/containers/evaluate.py:62:16: F821 undefined name 'e' except e: ^ ./contrib/components/openvino/ovms-deployer/containers/evaluate.py:63:50: F821 undefined name 'e' print("Can not read the image file", e) ^ ``` Your review please @Ark-kun commit 7b442f4 Author: Kirin Patel <[email protected]> Date: Tue Aug 20 18:27:19 2019 -0700 Created extensible code editor based on react-ace (kubeflow#1855) * Created extensible code editor based on react-ace * Installed dependencies * Updated unit tests for Editor.tsx to test placeholder and value in simplified manner * Updated Editor unit tests to use snapshot testing where applicable commit d11fae7 Author: Yuan (Bob) Gong <[email protected]> Date: Wed Aug 21 08:25:20 2019 +0800 Use KFP lite deployment for presubmit tests (kubeflow#1808) * Refactor presubmit-tests-with-pipeline-deployment.sh so that it can be run from a different project * Simplify getting service account from cluster. * Migrate presubmit-tests-with-pipeline-deployment.sh to use kfp lightweight deployment. * Add option to cache built images to make debugging faster. * Fix cluster set up * Copy image builder image instead of granting permission * Add missed yes command * fix stuff * Let other usages of image-builder image become configurable * let test workflow use image builder image * Fix permission issue * Hide irrelevant error logs * Use shared service account key instead * Move test manifest to test folder * Move build-images.sh to a different script file * Update README.md * add cluster info dump * Use the same cluster resources as kubeflow deployment * Remove cluster info dump * Add timing to test log * cleaned up code * fix tests * address cr comments * Address cr comments * Enable image caching to improve retest speed commit 0864faf Author: IronPan <[email protected]> Date: Tue Aug 20 14:09:19 2019 -0700 Use single part as default (kubeflow#1893) The data stored in artifact storage are usually small. Using multi-part is not strictly a requirement. Change the default to true to better support more platform out of box. commit 6284dc1 Author: Christian Clauss <[email protected]> Date: Tue Aug 20 22:01:18 2019 +0200 IBM Watson samples: from six.moves import xrange (kubeflow#1877) Discovered in kubeflow#1721 __xrange()__ was removed in Python 3 in favor of an improved version of __range()__. This PR ensures equivalent functionality in both Python 2 and Python 3. ``` ./samples/contrib/ibm-samples/watson/source/model-source-code/tf-model/input_data.py:100:40: F821 undefined name 'xrange' fake_image = [1.0 for _ in xrange(784)] ^ ./samples/contrib/ibm-samples/watson/source/model-source-code/tf-model/input_data.py:102:41: F821 undefined name 'xrange' return [fake_image for _ in xrange(batch_size)], [ ^ ./samples/contrib/ibm-samples/watson/source/model-source-code/tf-model/input_data.py:103:37: F821 undefined name 'xrange' fake_label for _ in xrange(batch_size)] ^ ``` @gaoning777 @Ark-kun Your reviews please. commit 79c7bda Author: Ning <[email protected]> Date: Tue Aug 20 09:24:56 2019 -0700 fix unit tests and address some comments (kubeflow#1892) commit 6a7b28f Author: Aakash Bajaj <[email protected]> Date: Tue Aug 20 14:07:33 2019 +0530 gcp cred bug fix for multiple credentials in single pipeline (kubeflow#1384) * gcp cred bug fix for multiple credentials in single pipeline * squash to remove conflict commit 60fd70c Author: Andy Wei <[email protected]> Date: Tue Aug 20 15:33:33 2019 +0800 Let backend apiserver mysql dbname configurable (kubeflow#1714) * let mysql dbname configurable * solve conflict * move declaration out to fix scope commit 101a346 Author: olegchorny <[email protected]> Date: Tue Aug 20 08:33:47 2019 +0300 'core' folder included to parameters related On-Premise cluster (kubeflow#1751) * 'core' folder included to parameters related On-Premise cluster Update is required because this sample was migrated to the samples/core folder * Update README.md commit 4c5d34f Author: dushyanthsc <[email protected]> Date: Mon Aug 19 18:57:33 2019 -0700 test/project-cleanup: Support to cleanup gke-clusters in test project (kubeflow#1857) Change to add base framework for cleaning up resources in a GCP project. The resource specification is specified declaratively using a YAML file. As per current requirements this change only adds cleaning up of GKE clusters. commit 0ed5819 Author: Yuan (Bob) Gong <[email protected]> Date: Tue Aug 20 08:13:31 2019 +0800 Refactor presubmit-tests-with-pipeline-deployment.sh to run in other projects (kubeflow#1732) * Refactor presubmit-tests-with-pipeline-deployment.sh so that it can be run from a different project * Simplify getting service account from cluster. * Copy image builder image instead of granting permission * Add missed yes command * fix stuff * Let other usages of image-builder image become configurable * let test workflow use image builder image commit 0369a4c Author: Ajay Gopinathan <[email protected]> Date: Fri Aug 16 21:02:07 2019 -0700 Update manifests to point to 0.26 release. (kubeflow#1870) commit 2b246bc Author: Alexey Volkov <[email protected]> Date: Fri Aug 16 19:54:07 2019 -0700 SDK - Tests - Improved the "ContainerOp.set_retry" test (kubeflow#1843) Properly testing the feature isntead of just comparing with golden data. commit d66508d Author: Ajay Gopinathan <[email protected]> Date: Fri Aug 16 18:42:48 2019 -0700 Update changelog for 0.1.26 (kubeflow#1872) * Update changelog for 0.1.26 * Update changelog. commit 1d99704 Author: Alexey Volkov <[email protected]> Date: Fri Aug 16 18:24:09 2019 -0700 SDK - Fixed string comparisons (kubeflow#1756) commit 6d5ffa2 Author: Ajay Gopinathan <[email protected]> Date: Fri Aug 16 16:57:33 2019 -0700 Remove copying of tfx data for cloudbuild release steps. (kubeflow#1871) commit de538aa Author: Ning <[email protected]> Date: Fri Aug 16 15:53:07 2019 -0700 add compile step in the samples to generate zip files (kubeflow#1866) * add compile step in the samples to generate zip files commit 5bcc665 Author: Ajay Gopinathan <[email protected]> Date: Fri Aug 16 13:58:23 2019 -0700 Update Python SDK versions for release. (kubeflow#1860) commit 0d898cb Author: Ajay Gopinathan <[email protected]> Date: Fri Aug 16 13:16:11 2019 -0700 Release 0517114 (kubeflow#1859) * Updated component images to version 0517114 * Updated components to version 48dd338 commit 7dbca1a Author: Ning <[email protected]> Date: Fri Aug 16 12:06:37 2019 -0700 update gcloud ml-engine to ai-platform (kubeflow#1863) commit e849f22 Author: Ryan Dawson <[email protected]> Date: Fri Aug 16 19:21:23 2019 +0100 Seldon examples (kubeflow#1405) commit a6f3f40 Author: sina chavoshi <[email protected]> Date: Fri Aug 16 09:56:09 2019 -0700 Adding a sample for serving component (kubeflow#1830) * Adding a sample for serving component * removed typo / updated based on PR feedback * fixing the jupyter rendering issue * adding pip3 for tensorflow * Fixed spelling error in VERSION * fix indentation based on review feedback commit 54ff3e6 Author: Alexey Volkov <[email protected]> Date: Fri Aug 16 01:22:31 2019 -0700 SDK - Cleanup - Serialized PipelineParamTuple does not need value or type (kubeflow#1469) * SDK - Refactoring - Serialized PipelineParam does not need type Only the types in non-serialized PipelineParams are ever used. * SDK - Refactoring - Serialized PipelineParam does not need value Default values are only relevant when PipelineParam is used in the pipeline function signature and even in this case compiler captures them explicitly from the pipelineParam objects in the signature. There is no other uses for them. commit d2e94e4 Author: Riley Bauer <[email protected]> Date: Thu Aug 15 19:52:34 2019 -0700 Fix run duration bug (kubeflow#1827) * Allows durations >=24h and renames 'showLink' in RunList * Update, fix tests commit d8eaeaa Author: Alexey Volkov <[email protected]> Date: Thu Aug 15 17:25:59 2019 -0700 SDK - Preserving the pipeline input information in the compiled Workflow (kubeflow#1381) * SDK - Preserving the pipeline metadata in the compiled Workflow * Stabilizing the DSL compiler tests commit afe8a69 Author: Kirin Patel <[email protected]> Date: Thu Aug 15 17:00:28 2019 -0700 Reduce API usage by utilizing reference name in reference resource API (kubeflow#1824) * Regenerated run api for frontend * Added support for reference name to resource reference API in frontend * Revert "Regenerated run api for frontend" * Addressed PR comments * Removed extra if statement by setting default value of parameter * Removed the whole comment * Addressed PR feedback * Addressed PR feedback * Simplified logic after offline discussion commit ea67c99 Author: Kirin Patel <[email protected]> Date: Thu Aug 15 16:18:35 2019 -0700 Change how Variables are Provided to Visualizations (kubeflow#1754) * Changed way visualization variables are passed from request to NotebookNode Visualization variables are now saved to a json file and loaded by a NotebookNode upon execution. * Updated roc_curve visualization to reflect changes made to dependency injection * Fixed bug where checking if is_generated is provided to roc_curve visualization would crash visualizaiton Also changed ' -> " * Changed text_exporter to always sort variables by key for testing * Addressed PR suggestions commit d238bef Author: Jiaxiao Zheng <[email protected]> Date: Thu Aug 15 12:55:29 2019 -0700 Add back coveralls. (kubeflow#1849) * Remove redundant import. * Simplify sample_test.yaml by using withItem syntax. * Simplify sample_test.yaml by using withItem syntax. * Change dict to str in withItems. * Add back coveralls. commit 0517114 Author: Riley Bauer <[email protected]> Date: Thu Aug 15 12:28:35 2019 -0700 Reduce getPipeline calls in RunList (kubeflow#1852) * Skips calling getPipeline in RunList if the pipeline name is in the pipeline_spec * Update fixed data to include pipeline names in pipeline specs * Remove redundant getRuns call commit 39e5840 Author: IronPan <[email protected]> Date: Thu Aug 15 11:04:34 2019 -0700 Add retry button in Pipeline UI (kubeflow#1782) * add retry button * add retry button * add retry button * address comments * fix tests * fix tests * update image * Update StatusUtils.test.tsx * Update RunDetails.test.tsx * Update Buttons.ts * update test * update frontend * update * update * addrerss comments * update test
) * Remove redundant import. * Simplify sample_test.yaml by using withItem syntax. * Simplify sample_test.yaml by using withItem syntax. * Change dict to str in withItems. * Add back coveralls. * Squashed commit of the following: commit 7dbca1a Author: Ning <[email protected]> Date: Fri Aug 16 12:06:37 2019 -0700 update gcloud ml-engine to ai-platform (#1863) commit e849f22 Author: Ryan Dawson <[email protected]> Date: Fri Aug 16 19:21:23 2019 +0100 Seldon examples (#1405) commit a6f3f40 Author: sina chavoshi <[email protected]> Date: Fri Aug 16 09:56:09 2019 -0700 Adding a sample for serving component (#1830) * Adding a sample for serving component * removed typo / updated based on PR feedback * fixing the jupyter rendering issue * adding pip3 for tensorflow * Fixed spelling error in VERSION * fix indentation based on review feedback commit 54ff3e6 Author: Alexey Volkov <[email protected]> Date: Fri Aug 16 01:22:31 2019 -0700 SDK - Cleanup - Serialized PipelineParamTuple does not need value or type (#1469) * SDK - Refactoring - Serialized PipelineParam does not need type Only the types in non-serialized PipelineParams are ever used. * SDK - Refactoring - Serialized PipelineParam does not need value Default values are only relevant when PipelineParam is used in the pipeline function signature and even in this case compiler captures them explicitly from the pipelineParam objects in the signature. There is no other uses for them. commit d2e94e4 Author: Riley Bauer <[email protected]> Date: Thu Aug 15 19:52:34 2019 -0700 Fix run duration bug (#1827) * Allows durations >=24h and renames 'showLink' in RunList * Update, fix tests commit d8eaeaa Author: Alexey Volkov <[email protected]> Date: Thu Aug 15 17:25:59 2019 -0700 SDK - Preserving the pipeline input information in the compiled Workflow (#1381) * SDK - Preserving the pipeline metadata in the compiled Workflow * Stabilizing the DSL compiler tests commit afe8a69 Author: Kirin Patel <[email protected]> Date: Thu Aug 15 17:00:28 2019 -0700 Reduce API usage by utilizing reference name in reference resource API (#1824) * Regenerated run api for frontend * Added support for reference name to resource reference API in frontend * Revert "Regenerated run api for frontend" * Addressed PR comments * Removed extra if statement by setting default value of parameter * Removed the whole comment * Addressed PR feedback * Addressed PR feedback * Simplified logic after offline discussion commit ea67c99 Author: Kirin Patel <[email protected]> Date: Thu Aug 15 16:18:35 2019 -0700 Change how Variables are Provided to Visualizations (#1754) * Changed way visualization variables are passed from request to NotebookNode Visualization variables are now saved to a json file and loaded by a NotebookNode upon execution. * Updated roc_curve visualization to reflect changes made to dependency injection * Fixed bug where checking if is_generated is provided to roc_curve visualization would crash visualizaiton Also changed ' -> " * Changed text_exporter to always sort variables by key for testing * Addressed PR suggestions commit d238bef Author: Jiaxiao Zheng <[email protected]> Date: Thu Aug 15 12:55:29 2019 -0700 Add back coveralls. (#1849) * Remove redundant import. * Simplify sample_test.yaml by using withItem syntax. * Simplify sample_test.yaml by using withItem syntax. * Change dict to str in withItems. * Add back coveralls. commit 0517114 Author: Riley Bauer <[email protected]> Date: Thu Aug 15 12:28:35 2019 -0700 Reduce getPipeline calls in RunList (#1852) * Skips calling getPipeline in RunList if the pipeline name is in the pipeline_spec * Update fixed data to include pipeline names in pipeline specs * Remove redundant getRuns call commit 39e5840 Author: IronPan <[email protected]> Date: Thu Aug 15 11:04:34 2019 -0700 Add retry button in Pipeline UI (#1782) * add retry button * add retry button * add retry button * address comments * fix tests * fix tests * update image * Update StatusUtils.test.tsx * Update RunDetails.test.tsx * Update Buttons.ts * update test * update frontend * update * update * addrerss comments * update test * WIP: rewrite sample test infra step 1, presubmit check triggered by fire. * WIP: rewrite sample test infra step 2, add ComponentTest class * WIP: rewrite sample test infra Rewrite dockerfile * Fixing * Refactor the logic of run_test.sh into sample_test_launcher.py * Fix exit_code * Fix imports and add todo * Squashed commit of the following: commit 0ed5819 Author: Yuan (Bob) Gong <[email protected]> Date: Tue Aug 20 08:13:31 2019 +0800 Refactor presubmit-tests-with-pipeline-deployment.sh to run in other projects (#1732) * Refactor presubmit-tests-with-pipeline-deployment.sh so that it can be run from a different project * Simplify getting service account from cluster. * Copy image builder image instead of granting permission * Add missed yes command * fix stuff * Let other usages of image-builder image become configurable * let test workflow use image builder image * Clean up. * Fix launcher flag. * switch to sys.execxutable. * Fix bugs * Add copy blob util function fix issues, and Squashed commit of the following: commit 0ed5819 Author: Yuan (Bob) Gong <[email protected]> Date: Tue Aug 20 08:13:31 2019 +0800 Refactor presubmit-tests-with-pipeline-deployment.sh to run in other projects (#1732) * Refactor presubmit-tests-with-pipeline-deployment.sh so that it can be run from a different project * Simplify getting service account from cluster. * Copy image builder image instead of granting permission * Add missed yes command * fix stuff * Let other usages of image-builder image become configurable * let test workflow use image builder image commit 0369a4c Author: Ajay Gopinathan <[email protected]> Date: Fri Aug 16 21:02:07 2019 -0700 Update manifests to point to 0.26 release. (#1870) commit 2b246bc Author: Alexey Volkov <[email protected]> Date: Fri Aug 16 19:54:07 2019 -0700 SDK - Tests - Improved the "ContainerOp.set_retry" test (#1843) Properly testing the feature isntead of just comparing with golden data. commit d66508d Author: Ajay Gopinathan <[email protected]> Date: Fri Aug 16 18:42:48 2019 -0700 Update changelog for 0.1.26 (#1872) * Update changelog for 0.1.26 * Update changelog. commit 1d99704 Author: Alexey Volkov <[email protected]> Date: Fri Aug 16 18:24:09 2019 -0700 SDK - Fixed string comparisons (#1756) commit 6d5ffa2 Author: Ajay Gopinathan <[email protected]> Date: Fri Aug 16 16:57:33 2019 -0700 Remove copying of tfx data for cloudbuild release steps. (#1871) commit de538aa Author: Ning <[email protected]> Date: Fri Aug 16 15:53:07 2019 -0700 add compile step in the samples to generate zip files (#1866) * add compile step in the samples to generate zip files commit 5bcc665 Author: Ajay Gopinathan <[email protected]> Date: Fri Aug 16 13:58:23 2019 -0700 Update Python SDK versions for release. (#1860) commit 0d898cb Author: Ajay Gopinathan <[email protected]> Date: Fri Aug 16 13:16:11 2019 -0700 Release 0517114 (#1859) * Updated component images to version 0517114 * Updated components to version 48dd338 commit 7dbca1a Author: Ning <[email protected]> Date: Fri Aug 16 12:06:37 2019 -0700 update gcloud ml-engine to ai-platform (#1863) commit e849f22 Author: Ryan Dawson <[email protected]> Date: Fri Aug 16 19:21:23 2019 +0100 Seldon examples (#1405) commit a6f3f40 Author: sina chavoshi <[email protected]> Date: Fri Aug 16 09:56:09 2019 -0700 Adding a sample for serving component (#1830) * Adding a sample for serving component * removed typo / updated based on PR feedback * fixing the jupyter rendering issue * adding pip3 for tensorflow * Fixed spelling error in VERSION * fix indentation based on review feedback commit 54ff3e6 Author: Alexey Volkov <[email protected]> Date: Fri Aug 16 01:22:31 2019 -0700 SDK - Cleanup - Serialized PipelineParamTuple does not need value or type (#1469) * SDK - Refactoring - Serialized PipelineParam does not need type Only the types in non-serialized PipelineParams are ever used. * SDK - Refactoring - Serialized PipelineParam does not need value Default values are only relevant when PipelineParam is used in the pipeline function signature and even in this case compiler captures them explicitly from the pipelineParam objects in the signature. There is no other uses for them. commit d2e94e4 Author: Riley Bauer <[email protected]> Date: Thu Aug 15 19:52:34 2019 -0700 Fix run duration bug (#1827) * Allows durations >=24h and renames 'showLink' in RunList * Update, fix tests commit d8eaeaa Author: Alexey Volkov <[email protected]> Date: Thu Aug 15 17:25:59 2019 -0700 SDK - Preserving the pipeline input information in the compiled Workflow (#1381) * SDK - Preserving the pipeline metadata in the compiled Workflow * Stabilizing the DSL compiler tests commit afe8a69 Author: Kirin Patel <[email protected]> Date: Thu Aug 15 17:00:28 2019 -0700 Reduce API usage by utilizing reference name in reference resource API (#1824) * Regenerated run api for frontend * Added support for reference name to resource reference API in frontend * Revert "Regenerated run api for frontend" * Addressed PR comments * Removed extra if statement by setting default value of parameter * Removed the whole comment * Addressed PR feedback * Addressed PR feedback * Simplified logic after offline discussion commit ea67c99 Author: Kirin Patel <[email protected]> Date: Thu Aug 15 16:18:35 2019 -0700 Change how Variables are Provided to Visualizations (#1754) * Changed way visualization variables are passed from request to NotebookNode Visualization variables are now saved to a json file and loaded by a NotebookNode upon execution. * Updated roc_curve visualization to reflect changes made to dependency injection * Fixed bug where checking if is_generated is provided to roc_curve visualization would crash visualizaiton Also changed ' -> " * Changed text_exporter to always sort variables by key for testing * Addressed PR suggestions commit d238bef Author: Jiaxiao Zheng <[email protected]> Date: Thu Aug 15 12:55:29 2019 -0700 Add back coveralls. (#1849) * Remove redundant import. * Simplify sample_test.yaml by using withItem syntax. * Simplify sample_test.yaml by using withItem syntax. * Change dict to str in withItems. * Add back coveralls. commit 0517114 Author: Riley Bauer <[email protected]> Date: Thu Aug 15 12:28:35 2019 -0700 Reduce getPipeline calls in RunList (#1852) * Skips calling getPipeline in RunList if the pipeline name is in the pipeline_spec * Update fixed data to include pipeline names in pipeline specs * Remove redundant getRuns call commit 39e5840 Author: IronPan <[email protected]> Date: Thu Aug 15 11:04:34 2019 -0700 Add retry button in Pipeline UI (#1782) * add retry button * add retry button * add retry button * address comments * fix tests * fix tests * update image * Update StatusUtils.test.tsx * Update RunDetails.test.tsx * Update Buttons.ts * update test * update frontend * update * update * addrerss comments * update test * Fix default image prefix. * Fix dependencies. * Fix gs op * Lint * Fix papermill exec * Init for componentTest * Debugging. * Try to fix the issue. Possible cause: nteract/papermill#410 * Fix exit code * Another fix. * Squashed commit of the following: commit 2592307 Author: Christian Clauss <[email protected]> Date: Wed Aug 21 04:15:19 2019 +0200 Undefined name 'e' in openvino (#1876) Discovered in #1721 ``` ./contrib/components/openvino/ovms-deployer/containers/evaluate.py:62:16: F821 undefined name 'e' except e: ^ ./contrib/components/openvino/ovms-deployer/containers/evaluate.py:63:50: F821 undefined name 'e' print("Can not read the image file", e) ^ ``` Your review please @Ark-kun commit 7b442f4 Author: Kirin Patel <[email protected]> Date: Tue Aug 20 18:27:19 2019 -0700 Created extensible code editor based on react-ace (#1855) * Created extensible code editor based on react-ace * Installed dependencies * Updated unit tests for Editor.tsx to test placeholder and value in simplified manner * Updated Editor unit tests to use snapshot testing where applicable commit d11fae7 Author: Yuan (Bob) Gong <[email protected]> Date: Wed Aug 21 08:25:20 2019 +0800 Use KFP lite deployment for presubmit tests (#1808) * Refactor presubmit-tests-with-pipeline-deployment.sh so that it can be run from a different project * Simplify getting service account from cluster. * Migrate presubmit-tests-with-pipeline-deployment.sh to use kfp lightweight deployment. * Add option to cache built images to make debugging faster. * Fix cluster set up * Copy image builder image instead of granting permission * Add missed yes command * fix stuff * Let other usages of image-builder image become configurable * let test workflow use image builder image * Fix permission issue * Hide irrelevant error logs * Use shared service account key instead * Move test manifest to test folder * Move build-images.sh to a different script file * Update README.md * add cluster info dump * Use the same cluster resources as kubeflow deployment * Remove cluster info dump * Add timing to test log * cleaned up code * fix tests * address cr comments * Address cr comments * Enable image caching to improve retest speed commit 0864faf Author: IronPan <[email protected]> Date: Tue Aug 20 14:09:19 2019 -0700 Use single part as default (#1893) The data stored in artifact storage are usually small. Using multi-part is not strictly a requirement. Change the default to true to better support more platform out of box. commit 6284dc1 Author: Christian Clauss <[email protected]> Date: Tue Aug 20 22:01:18 2019 +0200 IBM Watson samples: from six.moves import xrange (#1877) Discovered in #1721 __xrange()__ was removed in Python 3 in favor of an improved version of __range()__. This PR ensures equivalent functionality in both Python 2 and Python 3. ``` ./samples/contrib/ibm-samples/watson/source/model-source-code/tf-model/input_data.py:100:40: F821 undefined name 'xrange' fake_image = [1.0 for _ in xrange(784)] ^ ./samples/contrib/ibm-samples/watson/source/model-source-code/tf-model/input_data.py:102:41: F821 undefined name 'xrange' return [fake_image for _ in xrange(batch_size)], [ ^ ./samples/contrib/ibm-samples/watson/source/model-source-code/tf-model/input_data.py:103:37: F821 undefined name 'xrange' fake_label for _ in xrange(batch_size)] ^ ``` @gaoning777 @Ark-kun Your reviews please. commit 79c7bda Author: Ning <[email protected]> Date: Tue Aug 20 09:24:56 2019 -0700 fix unit tests and address some comments (#1892) commit 6a7b28f Author: Aakash Bajaj <[email protected]> Date: Tue Aug 20 14:07:33 2019 +0530 gcp cred bug fix for multiple credentials in single pipeline (#1384) * gcp cred bug fix for multiple credentials in single pipeline * squash to remove conflict commit 60fd70c Author: Andy Wei <[email protected]> Date: Tue Aug 20 15:33:33 2019 +0800 Let backend apiserver mysql dbname configurable (#1714) * let mysql dbname configurable * solve conflict * move declaration out to fix scope commit 101a346 Author: olegchorny <[email protected]> Date: Tue Aug 20 08:33:47 2019 +0300 'core' folder included to parameters related On-Premise cluster (#1751) * 'core' folder included to parameters related On-Premise cluster Update is required because this sample was migrated to the samples/core folder * Update README.md commit 4c5d34f Author: dushyanthsc <[email protected]> Date: Mon Aug 19 18:57:33 2019 -0700 test/project-cleanup: Support to cleanup gke-clusters in test project (#1857) Change to add base framework for cleaning up resources in a GCP project. The resource specification is specified declaratively using a YAML file. As per current requirements this change only adds cleaning up of GKE clusters. * Fix working dir * Fix exit code format * Remove kfp notebook sample using tfx:oss components. * Refactor to reduce potential dup code. * Clear unused const. * Fix redundant check * Add image injection for component test. * Fix unused import * Squash and merge from master * Squashed commit of the following: commit 41d39c1 Author: Kirin Patel <[email protected]> Date: Wed Aug 21 19:38:31 2019 -0700 Add run with json data as input within fixed-data.ts for UI testing and development (#1895) * Added run with json data as input * Changed run and uid to not be duplicates of hello-world-runtime commit 851e7c8 Author: Kirin Patel <[email protected]> Date: Wed Aug 21 19:04:31 2019 -0700 Replace codemirror editor react component with react-ace editor component (#1890) * Replaced CodeMirror with Editor in PipelineDetails.tsx * Replaced codemirror in DetailsTable with react-ace * Removed codemirror * Updated unit tests for Editor.tsx to test placeholder and value in simplified manner * Updated DetailsTable.test.tsx.snap to reflect changes made in DetailsTable.tsx * Updated PipelineDetails test snapshot * Changed width of Editor in DetailsTable to be 100% instead of 300px * Revert "Updated unit tests for Editor.tsx to test placeholder and value in simplified manner" This reverts commit 40103f2. commit 8c3d6fe Author: Kirin Patel <[email protected]> Date: Wed Aug 21 18:30:33 2019 -0700 Add visualization-server service to lightweight deployment (#1844) * Add visualization-server service to lightweight deployment * Addressed PR suggestions * Added field to determine if visualization service is active and fixed unit tests for visualization_server.go * Additional small fixes * port change from 88888 -> 8888 * version change from 0.1.15 -> 0.1.26 * removed visualization-server from base/kustomization.yaml * Fixed visualization_server_test.go to reflect new changes * Changed implementation to be fail fast * Changed host name to be constant provided by environment * Added retry and extracted isVisualizationServiceAlive logic to function * Fixed deployment.yaml file * Fixed serviceURL configuration issuse serviceURL is now properly obtained from the environment, the service ip address and port are used rather than service name and namespace * Added log message to indicate when visualization service is unreachable * Addressed PR comments * Removed _HTTP commit ad307db Author: Eterna2 <[email protected]> Date: Thu Aug 22 08:52:32 2019 +0800 [Bug Fix] Delete ResourceOp should not have output parameters (#1822) * Fix bug where delete resource op should not have success_condition, failure_condition, and output parameters * remove unnecessary whitespace * compiler test for delete resource ops should retrieve templates from spec instead of root commit 593f25a Author: Alexey Volkov <[email protected]> Date: Wed Aug 21 17:16:33 2019 -0700 Collecting coverage when running python tests (#898) * Collecting coiverage when running python tests * Added coveralls to python unit tests * Try removing the PATH modification * Specifying coverage run --source * Using the installed package * Try getting the correct coverage paths commit 553885f Author: Alexey Volkov <[email protected]> Date: Wed Aug 21 16:38:12 2019 -0700 SDK - Components - Fixed ModelBase comparison bug (#1874) commit 2622c67 Author: IronPan <[email protected]> Date: Wed Aug 21 16:37:53 2019 -0700 cleanup test dir (#1914) commit 203307d Author: Alexey Volkov <[email protected]> Date: Wed Aug 21 16:37:21 2019 -0700 SDK - Lightweight - Fixed custom types in multi-output case (#1875) The type was mistakenly serialized as `_ForwardRef('CustomType')`. The input parameter types and single-output types were not affected. commit 2e7f2d4 Author: IronPan <[email protected]> Date: Wed Aug 21 16:36:35 2019 -0700 Add cloud sql and gcs connection for pipeline-lite deployment (#1910) * restructure * working example * working example * move mysql * moving minio and mysql out * add gcp * add files * fix test commit 9adf163 Author: Alexey Volkov <[email protected]> Date: Wed Aug 21 16:29:54 2019 -0700 SDK - Airflow - Fixed bug in airflow op creation (#1911) This PR fixes a bug in AirFlow op creation. The `_run_airflow_op` helper function was not captured along with the `_run_airflow_op_closure` function, because they belong to different modules (`_run_airflow_op_closure` was module-less). This was not discovered during the notebook testing of the code since in that environment the `_run_airflow_op` was also module-less as it was defined in a notebook (not in .py file). commit 7ec5697 Author: hongye-sun <[email protected]> Date: Wed Aug 21 16:06:31 2019 -0700 Release 151c534 (#1916) * Updated component images to version 151c534 * Updated components to version a97f1d0 * Update setup.py * Update setup.py commit 7e062ce Author: IronPan <[email protected]> Date: Wed Aug 21 15:14:31 2019 -0700 Update README.md commit 8e1e823 Author: Christian Clauss <[email protected]> Date: Thu Aug 22 00:04:31 2019 +0200 Lint Python code for undefined names (#1721) * Lint Python code for undefined names * Lint Python code for undefined names * Exclude tfdv.py to workaround an overzealous pytest * Fixup for tfdv.py * Fixup for tfdv.py * Fixup for tfdv.py * # Conflicts: # sdk/python/tests/components/test_python_op.py * Fix version for papermill * Squashed commit of the following: commit af456bc Author: Eterna2 <[email protected]> Date: Fri Aug 23 02:08:33 2019 +0800 [front-end-server] Allow viewer:tensorboard podTemplateSpec to be customizable (#1906) * Allow front-end server to provide custom viewer podTemplateSpec via path/configmap * Fix JSON.parse input to string commit 06b7ad6 Author: IronPan <[email protected]> Date: Thu Aug 22 10:03:28 2019 -0700 Cleanup pipeline-lite deployment (#1921) * restructure * working example * working example * move mysql * moving minio and mysql out * add gcp * add files * fix test * extract parameters to single place * update * update readme * update readme * address pr comment commit 56160f1 Author: Kirin Patel <[email protected]> Date: Thu Aug 22 09:26:33 2019 -0700 Added support for environment specified kernel timeout (#1920) * Fix google-cloud-storage and google fire version info. * Delete run_test.sh * Merge from kfp master * Squashed commit of the following: commit 7b6825d Author: hongye-sun <[email protected]> Date: Thu Aug 22 18:23:18 2019 -0700 Update changelog for 0.1.27 (#1935) * Update changelog for 0.1.27 * Revert unexpected changes. commit 336760c Author: IronPan <[email protected]> Date: Thu Aug 22 17:47:18 2019 -0700 sync namespaced install file (#1932) commit 55d62fe Author: Hamed <[email protected]> Date: Fri Aug 23 01:09:18 2019 +0100 Support Affinity for ContainerOps (#1886) commit d15697b Author: Animesh Singh <[email protected]> Date: Thu Aug 22 16:29:17 2019 -0700 Initial kfserving pipeline component (#1838) * initial kfserving pipeline component * changing some descriptions * adding action as an input, and pinning versions * moving Dockerfile in correct location * adding namespace to delete commit 119405d Author: hongye-sun <[email protected]> Date: Thu Aug 22 15:49:29 2019 -0700 upgrade backend image versions (#1918) commit c01315a Author: Alexey Volkov <[email protected]> Date: Thu Aug 22 15:31:24 2019 -0700 SDK - Refactoring - Replaced the TypeMeta class (#1930) * SDK - Refactoring - Replaced the TypeMeta class The PipelineParam no longer exposes the private TypeMeta class Fixes #1420 The refactoring PR is part of a series of PR which unifies the metadata and specification types. commit 736c7a5 Author: IronPan <[email protected]> Date: Thu Aug 22 15:29:19 2019 -0700 clean up owner file (#1928) commit 5abba66 Author: Kirin Patel <[email protected]> Date: Thu Aug 22 14:45:18 2019 -0700 Add pipeline id to pipeline summary card (#1927) * Added pipeline id to pipeline summary card * Updated PipelineDetails.test.tsx.snap commit 3c8952e Author: Kirin Patel <[email protected]> Date: Thu Aug 22 13:57:20 2019 -0700 Add TFDV, TFMA, and Table visualization support for Python based visualizations (#1898) * Added table and tfdv visualization Also fixed issue surrounding ApiVisualizationType enum * Fixed table visualization * Removed byte limit * Fixed issue where headers would not properly be applied * Fixed issue where table would not be intractable * Updated table visualizaiton to reflect changes made to dependency injection * Fixed bug where checking if headers is provided to table visualizations could crash visualization * Added TFMA visualization * Updated new visualizations to match syntax of #1878 * Updated test snapshots to account for TFMA visualization * Small if statement synax changes * Add flake8 noqa comments to table.py and tfma.py commit db6c9b4 Author: dushyanthsc <[email protected]> Date: Thu Aug 22 13:11:18 2019 -0700 pipeline-lite: Introduce metadata component to pipeline-lite (#1840) This change introduces the metadata component to pipeline-lite installation. This installation: 1. Does not include metadata-ui 2. mysql installation is used instead of metadata-db 3. Replica count has been reduced 1 instead of 3 * Fix check_notebook_results args. * Fix. * Fix work dir path * Fix global const naming convention * Remove redundant check. * Extract project name from result gcs dir. * Add error catching * Name change. * Improve error catching. * Improve error catching. * naming correction.
Parts of #1639
This change is