Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(sdk): ImporterSpec v2 #6917

Merged
merged 10 commits into from
Nov 18, 2021
Merged

Conversation

ji-yaqi
Copy link
Contributor

@ji-yaqi ji-yaqi commented Nov 17, 2021

Add Importer Spec for v2.

The generated IR has the following difference with v1:

  1. Generate PipelineSpec, instead of PipelineJob
  2. Default name change from param to channel, e.g. pipelineparam--dataset2 to pipelinechannel--dataset2
  3. Keyword changes for schema 2.1.0, e.g. type to parameterType, adding defaultValue etc.
  4. Artifact types in Output definitions is default to general types like system.Artifact (will investigate further)

@google-oss-prow
Copy link

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

@ji-yaqi ji-yaqi marked this pull request as ready for review November 17, 2021 11:34
Copy link
Member

@chensun chensun left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks, Yaqi!

pipeline.yaml Outdated
@@ -0,0 +1,124 @@
apiVersion: argoproj.io/v1alpha1
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

revert this file?

@@ -1,164 +0,0 @@
# Copyright 2020 The Kubeflow Authors
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: ideally we should still keep a unit test file for this. Maybe test calling dsl.importer() with some value and then assert we got the right pipeline task object?
Given that we have the some test coverage in the compiler cli test, we could delay this a bit in a later PR.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Okay, will add some tests in following PRs.

deployment_config.executors[
executor_label].container.CopyFrom(
subgroup_container_spec)
if hasattr(subgroup, 'container_spec'):
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Given that we document both container_spec and importer_spec as class attributes (in the docstring of PipelineTask), hasattr seems to be a bit contradictory.
How about we initialize them to None. and check if subgroup.container_spec is not None here?

Copy link
Member

@chensun chensun left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm
/approve

Thanks!

@google-oss-prow
Copy link

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: chensun

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@google-oss-prow google-oss-prow bot merged commit c4d8dfd into kubeflow:master Nov 18, 2021
abaland pushed a commit to abaland/pipelines that referenced this pull request May 29, 2022
* importer

* compiler

* check diff

* fix format

* remove v1 node test

* remove importer v2 compat test

* release note

* address comments
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants