From 1cf3d5914cec9a7d03b4713d6847a1ccd66641d4 Mon Sep 17 00:00:00 2001 From: Hongchao Deng Date: Sun, 17 May 2020 11:02:54 -0700 Subject: [PATCH] update design --- ...5-spec-v1alpha2-dependency-parampassing.md | 102 -------- design/20200516-dependency.md | 233 ++++++++++++++++++ 2 files changed, 233 insertions(+), 102 deletions(-) delete mode 100644 design/20200305-spec-v1alpha2-dependency-parampassing.md create mode 100644 design/20200516-dependency.md diff --git a/design/20200305-spec-v1alpha2-dependency-parampassing.md b/design/20200305-spec-v1alpha2-dependency-parampassing.md deleted file mode 100644 index 9e9c926..0000000 --- a/design/20200305-spec-v1alpha2-dependency-parampassing.md +++ /dev/null @@ -1,102 +0,0 @@ -# Component and Trait Dependency with Parameter Passing - -* Author: @wonderflow @hongchaodeng - -## Background - -In https://github.com/oam-dev/spec/issues/171, we have discussed our dependency stories and started to explore solutions. -Now that we have implemented and evolved the design to solve real world problems, we are proposing the design to OAM spec. - -## Goals - -In this design, we want to help applications reduce the burden of handling fault tolerance and dependency health. -By putting this work in the OAM platform layer, OAM users can define explicit dependency and parameter passing in a standard way -without carign about the implementation details. - -## Use Cases - -### Case 1: MySQL and Web - -Two components, MySQL and Web exists in the same AppConfig and Web depends on MySQL that: - -1. MySQL needs to start first; -2. Web depends on connection credentials from MySQL Component. Once MySQL has started, the credentials will be injected into Web component's environment variables via service binding. - -### Case 2: NAS FileSystem and MountTarget - -Two components, NAS_FileSystem and NAS_MountTarget exists in the same _ApplicationConfiguration_ and NAS_MountTarget depends on NAS_FileSystem that: - -1. NAS_FileSystem needs to be created first; -2. NAS_MountTarget depends on filesystemID from NAS_FileSystem to fill in NAS_MountTarget's spec. - -## Proposal - -### Component Dependency - -For use case 1 and 2, we are proposing to add `dependencies` in _ApplicationConfiguration_ which a component can describe its dependent components. - -```yaml -apiVersion: core.oam.dev/v1alpha2 -kind: ApplicationConfiguration -spec: -- components: - - componentName: mysql - - componentName: web - dependencies: - - mysql ---- -kind: ApplicationConfiguration -spec: -- components: - - componentName: nas-file-system - - componentName: nas-mount-target - dependencies: - - nas-file-system -``` - -### Trait Ordering - -For use case 1, we are using a trait called [ServiceBinding](https://github.com/oam-dev/trait-injector) to auto-bind credentials. ServiceBinding trait should be applied before component starts. We propose to add `stage` in TraitDefinition to express ordering relationship with the workload: - -```yaml -apiVersion: core.oam.dev/v1alpha2 -kind: TraitDefinition -spec: - appliesToWorkloads: - - core.oam.dev/v1alpha2.ContainerizedWorkload - stage: preStart # other options: postStart | preStop | postStop -``` - -### Parameter Passing - -For use case 2, the filesystemID from NasFileSystem's status needs to be passed to NasMountTarget's spec. To solve this problem, we propose to extend _ParameterValue_ to accept value from another component's field: - -```yaml -kind: ApplicationConfiguration -spec: - components: - - componentName: nas-file-system # The Workload of nas-file-system output a filesystemID at .status.filesystemID - - - componentName: nas-mount-target - dependencies: - - nas-file-system - parameterValues: - - name: filesystem-id - from: - compoent: nas-file-system - fieldPath: ".status.filesystemID" # take the field's value from nas-file-system's workload ---- -apiVersion: core.oam.dev/v1alpha2 -kind: Component -metadata: - name: nas-mount-target -spec: - workload: - apiVersion: ros.aliyun.com/v1alpha2 - kind: NASMountTarget - spec: - filesystemID: "" # to be filled - parameters: - - name: filesystem-id - fieldPath: ".spec.filesystemID" -``` diff --git a/design/20200516-dependency.md b/design/20200516-dependency.md new file mode 100644 index 0000000..c6dfa85 --- /dev/null +++ b/design/20200516-dependency.md @@ -0,0 +1,233 @@ +# Resource Dependency in OAM + +Owner: Hongchao Deng (@hongchaodeng) +Date: 05/17/2020 + +## Background + +Before creating some application services or resources, we often need to rely on +other services or resources to be ready or have some data available first. +This is a well-known pattern called dependency. + +In OAM, we talk about dependency in two different contexts: + +1. **Inter ApplicationConfiguration**: + There is dependency workflow between multiple ApplicationConfigurations. This usually can be achieved + via external workflow engine such as Tekton/Argo Workflow. +2. **Intra ApplicationConfiguration**: + There is dependency between Components and Traits within the same ApplicationConfiguration. + Since they are applied at the same time, external workflow engine couldn't do anything to control their ordering. + It is the OAM runtime that needs to satisfy this requirement. + +In this proposal, we are handling the second case, *Intra ApplicationConfiguration* dependency. + +## Goals + +This proposal includes the following goals: + +1. **Describe ordering for k8s-style resources**. + One key part of depedency is ordering, i.e. creating resources one after another. + For example, resource A could not be created unless resource B has been ready (we will describe the meaning of ready below). + We want to and could achieve this by generic k8s resource design. + Since both components and traits are k8s-style resources, as long as we achieve this goal it should satisfy OAM requirements too. +2. **Describe field data passing between resources**. + If we think dependency as ordering, it is still not enough. We also need to handle field data passing. + For example, resource A's field "spec.connSecret" relies on resource B's field "status.connSecret". + This means the field data of one resource is passed from another field of other resource and needs to wait the field data to be ready. + In fact, ordering in the first goal can be modeled as data dependency as well except that the receiver doesn't use the data. +3. **OAM native experience**. + OAM has application concepts like Components, Traits, etc. + The solution will provide OAM native experience built on top of the generic solution. For example, modeling as traits. + +In the following we will first go through the use cases and then propose a solution to satisfy them. + + +## Use Cases + +## 1. Web app and database + +An applicationConfiguration consists of two components: web and db (database). +Web component should not be created until db component is created, +including public endpoint and connection secret to be made ready. + +## 2. ML app workflow + +An ML applicationConfiguration from [Hypercycle ML](http://www.4paradigm.com/product/hypercycleml) +has three components: + +- preprocess +- training +- serving + +They have strict ordering: + +``` +preprocess -> training -> serving +``` + +Without the previous stage finished, the next couldn't start due to lack of data. +Today, ML frameworks including Tensorflow do not handle such dependency issue. +They rely on external tools (e.g. Argo Workflow) to handle it. +If we want to model them as Components in the same ApplicationConfiguration, the external tools could not help in this case. + +## 3. Service-binding Trait + +Service binding is an OAM trait that we develop to bind user specified input such as env or file path +to specific data sources such as secrets or events. +More background introduction can be found in this [slides](https://docs.google.com/presentation/d/1PseN_8_zZH8clWZJzP8tuRMpz51SxQb-mKeG1f4QVZQ/edit?usp=sharing). + +A service binding trait needs to be applied before creating the component. +Moreover, this kind of trait ordering is quite general and applies to many other traits we have seen. + +## 4. NSQ deployment + +An [NSQ](https://github.com/nsqio/nsq) cluster is composed with three components, that is nsqd nsqlookup and nsqadmin. +If we want to setup a nsq cluster, we should make sure nsqd is up before nsqlookup, then run a nsqadmin workload at last. +That is, nsqlookup consumes the result of nsqd component instance, nsqadmin consumes the result of nsqlookup component instance. +If we couldn't describe the dependencies order between the components, the component couldn't run properly. + +This is similar to [kubernetes#65502](https://github.com/kubernetes/kubernetes/issues/65502) , the issue describes dependencies between containers on the same Pod, while we are discussing dependencies between components. + + +## Proposal + +The overall idea is to have each resource object specify data inputs which are from fields of other resources. +The OAM runtime will build the dependency graph accordingly and only create resources once the data inputs are ready. +To achieve this, we propose: + +1. Add the following `DataInputs` into each resource. + + ```go + type DataInputs []DataInput + + type DataInput struct { + // FromDataOutput specifies the name of the DataOutput object in the same namespace. + FromDataOutput string `json:"fromDataOutput"` + + // ToFieldPaths specifies an array of fields within this resource object + // that will be overwritten by the value of given DataOutput. The type of the + // parameter (e.g. int, string) is inferred from the type of these fields; + // All fields must be of the same type. Fields are specified as JSON field + // paths without a leading dot, for example 'spec.replicas'. + // If empty, it means this input is not used at all. + // This is allowed for resource ordering purpose. + ToFieldPaths []string `json:"toFieldPaths"` + } + ``` + + For example, a web application may specify input of mysql secret: + + ```yaml + datainputs: + - toFieldPaths: ["spec.connSecret"] + fromDataOutput: mysql-conn-secret # check below DataOutput of the same name + ``` + + The DataInputs object will be json-marshaled and put into an annotation key `core.oam.dev/datainputs`. + + ```yaml + kind: Deployment + metadata: + name: my-web-app + annotations: + "core.oam.dev/datainputs": "[{\"toFieldPaths\": [\"spec.connSecret\"],\"fromDataOutput\": \"mysql-conn-secret\"}]" + ``` + +2. Add a new CRD `DataOutput` with the following definition: + + ```go + type DataOutput struct { + metav1.TypeMeta `json:",inline"` + metav1.ObjectMeta `json:"metadata,omitempty"` + + Spec DataOutputSpec `json:"spec,omitempty"` + Status DataOutputStatus `json:"status,omitempty"` + } + + type DataOutputSpec struct { + // Resource specifies the reference of specific resource, which points to a unique resource. + Resource ResourceReference `json:"resource"` + + // FieldPath specifies the field path of a given k8s-style resource. + FieldPath string `json:"fieldPath"` + } + + type ResourceReference struct { + APIVersion string `json:"apiVersion"` + + Kind string `json:"kind"` + + // Name indicates the name of the object in the same namespace. + Name string `json:"name,omitempty"` + } + + type DataOutputStatus struct { + // Ready indicates whether the field data of the source resource is ready. + Ready bool `json:"ready"` + } + ``` + + For example, a mysql connection secret data output would look like: + + ```yaml + apiVersion: core.oam.dev/v1alpha2 + kind: DataOutput + metadata: + name: mysql-conn-secret + spec: + resource: + apiVersion: alibaba.oam.crossplane.io/v1alpha1 + kind: RDSInstance + name: my-rds + fieldPath: "status.connSecret" + ``` + + The OAM runtime will automatically build a dependency between the `my-web-app` and `my-rds`. + It will wait until `my-rds` object's field `status.connSecret` has value, + and then creates `my-web-app` object with corresponding values passed to its fields. + +3. In order to integrate with OAM, we could model this feature as an OAM Trait as follows: + + ```yaml + components: + - componentName: my-web-app + traits: + - trait: + apiVersion: core.oam.dev/v1alpha2 + kind: DataOutput + metadata: + name: mysql-conn-secret + spec: + resource: + apiVersion: alibaba.oam.crossplane.io/v1alpha1 + kind: RDSInstance + name: my-rds + fieldPath: "status.connSecret" + ``` + + In the component object, we could set a convention that the name of the DataOutput object + will correspond to the name of the Parameter. + + ```yaml + kind: Component + metadata: + name: my-web-app + spec: + parameters: + - name: mysql-conn-secret + fieldPaths: ["spec.connSecret"] + ``` + + In this case, the OAM runtime will match the parameter with the DataOutput trait, and use it to build dependency graph. + +## Solution examples + +In the proposal section we walk through how the proposal solves the web and db use case. +In this section we are providing more solution examples for the rest of use cases: + +1. ML app: TODO. + +2. Service binding: TODO. + +3. NSQ: TODO. +