[WIP] Add component type to spec #221

PhilippeMoussalli · 2023-06-20T15:18:49Z

PR that implement a component type per spec as discussed here to explicitly define the kubeflow/fondant input and output arguments per component type. This is still work in progress, current changes include adding the necessary changes to the schema and component spec script.

What still needs to be done:

Add type field to all the affected components
Add relevant tests

RobbeSneyders · 2023-06-21T09:26:11Z

Will close this for now. Feel free to reopen in the future.

@ChristiaensBert

PR for running the controlnet pipeline end-to-end on KFP. Some observations when doing the pipeline testing: - Tested with @ChristiaensBert VM and it runs really nice and much faster than the public clip service. - I could not test everything end to end locally since the GPU component are difficult to run locally -> switched to KFP to leverage the GPU VMs - I had to rebuild images using the build and tag images in the `scripts` folder. I think we still need to modify the script to enable only building specified components since it currently default to all components in the `components` directory which might take some time to build - The local runner does not seem to do the subset checking yet and we still need to expand the CLI to be able to use the kfp runner (currently not supported). Although the CLI is really nice overall :) - Pipeline runs fine and writes the dataset to the hub but fails at the end since it expects an output manifest. This can be resolved with this [ticket](#221). We should prioritize this. Notes: - Changed the segmentation to output a segmentation image instead of a segmentation array since that's the output expected for controlnet training Things to do: - Estimate how much the job would cost

@ChristiaensBert

PR for running the controlnet pipeline end-to-end on KFP. Some observations when doing the pipeline testing: - Tested with @ChristiaensBert VM and it runs really nice and much faster than the public clip service. - I could not test everything end to end locally since the GPU component are difficult to run locally -> switched to KFP to leverage the GPU VMs - I had to rebuild images using the build and tag images in the `scripts` folder. I think we still need to modify the script to enable only building specified components since it currently default to all components in the `components` directory which might take some time to build - The local runner does not seem to do the subset checking yet and we still need to expand the CLI to be able to use the kfp runner (currently not supported). Although the CLI is really nice overall :) - Pipeline runs fine and writes the dataset to the hub but fails at the end since it expects an output manifest. This can be resolved with this [ticket](#221). We should prioritize this. Notes: - Changed the segmentation to output a segmentation image instead of a segmentation array since that's the output expected for controlnet training Things to do: - Estimate how much the job would cost

PhilippeMoussalli added 3 commits June 20, 2023 16:58

Add component type to schema

f677ac1

Add component type to component spec

f943d38

modify component spec tests

60e3fa7

PhilippeMoussalli requested review from RobbeSneyders and GeorgesLorre June 20, 2023 15:18

RobbeSneyders closed this Jun 21, 2023

PhilippeMoussalli mentioned this pull request Jul 3, 2023

Large scale controlnet #260

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Add component type to spec #221

[WIP] Add component type to spec #221

PhilippeMoussalli commented Jun 20, 2023

RobbeSneyders commented Jun 21, 2023

[WIP] Add component type to spec #221

[WIP] Add component type to spec #221

Conversation

PhilippeMoussalli commented Jun 20, 2023

RobbeSneyders commented Jun 21, 2023