Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[GSoC] Update
tune
API for LLM hyperparameters optimization (#2393)
* update tune api for llm hyperparameters optimization Signed-off-by: helenxie-bit <[email protected]> * resolve conflict Signed-off-by: helenxie-bit <[email protected]> * fix the problem of dependency Signed-off-by: helenxie-bit <[email protected]> * fix the format of import statement Signed-off-by: helenxie-bit <[email protected]> * adjust the blank lines Signed-off-by: helenxie-bit <[email protected]> * delete the trainer to reuse it in Training Operator Signed-off-by: helenxie-bit <[email protected]> * update constants Signed-off-by: helenxie-bit <[email protected]> * update metrics format Signed-off-by: helenxie-bit <[email protected]> * update the type of and Signed-off-by: helenxie-bit <[email protected]> * update the message of 'ImportError' Signed-off-by: helenxie-bit <[email protected]> * add TODO of PVC creation Signed-off-by: helenxie-bit <[email protected]> * update the name of pvc Signed-off-by: helenxie-bit <[email protected]> * reuse constants from Training Operator Signed-off-by: helenxie-bit <[email protected]> * keep 'parameters' and update validation Signed-off-by: helenxie-bit <[email protected]> * update for test Signed-off-by: helenxie-bit <[email protected]> * reuse 'get_container_spec' and 'get_pod_template_spec' from Training Operator Signed-off-by: helenxie-bit <[email protected]> * format with black Signed-off-by: helenxie-bit <[email protected]> * fix Lint error Signed-off-by: helenxie-bit <[email protected]> * fix Lint errors Signed-off-by: helenxie-bit <[email protected]> * delete types Signed-off-by: helenxie-bit <[email protected]> * fix format Signed-off-by: helenxie-bit <[email protected]> * update format Signed-off-by: helenxie-bit <[email protected]> * update format Signed-off-by: helenxie-bit <[email protected]> * fix e2e test error Signed-off-by: helenxie-bit <[email protected]> * add TODO Signed-off-by: helenxie-bit <[email protected]> * format with max line length Signed-off-by: helenxie-bit <[email protected]> * format docstring Signed-off-by: helenxie-bit <[email protected]> * update format Signed-off-by: helenxie-bit <[email protected]> * add helper functions Signed-off-by: helenxie-bit <[email protected]> * update format Signed-off-by: helenxie-bit <[email protected]> * update format Signed-off-by: helenxie-bit <[email protected]> * run test again Signed-off-by: helenxie-bit <[email protected]> * run test again Signed-off-by: helenxie-bit <[email protected]> * run test again Signed-off-by: helenxie-bit <[email protected]> * fix dict substitution in training_parameters Signed-off-by: helenxie-bit <[email protected]> * fix typo Signed-off-by: helenxie-bit <[email protected]> * resolve conflicts and add check for case of no parameters Signed-off-by: helenxie-bit <[email protected]> * fix format Signed-off-by: helenxie-bit <[email protected]> * fix format Signed-off-by: helenxie-bit <[email protected]> * fix format Signed-off-by: helenxie-bit <[email protected]> * fix flake8 error Signed-off-by: helenxie-bit <[email protected]> * fix format Signed-off-by: helenxie-bit <[email protected]> * fix format Signed-off-by: helenxie-bit <[email protected]> * fix format Signed-off-by: helenxie-bit <[email protected]> * fix format Signed-off-by: helenxie-bit <[email protected]> * fix format Signed-off-by: helenxie-bit <[email protected]> * update isort file to black and fix typo Signed-off-by: helenxie-bit <[email protected]> * modify the set of metrics format Signed-off-by: helenxie-bit <[email protected]> * update tune API Signed-off-by: helenxie-bit <[email protected]> * add types.TrainerResources class Signed-off-by: helenxie-bit <[email protected]> * fix flake8 error Signed-off-by: helenxie-bit <[email protected]> * rerun tests Signed-off-by: helenxie-bit <[email protected]> * rerun tests Signed-off-by: helenxie-bit <[email protected]> * resolve conflict Signed-off-by: helenxie-bit <[email protected]> * rerun tests Signed-off-by: helenxie-bit <[email protected]> * rerun tests Signed-off-by: helenxie-bit <[email protected]> * rerun tests Signed-off-by: helenxie-bit <[email protected]> * rerun tests Signed-off-by: helenxie-bit <[email protected]> * rerun tests Signed-off-by: helenxie-bit <[email protected]> * rerun tests Signed-off-by: helenxie-bit <[email protected]> * rerun tests Signed-off-by: helenxie-bit <[email protected]> * rerun tests Signed-off-by: helenxie-bit <[email protected]> * rerun tests Signed-off-by: helenxie-bit <[email protected]> * rerun tests Signed-off-by: helenxie-bit <[email protected]> * delete properties of 'TrainerResources' Signed-off-by: helenxie-bit <[email protected]> * fix format error Signed-off-by: helenxie-bit <[email protected]> * update types Signed-off-by: helenxie-bit <[email protected]> * fix format Signed-off-by: helenxie-bit <[email protected]> * add import of 'TrainerResources' in '__init__.py' of katib Signed-off-by: helenxie-bit <[email protected]> * rerun tests Signed-off-by: helenxie-bit <[email protected]> * revert changes and rerun tests Signed-off-by: helenxie-bit <[email protected]> * check pvc and pv status of katib deployments Signed-off-by: helenxie-bit <[email protected]> * check pvc and pv status of katib deployments Signed-off-by: helenxie-bit <[email protected]> * recommit changes Signed-off-by: helenxie-bit <[email protected]> * update minikube version when setup Signed-off-by: helenxie-bit <[email protected]> * delete the code that disables formatting for the tune function Signed-off-by: helenxie-bit <[email protected]> * update according to andrey's feedback Signed-off-by: helenxie-bit <[email protected]> * add helper function in utils Signed-off-by: helenxie-bit <[email protected]> * fix format Signed-off-by: helenxie-bit <[email protected]> * rerun tests Signed-off-by: helenxie-bit <[email protected]> * move metrics_collector_spec back & update helper functions & add return type for helper functions Signed-off-by: helenxie-bit <[email protected]> * rerun tests Signed-off-by: helenxie-bit <[email protected]> * fix some typos Signed-off-by: helenxie-bit <[email protected]> * simplify the definition of 'TrainerResources' Signed-off-by: helenxie-bit <[email protected]> --------- Signed-off-by: helenxie-bit <[email protected]>
- Loading branch information