[suggestion] Implement more algorithms #15

gaocegege · 2018-04-07T04:37:20Z

libbyandhelen · 2018-04-11T00:38:27Z

I am implementing the Bayesian Optimization algorithm using python. But I have encountered a question. Since I use the one_hot encoding to deal with the categorical parameters, and embed the integer and discrete parameters to continuous space, the final suggestion is different from the values which need to be used for training. For example:
this is the study config:

configs=[
                    api_pb2.ParameterConfig(
                        name="param1",
                        parameter_type=api_pb2.INT,
                        feasible=api_pb2.FeasibleSpace(max="5", min="1", list=[]),
                    ),
                    api_pb2.ParameterConfig(
                        name="param2",
                        parameter_type=api_pb2.CATEGORICAL,
                        feasible=api_pb2.FeasibleSpace(max=None, min=None, list=["cat1", "cat2", "cat3"])
                    ),
                    api_pb2.ParameterConfig(
                        name="param3",
                        parameter_type=api_pb2.DISCRETE,
                        feasible=api_pb2.FeasibleSpace(max=None, min=None, list=["3", "2", "6"])
                    ),
                    api_pb2.ParameterConfig(
                        name="param4",
                        parameter_type=api_pb2.DOUBLE,
                        feasible=api_pb2.FeasibleSpace(max="5", min="1", list=[])
                    )
                ],

and this is the intermediate result generated by the algorithm, which need to be used in the following iterations

lowerbound [ 1.  0.  0.  0.  2.  1.]
upperbound [ 5.  1.  1.  1.  6.  5.]
[[ 2.    0.23  0.56  0.77  6.    4.5 ]]

this is the final result which is generated from the intermediate result

parameter_set {
  name: "param1"
  parameter_type: INT
  value: "3"
}
parameter_set {
  name: "param2"
  parameter_type: CATEGORICAL
  value: "cat1"
}
parameter_set {
  name: "param3"
  parameter_type: DISCRETE
  value: "3"
}
parameter_set {
  name: "param4"
  parameter_type: DOUBLE
  value: "3.0"
}

So my question is how can we store this intermediate result. And it would be nice if anyone could tell me the location of the scripts which calls the suggestion services (like generate_trials), so that the workflow is more @gaocegege @ddutta @YujiOshima

YujiOshima · 2018-04-11T07:48:41Z

@libbyandhelen Cool!
Ideally, the intermediate results should be saved in DB.
But the Katib DB does not have such interface.
So currently suggestions store the intermediate information in their memory.
I understand it's big problem because it makes the services stateful.
SuggestTrial is called from trialIteration in the manager. https://github.com/kubeflow/hp-tuning/blob/master/manager/main.go#L106

libbyandhelen · 2018-04-11T22:22:40Z

@YujiOshima Thank you!
Just as you said, I stored all the necessary information in the suggestion service's memory. So the service can run and test independently now. And the testing script uses the Frank's function as an example.
So here are some further questions:

I saw that there are some mysql interfaces in kubeflow/db/interface.go. Are we going to use these to store the data to database?
If I understand correctly, the generate_trials function in suggestion service uses the completed_trials to report the objective value to the service, right?
Do you think it is the time to make a pull request?

gaocegege · 2018-04-12T08:08:00Z

@libbyandhelen

Welcome to open a WIP PR (work in progress PR) and let us see your work to guarantee that you are in the right way.

YujiOshima · 2018-04-12T08:23:47Z

@libbyandhelen

I agree with @gaocegege welcome to open PR.

For other questions.

Yes. If we want to develop in python, we need a similar interface for the Mysql.
It's not recommended. I want to make suggestion-services get information of Trials from not completed_trials and running_trials but DB.

gaocegege · 2018-04-12T08:43:43Z

FYI I added some possible algorithms in #15 (comment)

libbyandhelen · 2018-04-13T02:27:21Z

I am now trying to add more kernels in Gaussian process and more types of acquisition functions. In this case, the service itself needs some parameters such as kernel type and acquisition function type. Dose the SetSuggestionParameters act as setting the parameters of the service itself?
@YujiOshima

YujiOshima · 2018-04-13T02:46:28Z

@libbyandhelen Yes, it is. The SuggestionParameters consists of key-value pairs string Name : string Value.
And you will parse their parameters like this https://github.com/kubeflow/hp-tuning/blob/master/suggestion/hyperband_service.go#L92

libbyandhelen · 2018-04-13T04:53:41Z

@YujiOshima Got it. I just pushed a commit to the pull request with one more GP kernel added and two more acquisition functions added. May I ask what is the next step, and how I should integrate it with the system?

Franky12 · 2020-04-17T07:10:40Z

Hi guys, exploring katib looking for good first issue, is there something I can start with?

gaocegege · 2020-04-17T08:34:32Z

Now I think we can close this issue since we have already implemented many algorithms. @Franky12 Thanks for your interest. If you want to contribute to katib, please have a look at our roadmap.

gaocegege added area/suggestion priority/p3 kind/feature labels Apr 7, 2018

libbyandhelen mentioned this issue Apr 12, 2018

bayesian optimization draft #38

Merged

gaocegege added help wanted Extra attention is needed good first issue Good for newcomers labels May 6, 2018

jdplatt mentioned this issue Mar 2, 2019

Move all suggestion algorithms to python #407

Closed

gaocegege closed this as completed Apr 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[suggestion] Implement more algorithms #15

[suggestion] Implement more algorithms #15

gaocegege commented Apr 7, 2018 •

edited

Loading

libbyandhelen commented Apr 11, 2018

YujiOshima commented Apr 11, 2018 •

edited

Loading

libbyandhelen commented Apr 11, 2018 •

edited

Loading

gaocegege commented Apr 12, 2018

YujiOshima commented Apr 12, 2018

gaocegege commented Apr 12, 2018

libbyandhelen commented Apr 13, 2018

YujiOshima commented Apr 13, 2018

libbyandhelen commented Apr 13, 2018

Franky12 commented Apr 17, 2020

gaocegege commented Apr 17, 2020

[suggestion] Implement more algorithms #15

[suggestion] Implement more algorithms #15

Comments

gaocegege commented Apr 7, 2018 • edited Loading

libbyandhelen commented Apr 11, 2018

YujiOshima commented Apr 11, 2018 • edited Loading

libbyandhelen commented Apr 11, 2018 • edited Loading

gaocegege commented Apr 12, 2018

YujiOshima commented Apr 12, 2018

gaocegege commented Apr 12, 2018

libbyandhelen commented Apr 13, 2018

YujiOshima commented Apr 13, 2018

libbyandhelen commented Apr 13, 2018

Franky12 commented Apr 17, 2020

gaocegege commented Apr 17, 2020

gaocegege commented Apr 7, 2018 •

edited

Loading

YujiOshima commented Apr 11, 2018 •

edited

Loading

libbyandhelen commented Apr 11, 2018 •

edited

Loading