Suggestion for Neural Architecture Search with Reinforcement Learning #339

jinan-zhou · 2019-01-23T02:59:25Z

Fixes: #338.

The first suggestion we are working on is reinforcement learning. The algorithm follows the paper Neural Architecture Search with Reinforcement Learning by Zoph & Le (https://arxiv.org/abs/1611.01578), and the implementation is based on the github of Efﬁcient Neural Architecture Search via Parameter Sharing (https://github.com/melodyguan/enas).

Related to #257

This change is

googlebot · 2019-01-23T02:59:28Z

We found a Contributor License Agreement for you (the sender of this pull request), but were unable to find agreements for all the commit author(s) or Co-authors. If you authored these, maybe you used a different email address in the git commits than was used to sign the CLA (login here to double check)? If these were authored by someone else, then they will need to sign a CLA as well, and confirm that they're okay with these being contributed to Google.
In order to pass this check, please resolve this problem and have the pull request author add another comment and the bot will run again. If the bot doesn't comment, it means it doesn't think anything has changed.

jinan-zhou · 2019-01-23T03:04:27Z

I signed it!

googlebot · 2019-01-23T03:20:02Z

CLAs look good, thanks!

…and fix bugs

jinan-zhou · 2019-01-30T00:19:02Z

/assign @YujiOshima

pkg/suggestion/nasrl_service.py

pkg/suggestion/NAS_Reinforcement_Learning/Operation.py

pkg/suggestion/nasrl_service.py

hougangliu · 2019-02-13T08:37:30Z

/assign @richardsliu @gaocegege

hougangliu · 2019-02-13T08:37:49Z

/assign @johnugeorge

pkg/suggestion/NAS_Reinforcement_Learning/LSTM.py

pkg/suggestion/nasrl_service.py

pkg/suggestion/NAS_Reinforcement_Learning/README.md

hougangliu · 2019-02-20T02:11:29Z

pkg/suggestion/nasrl_service.py

+        for w in worker_list:
+            if w.Worker.status == api_pb2.COMPLETED:
+                for ml in w.metrics_logs:
+                    if ml.name != self.objective_name:


if w.metrics_logs is like [objectivemetrics, othermetrics], line 214 will return othermetrics value (ml is othermetrics)

how about

if ml.name == self.objective_name: self.logger.info("Evaluation result of previous candidate: {}".format(ml.values[-1].value)) return float(ml.values[-1].value)

and keep below to keep same indent with line 204 (completed_count can be removed)

self.logger.warning("Error. No trial has completed.") return None

I believe w.metrics_logs will only return one selected metric defined in https://github.com/kubeflow/katib/blob/master/examples/nasjob-example-RL.yaml#L12. The metrics collector will parse the pattern <metrics_name>: <metrics_value> from the log of training container and give it to suggestion.

Currently, the algorithm only spawns one trial each time and uses the metrics of the very last trial to update. That is why I will go through all the available trials and index the last one. In the future, the suggestion will be able to spawn multiple trials. Therefore, I think it is better to store all the metrics first and process the data afterwards. Currently the process is just taking the last one and return . In the future, it may become take the average and return.

w.metrics_logs will return Collection of https://github.com/kubeflow/katib/blob/master/examples/nasjob-example-RL.yaml#L12 and https://github.com/kubeflow/katib/blob/master/examples/nasjob-example-RL.yaml#L15. since only_latest_log=True, each of them only return last value, that is, ml.values[-1].value == ml.values[0].value

Maybe you can take a look by debug to verify it

Yes, you are right! The program has been modified as suggested.

However, according to my testing run, writing like this causes the program to repeatedly print "Error. No trial has completed." So I removed return None and print error message and everything works fine. It is OK to not return, so that the GetEvaluationResult does not return anything and suggestion will not give any trials. And it is not en error if the status is not completed. There is no need to handle that.

hougangliu · 2019-02-20T23:50:11Z

pkg/suggestion/nasrl_service.py

-                    # However, if the user want to minimize the metrics, we can take the negative of the result
-                    if self.opt_direction == api_pb2.MINIMIZE:
-                        result = -result
-


why delete above?

Sorry, I made a mistake. Should delete the code above that. Already fixed

hougangliu · 2019-02-20T23:59:18Z

/lgtm

hougangliu · 2019-02-21T00:52:38Z

/approve

k8s-ci-robot · 2019-02-21T00:52:47Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: hougangliu

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [hougangliu]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-ci-robot added the do-not-merge/work-in-progress label Jan 23, 2019

k8s-ci-robot requested review from hougangliu and libbyandhelen January 23, 2019 02:59

k8s-ci-robot added the size/XL label Jan 23, 2019

DeeperMind added 2 commits January 22, 2019 19:18

Suggestion for Neural Architecture Search with Reinforcement Learning

a056e8b

Add NAS RL Suggestion

a8709d0

jinan-zhou force-pushed the NAS-RL-suggestion branch from d9b388c to a8709d0 Compare January 23, 2019 03:19

DeeperMind added 4 commits January 22, 2019 19:25

Fix new line

9342c69

set json format for GetSuggestion()

86e04ce

finish trial return in GetSuggestion(), finish GetEvaluationHistory, …

7d00c8b

…and fix bugs

fix a bug in GetEvaluationResult()

bd01e67

jinan-zhou mentioned this pull request Jan 26, 2019

Training Container with Model Constructor for cifar10 #345

Merged

DeeperMind added 3 commits January 25, 2019 19:05

fix bigs in GetEvaluationResult

99d8761

fix an error in GetEvaluatinResult

8f3eee4

Add python Katib api

99c349c

k8s-ci-robot added size/XXL and removed size/XL labels Jan 29, 2019

Merge from master

758eb8a

k8s-ci-robot added size/XL and removed size/XXL labels Jan 29, 2019

DeeperMind added 3 commits January 29, 2019 11:15

Remove unnecessary requirements

8013ef4

add about for suggestion

14c77b8

rename to README

5f7824a

jinan-zhou changed the title ~~[WIP] Suggestion for Neural Architecture Search with Reinforcement Learning~~ Suggestion for Neural Architecture Search with Reinforcement Learning Jan 30, 2019

k8s-ci-robot removed the do-not-merge/work-in-progress label Jan 30, 2019

k8s-ci-robot assigned YujiOshima Jan 30, 2019

hougangliu reviewed Feb 13, 2019

View reviewed changes

pkg/suggestion/nasrl_service.py Show resolved Hide resolved

pkg/suggestion/nasrl_service.py Outdated Show resolved Hide resolved

hougangliu mentioned this pull request Feb 13, 2019

need validate nasConfig #372

Closed

hougangliu reviewed Feb 13, 2019

View reviewed changes

pkg/suggestion/nasrl_service.py Show resolved Hide resolved

pkg/suggestion/nasrl_service.py Show resolved Hide resolved

k8s-ci-robot assigned gaocegege and richardsliu Feb 13, 2019

k8s-ci-robot assigned johnugeorge Feb 13, 2019

Merge remote-tracking branch 'upstream/master' into NAS-RL-suggestion

bb2c8ef

hougangliu reviewed Feb 14, 2019

View reviewed changes

pkg/suggestion/NAS_Reinforcement_Learning/LSTM.py Show resolved Hide resolved

fix some small problems

c535c00

hougangliu reviewed Feb 19, 2019

View reviewed changes

pkg/suggestion/nasrl_service.py Outdated Show resolved Hide resolved

DeeperMind added 2 commits February 19, 2019 12:11

Fix several problems

8945324

Fix a typo

98940c3

hougangliu reviewed Feb 20, 2019

View reviewed changes

pkg/suggestion/NAS_Reinforcement_Learning/README.md Outdated Show resolved Hide resolved

DeeperMind added 2 commits February 19, 2019 17:24

fix some problems

b06b083

Merge remote-tracking branch 'upstream/master' into NAS-RL-suggestion

b91afe1

hougangliu reviewed Feb 20, 2019

View reviewed changes

DeeperMind added 2 commits February 20, 2019 10:57

small fixes

cf48a19

Suggestion do not need to handle uncompleted trials

c8c5fc6

hougangliu reviewed Feb 20, 2019

View reviewed changes

fix a small problem

78b62d9

k8s-ci-robot assigned hougangliu Feb 20, 2019

k8s-ci-robot added the lgtm label Feb 20, 2019

k8s-ci-robot added the approved label Feb 21, 2019

k8s-ci-robot merged commit 3bb8b54 into kubeflow:master Feb 21, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Suggestion for Neural Architecture Search with Reinforcement Learning #339

Suggestion for Neural Architecture Search with Reinforcement Learning #339

jinan-zhou commented Jan 23, 2019 •

edited

Loading

googlebot commented Jan 23, 2019

jinan-zhou commented Jan 23, 2019

googlebot commented Jan 23, 2019

jinan-zhou commented Jan 30, 2019

hougangliu commented Feb 13, 2019

hougangliu commented Feb 13, 2019

hougangliu Feb 20, 2019

jinan-zhou Feb 20, 2019 •

edited

Loading

hougangliu Feb 20, 2019 •

edited

Loading

jinan-zhou Feb 20, 2019 •

edited

Loading

hougangliu Feb 20, 2019

jinan-zhou Feb 20, 2019

hougangliu Feb 20, 2019

hougangliu commented Feb 20, 2019

hougangliu commented Feb 21, 2019

k8s-ci-robot commented Feb 21, 2019

Suggestion for Neural Architecture Search with Reinforcement Learning #339

Suggestion for Neural Architecture Search with Reinforcement Learning #339

Conversation

jinan-zhou commented Jan 23, 2019 • edited Loading

googlebot commented Jan 23, 2019

jinan-zhou commented Jan 23, 2019

googlebot commented Jan 23, 2019

jinan-zhou commented Jan 30, 2019

hougangliu commented Feb 13, 2019

hougangliu commented Feb 13, 2019

hougangliu Feb 20, 2019

Choose a reason for hiding this comment

jinan-zhou Feb 20, 2019 • edited Loading

Choose a reason for hiding this comment

hougangliu Feb 20, 2019 • edited Loading

Choose a reason for hiding this comment

jinan-zhou Feb 20, 2019 • edited Loading

Choose a reason for hiding this comment

hougangliu Feb 20, 2019

Choose a reason for hiding this comment

jinan-zhou Feb 20, 2019

Choose a reason for hiding this comment

hougangliu Feb 20, 2019

Choose a reason for hiding this comment

hougangliu commented Feb 20, 2019

hougangliu commented Feb 21, 2019

k8s-ci-robot commented Feb 21, 2019

jinan-zhou commented Jan 23, 2019 •

edited

Loading

jinan-zhou Feb 20, 2019 •

edited

Loading

hougangliu Feb 20, 2019 •

edited

Loading

jinan-zhou Feb 20, 2019 •

edited

Loading