Normalized gym env #125

zhanpenghe · 2018-06-06T22:37:55Z

This is basically a rewrite of normalized_env and it conforms to the interface of gym.Env.

See issue #64

jonashen · 2018-06-06T23:41:56Z

rllab/envs/normalized_gym_env.py

+    else:
+        raise NotImplementedError
+
+


Maybe add bounds(), flatten_n_gym_space(), unflatten_gym_space(), and unflatten_n_gym_space()?

Or we can move these functions into a new helper class. I think that will be beneficial when porting the other rllab.Envs.

I agree these would be useful in something like rllab.env.utils

eric-heiden · 2018-06-07T02:05:10Z

rllab/envs/normalized_gym_env.py

+        raise NotImplementedError
+
+
+class NormalizedGymEnv(gym.Env, Serializable):


Consider inheriting from Wrapper because this will take care of the render, close, seed, etc. functions.

eric-heiden · 2018-06-07T02:15:45Z

rllab/envs/normalized_gym_env.py

+    def step(self, action):
+        if isinstance(self._env.action_space, gym.spaces.Box):
+            # rescale the action
+            lb, ub = self._env.action_space.bounds


I think the bounds can sometimes be (-np.inf, np.inf) so you shouldn't normalize in this case.

eric-heiden · 2018-06-07T02:24:20Z

rllab/envs/normalized_gym_env.py

+
+    def _apply_normalize_obs(self, obs):
+        self._update_obs_estimate(obs)
+        return (flatten_gym_space(obs, self._env.observation_space) -


For completeness, the returned obs should be "unflattened" again to be consistent with how the env behaved originally. You should probably support unflattening for Discrete, Box and Tuple spaces (given that your flatten function handles these cases).
Just in case this results in a significant computing overhead (benchmarking would help clarify), I would suggest you let the user set a constructor parameter, e.g. flatten_obs (False by default), to manually deactivate this costly unflattening.

eric-heiden · 2018-06-07T02:25:24Z

rllab/envs/normalized_gym_env.py

+            lb, ub = self._env.action_space.bounds
+            scaled_action = lb + (action + 1.) * 0.5 * (ub - lb)
+            scaled_action = np.clip(scaled_action, lb, ub)
+        else:


I think Discrete is also a common action space and should be handled here.

I think Discrete does not need to be scaled.

ryanjulian · 2018-06-07T17:40:44Z

rllab/envs/normalized_gym_env.py

+    else:
+        raise NotImplementedError
+
+


I agree these would be useful in something like rllab.env.utils

ryanjulian · 2018-06-07T17:41:25Z

tests/test_normalized_gym.py

@@ -0,0 +1,30 @@
+import gym
+from rllab.envs.normalized_gym_env import NormalizedGymEnv


PEP8: import grouping

Yes, I agree about the utils. I asked @jonashen to add that into his pr since he is doing the refactoring of gym.Env so let me reopen this after his pr.

ryanjulian

Since all envs will be gym.Envs as of #118, shouldn't this just replace normalized?

zhanpenghe · 2018-06-09T19:42:15Z

Yes, the normalized class will be deleted when pr #129 is done.

ryanjulian

Please make a Github issue to rename/replaced normalized with this once the gym.Env change has posted.

ryanjulian · 2018-06-11T04:54:05Z

Please reopen this PR against https://github.com/rlworkgroup/garage

zhanpenghe added 5 commits June 6, 2018 15:20

Add normalized gym env

604f3d2

yapf

cef11af

more yapf

a0326d1

more yapf

c0f2d2c

more yapf

47cff85

zhanpenghe requested review from ryanjulian, eric-heiden, jonashen and hejia-zhang June 6, 2018 22:40

jonashen reviewed Jun 6, 2018

View reviewed changes

jonashen mentioned this pull request Jun 7, 2018

Normalization for gym envs #64

Closed

eric-heiden requested changes Jun 7, 2018

View reviewed changes

ryanjulian reviewed Jun 7, 2018

View reviewed changes

ryanjulian added this to the Week of June 4th milestone Jun 7, 2018

zhanpenghe added 2 commits June 8, 2018 11:54

Add unflatten option

8b8bdb8

fix arg name

954d496

zhanpenghe requested a review from eric-heiden June 8, 2018 19:04

eric-heiden approved these changes Jun 8, 2018

View reviewed changes

zhanpenghe requested review from ryanjulian, jonashen, a user and CatherineSue June 8, 2018 23:24

ryanjulian approved these changes Jun 9, 2018

View reviewed changes

ryanjulian mentioned this pull request Jun 9, 2018

Replaced rllab.envs.Env with gym.Env #129

Open

zhanpenghe force-pushed the normalized_gym_env branch from 67b9ee5 to 954d496 Compare June 9, 2018 20:46

ryanjulian approved these changes Jun 9, 2018

View reviewed changes

zhanpenghe mentioned this pull request Jun 10, 2018

Cleanup normalized env #131

Closed

ryanjulian mentioned this pull request Jun 11, 2018

Cleanup normalized env rlworkgroup/garage#2

Closed

ghost approved these changes Jun 15, 2018

View reviewed changes

zhanpenghe closed this Jun 18, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Normalized gym env #125

Normalized gym env #125

zhanpenghe commented Jun 6, 2018 •

edited

Loading

jonashen Jun 6, 2018 •

edited

Loading

jonashen Jun 6, 2018

ryanjulian Jun 7, 2018

eric-heiden Jun 7, 2018

eric-heiden Jun 7, 2018

eric-heiden Jun 7, 2018

eric-heiden Jun 7, 2018

zhanpenghe Jun 8, 2018

ryanjulian Jun 7, 2018

ryanjulian Jun 7, 2018

zhanpenghe Jun 8, 2018

ryanjulian left a comment

zhanpenghe commented Jun 9, 2018

ryanjulian left a comment

ryanjulian commented Jun 11, 2018

		raise NotImplementedError


		class NormalizedGymEnv(gym.Env, Serializable):

		@@ -0,0 +1,30 @@
		import gym
		from rllab.envs.normalized_gym_env import NormalizedGymEnv

Normalized gym env #125

Normalized gym env #125

Conversation

zhanpenghe commented Jun 6, 2018 • edited Loading

jonashen Jun 6, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ryanjulian left a comment

Choose a reason for hiding this comment

zhanpenghe commented Jun 9, 2018

ryanjulian left a comment

Choose a reason for hiding this comment

ryanjulian commented Jun 11, 2018

zhanpenghe commented Jun 6, 2018 •

edited

Loading

jonashen Jun 6, 2018 •

edited

Loading