[RLlib] Attention Net prep PR #2: Smaller cleanups. #12449

sven1977 · 2020-11-26T15:16:36Z

The current attention net trajectory view PR (#11729) is too large (>1000 lines added).
Therefore, I'm moving smaller preparatory and cleanup changes into 3 pre-PRs. This is the second one of these. Only review it once this one here (#12447) has been merged.

Why are these changes needed?

Related issue number

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

…ntion_nets_prep_2

sven1977 · 2020-11-28T10:48:08Z

rllib/evaluation/collectors/simple_list_collector.py

@@ -133,7 +140,7 @@ def build(self, view_requirements: Dict[str, ViewRequirement]) -> \
                continue
            # OBS are already shifted by -1 (the initial obs starts one ts
            # before all other data columns).
-            shift = view_req.shift - \
+            shift = view_req.data_rel_pos - \


Renamed this b/c this will support (in the upcoming PRs) not just a single shift (int), but also:

list of ints (include not just one ts in this view, but several)

a range string, e.g. "-50:-1" (will be used by attention nets and Atari framestacking).

sven1977 · 2020-11-28T10:48:36Z

rllib/evaluation/collectors/simple_list_collector.py

@@ -52,17 +52,19 @@ def __init__(self, shift_before: int = 0):
        # each time a (non-initial!) observation is added.
        self.count = 0

-    def add_init_obs(self, episode_id: EpisodeID, agent_id: AgentID,
-                     env_id: EnvID, init_obs: TensorType,
+    def add_init_obs(self, episode_id: EpisodeID, agent_index: int,


agent_id vs agent_idx was a bug

added timestep

…ntion_nets_prep_2

ericl · 2020-11-30T21:10:08Z

rllib/policy/view_requirement.py

@@ -29,7 +29,7 @@ class ViewRequirement:
    def __init__(self,
                 data_col: Optional[str] = None,
                 space: gym.Space = None,
-                 shift: Union[int, List[int]] = 0,
+                 data_rel_pos: Union[int, List[int]] = 0,


Why not keep it as shift? It seems to be intuitive

I liked shift, too. The problem is, there will also be an abs_pos soon (see attention net PRs). So I wanted to distinguish between these two concepts.

ericl · 2020-11-30T21:43:27Z

rllib/evaluation/rollout_worker.py

+                whether to create those new envs in remote processes instead of
+                in the current process. This adds overheads, but can make sense
+                if your envs are expensive to step/reset (e.g., for StarCraft).
+                Use this cautiously, overheads are significant!


sven1977 added 3 commits November 26, 2020 14:24

WIP.

b5a4bc1

Fix.

0437680

WIP.

86911d9

sven1977 changed the title ~~[RLlib] Attention Net prep PR #2: Smaller cleanups.~~ [ONHOLD RLlib] Attention Net prep PR #2: Smaller cleanups. Nov 26, 2020

sven1977 mentioned this pull request Nov 26, 2020

[RLlib] Attention Net prep PR #3. #12450

Merged

6 tasks

sven1977 added 5 commits November 26, 2020 16:37

Fix.

5e269c4

Fix.

787810d

WIP.

7113c32

Merge branch 'attention_nets_prep_0' into attention_nets_prep_2

2040c93

Fixes and LINT.

f79faf7

sven1977 added the tests-ok The tagger certifies test failures are unrelated and assumes personal liability. label Nov 27, 2020

Merge branch 'master' of https://github.com/ray-project/ray into atte…

b5a31b3

…ntion_nets_prep_2

sven1977 removed the tests-ok The tagger certifies test failures are unrelated and assumes personal liability. label Nov 27, 2020

sven1977 changed the title ~~[ONHOLD RLlib] Attention Net prep PR #2: Smaller cleanups.~~ [RLlib] Attention Net prep PR #2: Smaller cleanups. Nov 28, 2020

sven1977 requested a review from ericl November 28, 2020 10:42

sven1977 assigned ericl Nov 28, 2020

Merge branch 'master' of https://github.com/ray-project/ray into atte…

bc084a2

…ntion_nets_prep_2

sven1977 commented Nov 28, 2020

View reviewed changes

sven1977 added tests-ok The tagger certifies test failures are unrelated and assumes personal liability. and removed tests-ok The tagger certifies test failures are unrelated and assumes personal liability. labels Nov 28, 2020

Merge branch 'master' of https://github.com/ray-project/ray into atte…

7241d82

…ntion_nets_prep_2

ericl reviewed Nov 30, 2020

View reviewed changes

ericl approved these changes Nov 30, 2020

View reviewed changes

ericl added the @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. label Nov 30, 2020

ericl reviewed Nov 30, 2020

View reviewed changes

sven1977 merged commit 3ad9365 into ray-project:master Dec 1, 2020

sven1977 deleted the attention_nets_prep_2 branch March 27, 2021 11:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Attention Net prep PR #2: Smaller cleanups. #12449

[RLlib] Attention Net prep PR #2: Smaller cleanups. #12449

sven1977 commented Nov 26, 2020 •

edited

Loading

sven1977 Nov 28, 2020

sven1977 Nov 28, 2020

ericl Nov 30, 2020

sven1977 Dec 1, 2020

ericl Nov 30, 2020

[RLlib] Attention Net prep PR #2: Smaller cleanups. #12449

[RLlib] Attention Net prep PR #2: Smaller cleanups. #12449

Conversation

sven1977 commented Nov 26, 2020 • edited Loading

Why are these changes needed?

Related issue number

Checks

sven1977 Nov 28, 2020

Choose a reason for hiding this comment

sven1977 Nov 28, 2020

Choose a reason for hiding this comment

ericl Nov 30, 2020

Choose a reason for hiding this comment

sven1977 Dec 1, 2020

Choose a reason for hiding this comment

ericl Nov 30, 2020

Choose a reason for hiding this comment

sven1977 commented Nov 26, 2020 •

edited

Loading