New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

[Feature] Max Value Writer #1622

Merged

vmoens merged 22 commits into pytorch:main from PyTorchRL:max_val_writer

Oct 18, 2023

Contributor

albertbou92 commented Oct 10, 2023 •

edited

Loading

Description

A Writer class for composable replay buffers that keeps the top elements based on some ranking key.

Motivation and Context

Why is this change required? What problem does it solve?
If it fixes an open issue, please link to the issue here.
You can use the syntax close #15213 if this solves the issue #15213

I have raised an issue to propose this change (required for new features and bug fixes)

Types of changes

What types of changes does your code introduce? Remove all that do not apply:

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds core functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)
Example (update in the folder of examples)

Checklist

Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!

I have read the CONTRIBUTION guide (required)
My change requires a change to the documentation.
I have updated the tests accordingly (required for a bug fix or a new feature).
I have updated the documentation accordingly.

albertbou92 added 3 commits

October 10, 2023 12:48


          max value writer

a8b29ac


          max value writer

c18db48


          max value writer

3d6d9eb

facebook-github-bot added the CLA Signed label

albertbou92 changed the title ~~[Feature] Max Value Writer~~ [Feature, WIP] Max Value Writer


          default key

6f07af1

vmoens reviewed

View reviewed changes

Contributor

vmoens left a comment

Looks promising thanks a lot for this
Can we add it to the docs?

torchrl/data/replay_buffers/writers.py Show resolved Hide resolved

torchrl/data/replay_buffers/writers.py Outdated Show resolved Hide resolved

torchrl/data/replay_buffers/writers.py Outdated Show resolved Hide resolved

torchrl/data/replay_buffers/writers.py Outdated Show resolved Hide resolved

torchrl/data/replay_buffers/writers.py Outdated

Comment on lines 155 to 156

		for sample in data:
		self.add(sample)

Contributor

vmoens Oct 10, 2023

Is this efficient when data is a TensorDict?
Shouldn't we batch these ops?

Contributor Author

albertbou92 Oct 11, 2023 •

edited

Loading

How? if we need to check whether or not the values are higher that the current stored values we need to check them one by one.

maybe we can pre-sort the td values or something and try to make it a bit more efficient

vmoens added the enhancement label

albertbou92 added 6 commits

October 11, 2023 11:31


          added comments

a02c2d8


          added comments

2b12160

fix

bd32b3c


          add to docs

b6e1be8


          batched extension

e90f4f0


          test

b383615

albertbou92 changed the title ~~[Feature, WIP] Max Value Writer~~ [Feature] Max Value Writer

albertbou92 added 3 commits

October 17, 2023 10:57


          test

2aa659b


          test

ee93f20

fix

23e1888

vmoens approved these changes

View reviewed changes

Contributor

vmoens left a comment

LGTM thanks for this! I left some suggestions

torchrl/data/replay_buffers/writers.py Outdated Show resolved Hide resolved

torchrl/data/replay_buffers/writers.py Outdated Show resolved Hide resolved

torchrl/data/replay_buffers/writers.py Outdated Show resolved Hide resolved

torchrl/data/replay_buffers/writers.py Outdated Show resolved Hide resolved

torchrl/data/replay_buffers/writers.py Outdated Show resolved Hide resolved

torchrl/data/replay_buffers/writers.py Outdated Show resolved Hide resolved

torchrl/data/replay_buffers/writers.py Outdated

+                      rank_data = data.get("_data").get(self._rank_key)
+                      # Sum the rank key, in case it is a whole trajectory
+                      rank_data = rank_data.sum().item()

Contributor

vmoens Oct 18, 2023

Is this safe?
Maybe we should document what are the expected shapes for this class, eg

[B, T]

but not

[B1, B2, T]

Another option is to check the number of dimensions of the ranking key OR the name of the last dim of the input tensordict (which should be "time").

Not raising any exception and just doing a plain sum could lead to surprising results I think

Contributor Author

albertbou92 Oct 18, 2023 •

edited

Loading

I added the first option. Since the ranking value has to be a single float we only allow data of the shape [] and [T] for the add method and [B] and [B, T] for the extend method. If data has a time dimension, we sum along it. If too many dimensions are provided, an error is raised.

I did not go for checking the dimension names because it seemed to restrictive. I don't think time dimension is always labelled

Contributor

vmoens Oct 18, 2023

Not always but mostly
if you get your data from env.rollout or collector, it will.
If from there you store the data in a rb, it will keep the tag.
But if you reshape or do other stuff it could go away.

torchrl/data/replay_buffers/writers.py Outdated Show resolved Hide resolved

torchrl/data/replay_buffers/writers.py Outdated Show resolved Hide resolved

albertbou92 added 7 commits

October 18, 2023 09:18


          comments feedback

466ee57


          comments feedback

7f6ca5f


          comments feedback

8ec741b


          comments feedback

36063dd


          comments feedback

da8eb3a


          comments feedback

846170c


          Merge branch 'main' into max_val_writer

faad875

albertbou92 requested a review from vmoens

October 18, 2023 14:43

vmoens approved these changes

View reviewed changes

Contributor

vmoens left a comment

LGTM thanks so much!

torchrl/data/replay_buffers/writers.py Outdated Show resolved Hide resolved

torchrl/data/replay_buffers/writers.py Outdated Show resolved Hide resolved

vmoens added 2 commits

October 18, 2023 18:23


          Update torchrl/data/replay_buffers/writers.py

48a7f31


          Update torchrl/data/replay_buffers/writers.py

9d00f10

vmoens merged commit 55d667e into pytorch:main

52 of 59 checks passed

vmoens deleted the max_val_writer branch

October 18, 2023 17:51

vmoens mentioned this pull request

[Feature] step_and_maybe_reset in env #1611

Merged

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed enhancement