[BugFix] Account for terminating data in SAC losses #2606

vmoens · 2024-11-25T13:34:29Z

Stack from ghstack (oldest at bottom):

-> [BugFix] Account for terminating data in SAC losses #2606

[ghstack-poisoned]

pytorch-bot · 2024-11-25T13:34:33Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2606

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 17 New Failures, 1 Unrelated Failure

As of commit 18bf7f5 with merge base d90b9e3 ():

NEW FAILURES - The following jobs have failed:

Continuous Benchmark (PR) / CPU Pytest benchmark (gh)
FAILED ../../../../../../tmp/test_objectives_benchmarks.py::test_sac_speed[True-None] - torch._dynamo.exc.Unsupported: Graph break under GenericContextWrappingVariable
Continuous Benchmark (PR) / GPU Pytest benchmark (gh)
FAILED ../../../../tmp/test_objectives_benchmarks.py::test_sac_speed[True-None] - torch._dynamo.exc.Unsupported: Graph break under GenericContextWrappingVariable
Generate documentation / build-docs (3.10, 12.1) / linux-job (gh)
##[error]No files were found with the provided path: /home/ec2-user/actions-runner/_work/_temp/artifacts/. No artifacts will be uploaded.
Libs Tests on Linux / unittests-gym (3.9, 12.1) / linux-job (gh)
##[error]fatal: couldn't find remote ref refs/pull/2606/merge
Libs Tests on Linux / unittests-sklearn (3.9, 12.1) / linux-job (gh)
##[error]fatal: couldn't find remote ref refs/pull/2606/merge
RLHF Tests on Linux / unittests (3.9, 12.1) / linux-job (gh)
##[error]fatal: couldn't find remote ref refs/pull/2606/merge
SOTA Tests on Linux / tests (3.9, 12.1) / linux-job (gh)
RuntimeError: Command docker exec -t 8dac7cfc5db38e1d67820c6303efeda4bf5502f6ce61268622773dd975b0276d /exec failed with exit code 1
Unit-tests on Linux / tests-cpu (3.10) / linux-job (gh)
test/test_cost.py::TestDiscreteSAC::test_discrete_sac_terminating[terminated2-done2-reward2-observation2-action2]
Unit-tests on Linux / tests-cpu (3.11) / linux-job (gh)
test/test_cost.py::TestDiscreteSAC::test_discrete_sac_terminating[terminated2-done2-reward2-observation2-action2]
Unit-tests on Linux / tests-cpu (3.12) / linux-job (gh)
test/test_cost.py::TestDiscreteSAC::test_discrete_sac_terminating[terminated2-done2-reward2-observation2-action2]
Unit-tests on Linux / tests-cpu (3.9) / linux-job (gh)
test/test_cost.py::TestDiscreteSAC::test_discrete_sac_terminating[terminated2-done2-reward2-observation2-action2]
Unit-tests on Linux / tests-cpu-oldget (3.12) / linux-job (gh)
test/test_cost.py::TestDiscreteSAC::test_discrete_sac_terminating[terminated2-done2-reward2-observation2-action2]
Unit-tests on Linux / tests-gpu (3.11, 12.1) / linux-job (gh)
test/test_cost.py::TestDiscreteSAC::test_discrete_sac_terminating[terminated2-done2-reward2-observation2-action2]
Unit-tests on Linux / tests-olddeps (3.8, 11.6) / linux-job (gh)
##[error]fatal: couldn't find remote ref refs/pull/2606/merge
Unit-tests on Linux / tests-optdeps (3.11, 12.1) / linux-job (gh)
test/test_cost.py::TestDiscreteSAC::test_discrete_sac_terminating[terminated2-done2-reward2-observation2-action2]
Unit-tests on Linux / tests-stable-gpu (3.10, 11.8) / linux-job (gh)
test/test_cost.py::TestDiscreteSAC::test_discrete_sac_terminating[terminated2-done2-reward2-observation2-action2]
Unit-tests on Windows / unittests-cpu / windows-job (gh)
##[error]fatal: couldn't find remote ref refs/pull/2606/merge

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

Habitat Tests on Linux / tests (3.9, 12.1) / linux-job (gh) (trunk failure)
AttributeError: _ARRAY_API not found

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: dc1870292786c262b4ab6a221b3afb551e0efb9b Pull Request resolved: #2606

matteobettini · 2024-11-25T21:46:49Z

torchrl/objectives/sac.py

+                # Check done state and avoid passing these to the actor
+                done = next_tensordict.get(self.tensor_keys.done)
+                if done is not None and done.any():
+                    next_tensordict_select = next_tensordict[~done.squeeze(-1)]


The done shape could be more extended than the batch shape, this line is breaking in multiagent settings

Then we need a test that covers this use case!
Can you draft one for me?

The SOTA ci picked up on this. Both SAC scripts are failing

Ah i didn't see (SOTA is broken bc of dreamer so I didn't check)
we should have tests that are not in SOTA, SOTA is there to test that scripts run smoothly, not features. The scripts are not part of the core lib - we can arbitrarily decide to ditch them, the rest of the lib should still work.

Yeah I long wanted to make some tests for multiagent data in losses, will get to it when I have time.

Right now just crunching on writing thesis and satisfying BenchMARL users in free time.

Update

18bf7f5

[ghstack-poisoned]

vmoens added a commit that referenced this pull request Nov 25, 2024

[BugFix] Account for terminating data in SAC losses

c1ad347

ghstack-source-id: dc1870292786c262b4ab6a221b3afb551e0efb9b Pull Request resolved: #2606

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 25, 2024

vmoens merged commit 18bf7f5 into gh/vmoens/49/base Nov 25, 2024
40 of 58 checks passed

vmoens added a commit that referenced this pull request Nov 25, 2024

[BugFix] Account for terminating data in SAC losses

c8676f4

ghstack-source-id: dc1870292786c262b4ab6a221b3afb551e0efb9b Pull Request resolved: #2606

vmoens deleted the gh/vmoens/49/head branch November 25, 2024 13:35

vmoens mentioned this pull request Nov 25, 2024

[BUG] torchrl.objectives.SACLoss internal objective does not mask out terminated states #2590

Closed

3 tasks

matteobettini reviewed Nov 25, 2024

View reviewed changes

vmoens added the bug Something isn't working label Nov 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BugFix] Account for terminating data in SAC losses #2606

[BugFix] Account for terminating data in SAC losses #2606

vmoens commented Nov 25, 2024 •

edited

Loading

pytorch-bot bot commented Nov 25, 2024 •

edited

Loading

matteobettini Nov 25, 2024

vmoens Nov 25, 2024 •

edited

Loading

matteobettini Nov 25, 2024

vmoens Nov 25, 2024

matteobettini Nov 25, 2024

[BugFix] Account for terminating data in SAC losses #2606

[BugFix] Account for terminating data in SAC losses #2606

Conversation

vmoens commented Nov 25, 2024 • edited Loading

pytorch-bot bot commented Nov 25, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2606

❌ 17 New Failures, 1 Unrelated Failure

matteobettini Nov 25, 2024

Choose a reason for hiding this comment

vmoens Nov 25, 2024 • edited Loading

Choose a reason for hiding this comment

matteobettini Nov 25, 2024

Choose a reason for hiding this comment

vmoens Nov 25, 2024

Choose a reason for hiding this comment

matteobettini Nov 25, 2024

Choose a reason for hiding this comment

vmoens commented Nov 25, 2024 •

edited

Loading

pytorch-bot bot commented Nov 25, 2024 •

edited

Loading

vmoens Nov 25, 2024 •

edited

Loading