polish(nyz): polish dqn and ppo comments #732

PaParaZz1 · 2023-09-20T05:03:01Z

Description

Related Issue

TODO

Check List

merge the latest version source branch/repo, and resolve all the conflicts
pass style check
pass all the tests

ding/policy/base_policy.py

puyuan1996 · 2023-10-31T06:54:23Z

ding/policy/dqn.py

        """
+        # Data preprocessing operations, such as stack data, cpu to cuda device
        data = default_preprocess_learn(


数据预处理这里或许可以详细说明，里面做了哪些操作

里面的细节注释以后再更新

ding/policy/dqn.py

puyuan1996 · 2023-10-31T07:00:14Z

ding/policy/r2d2.py

-        R2D2 proposes that several tricks should be used to improve upon DRQN,
-        namely some recurrent experience replay tricks such as burn-in.
+        R2D2 proposes that several tricks should be used to improve upon DRQN, namely some recurrent experience replay \
+        tricks and the burn-in mechanism for off-policy training.


The R2D2 policy class is inspired by the paper "Recurrent Experience Replay in Distributed Reinforcement Learning". R2D2 suggests the incorporation of several enhancements over DRQN, specifically the application of novel recurrent experience replay strategies and the implementation of a burn-in mechanism for off-policy training.

ding/policy/r2d2.py

ding/policy/dqn.py

puyuan1996 · 2023-10-31T07:30:22Z

ding/policy/dqn.py

-            - data (:obj:`List[Dict[str, Any]`): The trajectory data(a list of transition), each element is the same \
-                format as the return value of ``self._process_transition`` method.
+            - transitions (:obj:`List[Dict[str, Any]`): The trajectory data (a list of transition), each element is \
+                the same format as the return value of ``self._process_transition`` method.
        Returns:


The trajectory data, which is a list of transitions. Each element is in the same format as the return value of the self._process_transition method.

puyuan1996 · 2023-10-31T07:31:30Z

ding/policy/dqn.py

-            And the user can customize the this data processing procecure by overriding this two methods and collector \
-            itself.
+            - samples (:obj:`List[Dict[str, Any]]`): The processed train samples, each element is the similar format \
+                as input transitions, but may contain more data for training, such as nstep reward and target obs.
        """


The processed training samples. Each element is similar in format to the input transitions, but may contain additional data for training, such as n-step reward and target observations.

polish(nyz) polish dqn and ppo comments

225c419

PaParaZz1 added the doc Documentation label Sep 20, 2023

PaParaZz1 mentioned this pull request Sep 20, 2023

Roadmap for DI-engine #548

Open

PaParaZz1 added 6 commits September 21, 2023 14:13

polish(nyz) polish ddpg comments

9783f31

polish(nyz) polish impala comments

8317822

polish(nyz) polish pdqn comments

3d51760

polish(nyz) polish r2d2 comments

ec401c1

polish(nyz): polish policy mode comments

203c646

polish(nyz): polish sac comments

78a60c1

PaParaZz1 changed the title ~~polish(nyz) polish dqn and ppo comments~~ polish(nyz): polish dqn and ppo comments Oct 8, 2023

PaParaZz1 added 7 commits October 9, 2023 23:09

polish(nyz): polish cql/dt comments

d85fab0

polish(nyz): complete dqn comments

e10dc80

fix(nyz): fix discrete cql/sac unittest bugs

5462ee6

polish(nyz): complete r2d2 comments

1a3e259

polish(nyz): complete ddpg/bc comments

33ea61b

polish(nyz): complete sac/cql comments

b654d07

polish(nyz): polish qmix/mdqn/pdqn comments

2c6408c

puyuan1996 reviewed Oct 31, 2023

View reviewed changes

PaParaZz1 and others added 2 commits October 31, 2023 16:00

polish(nyz): complete ppo/impala/dt comments

ac7c6e2

Merge branch 'main' into dev-policy-comments

a22f3b8

PaParaZz1 merged commit 111bf24 into main Oct 31, 2023
31 of 40 checks passed

PaParaZz1 deleted the dev-policy-comments branch October 31, 2023 08:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

polish(nyz): polish dqn and ppo comments #732

polish(nyz): polish dqn and ppo comments #732

PaParaZz1 commented Sep 20, 2023

puyuan1996 Oct 31, 2023

PaParaZz1 Oct 31, 2023

puyuan1996 Oct 31, 2023

puyuan1996 Oct 31, 2023

puyuan1996 Oct 31, 2023

polish(nyz): polish dqn and ppo comments #732

polish(nyz): polish dqn and ppo comments #732

Conversation

PaParaZz1 commented Sep 20, 2023

Description

Related Issue

TODO

Check List

puyuan1996 Oct 31, 2023

Choose a reason for hiding this comment

PaParaZz1 Oct 31, 2023

Choose a reason for hiding this comment

puyuan1996 Oct 31, 2023

Choose a reason for hiding this comment

puyuan1996 Oct 31, 2023

Choose a reason for hiding this comment

puyuan1996 Oct 31, 2023

Choose a reason for hiding this comment