You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am replicating a TD3 implementation in DI-engine and noticed the actor and critic hidden sizes are forced to be integers for vector observations. Is there a reason for not allowing the MLP to have heterogeneous hidden layer sizes?
The text was updated successfully, but these errors were encountered:
In our experiments for TD3, we mainly test its performance on the classical MuJoCo environments. For this case, the simple design is suitable and enough to acquire excellent performance. And it often needs more complicated normalization and network initialization techniques when using more complex network. Therefore, we use current design for ContinuousQAC.
If you have other demands for your environment, you can imitate ContinuousQAC class and implement your own QAC network.
Hello,
I am replicating a TD3 implementation in DI-engine and noticed the actor and critic hidden sizes are forced to be integers for vector observations. Is there a reason for not allowing the MLP to have heterogeneous hidden layer sizes?
The text was updated successfully, but these errors were encountered: