No Hidden Size List for ContinuousQAC? #826

MarkHolmstrom · 2024-08-29T08:16:57Z

Hello,

I am replicating a TD3 implementation in DI-engine and noticed the actor and critic hidden sizes are forced to be integers for vector observations. Is there a reason for not allowing the MLP to have heterogeneous hidden layer sizes?

PaParaZz1 · 2024-09-03T08:48:39Z

In our experiments for TD3, we mainly test its performance on the classical MuJoCo environments. For this case, the simple design is suitable and enough to acquire excellent performance. And it often needs more complicated normalization and network initialization techniques when using more complex network. Therefore, we use current design for ContinuousQAC.

If you have other demands for your environment, you can imitate ContinuousQAC class and implement your own QAC network.

PaParaZz1 added discussion Discussion of a typical issue algo Add new algorithm or improve old one labels Aug 30, 2024

PaParaZz1 closed this as completed Sep 4, 2024

PaParaZz1 mentioned this issue Sep 20, 2024

Roadmap for DI-engine #548

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

No Hidden Size List for ContinuousQAC? #826

No Hidden Size List for ContinuousQAC? #826

MarkHolmstrom commented Aug 29, 2024

PaParaZz1 commented Sep 3, 2024

No Hidden Size List for ContinuousQAC? #826

No Hidden Size List for ContinuousQAC? #826

Comments

MarkHolmstrom commented Aug 29, 2024

PaParaZz1 commented Sep 3, 2024