Algorithm Problem #16

ShelyH · 2022-10-16T01:39:35Z

Hi, sorry to bother you again! This experiment uses the PPO algorithm to update the policy, but this is an online policy. Can I use an offline policy, such as SAC, to replace the PPO algorithm in my experiment?

Shuijing725 · 2022-10-16T19:32:51Z

Yes, I think so. Please let me know what results you get!

ShelyH · 2022-10-17T01:32:21Z

Sorry, so far I haven't learned a successful navigation strategy. I wonder if it is the way the policy is updated. Could you give me some suggestions? Thanks!

Shuijing725 · 2022-10-17T04:34:52Z

Without knowing your implementation details, I apologize that I cannot give very useful advice. The issue might be a bug in your code, unsuitable hyperparameters, or something else.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Algorithm Problem #16

Algorithm Problem #16

ShelyH commented Oct 16, 2022

Shuijing725 commented Oct 16, 2022

ShelyH commented Oct 17, 2022

Shuijing725 commented Oct 17, 2022

Algorithm Problem #16

Algorithm Problem #16

Comments

ShelyH commented Oct 16, 2022

Shuijing725 commented Oct 16, 2022

ShelyH commented Oct 17, 2022

Shuijing725 commented Oct 17, 2022