-
Notifications
You must be signed in to change notification settings - Fork 29
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Issues Running MAPPO #38
Comments
@eugenevinitsky Could you help take a look? |
Hi, sorry this bug is here! I am out today but this will be definitively fixed by tomorrow afternoon. |
I believe the fixes that you have there are correct though. |
Thanks for your patience, working on getting this merged but the relevant fixes are in: |
Heads up though, that code has not been extensively hyper-parameter tuned |
No rush at all but let us know if this resolves your issue? |
Hi @eugenevinitsky , Everything is running now thanks for the fixes. Just out of curiosity before we close this issue, what should the fps be during training? I'm getting 25-30:
|
It's hard to say what is the normal FPS. It depends on lost of things. Could you provide more details such as what machine you are using, what and how many CPU cores you have, what and how many GPUs you have, etc. |
Hey @roggirg, it depends on the number of rollout threads you're using and whether you are using a GPU or just CPU; the MAPPO code uses an RNN by default and includes the time for backprop when computing the FPS. Can you try increasing the value of algorithm.n_rollout_threads? It should basically scale linearly in the number of threads or workers |
Ah cool, thanks @eugenevinitsky @xiaomengy . I played around with n_rollout_threads=4 (did not know of its existence) and the FPS jumped up to ~50ish. |
We're going to re-open this because that's a good deal slower than we expect it to be. @xiaomengy, any chance you could run the line |
Hi Folks,
I'm trying to run "on-policy PPO" using
python examples/on_policy_files/nocturne_runner.py algorithm=ppo
and there are a couple of issues I'm encountering.algo
vs.algorithm
: The config.yml file usesalgorithm
whereas the script usescfg.algo
. Switchingalgo
toalgorithm
seems to fix the issue.wandb_name
seems to be missing from the cfg. To make it work, I just disabled use of wandb.len(self.vehicles)
on line 30 which throwsAttributeError: 'BaseEnv' object has no attribute 'vehicles'
. Replacingself.vehicles
withself.controlled_vehicles
seems to solve the issue. Is this the correct way to fix it?Thanks for your help.
The text was updated successfully, but these errors were encountered: