You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am currently unable to recreate the results of the stable transformer on the Pong environment. I believe from the paper the last 100 episode returns should be ~17.62 for this model and environment.
I am running the train program with arguments as specified in README for Best Performing Stable Transformer on Pong.
In train.py line 731 I changed ctx = mp.get_context("fork") to ctx = mp.get_context("spawn")
Hello,
I am currently unable to recreate the results of the stable transformer on the Pong environment. I believe from the paper the last 100 episode returns should be ~17.62 for this model and environment.
I am running the train program with arguments as specified in README for Best Performing Stable Transformer on Pong.
In train.py line 731 I changed
ctx = mp.get_context("fork")
toctx = mp.get_context("spawn")
The final results I obtained one one run:
Results from another run:
I am on Ubuntu 18.04.4, using Cuda 10.2, cudnn 7, torch 1.6.0.
Thanks in advance for any help.
Best,
Sean
The text was updated successfully, but these errors were encountered: