Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
CachedEnvWrapper
by @heiner that uses two queues to emulate a single environment. By making use of several threads, this allows to swap the underlying environment during resetting. Consequently, the tasks wherereset()
could be a bottleneck can be made substantially faster.Also added a script that compares the SPS of environments with or without the wrapper. On
MiniHack-MultiRoom-N2-Lava-v0
which features comparatively longreset()
* ,CachedEnvWrapper
speeds up the SPS approximately two times on a random policy.*This is due to the fact that it triggers a
reset()
on MiniGrid's side, recompiles the des-file, and generally has short episodes given all the lava tiles on the grid that instantly kill the agent.