Source from :

POMCP 1.0

This is the release of POMCP used in the NIPS 2010 paper "Online Monte-Carlo Planning in Large POMDPs" by David Silver and Joel Veness

Added Code

We introduce human knowledge palyer into the game by introducing new modes for choosing of legal actions at each stage. We show this human knowledge can contribute to better policy design than that of the base line model

Future work

The current human knowledge works only on cells of distance one from the node we plan to introuc more complex examples
We plan to do more in depth experiments with less restrictions

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Source from :

Added Code

Future work

Files

README.md

Latest commit

History

README.md

File metadata and controls

Source from :

Added Code

Future work