Tour De Flags Maze solved by deep reinforcement learning (Q-learning) technique. The Tour De Flags maze game is similar to the classical Mouse/Cheese maze game, except that the mouse is replaced by an agent whose mission is to collect several flags before arriving to the target cell (were the "Cheese" used to be in the previous maze game). For simplicity sake we will assume that the agent always starts from cell (0,0) and the destination cell is always at the bottom right cell of the maze. A more elaborate description: http://www.samyzaf.com/ML/tdf/tdf.html
-
Notifications
You must be signed in to change notification settings - Fork 5
samyzaf/tdfmaze
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Tour De Flags Maze solved by deep reinforcement learning technique (Q-learning)
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published