Skip to content

Latest commit

 

History

History
21 lines (18 loc) · 1.64 KB

README.md

File metadata and controls

21 lines (18 loc) · 1.64 KB

Lecture slides - here

Materials

  • Russian materials:
  • English materials:
    • Lecture by David Silver (english) - video part I, video part II
    • Alternative lecture by Pieter Abbeel (english) - video
    • Alternative lecture by John Schulmann (english) - video
    • Blog post on q-learning Vs SARSA - url

More materials

  • N-step temporal difference from Sutton's book - suttonbook chapter 7
  • Eligibility traces from Sutton's book - suttonbook chapter 12
  • Blog post on eligibility traces - url

Assignments

Just as usual, start with homework.ipynb Open In Colab For seminar, implement q-learning agent and test it on Taxi and CartPole with binarizer. And then, implement EV-SARSA agent, experience replay + bonus tasks for homework.