Oppai>///<


  • Home

  • Tags

  • Archives

  • Search

Reinforcement Learning NoteTag

Chapter13 Policy Gradient Methods

12-28

Chapter12 Eligibility Traces

12-24

Chapter11 Off-policy Methods with Approximation

12-19

Chapter10 On-policy Control with Approximation

12-15

Chapter09 On-policy Prediction with Approximation

12-08

Chapter08 Planning and Learning with Tabular Methods

12-07

Chapter07 n-step Bootstrapping

12-03

Chapter06 Temporal-Difference Learning

12-01

Chapter05 Monte Carlo Methods

11-28

Chapter04 Dynamic Programming

11-23
12
xingE650

xingE650

46 posts
7 tags
Helpful Link
  • numpy-reference
  • RL-book-code
  • latex-common-grammer
  • common-pretrained-models
© 2019 xingE650
Powered by Hexo
|
Theme — NexT.Gemini v5.1.4
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6