Reinforcement learning 强化学习
阅读量:
[[ Reinforcement Learning 强化学习.canvas ]]
Terminology 术语
RL Observation and State 观察和状态
RL Exploration and Exploitation 探索和利用
RL Off-policy and On-policy 异策和同策
Markov Decision Process 马尔可夫决策过程
Algorithm 算法
Courses
Resources
- http://incompleteideas.net/book/RLbook2020.pdf
- http://huangc.top/2018/05/03/RL-2018/
- http://huangc.top/2018/05/12/RL2-2018/
#待整理笔记
反向链接
到头儿啦~