Chapter02 Multi-armed Bandits

Posted on 2018-11-14

k-arm-Bandits问题可说是强化学习最简单的任务了，因为他只涉及了1个state下的action选取。通过本章可以对强化学习的目标，评估方法和训练方法有一个初步的认识。

Chapter01 Tic-Tac-Toe

Posted on 2018-11-14

引用来自ShangtongZhang的代码chapter01/tic_tac_toe.py

Tic-Tac-Toe的python代码实现

1、引入模块并定义井字棋的常量

Chapter01 Introduction

Posted on 2018-11-12

希望能以更新博客的方式激励一下自己，目前是准备读一下强化学习的入门书《Reinforcement Learning Introduction》，然后做一下读书笔记。下面是绪论(Introduction)的内容。

some config on next theme

Posted on 2018-11-10

上次配置了hexo+github的个人博客，这次我做了一些偏好的配置。

my first try of hexo+github

Posted on 2018-11-09

今天我学习了如何配置Hexo来写博客，并放到github上，但是我还不会markdown，自己还是好菜啊QAQ。

Hello World

Posted on 2018-11-08

Welcome to Hexo! This is your very first post. Check documentation for more info. If you get any problems when using Hexo, you can find the answer in troubleshooting or you can ask me on GitHub.

1
2
3
4
5
6