强化学习
CS294-112 Deep Reinforcement Learning Sp17
prerequisites
Jan 30-Learning dynamical system models from data Levine
Jan 27 -Review section autodiff backpropagation optimization Finn
Jan 25-Optimal control and planning Levine
Jan 23-Supervised learning and decision making Levine
Jan 18-Introduction and course overview
Feb 8-RL definitions value iteration policy iteration Schulman
Feb 6-Guest lecture-Igor Mordatch OpenAI
Feb 15-Learning Q-functions Q-learning SARSA and others Schulman
Feb 13-Reinforcement learning with policy gradients Schulman
Feb 1-Learning policies by imitating optimal controllers Levine
CS294-131-02-27.mp4 177.7MB
CS294-112-04-26.mp4 36.3MB
CS294-112-04-24.mp4 333.0MB
CS294-112-04-19.mp4 528.0MB
CS294-112-04-12.mp4 249.8MB
CS294-112-04-10.mp4 249.2MB
CS294-112-04-03.mp4 221.1MB
CS294-112-03-22.mp4 305.6MB
CS294-112-03-20.mp4 194.8MB
CS294-112-03-15.mp4 321.4MB
CS294-112-03-13.mp4 291.9MB
CS294-112-03-08.mp4 210.1MB
CS294-112-03-06.mp4 235.3MB
CS294-112-03-01.mp4 264.6MB
CS294-112-02-27.mp4 297.9MB
CS294-112-02-22.mp4 262.5MB
CS294-112-02-15.mp4 248.1MB
CS294-112-02-15未校正.srt 0.1MB
CS294-112-02-13.mp4 187.7MB
CS294-112-02-13-未校正.srt 0.1MB
CS294-112-02-08.mp4 132.9MB
CS294-112-02-08-未校正.srt 0.1MB
CS294-112-02-06.mp4 234.4MB
CS294-112-02-06-未校正.srt 0.1MB
CS294-112-02-01.mp4 325.1MB
CS294-112-02-01-未校正.srt 0.1MB
CS294-112-01-30.mp4 538.0MB
CS294-112-01-30-未校正.srt 0.2MB
CS294-112-01-25.mp4 362.1MB
CS294-112-01-25-未校正.srt 0.1MB
CS294-112-01-18.srt 0.1MB
CS294-112-01-18.mp4 227.9MB
CS 294-112-04-05.mp4 336.0MB
Lecture 9 Markov Decision Processes II.srt 0.1MB
Lecture 9 Markov Decision Processes II.mp4 120.9MB
Lecture 8 MDPs I.srt 0.1MB
Lecture 8 MDPs I.mp4 120.1MB
网盘链接有效,可以访问
《强化学习|CS294-112 Deep Reinforcement Learning Sp17》来源于网盘资源爬虫采集。盘搜搜不复制、传播、储存任何网盘资源,也不提供资源下载服务,链接会跳转至百度网盘,资源的安全性与有效性请您自行辨别。