Chevron Left
返回到 Sample-based Learning Methods

学生对 University of Alberta 提供的 Sample-based Learning Methods 的评价和反馈

4.8
1,253 个评分

课程概述

In this course, you will learn about several algorithms that can learn near optimal policies based on trial and error interaction with the environment---learning from the agent’s own experience. Learning from actual experience is striking because it requires no prior knowledge of the environment’s dynamics, yet can still attain optimal behavior. We will cover intuitively simple but powerful Monte Carlo methods, and temporal difference learning methods including Q-learning. We will wrap up this course investigating how we can get the best of both worlds: algorithms that can combine model-based planning (similar to dynamic programming) and temporal difference updates to radically accelerate learning. By the end of this course you will be able to: - Understand Temporal-Difference learning and Monte Carlo as two strategies for estimating value functions from sampled experience - Understand the importance of exploration, when using sampled experience rather than dynamic programming sweeps within a model - Understand the connections between Monte Carlo and Dynamic Programming and TD. - Implement and apply the TD algorithm, for estimating value functions - Implement and apply Expected Sarsa and Q-learning (two TD methods for control) - Understand the difference between on-policy and off-policy control - Understand planning with simulated experience (as opposed to classic planning strategies) - Implement a model-based approach to RL, called Dyna, which uses simulated experience - Conduct an empirical study to see the improvements in sample efficiency when using Dyna...

热门审阅

DP

Feb 14, 2021

Excellent course that naturally extends the first specialization course. The application examples in programming are very good and I loved how RL gets closer and closer to how a living being thinks.

DC

Aug 23, 2020

The material discussed is very clear, and the graded quizzes and programming assignments force you to really understand what you have just heard. I enjoyed this course a lot, and learned even more.

筛选依据:

76 - Sample-based Learning Methods 的 100 个评论(共 244 个)

创建者 Hannes

Sep 10, 2021

Enlightening explanations, well-structured content and challenging assignments. Very engaging course I thoroughly enjoyed!

创建者 Nathaniel W

Dec 23, 2020

Well done course that covers the different basic aspects of to do reinforcement learning and how models work into it.

创建者 Lik M C

Jan 9, 2020

Again, the course is excellent. The assignments are even better than Course 1. A really great course worth to take!

创建者 Zhang d

Apr 6, 2020

It is a wonderful and meanningful course, which can teach us the knowledge of Q-learning, expected Sarsa and so on.

创建者 Xingbei W

Mar 8, 2020

Although I have learned q learning and td, this course still give me a lot of new feeling and understanding on it.

创建者 Mathew

Jun 7, 2020

Very well structured and a great compliment to the Reinforcement Learning (2nd Edition) book by Sutton and Barto.

创建者 George M

Feb 24, 2021

Very well defined course.

Exercises are fairly challenging and provide useful intuition into common problems.

创建者 Alaaeldin Z

Dec 10, 2020

The course is amazing. The lectures are well organized. Quizes and assignments are very useful for learning.

创建者 maryam t

Nov 16, 2021

A very good course for understanding basic concepts of RL. It is not enough for doing projects with coding.

创建者 Stewart A

Sep 2, 2019

Great course! Lots of hands-on RL algorithms. I'm looking forward to the next course in the specialization.

创建者 Casey S S

Feb 11, 2021

I thought this was an excellent sequel, introducing the reader to the fundamental innovations of RL.

创建者 Martin P

May 30, 2020

A very interesting topic presented in an easy to consume form. It was fun learning with this course.

创建者 김한준

Apr 7, 2020

The course is spectacular! I've learned countless material on Reinforcement learning! Thank you!

创建者 Roberto M

Mar 28, 2020

The course is well organized and teachers provide a lot of examples to facilitate comprehension.

创建者 Chintan K

Jul 22, 2020

the course videos were short and precise , which makes the learning content fun and informative

创建者 Wang G

Oct 18, 2019

Very Nice Explanation and Assignment! Look forward the next 2 courses in this specialization!

创建者 Sodagreenmario

Sep 17, 2019

Great course, but there are still some little bugs that can be fixed in notebook assignments.

创建者 Floris v R

Jan 4, 2022

Very clear explanations in the videos, good tests & asignments. Complex stuff well explained

创建者 Chris D

Apr 18, 2020

Very good. Minor issues with inconsistency between parameter naming in different exercises.

创建者 Alden C

Nov 1, 2022

Compressing this much complication into such a tight package is a tremendous achievement.

创建者 Sirusala N S

Jul 30, 2020

The concepts were explained very clearly. The assignments were helpful in understanding.

创建者 高橋耕司

Oct 6, 2019

I made a lot of mistakes, but I learned a lot because of that.

It ’s a wonderful course.

创建者 Sérgio V C

Mar 15, 2021

A good course to learn the basics of Monte Carlo methods for RL, as well as TD-methods!

创建者 Jau-Jie Y

Jul 7, 2021

I am happy of the history talking of Barto and Sutton.

The others teachers were good.

创建者 Louis S

Jun 5, 2020

Excellent content. The fact that it follows Sutton and Barto's TextBook is a must.