Modern Reinforcement Learning: Deep Q Learning in PyTorch

Focused View

Phil Tabor

5:41:59

512 View

1 - Introduction

1 - M1.pdf

1 - What You Will Learn In This Course.mp4

04:07

2 - M2.pdf

2 - Required Background software and hardware.mp4

03:46

3 - How to Succeed in this Course.mp4

04:45

2 - Fundamentals of Reinforcement Learning

4 - Agents Environments and Actions.mp4

10:04

4 - M4.pdf

5 - M5.pdf

5 - Markov Decision Processes.mp4

11:30

6 - M6.pdf

6 - Value Functions Action Value Functions and the Bellman Equation.mp4

08:10

7 - M7.pdf

7 - Model Free vs Model Based Learning.mp4

03:34

8 - M8.pdf

8 - The ExploreExploit Dilemma.mp4

05:27

9 - M9.pdf

9 - Temporal Difference Learning.mp4

22:01

3 - Deep Learning Crash Course

10 - Dealing with Continuous State Spaces with Deep Neural Networks.mp4

18:53

10 - M10.pdf

11 - M11.pdf

11 - Naive Deep Q Learning in Code Step 1 Coding the Deep Q Network.mp4

07:55

12 - Naive Deep Q Learning in Code Step 2 Coding the Agent Class.mp4

10:10

13 - Naive Deep Q Learning in Code Step 3 Coding the Main Loop and Learning.mp4

09:21

14 - Naive Deep Q Learning in Code Step 4 Verifying the Functionality of Our Code.mp4

02:13

15 - Naive Deep Q Learning in Code Step 5 Analyzing Our Agents Performance.mp4

02:42

16 - Dealing with Screen Images with Convolutional Neural Networks.mp4

03:52

16 - M12.pdf

4 - Human Level Control Through Deep Reinforcement Learning From Paper to Code

17 - How to Read Deep Learning Papers.mp4

07:15

17 - M13.pdf

18 - Analyzing the Paper.mp4

20:33

18 - human level control through deep reinforcement learning nature paper.zip

19 - How to Modify the OpenAI Gym Atari Environments.mp4

14:29

19 - M15.pdf

20 - How to Preprocess the OpenAI Gym Atari Screen Images.mp4

02:55

21 - How to Stack the Preprocessed Atari Screen Images.mp4

03:26

22 - How to Combine All the Changes.mp4

01:30

23 - How to Add Reward Clipping Fire First and No Ops.mp4

04:49

24 - How to Code the Agents Memory.mp4

10:55

24 - M16a.pdf

25 - How to Code the Deep Q Network.mp4

11:44

25 - M16b.pdf

26 - Coding the Deep Q Agent Step 1 Coding the Constructor.mp4

07:48

26 - M16c.pdf

27 - Coding the Deep Q Agent Step 2 EpsilonGreedy Action Selection.mp4

02:22

28 - Coding the Deep Q Agent Step 3 Memory Model Saving and Network Copying.mp4

04:24

29 - Coding the Deep Q Agent Step 4 The Agents Learn Function.mp4

07:54

30 - Coding the Deep Q Agent Step 5 The Main Loop and Analyzing the Performance.mp4

14:14

5 - Deep Reinforcement Learning with Double Q Learning

31 - Analyzing the Paper.mp4

15:39

31 - deep reinforcement learning with double q learning.zip

32 - Coding the Double Q Learning Agent and Analyzing Performance.mp4

08:51

32 - M18.pdf

6 - Dueling Network Architectures for Deep Reinforcement Learning

33 - Analyzing the Paper.mp4

14:01

33 - dueling network architectures for deep reinforcement learning the paper.zip

34 - Coding the Dueling Deep Q Network.mp4

03:21

34 - M20.pdf

35 - Coding the Dueling Deep Q Learning Agent and Analyzing Performance.mp4

10:10

35 - M21a.pdf

36 - Coding the Dueling Double Deep Q Learning Agent and Analyzing Performance.mp4

05:36

36 - M21b.pdf

7 - Improving On Our Solutions

37 - Implementing a Command Line Interface for Rapid Model Testing.mp4

09:30

37 - M22.pdf

38 - Consolidating Our Code Base for Maximum Extensability.mp4

18:32

38 - M23.pdf

39 - How to Test Our Agent and Watch it Play the Game in Real Time.mp4

07:39

8 - Conclusion

40 - M24.pdf

40 - Summarizing What Weve Learned.mp4

04:40

9 - Bonus Lecture

41 - Bonus Video Where to Go From Here.mp4

01:12

41 - M25.pdf

Description

How to Turn Deep Reinforcement Learning Research Papers Into Agents That Beat Classic Atari Games

What You'll Learn?

How to read and implement deep reinforcement learning papers
How to code Deep Q learning agents
How to Code Double Deep Q Learning Agents
How to Code Dueling Deep Q and Dueling Double Deep Q Learning Agents
How to write modular and extensible deep reinforcement learning software
How to automate hyperparameter tuning with command line arguments

Who is this for?

Python developers eager to learn about cutting edge deep reinforcement learning

What You Need to Know?

Some College Calculus

Exposure To Deep Learning

Comfortable with Python

More details

Description
In this complete deep reinforcement learning course you will learn a repeatable framework for reading and implementing deep reinforcement learning research papers. You will read the original papers that introduced the Deep Q learning, Double Deep QÂ learning, and Dueling Deep Q learning algorithms. You will then learn how to implement these in pythonic and concise PyTorch code, that can be extended to include any future deep Q learning algorithms. These algorithms will be used to solve a variety of environments from the Open AIÂ gym's Atari library, including Pong, Breakout, and Bankheist.

You will learn the key to making these Deep Q Learning algorithms work, which is how to modify the Open AIÂ Gym's Atari library to meet the specifications of the original Deep QÂ Learning papers. You will learn how to:
Repeat actions to reduce computational overhead
Rescale the Atari screen images to increase efficiency
Stack frames to give the Deep Q agent a sense of motion
Evaluate the Deep Q agent's performance with random no-ops to deal with model over training
Clip rewards to enable the Deep Q learning agent to generalize across Atari games with different score scales

If you do not have prior experience in reinforcement or deep reinforcement learning, that's no problem. Included in the course is a complete and concise course on the fundamentals of reinforcement learning. The introductory course in reinforcement learning will be taught in the context of solving the Frozen Lake environment from the Open AIÂ Gym.
We will cover:
Markov decision processes
Temporal difference learning
The original Q learning algorithm
How to solve the Bellman equation
Value functions and action value functions
Model free vs. model based reinforcement learning
Solutions to the explore-exploit dilemma, including optimistic initial values and epsilon-greedy action selection
Also included is a mini course in deep learning using the PyTorch framework. This is geared for students who are familiar with the basic concepts of deep learning, but not the specifics, or those who are comfortable with deep learning in another framework, such as Tensorflow or Keras. You will learn how to code a deep neural network in Pytorch as well as how convolutional neural networks function. This will be put to use in implementing a naive Deep Q learning agent to solve the Cartpole problem from the Open AIÂ gym.Â
Who this course is for:
Python developers eager to learn about cutting edge deep reinforcement learning

User Reviews

Rating

average 0

Total votes0

Focused display

In 2012 I received my PhD in experimental condensed matter physics from West Virginia University. Following that I was a dry etch process engineer for Intel Corporation, where I leveraged big data to make essential process improvements for mission critical products. After leaving Intel in 2015, I have worked as a contract and freelance deep learning and artificial intelligence engineer.

Udemy

View courses Udemy

Students take courses primarily to improve job-related skills.Some courses generate credit toward technical certification. Udemy has made a special effort to attract corporate trainers seeking to create coursework for employees of their company.