Companies Home Search Profile

Modern Reinforcement Learning: Deep Q Learning in PyTorch

Focused View

Phil Tabor

5:41:59

512 View
  • 1 - M1.pdf
  • 1 - What You Will Learn In This Course.mp4
    04:07
  • 2 - M2.pdf
  • 2 - Required Background software and hardware.mp4
    03:46
  • 3 - How to Succeed in this Course.mp4
    04:45
  • 4 - Agents Environments and Actions.mp4
    10:04
  • 4 - M4.pdf
  • 5 - M5.pdf
  • 5 - Markov Decision Processes.mp4
    11:30
  • 6 - M6.pdf
  • 6 - Value Functions Action Value Functions and the Bellman Equation.mp4
    08:10
  • 7 - M7.pdf
  • 7 - Model Free vs Model Based Learning.mp4
    03:34
  • 8 - M8.pdf
  • 8 - The ExploreExploit Dilemma.mp4
    05:27
  • 9 - M9.pdf
  • 9 - Temporal Difference Learning.mp4
    22:01
  • 10 - Dealing with Continuous State Spaces with Deep Neural Networks.mp4
    18:53
  • 10 - M10.pdf
  • 11 - M11.pdf
  • 11 - Naive Deep Q Learning in Code Step 1 Coding the Deep Q Network.mp4
    07:55
  • 12 - Naive Deep Q Learning in Code Step 2 Coding the Agent Class.mp4
    10:10
  • 13 - Naive Deep Q Learning in Code Step 3 Coding the Main Loop and Learning.mp4
    09:21
  • 14 - Naive Deep Q Learning in Code Step 4 Verifying the Functionality of Our Code.mp4
    02:13
  • 15 - Naive Deep Q Learning in Code Step 5 Analyzing Our Agents Performance.mp4
    02:42
  • 16 - Dealing with Screen Images with Convolutional Neural Networks.mp4
    03:52
  • 16 - M12.pdf
  • 17 - How to Read Deep Learning Papers.mp4
    07:15
  • 17 - M13.pdf
  • 18 - Analyzing the Paper.mp4
    20:33
  • 18 - human level control through deep reinforcement learning nature paper.zip
  • 19 - How to Modify the OpenAI Gym Atari Environments.mp4
    14:29
  • 19 - M15.pdf
  • 20 - How to Preprocess the OpenAI Gym Atari Screen Images.mp4
    02:55
  • 21 - How to Stack the Preprocessed Atari Screen Images.mp4
    03:26
  • 22 - How to Combine All the Changes.mp4
    01:30
  • 23 - How to Add Reward Clipping Fire First and No Ops.mp4
    04:49
  • 24 - How to Code the Agents Memory.mp4
    10:55
  • 24 - M16a.pdf
  • 25 - How to Code the Deep Q Network.mp4
    11:44
  • 25 - M16b.pdf
  • 26 - Coding the Deep Q Agent Step 1 Coding the Constructor.mp4
    07:48
  • 26 - M16c.pdf
  • 27 - Coding the Deep Q Agent Step 2 EpsilonGreedy Action Selection.mp4
    02:22
  • 28 - Coding the Deep Q Agent Step 3 Memory Model Saving and Network Copying.mp4
    04:24
  • 29 - Coding the Deep Q Agent Step 4 The Agents Learn Function.mp4
    07:54
  • 30 - Coding the Deep Q Agent Step 5 The Main Loop and Analyzing the Performance.mp4
    14:14
  • 31 - Analyzing the Paper.mp4
    15:39
  • 31 - deep reinforcement learning with double q learning.zip
  • 32 - Coding the Double Q Learning Agent and Analyzing Performance.mp4
    08:51
  • 32 - M18.pdf
  • 33 - Analyzing the Paper.mp4
    14:01
  • 33 - dueling network architectures for deep reinforcement learning the paper.zip
  • 34 - Coding the Dueling Deep Q Network.mp4
    03:21
  • 34 - M20.pdf
  • 35 - Coding the Dueling Deep Q Learning Agent and Analyzing Performance.mp4
    10:10
  • 35 - M21a.pdf
  • 36 - Coding the Dueling Double Deep Q Learning Agent and Analyzing Performance.mp4
    05:36
  • 36 - M21b.pdf
  • 37 - Implementing a Command Line Interface for Rapid Model Testing.mp4
    09:30
  • 37 - M22.pdf
  • 38 - Consolidating Our Code Base for Maximum Extensability.mp4
    18:32
  • 38 - M23.pdf
  • 39 - How to Test Our Agent and Watch it Play the Game in Real Time.mp4
    07:39
  • 40 - M24.pdf
  • 40 - Summarizing What Weve Learned.mp4
    04:40
  • 41 - Bonus Video Where to Go From Here.mp4
    01:12
  • 41 - M25.pdf
  • Description


    How to Turn Deep Reinforcement Learning Research Papers Into Agents That Beat Classic Atari Games

    What You'll Learn?


    • How to read and implement deep reinforcement learning papers
    • How to code Deep Q learning agents
    • How to Code Double Deep Q Learning Agents
    • How to Code Dueling Deep Q and Dueling Double Deep Q Learning Agents
    • How to write modular and extensible deep reinforcement learning software
    • How to automate hyperparameter tuning with command line arguments

    Who is this for?


  • Python developers eager to learn about cutting edge deep reinforcement learning
  • What You Need to Know?


  • Some College Calculus
  • Exposure To Deep Learning
  • Comfortable with Python
  • More details


    Description

    In this complete deep reinforcement learning course you will learn a repeatable framework for reading and implementing deep reinforcement learning research papers. You will read the original papers that introduced the Deep Q learning, Double Deep Q learning, and Dueling Deep Q learning algorithms. You will then learn how to implement these in pythonic and concise PyTorch code, that can be extended to include any future deep Q learning algorithms. These algorithms will be used to solve a variety of environments from the Open AI gym's Atari library, including Pong, Breakout, and Bankheist.


    You will learn the key to making these Deep Q Learning algorithms work, which is how to modify the Open AI Gym's Atari library to meet the specifications of the original Deep Q Learning papers. You will learn how to:

    • Repeat actions to reduce computational overhead

    • Rescale the Atari screen images to increase efficiency

    • Stack frames to give the Deep Q agent a sense of motion

    • Evaluate the Deep Q agent's performance with random no-ops to deal with model over training

    • Clip rewards to enable the Deep Q learning agent to generalize across Atari games with different score scales


    If you do not have prior experience in reinforcement or deep reinforcement learning, that's no problem. Included in the course is a complete and concise course on the fundamentals of reinforcement learning. The introductory course in reinforcement learning will be taught in the context of solving the Frozen Lake environment from the Open AI Gym.

    We will cover:

    • Markov decision processes

    • Temporal difference learning

    • The original Q learning algorithm

    • How to solve the Bellman equation

    • Value functions and action value functions

    • Model free vs. model based reinforcement learning

    • Solutions to the explore-exploit dilemma, including optimistic initial values and epsilon-greedy action selection

    Also included is a mini course in deep learning using the PyTorch framework. This is geared for students who are familiar with the basic concepts of deep learning, but not the specifics, or those who are comfortable with deep learning in another framework, such as Tensorflow or Keras. You will learn how to code a deep neural network in Pytorch as well as how convolutional neural networks function. This will be put to use in implementing a naive Deep Q learning agent to solve the Cartpole problem from the Open AI gym. 

    Who this course is for:

    • Python developers eager to learn about cutting edge deep reinforcement learning

    User Reviews
    Rating
    0
    0
    0
    0
    0
    average 0
    Total votes0
    Focused display
    In 2012 I received my PhD in experimental condensed matter physics from West Virginia University. Following that I was a dry etch process engineer for Intel Corporation, where I leveraged big data to make essential process improvements for mission critical products. After leaving Intel in 2015, I have worked as a contract and freelance deep learning and artificial intelligence engineer.
    Students take courses primarily to improve job-related skills.Some courses generate credit toward technical certification. Udemy has made a special effort to attract corporate trainers seeking to create coursework for employees of their company.
    • language english
    • Training sessions 41
    • duration 5:41:59
    • English subtitles has
    • Release Date 2022/11/17