Too Long; Didn't Read
This is part 2 of my hands-on course on reinforcement learning, which takes you from zero to HERO. Today we will learn about Q-learning, a classic RL algorithm born in the 90s. We will use an environment from OpenAI Gym, called the `Taxi-v3` environment. The taxi drives to a random location, picks up the passenger, drives to the passenger’s destination (another one of the four specified locations), and then drops off the passenger.