see more

Intuitive RL: Intro to Advantage-Actor-Critic (A2C) by@rudygilman

79,952 reads

79,952 reads

Intuitive RL: Intro to Advantage-Actor-Critic (A2C)

by Rudy GilmanJanuary 9th, 2018

Read on Terminal Reader

Read this story w/o Javascript

EN

Too Long; Didn't Read

Reinforcement learning (RL) practitioners have produced a number of excellent tutorials. Most, however, describe RL in terms of mathematical equations and abstract diagrams. We like to think of the field from a different perspective. RL itself is inspired by how animals learn, so why not translate the underlying RL machinery back into the natural phenomena they’re designed to mimic? Humans learn best through stories.

featured image - Intuitive RL: Intro to Advantage-Actor-Critic (A2C)

Reinforcement learning (RL) practitioners have produced a number of excellent tutorials. Most, however, describe RL in terms of mathematical equations and abstract diagrams. We like to think of the field from a different perspective. RL itself is inspired by how animals learn, so why not translate the underlying RL machinery back into the natural phenomena they’re designed to mimic? Humans learn best through stories.

This is a story about the Actor Advantage Critic (A2C) model. Actor-Critic models are a popular form of Policy Gradient model, which is itself a vanilla RL algorithm. If you understand the A2C, you understand deep RL.

After you’ve gained an intuition for the A2C, check out:

Our simple code implementation of the A2C (for learning) or our industrial-strength PyTorch version based on OpenAI’s TensorFlow Baselines model
Barto & Sutton’s Introduction to RL, David Silver’s canonical course, Yuxi Li’s overview and Denny Britz’ GitHub repo for a deep dive in RL
fast.ai’s awesome course for intuitive and practical coverage of deep learning in general, implemented in PyTorch
Arthur Juliani’s tutorials on RL, implemented in TensorFlow.

Illustrations by @embermarke

Stellar

L O A D I N G
. . . comments & more!

About Author

Rudy Gilman@rudygilman

Read my stories

TOPICS

#machine-learning #reinforcement-learning #neural-networks #deep-learning #advantage-actor-critic

Languages

hackernoon-top-story

hackernoon-es

THIS ARTICLE WAS FEATURED IN...

Permanent on Arweave

Read on Terminal Reader

Read this story w/o Javascript

Join HackerNoon

Latest technology trends. Customized Experience. Curated Stories. Publish Your Ideas