paint-brush
Using Optuna to Search for Tiny RL Policiesby@jfpettit
442 reads
442 reads

Using Optuna to Search for Tiny RL Policies

by Jacob Pettit7mApril 29th, 2021
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

Using Optuna to Search for Tiny RL Policies, I used the Optuna framework to search for trivial policies in an environment. I decided to use CMA-ES1 as my optimization method to find a faster solution and find a solution faster and faster. I used Optuna directly optimize each parameter in the weight array. This approach scales really poorly, since Optuna is designed to optimize hyperparameters and it suggests one number at a time. Check out my code at this link if you’re interested.

Companies Mentioned

Mention Thumbnail
Mention Thumbnail
featured image - Using Optuna to Search for Tiny RL Policies
Jacob Pettit HackerNoon profile picture
Jacob Pettit

Jacob Pettit

@jfpettit

ML researcher, blogging about reinforcement learning, machine learning, and AI.

About @jfpettit
LEARN MORE ABOUT @JFPETTIT'S
EXPERTISE AND PLACE ON THE INTERNET.
L O A D I N G
. . . comments & more!

About Author

Jacob Pettit HackerNoon profile picture
Jacob Pettit@jfpettit
ML researcher, blogging about reinforcement learning, machine learning, and AI.

TOPICS

THIS ARTICLE WAS FEATURED IN...

Permanent on Arweave
Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite