Using Optuna to Search for Tiny RL Policiesby@jfpettit
398 reads

Using Optuna to Search for Tiny RL Policies

tldt arrow
Read on Terminal Reader🖨️

Too Long; Didn't Read

Using Optuna to Search for Tiny RL Policies, I used the Optuna framework to search for trivial policies in an environment. I decided to use CMA-ES1 as my optimization method to find a faster solution and find a solution faster and faster. I used Optuna directly optimize each parameter in the weight array. This approach scales really poorly, since Optuna is designed to optimize hyperparameters and it suggests one number at a time. Check out my code at this link if you’re interested.

Companies Mentioned

Mention Thumbnail
Mention Thumbnail
featured image - Using Optuna to Search for Tiny RL Policies
Jacob Pettit HackerNoon profile picture

@jfpettit

Jacob Pettit

About @jfpettit
LEARN MORE ABOUT @JFPETTIT'S EXPERTISE AND PLACE ON THE INTERNET.
react to story with heart

RELATED STORIES

L O A D I N G
. . . comments & more!
Hackernoon hq - po box 2206, edwards, colorado 81632, usa