Using Optuna to Search for Tiny RL Policiesby@jfpettit
415 reads

Using Optuna to Search for Tiny RL Policies

tldt arrow
Read on Terminal Reader
Read this story w/o Javascript

Too Long; Didn't Read

Using Optuna to Search for Tiny RL Policies, I used the Optuna framework to search for trivial policies in an environment. I decided to use CMA-ES1 as my optimization method to find a faster solution and find a solution faster and faster. I used Optuna directly optimize each parameter in the weight array. This approach scales really poorly, since Optuna is designed to optimize hyperparameters and it suggests one number at a time. Check out my code at this link if you’re interested.

Companies Mentioned

Mention Thumbnail
Mention Thumbnail
featured image - Using Optuna to Search for Tiny RL Policies
Jacob Pettit HackerNoon profile picture

@jfpettit

Jacob Pettit


Receive Stories from @jfpettit

react to story with heart

RELATED STORIES

L O A D I N G
. . . comments & more!