paint-brush
profile-img

The beautiful humans of HackerNoon have collectively read @languagemodels's 11 stories for and 1 minutes.

#Interests

reinforcement-learning

in-context-learning

preference-learning

large-language-models

reward-functions

rlhf-efficiency

in-context-preference-learning

human-in-the-loop-rl

Related HackerNoon Humans: