paint-brush
profile-img

#Interests

reinforcement-learning

in-context-learning

preference-learning

large-language-models

reward-functions

rlhf-efficiency

in-context-preference-learning

human-in-the-loop-rl

Related HackerNoon Humans: