#aiShare How You Collect Data to Train Your AI, Win From $2500 in the AI Writing ContestHackerNoon Writing Contests AnnouncementsOct 31, 20242m
#llm-fine-tuningYaFSDP - An LLM Training Tool That Cuts GPU Usage by 20% - Is Out Now Yandex Jun 22, 20244m
#reinforcement-learningObjective Mismatch in Reinforcement Learning from Human Feedback: Acknowledgments, and ReferencesThe FeedbackLoop: #1 in PM EducationJan 16, 20244m
#reinforcement-learningObjective Mismatch in Reinforcement Learning from Human Feedback: ConclusionThe FeedbackLoop: #1 in PM EducationJan 16, 20241m
#reinforcement-learningThe Iterative Deployment of RLHF in Language ModelsThe FeedbackLoop: #1 in PM EducationJan 16, 20242m
#reinforcement-learningUnderstanding Objective MismatchThe FeedbackLoop: #1 in PM EducationJan 16, 20247m
#reinforcement-learningThe Mechanics of Reward Models in RLHFThe FeedbackLoop: #1 in PM EducationJan 16, 20242m
#reinforcement-learningRelated Work on Reinforcement Learning from Human Feedback The FeedbackLoop: #1 in PM EducationJan 16, 20243m
#reinforcement-learningThe Alignment Ceiling: Objective Mismatch in Reinforcement Learning from Human FeedbackThe FeedbackLoop: #1 in PM EducationJan 16, 20244m