405 讀數

直接偏好优化:你的语言模型其实是一个奖励模型

by
2024/08/25
featured image - 直接偏好优化:你的语言模型其实是一个奖励模型

About Author

Writings, Papers and Blogs on Text Models HackerNoon profile picture

We publish the best academic papers on rule-based techniques, LLMs, & the generation of text that resembles human text.

註釋

avatar

標籤

这篇文章刊登在

Related Stories