paint-brush
无需 GPU 即可运行 Llama!使用 LLMWare 和 Quantized Dragon 进行量化 LLM经过@shanglun
2,627 讀數
2,627 讀數

无需 GPU 即可运行 Llama!使用 LLMWare 和 Quantized Dragon 进行量化 LLM

经过 Shanglun Wang12m2024/01/07
Read on Terminal Reader

太長; 讀書

随着 GPU 资源变得更加有限,小型化和专业法学硕士正在慢慢受到重视。今天我们探索量化,这是一种尖端的小型化技术,使我们能够在没有专门硬件的情况下运行高参数模型。
featured image - 无需 GPU 即可运行 Llama!使用 LLMWare 和 Quantized Dragon 进行量化 LLM
Shanglun Wang HackerNoon profile picture
Shanglun Wang

Shanglun Wang

@shanglun

Quant, technologist, occasional economist, cat lover, and tango organizer.

0-item

STORY’S CREDIBILITY

Original Reporting

Original Reporting

This story contains new, firsthand information uncovered by the writer.

L O A D I N G
. . . comments & more!

About Author

Shanglun Wang HackerNoon profile picture
Shanglun Wang@shanglun
Quant, technologist, occasional economist, cat lover, and tango organizer.

標籤

Languages

这篇文章刊登在...

Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite