This story draft by @escholar has not been reviewed by an editor, YET.

PowerInfer-2: Fast Large Language Model Inference on a Smartphone: Implementation

We publish the best academic work (that's too often lost to peer reviews & the TA's desk) to the global tech community

Light-Mode