Lessons for Improving Training Performance — Part 1by@emwatz
976 reads

Lessons for Improving Training Performance — Part 1

tldt arrow
Read on Terminal Reader
Read this story w/o Javascript

Too Long; Didn't Read

Pure Storage published TensorFlow deep learning performance results in March. In Part 2 we’ll investigate how input pipelines affect overall training throughput. Performance gains came from ten months of application developments, not a single factor. With FP16 support, developers can take advantage of Tensor Cores present on Nvidia GPUs, trading lower precision for higher training throughput. With larger batch sizes, more samples are processed together, amortizing coordination work. The input pipeline during training, previously a performance limiter, is more efficient.

Companies Mentioned

Mention Thumbnail
Mention Thumbnail
featured image - Lessons for Improving Training Performance — Part 1
Emily Watkins HackerNoon profile picture

@emwatz

Emily Watkins

Receive Stories from @emwatz

react to story with heart

RELATED STORIES

L O A D I N G
. . . comments & more!
Hackernoon hq - po box 2206, edwards, colorado 81632, usa