paint-brush
How to Use ASR System for Accurate Transcription Properties of Your Digital Productby@zilunpeng
371 reads
371 reads

How to Use ASR System for Accurate Transcription Properties of Your Digital Product

by Georgian.io8mApril 7th, 2021
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

Facebook’s wav2vec 2.0 allows you to pre-train transcription systems using audio only — with no corresponding transcription — and then use just a tiny transcribed dataset for training. The LibriSpeech dataset is the most commonly used audio processing dataset in speech research. In this blog, we share how we worked with wAV2vec with great results. We show the transcription for one audio sample in the dev-clean dataset. In this example, the ASR has inserted an “a”, identified “John” as “Jones” and deleted the word “are” from the ground truth.

Companies Mentioned

Mention Thumbnail
Mention Thumbnail
featured image - How to Use ASR System for Accurate Transcription Properties of Your Digital Product
Georgian.io HackerNoon profile picture
Georgian.io

Georgian.io

@zilunpeng

fin tech company

About @zilunpeng
LEARN MORE ABOUT @ZILUNPENG'S
EXPERTISE AND PLACE ON THE INTERNET.
L O A D I N G
. . . comments & more!

About Author

Georgian.io HackerNoon profile picture
Georgian.io@zilunpeng
fin tech company

TOPICS

THIS ARTICLE WAS FEATURED IN...

Permanent on Arweave
Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite
Allella
Leftic
Winscloud
Koyu