paint-brush
10 Best African Language Datasets for Data Science Projectsby@davisdavid
974 reads
974 reads

10 Best African Language Datasets for Data Science Projects

by Davis David6mMay 29th, 2021
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

These datasets can be used in numerous NLP tasks such as text classification, named entity recognition, machine translation, sentiment analysis, speech recognition, and topic modeling. These datasets have been made public to give you an opportunity to use your skills and help solving different challenges. The Swahili news dataset contains more than 31,000 news articles from different news categories such as Local, International, Business or Financial, health, sports, and Entertainment. The Chichewa news dataset has been categorized into 19 categories such a. education, law/order politics, culture, arts and crafts, farming, economy, and wildlife.

Companies Mentioned

Mention Thumbnail
Mention Thumbnail
featured image - 10 Best African Language  Datasets for Data Science Projects
Davis David HackerNoon profile picture
Davis David

Davis David

@davisdavid

Data Scientist | AI Practitioner | Software Developer| Technical Writer

About @davisdavid
LEARN MORE ABOUT @DAVISDAVID'S
EXPERTISE AND PLACE ON THE INTERNET.
L O A D I N G
. . . comments & more!

About Author

Davis David HackerNoon profile picture
Davis David@davisdavid
Data Scientist | AI Practitioner | Software Developer| Technical Writer

TOPICS

THIS ARTICLE WAS FEATURED IN...

Permanent on Arweave
Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite