Too Long; Didn't Read
These datasets can be used in numerous NLP tasks such as text classification, named entity recognition, machine translation, sentiment analysis, speech recognition, and topic modeling. These datasets have been made public to give you an opportunity to use your skills and help solving different challenges. The Swahili news dataset contains more than 31,000 news articles from different news categories such as Local, International, Business or Financial, health, sports, and Entertainment. The Chichewa news dataset has been categorized into 19 categories such a. education, law/order politics, culture, arts and crafts, farming, economy, and wildlife.
Share Your Thoughts