paint-brush
How to Solve the Problem of Imbalanced Datasetsby@modzy
183 reads

How to Solve the Problem of Imbalanced Datasets

by Modzy3mMay 26th, 2021
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

Data imbalance refers to when the classes in a dataset are not equally distributed, which can then lead to potential risks in training a model. There are several methods to balancing training data and overcoming imbalanced data, including resampling and weight balancing. In a world where AI is proliferating, it is important that we place a particular focus on training data to reduce the risk of biased outputs. An imbalanced crime dataset would perpetuate racial and gender biases that exist in the dataset when using artificial intelligence to predict criminal behavior.

Company Mentioned

Mention Thumbnail
featured image - How to Solve the Problem of Imbalanced Datasets
Modzy HackerNoon profile picture
Modzy

Modzy

@modzy

A software platform for organizations and developers to responsibly deploy, monitor, and get value from AI - at scale.

L O A D I N G
. . . comments & more!

About Author

Modzy HackerNoon profile picture
Modzy@modzy
A software platform for organizations and developers to responsibly deploy, monitor, and get value from AI - at scale.

TOPICS

THIS ARTICLE WAS FEATURED IN...

Permanent on Arweave
Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite
Also published here
Twisave