paint-brush
Use Up-Sampling and Weights to Address Imbalance Data Problemby@ryan-yu
754 reads
754 reads

Use Up-Sampling and Weights to Address Imbalance Data Problem

by Ryan Yu2mMarch 24th, 2020
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

Imbalance data means the classes we want to predict are disproportional. Classes that make up a large proportion of the data are called majority classes. Those that make a smaller portion are minority classes. The true positive rate drops from 97% to 33% for class 1. Using balanced class weight improves recall from 33% to 96%, but incurs many false positive and precision decreases from 100% to 36%. Another approach is to apply up-sampling. This means we randomly sample with replacement from minority class to increase proportion of minority class.

Company Mentioned

Mention Thumbnail
featured image - Use Up-Sampling and Weights to Address Imbalance Data Problem
Ryan Yu HackerNoon profile picture
Ryan Yu

Ryan Yu

@ryan-yu

L O A D I N G
. . . comments & more!

About Author

Ryan Yu HackerNoon profile picture
Ryan Yu@ryan-yu

TOPICS

THIS ARTICLE WAS FEATURED IN...

Permanent on Arweave
Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite