paint-brush
Multicollinearity and Its Importance in Machine Learningby@nikolao
11,178 reads
11,178 reads

Multicollinearity and Its Importance in Machine Learning

by Nikola O.4mJanuary 4th, 2022
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow
EN

Too Long; Didn't Read

Multicollinearity is a well-known challenge in multiple regression. The term refers to the high correlation between two or more explanatory variables, i.e. predictors. It can be an issue in machine learning, but what really matters is your specific use case. In many cases, multiple regression is used with the purpose of understanding something. For example, an ecologist might want to know what kind of environmental and biological factors lead to changes in the population size of chimpanzees. We think of machine learning algorithms as black boxes that need to predict, but that black box sometimes needs to be understood as well. That's when multicollinearity is an issue.

Company Mentioned

Mention Thumbnail
featured image - Multicollinearity and Its Importance in Machine Learning
Nikola O. HackerNoon profile picture
Nikola O.

Nikola O.

@nikolao

Combines ideas from data science, humanities and social sciences. Enjoys thinking, science fiction and design.

About @nikolao
LEARN MORE ABOUT @NIKOLAO'S
EXPERTISE AND PLACE ON THE INTERNET.
L O A D I N G
. . . comments & more!

About Author

Nikola O. HackerNoon profile picture
Nikola O.@nikolao
Combines ideas from data science, humanities and social sciences. Enjoys thinking, science fiction and design.

TOPICS

Languages

THIS ARTICLE WAS FEATURED IN...

Permanent on Arweave
Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite