Anscombe’s Quartet And Importance of Data Visualization

Written by imsparsh | Published 2020/08/24
Tech Story Tags: machine-learning | data-science | artificial-intelligence | programming | python | data-visualization | exploratory-data-analysis | python3 | web-monetization

TLDR Anscombe’s Quartet comprises four data sets that have nearly identical simple descriptive statistics, yet have very different distributions and appear very different when plotted on scatter plots. It was constructed in 1973 by statistician Francis Anscombe to illustrate the importance of plotting the graphs before analyzing and model building, and the effect of other observations on statistical properties. The Linear Regression can be only be considered a fit for the data with linear relationships and is incapable of handling any other kind of datasets. All important features in the dataset must be visualised before implementing any machine learning algorithm on them.via the TL;DR App

no story

Written by imsparsh | Technologist. Programmer. Musician. Explorer.
Published by HackerNoon on 2020/08/24