Too Long; Didn't Read
In this post, we will talk about encoding to be able to use categorical data as features for our ML models. Categorical data has variables that contain label values (text) and not numerical values. We have to convert data which contains categorical variables to numbers before we can train a ML model. Two most popular encoding techniques are Ordinal Encoding and One-Hot Encoding. Other preprocessing techniques for features scaling, features selection and dimension reduction are topics for another post… Let’s get started with encoding!