Implementation of Data Preprocessing on Titanic Dataset
Too Long; Didn't Read
Machine learning model is supposed to predict who survived during the titanic shipwreck. Preprocessing is necessary to convert raw data into a clean data set and dataset must be converted to numeric data. Machine learning models need data for training to perform well, so we preserve the data and make use of it as much as possible. We use Python, Numpy, Pandas, Scikit and numpy to preprocess the data for machine learning models. We then split the data set into training and test set using scikit model selection.