Creating a structured data classifier to predict Titanic survivors
This model will predict whether a Titanic passenger will survive the sinking of the ship based on characteristics that have been extracted from the Titanic Kaggle dataset. Although luck was an important factor in survival, some groups of people were more likely to survive than others.
There are a train dataset and a test dataset in this dataset. Both are similar datasets that include passenger information such as name, age, sex, socioeconomic class, and so on.
The train dataset (train.csv
) contains details about a subset of the passengers on board (891, to be exact), revealing if they survived or not in the survived
column.
The test dataset (test.csv
) will be used in the final evaluation and contains similar information for the other 418 passengers.
AutoKeras will find patterns in the train data to predict whether these other 418 passengers on board (found in test.csv
) survived.
The full source code notebook...