There is no machine learning project without data, so the first step in our analysis is to load the input file (titanic_small.csv) into AMLS. This is a simplified version of the Titanic dataset, which contains three features and one target variable:
- Features:
- pclass: The class in which the passenger traveled (values 1, 2, or 3 corresponding to 1st, 2nd, and 3rd class)
- sex: Passenger's gender (female or male)
- Age group: Infant, child, teenager, adult, elderly, or unknown
- Target variable:
- Survived: 1 if the passenger survived the shipwreck, 0 if they didn't.
To load the file, follow these steps:
- From the home page, click on DATASETS. You will see an empty list of datasets:
![](https://static.packt-cdn.com/products/9781789345377/graphics/assets/0b60a787-1009-48ff-aaae-5e9d7c3c6476.png)
- Click on +NEW to get a link to upload a local data file:
![](https://static.packt-cdn.com/products/9781789345377/graphics/assets/0a214150-da00-40b2-bd77-7662073dff79.png)
- Click on FROM LOCAL FILE and you will see the following dialog box:
![](https://static.packt-cdn.com/products/9781789345377/graphics/assets/2f55592d-79d7-4293-8a12-02493ab6975f.png)
- Click on Choose File and navigate...