Importing the Data
Before we begin with the actual analysis, we will need to import the required packages as follows:
# Import basic libraries import numpy as np import pandas as pd # import visualization libraries import seaborn as sns import matplotlib.pyplot as plt %matplotlib inline
Next, read/import the dataset into the work environment:
df = pd.read_excel('default_credit.xls') df.head(5)
The output will be as follows:
data:image/s3,"s3://crabby-images/13aba/13aba077312290f6b8ce6fa701dbee3a8e24826f" alt="Figure 6.2: Top five rows of the DataFrame"
Figure 6.2: Top five rows of the DataFrame
Check the metadata of the DataFrame:
# Getting Meta Data Information about the dataset df.info()
The output will be similar to the image shown below:
data:image/s3,"s3://crabby-images/42038/420382a2bfb5a93f48016ba9a83d78e5ee667b2e" alt="Figure 6.3: Information of the DataFrame"
Figure 6.3: Information of the DataFrame
Check the descriptive statistics for the numerical columns in the DataFrame:
df.describe().T
The output will be as follows:
data:image/s3,"s3://crabby-images/bf653/bf653e49c305f1580beaf0127e832ced2f7d2c3a" alt="Figure 6.4: Descriptive statistics of the DataFrame"
Figure 6.4: Descriptive statistics of the DataFrame
Next, check for null values:
...