Test your knowledge
To test your knowledge, use the loan_data.csv
dataset in the Chapter11/test_your_knowledge/data
folder in this book's GitHub repository to fit ML classification models to the data to predict the TARGET
column. You will need to do some cleaning of the columns (for example, converting strings to numbers) before using it with ML algorithms. The value of 1 for the target means a default or late payment on the loan, while 0 means no late payments or defaults. Examine p-values from a logistic regression fit and see whether any features could potentially be thrown out. Check your accuracy of the models you try and compare it to the majority class fraction. Write a short analysis of the results and process.