The dataset for the building the Diabetic Retinopathy detection application is obtained from Kaggle and can be downloaded from following the link: https://www.kaggle.com/c/ classroom-diabetic-retinopathy-detection-competition/data.
Both the training and the holdout test datasets are present within the train_dataset.zip file, which is available at the preceding link.
We will use the labeled training data to build the model through cross-validation. We will evaluate the model on the holdout dataset.
Since we are dealing with class prediction, accuracy will be a useful validation metric. Accuracy is defined as follows:
Here, c is the number of correctly classified samples, and N is the total number of evaluated samples.
We will also use the quadratic weighted kappa statistics to determine the quality of the model, and to have a benchmark as to how...