Training a model and not evaluating its performance makes no sense at all. Hence, we will now be evaluating our deep learning model's performance on the test dataset, which has a total of 1,000 different images from the Flickr8K dataset. We start off by loading up the usual dependencies in case you don't already have them:
import pandas as pd import numpy as np import matplotlib.pyplot as plt pd.options.display.max_colwidth = 500
%matplotlib inline