Localization algorithms are an extension of the materials learned in Chapter 2, Image Classification and Chapter 3, Image Retrieval. In image classification, an image is passed through several layers of a CNN (convolutional neural network). The final layer of CNN outputs the probabilistic value, belonging to each of the labels. This can be extended to localize the objects. We will see these ideas in the following sections.
Localizing algorithms
Localizing objects using sliding windows
An intuitive way of localization is to predict several cropped portions of an image with an object. The cropping of the images can be done by moving a window across the image and predicting for every window. The method of moving a...