Categorizing Images of Clothing with Convolutional Neural Networks
The previous chapter wrapped up our coverage of the best practices for general machine learning. Starting from this chapter, we will dive into the more advanced topics of deep learning and reinforcement learning.
When we deal with image classification, we usually flatten the images, get vectors of pixels, and feed them to a neural network (or another model). Although this might do the job, we lose critical spatial information. In this chapter, we will use Convolutional Neural Networks (CNNs) to extract rich and distinguishable representations from images. You will see how CNN representations make a “9” a “9”, a “4” a “4”, a cat a cat, or a dog a dog.
We will start by exploring individual building blocks in the CNN architecture. Then, we will develop a CNN classifier in PyTorch to categorize clothing images and demystify the convolutional mechanism. Finally...