Video
In this section, we move from image processing to video processing. We'll start our look at video by discussing six ways in which to classify videos with pretrained nets.
Classifying videos with pretrained nets in six different ways
Classifying videos is an area of active research because of the large amount of data needed for processing this type of media. Memory requirements are frequently reaching the limits of modern GPUs and a distributed form of training on multiple machines might be required. Researchers are currently exploring different directions of investigation, with increased levels of complexity from the first approach to the sixth, described next. Let's review them.
The first approach consists of classifying one video frame at a time by considering each one of them as a separate image processed with a 2D CNN. This approach simply reduces the video classification problem to an image classification problem. Each video frame "emits" a classification...