Introducing computer vision problems
In this book, we mentioned computer vision several times, but since this chapter is focused on this particular domain, we will look at it in more detail now. There are several practical tasks related to image and video processing, which are referred to as computer vision domain. While working on some computer vision task, it's important to know these names, to be able to find what you need in the vast ocean of computer vision publications:
- Object recognition: The same as classification. Assigning labels to the images. This is a cat. Age estimation. Facial expression recognition.
- Object localization: Finding frame of object in the image. The cat is in this frame.
- Object detection: Finding frames of objects in the image. The cat is in this frame.
- Semantic segmentation: Each point in the picture is assigned to one class. If the picture contains several cats, each cat's pixel would be assigned to the cat class.
- Instance segmentation: Each point in the picture...