Image classification is the task of predicting labels or categories. Object detection is the task of predicting a list of several deep learning-based algorithms with its corresponding bounding box. The bounding box may have objects other than the detected object inside it. In some applications, labeling every pixel to a label is important rather than bounding box which may have multiple objects. Semantic segmentation is the task of predicting pixel-wise labels.
Here is an example of an image and its corresponding semantic segmentation:
data:image/s3,"s3://crabby-images/c44a6/c44a64037dd0e2cc26906177c0baf9df2a7b51cf" alt=""
data:image/s3,"s3://crabby-images/8a0d3/8a0d3738ceba066b55a044425fa3a80a8098fae1" alt=""
As shown in the image, an input image is predicted with labels for every pixel. The labels could be the sky, tree, person, mountain, and bridge. Rather than assigning a label to the whole image, labels are assigned to each pixel. Semantic segmentation labels pixels independently. You will notice that every...