In this chapter, we will cover one of the most famous application domains in the artificial intelligence (AI) community, computer vision. Applying AI to images and videos has been going on for over two decades now. With better computational power, algorithms such as convolutional neural networks (CNNs) and its variants have worked fairly well in object detection tasks. Advanced steps have been taken towards automated image captioning, diabetic retinopathy, video object detection, captioning, and a lot more.
Due to its promising results and more generalized approach, applying reinforcement learning to computer vision successfully forms challenging tasks for researchers. We have seen how AlphaGo and AlphaGo Zero have outperformed professional human Go players, where the deep reinforcement learning approach is applied to the image of the...