In this chapter, we covered the study of multimedia processing by breaking it into the core components of image, audio, and video processing, and discussed some of the most common tools for working on them. We saw how easy it becomes for us to perform image or video processing using OpenCV. Also, we saw a quick example of generating drum music using Magenta. In the later part of this chapter, we covered how LSTMs work with time series data, and built an API that could generate instrumental music from sample files provided. Finally, we used this API with a Flutter app, which is cross-platform and can be deployed on Android, iOS, and the web simultaneously.
In the next chapter, we will be looking at how we can use deep reinforcement learning (DRL) to create agents that can play board games such as chess.Â