Technical requirements
Image captioning is a problem that requires vast amounts of resources in terms of memory, storage, and computing power. My recommendation is that you use a cloud-based solution such as AWS or FloydHub to run the recipes in this chapter unless you have sufficiently capable hardware. As expected, a GPU is of paramount importance to complete the recipes in this chapter. In the Getting ready section of each recipe, you'll find what you'll need to prepare. The code of this chapter is available here: https://github.com/PacktPublishing/Tensorflow-2.0-Computer-Vision-Cookbook/tree/master/ch7.
Check out the following link to see the Code in Action video: