Downloading the data
The MS-COCO dataset we will be using is quite large. Therefore, we will manually download these datasets. To do that, follow the instructions below:
- Create a folder called
data
in theCh11-Image-Caption-Generation
folder - Download the 2014 Train images set (http://images.cocodataset.org/zips/train2014.zip) containing 83K images (
train2014.zip
) - Download the 2017 Val images set (http://images.cocodataset.org/zips/val2017.zip) containing 5K images (
val2017.zip
) - Download the annotation sets for 2014 (
annotations_trainval2014.zip
) (http://images.cocodataset.org/annotations/annotations_trainval2014.zip) and 2017 (annotations_trainval2017.zip
) (http://images.cocodataset.org/annotations/annotations_trainval2017.zip) - Copy the downloaded zip files to the
Ch11-Image-Caption-Generation/data
folder - Extract the zip files using the Extract to option so that it unzips the content within a sub-folder
Once you complete the above...