Once the data is done downloading, let's take a look and see what we've got. Go ahead and run ls -al amazon* to make sure the files actually downloaded:
If you have anything else in this directory named amazon, that will show up as well. Now that the files are downloaded, let's introduce a new command, called file. Go ahead and run the following file amazon* command:
Wow, without any parameters set, the file command was able to figure out that this is a compressed archive. You'll use the file command a lot to determine the type of files you're working with. Let's decompress the files so we can work with them. This might take a little bit, depending on the speed of your system.
To do so, run the following:
zcat amazon_reviews_us_Digital_Ebook_Purchase_v1_00.tsv.gz >> amazon_reviews_us_Digital_Ebook_Purchase_v1_00.tsv...