Sqoop working example
We will be using Google Cloud Platform for running the whole use case that we will be covering in this book. Screenshots and code would be covered throughout this book with this in mind so that the reader at the end of this book would have a fully functioning Data Lake in the cloud which slowly could be connected to the real database existing in the enterprise.
Being the first chapter, which is now dealing with installation and code, this chapter will install certain softwares/tools/technologies/libraries that will be referred to in subsequent chapters. In the context of Sqoop, some installations and commands won't be required butare needed for running all of these in the cloud having a clean node with nothing installed on it.
These examples have been prepared and tested on CentOS 7, and this would be our platform for all the examples covered in this book.
Installation and Configuration
For all the installations discussed in this book, we are following some basic conventions...