Say you have a few hundred TB of on-premise data and a really slow network connection. Getting the data onto the cloud is a formidable task—particularly if you are using a VPN to connect on-premise to the cloud, further slowing your connectivity. To make things easier in such situations, Google offers offline ingestion options that roughly correspond to Snowball and Snowmobile from AWS.
It may be sufficient for you to merely know that such options exist, if you care about the fine print, please read on, otherwise feel free to skip to the end of the chapter:
- Data Preparation: Store and/or prepare your data. This can mean arranging it in servers, creating backup images, network settings, or even databases for machine learning:
- The data should be arranged in the form of non-nested directories (in case of GCS), which then would...