Preparing the dataset
Before starting the ML implementation, it's necessary to analyze and prepare the data for our use case. Since the dataset has been already used in Chapter 4, Predicting Numerical Values with Linear Regression, we will not start the analysis from the beginning, but we will focus exclusively on the queries relevant for our use case.
To start the preparation of our data, we need to do the following:
- Log into Google Cloud Console and access the BigQuery user interface from the navigation menu.
- Create a new dataset under the project that we created in Chapter 2, Setting Up Your GCP and BigQuery Environment. For this use case, we'll create the dataset
11_nyc_bike_sharing_dnn
with the default options. - Now we're ready to create the table that will contain the training dataset. Let's execute the following SQL statement:
CREATE OR REPLACE TABLE `11_nyc_bike_sharing_dnn.training_table` AS Â Â Â Â Â Â Â Â ...