You're reading from Codeless Time Series Analysis with KNIME A practical guide to implementing forecasting models for time series analysis applications

Product type Paperback

Published in Aug 2022

Publisher Packt

ISBN-13 9781803232065

Length 392 pages

Edition 1st Edition

Languages

Java

Tools

Knime

Concepts

Data Analysis

Authors (4):

Daniele Tonini

Maarit Widmann

Corey Weisinger

KNIME AG

View More author details

Table of Contents (20) Chapters

Preface

1. Part 1: Time Series Basics and KNIME Analytics Platform

2. Chapter 1: Introducing Time Series Analysis FREE CHAPTER

3. Chapter 2: Introduction to KNIME Analytics Platform

4. Chapter 3: Preparing Data for Time Series Analysis

5. Chapter 4: Time Series Visualization

6. Chapter 5: Time Series Components and Statistical Properties

7. Part 2: Building and Deploying a Forecasting Model

8. Chapter 6: Humidity Forecasting with Classical Methods

9. Chapter 7: Forecasting the Temperature with ARIMA and SARIMA Models

10. Chapter 8: Audio Signal Classification with an FFT and a Gradient-Boosted Forest

11. Chapter 9: Training and Deploying a Neural Network to Predict Glucose Levels

12. Chapter 10: Predicting Energy Demand with an LSTM Model

13. Chapter 11: Anomaly Detection – Predicting Failure with No Failure Examples

14. Part 3: Forecasting on Mixed Platforms

15. Chapter 12: Predicting Taxi Demand on the Spark Platform

16. Chapter 13: GPU Accelerated Model for Multivariate Forecasting

17. Chapter 14: Combining KNIME and H2O to Predict Stock Prices

18. Answers

19. Other Books You May Enjoy

Training a random forest model on Spark

In this section, we will explore and preprocess the historical taxi trip data and train and evaluate a random forest model for taxi demand prediction on Spark. We will introduce these steps in the following subsections:

Exploring the seasonalities via line plots and auto-correlation plots
Preprocessing the data
Training and testing the Spark random forest model

The steps in the application are also depicted in the training workflow in Figure 12.6 (accessible on the KNIME Hub under https://kni.me/w/13wY0Bz-2wUAxffc):

Figure 12.6 – The workflow training a Spark random forest model for demand prediction

The first part of the workflow loads the Parquet files onto Spark as introduced in the Accessing the data and loading it into Spark subsection. The downstream parts of the workflow – data exploration, preprocessing, model training and testing, and model evaluation – are introduced...

The rest of the chapter is locked

You're reading from Codeless Time Series Analysis with KNIME A practical guide to implementing forecasting models for time series analysis applications

Table of Contents (20) Chapters

Training a random forest model on Spark

Unlock this book and the full library FREE for 7 days

Authors (4)

Personalised recommendations for you