Search icon CANCEL
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Conferences
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
Codeless Time Series Analysis with KNIME

You're reading from   Codeless Time Series Analysis with KNIME A practical guide to implementing forecasting models for time series analysis applications

Arrow left icon
Product type Paperback
Published in Aug 2022
Publisher Packt
ISBN-13 9781803232065
Length 392 pages
Edition 1st Edition
Languages
Tools
Arrow right icon
Authors (4):
Arrow left icon
Daniele Tonini Daniele Tonini
Author Profile Icon Daniele Tonini
Daniele Tonini
Maarit Widmann Maarit Widmann
Author Profile Icon Maarit Widmann
Maarit Widmann
Corey Weisinger Corey Weisinger
Author Profile Icon Corey Weisinger
Corey Weisinger
KNIME AG KNIME AG
Author Profile Icon KNIME AG
KNIME AG
Arrow right icon
View More author details
Toc

Table of Contents (20) Chapters Close

Preface 1. Part 1: Time Series Basics and KNIME Analytics Platform
2. Chapter 1: Introducing Time Series Analysis FREE CHAPTER 3. Chapter 2: Introduction to KNIME Analytics Platform 4. Chapter 3: Preparing Data for Time Series Analysis 5. Chapter 4: Time Series Visualization 6. Chapter 5: Time Series Components and Statistical Properties 7. Part 2: Building and Deploying a Forecasting Model
8. Chapter 6: Humidity Forecasting with Classical Methods 9. Chapter 7: Forecasting the Temperature with ARIMA and SARIMA Models 10. Chapter 8: Audio Signal Classification with an FFT and a Gradient-Boosted Forest 11. Chapter 9: Training and Deploying a Neural Network to Predict Glucose Levels 12. Chapter 10: Predicting Energy Demand with an LSTM Model 13. Chapter 11: Anomaly Detection – Predicting Failure with No Failure Examples 14. Part 3: Forecasting on Mixed Platforms
15. Chapter 12: Predicting Taxi Demand on the Spark Platform 16. Chapter 13: GPU Accelerated Model for Multivariate Forecasting 17. Chapter 14: Combining KNIME and H2O to Predict Stock Prices 18. Answers 19. Other Books You May Enjoy

Training a random forest model on Spark

In this section, we will explore and preprocess the historical taxi trip data and train and evaluate a random forest model for taxi demand prediction on Spark. We will introduce these steps in the following subsections:

  1. Exploring the seasonalities via line plots and auto-correlation plots
  2. Preprocessing the data
  3. Training and testing the Spark random forest model

The steps in the application are also depicted in the training workflow in Figure 12.6 (accessible on the KNIME Hub under https://kni.me/w/13wY0Bz-2wUAxffc):

Figure 12.6 – The workflow training a Spark random forest model for demand prediction

The first part of the workflow loads the Parquet files onto Spark as introduced in the Accessing the data and loading it into Spark subsection. The downstream parts of the workflow – data exploration, preprocessing, model training and testing, and model evaluation – are introduced...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime