Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Save more on your purchases! discount-offer-chevron-icon
Savings automatically calculated. No voucher code required.
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletter Hub
Free Learning
Arrow right icon
timer SALE ENDS IN
0 Days
:
00 Hours
:
00 Minutes
:
00 Seconds
Arrow up icon
GO TO TOP
Hands-On Deep Learning with R

You're reading from   Hands-On Deep Learning with R A practical guide to designing, building, and improving neural network models using R

Arrow left icon
Product type Paperback
Published in Apr 2020
Publisher Packt
ISBN-13 9781788996839
Length 330 pages
Edition 1st Edition
Languages
Tools
Arrow right icon
Authors (2):
Arrow left icon
Rodger Devine Rodger Devine
Author Profile Icon Rodger Devine
Rodger Devine
Michael Pawlus Michael Pawlus
Author Profile Icon Michael Pawlus
Michael Pawlus
Arrow right icon
View More author details
Toc

Table of Contents (16) Chapters Close

Preface 1. Section 1: Deep Learning Basics
2. Machine Learning Basics FREE CHAPTER 3. Setting Up R for Deep Learning 4. Artificial Neural Networks 5. Section 2: Deep Learning Applications
6. CNNs for Image Recognition 7. Multilayer Perceptron for Signal Detection 8. Neural Collaborative Filtering Using Embeddings 9. Deep Learning for Natural Language Processing 10. Long Short-Term Memory Networks for Stock Forecasting 11. Generative Adversarial Networks for Faces 12. Section 3: Reinforcement Learning
13. Reinforcement Learning for Gaming 14. Deep Q-Learning for Maze Solving 15. Other Books You May Enjoy

Formatting data using tokenization

The first step we will take to begin analyzing text is loading text files and then tokenizing our data by transforming the text from sentences into smaller pieces, such as words or terms. A text object can be tokenized in a number of ways. In this chapter, we will tokenize text into words, although other sized terms could also be tokenized. These are referred to as n-grams, so we can get two-word terms (2-grams), three-word terms, or a term of any arbitrary size.

To get started with the process of creating one-word tokens from our text objects, we will use the following steps:

  1. Let's load the libraries that we will need. For this project, we will use tidyverse for data manipulation, tidytext for special functions to manipulate text data, spacyr for extracting text metadata, and textmineR for word embeddings. To load these libraries, we run...
lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime
Banner background image