Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Free Learning

R Programming By Example

You're reading from R Programming By Example Practical, hands-on projects to help you get started with R

Product type Paperback

Published in Dec 2017

Publisher Packt

ISBN-13 9781788292542

Length 470 pages

Edition 1st Edition

Languages

R

Concepts

Programming Language

Authors (2):

Omar Trejo Navarro

Omar Trejo Navarro

View More author details

Table of Contents (12) Chapters

Preface

1. Introduction to R FREE CHAPTER

2. Understanding Votes with Descriptive Statistics

3. Predicting Votes with Linear Models

4. Simulating Sales Data and Working with Databases

5. Communicating Sales with Visualizations

6. Understanding Reviews with Text Analysis

7. Developing Automatic Presentations

8. Object-Oriented System to Track Cryptocurrencies

9. Implementing an Efficient Simple Moving Average

10. Adding Interactivity with Dashboards

11. Required Packages

Building the corpus with tokenization and data cleaning

The first thing we need to create when working with text data is to extract the tokens that will be used to create our corpus. Simply, these tokens are all the terms found in every text in our data, put together, and removed the ordering or grammatical context. To create them, we use the tokens() function and the related functions from the quanteda package. As you can imagine, our data will not only contain words, but also punctuation marks, numbers, symbols, and other characters like hyphens. Depending on the context of the problem you're working with, you may find it quite useful to remove all of them as we do here. However, keep in mind that in some contexts some of these special characters can be meaningful (for example, the hashtag symbol (#) can be relevant when analyzing Twitter data):

tokens <- tokens(
  ...

The rest of the chapter is locked

Register for a free Packt account to unlock a world of extra content!

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (2)

Trejo Navarro

Trejo Navarro

Omar Trejo Navarro is a data consultant. He co-founded Datata, is actively working on CVEST, and maintains a personal website (otrenav.com). He is an applied mathematics and economics double major from ITAM in Mexico City, where he continues to work as a research assistant. He does software development with a focus on data platforms, data science, and web applications. He has worked with clients from all over the world, and is a keen supporter of open source, open data, and open science in general.

See other products by Trejo Navarro

Omar Trejo Navarro

Omar Trejo Navarro

Omar Trejo Navarro is a data consultant. He co-founded Datata, is actively working on CVEST, and maintains a personal website (otrenav.com). He is an applied mathematics and economics double major from ITAM in Mexico City, where he continues to work as a research assistant. He does software development with a focus on data platforms, data science, and web applications. He has worked with clients from all over the world, and is a keen supporter of open source, open data, and open science in general.

See other products by Omar Trejo Navarro

Other recommended products

Related to this chapter

Modern R Programming Cookbook

Modern R Programming Cookbook

R is a powerful tool for statistics, graphics, and statistical programming. It is used by tens of thousands of people daily to perform serious statistical analyses. It is a free, open source system whose implementation is the collective accomplishment of many intelligent, hard-working people. There are more than 2,000 available add-ons, and R is a serious rival to all commercial statistical packages. . The objective of this book is to show how to work with different programming aspects of R. The emerging R developers and data science could have very good programming knowledge but might have limited understanding about R syntax and semantics. Our book will be a platform develop practical solution out of real world problem in scalable fashion and with very good understanding.

Oct 2017 7h 52m

Web Application Development with R Using Shiny

Web Application Development with R Using Shiny

Shiny is an open source R package that provides an elegant and powerful web framework for building web applications using R. This guide takes a fresh approach to developing scalable web applications. It will enable you to create responsive, interactive web applications using the complete R Shiny suite.

Sep 2018 7h 56m

R Data Analysis Cookbook

R Data Analysis Cookbook

Data analytics with R has emerged as a very important focus for organizations of all kinds. R enables even those with only an intuitive grasp of the underlying concepts, without a deep mathematical background, to unleash powerful and detailed examinations of their data. This book empowers you by showing you ways to use R to generate professional analysis reports. The book also teaches you to quickly adapt the example code for your own needs and save yourself the time needed to construct code from scratch.

Sep 2017 18h 40m

Mastering Machine Learning with R

Mastering Machine Learning with R

Machine learning is the field of Artificial Intelligence where we build systems that learn from data. Given the growing prominence of R—a cross-platform, zero-cost statistical programming environment—there has never been a better time to start applying machine learning to your data. This book will teach you advanced techniques in machine learning with the latest code in R 3.3.2.

Apr 2017 14h 0m

Mastering Machine Learning with R

Mastering Machine Learning with R

Machine learning is a field of AI where we build systems that learn from data. This book explains complicated concepts with real-world applications. It demonstrates the power of R and machine learning extensively while highlighting the constraints. Finally, it will walk you through topics such as text analysis, time series, and deep learning.

Jan 2019 11h 48m

Hands-On Data Science with R

Hands-On Data Science with R

Hands-On Data Science with R explore various popular R packages to perform various data science tasks, including core statistical concepts and a wide array of use cases. This practical book covers the entire data science ecosystem for aspiring data scientists, including machine learning, NLP, and neural networks

Nov 2018 14h 0m

Personalised recommendations for you

Based on your interests and search pattern

Pragmatic Microservices with C# and Azure

Pragmatic Microservices with C# and Azure

This book empowers you with in-depth knowledge of microservices architecture using .NET and Azure. Through hands-on tutorials, you'll be able to build, deploy, and manage scalable applications, gaining crucial skills for modern software development.

May 2024 16h 56m

Mastering Python Design Patterns

Mastering Python Design Patterns

Unlock the power of design patterns to build maintainable and scalable software and applications using Python. Authored by Python veterans, this book is your guide to mastering design patterns in Python.

May 2024 9h 52m

System Programming Essentials with Go

System Programming Essentials with Go

From file operations to process management and network programming, this hands-on guide equips software engineers with the skills to build efficient, reliable applications and optimize their performance.

Jun 2024 13h 36m

Modern Python Cookbook

Modern Python Cookbook

The new edition of Modern Python Cookbook provides over 130 recipes for solving real-world problems with Python. Updated for Python 3.12 with new recipes and chapters. This practical guide will enhance your skills and teach you advanced techniques.

Jul 2024 27h 16m

The Ultimate Zoom Cookbook

The Ultimate Zoom Cookbook

This cookbook is an in-depth guide to using Zoom effectively. You'll be able to follow each recipe easily to harness the power of the communication and collaboration tools in Zoom.

May 2024 11h 20m

Enterprise Architecture with .NET

Enterprise Architecture with .NET

This book will help you create applications that integrate correctly into complex and ever-changing information systems. You'll execute this by using industry standards that reduce the app's technical debt and elevate software development practices.

May 2024 25h 44m

Salesforce B2C Solution Architect's Handbook

Salesforce B2C Solution Architect's Handbook

Discover how Salesforce Customer 360 unifies Marketing Cloud, B2C Commerce, Data Cloud, and Service Cloud into one solution, and learn the capabilities, integration options, limitations, and workflows to create value for your organization.

May 2024 15h 28m

Systems Programming with C# and .NET

Systems Programming with C# and .NET

Unlock the full potential of C# and .NET Core in systems programming to secure, deploy, and maintain robust applications. With this book, you'll focus on low-level APIs, memory management, and performance optimization.

Jul 2024 15h 48m

Technical Program Manager's Handbook

Technical Program Manager's Handbook

Unlock the full potential of your career as a TPM with this comprehensive guide, featuring new chapters on AI and more. Learn advanced techniques and gain insights from industry leaders. Elevate your skills and thrive in the Big Five tech companies.

Sep 2024 12h 16m

Microsoft Power Pages in Action

Microsoft Power Pages in Action

Packed with real-world examples, low-code coding techniques, and insights into crafting responsive pages, automating apps, and enhancing virtual agents, Microsoft Power Pages in Action is a valuable resource for building feature-rich web apps.

Jun 2024 11h 40m