You're reading from The Statistics and Machine Learning with R Workshop Unlock the power of efficient data science modeling with this hands-on guide

Product type Paperback

Published in Oct 2023

Publisher Packt

ISBN-13 9781803240305

Length 516 pages

Edition 1st Edition

Languages

Concepts

Data Science

Author (1):

Liu Peng

View More author details

Table of Contents (20) Chapters

Preface

1. Part 1:Statistics Essentials

2. Chapter 1: Getting Started with R FREE CHAPTER

3. Chapter 2: Data Processing with dplyr

4. Chapter 3: Intermediate Data Processing

5. Chapter 4: Data Visualization with ggplot2

6. Chapter 5: Exploratory Data Analysis

7. Chapter 6: Effective Reporting with R Markdown

8. Part 2:Fundamentals of Linear Algebra and Calculus in R

9. Chapter 7: Linear Algebra in R

10. Chapter 8: Intermediate Linear Algebra in R

11. Chapter 9: Calculus in R

12. Part 3:Fundamentals of Mathematical Statistics in R

13. Chapter 10: Probability Basics

14. Chapter 11: Statistical Estimation

15. Chapter 12: Linear Regression in R

16. Chapter 13: Logistic Regression in R

17. Chapter 14: Bayesian Statistics

18. Index

Why subscribe?

19. Other Books You May Enjoy

Case study – working with the Stack Overflow dataset

This section will cover an exercise to help you practice different data transformation, aggregation, and merging techniques based on the public Stack Overflow dataset, which contains a set of tables related to technical questions and answers posted on the Stack Overflow platform. The supporting raw data has been uploaded to the accompanying Github repository of this book. We will directly download it from the source GitHub link using the readr package, another tidyverse offering that provides an easy, fast, and friendly way to read a wide range of data sources, including those from the web.

Exercise 2.11 – working with the Stack Overflow dataset

Let’s begin this exercise:

Download three data sources on questions, tags, and their mapping table from GitHub:

library(readr)
df_questions = read_csv("https://raw.githubusercontent.com/PacktPublishing/The-Statistics-and-Machine-Learning-with-R-Workshop...

The rest of the chapter is locked

You're reading from The Statistics and Machine Learning with R Workshop Unlock the power of efficient data science modeling with this hands-on guide

Table of Contents (20) Chapters

Case study – working with the Stack Overflow dataset

Exercise 2.11 – working with the Stack Overflow dataset

Authors (1)

Personalised recommendations for you

You're reading from The Statistics and Machine Learning with R Workshop Unlock the power of efficient data science modeling with this hands-on guide

Table of Contents (20) Chapters

Case study – working with the Stack Overflow dataset

Exercise 2.11 – working with the Stack Overflow dataset

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you