You're reading from Extending Excel with Python and R Unlock the potential of analytics languages for advanced data manipulation and visualization

Product type Paperback

Published in Apr 2024

Publisher Packt

ISBN-13 9781804610695

Length 344 pages

Edition 1st Edition

Languages

Python

Tools

Excel

Concepts

Data Analysis

Authors (2):

Steven Sanderson

David Kun

View More author details

Table of Contents (20) Chapters

Preface

1. Part 1:The Basics – Reading and Writing Excel Files from R and Python

2. Chapter 1: Reading Excel Spreadsheets FREE CHAPTER

3. Chapter 2: Writing Excel Spreadsheets

4. Chapter 3: Executing VBA Code from R and Python

5. Chapter 4: Automating Further – Task Scheduling and Email

6. Part 2: Making It Pretty – Formatting, Graphs, and More

7. Chapter 5: Formatting Your Excel Sheet

8. Chapter 6: Inserting ggplot2/matplotlib Graphs

9. Chapter 7: Pivot Tables and Summary Tables

10. Part 3: EDA, Statistical Analysis, and Time Series Analysis

11. Chapter 8: Exploratory Data Analysis with R and Python

12. Chapter 9: Statistical Analysis: Linear and Logistic Regression

13. Chapter 10: Time Series Analysis: Statistics, Plots, and Forecasting

14. Part 4: The Other Way Around – Calling R and Python from Excel

15. Chapter 11: Calling R/Python Locally from Excel Directly or via an API

16. Part 5: Data Analysis and Visualization with R and Python for Excel Data – A Case Study

17. Chapter 12: Data Analysis and Visualization with R and Python in Excel – A Case Study

18. Index

Why subscribe?

19. Other Books You May Enjoy

Summary

In this chapter, we delved into two pivotal processes: data cleaning and EDA using R and Python, with a specific focus on Excel data.

Data cleaning is a fundamental step. We learned how to address missing data, be it through imputation, removal, or interpolation. Dealing with duplicates was another key focus, as Excel data, often sourced from multiple places, can be plagued with redundancies. Ensuring the correct assignment of data types was emphasized to prevent analysis errors stemming from data type issues.

In the realm of EDA, we started with summary statistics. These metrics, such as mean, median, standard deviation, and percentiles for numerical features, grant an initial grasp of data central tendencies and variability. We then explored data distribution, understanding which is critical for subsequent analysis and modeling decisions. Lastly, we delved into the relationships between variables, employing scatter plots and correlation matrices to unearth correlations...

The rest of the chapter is locked

You're reading from Extending Excel with Python and R Unlock the potential of analytics languages for advanced data manipulation and visualization

Table of Contents (20) Chapters

Summary

Unlock this book and the full library FREE for 7 days

Authors (2)

Personalised recommendations for you