You're reading from Extending Power BI with Python and R Perform advanced analysis using the power of analytical languages

Product type Paperback

Published in Mar 2024

Publisher Packt

ISBN-13 9781837639533

Length 814 pages

Edition 2nd Edition

Languages

Python

Tools

Power BI

Concepts

Business Intelligence

Author (1):

Luca Zavarella

View More author details

Table of Contents (27) Chapters

Preface

1. Where and How to Use R and Python Scripts in Power BI FREE CHAPTER

2. Configuring R with Power BI

3. Configuring Python with Power BI

4. Solving Common Issues When Using Python and R in Power BI

5. Importing Unhandled Data Objects

6. Using Regular Expressions in Power BI

7. Anonymizing and Pseudonymizing Your Data in Power BI

8. Logging Data from Power BI to External Sources

9. Loading Large Datasets Beyond the Available RAM in Power BI

10. Boosting Data Loading Speed in Power BI with Parquet Format

11. Calling External APIs to Enrich Your Data

12. Calculating Columns Using Complex Algorithms: Distances

13. Calculating Columns Using Complex Algorithms: Fuzzy Matching

14. Calculating Columns Using Complex Algorithms: Optimization Problems

15. Adding Statistical Insights: Associations

16. Adding Statistical Insights: Outliers and Missing Values

17. Using Machine Learning without Premium or Embedded Capacity

18. Using SQL Server External Languages for Advanced Analytics and ML Integration in Power BI

19. Exploratory Data Analysis

20. Using the Grammar of Graphics in Python with plotnine

21. Advanced Visualizations

22. Interactive R Custom Visuals

23. Other Books You May Enjoy

24. Index

Appendix 1: Answers

1. Appendix 2: Glossary

From CSV to the Parquet file format

The traditional approach of storing structured data in CSV files has long been the method of choice for many organizations. For example, the very dataset used in Chapter 9, Loading Large Datasets beyond the Available RAM in Power BI, which contains monthly U.S. flight data from 1987 to 2012, consists of many CSV files. However, this approach has several significant limitations that can negatively impact data processing and analysis:

The CSV file format is not optimized for columnar storage and stores data in a row-based format. As a result, CSV files can have slower read and write times, especially for large datasets. This can result in slower query execution times and reduced overall performance, negatively impacting the efficiency of data processing and analysis.
Although CSV files can handle basic data types such as integers and strings, they can struggle when it comes to dealing with more complex data structures such as arrays and nested data types...

The rest of the chapter is locked

You're reading from Extending Power BI with Python and R Perform advanced analysis using the power of analytical languages

Table of Contents (27) Chapters

From CSV to the Parquet file format

Authors (1)

Personalised recommendations for you

You're reading from Extending Power BI with Python and R Perform advanced analysis using the power of analytical languages

Table of Contents (27) Chapters

From CSV to the Parquet file format

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you