You're reading from Extending Power BI with Python and R Perform advanced analysis using the power of analytical languages

Product type Paperback

Published in Mar 2024

Publisher Packt

ISBN-13 9781837639533

Length 814 pages

Edition 2nd Edition

Languages

Python

Tools

Power BI

Concepts

Business Intelligence

Author (1):

Luca Zavarella

View More author details

Table of Contents (27) Chapters

Preface

1. Where and How to Use R and Python Scripts in Power BI FREE CHAPTER

2. Configuring R with Power BI

3. Configuring Python with Power BI

4. Solving Common Issues When Using Python and R in Power BI

5. Importing Unhandled Data Objects

6. Using Regular Expressions in Power BI

7. Anonymizing and Pseudonymizing Your Data in Power BI

8. Logging Data from Power BI to External Sources

9. Loading Large Datasets Beyond the Available RAM in Power BI

10. Boosting Data Loading Speed in Power BI with Parquet Format

11. Calling External APIs to Enrich Your Data

12. Calculating Columns Using Complex Algorithms: Distances

13. Calculating Columns Using Complex Algorithms: Fuzzy Matching

14. Calculating Columns Using Complex Algorithms: Optimization Problems

15. Adding Statistical Insights: Associations

16. Adding Statistical Insights: Outliers and Missing Values

17. Using Machine Learning without Premium or Embedded Capacity

18. Using SQL Server External Languages for Advanced Analytics and ML Integration in Power BI

19. Exploratory Data Analysis

20. Using the Grammar of Graphics in Python with plotnine

21. Advanced Visualizations

22. Interactive R Custom Visuals

23. Other Books You May Enjoy

24. Index

Appendix 1: Answers

1. Appendix 2: Glossary

Importing large datasets with Python

In Chapter 3, Configuring Python with Power BI, we suggested that you install some of the most commonly used data management packages in your environment, including NumPy, pandas, and scikit-learn. The biggest limitation of these packages is that they cannot handle datasets larger than the RAM of the machine on which they are used, so they cannot scale to more than one machine. To overcome this limitation, distributed systems based on Spark, which has become a dominant tool in the big data analytics landscape, are often used. However, moving to these systems forces developers to rethink code they have already written using an API called PySpark, which was created to use Spark objects with Python. This process is generally seen as causing delays in project delivery and causing frustration for developers who are much more comfortable with the libraries available for standard Python.In response to the above issues, the community has developed a new library...

The rest of the chapter is locked

You're reading from Extending Power BI with Python and R Perform advanced analysis using the power of analytical languages

Table of Contents (27) Chapters

Importing large datasets with Python

Authors (1)

Personalised recommendations for you

You're reading from Extending Power BI with Python and R Perform advanced analysis using the power of analytical languages

Table of Contents (27) Chapters

Importing large datasets with Python

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you