Section 2: Data Ingestion and Transformation with R and Python in Power BI
The data ingestion and data transformation phases are the most common in any system such as Power BI that allows the integration of data from heterogeneous external data sources. The introduction of analytical languages in Power BI allows analysts to add powerful and sophisticated tools to these phases, facilitating operations in scenarios that seem impossible to address with the common tools provided by Power BI. We are talking about tools such as some techniques of reading and writing files directly from analytical languages, the manipulation of serialized files through analytical languages, regular expressions, and sophisticated techniques of natural language processing and distributed systems that allow you to work with very large datasets without generating memory problems. All of this will be the subject of this section of the book and will grant the analyst knowledge of data manipulation above the standard...