Packt+ | Advance your knowledge in tech

You're reading from Learning Pentaho Data Integration 8 CE An end-to-end guide to exploring, transforming, and integrating your data across multiple sources

Product type Paperback

Published in Dec 2017

Publisher Packt

ISBN-13 9781788292436

Length 500 pages

Edition 3rd Edition

Languages

Java

Tools

Pentaho

Concepts

Data Processing

Author (1):

María Carina Roldán

View More author details

Table of Contents (17) Chapters

Preface

1. Getting Started with Pentaho Data Integration FREE CHAPTER

2. Getting Started with Transformations

3. Creating Basic Task Flows

4. Reading and Writing Files

5. Manipulating PDI Data and Metadata

6. Controlling the Flow of Data

7. Cleansing, Validating, and Fixing Data

8. Manipulating Data by Coding

9. Transforming the Dataset

10. Performing Basic Operations with Databases

11. Loading Data Marts with PDI

12. Creating Portable and Reusable Transformations

13. Implementing Metadata Injection

14. Creating Advanced Jobs

15. Launching Transformations and Jobs from the Command Line

16. Best Practices for Designing and Deploying a PDI Project

Going forward and backward across rows

Besides the common use cases explained in the previous sections, there are other use cases that work with groups of rows, looking for rows before or after the current one within each group.

Some examples of this are as follows:

You have a dataset with monthly sales, group by product line. For each product line, you want to calculate the variation of sales from one month to the next.
You have daily sales and want to infer the number of days without sales. (This is the gap in days between a date and the next in your dataset.)
You have a dataset with a list of sales amounts and sales commissions. The fields in your dataset are sales_amount_from, sales_amount_to, and commission_%. You detected that there are overlaps in the data:

sales_amount_from, sales_amount_to, commission_%
0, 1000, %5
1001, 5000, %15
4500, 9999, %15

You want to automatically fix these overlaps. In this case, you want to change the second row to the following:

1001, 4499, %15

In all these examples...

The rest of the chapter is locked

You're reading from Learning Pentaho Data Integration 8 CE An end-to-end guide to exploring, transforming, and integrating your data across multiple sources

Table of Contents (17) Chapters

Going forward and backward across rows

Authors (1)

Other recommended products

Personalised recommendations for you

You're reading from Learning Pentaho Data Integration 8 CE An end-to-end guide to exploring, transforming, and integrating your data across multiple sources

Table of Contents (17) Chapters

Going forward and backward across rows

Unlock this book and the full library FREE for 7 days

Authors (1)

Other recommended products

Personalised recommendations for you