Packt+ | Advance your knowledge in tech

You're reading from Getting Started with Talend Open Studio for Data Integration This is the complete course for anybody who wants to get to grips with Talend Open Studio for Data Integration. From the basics of transferring data to complex integration processes, it will give you a head start.

Product type Paperback

Published in Nov 2012

Publisher Packt

ISBN-13 9781849514729

Length 320 pages

Edition 1st Edition

Languages

Java

Tools

Talend

Concepts

Data Processing

Author (1):

Jonathan Bowen

View More author details

Table of Contents (22) Chapters

Getting Started with Talend Open Studio for Data Integration

Credits

Foreword

About the Author

Acknowledgement

About the Reviewers

www.PacktPub.com

Preface

1. Knowing Talend Open Studio

2. Working with Talend Open Studio FREE CHAPTER

3. Transforming Files

4. Working with Databases

5. Filtering, Sorting, and Other Processing Techniques

6. Managing Files

7. Job Orchestration

8. Managing Jobs

9. Global Variables and Contexts

10. Worked Examples

Installing Sample Jobs and Data

Downloading job and data files

Resources

Index

Duplicating and merging dataflows

Our final section in this chapter will look at how we can duplicate and merge dataflows. Duplicating dataflows is particularly useful as it allows us to undertake different processing on the same data without having to read a file twice or query a database twice. Merging dataflows allows us to take data from different sources and rationalize it into a single dataflow.

Duplicating data

Open the job DuplicatingData from the Resources directory.

It starts with a simple database query. The dataflow from this is replicated using a tReplicate component and the same dataflow is subsequently passed to two processing streams. In this case the processing is very simple, a filter on each dataflow to filter for rows from region1 or region3 respectively. As noted previously, the processing on each dataflow could be completely different, for example, one flow being extracted to a CSV file while the other transformed and imported into a different database.

Tip

The tReplicate...

The rest of the chapter is locked

You're reading from Getting Started with Talend Open Studio for Data Integration This is the complete course for anybody who wants to get to grips with Talend Open Studio for Data Integration. From the basics of transferring data to complex integration processes, it will give you a head start.

Table of Contents (22) Chapters

Duplicating and merging dataflows

Duplicating data

Tip

Authors (1)

Personalised recommendations for you

You're reading from Getting Started with Talend Open Studio for Data Integration This is the complete course for anybody who wants to get to grips with Talend Open Studio for Data Integration. From the basics of transferring data to complex integration processes, it will give you a head start.

Table of Contents (22) Chapters

Duplicating and merging dataflows

Duplicating data

Tip

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you