Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
IBM Cloud Pak for Data

You're reading from   IBM Cloud Pak for Data An enterprise platform to operationalize data, analytics, and AI

Arrow left icon
Product type Paperback
Published in Nov 2021
Publisher Packt
ISBN-13 9781800562127
Length 336 pages
Edition 1st Edition
Arrow right icon
Authors (3):
Arrow left icon
Hemanth Manda Hemanth Manda
Author Profile Icon Hemanth Manda
Hemanth Manda
Sriram Srinivasan Sriram Srinivasan
Author Profile Icon Sriram Srinivasan
Sriram Srinivasan
Deepak Rangarao Deepak Rangarao
Author Profile Icon Deepak Rangarao
Deepak Rangarao
Arrow right icon
View More author details
Toc

Table of Contents (17) Chapters Close

Preface 1. Section 1: The Basics
2. Chapter 1: The AI Ladder – IBM's Prescriptive Approach FREE CHAPTER 3. Chapter 2: Cloud Pak for Data: A Brief Introduction 4. Section 2: Product Capabilities
5. Chapter 3: Collect – Making Data Simple and Accessible 6. Chapter 4: Organize – Creating a Trusted Analytics Foundation 7. Chapter 5: Analyzing: Building, Deploying, and Scaling Models with Trust and Transparency 8. Chapter 6: Multi-Cloud Strategy and Cloud Satellite 9. Chapter 7: IBM and Partner Extension Services 10. Chapter 8: Customer Use Cases 11. Section 3: Technical Details
12. Chapter 9: Technical Overview, Management, and Administration 13. Chapter 10: Security and Compliance 14. Chapter 11: Storage 15. Chapter 12: Multi-Tenancy 16. Other Books You May Enjoy

Data virtualization versus ETL – when to use what?

Historically, Data warehouses and data lakes are built by moving data in bulk using ETL. One of the leading ETL products in the market happens to be from IBM and is called IBM DataStage. So, it begs the question as to when someone should use data virtualization versus an ETL offering. The answer depends on the use case. If the intent is to explore and analyze small sets of data in real time and where data can change every few minutes or hours, data virtualization is recommended. Please note that the reference to small sets of data alludes to the actual data that's transferred, not the dataset that a query is performed on. On the flip side, if the use case requires processing huge datasets across multiple sources and where data is more or less static over time (historical datasets), an ETL-based solution is highly recommended.

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime
Banner background image