You're reading from Scalable Data Architecture with Java Build efficient enterprise-grade data architecting solutions using Java

Product type Paperback

Published in Sep 2022

Publisher Packt

ISBN-13 9781801073080

Length 382 pages

Edition 1st Edition

Languages

Java

Tools

Deeplearning4j

Concepts

Data Science

Author (1):

Sinchan Banerjee

View More author details

Table of Contents (19) Chapters

Preface

1. Section 1 – Foundation of Data Systems

2. Chapter 1: Basics of Modern Data Architecture FREE CHAPTER

3. Chapter 2: Data Storage and Databases

4. Chapter 3: Identifying the Right Data Platform

5. Section 2 – Building Data Processing Pipelines

6. Chapter 4: ETL Data Load – A Batch-Based Solution to Ingesting Data in a Data Warehouse

7. Chapter 5: Architecting a Batch Processing Pipeline

8. Chapter 6: Architecting a Real-Time Processing Pipeline

9. Chapter 7: Core Architectural Design Patterns

10. Chapter 8: Enabling Data Security and Governance

11. Section 3 – Enabling Data as a Service

12. Chapter 9: Exposing MongoDB Data as a Service

13. Chapter 10: Federated and Scalable DaaS with GraphQL

14. Section 4 – Choosing Suitable Data Architecture

15. Chapter 11: Measuring Performance and Benchmarking Your Applications

16. Chapter 12: Evaluating, Recommending, and Presenting Your Solutions

17. Index

Why subscribe?

18. Other Books You May Enjoy

Designing the solution

To design the solution for the current problem statement, let’s analyze the data points or facts that are available to us right now:

The current problem is a batch-based data engineering problem
The problem at hand is a data ingestion problem
Our source is CSV files containing structured data
Our target is a PostgreSQL data warehouse
Our data warehouse follows a star schema, with one fact table, two dynamic dimension tables, and three static dimension tables
We should choose a technology that is independent of the deployment platform, considering that our solution can be migrated to the cloud in the future
For the context and scope of this book, we will explore optimum solutions based on Java-based technologies

Based on the preceding facts, we can conclude that we have to build three similar data ingestion pipelines – one for the fact table and two others for the dynamic dimension tables. At this point, we...

The rest of the chapter is locked

You're reading from Scalable Data Architecture with Java Build efficient enterprise-grade data architecting solutions using Java

Table of Contents (19) Chapters

Designing the solution

Authors (1)

Personalised recommendations for you

You're reading from Scalable Data Architecture with Java Build efficient enterprise-grade data architecting solutions using Java

Table of Contents (19) Chapters

Designing the solution

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you