You're reading from Practical Machine Learning on Databricks Seamlessly transition ML models and MLOps on Databricks

Product type Paperback

Published in Nov 2023

Publisher Packt

ISBN-13 9781801812030

Length 244 pages

Edition 1st Edition

Languages

Python

Tools

MLOps

Concepts

Data Science

Author (1):

Debu Sinha

View More author details

Table of Contents (16) Chapters

Preface

1. Part 1: Introduction

2. Chapter 1: The ML Process and Its Challenges FREE CHAPTER

3. Chapter 2: Overview of ML on Databricks

4. Part 2: ML Pipeline Components and Implementation

5. Chapter 3: Utilizing the Feature Store

6. Chapter 4: Understanding MLflow Components on Databricks

7. Chapter 5: Create a Baseline Model Using Databricks AutoML

8. Part 3: ML Governance and Deployment

9. Chapter 6: Model Versioning and Webhooks

10. Chapter 7: Model Deployment Approaches

11. Chapter 8: Automating ML Workflows Using Databricks Jobs

12. Chapter 9: Model Drift Detection and Retraining

13. Chapter 10: Using CI/CD to Automate Model Retraining and Redeployment

14. Index

Why subscribe?

15. Other Books You May Enjoy

Wikipedia, Hyperparameter (machine learning) (https://en.wikipedia.org/wiki/Hyperparameter_(machine_learning)).
Matt Asay, 2017, 85% of big data projects fail, TechRepublic, November (https://www.techrepublic.com/article/85-of-big-data-projects-fail-but-your-developers-can-help-yours-succeed/).
Rackspace Technologies, New Global Rackspace Technology Study Uncovers Widespread Artificial Intelligence and Machine Learning Knowledge Gap, January 2021 (https://www.rackspace.com/newsroom/new-global-rackspace-technology-study-uncovers-widespread-artificial-intelligence-and).
Gartner, Gartner Data Shows 87 Percent of Organizations Have Low BI and Analytics Maturity, December 2018 (https://www.gartner.com/en/newsroom/press-releases/2018-12-06-gartner-data-shows-87-percent-of-organizations-have-low-bi-and-analytics-maturity).
Learning Spark: Lightning-Fast Data Analytics, by Holden Karau, Andy Konwinski, Patrick Wendell, and Matei Zaharia: This comprehensive guide covers the fundamentals of Spark, including RDDs, the DataFrame API, Spark Streaming, MLlib, and GraphX. With practical examples and use cases, it will help you become proficient in using Spark for data analytics.
Spark: The Definitive Guide, by Bill Chambers and Matei Zaharia: This acclaimed book provides a deep dive into Spark’s core concepts and advanced features. It covers Spark’s architecture, data processing techniques, ML, graph processing, and deployment considerations. Suitable for beginners and experienced users, it offers a comprehensive understanding of Spark.
High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark, by Holden Karau, Rachel Warren, and Matei Zaharia: This book explores strategies for optimizing Spark applications to achieve maximum performance and scalability. It offers insights into tuning Spark configurations, improving data locality, leveraging advanced features, and designing efficient data pipelines.
Spark in Action, by Jean-Georges Perrin: This practical guide takes you through the entire Spark ecosystem, covering data ingestion, transformation, ML, real-time processing, and integration with other technologies. With hands-on examples and real-world use cases, it enables you to apply Spark to your specific projects.
Get Started using Unity Catalog (https://docs.databricks.com/data-governance/unity-catalog/get-started.html)
Databricks documentation (https://docs.databricks.com/introduction/index.html).