You're reading from Learn OpenAI Whisper Transform your understanding of GenAI through robust and accurate speech processing solutions

Product type Paperback

Published in May 2024

Publisher Packt

ISBN-13 9781835085929

Length 372 pages

Edition 1st Edition

Concepts

GPT/LLMs

Author (1):

Josué R. Batista

View More author details

Table of Contents (16) Chapters

Preface

1. Part 1: Introducing OpenAI’s Whisper

2. Chapter 1: Unveiling Whisper – Introducing OpenAI’s Whisper FREE CHAPTER

3. Chapter 2: Understanding the Core Mechanisms of Whisper

4. Part 2: Underlying Architecture

5. Chapter 3: Diving into the Whisper Architecture

6. Chapter 4: Fine-Tuning Whisper for Domain and Language Specificity

7. Part 3: Real-world Applications and Use Cases

8. Chapter 5: Applying Whisper in Various Contexts

9. Chapter 6: Expanding Applications with Whisper

10. Chapter 7: Exploring Advanced Voice Capabilities

11. Chapter 8: Diarizing Speech with WhisperX and NVIDIA’s NeMo

12. Chapter 9: Harnessing Whisper for Personalized Voice Synthesis

13. Chapter 10: Shaping the Future with Whisper

14. Index

Why subscribe?

15. Other Books You May Enjoy

Milestone 5 – Defining training parameters and hardware configurations

Now that our data is ready, we can start training our model. We’ll use the Hugging Face Trainer to help with most of the work. The Hugging Face Trainer class provides a feature-complete training and evaluation loop for PyTorch models optimized for Transformers. It supports distributed training on multiple GPUs/TPUs and mixed precision and offers a lot of customizability for users. The Trainer class abstracts away the complexities of the training loop, allowing users to focus on providing the essential components required for training, such as a model and a dataset. Here’s what we need to do:

Set up a data collator: This tool takes our prepared data into PyTorch tensors that the model can use.
Choose evaluation metrics: We want to see how well the model performs using the word error rate (WER) metric. To perform this calculation, we’ll create a function called compute_metrics...

The rest of the chapter is locked

You're reading from Learn OpenAI Whisper Transform your understanding of GenAI through robust and accurate speech processing solutions

Table of Contents (16) Chapters

Milestone 5 – Defining training parameters and hardware configurations

Authors (1)

Personalised recommendations for you

You're reading from Learn OpenAI Whisper Transform your understanding of GenAI through robust and accurate speech processing solutions

Table of Contents (16) Chapters

Milestone 5 – Defining training parameters and hardware configurations

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you