You're reading from Building AI Applications with OpenAI APIs Leverage ChatGPT, Whisper, and DALL-E APIs to build 10 innovative AI projects

Product type Paperback

Published in Oct 2024

Publisher Packt

ISBN-13 9781835884003

Length 252 pages

Edition 2nd Edition

Languages

Python

Concepts

GPT/LLMs

Author (1):

Martin Yanev

View More author details

Table of Contents (19) Chapters

Preface

1. Part 1:Getting Started with OpenAI APIs FREE CHAPTER

2. Chapter 1: Getting Started with the ChatGPT API for NLP Tasks

3. Chapter 2: Building a ChatGPT Clone

4. Part 2: Build Web Applications with ChatGPT API

5. Chapter 3: Creating and Deploying a Code Bug-Fixing Application Using Flask

6. Chapter 4: Integrating the Code Bug-Fixing Application with a Payment Service

7. Chapter 5: Quiz Generation App with ChatGPT and Django

8. Part 3: ChatGPT, DALL-E, and Whisper APIs for Desktop Apps Development

9. Chapter 6: Language Translation Desktop App with the ChatGPT API and Microsoft Word

10. Chapter 7: Building an Outlook Email Reply Generator

11. Chapter 8: Essay Generation Tool with PyQt and the ChatGPT API

12. Chapter 9: Integrating the ChatGPT and DALL-E APIs: Building an End-to-End PowerPoint Presentation Generator

13. Chapter 10: Speech Recognition and Text-to-Speech with the Whisper API

14. Part 4: Advanced Concepts for Powering ChatGPT Apps

15. Chapter 11: Choosing the Right ChatGPT API Model

16. Chapter 12: Fine-Tuning ChatGPT to Create Unique API Models

17. Index

Why subscribe?

18. Other Books You May Enjoy

Speech Recognition and Text-to-Speech with the Whisper API

Welcome to Chapter 10 of our journey into the world of cutting-edge AI technologies. In this chapter, we’ll embark on an exploration of the remarkable Whisper API. Harnessing the power of advanced speech recognition and translation, the Whisper API opens exciting possibilities for transforming audio into text. Imagine having the ability to transcribe conversations, interviews, podcasts, or any spoken content effortlessly. Whether you aim to extract valuable insights from multilingual audio files or create accessible content for a global audience, the Whisper API has you covered.

In this chapter, we will do a deep dive into the core functionalities of the Whisper API by developing a language transcription project using Python. We’ll get acquainted with its essential endpoints, namely transcriptions and translations, which form the backbone of its speech-to-text capabilities. With its state-of-the-art open source...