Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Newsletter Hub

Free Learning

You're reading from Neural Networks with Keras Cookbook Over 70 recipes leveraging deep learning techniques across image, text, audio, and game bots

Product type Paperback

Published in Feb 2019

Publisher Packt

ISBN-13 9781789346640

Length 568 pages

Edition 1st Edition

Languages

Python

Tools

Keras

Concepts

Deep Learning

Authors (2):

V Kishore Ayyadevara

Srinivas Pradeep

View More author details

Table of Contents (18) Chapters

Preface

1. Building a Feedforward Neural Network FREE CHAPTER

2. Building a Deep Feedforward Neural Network

3. Applications of Deep Feedforward Neural Networks

4. Building a Deep Convolutional Neural Network

5. Transfer Learning

6. Detecting and Localizing Objects in Images

7. Image Analysis Applications in Self-Driving Cars

8. Image Generation

9. Encoding Inputs

10. Text Analysis Using Word Vectors

11. Building a Recurrent Neural Network

12. Applications of a Many-to-One Architecture RNN

13. Sequence-to-Sequence Learning

14. End-to-End Learning

15. Audio Analysis

16. Reinforcement Learning

17. Other Books You May Enjoy

Leave a review - let other readers know what you think

Transcribing audio into text

In Chapter 14, End-to-End Learning, we learned about transcribing handwritten text images into text. In this section, we will be leveraging a similar end-to-end model to transcribe voices into text.

Getting ready

The strategy that we'll adopt to transcribe voices is as follows:

Download a dataset that contains the audio file and its corresponding transcriptions (ground truths)
Specify a sampling rate while reading the audio files:
- If the sampling rate is 16,000, we'll be extracting 16,000 data points per second of audio.

Extract a Fast Fourier Transformation of the audio array:
- An FFT ensures that we have only the most important features of a signal.
- By default, the FFT gives...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (1)

V Kishore Ayyadevara

V Kishore Ayyadevara leads a team focused on using AI to solve problems in the healthcare space. He has 10 years' experience in data science, solving problems to improve customer experience in leading technology companies. In his current role, he is responsible for developing a variety of cutting edge analytical solutions that have an impact at scale while building strong technical teams. Prior to this, Kishore authored three books — Pro Machine Learning Algorithms, Hands-on Machine Learning with Google Cloud Platform, and SciPy Recipes. Kishore is an active learner with keen interest in identifying problems that can be solved using data, simplifying the complexity and in transferring techniques across domains to achieve quantifiable results.

See other products by V Kishore Ayyadevara