Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Newsletter Hub

Free Learning

You're reading from Python Deep Learning Projects 9 projects demystifying neural network and deep learning models for building intelligent systems

Product type Paperback

Published in Oct 2018

Publisher Packt

ISBN-13 9781788997096

Length 472 pages

Edition 1st Edition

Languages

Python

Concepts

Deep Learning

Authors (3):

Rahul Kumar

Matthew Lamons

Abhishek Nagaraja

View More author details

Table of Contents (17) Chapters

Preface

1. Building Deep Learning Environments FREE CHAPTER

2. Training NN for Prediction Using Regression

3. Word Representation Using word2vec

4. Building an NLP Pipeline for Building Chatbots

5. Sequence-to-Sequence Models for Building Chatbots

6. Generative Language Model for Content Creation

7. Building Speech Recognition with DeepSpeech2

8. Handwritten Digits Classification Using ConvNets

9. Object Detection Using OpenCV and TensorFlow

10. Building Face Recognition Using FaceNet

11. Automated Image Captioning

12. Pose Estimation on 3D models Using ConvNets

13. Image Translation Using GANs for Style Transfer

14. Develop an Autonomous Agent with Deep R Learning

15. Summary and Next Steps in Your Deep Learning Career

16. Other Books You May Enjoy

Leave a review - let other readers know what you think

Training the model

Now that we understand the data that we are using and the DeepSpeech model architecture, let's set up the environment to train the model. There are some preliminary steps to create a virtual environment for the project that are optional, but always recommended to use. Also, it's recommended to use GPUs to train these models.

Along with Python Version 3.5 and TensorFlow version 1.7+, the following are some of the prerequisites:

python-Levenshtein: To compute character error rate (CER), basically the distance
python_speech_features: To extract MFCC features from raw data
pysoundfile: To read FLAC files
scipy: Helper functions for windowing
tqdm: For displaying a progress bar

Let's create the virtual environment and install all the dependencies:

conda create -n 'SpeechProject' python=3.5.0
source activate SpeechProject

Install the following...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (3)

Lamons

Matthew Lamons's background is in experimental psychology and deep learning. Founder and CEO of Skejulthe AI platform to help people manage their activities. Named by Gartner, Inc. as a "Cool Vendor" in the "Cool Vendors in Unified Communication, 2017" report. He founded The Intelligence Factory to build AI strategy, solutions, insights, and talent for enterprise clients and incubate AI tech startups based on the success of his Applied AI MasterMinds group. Matthew's global community of more than 85 K are leaders in AI, forecasting, robotics, autonomous vehicles, marketing tech, NLP, computer vision, reinforcement, and deep learning. Matthew invites you to join him on his mission to simplify the future and to build AI for good.

See other products by Lamons

Kumar

Rahul Kumar is an AI scientist, deep learning practitioner, and independent researcher. His expertise in building multilingual NLU systems and large-scale AI infrastructures has brought him to Copenhagen, where he leads a large team of AI engineers as Chief AI Scientist at Jatana. Often invited to speak at AI conferences, he frequently travels between India, Europe, and the US where, among other research initiatives, he collaborates with The Intelligence Factory as NLP data science fellow. Keen to explore the ramifications of emerging technologies for his next book, he's currently involved in various research projects on Quantum Computing (QC), high-performance computing (HPC), and the brain-computer interaction (BCI).

See other products by Kumar

Nagaraja

Abhishek Nagaraja was born and raised in India. Graduated Magna Cum Laude from the University of Illinois at Chicago, United States, with a Masters Degree in Mechanical Engineering with a concentration in Mechatronics and Data Science. Abhishek specializes in Keras and TensorFlow for building and evaluation of custom architectures in deep learning recommendation models. His deep learning skills and interest span computational linguistics and NLP to build chatbots to computer vision and reinforcement learning. He has been working as a Data Scientist for Skejul Inc. building an AI-powered activity forecast engine and engaged as a Deep Learning Data Scientist with The Intelligence Factory building solutions for enterprise clients.

See other products by Nagaraja