Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Python Deep Learning Projects 9 projects demystifying neural network and deep learning models for building intelligent systems

Product type Paperback

Published in Oct 2018

Publisher Packt

ISBN-13 9781788997096

Length 472 pages

Edition 1st Edition

Languages

Python

Concepts

Deep Learning

Authors (3):

Rahul Kumar

Matthew Lamons

Abhishek Nagaraja

View More author details

Table of Contents (17) Chapters

Preface

1. Building Deep Learning Environments

2. Training NN for Prediction Using Regression FREE CHAPTER

3. Word Representation Using word2vec

4. Building an NLP Pipeline for Building Chatbots

5. Sequence-to-Sequence Models for Building Chatbots

6. Generative Language Model for Content Creation

7. Building Speech Recognition with DeepSpeech2

8. Handwritten Digits Classification Using ConvNets

9. Object Detection Using OpenCV and TensorFlow

10. Building Face Recognition Using FaceNet

11. Automated Image Captioning

12. Pose Estimation on 3D models Using ConvNets

13. Image Translation Using GANs for Style Transfer

14. Develop an Autonomous Agent with Deep R Learning

15. Summary and Next Steps in Your Deep Learning Career

16. Other Books You May Enjoy

Leave a review - let other readers know what you think

Training the captioning model

Now, let's train the model. The first thing we need to do is to extract the features stored in the respective .npy files and then pass those features through the CNN encoder.

The encoder output, hidden state (initialized to 0) and the decoder input (which is the start token) are passed to the decoder. The decoder returns the predictions and the decoder hidden state.

The decoder hidden state is then passed back into the model and the predictions are used to calculate the loss. While training, we use the teacher forcing technique to decide the next input to the decoder.

Teacher forcing is the technique where the target word is passed as the next input to the decoder. This technique helps to learn the correct sequence or correct statistical properties for the sequence, quickly.

The final step is to calculate the gradient and apply it to the optimizer...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (3)

Lamons

Matthew Lamons's background is in experimental psychology and deep learning. Founder and CEO of Skejulthe AI platform to help people manage their activities. Named by Gartner, Inc. as a "Cool Vendor" in the "Cool Vendors in Unified Communication, 2017" report. He founded The Intelligence Factory to build AI strategy, solutions, insights, and talent for enterprise clients and incubate AI tech startups based on the success of his Applied AI MasterMinds group. Matthew's global community of more than 85 K are leaders in AI, forecasting, robotics, autonomous vehicles, marketing tech, NLP, computer vision, reinforcement, and deep learning. Matthew invites you to join him on his mission to simplify the future and to build AI for good.

See other products by Lamons

Nagaraja

Abhishek Nagaraja was born and raised in India. Graduated Magna Cum Laude from the University of Illinois at Chicago, United States, with a Masters Degree in Mechanical Engineering with a concentration in Mechatronics and Data Science. Abhishek specializes in Keras and TensorFlow for building and evaluation of custom architectures in deep learning recommendation models. His deep learning skills and interest span computational linguistics and NLP to build chatbots to computer vision and reinforcement learning. He has been working as a Data Scientist for Skejul Inc. building an AI-powered activity forecast engine and engaged as a Deep Learning Data Scientist with The Intelligence Factory building solutions for enterprise clients.

See other products by Nagaraja

Kumar

Ashish Kumar is a seasoned data science professional, a publisher author and a thought leader in the field of data science and machine learning. An IIT Madras graduate and a Young India Fellow, he has around 7 years of experience in implementing and deploying data science and machine learning solutions for challenging industry problems in both hands-on and leadership roles. Natural Language Procession, IoT Analytics, R Shiny product development, Ensemble ML methods etc. are his core areas of expertise. He is fluent in Python and R and teaches a popular ML course at Simplilearn. When not crunching data, Ashish sneaks off to the next hip beach around and enjoys the company of his Kindle. He also trains and mentors data science aspirants and fledgling start-ups.

See other products by Kumar