Search icon CANCEL
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Conferences
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
Intelligent Mobile Projects with TensorFlow

You're reading from   Intelligent Mobile Projects with TensorFlow Build 10+ Artificial Intelligence apps using TensorFlow Mobile and Lite for iOS, Android, and Raspberry Pi

Arrow left icon
Product type Paperback
Published in May 2018
Publisher Packt
ISBN-13 9781788834544
Length 404 pages
Edition 1st Edition
Languages
Tools
Arrow right icon
Author (1):
Arrow left icon
Jeff Tang Jeff Tang
Author Profile Icon Jeff Tang
Jeff Tang
Arrow right icon
View More author details
Toc

Table of Contents (14) Chapters Close

Preface 1. Getting Started with Mobile TensorFlow FREE CHAPTER 2. Classifying Images with Transfer Learning 3. Detecting Objects and Their Locations 4. Transforming Pictures with Amazing Art Styles 5. Understanding Simple Speech Commands 6. Describing Images in Natural Language 7. Recognizing Drawing with CNN and LSTM 8. Predicting Stock Price with RNN 9. Generating and Enhancing Images with GAN 10. Building an AlphaZero-like Mobile Game App 11. Using TensorFlow Lite and Core ML on Mobile 12. Developing TensorFlow Apps on Raspberry Pi 13. Other Books You May Enjoy

Image captioning – how it works

The model that won the first MSCOCO Image Captioning Challenge in 2015 is described in the paper, Show and Tell: Lessons learned from the 2015 MSCOCO Image Captioning Challenge (https://arxiv.org/pdf/1609.06647.pdf). Before we talk about the training process, which is also covered pretty well in TensorFlow's im2txt model documentation website at https://github.com/tensorflow/models/tree/master/research/im2txt, let's first get a basic understanding of how the model works. This will also help you understand training and inference code in Python, as well as the inference code in iOS and Android you'll see later in the chapter.

The winning Show and Tell model is trained using an end-to-end method, similar to the latest deep learning-based speech recognition models we covered briefly in the previous chapter. It uses the MSCOCO image...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime