Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Hands-On Generative Adversarial Networks with Keras Your guide to implementing next-generation generative adversarial networks

Product type Paperback

Published in May 2019

Publisher Packt

ISBN-13 9781789538205

Length 272 pages

Edition 1st Edition

Languages

Python

Tools

Keras

Concepts

Neural Networks

Author (1):

Rafael Valle

View More author details

Table of Contents (14) Chapters

Preface

1. Section 1: Introduction and Environment Setup

2. Deep Learning Basics and Environment Setup FREE CHAPTER

3. Introduction to Generative Models

4. Section 2: Training GANs

5. Implementing Your First GAN

6. Evaluating Your First GAN

7. Improving Your First GAN

8. Section 3: Application of GANs in Computer Vision, Natural Language Processing, and Audio

9. Progressive Growing of GANs

10. Generation of Discrete Sequences Using GANs

11. Text-to-Image Synthesis with GANs

12. TequilaGAN - Identifying GAN Samples

13. Whats next in GANs

Improving the baseline model

In this example, we improve the baseline model without doing any modifications to the architecture. The authors propose changing the optimization problem such that the Discriminator also has access to mismatched pairs of text embeddings and images.

This approach is called the Matching-Aware Discriminator and is designed to separate the error sources in this task. During training, the discriminator has access to real images with proper text and synthetic images with arbitrary text. In this context, the discriminator implicitly has two sources of error: fake images that look real but do not match the text description, and unrealistic images for any text.

In this context, the authors explicitly provide the discriminator with pairs of real images and unmatched texts, and empirically find that this helps during training. We'll provide a slice of the...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (1)

Rafael Valle

Rafael Valle is a research scientist at NVIDIA focusing on audio applications. He has years of experience developing high performance machine learning models for data/audio analysis, synthesis and machine improvisation with formal specifications. Dr. Valle was the first to generate speech samples from scratch with GANs and to show that simple yet efficient techniques can be used to identify GAN samples. He holds an Interdisciplinary PhD in Machine Listening and Improvisation from UC Berkeley, a Masters degree in Computer Music from the MH-Stuttgart in Germany and a Bachelors degree in Orchestral Conducting from UFRJ in Brazil.

See other products by Rafael Valle