Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Newsletter Hub

Free Learning

You're reading from Advanced Deep Learning with Python Design and implement advanced next-generation AI solutions using TensorFlow and PyTorch

Product type Paperback

Published in Dec 2019

Publisher Packt

ISBN-13 9781789956177

Length 468 pages

Edition 1st Edition

Languages

Python

Tools

PyTorch

Concepts

Deep Learning

Author (1):

Ivan Vasilev

View More author details

Table of Contents (17) Chapters

Preface

1. Section 1: Core Concepts FREE CHAPTER

2. The Nuts and Bolts of Neural Networks

3. Section 2: Computer Vision

4. Understanding Convolutional Networks

5. Advanced Convolutional Networks

6. Object Detection and Image Segmentation

7. Generative Models

8. Section 3: Natural Language and Sequence Processing

9. Language Modeling

10. Understanding Recurrent Networks

11. Sequence-to-Sequence Models and Attention

12. Section 4: A Look to the Future

13. Emerging Neural Network Designs

14. Meta Learning

15. Deep Learning for Autonomous Vehicles

16. Other Books You May Enjoy

Leave a review - let other readers know what you think

Transformer language models

In Chapter 6, Language Modeling, we introduced several different language models (word2vec, GloVe, and fastText) that use the context of a word (its surrounding words) to create word vectors (embeddings). These models share some common properties:

They are context-free (I know it contradicts the previous statement) because they create a single global word vector of each word based on all its occurrences in the training text. For example, lead can have completely different meanings in the phrases lead the way and lead atom, yet the model will try to embed both meanings in the same word vector.
They are position-free because they don't take into account the order of the contextual words when training for the embedding vectors.

In contrast, it's possible to create transformer-based language models, which are both context- and position-dependent...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (1)

Vasilev

Ivan Vasilev started working on the first open source Java deep learning library with GPU support in 2013. The library was acquired by a German company, with whom he continued its development. He has also worked as a machine learning engineer and researcher in medical image classification and segmentation with deep neural networks. Since 2017, he has focused on financial machine learning. He co-founded an algorithmic trading company, where he's the lead engineer. He holds an MSc in artificial intelligence from Sofia University St. Kliment Ohridski and has written two previous books on the same topic.

See other products by Vasilev