Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Hands-On One-shot Learning with Python Learn to implement fast and accurate deep learning models with fewer training samples using PyTorch

Product type Paperback

Published in Apr 2020

Publisher Packt

ISBN-13 9781838825461

Length 156 pages

Edition 1st Edition

Languages

Python

Tools

PyTorch

Concepts

Deep Learning

Authors (2):

Ankush Garg

Shruti Jadon

View More author details

Table of Contents (11) Chapters

Preface

1. Section 1: One-shot Learning Introduction

2. Introduction to One-shot Learning FREE CHAPTER

3. Section 2: Deep Learning Architectures

4. Metrics-Based Methods

5. Model-Based Methods

6. Optimization-Based Methods

7. Section 3: Other Methods and Conclusion

8. Generative Modeling-Based Methods

9. Conclusions and Other Approaches

10. Other Books You May Enjoy

Leave a review - let other readers know what you think

Overview of gradient descent

If we look into the learning method of neural network architectures, it usually consists of a lot of parameters and is optimized using a gradient-descent algorithm, which takes many iterative steps over many examples to perform well. The gradient descent algorithm, however, provides a decent performance in its models, but there are scenarios where the gradient-descent optimization algorithm fails. Let's look at such scenarios in the coming sections.

There are mainly two reasons why the gradient-descent algorithm fails to optimize a neural network when a limited amount of data is given:

For each new task, the neural network has to start from a random initialization of its parameters, which results in late convergence. Transfer learning has been used to alleviate this problem by using a pretrained network, but it is constrained in that the data...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (2)

Jadon

Shruti Jadon is currently working as a Machine Learning Software Engineer at Juniper Networks, Sunnyvale and visiting Researcher at Rhode Island Hospital (Brown University). She has obtained her master's degree in Computer Science from University of Massachusetts, Amherst. Her research interests include deep learning architectures, computer vision, and convex optimization. In the past, she has worked at Autodesk, Quantiphi, SAP Labs, and Snapdeal.

See other products by Jadon

Garg

Nishant Garg has over 17 years' software architecture and development experience in various technologies, such as Java Enterprise Edition, SOA, Spring, Hadoop, Hive, Flume, Sqoop, Oozie, Spark, Shark, YARN, Impala, Kafka, Storm, Solr/Lucene, NoSQL databases (such as HBase, Cassandra, and MongoDB), and MPP databases (such as GreenPlum). He received his MS in software systems from the Birla Institute of Technology and Science, Pilani, India, and is currently working as a technical architect for the Big Data RandD Group with Impetus Infotech Pvt. Ltd. Previously, Nishant has enjoyed working with some of the most recognizable names in IT services and financial industries, employing full software life cycle methodologies such as Agile and SCRUM. Nishant has also undertaken many speaking engagements on big data technologies and is also the author of Apache Kafka and HBase Essentials, Packt Publishing.

See other products by Garg