You're reading from Modern Computer Vision with PyTorch A practical roadmap from deep learning fundamentals to advanced applications and Generative AI

Product type Paperback

Published in Jun 2024

Publisher Packt

ISBN-13 9781803231334

Length 746 pages

Edition 2nd Edition

Languages

Python

Tools

PyTorch

Concepts

Computer Vision

Authors (2):

V Kishore Ayyadevara

Yeshwanth Reddy

View More author details

Table of Contents (26) Chapters

Preface

1. Section 1: Fundamentals of Deep Learning for Computer Vision

2. Artificial Neural Network Fundamentals FREE CHAPTER

3. PyTorch Fundamentals

4. Building a Deep Neural Network with PyTorch

5. Section 2: Object Classification and Detection

6. Introducing Convolutional Neural Networks

7. Transfer Learning for Image Classification

8. Practical Aspects of Image Classification

9. Basics of Object Detection

10. Advanced Object Detection

11. Image Segmentation

12. Applications of Object Detection and Segmentation

13. Section 3: Image Manipulation

14. Autoencoders and Image Manipulation

15. Image Generation Using GANs

16. Advanced GANs to Manipulate Images

17. Section 4: Combining Computer Vision with Other Techniques

18. Combining Computer Vision and Reinforcement Learning

19. Combining Computer Vision and NLP Techniques

20. Foundation Models in Computer Vision

21. Applications of Stable Diffusion

22. Moving a Model to Production

23. Other Books You May Enjoy

24. Index

Appendix

Summarizing the training process of a neural network

Training a neural network is a process of coming up with optimal weights for a neural network architecture by repeating the two key steps, forward propagation and backpropagation with a given learning rate.

In forward propagation, we apply a set of weights to the input data, pass it through the defined hidden layers, perform the defined non-linear activation on the hidden layers’ output, and then connect the hidden layer to the output layer by multiplying the hidden layer node values with another set of weights to estimate the output value. Finally, we calculate the overall loss corresponding to the given set of weights. For the first forward propagation, the values of the weights are initialized randomly.

In backpropagation, we decrease the loss value (error) by adjusting weights in a direction that reduces the overall loss. Furthermore, the magnitude of the weight update is the gradient times the learning rate.

The process of feedforward propagation and backpropagation is repeated until we achieve as minimal a loss as possible. This implies that, at the end of the training, the neural network has adjusted its weights such that it predicts the output that we want it to predict. In the preceding toy example, after training, the updated network will predict a value of 0 as output when {1,1} is fed as input as it is trained to achieve that.

You're reading from Modern Computer Vision with PyTorch A practical roadmap from deep learning fundamentals to advanced applications and Generative AI

Table of Contents (26) Chapters Close

Summarizing the training process of a neural network

Authors (2)

Personalised recommendations for you

Table of Contents (26) Chapters