Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Mastering Computer Vision with TensorFlow 2.x Build advanced computer vision applications using machine learning and deep learning techniques

Product type Paperback

Published in May 2020

Publisher Packt

ISBN-13 9781838827069

Length 430 pages

Edition 1st Edition

Languages

Python

Tools

OpenCV

Concepts

Computer Vision

Author (1):

Krishnendu Kar

View More author details

Table of Contents (18) Chapters

Preface

1. Section 1: Introduction to Computer Vision and Neural Networks

2. Computer Vision and TensorFlow Fundamentals FREE CHAPTER

3. Content Recognition Using Local Binary Patterns

4. Facial Detection Using OpenCV and CNN

5. Deep Learning on Images

6. Section 2: Advanced Concepts of Computer Vision with TensorFlow

7. Neural Network Architecture and Models

8. Visual Search Using Transfer Learning

9. Object Detection Using YOLO

10. Semantic Segmentation and Neural Style Transfer

11. Section 3: Advanced Implementation of Computer Vision with TensorFlow

12. Action Recognition Using Multitask Deep Learning

13. Object Detection Using R-CNN, SSD, and R-FCN

14. Section 4: TensorFlow Implementation at the Edge and on the Cloud

15. Deep Learning on Edge Devices with CPU/GPU Optimization

16. Cloud Computing Platform for Computer Vision

17. Other Books You May Enjoy

Leave a review - let other readers know what you think

YOLO versus YOLO v2 versus YOLO v3

A comparison of the three YOLO versions is shown in this table:

	YOLO	YOLO v2	YOLO v3
Input size	224 x 224	448 x 448
Framework	Darknet trained on ImageNet—1,000.	Darknet-19 19 convolution layers and 5 max pool layers.	Darknet-53 53 convolutional layers. For detection, 53 more layers are added, giving a total of 106 layers.
Small size detection	It cannot find small images.	Better than YOLO at detecting small images.	Better than YOLO v2 at small image detection.
		Uses anchor boxes.	Uses a residual block.

The following diagram compares the architectures of YOLO v2 and YOLO v3:

The basic convolution layers are similar, but YOLO v3 carries out detection at three separate layers: 82, 94, and 106.

The most critical item that you should take from YOLO v3 is its object detection...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at R$50/month. Cancel anytime

Authors (1)

Kar

Krishnendu (Krish) is passionate about research on computer vision and solving AI problems to make our life simpler. His core expertise is deep learning - computer vision, IoT, and agile software development. Krish is also a passionate app developer and has a dash cam-based object and lane detection and turn by turn navigation and fitness app in the iOS app store - Nity Map AI Camera & Run timer.

See other products by Kar