0

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Free Learning

Practical Computer Vision

You're reading from Practical Computer Vision Extract insightful information from images using TensorFlow, Keras, and OpenCV

Product type Paperback

Published in Feb 2018

Publisher Packt

ISBN-13 9781788297684

Length 234 pages

Edition 1st Edition

Languages

Python

Tools

Keras

Concepts

Computer Vision

Author (1):

Abhinav Dadhich

View More author details

Table of Contents (12) Chapters

Preface

1. A Fast Introduction to Computer Vision FREE CHAPTER

2. Libraries, Development Platform, and Datasets

3. Image Filtering and Transformations in OpenCV

4. What is a Feature?

5. Convolutional Neural Networks

6. Feature-Based Object Detection

7. Segmentation and Tracking

8. 3D Computer Vision

9. Mathematics for Computer Vision

10. Machine Learning for Computer Vision

11. Other Books You May Enjoy

Leave a review - let other readers know what you think

What is machine learning?

Let's say we have scanned images of several handwritten digits and want to make a piece of software that would recognize handwritten digits from an image scan. For simplicity, let's assume that we have only one digit. The target software that we develop takes in this image and outputs a number corresponding to that image. We can create an algorithm with several checks, such as: if there is a single vertical line, then output it as 1, or if there is an oval shape, then show it as zero. However, this is very naive and is a bad solution because we can have vertical lines for other digits too: 7, 9, and so on. The following figure explains the overall process, taking in one of the samples from the MNIST handwritten digit dataset:

There are several ways to model such a problem. We know that an image is made up of arrays of pixels and each pixel...

The rest of the chapter is locked

Register for a free Packt account to unlock a world of extra content!

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (1)

Abhinav Dadhich

Abhinav Dadhich

Abhinav Dadhich is a Researcher and Application Developer on deep learning at Abeja Inc. Tokyo. His day is often filled with designing deep learning models for computer vision applications like image classification, object detection, segmentation etc. His passion lies in understanding and replicating human vision system. Previously, he has worked on 3D mapping and robot navigation. He has graduated with B.Tech. in EE from IIT Jodhpur, India and has done his M.Eng. in Information Science from NAIST, Japan. He puts up notes and codes for several topics on GitHub profile.

See other products by Abhinav Dadhich

Other recommended products

Related to this chapter

Computer Vision with Python 3

Computer Vision with Python 3

The field of computer vision involves designing and implementing algorithms to understand images and extract meaningful information from them. This book enables you to build real-world applications using Python and open source image processing libraries.

Aug 2017 6h 52m

Raspberry Pi Computer Vision Programming

Raspberry Pi Computer Vision Programming

You will learn the basics of hardware and software required for image processing and computer vision with Raspberry Pi and Python 3. You will have a look at all the major image processing, manipulation, and computer vision techniques and algorithms in detail using engaging examples. You will build a lot of real-life computer vision applications.

Jun 2020 10h 12m

OpenCV 3.x with Python By Example

OpenCV 3.x with Python By Example

Computer vision is found everywhere in modern technology. OpenCV for Python enables us to run computer vision algorithms in real time. With the advent of powerful machines, we have more processing power to work with. Using this technology, we can seamlessly integrate our computer vision applications into the cloud. Focusing on OpenCV 3.x and Python 3.6, this book will walk you through all the building blocks needed to build amazing computer vision applications with ease.

Jan 2018 8h 56m

The Computer Vision Workshop

The Computer Vision Workshop

With The Computer Vision Workshop, you'll explore the basic and advanced techniques in video and image processing using OpenCV and Python. It is filled with real-world exercises and activities that will make the learning process easy and enjoyable.

Jul 2020 18h 56m

Hands-On Image Processing with Python

Hands-On Image Processing with Python

This book covers how to use the image processing libraries in Python. It will enable you to write code snippets to implement complex image processing algorithms such as image enhancement, filtering, segmentation, object detection, and more. You will also be able to use machine learning and deep learning models and learn to implement them with ease.

Nov 2018 16h 24m

Hands-On GPU-Accelerated Computer Vision with OpenCV and CUDA

Hands-On GPU-Accelerated Computer Vision with OpenCV and CUDA

This book is a guide to explore how accelerating of computer vision applications using GPUs will help you develop algorithms that work on complex image data in real time. It will solve the problems you face while deploying these algorithms on embedded platforms with the help of development boards from NVIDIA such as the Jetson TX1, Jetson TX2, and Jetson TK1.

Sep 2018 12h 40m

Mastering OpenCV 3

Mastering OpenCV 3

As we become more capable of handling data in every kind, we are becoming more reliant on visual input and what we can do with those self-driving cars, face recognition, and even augmented reality applications and games. This is all powered by Computer Vision. This book will put you straight to work in creating powerful and unique computer vision applications.

Apr 2017 8h 20m

OpenCV 4 Computer Vision Application Programming Cookbook

OpenCV 4 Computer Vision Application Programming Cookbook

This book will present a variety of CV algorithms using the standard library. It will implement any shortfall that might come in CV by practicing the recipes that implement various tasks such as image processing and object recognition among others. It will help you in implementing CV algorithms to meet the technical requirement of your projects.

May 2019 16h 28m

Practical Convolutional Neural Networks

Practical Convolutional Neural Networks

This book helps you master CNN, from the basics to the most advanced concepts in CNN such as GANs, instance classification and attention mechanism for vision models and more. You will implement advanced CNN models using complex image and video datasets. By the end of the book you will learn CNN's best practices to implement smart ConvNet models and apply them to solve complex deep learning problems.

Feb 2018 7h 16m

OpenCV 3 Computer Vision Application Programming Cookbook

OpenCV 3 Computer Vision Application Programming Cookbook

Feb 2017 15h 48m

Learning OpenCV 4 Computer Vision with Python 3

Learning OpenCV 4 Computer Vision with Python 3

Now in its third edition, this is the original book on OpenCV's Python bindings. Readers will learn a great range of techniques and algorithms, from the classics to the state-of-the-art, and from geometry to machine learning. All of this is in aid of solving practical computer vision problems in well-built applications.

Feb 2020 12h 24m

Hands-On Computer Vision with Julia

Hands-On Computer Vision with Julia

This book is a thorough guide for developers who want to get started with building computer vision applications using Julia. Julia is well suited to image processing because of its ease of use and the fact that it lets you write easy-to-compile and efficient machine code.

Jun 2018 6h 44m

Personalised recommendations for you

Based on your interests and search pattern

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m

Data Governance Handbook

Data Governance Handbook

This book provides a highly focused view of real business outcomes powered by data governance, that resonate with non-data executives such as CFOs and CEOs. You'll also find useful insights into how to implement data governance initiatives.

May 2024 13h 8m

Data Engineering with Databricks Cookbook

Data Engineering with Databricks Cookbook

This book shows you how to use Apache Spark, Delta Lake, and Databricks to build data pipelines, manage and transform data, optimize performance, and more. Additionally, you'll implement DataOps and DevOps practices, and orchestrate data workflows.

May 2024 14h 36m

Azure Data Engineer Associate Certification Guide

Azure Data Engineer Associate Certification Guide

Unlock the power of Azure data engineering with this certification guide, elevating your skills in data processing, storage, and security with the help of practical insights, hands-on exercises, and the latest advancements.

May 2024 18h 16m

Microsoft Power BI Cookbook

Microsoft Power BI Cookbook

Microsoft Power BI is the most sought-after platform for BI professionals' visualization needs. Explore the latest Power BI features, future AI enhancements, and integration with other Power Platform tools via new recipes in this updated edition.

Jul 2024 19h 56m

Python Data Cleaning Cookbook

Python Data Cleaning Cookbook

The book shows you how to clean, wrangle, and view data from multiple perspectives, including dataset and column attributes. You will cover common and not-so-common challenges that are faced while cleaning messy data for complex situations and learn to manipulate data to get it down to a form that can be useful for making the right decisions.

May 2024 16h 12m

Microsoft Azure AI Fundamentals AI-900 Exam Guide

Microsoft Azure AI Fundamentals AI-900 Exam Guide

This AI-900 study guide will help you prepare and practice for the certification exam. You'll delve into AI workloads, ML principles, computer vision, NLP, knowledge mining, and generative AI using Azure cloud services.

May 2024 9h 36m

Using Stable Diffusion with Python

Using Stable Diffusion with Python

This book shows you how to use Python to control Stable Diffusion and generate high-quality images. In addition to covering the basic usage of the diffusers package, the book provides solutions for extending the package for more advanced purposes.

Jun 2024 11h 44m

Getting Started with DuckDB

Getting Started with DuckDB

This hands-on book teaches you to analyze large datasets with blazing speed and ease. You will learn how to use DuckDB to quickly load, query, transform, analyze, and visualize data effectively through a series of practical examples.

Jun 2024 12h 44m

Databricks Certified Associate Developer for Apache Spark Using Python

Databricks Certified Associate Developer for Apache Spark Using Python

This guide gets you ready for certification with expert-backed content, key exam concepts, and topic reviews. Additionally, you'll be able to make the most of Apache Spark 3.0 to modernize workloads and more using specific tools and techniques.