0

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Free Learning

Python Image Processing Cookbook

You're reading from Python Image Processing Cookbook Over 60 recipes to help you perform complex image processing and computer vision tasks with ease

Product type Paperback

Published in Apr 2020

Publisher Packt

ISBN-13 9781789537147

Length 438 pages

Edition 1st Edition

Languages

Processing

Tools

Processing

Concepts

Computer Vision

Author (1):

Sandipan Dey

View More author details

Table of Contents (11) Chapters

Preface

1. Image Manipulation and Transformation

2. Image Enhancement FREE CHAPTER

3. Image Restoration

4. Binary Image Processing

5. Image Registration

6. Image Segmentation

7. Image Classification

8. Object Detection in Images

9. Face Recognition, Image Captioning, and More

10. Other Books You May Enjoy

Leave a review - let other readers know what you think

Object detection with Mask R-CNN

The Mask R-CNN algorithm (2017), by Girshick et al., includes a number of improvements compared with the Faster R-CNN algorithm for region-based object detection, with the following two primary contributions:

ROI Pooling is replaced with an ROI Align module (which is more accurate).
An additional branch is inserted (which receives the output from ROI Align, subsequently feeding it into two successive convolution layers. Output from the last convolutional layer forms the object mask) at the output of the ROI Align module.

The RoIAlign module provides a more precise correspondence between the regions of the feature map selected and those of the input image. Much more fine-grained alignment is needed for pixel-level segmentation, rather than just computing the bounding boxes. The following screenshot shows the architecture of Mask R-CNN:

In...

The rest of the chapter is locked

Register for a free Packt account to unlock a world of extra content!

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at ₹800/month. Cancel anytime

Authors (1)

Sandipan Dey

Sandipan Dey

Sandipan Dey is a data scientist with a wide range of interests, covering topics such as machine learning, deep learning, image processing, and computer vision. He has worked in numerous data science fields, working with recommender systems, predictive models for the events industry, sensor localization models, sentiment analysis, and device prognostics. He earned his master's degree in computer science from the University of Maryland, Baltimore County, and has published in a few IEEE Data Mining conferences and journals. He has earned certifications from 100+ MOOCs on data science, machine learning, deep learning, image processing, and related courses. He is a regular blogger (sandipanweb) and is a machine learning education enthusiast.

See other products by Sandipan Dey

Other recommended products

Related to this chapter

Hands-On Image Processing with Python

Hands-On Image Processing with Python

This book covers how to use the image processing libraries in Python. It will enable you to write code snippets to implement complex image processing algorithms such as image enhancement, filtering, segmentation, object detection, and more. You will also be able to use machine learning and deep learning models and learn to implement them with ease.

Nov 2018 16h 24m

Computer Vision with Python 3

Computer Vision with Python 3

The field of computer vision involves designing and implementing algorithms to understand images and extract meaningful information from them. This book enables you to build real-world applications using Python and open source image processing libraries.

Aug 2017 6h 52m

Computer Vision Projects with OpenCV and Python 3

Computer Vision Projects with OpenCV and Python 3

This book demonstrates techniques to leverage the power of Python, OpenCV, and TensorFlow to solve problems in Computer Vision. This book also shows you how to build an application that can estimate human poses within images. You will also classify images and identify humans in videos, and then develop your own handwritten digit classifier.

The Computer Vision Workshop

The Computer Vision Workshop

With The Computer Vision Workshop, you'll explore the basic and advanced techniques in video and image processing using OpenCV and Python. It is filled with real-world exercises and activities that will make the learning process easy and enjoyable.

Jul 2020 18h 56m

OpenCV 3.x with Python By Example

OpenCV 3.x with Python By Example

Computer vision is found everywhere in modern technology. OpenCV for Python enables us to run computer vision algorithms in real time. With the advent of powerful machines, we have more processing power to work with. Using this technology, we can seamlessly integrate our computer vision applications into the cloud. Focusing on OpenCV 3.x and Python 3.6, this book will walk you through all the building blocks needed to build amazing computer vision applications with ease.

Jan 2018 8h 56m

OpenCV 3 Computer Vision with Python Cookbook

OpenCV 3 Computer Vision with Python Cookbook

OpenCV 3 is a native cross-platform library for computer vision, machine learning, and image processing. OpenCV's convenient high-level APIs hide very powerful internals designed for computational efficiency that can take advantage of multicore and GPU processing. This book will help you tackle increasingly challenging computer vision problems by providing a number of recipes that you can use to improve your applications.

Mar 2018 10h 12m

Raspberry Pi Computer Vision Programming

Raspberry Pi Computer Vision Programming

You will learn the basics of hardware and software required for image processing and computer vision with Raspberry Pi and Python 3. You will have a look at all the major image processing, manipulation, and computer vision techniques and algorithms in detail using engaging examples. You will build a lot of real-life computer vision applications.

Jun 2020 10h 12m

Mastering Computer Vision with TensorFlow 2.x

Mastering Computer Vision with TensorFlow 2.x

You will learn the principles of computer vision and deep learning, and understand various models and architectures with their pros and cons. You will learn how to use TensorFlow 2.x to build your own neural network model and apply it to various computer vision tasks such as image acquiring, processing, and analyzing.

May 2020 14h 20m

Hands-On Computer Vision with Julia

Hands-On Computer Vision with Julia

This book is a thorough guide for developers who want to get started with building computer vision applications using Julia. Julia is well suited to image processing because of its ease of use and the fact that it lets you write easy-to-compile and efficient machine code.

Jun 2018 6h 44m

Qt 5 and OpenCV 4 Computer Vision Projects

Qt 5 and OpenCV 4 Computer Vision Projects

We are entering the age of artificial intelligence, and Computer Vision plays an important role in the AI field. This book combines OpenCV 4 and Qt 5 as well as many deep learning models to develop many complete, practical, and functional applications through which the readers can learn a lot in CV, GUI, and AI domains.

Jun 2019 11h 36m

Learn OpenCV 4 By Building Projects

Learn OpenCV 4 By Building Projects

OpenCV is mainly used in Computer Vision and image processing and is considered to be one of the best open source libraries that helps developers focus on constructing complete projects on image processing, motion detection, and image segmentation. This book will be your guide to understanding the basic OpenCV concepts and algorithms.

Nov 2018 10h 20m

OpenCV 3 Computer Vision Application Programming Cookbook

OpenCV 3 Computer Vision Application Programming Cookbook

Feb 2017 15h 48m

Personalised recommendations for you

Based on your interests and search pattern

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m

Data Governance Handbook

Data Governance Handbook

This book provides a highly focused view of real business outcomes powered by data governance, that resonate with non-data executives such as CFOs and CEOs. You'll also find useful insights into how to implement data governance initiatives.

May 2024 13h 8m

Data Engineering with Databricks Cookbook

Data Engineering with Databricks Cookbook

This book shows you how to use Apache Spark, Delta Lake, and Databricks to build data pipelines, manage and transform data, optimize performance, and more. Additionally, you'll implement DataOps and DevOps practices, and orchestrate data workflows.

May 2024 14h 36m

Azure Data Engineer Associate Certification Guide

Azure Data Engineer Associate Certification Guide

Unlock the power of Azure data engineering with this certification guide, elevating your skills in data processing, storage, and security with the help of practical insights, hands-on exercises, and the latest advancements.

May 2024 18h 16m

Microsoft Power BI Cookbook

Microsoft Power BI Cookbook

Microsoft Power BI is the most sought-after platform for BI professionals' visualization needs. Explore the latest Power BI features, future AI enhancements, and integration with other Power Platform tools via new recipes in this updated edition.

Jul 2024 19h 56m

Python Data Cleaning Cookbook

Python Data Cleaning Cookbook

The book shows you how to clean, wrangle, and view data from multiple perspectives, including dataset and column attributes. You will cover common and not-so-common challenges that are faced while cleaning messy data for complex situations and learn to manipulate data to get it down to a form that can be useful for making the right decisions.

May 2024 16h 12m

Microsoft Azure AI Fundamentals AI-900 Exam Guide

Microsoft Azure AI Fundamentals AI-900 Exam Guide

This AI-900 study guide will help you prepare and practice for the certification exam. You'll delve into AI workloads, ML principles, computer vision, NLP, knowledge mining, and generative AI using Azure cloud services.

May 2024 9h 36m

Using Stable Diffusion with Python

Using Stable Diffusion with Python

This book shows you how to use Python to control Stable Diffusion and generate high-quality images. In addition to covering the basic usage of the diffusers package, the book provides solutions for extending the package for more advanced purposes.

Jun 2024 11h 44m

Getting Started with DuckDB

Getting Started with DuckDB

This hands-on book teaches you to analyze large datasets with blazing speed and ease. You will learn how to use DuckDB to quickly load, query, transform, analyze, and visualize data effectively through a series of practical examples.

Jun 2024 12h 44m

Databricks Certified Associate Developer for Apache Spark Using Python

Databricks Certified Associate Developer for Apache Spark Using Python

This guide gets you ready for certification with expert-backed content, key exam concepts, and topic reviews. Additionally, you'll be able to make the most of Apache Spark 3.0 to modernize workloads and more using specific tools and techniques.