You're reading from 3D Deep Learning with Python Design and develop your computer vision model with 3D data using PyTorch3D and more

Product type Paperback

Published in Oct 2022

Publisher Packt

ISBN-13 9781803247823

Length 236 pages

Edition 1st Edition

Languages

Python

Tools

PyTorch3D

Concepts

Computer Vision

Authors (4):

Xudong Ma

David Farrugia

Vishakh Hegde

Lilit Yolyan

View More author details

Table of Contents (16) Chapters

Preface

1. PART 1: 3D Data Processing Basics

2. Chapter 1: Introducing 3D Data Processing FREE CHAPTER

3. Chapter 2: Introducing 3D Computer Vision and Geometry

4. PART 2: 3D Deep Learning Using PyTorch3D

5. Chapter 3: Fitting Deformable Mesh Models to Raw Point Clouds

6. Chapter 4: Learning Object Pose Detection and Tracking by Differentiable Rendering

7. Chapter 5: Understanding Differentiable Volumetric Rendering

8. Chapter 6: Exploring Neural Radiance Fields (NeRF)

9. PART 3: State-of-the-art 3D Deep Learning Using PyTorch3D

10. Chapter 7: Exploring Controllable Neural Feature Fields

11. Chapter 8: Modeling the Human Body in 3D

12. Chapter 9: Performing End-to-End View Synthesis with SynSin

13. Chapter 10: Mesh R-CNN

14. Index

Why subscribe?

15. Other Books You May Enjoy

Understanding camera models

In this section, we will learn about camera models. In 3D deep learning, usually we need to use 2D images for 3D detection. Either 3D information is detected solely from 2D images, or 2D images are fused with depth for high accuracy. Nevertheless, camera models are essential to build correspondence between the 2D space and the 3D world.

In PyTorch3D, there are two major camera models, the orthographic camera defined by the OrthographicCameras class and the perspective camera model defined by the PerspectiveCameras class. The following figure shows the differences between the two camera models.

Figure 1.5 – Two major camera models implemented in PyTorch3D, perspective and orthographic

The orthographic cameras use orthographic projections to map objects in the 3D world to 2D images, while the perspective cameras use perspective projections to map objects in the 3D world to 2D images. The orthographic projections map objects to 2D images, disregarding the object depth. For example, just as shown in the figure, two objects with the same geometric size at different depths would be mapped to 2D images of the same size. On the other hand, in perspective projections, if an object moved far away from the camera, it would be mapped to a smaller size on the 2D images.

Now that we have learned about the basic concept of camera models, let us look at some coding examples to see how we can create and use these camera models.

Tech Concepts

Programming languages

Tech Tools

Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.

Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.

50+ new titles added per month and exclusive early access to books as they are being written.

You're reading from 3D Deep Learning with Python Design and develop your computer vision model with 3D data using PyTorch3D and more

Table of Contents (16) Chapters

Understanding camera models

Authors (4)

Personalised recommendations for you

Create a Free Account To Continue Reading

Sign in to activate your 7-day free access