Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Conferences
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
Kinect for Windows SDK Programming Guide

You're reading from   Kinect for Windows SDK Programming Guide

Arrow left icon
Product type Paperback
Published in Dec 2012
Publisher Packt
ISBN-13 9781849692380
Length 392 pages
Edition 1st Edition
Languages
Tools
Arrow right icon
Author (1):
Arrow left icon
Abhijit Jana Abhijit Jana
Author Profile Icon Abhijit Jana
Abhijit Jana
Arrow right icon
View More author details
Toc

Table of Contents (19) Chapters Close

Kinect for Windows SDK Programming Guide
Credits
About the Author
Acknowledgement
About the Reviewers
www.PacktPub.com
Preface
1. Understanding the Kinect Device 2. Getting Started FREE CHAPTER 3. Starting to Build Kinect Applications 4. Getting the Most out of Kinect Camera 5. The Depth Data – Making Things Happen 6. Human Skeleton Tracking 7. Using Kinect's Microphone Array 8. Speech Recognition 9. Building Gesture-controlled Applications 10. Developing Applications Using Multiple Kinects 11. Putting Things Together Index

The Kinect SDK architecture for Audio


The SDK installs the Kinect USB Audio components that actually interact with the microphone array of the Kinect sensor and the SDK components. For speech recognition, Kinect uses the underlying speech API of the Windows operating system. Kinect has its own internal pipeline that processes the captured audio data; however, when it comes under the operating system level, the audio API is built on existing audio framework components. From the following diagram, you can see that the captured audio from the Kinect microphone array is passed to the application via the Kinect and Windows Audio Components:

Along with the device drivers, the following are the two major components:

  • DirectX Media Object (DMO)

  • Windows Speech Recognition API (SAPI)

The majority of audio functionality, such as as Noise Suppression (NS), Acoustic Echo Cancellation (AEC), and Automatic Gain Control (AGC) is controlled by the DMO. However, these are not new functionalities for DMO; the...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime
Banner background image