Packt+ | Advance your knowledge in tech

Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Artificial Intelligence with Python Cookbook Proven recipes for applying AI algorithms and deep learning techniques using TensorFlow 2.x and PyTorch 1.6

Product type Paperback

Published in Oct 2020

Publisher Packt

ISBN-13 9781789133967

Length 468 pages

Edition 1st Edition

Languages

Python

Tools

PyTorch

Concepts

Artificial Intelligence

Authors (2):

Ritesh Kumar

Ben Auffarth

View More author details

Table of Contents (13) Chapters

Preface

1. Getting Started with Artificial Intelligence in Python

2. Advanced Topics in Supervised Machine Learning FREE CHAPTER

3. Patterns, Outliers, and Recommendations

4. Probabilistic Modeling

5. Heuristic Search Techniques and Logical Inference

6. Deep Reinforcement Learning

7. Advanced Image Applications

8. Working with Moving Images

9. Deep Learning in Audio and Speech

10. Natural Language Processing

11. Artificial Intelligence in Production

12. Other Books You May Enjoy

Leave a review - let other readers know what you think

Deep Learning in Audio and Speech

In this chapter, we'll deal with sounds and speech. Sound data comes in the form of waves, and therefore requires different preprocessing than other types of data.

Machine learning on audio signals finds commercial applications in speech enhancement (for example, in hearing aids), speech-to-text and text-to-speech, noise cancellation (as in headphones), recommending music to users based on their preferences (such as Spotify), and generating audio. Many fun problems can be encountered in audio, including the classification of music genres, the transcription of music, generating music, and many more besides.

We'll implement several applications with sound and speech in this chapter. We'll first do a simple example of a classification task, where we try to distinguish different words. This would be a typical application in a smart...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (2)

Kumar

Ashish Kumar is a seasoned data science professional, a publisher author and a thought leader in the field of data science and machine learning. An IIT Madras graduate and a Young India Fellow, he has around 7 years of experience in implementing and deploying data science and machine learning solutions for challenging industry problems in both hands-on and leadership roles. Natural Language Procession, IoT Analytics, R Shiny product development, Ensemble ML methods etc. are his core areas of expertise. He is fluent in Python and R and teaches a popular ML course at Simplilearn. When not crunching data, Ashish sneaks off to the next hip beach around and enjoys the company of his Kindle. He also trains and mentors data science aspirants and fledgling start-ups.

See other products by Kumar

Ben Auffarth

Ben Auffarth is a full-stack data scientist with more than 15 years of work experience. With a background and Ph.D. in computational and cognitive neuroscience, he has designed and conducted wet lab experiments on cell cultures, analyzed experiments with terabytes of data, run brain models on IBM supercomputers with up to 64k cores, built production systems processing hundreds and thousands of transactions per day, and trained language models on a large corpus of text documents. He co-founded and is the former president of Data Science Speakers, London.

See other products by Ben Auffarth