Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Hands-On Unsupervised Learning with Python Implement machine learning and deep learning models using Scikit-Learn, TensorFlow, and more

Product type Paperback

Published in Feb 2019

Publisher Packt

ISBN-13 9781789348279

Length 386 pages

Edition 1st Edition

Languages

Python

Tools

Scikit-learn

Concepts

Deep Learning

Authors (2):

Giuseppe Bonaccorso

View More author details

Table of Contents (12) Chapters

Preface

1. Getting Started with Unsupervised Learning FREE CHAPTER

2. Clustering Fundamentals

3. Advanced Clustering

4. Hierarchical Clustering in Action

5. Soft Clustering and Gaussian Mixture Models

6. Anomaly Detection

7. Dimensionality Reduction and Component Analysis

8. Unsupervised Neural Network Models

9. Generative Adversarial Networks and SOMs

10. Assessments

11. Other Books You May Enjoy

Leave a review - let other readers know what you think

Topic modeling with Latent Dirichlet Allocation

We will now consider another kind of decomposition that is extremely helpful when working with text documents (that is, NLP). The theoretical part is not very easy, because it requires deep knowledge of probability theory and statistical learning (it can be found in the original paper Latent Dirichlet Allocation, Journal of Machine Learning Research, Blei D., Ng A., and Jordan M., 3, (2003) 993-1022); therefore, we are only going to discuss the main elements, without any mathematical references (a more compact description is also present in Machine Learning Algorithms Second Edition, Bonaccorso, G., Packt Publications, 2018). Let's consider a set of text documents, d_j (called a corpus), whose atoms (or components) are the words, w_i:

After collecting all of the words, we can build a dictionary:

We can also state the following...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (2)

Bonaccorso

Giuseppe Bonaccorso is Head of Data Science in a large multinational company. He received his M.Sc.Eng. in Electronics in 2005 from University of Catania, Italy, and continued his studies at University of Rome Tor Vergata, and University of Essex, UK. His main interests include machine/deep learning, reinforcement learning, big data, and bio-inspired adaptive systems. He is author of several publications including Machine Learning Algorithms and Hands-On Unsupervised Learning with Python, published by Packt.

See other products by Bonaccorso

Giuseppe Bonaccorso

Giuseppe Bonaccorso is an experienced manager in the fields of AI, data science, and machine learning. He has been involved in solution design, management, and delivery in different business contexts. He got his M.Sc.Eng in electronics in 2005 from the University of Catania, Italy, and continued his studies at the University of Rome Tor Vergata, Italy, and the University of Essex, UK. His main interests include machine/deep learning, reinforcement learning, big data, bio-inspired adaptive systems, neuroscience, and natural language processing.

See other products by Giuseppe Bonaccorso