Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Free Learning

The Unsupervised Learning Workshop

You're reading from The Unsupervised Learning Workshop Get started with unsupervised learning algorithms and simplify your unorganized data to help make future predictions

Product type Paperback

Published in Jul 2020

Publisher Packt

ISBN-13 9781800200708

Length 550 pages

Edition 1st Edition

Languages

Python

Tools

Jupyter

Concepts

Machine Learning

Authors (3):

Benjamin Johnston

Christopher Kruger

Aaron Jones

View More author details

Table of Contents (11) Chapters

Preface

1. Introduction to Clustering

2. Hierarchical Clustering FREE CHAPTER

3. Neighborhood Approaches and DBSCAN

4. Dimensionality Reduction Techniques and PCA

5. Autoencoders

6. t-Distributed Stochastic Neighbor Embedding

7. Topic Modeling

8. Market Basket Analysis

9. Hotspot Analysis

Appendix

Cleaning Text Data

A key component of all successful modeling exercises is a clean dataset that has been appropriately and sufficiently preprocessed for the specific data type and analysis being performed. Text data is no exception, as it is virtually unusable in its raw form. It does not matter what algorithm is being run: if the data isn't properly prepared, the results will be at best meaningless and at worst misleading. As the saying goes, garbage in, garbage out. For topic modeling, the goal of data cleaning is to isolate the words in each document that could be relevant by removing everything that could be obstructive.

Data cleaning and preprocessing is almost always specific to the dataset, meaning that each dataset will require a unique set of cleaning and preprocessing steps selected to specifically handle the issues in it. With text data, cleaning and preprocessing steps can include language filtering, removing URLs and screen names, lemmatizing, and stop word removal...

The rest of the chapter is locked

Register for a free Packt account to unlock a world of extra content!

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (3)

Aaron Jones

Aaron Jones

Aaron Jones is a full-time senior data scientist and consultant. He has built models and data products while working in retail, media, and environmental science. Aaron is based in Seattle, Washington and has a particular interest in clustering algorithms, natural language processing, and Bayesian statistics.

See other products by Aaron Jones

Christopher Kruger

Christopher Kruger

Christopher Kruger is a practicing data scientist and AI researcher. He has managed applied machine learning projects across multiple industries while mentoring junior team members on best practices. His primary focus is on pushing both business practicality as well as academic rigor in every project. Chris is currently developing research in the computer vision space.

See other products by Christopher Kruger

Benjamin Johnston

Benjamin Johnston

Benjamin Johnston is a senior data scientist for one of the world's leading data-driven MedTech companies and is involved in the development of innovative digital solutions throughout the entire product development pathway, from problem definition to solution research and development, through to final deployment. He is currently completing his Ph.D. in machine learning, specializing in image processing and deep convolutional neural networks. He has more than 10 years of experience in medical device design and development, working in a variety of technical roles, and holds first-class honors bachelor's degrees in both engineering and medical science from the University of Sydney, Australia.

See other products by Benjamin Johnston

Other recommended products

Related to this chapter

Applied Unsupervised Learning with R

Applied Unsupervised Learning with R

Starting with the basics, Applied Unsupervised Learning with R explains clustering methods, distribution analysis, data encoders, and all features of R that enable you to understand your data better and get answers to all your business questions.

Mar 2019 10h 40m

Applied Unsupervised Learning with R

Applied Unsupervised Learning with R

Starting with the basics, Applied Unsupervised Learning with R explains clustering methods, distribution analysis, data encoders, and all features of R that enable you to understand your data better and get answers to all your business questions.

Mar 2019 10h 40m

Applied Unsupervised Learning with R

Applied Unsupervised Learning with R

Starting with the basics, Applied Unsupervised Learning with R explains clustering methods, distribution analysis, data encoders, and all features of R that enable you to understand your data better and get answers to all your business questions.

Mar 2019 10h 40m

Applied Unsupervised Learning with R

Applied Unsupervised Learning with R

Starting with the basics, Applied Unsupervised Learning with R explains clustering methods, distribution analysis, data encoders, and all features of R that enable you to understand your data better and get answers to all your business questions.

Mar 2019 10h 40m

Applied Unsupervised Learning with R

Applied Unsupervised Learning with R

Starting with the basics, Applied Unsupervised Learning with R explains clustering methods, distribution analysis, data encoders, and all features of R that enable you to understand your data better and get answers to all your business questions.

Mar 2019 10h 40m

Applied Unsupervised Learning with R

Applied Unsupervised Learning with R

Starting with the basics, Applied Unsupervised Learning with R explains clustering methods, distribution analysis, data encoders, and all features of R that enable you to understand your data better and get answers to all your business questions.

Mar 2019 10h 40m

Applied Unsupervised Learning with R

Applied Unsupervised Learning with R

Starting with the basics, Applied Unsupervised Learning with R explains clustering methods, distribution analysis, data encoders, and all features of R that enable you to understand your data better and get answers to all your business questions.

Mar 2019 10h 40m

Applied Unsupervised Learning with R

Applied Unsupervised Learning with R

Starting with the basics, Applied Unsupervised Learning with R explains clustering methods, distribution analysis, data encoders, and all features of R that enable you to understand your data better and get answers to all your business questions.

Mar 2019 10h 40m

Applied Unsupervised Learning with R

Applied Unsupervised Learning with R

Starting with the basics, Applied Unsupervised Learning with R explains clustering methods, distribution analysis, data encoders, and all features of R that enable you to understand your data better and get answers to all your business questions.

Mar 2019 10h 40m

Applied Unsupervised Learning with R

Applied Unsupervised Learning with R

Starting with the basics, Applied Unsupervised Learning with R explains clustering methods, distribution analysis, data encoders, and all features of R that enable you to understand your data better and get answers to all your business questions.

Mar 2019 10h 40m

Hands-On Unsupervised Learning with Python

Hands-On Unsupervised Learning with Python

Unsupervised learning is a key required block in both machine learning and deep learning domains. You will explore how to make your models learn, grow, change, and develop by themselves whenever they are exposed to a new set of data. With this book, you will learn the art of unsupervised learning for different real-world challenges.

Feb 2019 12h 52m

Hands-On Unsupervised Learning with Python

Hands-On Unsupervised Learning with Python

Unsupervised learning is a key required block in both machine learning and deep learning domains. You will explore how to make your models learn, grow, change, and develop by themselves whenever they are exposed to a new set of data. With this book, you will learn the art of unsupervised learning for different real-world challenges.

Feb 2019 12h 52m

Personalised recommendations for you

Based on your interests and search pattern

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m

Modern Computer Vision with PyTorch

Modern Computer Vision with PyTorch

This book provides a hands-on approach to solving over 30 prominent real-world computer vision problems using PyTorch 2.x on actual datasets. Here you'll learn to build a neural network from scratch and optimize hyperparameters, perform image classification, multi-object detection, segmentation, and more. You'll also explore facial expression manipulation and combining CV with NLP and RL techniques, build generative AI applications, and take your model to production on AWS. By the end of this book, you'll master modern NN architectures and confidently solve real-world CV problems.

Jun 2024 24h 52m