Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Machine Learning for OpenCV Intelligent image processing with Python

Product type Paperback

Published in Jul 2017

Publisher Packt

ISBN-13 9781783980284

Length 382 pages

Edition 1st Edition

Languages

Python

Tools

OpenCV

Concepts

Machine Learning

Authors (2):

Michael Beyeler

Michael Beyeler (USD)

View More author details

Table of Contents (13) Chapters

Preface

1. A Taste of Machine Learning

2. Working with Data in OpenCV and Python FREE CHAPTER

3. First Steps in Supervised Learning

4. Representing Data and Engineering Features

5. Using Decision Trees to Make a Medical Diagnosis

6. Detecting Pedestrians with Support Vector Machines

7. Implementing a Spam Filter with Bayesian Learning

8. Discovering Hidden Structures with Unsupervised Learning

9. Using Deep Learning to Classify Handwritten Digits

10. Combining Different Algorithms into an Ensemble

11. Selecting the Right Model with Hyperparameter Tuning

12. Wrapping Up

Representing text features

Similar to categorical features, scikit-learn offers an easy way to encode another common feature type, text features. When working with text features, it is often convenient to encode individual words or phrases as numerical values.

Let's consider a dataset that contains a small corpus of text phrases:

In [1]: sample = [
...        'feature engineering',
...        'feature selection',
...        'feature extraction'
...     ]

One of the simplest methods of encoding such data is by word count; for each phrase, we simply count the occurrences of each word within it. In scikit-learn, this is easily done using CountVectorizer, which functions akin to DictVectorizer:

In [2]: from sklearn.feature_extraction.text import CountVectorizer
...     vec = CountVectorizer()
...     X = vec.fit_transform(sample)
...     X
Out[2]: <3x4...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (2)

Michael Beyeler (USD)

Michael Beyeler is a postdoctoral fellow in neuroengineering and data science at the University of Washington, where he is working on computational models of bionic vision in order to improve the perceptual experience of blind patients implanted with a retinal prosthesis (bionic eye).His work lies at the intersection of neuroscience, computer engineering, computer vision, and machine learning. He is also an active contributor to several open source software projects, and has professional programming experience in Python, C/C++, CUDA, MATLAB, and Android. Michael received a PhD in computer science from the University of California, Irvine, and an MSc in biomedical engineering and a BSc in electrical engineering from ETH Zurich, Switzerland.

See other products by Michael Beyeler (USD)

Michael Beyeler