Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Newsletter Hub

Free Learning

You're reading from Hands-On Machine Learning for Algorithmic Trading Design and implement investment strategies based on smart algorithms that learn from data using Python

Product type Paperback

Published in Dec 2018

Publisher Packt

ISBN-13 9781789346411

Length 684 pages

Edition 1st Edition

Languages

Python

Tools

TensorFlow

Concepts

Design

Authors (2):

Yau

Stefan Jansen

View More author details

Table of Contents (23) Chapters

Preface

1. Machine Learning for Trading

2. Market and Fundamental Data FREE CHAPTER

3. Alternative Data for Finance

4. Alpha Factor Research

5. Strategy Evaluation

6. The Machine Learning Process

7. Linear Models

8. Time Series Models

9. Bayesian Machine Learning

10. Decision Trees and Random Forests

11. Gradient Boosting Machines

12. Unsupervised Learning

13. Working with Text Data

14. Topic Modeling

15. Word Embeddings

16. Deep Learning

17. Convolutional Neural Networks

18. Recurrent Neural Networks

19. Autoencoders and Generative Adversarial Nets

20. Reinforcement Learning

21. Next Steps

22. Other Books You May Enjoy

Leave a review - let other readers know what you think

From tokens to numbers – the document-term matrix

In this section, we first introduce how the BoW model converts text data into a numeric vector space representation that permits the comparison of documents using their distance. We then proceed to illustrate how to create a document-term matrix using the sklearn library.

The BoW model

The BoW model represents a document based on the frequency of the terms or tokens it contains. Each document becomes a vector with one entry for each token in the vocabulary that reflects the token's relevance to the document.

The document-term matrix is straightforward to compute given the vocabulary. However, it is also a crude simplification because it abstracts from word order...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (2)

Yau

See other products by Yau

Stefan Jansen

Stefan is the founder and CEO of Applied AI. He advises Fortune 500 companies, investment firms, and startups across industries on data & AI strategy, building data science teams, and developing end-to-end machine learning solutions for a broad range of business problems. Before his current venture, he was a partner and managing director at an international investment firm, where he built the predictive analytics and investment research practice. He was also a senior executive at a global fintech company with operations in 15 markets, advised Central Banks in emerging markets, and consulted for the World Bank. He holds Master's degrees in Computer Science from Georgia Tech and in Economics from Harvard and Free University Berlin, and a CFA Charter. He has worked in six languages across Europe, Asia, and the Americas and taught data science at Datacamp and General Assembly.

See other products by Stefan Jansen