Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Free Learning

You're reading from Hands-On Machine Learning for Algorithmic Trading Design and implement investment strategies based on smart algorithms that learn from data using Python

Product type Paperback

Published in Dec 2018

Publisher Packt

ISBN-13 9781789346411

Length 684 pages

Edition 1st Edition

Languages

Python

Tools

TensorFlow

Concepts

Design

Authors (2):

Jeffrey Yau

Stefan Jansen

View More author details

Table of Contents (23) Chapters

Preface

1. Machine Learning for Trading FREE CHAPTER

2. Market and Fundamental Data

3. Alternative Data for Finance

4. Alpha Factor Research

5. Strategy Evaluation

6. The Machine Learning Process

7. Linear Models

8. Time Series Models

9. Bayesian Machine Learning

10. Decision Trees and Random Forests

11. Gradient Boosting Machines

12. Unsupervised Learning

13. Working with Text Data

14. Topic Modeling

15. Word Embeddings

16. Deep Learning

17. Convolutional Neural Networks

18. Recurrent Neural Networks

19. Autoencoders and Generative Adversarial Nets

20. Reinforcement Learning

21. Next Steps

22. Other Books You May Enjoy

Leave a review - let other readers know what you think

Topic Modeling

In the last chapter, we converted unstructured text data into a numerical format using the bag-of-words model. This model abstracts from word order and represents documents as word vectors, where each entry represents the relevance of a token to the document.

The resulting document-term matrix (DTM), (you may also come across the transposed term-document matrix) is useful to compare documents to each other or to a query vector based on their token content, and quickly find a needle in a haystack or classify documents accordingly.

However, this document model is both high-dimensional and very sparse. As a result, it does little to summarize the content or get closer to understanding what it is about. In this chapter, we will use unsupervised machine learning in the form of topic modeling to extract hidden themes from documents. These themes can produce detailed insights...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (2)

Yau

See other products by Yau

Stefan Jansen

Stefan is the founder and CEO of Applied AI. He advises Fortune 500 companies, investment firms, and startups across industries on data & AI strategy, building data science teams, and developing end-to-end machine learning solutions for a broad range of business problems. Before his current venture, he was a partner and managing director at an international investment firm, where he built the predictive analytics and investment research practice. He was also a senior executive at a global fintech company with operations in 15 markets, advised Central Banks in emerging markets, and consulted for the World Bank. He holds Master's degrees in Computer Science from Georgia Tech and in Economics from Harvard and Free University Berlin, and a CFA Charter. He has worked in six languages across Europe, Asia, and the Americas and taught data science at Datacamp and General Assembly.

See other products by Stefan Jansen