Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Hands-On Machine Learning for Algorithmic Trading Design and implement investment strategies based on smart algorithms that learn from data using Python

Product type Paperback

Published in Dec 2018

Publisher Packt

ISBN-13 9781789346411

Length 684 pages

Edition 1st Edition

Languages

Python

Tools

TensorFlow

Concepts

Design

Authors (2):

Jeffrey Yau

Stefan Jansen

View More author details

Table of Contents (23) Chapters

Preface

1. Machine Learning for Trading

2. Market and Fundamental Data FREE CHAPTER

3. Alternative Data for Finance

4. Alpha Factor Research

5. Strategy Evaluation

6. The Machine Learning Process

7. Linear Models

8. Time Series Models

9. Bayesian Machine Learning

10. Decision Trees and Random Forests

11. Gradient Boosting Machines

12. Unsupervised Learning

13. Working with Text Data

14. Topic Modeling

15. Word Embeddings

16. Deep Learning

17. Convolutional Neural Networks

18. Recurrent Neural Networks

19. Autoencoders and Generative Adversarial Nets

20. Reinforcement Learning

21. Next Steps

22. Other Books You May Enjoy

Leave a review - let other readers know what you think

Working with Text Data

This is the first of three chapters dedicated to extracting signals for algorithmic trading strategies from text data using natural language processing (NLP) and machine learning (ML).

Text data is very rich in content, yet unstructured in format, and hence requires more preprocessing so that an ML algorithm can extract the potential signal. The key challenge lies in converting text into a numerical format for use by an algorithm, while simultaneously expressing the semantics or meaning of the content. We will cover several techniques that capture nuances of language that are readily understandable to humans so that they can become an input for ML algorithms.

In this chapter, we introduce fundamental feature extraction techniques that focus on individual semantic units; that is, words or short groups of words called tokens. We will show how to represent...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (2)

Yau

See other products by Yau

Stefan Jansen

Stefan is the founder and CEO of Applied AI. He advises Fortune 500 companies, investment firms, and startups across industries on data & AI strategy, building data science teams, and developing end-to-end machine learning solutions for a broad range of business problems. Before his current venture, he was a partner and managing director at an international investment firm, where he built the predictive analytics and investment research practice. He was also a senior executive at a global fintech company with operations in 15 markets, advised Central Banks in emerging markets, and consulted for the World Bank. He holds Master's degrees in Computer Science from Georgia Tech and in Economics from Harvard and Free University Berlin, and a CFA Charter. He has worked in six languages across Europe, Asia, and the Americas and taught data science at Datacamp and General Assembly.

See other products by Stefan Jansen