You're reading from Python for Finance Cookbook – Second Edition Over 80 powerful recipes for effective financial data analysis

Product type Paperback

Published in Dec 2022

Publisher Packt

ISBN-13 9781803243191

Length 740 pages

Edition 2nd Edition

Languages

Python

Tools

Prophet

Concepts

Data Analysis

Author (1):

Eryk Lewinson

View More author details

Table of Contents (18) Chapters

Preface

1. Acquiring Financial Data

2. Data Preprocessing FREE CHAPTER

3. Visualizing Financial Time Series

4. Exploring Financial Time Series Data

5. Technical Analysis and Building Interactive Dashboards

6. Time Series Analysis and Forecasting

7. Machine Learning-Based Approaches to Time Series Forecasting

8. Multi-Factor Models

9. Modeling Volatility with GARCH Class Models

10. Monte Carlo Simulations in Finance

11. Asset Allocation

12. Backtesting Trading Strategies

13. Applied Machine Learning: Identifying Credit Default

14. Advanced Concepts for Machine Learning Projects

15. Deep Learning in Finance

16. Other Books You May Enjoy

17. Index

Acquiring Financial Data

The first chapter of this book is dedicated to a very important (some may say the most important) part of any data science/quantitative finance project—gathering data. In line with the famous adage “garbage in, garbage out,” we should strive to obtain data of the highest possible quality and then correctly preprocess it for later use with statistical and machine learning algorithms. The reason for this is simple—the results of our analyses are highly dependent on the input data and no sophisticated model will be able to compensate for that. That is also why in our analyses, we should be able to use our (or someone else’s) understanding of the economic/financial domain to motivate certain data for, for example, modeling stock returns.

One of the most frequently reported issues among the readers of the first edition of this book was getting high-quality data. That is why in this chapter we spend more time exploring different sources of financial data. While quite a few of these vendors offer similar information (prices, fundamentals, and so on), they also offer additional, unique data that can be downloaded via their APIs. An example could be company-related news articles or pre-computed technical indicators. That is why we will download different types of data depending on the recipe. However, be sure to inspect the documentation of the library/API, as most likely its vendor also provides standard data such as prices.

Additional examples are also covered in the Jupyter notebooks, which you can find in the accompanying GitHub repository.

The data sources in this chapter were selected intentionally not only to showcase how easy it can be to gather high-quality data using Python libraries but also to show that the gathered data comes in many shapes and sizes.

Sometimes we will get a nicely formatted pandas DataFrame, while other times it might be in JSON format or even bytes that need to be processed and then loaded as a CSV. Hopefully, these recipes will sufficiently prepare you to work with any kind of data you might encounter online.

Something to bear in mind while reading this chapter is that data differs among sources. This means that the prices we downloaded from two vendors will most likely differ, as those vendors also get their data from different sources and might use other methods to adjust the prices for corporate actions. The best practice is to find a source you trust the most concerning a particular type of data (based on, for example, opinion on the internet) and then use it to download the data you need. One additional thing to keep in mind is that when building algorithmic trading strategies, the data we use for modeling should align with the live data feed used for executing the trades.

This chapter does not cover one important type of data—alternative data. This could be any type of data that can be used to generate some insights into predicting asset prices. Alternative data can include satellite images (for example, tracking shipping routes, or the development of a certain area), sensor data, web traffic data, customer reviews, etc. While there are many vendors specializing in alternative data (for example, Quandl/Nasdaq Data Link), you can also get some by accessing publicly available information via web scraping. As an example, you could scrape customer reviews from Amazon or Yelp. However, those are often bigger projects and are unfortunately outside of the scope of this book. Also, you need to make sure that web scraping a particular website is not against its terms and conditions!

Using the vendors mentioned in this chapter, you can get quite a lot of information for free. But most of those providers also offer paid tiers. Remember to do thorough research on what the data suppliers actually provide and what your needs are before signing up for any of the services.

In this chapter, we cover the following recipes:

Getting data from Yahoo Finance
Getting data from Nasdaq Data Link
Getting data from Intrinio
Getting data from Alpha Vantage
Getting data from CoinGecko

The rest of the chapter is locked

You're reading from Python for Finance Cookbook – Second Edition Over 80 powerful recipes for effective financial data analysis

Table of Contents (18) Chapters

Acquiring Financial Data

Authors (1)

Personalised recommendations for you

You're reading from Python for Finance Cookbook – Second Edition Over 80 powerful recipes for effective financial data analysis

Table of Contents (18) Chapters

Acquiring Financial Data

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you