Chapter 2: Analyzing Open Source Software
You can't have a grasp of data science unless you understand open source. It is the oxygen that has fueled the explosion of artificial intelligence (AI) growth in the last two decades. You will be hard-pressed to find any software product or tool being used today that does not make use of open source or is not open source itself.
In this chapter, we will learn what it means for a tool to be open source and how that limits (or does not) how you can use it. We will then walk through how to find and start using different open source tools in your projects today. Finally, we will put these skills to use by evaluating and using one of the most popular open source tools for data science, scikit-learn.
We will focus on the following topics:
- Understanding open source
- Understanding the top four OSS licenses
- Evaluating a new tool or library
- Importing packages using the Anaconda distibution and conda-forge
- Evaluating...