Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Save more on your purchases now! discount-offer-chevron-icon
Savings automatically calculated. No voucher code required.
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Conferences
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
Machine Learning in Biotechnology and Life Sciences

You're reading from   Machine Learning in Biotechnology and Life Sciences Build machine learning models using Python and deploy them on the cloud

Arrow left icon
Product type Paperback
Published in Jan 2022
Publisher Packt
ISBN-13 9781801811910
Length 408 pages
Edition 1st Edition
Languages
Tools
Arrow right icon
Author (1):
Arrow left icon
Saleh Alkhalifa Saleh Alkhalifa
Author Profile Icon Saleh Alkhalifa
Saleh Alkhalifa
Arrow right icon
View More author details
Toc

Table of Contents (17) Chapters Close

Preface 1. Section 1: Getting Started with Data
2. Chapter 1: Introducing Machine Learning for Biotechnology FREE CHAPTER 3. Chapter 2: Introducing Python and the Command Line 4. Chapter 3: Getting Started with SQL and Relational Databases 5. Chapter 4: Visualizing Data with Python 6. Section 2: Developing and Training Models
7. Chapter 5: Understanding Machine Learning 8. Chapter 6: Unsupervised Machine Learning 9. Chapter 7: Supervised Machine Learning 10. Chapter 8: Understanding Deep Learning 11. Chapter 9: Natural Language Processing 12. Chapter 10: Exploring Time Series Analysis 13. Section 3: Deploying Models to Users
14. Chapter 11: Deploying Models with Flask Applications 15. Chapter 12: Deploying Applications to the Cloud 16. Other Books You May Enjoy

Exploring machine learning software

Before we start developing models, we will need to few tools to help us. The good news is that regardless of whether you are using a Mac, PC, or Linux, almost everything we will use is compatible with all platforms. There are three main items we will need to install: a language to develop our models in, a database to store our data in, and a cloud computing space to deploy our models in. Luckily for us, there is a fantastic technology stack ready to support our needs. We will be using the Python programming language to develop our models, MySQL to store our data, and AWS to run our cloud computing processes. Let's take a closer look at these three items.

Python (programming language)

Python is one of the most commonly used programming languages and sought-after skills in the data science industry today. It was first developed in 1991 and is regarded today as the most common language for data science. For this book, we will be using Python 3.7. There are several ways you can install Python on your computer. You can install the language in its standalone form from Python.org. This will provide you with a Python interpreter in its most basic form where you can run commands and execute scripts.

An alternative installation process that would install Python, pip (a package to help you install and manage Python libraries), and a collection of other useful libraries can be done by using Anaconda, which can be retrieved from anaconda.com. To have a working version of Python and its associated libraries on your computer as quickly as possible, using Anaconda is highly recommended. In addition to Python, we will need to install libraries to assist in a few areas. Think of libraries as nicely packaged portions of code that we can import and use as we see fit. Anaconda will, by default, install a few important libraries for us, but there will be others that we will need. We can install those on-the-go using pip. We will look at this in more detail in the next chapter. For the time being, go ahead and install Anaconda on your computer by navigating to the aforementioned website, downloading the installation that best matches your machine, and following the installation instructions provided.

MySQL (database)

When handling vast quantities of information, we will need a place to store and save all of our data throughout the analysis and preprocessing phases of our projects. For this, we will use MySQL, one of the most common relational databases used to store and retrieve data. We will take a closer look at the use of MySQL by using SQL. In addition to the MySQL relational database, we will also explore the use of DynamoDB, a non-relational and NoSQL database that has gained quite a bit of popularity in recent years. Don't worry about getting these setups right now – we will talk about getting them set up later on.

AWS and GCP (Cloud Computing)

Finally, after developing our machine learning models in Python and training them using the data in our databases, we will deploy our models to the cloud using both Amazon Web Services (AWS), and Google Cloud Platform (GCP). In addition to deploying our models, we will also explore a number of useful tools and resources such as Sagemaker, EC2, and AutoPilot (AWS), and Notebooks, App Engine, and AutoML (GCP).

You have been reading a chapter from
Machine Learning in Biotechnology and Life Sciences
Published in: Jan 2022
Publisher: Packt
ISBN-13: 9781801811910
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime