You're reading from Machine Learning in Biotechnology and Life Sciences Build machine learning models using Python and deploy them on the cloud

Product type Paperback

Published in Jan 2022

Publisher Packt

ISBN-13 9781801811910

Length 408 pages

Edition 1st Edition

Languages

Python

Tools

AWS

Concepts

Bioinformatics

Author (1):

Saleh Alkhalifa

View More author details

Table of Contents (17) Chapters

Preface

1. Section 1: Getting Started with Data

2. Chapter 1: Introducing Machine Learning for Biotechnology FREE CHAPTER

3. Chapter 2: Introducing Python and the Command Line

4. Chapter 3: Getting Started with SQL and Relational Databases

5. Chapter 4: Visualizing Data with Python

6. Section 2: Developing and Training Models

7. Chapter 5: Understanding Machine Learning

8. Chapter 6: Unsupervised Machine Learning

9. Chapter 7: Supervised Machine Learning

10. Chapter 8: Understanding Deep Learning

11. Chapter 9: Natural Language Processing

12. Chapter 10: Exploring Time Series Analysis

13. Section 3: Deploying Models to Users

14. Chapter 11: Deploying Models with Flask Applications

15. Chapter 12: Deploying Applications to the Cloud

16. Other Books You May Enjoy

Working with unstructured data

In the previous section, we explored some of the most common tasks and processes that are conducted when handing text-based data. More often than not, you will find that the data you work with is generally not of a structured nature, or perhaps not of a digital nature. Take, for example, a company that has decided to move all printed documents to a digital state. Or perhaps a company that maintains a large repository of documents, none of which are structured or organized. For tasks such as these, we can rely on several AWS products to come to our rescue. We will explore two of the most useful NLP tools in the next few sections.

OCR using AWS Textract

In my opinion, one of the most useful tools available within AWS is an Optical Character Recognition (OCR) tool known as AWS Textract. The main idea behind this tool is to enable users to extract text, tables, and other useful items from images or static PDF documents using pre-built machine learning...

The rest of the chapter is locked

You're reading from Machine Learning in Biotechnology and Life Sciences Build machine learning models using Python and deploy them on the cloud

Table of Contents (17) Chapters

Working with unstructured data

OCR using AWS Textract

Authors (1)

Personalised recommendations for you

You're reading from Machine Learning in Biotechnology and Life Sciences Build machine learning models using Python and deploy them on the cloud

Table of Contents (17) Chapters

Working with unstructured data

OCR using AWS Textract

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you