You're reading from Data Augmentation with Python Enhance deep learning accuracy with data augmentation methods for image, text, audio, and tabular data

Product type Paperback

Published in Apr 2023

Publisher Packt

ISBN-13 9781803246451

Length 394 pages

Edition 1st Edition

Languages

Python

Tools

BERT

Concepts

Data Science

Author (1):

Duc Haba

View More author details

Table of Contents (17) Chapters

Preface

1. Part 1: Data Augmentation

2. Chapter 1: Data Augmentation Made Easy FREE CHAPTER

3. Chapter 2: Biases in Data Augmentation

4. Part 2: Image Augmentation

5. Chapter 3: Image Augmentation for Classification

6. Chapter 4: Image Augmentation for Segmentation

7. Part 3: Text Augmentation

8. Chapter 5: Text Augmentation

9. Chapter 6: Text Augmentation with Machine Learning

10. Part 4: Audio Data Augmentation

11. Chapter 7: Audio Data Augmentation

12. Chapter 8: Audio Data Augmentation with Spectrogram

13. Part 5: Tabular Data Augmentation

14. Chapter 9: Tabular Data Augmentation

15. Index

Why subscribe?

16. Other Books You May Enjoy

Data augmentation role

Data is paramount in any AI project. This is especially true when using the artificial neural network (ANN) algorithm, also known as DL. The success or failure of a DL project is primarily due to the input data quality.

One primary reason for the significance of data augmentation is that it is relatively too easy to develop an AI for prediction and forecasting, and those models require robust data input. With the remarkable advancement in developing, training, and deploying a DL project, such as using the FastAI framework, you can create a world-class DL model in a handful of Python code lines. Thus, expanding the dataset is an effective option to improve the DL model’s accuracy over your competitor.

The traditional method of acquiring additional data is difficult, expensive, and impractical. Sometimes, the only available option is to use data augmentation techniques to extend the dataset.

Fun fact

Data augmentation methods can increase the data’s size tenfold. For example, it is relatively challenging to acquire additional skin cancer images. Thus, using a random combination of image transformations, such as vertical flip, horizontal flip, rotating, and skewing, is a practical technique that can expand the skin cancer photo data.

Without data augmentation, sourcing new skin cancer photos and labeling them is expensive and time-consuming. The International Skin Imaging Collaboration (ISIC) is the authoritative data source for skin diseases, where a team of dermatologists verified and classified the images. ISIC made the datasets available to the public to download for free. If you can’t find a particular dataset from ISIC, it is difficult to find other means, as accessing hospital or university labs to acquire skin disease images is laced with legal and logistic blockers. After obtaining the photos, hiring a team of dermatologists to classify the pictures to correct diseases would be costly.

Another example of the impracticality of attaining additional images instead of augmentation is when you download photos from social media or online search engines. Social media is a rich source of image, text, audio, and video data. Search engines, such as Google or Bing, make it relatively easy to download additional data for a project, but copyrights and legal usage are a quagmire. Most images, texts, audio, and videos on social media, such as YouTube, Facebook, TikTok, and Twitter, are not clearly labeled as copyrights or public domain material.

Furthermore, social media promotes popular content, not unfavorable or obscure material. For example, let’s say you want to add more images of parrots to your parrot classification AI system. Online searches will return a lot of blue-and-yellow macaws, red-and-green macaws, or sulfur-crested cockatoos, but not as many Galah, Kea, or the mythical Norwegian-blue parrot – a fake parrot from the Monty Python comedy skit.

Insufficient data for AI training is exacerbated for text, audio, and tabular data types. Generally, obtaining additional text, audio, and tabular data is expensive and time-consuming. There are strong copyright laws protecting text data. Audio files are less common online, and tabular data is primarily from private company databases.

The following section will define the four commonly used data types.

You're reading from Data Augmentation with Python Enhance deep learning accuracy with data augmentation methods for image, text, audio, and tabular data

Table of Contents (17) Chapters

Data augmentation role

Authors (1)

Personalised recommendations for you