Part 3: Labeling Text, Audio, and Video Data
In this part of the book, you will explore how to read text, audio, and video data using Python, analyze the data, and extract features. The content delves into various methods for programmatically labeling text, video, and audio data in Python, leveraging OpenAI’s large language models, as well as semi-supervised and unsupervised techniques such as K-means clustering. Additionally, this section aids in understanding different open source data annotation tools such as Label Studio, CVAT, pyOpenAnnotate, and Azure Machine Learning for image, video, audio, and text data, providing a comprehensive comparison between them.
This part comprises the following chapters:
- Chapter 7, Labeling Text Data
- Chapter 8, Exploring Video Data
- Chapter 9, Labeling Video Data
- Chapter 10, Exploring Audio Data
- Chapter 11, Labeling Audio Data
- Chapter 12, Hands-On Exploring Data Labeling Tools