Machine learning on unstructured data
Traditionally, machine learning has been applied to structured data – neat, tabulated data that fits nicely into a spreadsheet. However, most data being created and stored is unstructured – it doesn’t come pre-packaged in a friendly, orderly format. Examples of unstructured data are text documents, images, audio files, and videos. Luckily, machine learning is well equipped to handle this type of data, leading to the development of specialized fields such as natural language processing (NLP) and computer vision.
Natural language processing (NLP)
NLP is a field at the intersection of computer science, AI, and linguistics. It focuses on how computers can understand, interpret, and generate human language in a valuable way.
The following are applications of NLP:
- Sentiment analysis: Companies often use sentiment analysis to understand customer opinions about their products or services. For instance, NLP can analyze...