0

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Free Learning

Natural Language Processing with Java Cookbook

You're reading from Natural Language Processing with Java Cookbook Over 70 recipes to create linguistic and language translation applications using Java libraries

Product type Paperback

Published in Apr 2019

Publisher Packt

ISBN-13 9781789801156

Length 386 pages

Edition 1st Edition

Languages

Java

Tools

Deeplearning4j

Concepts

Mobile Application Development

Authors (2):

Richard M. Reese

Richard M Reese

View More author details

Table of Contents (14) Chapters

Preface

1. Preparing Text for Analysis and Tokenization FREE CHAPTER

2. Isolating Sentences within a Document

3. Performing Name Entity Recognition

4. Detecting POS Using Neural Networks

5. Performing Text Classification

6. Finding Relationships within Text

7. Language Identification and Translation

8. Identifying Semantic Similarities within Text

9. Common Text Processing and Generation Tasks

10. Extracting Data for Use in NLP Analysis

11. Creating a Chatbot

12. Installation and Configuration

13. Other Books You May Enjoy

Leave a review - let other readers know what you think

Tokenization using the Java SDK

Tokenization can be achieved using a number of Java classes, including the String, StringTokenizer, and StreamTokenizer classes. In this recipe, we will demonstrate the use of the Scanner class. While frequently used for console input, it can also be used to tokenize a string.

Getting ready

To prepare, we need to create a new Java project.

How to do it...

Let's go through the following steps:

Add the following import statement to your project's class:

import java.util.ArrayList;
import java.util.Scanner;

Add the following statements to the main method to declare the sample string, create an instance of the Scanner class, and add a list to hold the tokens:

String sampleText = 
    "In addition, the rook was moved too far to be effective.";
 Scanner scanner = new Scanner(sampleText);
 ArrayList<String> list = new ArrayList<>();

Insert the following loops to populate the list and display the tokens:

while (scanner.hasNext()) {
    String token = scanner.next();
    list.add(token);
}

for (String token : list) {
    System.out.println(token);
}

Execute the program. You should get the following output:

In
addition,
the
rook
was
moved
too
far
to
be
effective.

How it works...

The Scanner class's constructor took a string as an argument. This allowed us to apply the Scanner class's methods against the text we used in the next method, which returns a single token at a time, delimited by white spaces. While it was not necessary to store the tokens in a list, this permits us to use it later for different purposes.

You have been reading a chapter from

Natural Language Processing with Java Cookbook

Published in: Apr 2019

Publisher: Packt

ISBN-13: 9781789801156

© 2019 Packt Publishing Limited All Rights Reserved

Register for a free Packt account to unlock a world of extra content!

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at €18.99/month. Cancel anytime

Authors (2)

Richard M Reese

Richard M Reese

Richard M. Reese has worked in both industry and academia. For 17 years, he worked in the telephone and aerospace industries, serving in several capacities, including research and development, software development, supervision, and training. He currently teaches at Tarleton State University, where he has the opportunity to apply his years of industry experience to enhance his teaching. Richard has written several Java books and a C pointer book. He uses a concise and easy-to-follow approach to the topics at hand. His Java books have addressed EJB 3.1, updates to Java 7 and 8, certification, jMonkeyEngine, natural language processing, functional programming, networks, and data science.

See other products by Richard M Reese

Richard M. Reese

Richard M. Reese

Richard Reese has worked in the industry and academics for the past 29 years. For 10 years he provided software development support at Lockheed and at one point developed a C based network application. He was a contract instructor providing software training to industry for 5 years. Richard is currently an Associate Professor at Tarleton State University in Stephenville Texas. Richard is the author of various books and video courses some of which are as follows: Natural Language Processing with Java. Java for Data Science Getting Started with Natural Language Processing in Java

See other products by Richard M. Reese

Other recommended products

Related to this chapter

Natural Language Processing with Java

Natural Language Processing with Java

Natural Language Processing with Java will explore how to automatically organize text using approaches such as full-text search, proper name recognition, clustering, tagging, information extraction, and summarization. You will leverage the power of Java to extract relationships within different elements of text and documents.

Jul 2018 10h 36m

Natural Language Processing with Java

Natural Language Processing with Java

Natural Language Processing with Java will explore how to automatically organize text using approaches such as full-text search, proper name recognition, clustering, tagging, information extraction, and summarization. You will leverage the power of Java to extract relationships within different elements of text and documents.

Jul 2018 10h 36m

Natural Language Processing with Java

Natural Language Processing with Java

Natural Language Processing with Java will explore how to automatically organize text using approaches such as full-text search, proper name recognition, clustering, tagging, information extraction, and summarization. You will leverage the power of Java to extract relationships within different elements of text and documents.

Jul 2018 10h 36m

Java for Data Science

Java for Data Science

Harness the incredible power of Java-based approaches to data science and create new, innovative applications to explore, visualise and analyse big data. With its tutorial approach and step-by-step instructional style, Java for Data Science is the ultimate data science book for Java developers interested in Java-based data science solutions.

Jan 2017 12h 52m

Java Data Science Cookbook

Java Data Science Cookbook

Java has been one of the most popular languages for developers for several decades and yet the potential of the Java ecosystem still remains untapped when it comes to using JVM-based languages and platforms to solve data science related problems. A variety of tools and libraries are available such as Spark, Hadoop, and Mahout for computation and libraries such as MLlib, Weka, DL4j to implement smart data models. This book uncovers practically all these techniques in the form of recipes showing you how these tools and libraries can solve statistical, analytical, data mining, and information science related problems.

Mar 2017 12h 24m