Subscription

Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Conferences

Free Learning

You're reading from Elasticsearch 7 Quick Start Guide Get up and running with the distributed search and analytics capabilities of Elasticsearch

Product type Paperback

Published in Oct 2019

Publisher Packt

ISBN-13 9781789803327

Length 186 pages

Edition 1st Edition

Tools

Elasticsearch

Concepts

Enterprise Search

Authors (2):

Douglas Miller

Anurag Srivastava

View More author details

Table of Contents (10) Chapters

Preface

1. Introduction to Elastic Stack FREE CHAPTER

2. Installing Elasticsearch

3. Many as One – the Distributed Model

4. Prepping Your Data – Text Analysis and Mapping

5. Let's Do a Search!

6. Performance Tuning

7. Aggregating Datasets

8. Best Practices

9. Other Books You May Enjoy

Leave a review - let other readers know what you think

Data sparsity

In previous versions of Elasticsearch, the sparsity of documents was to be avoided because of Lucene's structure. This structure identifies documents internally with document IDs, which are then used for communication between the internal APIs of Lucene. Lucene retrieves values of the norm from the document ID, generated by a search query, by reading the byte at the index of the document ID.

Lucene is a full-featured text search engine that is written in Java, and Elasticsearch is built on top of Lucene.

This is, at the same time, both very efficient and time-intensive, because Lucene can quickly access the norm values and the documents that have no value and use one byte of storage for each. This means, though, that if an index has x documents, the norms require x bytes of storage per field. This not only affects the sparsity requirements, but also the indexing...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (2)

Srivastava

Anurag Srivastava is a senior technical lead in a multinational software company. He has more than 12 years' experience in web-based application development. He is proficient in designing architecture for scalable and highly available applications. He has handled development teams and multiple clients from all over the globe over the past 10 years of his professional career. He has significant experience with the Elastic Stack (Elasticsearch, Logstash, and Kibana) for creating dashboards using system metrics data, log data, application data, and relational databases. He has authored three other booksMastering Kibana 6.x, and Kibana 7 Quick Start Guide, and Learning Kibana 7 - Second Edition, all published by Packt.

See other products by Srivastava

Miller

Preston Miller is a consultant at an internationally recognized risk management firm. Preston holds an undergraduate degree from Vassar College and a master's degree in digital forensics from Marshall University. While at Marshall, Preston unanimously received the prestigious J. Edgar Hoover Foundation's scientific scholarship. Preston is a published author, recently of Python Digital Forensics Cookbook, which won the Forensic 4:cast Digital Forensics Book of the Year award in 2018. Preston is a member of the GIAC advisory board and holds multiple industry-recognized certifications in his field.

See other products by Miller