Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Free Learning

You're reading from Expert C++ Become a proficient programmer by learning coding best practices with C++17 and C++20's latest features

Product type Paperback

Published in Apr 2020

Publisher Packt

ISBN-13 9781838552657

Length 606 pages

Edition 1st Edition

Languages

C++

Concepts

Programming Language

Authors (2):

Vardan Grigoryan

Shunguang Wu

View More author details

Table of Contents (22) Chapters

Preface

1. Section 1: Under the Hood of C++ Programming

2. Introduction to Building C++ Applications FREE CHAPTER

3. Low-Level Programming with C++

4. Details of Object-Oriented Programming

5. Understanding and Designing Templates

6. Memory Management and Smart Pointers

7. Section 2: Designing Robust and Efficient Applications

8. Digging into Data Structures and Algorithms in STL

9. Functional Programming

10. Concurrency and Multithreading

11. Designing Concurrent Data Structures

12. Designing World-Ready Applications

13. Designing a Strategy Game Using Design Patterns

14. Networking and Security

15. Debugging and Testing

16. Graphical User Interface with Qt

17. Section 3: C++ in the AI World

18. Using C++ in Machine Learning Tasks

19. Implementing a Dialog-Based Search Engine

20. Assessments

21. Other Books You May Enjoy

Leave a review - let other readers know what you think

Indexing documents

The key functionality of search engines is indexing. The following diagram shows how documents downloaded by the crawler are processed to build the index file:

The index is shown as an inverted index in the preceding diagram. As you can see, the user queries are directed to the inverted index. Although we use the terms index and inverted index interchangeably in this chapter, inverted index is a more accurate name for it. First, let's see what the index for the search engine is. The whole reason for indexing documents is to provide a fast searching functionality. The idea is simple: each time the crawler downloads documents, the search engine processes its contents to divide it into words that refer to that document. This process is called tokenization. Let's say we have a document downloaded from Wikipedia containing the following text (for brevity...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (2)

Grigoryan

Vardan Grigoryan is a senior backend engineer and C++ developer with more than 9 years of experience. Vardan started his career as a C++ developer and then moved to the world of server-side backend development. While being involved in designing scalable backend architectures, he always tries to incorporate the use of C++ in critical sections that require the fastest execution time. Vardan loves tackling computer systems and program structures on a deeper level. He believes that true excellence in programming can be achieved by means of a detailed analysis of existing solutions and by designing complex systems.

See other products by Grigoryan

Cheng-Yang Wu has been tackling infrastructure and system reliability since he received his master's degree in computer science from National Taiwan University. His laziness prompted him to master DevOps skills to maximize his efficiency at work so as to squeeze in writing code for fun. He enjoys cooking as it's just like working with software a perfect dish always comes from balanced flavors and fine-tuned tastes.

See other products by Wu