Need for distributed search
At the beginning of the chapter, we have already seen some of the reasons leading to the need for distributed searches. Any search engine would have two important functions: firstly to index the data, and secondly to provide a real-time search. As the data grows, single node enterprise search applications face the following issues:
- There are times when an index on one machine is insufficient and it cannot accommodate enterprise information. This is mainly applicable for enterprises with growing data, which require the generation of large index sizes.
- As more and more users start using enterprise search, there is huge traffic for search operations. Single node searches have a limitation on the number of requests they can serve within a stipulated time, even if the data is not huge.
- For frequently changing data, the indexer has to index the data swiftly to avoid lagging and further delays. Often, index generation time is one of the primary expectations of enterprises...