Searching Solr documents in near real time
Apache Solr performs indexing on the updated data, and the data is available for search. In many cases, the index-generation job is run during offline hours (late nights, weekends) to update the search with the newer data. Until this point, Apache Solr cannot search for documents added in the customer document repository. Many times, the demand is to make a document available for search as and when it is uploaded to the customer repository. Apache Solr can perform search on these documents in near real time. There is a delay to generate index for a document and to make it available for search; hence, Apache Solr can support near real-time search on documents.
Strategies for near real-time search in Apache Solr
In Apache Solr, a commit operation is required to enable the document to be made available for searching. Commit operation in Apache Solr involves Solr access transaction logs (or update logs), which pick the identifiers and sync the index files...