The source code for Concierge has a lot of moving parts. Directly jumping into the code without any prior understanding might not be the best way to proceed. Instead, we shall take the tasks defined in the previous sections and present them as flow diagrams. However, first, let's have a brief look at the symbols and naming conventions we are using in the diagrams and code.
Conventions
Code conventions
Following are the entities in Concierge:
- payload (p): This represents the payload received to add a new document to index.
- document (d): This represents all the metadata representing a unique document.
- line (l): This represents all the metadata for a single line within a document.
- token (t): This represents all the metadata...