In this section, we are going to introduce built-in analyzers. Each built-in analyzer contains a tokenizer and zero or more token filters. The corresponding parameter of the token filter that is used can be applied to the analyzer just like in the previous section. No more character filters or token filters are added in the testing. We'll cover all the supported analyzers and compile the testing results in the following table. The input text for all testing will be In Elasticsearch 7.0:
Analyzer | Composed of | Output tokens | |
Tokenizer | Token filter | ||
standard | standard | lowercase + stop (disable in default) | ["in", "elasticsearch", "7.0"] |
simple | lowercase | ["in", "elasticsearch"] | |
whitespace | whitespace | ["In", "Elasticsearch", "7.0"] | |
stop | lowercase | stop | ["elasticsearch... |