Elasticsearch Ngram and Query String Query

Question

I am using Elasticsearch 1.2.1.

I am using Ngram tokenizer to tokenize my docs. I have a special use case, where my field may be very long (200-500 chars) and I would like to support lengthy (up to 200 chars) "contains" queries from any point of the field.

I started with Ngram analyzer with up to 260 chars and quickly discovered index time is too slow and capacity is too large, so I reduced the size to about 30 chars.

Now, I would like to be able to break tokens larger than 30 chars into smaller tokens and replace the user search with the broken tokens (knowing that I might be getting more results than I might have if I were to use a larger Ngram index).

What is the recommended way of achieving this functionality? Note that I am using query string query.

Elasticsearch Ngram and Query String Query

Answers (1)

Related Questions