Choosing query analyzer at query time in Solr

Question

I am indexing documents that have a large, textual content field. Most of the time I want to do special processing on that data, as well as on the incoming queries. (My current fieldType definition is at the bottom.)

However, sometimes, like when the user passes in something in quotation marks, I'd like to essentially use a different query analyzer than the one defined for the field. Maybe use a KeywordTokenizerFactory instead of a WhitespaceTokenizerFactory, so that I can match "multiple words in a phrase" without them being split apart.

How can I choose a different query analyzer at query time?

I understand that I can use copyField and setup an entirely different field definition, but this would essentially double the space used for my Solr index, which isn't feasible.

kkrugler · Accepted Answer

It is actually possible to dynamically change the analyzer used, but it requires some custom code. Check out slide 30 in http://www.slideshare.net/treygrainger/semantic-multilingual-strategies-in-lucenesolr, where Trey is talking about using this approach to support different analyzers for multi-lingual fields. His approach has to do this for both indexing and query analysis, whereas for you it's just the query.

Here's the JIRA feature request that Trey is referencing.

Choosing query analyzer at query time in Solr

Answers (1)

Related Questions