Not able to exclude partial string in solr 5.3.1?

Question

String is :-

my query is:

log_message:"*emaps/$\[\^/\]\+\?$\>*"

here log_message is field and it's type is

text_std_token_lower_case

Tokenizer are:

Gus · Accepted Answer

The tokenizer you have chosen (StandardTokenizerFactory) ignores punctuation characters. You can see this if you go to the analyisis page in the Solr admin UI. This will effect the tokenization of both your query and your field. You will need a tokenizer that does not omit punctuation.

One possible option is to use the Regular Expression Tokenizer documented on the Solr wiki (https://cwiki.apache.org/confluence/display/solr/Tokenizers) Perhaps you are looking for something like this?

That may require some tweaking if the urls can contain > characters that are not % encoded, or HEAD is possible etc. I am not confident that this will perform well however since regular expressions can become expensive. If this bogs things down you might need to write your own tokenizer.

Not able to exclude partial string in solr 5.3.1?

Answers (1)

Related Questions