ranjit
ranjit

Reputation: 51

Special character indexing

i am creating a Lucene 3.0.3 index using StandardAnalyzer.

when searching is made on index using query like C, C# or C++ it gives same result for all these three term. As, i know while creating index analyzer ignore special character and do not create index for same.

Need to be able to differentiate between "C", "C#" and "C++"

please suggest me that, Is any existing analyzer will resolve this issue?

Any suggestion will be appreciated!!!

Upvotes: 2

Views: 1320

Answers (2)

Marko Bonaci
Marko Bonaci

Reputation: 5708

I guess that happens because of the fact that StandardAnalyzer uses StandardFilter, which uses StandardTokenizer, which removes special characters.
You could create your own Analyzer implementation.

Upvotes: 3

Related Questions