Reputation: 51
i am creating a Lucene 3.0.3 index using StandardAnalyzer.
when searching is made on index using query like C, C# or C++ it gives same result for all these three term. As, i know while creating index analyzer ignore special character and do not create index for same.
Need to be able to differentiate between "C", "C#" and "C++"
please suggest me that, Is any existing analyzer will resolve this issue?
Any suggestion will be appreciated!!!
Upvotes: 2
Views: 1320
Reputation: 5708
I guess that happens because of the fact that StandardAnalyzer
uses StandardFilter
, which uses StandardTokenizer
, which removes special characters.
You could create your own Analyzer implementation.
Upvotes: 3
Reputation: 12843
See http://www.gossamer-threads.com/lists/lucene/java-user/91747?do=post_view_threaded#91747
Upvotes: 0