Lucene is not returning the results if I am searching with special characters

Question

I am using Lucene 6.6.0 version, and I am indexing my data using StandardAnalyzer.

I am indexing following data of words.

a&e networks
a&e

After indexing , when I am searching with a&e it is not returning any results. this is my sample code.

    Directory dir = new RAMDirectory();
    IndexWriterConfig iwc = new IndexWriterConfig(new StandardAnalyzer());
    iwc.setOpenMode(IndexWriterConfig.OpenMode.CREATE);
    IndexWriter writer = new IndexWriter(dir, iwc);

    Document doc = new Document();
    doc.add(new TextField("text", "a&e networks", Field.Store.YES));
    writer.addDocument(doc);
    doc = new Document();
    doc.add(new TextField("text", "a&e", Field.Store.YES));
    writer.addDocument(doc);
    writer.close();

    IndexReader reader = DirectoryReader.open(dir);

    IndexSearcher searcher = new IndexSearcher(reader);

    Query query = new TermQuery(new Term("text", "a&e"));

    TopDocs results = searcher.search(query, 5);
    final ScoreDoc[] scoreDocs = results.scoreDocs;
    for (ScoreDoc scoreDoc : scoreDocs) {
        System.out.println(scoreDoc.doc + " " + scoreDoc.score + " " + searcher.doc(scoreDoc.doc).get("text"));
    }
    System.out.println("Hits: " + results.totalHits);
    System.out.println("Max score:" + results.getMaxScore());

I am getting output as Hits: 0 Max score:NaN

Even I am searching for a also it is not giving any results in this case.

but if I add stopwords set to StandardAnalyzer like this

    List stopWords = Arrays.asList("&");
    CharArraySet stopSet = new CharArraySet(stopWords, false);
    IndexWriterConfig iwc = new IndexWriterConfig(new StandardAnalyzer(stopSet));

and after that if i search for a then I am able to get the results. but even in that case also if i search for a&e , then I am not getting any results.

please suggest me how to achieve this, my goal here is if I search for a&e I should be able to get the results. do I need to any CustomAnalyzer ? If so please explain what should I add in CustomAnalyzer?

Lucene is not returning the results if I am searching with special characters

Answers (1)

Related Questions