HTML parser not working in SOLR 3.6

Question

Using the solr.jar with the example in the download for Apache Solr 3.6, the HTML tags are not getting stripped.

In schema.xml I added the following:

Also, I posted the following JSON to SOLR:

[
{
    "id" : "978-064172344522",

    "title":"my link  power-shot PowerShot USC Utility 
hello
 Rejections Under 35 U.S.C. 101 and 35 U.S.C. 112, First Paragraph Petitions to correct inventorship of an issued patent are decided by the Supervisory Patent Examiner, as set forth"

}

]

After restarting SOLR, I conducted a search for power-shot and the results still show the HTML tags

 
 
 0.13561106
 978-064172344522
 
 my link power-shot PowerShot USC Utility 
hello

What is missing here?

HTML parser not working in SOLR 3.6

Answers (1)

Related Questions