user3260152
user3260152

Reputation: 1

Tika 1.7 integration to Solr 5.1.0

I want to parse (many) rss/atom/rdf feeds using Tika 1.7 (works pretty well but not perfect) and upload data into Solr 5.1.0. automatically.

I can see the data in my terminal - looks pretty nice, each item parsed: title, link, description etc - but I don't know how to load data automatically into Solr.

Any help is welcome, Kind regards, Christian

Upvotes: 0

Views: 203

Answers (1)

Andrea
Andrea

Reputation: 2764

There's a contrib module called "SolrCell" which is a content extraction library. That works exactly as you want, and it uses Tika behind the scenes, for text and metadata extraction.

More information here: https://cwiki.apache.org/confluence/display/solr/Uploading+Data+with+Solr+Cell+using+Apache+Tika

Upvotes: 1

Related Questions