Reputation: 1
I want to parse (many) rss/atom/rdf feeds using Tika 1.7 (works pretty well but not perfect) and upload data into Solr 5.1.0. automatically.
I can see the data in my terminal - looks pretty nice, each item parsed: title, link, description etc - but I don't know how to load data automatically into Solr.
Any help is welcome, Kind regards, Christian
Upvotes: 0
Views: 203
Reputation: 2764
There's a contrib module called "SolrCell" which is a content extraction library. That works exactly as you want, and it uses Tika behind the scenes, for text and metadata extraction.
More information here: https://cwiki.apache.org/confluence/display/solr/Uploading+Data+with+Solr+Cell+using+Apache+Tika
Upvotes: 1