Reputation: 154
We have the index files which are provided by the client and now want to reindex them by removing the duplicate documents
for this i have to read the data from the index and have to generate an XML file or CSV file and then index them with eliminating duplicates
(correct me if im wrong i didnt find any other way to do this)
I know this is possible using Lucene but i wanted to know how can i do this using solr
Upvotes: 1
Views: 1228
Reputation: 13394
You can change the export format by setting the wt
parameter to xml
...&wt=xml
at you Solr query.
This will export the results to xml. So if you search for * you will get the full content - depending on the limit
parameter.
http://wiki.apache.org/solr/CoreQueryParameters#wt
That will also work with csv: http://wiki.apache.org/solr/CSVResponseWriter
Upvotes: 1