DavidVdd
DavidVdd

Reputation: 1018

Solr indexing increasing speed?

I'm trying to figure out whats holding the index speed back. I'm extracting text from pdf's to index each page seperatly to solr to get page hit results.

I was using commit after every "document". Then I noticed its spend loads of time rebuilding the index euch time I used commit.

Now I use this:

      <autoCommit> <maxDocs>10000</maxDocs> <maxTime>60000</maxTime> </autoCommit>

To get a commit every minute.

But then I was calculating and found out it indexed around 30 'documents'(pages as solrDoc)/sec or 10 real documents/sec. This seems pretty slow compared to other setups.

How could I increase my speed?

Extra info:(request if needed)

Upvotes: 1

Views: 372

Answers (1)

Persimmonium
Persimmonium

Reputation: 15791

one aspect is whether your process is multithreaded or not, if not, test by having several threads extracting text from pdf and then hand over to solr for indexing.

Upvotes: 1

Related Questions