Architecture for handling multiple user searching query at same time

Question

I am working on a problem similar to shopping online at Amazon.

There are many products and their information are stored in a database, the users can enter a search term and a list of closest matched products will be returned.

Currently I am using Lucene to handle the searching process. It's a very simple architecture:

When a user enters a search term, Lucene will go through the whole database to index every product
Then Lucene will return a list of best matched products

The problem with the prototype is, when there are many user querying, for each user, lucene needs to loop through entire database to index. And if the user queries, then logs out, then queries again, Lucene also needs to loop through again.. The speed is pretty slow.

So what are some way to improve this (or technology choice)?

Rob Conklin · Accepted Answer

When you create your lucene index (using an indexwriter), you should use a FSDirectory object to get a file. This is where your index is stored. Users should use a shared IndexSearcher to search this index.

IndexSearcher is thread safe (and relatively expensive to create), so you definitely want to keep it around after you use it.

I think you are going to be very impressed with the performance of this once you keep these things around.

Please take a look at this tutorial:http://oak.cs.ucla.edu/cs144/projects/lucene/ , it looks fairly good.

Architecture for handling multiple user searching query at same time

Answers (2)

Related Questions