Similarity between attributes in database

Question

I'm creating a new search system for my application. The idea now is to use query relaxation in order to get wider results from the database and then calculate it's relevance to the user. The problem is in the algorithm. I'm considering using something like nearest neighbor algorithm but I'm a little uncertain on how to use it.

How can I get the relevance, in %, of a record in databse, to the user search?

I need to do this operation in the attributes distance and category. In other words, when I'm querying the DB, the distance is multiplied by 2 and category is relaxed by selecting it's parent category.

An example: if the user searches for something that is up to 30km away and the category is 'soccer', I'll get from the DB all the records up till 60km and 'ballSports' (in a tree like: sports->fullContact->ballSports->soccer, so I'd get sports like soccer, football, rugby, and so on).

This % also needs to be calculated having in mind the weight of the attribute for the user. If the user considers category more important than distance, this has to be taken in to account when calculating the relevance.

A good example of a category tree and a formula to calculate distances can be found here on page 3: http://reference.kfupm.edu.sa/content/d/i/a_distributed_case_based_reasoning_appli_58512.pdf

How can I apply that formula to attributes? BTW, I'm using MongoDB so all data is in the document, no relations to other tables.

Thank you

Similarity between attributes in database

Answers (1)

Related Questions