MLlib Item Based Collaborative Filtering with No Ratings

Question

I am building a recommender system from query logs. For each query log I have data for what links were clicked by user. Users do not provide any ratings for the links they visit. I am trying to create a recommendation system that will suggest "If you have clicked this one, try this one which another similar user has tried". I am exploring Apache Spark - MLLib to use collaborative filtering for the purpose. Unfortunately the ALS algorithm takes "ratings" data.

Here is one of the solutions I got online:

"For each page we want recommendations for, we search for all the users who have viewed that page. Then, for each of those users, we look up all other pages they have viewed. We then count the number of users which have viewed each page in this data set, and use those with the highest count as our recommendations."

The user suggests that this approach is slow.

I was wondering if there is a good way to 'fake' the ranking data, or is there a popular open source implementation which does not requires the ranking data?

MLlib Item Based Collaborative Filtering with No Ratings

Answers (1)

Related Questions