System database design for article personalized recommendation system

Question

Hi I am designing a system which takes in article links from an API, sorts the articles into categories, and then sends a list of recommended article links to users based on users' specified filtering parameters.

The initial approach I've planned out is to use SQL databases to store the sorted articles as well as user info. Then each day I will run a SQL query on the article database for each user to fetch relevant article links. One thing I need to figure out is handling duplicate articles/users, but even assuming that there are unique instances this approach seems pretty inefficient.

I was wondering if there is a better way to design the system for scale, i.e., if the system has to handle the scope of millions of articles and millions of users?

Would grouping users together based on similar article filtering parameters be helpful (so potentially less queries need to be run if two or more users have the same article database querying)? Or would this effort be too complicated and not worthwhile?

System database design for article personalized recommendation system

Answers (1)

Related Questions