Beanstalkd queue to index documents in Elasticsearch

Question

I have an REST API which is composed by different resources. Some of those resources are also indexed and kept in sync in ES and I'm implementing a queue system to manage these operations in an async way. I decided to go for Beanstalkd as a queue system.

My tought

For each resource I will have a different tube and I will split indexing job by resources. For example I will have tubes like "index_users", "index_posts" which will receive jobs with the ids of resources to index in ES:

->useTube('index_users')->put( json_encode( [ 'ids' => [ 33, 35, 66 ] ] ) );

Have different tube for different resources helps me keeping things separated ( for example I can decide to stop indexing users just deleting the tube index_users ), job will be analyzed faster because there will be less amount of jobs per queue and a huge amount of indexing operations on one resource will not effect indexing other resources

My questions

Could be this a good way to procede?
What kind of cons this solution can have?
Someone told me that in beanstalk is better to have 1 tube with 1000000 jobs ( related to 2 resources ) rather than 2 tubes ( 1 for each resource ) with 5000 jobs each. They suggest me to go for a solution with just 1 tube also for memory consumption. Is this true?

Beanstalkd queue to index documents in Elasticsearch

Answers (1)

Related Questions