how to chain celery tasks

Question

I want to chain celery tasks in a STANDARD way.

I've a json file. Inside that file, there many harcoded urls. I need to scrap those links plus scrap the links which are found while scraping those links.

Currently, I'm doing like this.

for each_news_source, news_categories in rss_obj.iteritems():
    for each_category in news_categories:
        category = each_category['category']
        rss_link = each_category['feed']
        json_id = each_category['json']
        try:
            list_of_links = getrsslinks(rss_link)
            for link in list_of_links:
                scrape_link.delay(link, json_id, category)
        except Exception,e:
            print "Invalid url", str(e)

I want something where getrsslinks is also a celery task and then the scrapping of list of urls which is returned by getrsslinks should also be another celery task.

It follows this pattern

harcodeJSONURL1--
               --`getrsslinks` (celery task)
                               --scrap link 1 (celery task)
                               --scrap link 2 (celery task)
                               --scrap link 3 (celery task)
                               --scrap link 4 (celery task)

harcodeJSONURL2--
               --`getrsslinks` (celery task)
                               --scrap link 1 (celery task)
                               --scrap link 2 (celery task)
                               --scrap link 3 (celery task)
                               --scrap link 4 (celery task)

and so on..

How can I do this??

how to chain celery tasks

Answers (1)

Related Questions