Scrapy Item Loaders - understanding input and output processors?

Question

So from the scrapy docs I see:

The input processor processes the extracted data as soon as it’s received .... and the result of the input processor is collected and kept inside the ItemLoader. After collecting all data, the ItemLoader.load_item() method is called to populate and get the populated item object. That’s when the output processor is called with the data previously collected (and processed using the input processor). The result of the output processor is the final value that gets assigned to the item.

I get the idea of the input processor. For example, have some data that you want to clean up? Just run it through the appropriate input processor. What I don't understand is the purpose of the output processor. How is this functionally even different from the input processor? Couldn't you just include whatever data transformation you want in the first input processor?

Scrapy Item Loaders - understanding input and output processors?

Answers (1)

Related Questions