Can't transform big data in MS SSIS with 0xC0047048 error and nothing helps

Question

I need to join a huge table of sales with more than 600 million rows with additional information from other tables. The first part of the flow looks this way:

It starts sorting data but after 190M rows, it fails with the error about lack of disk space.

Information: 0x4004300C at load NT_sales by customers, SSIS.Pipeline: Execute phase is beginning.
Error: 0x80070070 at load NT_sales by customers:
Error: 0xC004704A at load NT_sales by customers: The buffer manager cannot extend the file "D:\Users\BUB2523\AppData\Local\Temp\17\DTS{1CF72EC9-E25F-49B1-AE1E-52296F16E0F2}.tmp" to length 4640004 bytes. There was insufficient disk space.
Error: 0xC0047048 at load NT_sales by customers:
Information: 0x4004800D at load NT_sales by customers: The buffer manager failed a memory allocation call for 4640000 bytes, but was unable to swap out any buffers to relieve memory pressure. 1 buffer was considered and 0 were locked. Either not enough memory is available to the pipeline because not enough are installed, other processes were using it, or too many buffers are locked.
Information: 0x4004800F at load NT_sales by customers: Buffer manager allocated 2801 megabyte(s) in 603 physical buffer(s).
Information: 0x40048010 at load NT_sales by customers: Component "Sort" (1325) owns 2585 megabyte(s) physical buffer.
Information: 0x40048010 at load NT_sales by customers: Component "Merge Join 4" (572) owns 27 megabyte(s) physical buffer.
Information: 0x40048010 at load NT_sales by customers: Component "Merge Join 2" (415) owns 23 megabyte(s) physical buffer.
Information: 0x40048010 at load NT_sales by customers: Component "SUM and GROUP BY" (1930) owns 23 megabyte(s) physical buffer.
Information: 0x40048010 at load NT_sales by customers: Component "Sort by cat_2" (1375) owns 21 megabyte(s) physical buffer.
Information: 0x40048010 at load NT_sales by customers: Component "Merge Join 3" (495) owns 21 megabyte(s) physical buffer.
Information: 0x40048010 at load NT_sales by customers: Component "Sort by cat_3" (1505) owns 21 megabyte(s) physical buffer.
Information: 0x40048010 at load NT_sales by customers: Component "Sort by EAN" (1640) owns 13 megabyte(s) physical buffer.
Information: 0x40048010 at load NT_sales by customers: Component "Merge Join 1" (349) owns 13 megabyte(s) physical buffer.
Information: 0x40048010 at load NT_sales by customers: Component "Sort by FK_store_code" (1770) owns 12 megabyte(s) physical buffer.

I tried to avoid sorting by ordering the data from the source, but then it fails in a similar way on Merge Join. I've decreased the property DefaultBufferMaxRows, but it doesn't help. The process is passing slower, but it fails at the same point, and it consumes all of the available disk space (approximately 80 GB).

Can I split it into batches somehow, or maybe there is an instrument in SSIS to make this project execute multiple times with smaller sizes of data?

P.s Thanks for your answers. I've made partitioning by weeks and sorting from DBs with ORDER BY queries but there is still a problem:

This sorting is never ending. 25M rows is not too many in my opinion but still for some reasons this sort not finishes even after few hours. And i have to do four of them in this flow. May be there are some properties to change or smth? Ni i have no idea whats going on.

Can't transform big data in MS SSIS with 0xC0047048 error and nothing helps

Answers (1)

If everything is in one database

Horizontal partitioning

Merging

Out of time notes

Related Questions

Can&#39;t transform big data in MS SSIS with 0xC0047048 error and nothing helps

Answers (1)

If everything is in one database

Horizontal partitioning

Merging

Out of time notes

Related Questions

Can't transform big data in MS SSIS with 0xC0047048 error and nothing helps