Delta Ingestion in druid from s3

Question

I am doing a POC over druid. I am ingesting data from s3 it takes ~7 mins for 289MB of data with default configurations. Now I have set "maxNumConcurrentSubTasks": 2 and "appendToExisting": true. When try to ingest the same data into druid from s3 it is taking almost equal time as above. I was expecting much lesser time as I have not updated any data and I am trying to append instead of overwriting the complete.

Am I misunderstanding concept of append in druid and also is there any optimum way to do delta ingestion from s3? Any leads would be appreciated.

Delta Ingestion in druid from s3

Answers (1)

Related Questions