Performance issue with JSON input

Question

I am loading mysql table from a mongodb source through kettle. Mongodb table has more than 4 million records and when I run the kettle job it takes 17 hours to finish the first time load. Even for incremental load it takes more than a hour.I tried with increasing commit size and also giving more memory to the job, but still performance is not improving. I think JSON input step takes a very long time to parse the data and hence its very slow. I have these steps in my transformation

Mongodb input step
Json Input
Strings cut
If field value is null
Concat fields
Select values
Table output.

Same 4 million records when extracted from postgre was way more fast than mongodb. Is there a way I can improve the performance? Please help me.

Thanks, Deepthi

Performance issue with JSON input

Answers (1)

Related Questions