Reputation: 61
We are receiving Json messages from upstream system via Kafka topic. Requirement is to store these messages into HDFS at certain interval. Since we are storing into HDFS we want to merge certain number of these Records in to single file. As per NiFi documentation we are using "MergeRecords" processor for that.
Below is the snapshot of the Processor Configuration. NiFi version: 1.8
For the Above configuration its expected that MergeRecords should have weighted for one of the thresholds i.e. Maximum records(100000) or Maximum Bean size(100KBs).
But its observed that bean is getting bundled pretty before either of the threshold is reached. It is triggering the bean formation only for 2 records of 5KB size.
If you could help with analysis and/or pointers as why MergeRecord processor is not behaving as per the configuration?
Upvotes: 2
Views: 906
Reputation: 226
Perhaps it is not waiting for Maximum records(100000) or Maximum Bean size(100KBs) because it hits the Max Bin Age that you specified first (1 minute).
Max Bin Age is defined in the docs as:
The maximum age of a Bin that will trigger a Bin to be complete.
Upvotes: 1