Custom compaction for Kafka topic on the broker side?

Question

Assume some Kafka cluster with some topic named MyTopic. According to business logic I am implementing, adjancent records are considered equal whenever some subset of value's rather then key's properties are equal. Thus, built-in compaction, driven by key equality, doesn't work for my scenario. I could implement pseudocompaction at the consumer side, which is neither an option due to performance. The whole idea is to maintain right compaction at the broker side. In addition to that, such a compaction has to be applied only within some special consumer group; all other groups have to get entire log of records as they are now.

According to my knowledge there is no way to implement such compaction. Am I wrong?

Oras · Accepted Answer

You can not have custom log compaction. It is either delete or compact based on keys. https://kafka.apache.org/documentation/#compaction

However, if your case is just related to some special consumer groups, you might create a stream to read your specified topic, create a hash key (based on value subset) which will write to another topic and apply clean up policy compaction to this new topic.

This obviously will have almost duplicated data which might not suit your case.

Custom compaction for Kafka topic on the broker side?

Answers (2)

Related Questions