Best practice for Kafka and Debezium in a CDC Oracle usage

Question

We are struggling to find some "best practice" regarding an usage of Kafka/Connect for CDC. What we are trying to achieve; Extract online redo logs from Oracle through Kafka Connect. We have ~700 different tables with between a couples of rows to ~40M rows for the biggest tables.

What we thought to use:

1 Debezium Oracle CDC
1 Kafka Connect
1 Kafka
1 Schema registry

What's the better approach; 1 connector per table ? meaning we will end up with 700 connectors ? (+700 for the related DDL in "database.server.name" ?)

because if we keep only 1 connector for all tables, the issue is that it will not be parallelised.

I tried to add 3 kafka workers or 3 kafka connect but the issue is still the same, I have only 1 table being processed at the same time.

Any best practice or return of experience will be much appreciated.

Many thanks,

Best practice for Kafka and Debezium in a CDC Oracle usage

Answers (1)

Related Questions