SQL Server CDC - Snapshot from specific date for Debezium Kafka

Question

We are enabling CDC on specific tables in our MSSQL. We are connecting to a pipeline of migrating data through MSSQL->CDC->DEBEZIUM->KAFKA_CONNECT

There is a table that has more than a million rows, but we need only a few thousand rows from the table to be included in the Snapshot Created when enabling CDC. The reason why I don't want to handle it in our Kafka-Consumer is because, while I need just 1% of the data to be written to Mongo, rest 99% is gonna hit the consumer without any use.

Questions:

Is it possible to create snapshot of specific rows/views while enabling CDC. I need rows which have column_value(modified date)>a specific date?
Is this too much of micro-optimisation and I shall let everything come and hit the pipeline and be rejected by the consumer instead?

SQL Server CDC - Snapshot from specific date for Debezium Kafka

Answers (1)

Related Questions